KR20070043926A

KR20070043926A - Data capture from rendered documents using handheld device

Info

Publication number: KR20070043926A
Application number: KR1020067022973A
Authority: KR
Inventors: 마틴 티. 킹; 클리포드 에이. 쿠실러; 프레이저 제임즈 퀀틴 스태포드; 데일 로렌스 그로버
Original assignee: 엑스비블리오 비.브이.
Priority date: 2004-04-01
Filing date: 2005-04-01
Publication date: 2007-04-26
Also published as: KR101178302B1

Abstract

스캐닝, 이미징 또는 다른 데이터 캡쳐링 능력을 갖는 휴대용 디바이스가 기술되어 있다. 일부의 경우에, 이 휴대가능한 디바이스는 소스 문서를 고유하게 식별하기 위해 충분한 정보가 캡쳐링된 때를 사용자에게 나타낼 수 있다. 일부의 경우에, 휴대가능한 디바이스가 언제 및 어디에서 데이터 캡쳐가 일어났는지를 지시하는 타임스탬프 및 위치스탬프를 계산한다. 일부의 경우에, 휴대가능한 디바이스가 제스쳐에 의해 제어된다. 일부의 경우에, 휴대가능한 스캐닝 디바이스는 연관된 빌링 및 컨텐트/서비스 가입 정보를 가지고 있다. Portable devices with scanning, imaging or other data capturing capabilities are described. In some cases, this portable device can indicate to the user when sufficient information has been captured to uniquely identify the source document. In some cases, the portable device calculates timestamps and location stamps indicating when and where data capture occurred. In some cases, the portable device is controlled by the gesture. In some cases, the portable scanning device has associated billing and content / service subscription information.

휴대가능한 디바이스, 스캐닝, 이미징, 데이터 캡쳐링, 타임스탬프, 위치스탬프, 제스쳐, 빌링, 컨텐트 Portable Device, Scanning, Imaging, Data Capturing, Timestamp, Location Stamp, Gesture, Billing, Content

Description

DATA CAPTURE FROM RENDERED DOCUMENTS USING HANDHELD DEVICE}

본 발명은 일반적으로 휴대가능한 데이터 캡쳐링 디바이스에 관련되고, 더 구체적으로는, 이미지 및/또는 오디오 클립을 캡쳐할 수 있는 휴대가능한 디바이스에 관한 것이다. The present invention generally relates to a portable data capturing device, and more particularly to a portable device capable of capturing images and / or audio clips.

컴퓨터의 시대에도 페이퍼 문서가 증가하는 것으로부터 알 수 있듯이, 페이퍼 문서에 대한 지속적인 수요가 있다. 오늘날은 그 어느 때 보다도 페이퍼 문서를 프린트하고 출판하는 것이 쉽다. 복제, 전사, 검색 및 편집을 함에 있어서 전자 문서가 더 간편함에도 불구하고 페이퍼 문서는 여전히 널리 이용되고 있다. There is a continuing demand for paper documents, as can be seen from the growing number of paper documents in the computer age. Today it is easier than ever to print and publish paper documents. Although electronic documents are simpler to copy, transcribe, search, and edit, paper documents are still widely used.

페이퍼 문서의 인기와 전자 문서의 장점을 고려할 때, 이 두 장점을 결합한다면 유용할 것이다. Given the popularity of paper documents and the advantages of electronic documents, it would be useful to combine these two advantages.

도1은 코어 시스템의 일 실시예에서의 정보 흐름을 도시하는 데이터 흐름도,1 is a data flow diagram illustrating the flow of information in one embodiment of a core system;

도2는 전형적인 작동환경의 컨텍스트에서 시스템의 전형적인 구현에 포함되는 구성요소들을 나타내는 도면, 2 illustrates the components involved in a typical implementation of a system in the context of a typical operating environment.

도3은 스캐너의 실시예를 나타내는 블록도, 3 is a block diagram showing an embodiment of a scanner;

도4는 휴대가능한 스캐닝 디바이스의 전형적인 사용을 도시한 개략도, 4 is a schematic diagram illustrating a typical use of a portable scanning device;

도5는 전형적인 휴대가능 스캐닝 디바이스의 실시예의 기능블록도,5 is a functional block diagram of an embodiment of a typical portable scanning device;

도6은 전형적으로 본 시스템에 의해 사용되는 데이터 기록용 포맷을 도시하는 데이터 구조도, Fig. 6 is a data structure diagram showing a format for recording data typically used by the present system;

도7은 휴대가능한 디바이스를 사용하여 문서가 스캐닝될 때의 위치 및/또는 시간에 대한 정보를 검출하고 저장하기 위해 본 시스템에서 전형적으로 실행되는 단계를 나타내는 흐름도, 7 is a flow diagram illustrating steps typically executed in the present system to detect and store information about the location and / or time when a document is scanned using a portable device;

도8은 사용자가 원형상 제스쳐를 하는 것을 검출하기 위해 본 시스템에 의해 전형적으로 실행되는 단계를 나타내는 흐름도, 8 is a flow diagram illustrating steps typically executed by the present system to detect a user making a circular gesture;

도9는 원형상 제스쳐를 행할 때의 사용자의 시도(attempt)에 대한 일례들을 나타내는 도면, FIG. 9 shows examples of an attempt of a user when performing a circular gesture; FIG.

도10은 러빙 제스쳐(rubbing gesture)를 검출하기 위해 시스템에 의해 전형적으로 실행되는 단계를 나타내는 흐름도, 10 is a flow diagram illustrating steps typically executed by a system to detect a rubbing gesture;

도11은 스캐너가 문서의 후방(우측에서 좌측으로) 방향으로 움직이는 것을 도시하는 도면, 11 is a diagram showing the scanner moving in the rear (right to left) direction of the document;

도12는 디바이스 근처에서 휴대가능한 스캐너와 연관시키기 위한 일 시스템 배치(configuration)의 블록도, 12 is a block diagram of one system configuration for associating with a portable scanner near a device;

도13은 스캐닝 디바이스 및 서비스 프로바이더에 연관된 전형적인 문의 세션(query session)을 도시하는 블록도,13 is a block diagram illustrating an exemplary query session associated with a scanning device and a service provider;

도14는 컨텐트를 스캐너-관련 디바이스로 제공하기 위해 본 시스템에 의해 디바이스들간에 전형적으로 실행되는 인터랙션을 도시하는 동작 흐름도,FIG. 14 is an operational flow diagram illustrating interactions typically executed between devices by the system to provide content to a scanner-related device;

도15는 문서의 두 라인으로부터 텍스트를 캡쳐하는 휴대가능한 스캐너를 나타내는 도면,15 illustrates a portable scanner for capturing text from two lines of a document;

도16은 캐릭터 오프셋을 결정하는 컨벌루션의 일 실시예를 나타내는 도면,16 illustrates an embodiment of a convolution for determining a character offset;

도17은 컨벌루션 프로세스를 개념화하기 위한 방법을 도시한 설명도,17 is an explanatory diagram illustrating a method for conceptualizing a convolution process;

도18은 또 다른 설명도로서, 메모리의 카피 위에 슬라이스 카피(slice copy)가 도시되어 있어서 매치가 왜 발견되는지를 더 확실히 알 수 있다,18 is another explanatory diagram in which a slice copy is shown on top of a copy of the memory to more clearly see why a match is found.

도19는 이미지 상에서 컨벌루션 프로세스를 실행하기 위해 본 시스템에 의해 전형적으로 실행되는 단계를 나타내는 흐름도, FIG. 19 is a flow diagram illustrating steps typically executed by the present system to execute a convolution process on an image;

도20은 마우스 아래 표면을 나타내기 위해 뷰잉 윈도우를 갖는 스캐너/마우스를 나타내는 도면, 20 illustrates a scanner / mouse with a viewing window to represent a surface under the mouse;

도21은 사용자가 무엇이 스캐닝되고 있는지 알 수 있도록 하우징 상부에 장착된 디스플레이(LCD, LED 등)를 갖는 스캐너/마우스를 나타내는 도면,FIG. 21 shows a scanner / mouse with a display (LCD, LED, etc.) mounted on top of the housing to allow the user to know what is being scanned;

도22는 종래의 기계식 x/y 메커니즘을 갖는 마우스 및 광학 스캐너 등과 같이, 별개의 포지션-감지 및 스캐닝 메커니즘을 갖는 마우스의 블록도, Fig. 22 is a block diagram of a mouse having separate position-sensing and scanning mechanisms, such as a mouse with a conventional mechanical x / y mechanism, an optical scanner, and the like;

도23은 렌더링된 문서로부터 x/y 이동을 검출하고 데이터를 스캐닝하기 위해 사용될 수 있는 광학 센서 어셈블리를 갖는 마우스를 나타내는 블록도, FIG. 23 is a block diagram illustrating a mouse having an optical sensor assembly that can be used to detect x / y movement from a rendered document and scan data; FIG.

도24는 스캐너 헤드 아래에 있는 것의 이미지를 뷰파인더로 반영하기 위해 일련의 거울을 사용하는 마우스/스캐너의 측면도, Figure 24 is a side view of the mouse / scanner using a series of mirrors to reflect an image of what is under the scanner head into the viewfinder;

도25는 감광성 반도체칩(CMOS, CCD 등)과 작동적으로 연결된 이미지 도 관(image conduit)을 사용하는 마우스/스캐너의 실시예를 나타내는 도면, FIG. 25 illustrates an embodiment of a mouse / scanner using an image conduit operatively connected with photosensitive semiconductor chips (CMOS, CCD, etc.);

도26은 스캐닝 헤드의 아래로 지나가는 텍스트를 사용자가 볼 수 있도록, 스캐닝 메커니즘의 어느 한쪽 측면상에 필수적으로 있는 윈도우인 뷰파인더를 갖는 마우스/스캐너의 상면도,Fig. 26 is a top view of the mouse / scanner with a viewfinder, which is a window essentially on either side of the scanning mechanism, so that the user can see text passing under the scanning head;

도27은 예시적인 핸드헬드 문서 데이터 캡쳐 디바이스의 모양을 보여주는 개략도,27 is a schematic diagram showing the shape of an exemplary handheld document data capture device;

도28은 주석 디바이스(annotator)의 일 실시예를 나타내는 블록도,28 is a block diagram illustrating one embodiment of an annotator;

도29는 전형적으로 USB 포트와 같은 통신 포트를 통해 PC 등과 같은 프로세싱 디바이스와 연결된 디바이스를 나타내는 도면,FIG. 29 illustrates a device typically coupled with a processing device such as a PC via a communication port such as a USB port; FIG.

도30은 본 시스템이 실행되고 있는 컴퓨터 시스템 및 다른 디바이스의 적어도 일부에 전형적으로 결합되어 있는 구성요소의 일부를 도시하는 블록도, 30 is a block diagram illustrating some of the components typically coupled to at least some of the computer systems and other devices on which the present system is running;

도31은 전자 문서를 주석(annotate)하기 위해 본 시스템에 의해 사용되는 전형적인 프로세스를 나타내는 흐름도, Fig. 31 is a flowchart showing an exemplary process used by the present system for annotating electronic documents.

도32는 사용자에 의해 입력된 주석을 나타내기 위해 본 시스템에 의해 사용되는 예시적인 주석 테이블을 도시한 테이블도 이다. FIG. 32 is a table diagram illustrating an exemplary annotation table used by the system to represent annotations entered by a user.

개요summary

렌더링된 문서에 포함된 텍스트 상에서 캡쳐하고 작동하기 위한 휴대가능한 디바이스("디바이스")가 설명되고, 몇몇 경우에는, 휴대가능한 디바이스에 의해 캡쳐된 텍스트를 처리하는 더 광범위한 시스템("시스템")의 일부로서 설명된다. A portable device (“device”) for capturing and operating on text contained in a rendered document is described, and in some cases, part of a broader system (“system”) that processes text captured by the portable device. It is described as.

일부 실시예에서, 스캐닝 능력을 갖춘 휴대가능 디바이스는 사용자가 문서를 다른 것과 구별될 정도로 인식하기에 충분한 텍스트 또는 다른 정보를 스캐닝하였다는 것을 사용자에게 알려준다. 일부 실시예에서는 휴대가능 디바이스가, 이미지를 획득하기 위한 이미지 캡쳐 디바이스, 이미지를 처리하기 위한 프로세서, 데이터 및/또는 (컴퓨터 프로그램 등과 같은) 로직을 저장하기 위한 메모리, 다른 디바이스들과 통신하기 위한 입력/출력 통신 인터페이스, 전원, 스캐닝되고 있는 정보를 설명하기 위한 일러스트레이션 소스, 및 로케이션 모듈을 포함하고 있다. In some embodiments, the portable device with scanning capability informs the user that the user has scanned enough text or other information to recognize the document to be distinguished from the others. In some embodiments, the portable device is an image capture device for acquiring an image, a processor for processing the image, a memory for storing data and / or logic (such as a computer program), an input for communicating with other devices. Includes / output communication interface, power source, illustration source for describing the information being scanned, and location module.

몇몇 실시예에서는, 비주얼 스캐닝 능력 외에 혹은 이에 더하여, 휴대가능 디바이스가 오디오 텍스트 캡쳐 능력을 가지고 있어서, 사용자가 렌더링된 문서로부터 큰 소리로 읽고 있는 동안 디바이스가 오디오 클립을 캡쳐할 수 있도록 한다. 전형적으로 이 시스템은 오디오 클립에 음성인식기술을 적용함으로써 오디오 텍스트 캡쳐 동작으로부터 텍스트 콘텐트를 도출한다. In some embodiments, in addition to or in addition to visual scanning capabilities, the portable device has audio text capture capabilities, allowing the device to capture audio clips while the user is reading aloud from the rendered document. Typically the system derives the text content from the audio text capture operation by applying speech recognition technology to the audio clip.

비록 여기에서 언급하고 있는 휴대가능 텍스트 캡쳐 디바이스가 때로는 구체적으로 비주얼 스캐닝에 대한 것이긴 하지만, 당업자라면 그러한 언급이 오디오 텍스트 캡쳐 등과 같은 다른 텍스트 캡쳐 기술을 사용한 휴대가능 텍스트 캡쳐 디바이스에도 동일하게 적용될 수 있음을 이해할 것이다. Although the portable text capture device referred to herein is sometimes specifically related to visual scanning, those skilled in the art can equally apply to portable text capture devices using other text capture techniques such as audio text capture. Will understand.

일부 실시예에서, 스캐너에 의해 스캔된 텍스트 또는 심볼이 사용되어 스캐너의 제어 로직 또는 제어 소프트웨어에 의해 제어 커맨드로 해석되어, 스캐너로 하여금 소프트웨어 프로그램을 실행하게 하거나, 혹은 그렇지 않으면 미리 지정된 어느 특정 액션(예컨대 메모리로부터의 데이터 삭제, 턴 온/오프, 금융 인터랙션의 개시 및/또는 완료 등)을 실행하게 한다. In some embodiments, text or symbols scanned by the scanner are used to be interpreted as control commands by the control logic or control software of the scanner to cause the scanner to execute a software program, or otherwise specify some predefined action ( For example, deleting data from memory, turning on / off, initiating and / or completing financial interactions, and the like.

일부 실시예에서 휴대가능 스캐너는, 문서를 식별하기에 충분한 텍스트가 스캐닝되어 전자 카피가 위치지정될 수 있다는 것을 사용자에게 알려준다. 휴대가능 스캐너는 스캐닝된 정보의 양을 소정의 임계 레벨과 비교하여 충분한 정보가 스캐닝되었는지 여부를 결정한다. (이러한 임계 방법은 스캐너가 컴퓨터와 통신하고 있지 않을 때 특히 유용하다.) 휴대가능 스캐너가 원격 컴퓨터와 통신할 때, 원격 컴퓨터는 텍스트가 스캐닝되고 있는 문서를 식별하였다는 것을 나타내는 메시지를 스캐너에게 전송할 수 있다. 이 메시지의 수신에 응답하여, 휴대가능 스캐너는 문서가 식별되었고 사용자가 스캐닝을 중단할 수 있다는 것을 사용자에게 알려준다. 다양한 실시예에서, 이러한 알림은 시각적(예컨대 발광디바이스(LED), 디스플레이 등), 청각적(예컨대 스피커, 비퍼 등), 또는 촉각적(촉감을 자극하는 것)이다. In some embodiments, the portable scanner informs the user that sufficient text to identify the document can be scanned so that the electronic copy can be positioned. The portable scanner compares the amount of information scanned with a predetermined threshold level to determine whether enough information has been scanned. (This critical method is particularly useful when the scanner is not communicating with a computer.) When the portable scanner communicates with a remote computer, the remote computer sends a message to the scanner indicating that the text has identified the document being scanned. Can be. In response to receiving this message, the portable scanner informs the user that the document has been identified and that the user can stop scanning. In various embodiments, such notifications are visual (eg, light emitting devices (LEDs), displays, etc.), acoustic (eg, speakers, beepers, etc.), or tactile (stimulating tactile).

일부 실시예에서, 휴대가능 스캐너는 위치 및/또는 시간 결정능력을 가지고 있어서, 스캐닝된 데이터와 함께 스캔이 언제 및/또는 어디에서 일어나는지에 대한 위치 및/또는 시간 정보를 저장할 수 있다. 시간 정보는 특정 스캐닝 이벤트의 타임 스탬프(time stamp)일 수 있다. 위치 정보는 특정 스캐닝 이벤트의 위치 스탬프 일 수 있다. In some embodiments, the portable scanner has location and / or time determining capability to store location and / or time information about when and / or where a scan occurs with the scanned data. The time information may be a time stamp of a particular scanning event. The location information may be a location stamp of a particular scanning event.

일부 실시예에서, 휴대가능 스캐너와 같은 휴대가능 디바이스의 동작은 스캔의 특성, 예컨대 속도, 반복성, 방향 등에 의해 제어된다. 또한, 스캐너의 제어 프로그램 또는 로직이 특정 심볼에 반응될 수 있다. 이러한 특정 심볼은 휴대가능 디바이스에 의해 수행되어야 할 특정 액션 또는 실행되어야 할 프로그램에 연관될 수 있다. In some embodiments, the operation of a portable device, such as a portable scanner, is controlled by the characteristics of the scan, such as speed, repeatability, direction, and the like. In addition, the scanner's control program or logic can be reacted to a particular symbol. This particular symbol may be associated with a particular action to be performed by the portable device or a program to be executed.

일부 실시예에서, 휴대가능 스캐너가 빌링(billing), 가입, 및/또는 디바이스 식별자 정보를 메모리에 저장해 놓을 수 있다. 가입 정보는 식별된 문서의 전자 카피에 대한 사용자의 액세스 권한을 인증하는데 사용될 수 있다. 빌링 정보는 식별된 문서의 전자 카피에 대한 액세스를 위해 비용을 지불하는데 사용될 수 있다. 디바이스 식별자는 사용자 식별을 인증하는데 도움을 위해 보안 특성으로서 사용될 수 있다. In some embodiments, the portable scanner may store billing, subscription, and / or device identifier information in memory. Subscription information may be used to authenticate a user's access to an electronic copy of the identified document. Billing information can be used to pay for access to an electronic copy of the identified document. The device identifier can be used as a security feature to help authenticate the user identification.

Ⅰ부 - Part I- 서 론Introduction

1.One. 시스템의 특성Characteristics of the system

그 대응되는 전자 부본(electronic counterpart)을 갖는 모든 페이퍼 문서의 각각에 대해, 전자 부본을 식별할 수 있는 페이퍼 문서 내에는 이산된 량의 정보가 존재한다. 일부 실시예에서, 본 시스템은 예컨대 핸드헬드 스캐너를 통해 페이퍼 문서에서 캡쳐된 텍스트의 샘플을 이용하여, 문서의 전자 부본을 식별하고 위치결정(locate) 한다. 대부분의 경우, 문서 텍스트의 약간의 단어가 페이퍼 문서에 대한 식별자로서 및 그의 전자 부본과의 링크로서 기능을 할 수 있다는 점에서, 기기에 의해 필요한 텍스트의 양은 매우 적다. 또한, 본 시스템은 이렇게 적은 단어들을 사용하여 문서를 식별할 뿐만 아니라 문서내에서의 위치도 식별할 수 있다. For each of all paper documents having a corresponding electronic counterpart, there is a discrete amount of information in the paper document from which the electronic copy can be identified. In some embodiments, the system identifies and locates an electronic copy of a document using, for example, a sample of text captured in a paper document via a handheld scanner. In most cases, the amount of text required by the device is very small in that some words of the document text can function as an identifier for the paper document and as a link to its electronic copy. In addition, the system can use these few words to identify the document as well as its location within the document.

따라서 페이퍼 문서 및 그의 디지털 부본은 이하에서 설명되는 본 시스템을 사용하여 많은 유용한 방식으로 연관될 수 있다. Thus, the paper document and its digital copy can be associated in many useful ways using the present system described below.

1.1. 전망에 대한 간단한 개요 1.1. Brief overview of the outlook

본 시스템이 일단 페이퍼 문서의 텍스트의 일부를 특정 디지털 엔티티와 연관시키는 것이 설정되면, 시스템은 그 연관성에 대해 대량의 기능성(functionality)을 구성할 수 있다. Once the system is set up to associate a portion of the text of a paper document with a particular digital entity, the system can configure a large amount of functionality for that association.

점차적으로 대부분의 페이퍼 문서가, 월드와이드웹 상에서 또는 일부 다른 온라인 데이터 베이스 또는 문서 전집으로부터 액세스 가능하거나 혹은 요금납부 또는 가입 등에 의해 액세스될 수 있는 전자 부본을 가지고 있는 것이 사실이다. 그러면, 가장 간단한 수준에서, 사용자가 페이퍼 문서의 약간의 단어를 스캔하면 본 시스템이 전자 문서 또는 그 일부분을 검색하거나, 디스플레이 하거나, 다른 사람에게 이메일로 전송하거나, 구매하거나, 프린트하거나, 또는 웹페이지에 게시할 수 있다. 부수적인 예로서, 어떤 사람이 아침식사 동안 읽고 있는 책의 일부 단어를 스캐닝함으로써 그 사람의 자동차에 있는 오디오 북 버전은 그가 회사에 가려고 운전을 시작하는 순간부터 읽기동작을 개시할 수 있고, 또는 프린트 카트리지의 일련번호를 스캐닝함으로써 교체를 지시하는 프로세스를 개시할 수 있을 것이다. Increasingly, it is true that most paper documents have electronic copies that can be accessed on the World Wide Web or from some other online database or document collection, or by payment or subscription. Then, at the simplest level, when a user scans a few words in a paper document, the system retrieves, displays, emails, purchases, prints, or webpages an electronic document or a portion thereof. Post to As an additional example, by scanning some words in a book that a person is reading during breakfast, the audiobook version in that person's car can start reading from the moment he starts driving to go to work, or print Scanning the cartridge's serial number may initiate a process to instruct replacement.

본 시스템은 이와 같은 그리고 "페이퍼/디지털 통합"의 많은 다른 예들을 현재의 문서 읽기, 프린트, 및 출판 과정에 변화를 요하지 않으면서도 구현하여, 이러한 종래 방식으로 렌더링된 문서에 디지털 기능성의 완전히 새로운 장을 제공한다. The system implements many of these and other examples of "paper / digital integration" without requiring changes to the current document reading, printing, and publishing process, thus creating a whole new chapter in digital functionality in such conventionally rendered documents. To provide.

1.2. 용어 1.2. Terms

본 시스템의 전형적인 사용은 페이퍼 문서로부터 텍스트를 스캔하는 광스캐너를 사용하는 것에서부터 시작한다. 하지만 다른 형태의 문서로부터의 다른 캡쳐 방법도 동등하게 적용될 수 있다는 것에 유의해야 한다. 따라서 본 시스템은 때로는 렌더링된 문서로부터 텍스트를 스캐닝 또는 캡쳐링 하는 것으로 설명되며, 이러한 용어는 다음과 같이 정의된다: Typical use of the system begins with the use of an optical scanner that scans text from a paper document. However, it should be noted that other capture methods from other types of documents may be equally applicable. Thus, the system is sometimes described as scanning or capturing text from a rendered document, which terms are defined as follows:

렌더링된 문서는 프린트된 문서 또는 디스플레이나 모니터 상에 보여지는 문서이다. 또한 이것은 영구적 형태이든 또는 일시적인 디스플레이 상의 것이든, 사람에게 인식가능한 문서이다. A rendered document is a printed document or a document shown on a display or monitor. It is also a document recognizable to a person, whether in permanent form or on a temporary display.

스캐닝 또는 캡쳐링은 렌더링된 문서로부터 정보를 얻기 위한 체계적 조사 프로세스이다. 이 프로세스는 스캐너 또는 카메라(예컨대 휴대폰에 장착된 카메라)를 이용하는 광학적 캡쳐를 포함할 수 있고, 문서를 소리내어 읽어 오디오 캡쳐 디바이스로 옮기는 것 또는 키패드나 키보드로 타이핑하는 것을 포함할 수 있다. 더 많은 예들은 섹션 15를 참조하라. Scanning or capturing is a systematic investigation process for obtaining information from a rendered document. This process may include optical capture using a scanner or camera (eg, a camera mounted on a mobile phone), and may include reading the document out loud and moving it to an audio capture device or typing on a keypad or keyboard. See section 15 for more examples.

2.2. 본 시스템의 도입Introduction of this system

이 부분은 페이퍼/디지털 통합을 위한 시스템을 구성하는 디바이스, 프로세스 및 시스템의 일부에 대해 설명한다. 다양한 실시예에서 본 시스템은 기본 기능성을 제공하는 이러한 기본 코어 위에서 매우 다양한 서비스와 애플리케이션을 구현한다. This section describes some of the devices, processes, and systems that make up a system for paper / digital integration. In various embodiments, the system implements a wide variety of services and applications on this basic core that provide basic functionality.

2.1. 프로세스 2.1. process

도1은 코어 시스템의 일 실시예에서의 정보 흐름을 도시하는 데이터 흐름도이다. 다른 실시예들은 여기에 도시된 모든 단계나 구성요소를 전부 사용하지 않을 수도 있고, 일부 실시예들은 그보다 많이 사용할 수도 있다. 1 is a data flow diagram illustrating the flow of information in one embodiment of a core system. Other embodiments may not use all of the steps or components shown herein, and some embodiments may use more.

전형적으로 광스캐너에 의한 광학적 형식으로 또는 보이스 레코더에 의한 오디오 형식으로, 렌더링된 문서로부터 텍스트가 캡쳐링되고(100), 이 이미지 또는 사운드 데이터는 캡쳐 프로세스의 인위적 요소를 제거하거나 신호대 잡음비를 향상시키기 위해 처리된다(102). 그 후 OCR, 속도인식, 또는 오토 코릴레이션(autocorrelation) 등과 같은 인식 프로세스(104)가, 일부 실시예에서 텍스트, 텍스트 오프셋, 또는 다른 심볼로 구성되는 기호(signature)로 데이터를 변환시킨다. 대안적으로, 본 시스템은 렌더링된 문서로부터 문서 기호를 추출하는 대안적 형태를 수행한다. 일부 실시예에서 기호는 한 세트의 가능한 텍스트 전사(trnascription)를 나타낸다. 이 프로세스는 다른 단계로부터의 피드백에 의해 영향을 받을 수 있는데, 예컨대, 검색 프로세스 및 컨텍스트 해석(100)이 캡쳐가 유래한 일부 후보 문서들(candidate documents)을 식별하였다면, 최초 캡쳐의 가능한 해석(interpretation)은 범위가 좁혀진다. Typically in optical format by an optical scanner or audio format by a voice recorder, text is captured from the rendered document (100), and this image or sound data can be used to remove artificial elements of the capture process or to improve the signal-to-noise ratio. Is handled 102. Recognition process 104, such as OCR, velocity recognition, autocorrelation, or the like, then transforms the data into a signature consisting of text, text offsets, or other symbols in some embodiments. Alternatively, the system performs an alternative form of extracting document symbols from the rendered document. In some embodiments the symbol represents a set of possible text trnascriptions. This process can be influenced by feedback from other steps, for example, if the search process and context interpretation 100 have identified some candidate documents from which the capture originated, possible interpretation of the original capture. ) Is narrowed.

후처리(106) 단계는 인식 처리의 출력을 취하여 이것을 필터링하거나 이에 대한 다른 작업을 수행하여 유용하게 한다. 구현되는 실시예에 따라, 이 단계에서 일부 직접 액션(107)이 후속 단계의 참조없이 즉시 취출되는데, 예컨대 사용자의 의도를 전달하기에 그 자체로 충분한 정보를 포함하는 프레이즈(phrase)나 심볼이 캡쳐된다. 이러한 경우 어떠한 디지털 부본 문서도 참조될 필요가 없고 심지어 본 시스템에 알려질 필요도 없다. Post-processing 106 takes the output of the recognition process and filters it or otherwise performs other tasks to make it useful. Depending on the embodiment implemented, at this stage some direct action 107 is immediately taken out without reference to a subsequent step, e.g. a phrase or symbol is captured that contains enough information per se to convey the user's intent. do. In this case no digital copy documents need to be referenced and even need not be known to the system.

그러나, 전형적으로 다음 단계는 검색에 사용하기 위한 문의(query) 또는 한 세트의 문의를 구성하는 것이 될 것이다(108). 문의 구성의 일부 측면은 사용된 검색 프로세스에 의존할 수 있고, 따라서 다음 단계까지는 수행되지 못할 것이다. 그러나 전형적으로, 명백하게 잘못 인식된 또는 전혀 관련없는 캐릭터의 제거와 같이 미리 수행될 수 있는 일부 동작이 있을 수 있다. Typically, however, the next step will be to construct a query or set of queries for use in the search (108). Some aspects of the construction of the query may depend on the search process used, and thus will not be performed until the next step. Typically, however, there may be some actions that can be performed in advance, such as the removal of obviously misrecognized or irrelevant characters.

그 후 문의 또는 문의들은 검색 및 컨텍스트 분석 단계(110)로 전달된다. 여기서, 시스템은 원본 데이터가 캡쳐된 문서를 식별하기 위해 선택적으로 시도한다. 그러기 위해, 시스템은 전형적으로 검색 인덱스 및 검색 엔진(112), 사용자에 대한 지식(114), 및 사용자의 컨텍스트 또는 캡쳐가 발생된 컨텍스트에 대한 지식(116)을 사용한다. 검색 엔진(112)은 구체적으로 렌더링된 문서에 대한, 이들의 디지털 부본 문서에 대한, 그리고 웹(인터넷)에 존재하고 있는 문서에 대한 정보를 사용 및/또는 인덱스 할 수 있다. 또한 이러한 많은 소스로부터 판독할 뿐만 아니라 이들을 기록할 수도 있고, 언급한 바와 같이, 예컨대 언어, 폰트, 렌더링, 및 후보 문서의 지식에 근거하여 가능한 다음 단어에 대한 정보를 인식 시스템(104)에게 제공함으로써, 정보를 프로세스의 다음 단계에 공급할 수 있다. The query or queries are then passed to a search and context analysis step 110. Here, the system optionally attempts to identify the document in which the original data was captured. To do so, the system typically uses the search index and search engine 112, knowledge of the user 114, and knowledge of the user's context or the context in which the capture occurred. The search engine 112 may use and / or index information about rendered documents, their digital copy documents, and about documents that exist on the web (internet). In addition to reading from many of these sources, it is also possible to record them and, as noted, by providing the recognition system 104 with information about the next possible word based on language, font, rendering, and knowledge of the candidate document, for example. The information can then be fed to the next step in the process.

일부 환경에서, 다음 단계는 식별되었던 문서 또는 문서들의 카피를 검색하는 것이다(120). 문서(124)의 소스는 예컨대 로컬 파일링 시스템 또는 데이터베이스 또는 웹 서버로부터 직접 액세스될 수 있거나, 혹은, 인증, 보안 또는 지불을 실행할 수 있는 몇몇 액세스 서비스(122)를 통해 연결될 수 있거나 또는 소정 포맷으로의 문서의 변환 등과 같은 다른 서비스를 제공할 수 있다. In some circumstances, the next step is to retrieve 120 the document or copy of documents that have been identified. The source of the document 124 may be accessed directly, for example, from a local filing system or database or web server, or may be connected via some access service 122 that may execute authentication, security or payment, or in a predetermined format. Other services such as document conversion can be provided.

시스템의 애플리케이션은 여분의 기능성이나 데이터와 문서의 전부 또는 일부분과의 연관성을 이용할 수 있다. 예컨대, 섹션 10.4에서 논의하는 바와 같이 광고(advertising) 애플리케이션은 특정 광고 메시지 또는 주제와 문서의 일부분과의 연관성을 이용할 수 있다. 이 여분의 연관된 기능성 또는 데이터는 문서 상의 하나 이상의 오버레이(overlay)로서 고려될 수 있고, 여기서 "마크업(markup)"으로 언급된다. 그 후, 프로세스(130)의 다음 단계는 캡쳐링된 데이터에 관련된 임의의 마크업을 식별하는 것이다. 이러한 마크업은 사용자, 문서의 창작자 또는 출판자, 또는 다른 누군가에 의해 제공될 수 있고, 일부 소스(132)로부터 직접 액세스되거나 혹은 일부 서비스(134)에 의해 생성될 수 있다. 다양한 실시예에서, 마크업은 렌더링된 문서 및/또는 렌더링된 문서의 디지털 부본, 또는 이들 문서들 중 어느 하나 또는 둘 다의 그룹에 적용되거나 이들에 연관될 수 있다. Applications in the system can take advantage of the extra functionality or association of data with all or part of the document. For example, as discussed in section 10.4, an advertising application may utilize the association of a particular advertising message or subject with a portion of the document. This extra associated functionality or data can be considered as one or more overlays on the document, referred to herein as " markup. &Quot; Then, the next step in process 130 is to identify any markup related to the captured data. Such markup may be provided by the user, the creator or publisher of the document, or someone else, and may be accessed directly from some source 132 or generated by some service 134. In various embodiments, markup may be applied to or associated with a rendered document and / or a digital copy of a rendered document, or a group of either or both of these documents.

마지막으로, 초기 단계들의 결과, 일부 액션이 취해진다(140). 이들은 발견된 정보를 단순히 기록하는 것과 같은 디폴트(default) 액션이거나, 데이터 또는 문서에 의존하는 것일 수 있고, 또는 마크업 분석으로부터 유도될 수도 있다. 때로는 이 액션은 데이터를 다른 시스템으로 단순히 전달하는 것이다. 몇몇의 경우, 렌더링된 문서에서의 특정 지점의 캡쳐에 적합한 다양한 가능한 액션이, 예커대 로컬 디스플레이(332), 컴퓨터 디스플레이(212), 또는 모바일 폰이나 PDA 디스플레이(216) 등과 같은 관련 디스플레이 상에 메뉴로서 사용자에게 제공될 것이다. 만일 사용자가 이 메뉴에 응답하지 않는다면, 디폴트 액션이 취해질 수 있다. Finally, as a result of the initial steps, some action is taken 140. These may be default actions, such as simply recording the found information, or may be dependent on data or documents, or may be derived from markup analysis. Sometimes this action is simply passing data to another system. In some cases, various possible actions suitable for capturing a specific point in a rendered document may be displayed on a menu on an associated display, such as a docker versus local display 332, a computer display 212, or a mobile phone or PDA display 216. Will be provided to the user. If the user does not respond to this menu, a default action can be taken.

2.2. 구성요소 2.2. Component

도2는 전형적인 작동환경의 컨텍스트에서 시스템의 전형적인 구현에 포함되는 구성요소들을 나타내는 도면이다. 도시된 바와 같이 작동환경은 하나 이상의 광학적 스캐닝 캡쳐 디바이스(202) 또는 보이스 캡쳐 디바이스(204)를 포함하고 있다. 일부 실시예에서는, 동일한 디바이스가 양쪽 기능을 모두 수행한다. 각각의 캡쳐 디바이스는 직통 유선 또는 무선 연결중 하나를 사용하거나 네트워크(220)를 통해 컴퓨터(212) 및 모바일 스테이션(216)(예컨대, 모바일 폰 또는 PDA) 등과 같은 시스템의 다른 부분과 통신할 수 있고, 유선 및 무선 연결을 사용하여 통신하는 것은 전형적으로 무선 기지국(214)을 포함한다. 일부 실시예에서, 캡쳐 디바이스는 모바일 스테이션과 통합되고, 음성통신 및 사진촬영용 디바이스에 사용되는 오디오 및/또는 광학 구성요소의 일부와도 선택적으로 공유한다. 2 illustrates the components involved in a typical implementation of a system in the context of a typical operating environment. As shown, the operating environment includes one or more optical scanning capture device 202 or voice capture device 204. In some embodiments, the same device performs both functions. Each capture device may communicate with other parts of the system, such as computer 212 and mobile station 216 (eg, mobile phone or PDA), using either a direct wired or wireless connection or via network 220. Communication using wired and wireless connections typically includes a wireless base station 214. In some embodiments, the capture device is integrated with the mobile station and optionally shares with some of the audio and / or optical components used in the voice communications and photography device.

컴퓨터(212)는 스캐닝 디바이스(202 및 204)로부터의 오더(order)를 처리하기 위한 컴퓨터실행가능 명령어를 포함하고 있는 메모리를 가질 수 있다. 일 예로서, 오더는 (스캐닝 디바이스(202/204)의 일련번호 또는 스캐너 사용자를 부분적으로 또는 완전히 구별하여 식별하는 식별자 등과 같은) 식별자, 스캐닝 컨텍스트 정보(예컨대, 스캔 시간, 스캔 위치 등), 및/또는 스캐닝되고 있는 문서를 다른 것과 구별하여 식별하는데 사용되는 (텍스트 스트링(string)과 같은) 스캐닝 정보를 포함할 수 있다. 대안적인 실시예에서, 작동 환경은 보다 작거나 많은 구성요소를 포함할 수도 있다. Computer 212 may have a memory containing computer executable instructions for processing orders from scanning devices 202 and 204. As an example, the order may include an identifier (such as the serial number of the scanning device 202/204 or an identifier that partially or completely distinguishes the scanner user), scanning context information (eg, scan time, scan location, etc.), and And / or include scanning information (such as a text string) used to distinguish the document being scanned from another. In alternative embodiments, the operating environment may include smaller or more components.

또한, 검색 엔진(232), 문서 소스(234), 사용자 계정(account) 서비스(236), 마크업 서비스(238), 및 다른 네트워크 서비스(239)가 네트워크(220) 상에서 이용가능하다. 네트워크(220)는 통합 인트라넷, 공중 인터넷, 모바일 폰 네트워크 또는 일부 다른 네트워크, 또는 이들의 임의의 상호접속일 수 있다. In addition, search engine 232, document source 234, user account service 236, markup service 238, and other network services 239 are available on network 220. The network 220 may be an integrated intranet, public internet, mobile phone network or some other network, or any interconnect thereof.

디바이스가 서로 어떻게 연결되는지에 관계없이, 이들은 공지된 상업적 트랜잭션 및 통신 프로토콜(예컨대 인터넷 프로토콜(IP))에 따라 작동할 수 있다. 다양한 실시예에서, 스캐닝 디바이스(202), 컴퓨터(212), 및 모바일 스테이션(216)의 기능 및 용량은 전적으로 또는 부분적으로 하나의 디바이스에 통합될 수 있다. 따라서 스캐닝 디바이스, 컴퓨터, 및 모바일 스테이션 이라는 용어들은, 디바이스가 스캐닝 디바이스(202), 컴퓨터(212), 및 모바일 스테이션(216)의 기능과 용량을 결합하는지 여부에 따라, 동일한 디바이스를 언급하는 것일 수 있다. 더욱이, 검색 엔진(232), 문서 소스(234), 사용자 계정 서비스(236), 마크업 서비스(238), 및 다른 네트워크 서비스(239)의 일부 또는 전체 기능이 임의의 디바이스의 및/또는 다른 미도시된 디바이스 상에 구현될 수도 있다. Regardless of how the devices are connected to each other, they can operate according to known commercial transaction and communication protocols (eg, Internet Protocol (IP)). In various embodiments, the functionality and capacity of the scanning device 202, the computer 212, and the mobile station 216 may be fully or partially integrated into one device. Thus, the terms scanning device, computer, and mobile station may refer to the same device, depending on whether the device combines the capabilities and capacities of scanning device 202, computer 212, and mobile station 216. have. Moreover, some or all of the functionality of search engine 232, document source 234, user account service 236, markup service 238, and other network services 239 may be of any device and / or other functionality. It may be implemented on the device shown.

2.3. 캡쳐 디바이스 2.3. capture device

상술하였듯이, 캡쳐 디바이스는, 렌더링된 문서로부터 이미지 데이터를 캡쳐할 수 있는 광학적 스캐너를 사용하거나 또는 사용자가 텍스트를 소리내어 읽는 것을 캡쳐하는 오디오 레코딩 디바이스를 사용하거나 또는 다른 방법을 사용하여 텍스트를 캡쳐할 수 있다. 캡쳐 디바이스의 일부 실시예는 바코드와 같이 기계판독 가능한 코드를 비롯하여 이미지, 그래픽 심볼, 아이콘 등을 캡쳐할 수 있다. 디바이스는 시스템내의 어딘가에 위치하는 다른 기능성들에 따라서, 트랜스듀서, 일부 저장장치, 및 데이터 인터페이스 정도로 구성되어 극히 간단할 수 있고, 더 많이 완비된 디바이스일 수도 있다. 설명을 위해, 이 섹션에서는 광학적 스캐너 등에 기초하고 적절한 수의 특징을 갖는 디바이스에 대해 설명한다. As noted above, the capture device may capture text using an optical scanner capable of capturing image data from the rendered document, or using an audio recording device that captures the user reading aloud the text, or using other methods. Can be. Some embodiments of the capture device may capture images, graphic symbols, icons, and the like, as well as machine readable code such as barcodes. The device may be extremely simple to configure as a transducer, some storage, and a data interface, depending on other functionalities located somewhere in the system, or may be a more complete device. For illustration purposes, this section describes devices based on optical scanners and the like with an appropriate number of features.

스캐너는 이미지를 캡쳐하고 디지털화하는 잘 알려진 디바이스 이다. 복사기 산업의 한 부류로서, 첫번째 스캐너는 한번에 전체 문서를 캡쳐하는 상대적으로 큰 디바이스 였고, 최근, 펜-타입 핸드헬드 디바이스 등과 같이 편리한 형태의 휴대가능 광학적 스캐너가 도입되었다. Scanners are well known devices for capturing and digitizing images. As a class in the copier industry, the first scanner was a relatively large device for capturing an entire document at one time, and recently, portable optical scanners of a convenient type, such as pen-type handheld devices, have been introduced.

일부 실시예에서, 휴대가능 스캐너는 렌더링된 문서로부터 텍스트, 그래픽, 또는 심볼을 스캔하기 위해 사용된다. 휴대가능 스캐너는 렌더링된 문서로부터 텍스트, 심볼, 그래픽 등을 캡쳐하는 스캐닝 소자를 포함한다. 종이로 프린트된 문서에 더하여, 일부 실시예에서는, 렌더링된 문서가 CRT 모니터 또는 LCD 디스플레이 등과 같은 스크린상에 디스플레이된 문서를 포함한다. In some embodiments, a portable scanner is used to scan text, graphics, or symbols from a rendered document. The portable scanner includes a scanning element that captures text, symbols, graphics, and the like from the rendered document. In addition to documents printed with paper, in some embodiments, the rendered document includes a document displayed on a screen, such as a CRT monitor or LCD display.

도3은 스캐너(302)의 일 실시예의 블록도이다. 스캐너(302)는 렌더링된 문서로부터 정보를 스캔하고 이것을 기계식(machine-compatible) 데이터로 변환하는 광학적 스캐닝 헤드(308), 및 전형적으로 렌즈, 개구, 또는 렌더링된 문서의 이미지를 스캐닝 헤드로 전달하기 위한 이미지 도관(conduit)인 광학적 경로(306)를 포함한다. 스캐닝 헤드(308)는 전하결합소자(CCD), 상보형 금속산화 반도체(CMOS) 이미지 디바이스, 또는 다른 형태의 광학 센서를 결합할 수 있다. 3 is a block diagram of one embodiment of a scanner 302. Scanner 302 is an optical scanning head 308 that scans information from a rendered document and converts it into machine-compatible data, and typically delivers an image of a lens, aperture, or rendered document to the scanning head. Optical path 306, which is an image conduit for the device. The scanning head 308 may combine a charge coupled device (CCD), a complementary metal oxide semiconductor (CMOS) image device, or other type of optical sensor.

마이크(310) 및 연관된 회로는 (소리내어 읽혀진 단어를 포함한) 주위환경의 사운드를 기계식 신호로 변환하며, 다른 입력 기능들도 버튼, 스크롤-휠 또는 터치패드(314)와 같은 다른 촉감센서의 형태로 존재한다. The microphone 310 and associated circuitry converts the sounds of the environment (including the words read aloud) into mechanical signals, and other input functions may also be in the form of other tactile sensors such as buttons, scroll-wheels or touchpads 314. Exists as.

사용자에 대한 피드백은 시각적 디스플레이 또는 지시등(332)를 통해, 또는 스피커나 다른 오디오 트랜스듀서(334)를 통해, 및 진동 모듈(336)을 통해 가능하 게 된다. Feedback to the user is possible via a visual display or indicator 332, or through a speaker or other audio transducer 334, and through the vibration module 336.

스캐너(302)는, 다른 포맷 및/또는 해석으로 수신된 신호를 처리하며 다른 다양한 구성요소들과 상호작용하는 로직(326)을 포함한다. 로직(326)은 RAM, ROM, 플래쉬, 또는 다른 적당한 메모리 등의 관련 저장장치(330)에 저장된 데이터 및 프로그램 명령어를 읽고 기록하도록 동작될 수 있고, 클록 유닛(328)로부터 시간 신호를 읽을 수 있다. 스캐너(302)는 스캐닝된 정보 및 다른 신호들을 네트워크 및/또는 관련 컴퓨팅 디바이스와 통신하는 인터페이스(316)를 또한 포함한다. 일부 실시예에서, 스캐너(302)는 온-보드 전원(332)를 가질 수 있다. 다른 실시예에서, 스캐너(302)는 유니버셜 시리얼 버스(USB) 연결과 같이 다른 디바이스와의 유선 연결로부터 전원이 공급될 수 있다. Scanner 302 includes logic 326 that processes signals received in other formats and / or interpretations and interacts with other various components. Logic 326 may be operable to read and write data and program instructions stored in associated storage 330 such as RAM, ROM, flash, or other suitable memory, and may read time signals from clock unit 328. . Scanner 302 also includes an interface 316 that communicates the scanned information and other signals with the network and / or associated computing device. In some embodiments, scanner 302 may have an on-board power source 332. In another embodiment, the scanner 302 may be powered from a wired connection with another device, such as a universal serial bus (USB) connection.

스캐너(302)의 사용에 대한 예시로서, 독자가 스캐너(302)를 사용하여 신문기사의 일부 텍스트를 스캔할 수 있다. 텍스트는 스캐닝 헤드(308)을 통해 비트맵 이미지로 스캔된다. 로직(326)은 비트맵 이미지가, 클록 유닛(328)으로부터 읽혀진 관련 타임-스탬프와 함께, 메모리(330)에 저장되도록 한다. 또한 로직(326)은 광학적 문자인식(OCR)을 수행하거나, 또는 비트맵 이미지를 텍스트로 변환하기 위한 스캔후(post-scan) 프로세스를 수행할 수 있다. 또한 로직(326)은, 예컨대 문자, 심볼, 또는 대상물의 반복적 발생을 발견하고 이들 반복되는 요소들 사이의 다른 문자, 심볼, 또는 대상물의 숫자 또는 거리를 결정하기 위해 컨벌루션과 유사한 프로세스를 행함으로써, 이미지로부터 기호를 선택적으로 추출할 수 있다. 그러면, 독자는 인터페이스(316)를 통해 관련 컴퓨터에 비트맵 이미지(스캔후 프로세스 가 로직(326)에 의해 행해진다면, 텍스트 또는 다른 기호)를 업로드할 수 있다. As an example of the use of the scanner 302, a reader may use the scanner 302 to scan some text in a newspaper article. The text is scanned into the bitmap image through the scanning head 308. Logic 326 causes the bitmap image to be stored in memory 330, with an associated time-stamp read from clock unit 328. Logic 326 may also perform optical character recognition (OCR), or may perform a post-scan process for converting a bitmap image into text. Logic 326 also performs a convolution-like process, e.g., to find repeated occurrences of a character, symbol, or object and to determine the number or distance of another character, symbol, or object between these repeated elements, You can selectively extract symbols from images. The reader can then upload the bitmap image (text or other symbol if the post-scan process is done by logic 326) to the relevant computer via interface 316.

스캐너(302)의 또다른 사용의 일 예로서, 마이크(310)를 음성 캡쳐 포트로 사용하여 독자가 신문기사의 일부 텍스트를 오디오 파일로서 캡쳐할 수 있다. 로직(326)은 오디오 파일이 메모리(328)에 저장되도록 한다. 로직(326)은 또한 오디오 파일을 텍스트로 변환하기 위해 음성인식 또는 다른 스캔후 프로세스를 수행할수 있다. 그 후, 위에 설명하였듯이 독자는 인터페이스(316)를 통해 관련 컴퓨터에 오디오 파일(또는, 로직(326)에 의해 수행된 스캔후 프로세스에 의해 생성된 텍스트)을 업로드할 수 있다. As another example of the use of the scanner 302, the microphone 310 can be used as a voice capture port to allow a reader to capture some text of a newspaper article as an audio file. Logic 326 causes the audio file to be stored in memory 328. Logic 326 may also perform speech recognition or other post-scan process to convert the audio file into text. The reader can then upload the audio file (or text generated by the post-scan process performed by logic 326) to the associated computer via the interface 316 as described above.

2부 - 코어 시스템의 범위의 개요Part 2-Overview of the scope of the core system

페이퍼-디지털 통합이 점점 흔해지면서, 이 통합을 더 잘 이용하기 위해 변화될 수 있거나 또는 보다 효율적으로 구현할 수 있게 하는 기존 기술의 많은 측면이 있다. 이 섹션은 이러한 이슈들 중 일부에 대한 것이다. As paper-to-digital integration becomes more common, there are many aspects of existing technologies that can be changed or better implemented to better utilize this integration. This section covers some of these issues.

3.3. 검색Search

문서 전집의 검색, 심지어는 월드와이드웹과 같이 거대한 전집에 대한 검색은, 검색엔진에 입력된 검색식을 만들기 위해 키보드를 사용하는 평범한 사용자에게도 이제는 흔한 일이 되었다. 이 섹션 및 다음 섹션은, 캡쳐에 의해 렌더링된 문서에서 유래된 문의(query)의 구성 및 이러한 문의를 다루는 검색엔진의 양 측면에 대해 논의한다. Searching through collections of documents, even large collections such as the World Wide Web, is now commonplace for ordinary users who use the keyboard to create search expressions entered into search engines. This section and the next section discuss the construction of queries derived from documents rendered by capture and both aspects of the search engine that handle these queries.

3.1. 검색 문의로서의 스캔/ 스피크 /타입 3.1. Scan / Speak / Type as Search Inquiry

상술한 시스템의 사용은 전형적으로 상기 섹션 1.2에서 언급한 것을 비롯한 여러 방법중 임의의 것을 사용하여 렌더링된 문서로부터 캡쳐되고 있는 몇몇 단어를 가지고 시작한다. 예컨대 OCR 또는 음성 입력의 경우와 같이 입력이 텍스트로의 변환을 위해 약간의 해석이 필요할 때, 인식 프로세스를 확장하기 위해 문서 전집이 사용되도록, 시스템내에 엔드-투-엔드(end-to-end) 피드백이 있을 수 있다. 엔드-투-엔드 피드백은, 인식 또는 해석의 근사를 수행함으로써, 하나 이상의 후보 매칭 문서의 세트를 식별함으로써, 그리고 그 후, 인식과 해석을 더 정제하고 제한하기 위해 후보 문서에서의 가능한 매치들로부터 정보를 사용함으로써, 적용될 수 있다. 후보 문서는 예상되는 관련성에 따라(예컨대, 이들 문서에서 스캔하였던 사람의 숫자, 또는 인터넷상의 인기도에 근거하여) 가중치가 정해질 수 있고, 이러한 가중치는 이러한 반복되는 인식 프로세스에서 적용될 수 있다. The use of the system described above typically begins with several words being captured from a document rendered using any of several methods, including those mentioned in section 1.2 above. End-to-end in the system so that document collection is used to extend the recognition process when input needs some interpretation for conversion to text, such as for OCR or voice input. There may be feedback. End-to-end feedback is derived from possible matches in the candidate document by performing an approximation of recognition or interpretation, by identifying one or more sets of candidate matching documents, and then further refine and limit recognition and interpretation. By using the information, it can be applied. Candidate documents may be weighted according to expected relevance (eg, based on the number of people scanned in these documents, or popularity on the Internet), and these weights may be applied in this iterative recognition process.

3.2. 단문 검색 3.2. Short search

단어들의 상대적 위치가 알려져 있을 때 일부 단어에 기초한 검색 문의의 선별능력이 매우 커지기 때문에, 본 시스템이 전집에서 텍스트의 위치를 식별하기 위해서는 적은 양의 텍스트만이 캡쳐되어도 된다. 매우 일반적으로, 짧은 문구와 같이 인접하는 시퀀스가 입력 텍스트가 될 수 있다. Since the ability to select search queries based on some words becomes very large when the relative positions of the words are known, only a small amount of text may be captured in order for the system to identify the position of the text in the collection. Very generally, adjacent sequences, such as short phrases, can be input text.

3.2.1. 단문 캡쳐로부터 문서 및 문서내의 위치를 발견 3.2.1. Find documents and locations within documents from short text captures

어떤 문구가 유래하였던 문서를 찾는 것에 더하여, 본 시스템은 그 문서 내에서 위치를 식별할 수 있고 또한 이 지식에 근거하여 액션을 취할 수 있다. In addition to finding the document from which the phrase originated, the system can identify a location within the document and also take action based on this knowledge.

3.2.2. 위치를 찾는 다른 방법 3.2.2. Alternative way to find location

또한 본 시스템은, 워터마크(watermark), 또는 렌더링된 문서상의 다른 특수 한 마크를 사용하는 것 등에 의해, 문서와 위치를 발견하는 다른 방법을 사용할 수 있다. The system can also use other methods of finding the document and location, such as by using a watermark or other special mark on the rendered document.

3.3. 다른 요소를 검색 문의에 결합 3.3. Combine other elements into search queries

캡쳐된 텍스트에 더하여, 다른 요소(즉, 사용자 식별, 프로파일, 및 컨텍스트에 관한 정보)가, 캡쳐 시간, 사용자의 지리적 위치 및 식별, 사용자의 습관 및 최근 활동에 대한 지식 등과 같이, 검색 문의의 일부를 형성할 수 있다. In addition to the captured text, other elements (ie, information about the user's identification, profile, and context) may be part of the search query, such as capture time, the user's geographic location and identification, knowledge of the user's habits, and recent activity. Can be formed.

특히 이러한 것들이 아주 최근이라면, 문서 식별, 및 이전의 캡쳐에 관한 다른 정보가 검색 문의의 일부를 형성할 수 있다. In particular, if these are very recent, other information about document identification and previous capture may form part of the search query.

사용자의 식별은 캡쳐링 디바이스, 및/또는 바이오메트릭(biometric) 또는 다른 보충적 정보(음성 속도, 지문 등)와 연관된 특유의 식별자로부터 결정될 수 있다. The identification of the user may be determined from a unique identifier associated with the capturing device and / or biometric or other supplemental information (voice speed, fingerprint, etc.).

3.4. 검색 문의의 신뢰불능 특성의 지식(OCR 에러 등) 3.4. Knowledge of the unreliable nature of search queries (OCR errors, etc.)

검색 문의는 사용되는 특정 캡쳐 방법에서 발생하기 쉬운 에러 타입을 고려하여 구성될 수 있다. 이것의 일 예로는 특정 문자의 인식시에 의심되는 에러를 지시하는 것인데, 이 경우 검색엔진은 이러한 문자들을 와일드 카드로서 처리하거나 또는 이들 문자에 더 낮은 우선순위를 부여할 수 있다. The search query can be constructed taking into account the type of error that is likely to occur in the particular capture method used. An example of this is to indicate a suspected error in the recognition of certain characters, in which case the search engine may treat these characters as wildcards or give them lower priority.

3.5. 실행/오프라인 사용을 위한 인덱스의 로컬 캐쉬 3.5. Local cache of indexes for run / offline use

때때로 캡쳐링 디바이스가 데이터 캡쳐를 할 때 검색엔진이나 전집과 통신하고 있지 않을 수 있다. 이러한 이유로, 디바이스의 오프라인 사용에 도움이 되는 정보가 미리 이 디바이스에 혹은 이 디바이스가 통신하는 일부 엔티티에 다운로드 될 수 있다. 몇몇 경우에, 전집에 관련된 인덱스의 상당 부분 혹은 전부가 다운로드 될 수 있다. 이에 대해서는 섹션 15.3에서 더 논의될 것이다. Sometimes the capturing device may not be communicating with search engines or collections when capturing data. For this reason, information that helps offline use of the device may be downloaded to this device in advance or to some entity with which the device communicates. In some cases, much or all of the index associated with the collection may be downloaded. This will be discussed further in Section 15.3.

3.6. 어떤 형태를 갖든 문의는 나중에 기록되거나 실행될 수 있다 3.6. Inquiries in any form can be recorded or executed later

만약 문의와 통신하거나 결과를 수신하는 것과 관련된 비용이나 지연이 있을 수 있다면, 이러한 사전-로드된(pre-loaded) 정보는 로컬 디바이스의 성능을 향상시키고, 통신 비용을 낮추며, 또한 유용하고 시의적절한 사용자 피드백을 제공할 수 있다. If there may be a cost or delay associated with communicating the query or receiving the results, this pre-loaded information can improve the performance of the local device, lower the communication cost, and also be useful and timely. User feedback can be provided.

어떠한 통신도 가능하지 않은 경우(로컬 디바이스가 "오프라인"인 경우), 문의가 저장되었다가 통신이 복구되었을 때 시스템의 나머지 부분으로 전송될 수 있다. If no communication is possible (when the local device is "offline"), the query can be saved and sent to the rest of the system when the communication is restored.

이러한 경우 각각의 문의와 함께 타임 스탬프를 전송하는 것이 중요하다. 문의를 해석함에 있어서 캡쳐 시간은 매우 중요한 요소가 될 수 있다. 예컨대, 섹션 13.1은 이전의 캡쳐와 관련하여 캡쳐 시간의 중요성에 대해 논의한다. 캡쳐 시간이 문의가 실행되는 시간과 항상 동일한 것은 아니라는 점에 유의해야 한다. In this case, it is important to send a time stamp with each query. Capture time can be a very important factor in interpreting queries. For example, section 13.1 discusses the importance of capture time with respect to previous captures. Note that the capture time is not always the same as the time the query is executed.

3.7. 병렬 검색 3.7. Parallel search

성능상의 이유로, 단일 캡쳐에 응답하여 다수개의 문의가 일렬 혹은 병렬 중 하나로 개시될 수 있다. 예컨대 새로운 단어가 캡쳐에 추가되거나 혹은 병렬로 다수의 검색엔진에 문의하기 위해, 여러개의 문의가 단일 캡쳐에 응답하여 전송될 수 있다. For performance reasons, multiple queries may be initiated in either serial or parallel in response to a single capture. Multiple queries may be sent in response to a single capture, for example, to add new words to the capture or to query multiple search engines in parallel.

예컨대, 일부 실시예에서, 본 시스템은 현재 문서에 대한 특별 인덱스의 문 의를, 로컬 머신 상의 검색엔진으로, 통합 네트워크 상의 검색엔진으로, 및 인터넷상의 원격 검색엔진으로 전송한다. For example, in some embodiments, the system sends queries of special indexes for the current document to a search engine on a local machine, to a search engine on a unified network, and to a remote search engine on the Internet.

어느 특별한 검색의 결과에 다른 것보다 더 높은 우선순위를 부여할 수도 있다. The result of one particular search may be given a higher priority than the others.

주어진 문의에 대한 응답이 현재 진행중인 문의가 불필요하다는 것을 나타낼 수도 있고, 이 경우 진행중인 문의는 완료전에 취소될 수 있다. The response to a given query may indicate that a query currently in progress is unnecessary, in which case the query in progress may be canceled before completion.

4.4. 페이퍼 및 검색엔진Paper and Search Engines

전통적인 온라인 문의를 다루는 검색엔진이 렌더링된 문서에서 유래하는 것들도 다루는 것이 바람직한 경우가 종종 있다. 상술된 본 시스템에서 사용하기에 더 적절하도록 만들기 위한 많은 방법에 의해서 기존의 검색엔진이 확장되거나 변경될 수 있다. It is often desirable for search engines that deal with traditional online queries to deal with those that come from the rendered document. Existing search engines can be extended or modified in many ways to make them more suitable for use with the present system described above.

본 시스템의 검색엔진 및/또는 다른 구성요소가, 여분의 또는 상이한 특성을 갖는 인덱스를 생성하고 유지할 수 있다. 본 시스템은 유입 페이퍼-유래된 문의를 변경하거나, 혹은 문의가 결과 검색에서 취급되는 방법을 변경할 수 있고, 따라서, 이러한 페이퍼-유래된 문의를 웹 브라우저 및 다른 소스에 타이핑 입력된 문의에서 유입된 것들과 구별할 수 있다. 또한, 다른 소스로부터의 것들과 비교하여 결과들이 페이퍼에서 유래된 검색에 의해 되돌아왔을 때, 본 시스템은 다른 액션을 취하거나 다른 옵션을 제공할 수 있다. 이러한 각각의 접근법이 이하에서 설명된다. The search engine and / or other components of the system may create and maintain indexes with extra or different characteristics. The system can change incoming paper-derived inquiries, or change the way inquiries are handled in the search for results, so that these paper-derived inquiries are typed into web browsers and other sources. Can be distinguished from In addition, when the results are returned by a search derived from a paper as compared to those from other sources, the system may take different actions or provide different options. Each of these approaches is described below.

4.1. 인덱싱 4.1. Indexing

종종, 페이퍼-유래된 문의 또는 전통적인 문의 중 하나를 사용하여 동일한 인덱스가 검색될 수 있지만, 다양한 방법으로 현재 시스템에서의 사용을 위해 인덱스가 강화될 수 있다. Often, the same index can be retrieved using either paper-derived or traditional queries, but the index can be enhanced for use in the current system in a variety of ways.

4.1.1. 페이퍼 형식에 대한 지식 4.1.1. Knowledge of the paper format

페이퍼-기반 검색의 경우에 도움을 줄 수 있는 이러한 인덱스에 여분의 필드가 추가될 수 있다. Extra fields can be added to this index that can be helpful in the case of paper-based searches.

페이퍼 형식에서 문서 유효성(availability)을 나타내는 인덱스 엔트리Index entry indicating document availability in paper format

첫번째 예는, 문서가 존재하거나 혹은 페이퍼 형식으로 배포되었다고 알려졌다는 것을 나타내는 필드이다. 문의가 페이퍼로부터 온 경우, 본 시스템은 이러한 문서에 더 높은 우선순위를 부여할 수 있다. The first example is a field indicating that the document is known to be present or distributed in paper form. If the query comes from paper, the system may give higher priority to such documents.

인기 페이퍼 형식의 지식Popular paper format knowledge

이 예에서는 페이퍼 문서의 인기도에 관련된(및, 선택적으로는, 이들 문서내에서의 세부 영역에 관련된) 통계적 데이터 -예컨대, 스캐닝 동작의 양, 출판자 또는 다른 소스에 의해 제공된 발행부수 등- 가 사용되어, 그러한 문서에 더 높은 우선순위를 부여하고, (예컨대, 브라우저-기반 문의 또는 웹 검색에 대한) 디지털 부본의 우선순위도 올려준다. In this example, statistical data related to the popularity of the paper documents (and, optionally, to specific areas within these documents), such as the amount of scanning operation, the number of publications provided by the publisher or other sources, etc. are used. It gives higher priority to such documents, and raises the priority of digital copies (eg, for browser-based queries or web searches).

렌더링된Rendered 포맷의 지식 Knowledge of the format

또다른 중요한 예는 문서의 특정 렌더링의 레이아웃에 관한 정보를 기록하는 것일 수 있다. Another important example may be to record information about the layout of a particular rendering of a document.

예컨대 서적의 어느 특정 판(edition)과 관련하여, 인덱스는, 어디에서 줄이 끊기고 페이지가 끊기는지, 어떤 폰트가 사용되었는지, 보통과 다른 대문자가 있는 지 등에 대한 정보를 포함할 수 있다. For example, with respect to any particular edition of a book, the index may include information about where the line breaks and page breaks, what fonts are used, and what other capitalizations are common.

또한 인덱스는, 이미지, 텍스트 박스, 표, 및 광고 등과 같은 페이지 상의 다른 항목과의 근접성에 대한 정보도 포함할 수 있다. The index may also include information about proximity to other items on the page, such as images, text boxes, tables, and advertisements.

원본에 대한 의미론적 정보의 사용Use of Semantic Information about the Original

마지막으로, 의미론적(semantic) 정보가 소스 마크업으로부터 추론될 수 있지만, 그러나 페이퍼 문서에서는 명확하지 않은데, 이를테면, 텍스트의 특정 조각이 판매를 위해 제공된 항목을 언급하거나 또는 어느 특정 문장이 프로그램 코드를 포함하고 있다는 사실이 또한 인덱스에 기록될 수 있다. Finally, semantic information can be inferred from the source markup, but it is not clear in the paper document, for example, that a particular piece of text refers to an item provided for sale, or any particular sentence refers to program code. The fact that it is included can also be recorded in the index.

4.1.2. 캡쳐 방법에 대한 지식으로 인덱싱 4.1.2. Index with knowledge of capture methods

인덱스의 특성을 변경할 수 있는 두번째 요소는, 사용될 가능성이 있는 캡쳐 타입에 대한 지식이다. 만약, 인덱스가 OCR 프로세스에서 쉽게 구별될 수 없는 문자를 고려한다거나 또는 문서에서 사용된 폰트에 대한 약간의 지식을 포함하고 있다면, 광학 스캔에 의해 개시된 검색을 하는 것이 유익할 것이다. 유사하게, 문의가 음성 인식에서 유래한다면, 유사한 사운드의 음소(phoneme)에 기초한 인덱스가 보다 더 효과적으로 검색될 것이다. 상술된 모델에서 인덱스의 사용에 영향을 줄 수 있는 또 다른 요소는, 인식 프로세스 동안의 반복 피드백의 중요성이다. 만약 텍스트가 캡쳐됨에 따라 검색엔진이 인덱스로부터 피드백을 제공할 수 있다면, 캡쳐의 정확성을 크게 향상시킬 수 있다. The second factor that can change the characteristics of the index is the knowledge of the capture types that may be used. If the index considers characters that cannot be easily distinguished in the OCR process, or contains some knowledge of the fonts used in the document, it would be beneficial to make a search initiated by optical scan. Similarly, if the query comes from speech recognition, an index based on the phoneme of similar sounds will be searched more effectively. Another factor that may affect the use of indexes in the model described above is the importance of iterative feedback during the recognition process. If the search engine can provide feedback from the index as the text is captured, it can greatly improve the accuracy of the capture.

오프셋을 이용한 인덱싱 Indexing with Offsets

인덱스가 섹션 9에서 상술한 오프셋-기반/오토코릴레이션 OCR 방법을 사용하 여 검색될 가능성이 있을 경우, 일부 실시예에서는, 본 시스템이 적절한 오프셋이나 인덱스의 기호 정보를 저장하고 있다. If the index is likely to be searched using the offset-based / autocorrelation OCR method described above in section 9, in some embodiments, the system stores symbol information of the appropriate offset or index.

4.1.3. 다수의 인덱스 4.1.3. Multiple indexes

마지막으로, 상술된 시스템에서, 통상적으로 다수의 인덱스에 대해 검색을 수행할 수도 있다. 인덱스는 여러 장치나 통합 네트워크 상에서 유지될 수 있다. 일부 인덱스는 캡쳐 디바이스로 다운로드 될 수 있고, 또는 캡쳐 디바이스에 가까운 장치로 다운로드 될 수도 있다. 특별한 관심, 습관, 또는 허가를 갖는 사용자 또는 사용자 그룹에 대해서는 별도의 인덱스가 생성될 수 있다. 인덱스는 각 파일 시스템, 각 디렉토리, 심지어 사용자의 하드디스크 상의 각 파일에 대해 존재할 수 있다. 인덱스는 사용자 및 시스템에 의해 공개되고 서명(subscribe)될 수 있다. 그러면, 배포되고, 업데이트되고, 합병되고, 및 효과적으로 분리될 수 있는 인덱스를 구성하는 것이 중요할 것이다. Finally, in the system described above, it may typically be possible to perform a search over multiple indices. Indexes can be maintained on multiple devices or on converged networks. Some indexes may be downloaded to the capture device, or may be downloaded to a device close to the capture device. Separate indexes may be created for users or groups of users with particular interests, habits, or permissions. An index can exist for each file system, each directory, and even for each file on a user's hard disk. Indexes can be published and subscribed by users and systems. It would then be important to construct indexes that can be distributed, updated, merged, and effectively separated.

4.2. 문의의 취급 4.2. Handling of Inquiries

4.2.1. 캡쳐가 페이퍼로부터 온 것을 앎 4.2.1. 것을 Capture is from paper

검색엔진이 검색 문의가 페이퍼 문서로부터 유래한 것을 인식하였을 때, 검색엔진은 다른 액션을 취할 수 있다. 검색엔진은, 예컨대 특정 캡쳐 방법에 나타날 가능성이 있는 에러 형태에 대해 더 많은 허용을 두는 방식으로 문의를 취급할 수 있다. When the search engine recognizes that the search query comes from a paper document, the search engine can take another action. Search engines can handle queries, for example, in a way that gives them more tolerance for the types of errors that may appear in a particular capture method.

검색엔진은 문의에 포함된 일부 지시자(예컨대 캡쳐의 특성을 나타내는 플래그)로부터 그것을 추론할 수 있고, 또는 문의 자체로부터도 추론할 수 있다(예컨 대, OCR 프로세스에 전형적인 에러나 불확실성을 인식할 수 있다). The search engine can infer it from some of the indicators included in the query (such as a flag indicating the nature of the capture), or it can also infer from the query itself (for example, it can recognize errors or uncertainties typical of the OCR process). ).

대안적으로, 캡쳐 디바이스로부터의 문의는 다른 소스로부터의 그것과는 다른 연결 채널이나 포트나 형태에 의해 엔진에 도달할 수 있고, 그러한 방식으로 구별될 수 있다. 예컨대, 본 시스템의 일부 실시예는 전용 게이트웨이를 통해 문의를 검색엔진으로 보낼 것이다. 따라서, 검색엔진은 전용 게이트웨이를 지나가는 모든 문의가 페이퍼 문서로부터 유래한 것임을 알고 있다. Alternatively, the query from the capture device may reach the engine by a different connection channel or port or type than from another source, and may be distinguished in that way. For example, some embodiments of the system will send a query to a search engine through a dedicated gateway. Thus, the search engine knows that all queries going through the dedicated gateway are from paper documents.

4.2.2. 컨텍스의 이용 4.2.2. The use of context

아래의 섹션 13은, 캡쳐된 텍스트 자체의 외부에 있지만 그러나 문서를 식별하는데 중요한 도움이 될 수 있는 다양한 다른 요소들에 대해 설명한다. 이러한 것들에는 최근 스캔의 이력, 특정 사용자의 장기간 독서 습관, 사용자의 지리적 위치, 및 특정 전자 문서에 대한 사용자의 최근 사용 등이 포함된다. 이러한 요소들은 여기서 "컨텍스트"라고 언급된다. Section 13 below describes various other elements that are outside of the captured text itself but can be of significant help in identifying the document. These include the history of recent scans, the long-term reading habits of a particular user, the geographic location of the user, the user's recent use of a particular electronic document, and the like. These elements are referred to herein as "contexts."

컨텍스트의 일부는 검색엔진 자체에 의해 취급될 수 있고, 검색 결과에 변영될 수도 있다. 예컨대, 검색엔진은 사용자의 스캔 이력을 추적할 수 있고, 이 스캔 이력을 기존의 키보드-기반 문의와 상호-참조(cross-reference)시킬 수도 있다. 그러한 경우, 검색엔진은 각각의 개별 사용자에 대해 대부분의 기존 검색엔진이 하는 것보다 더 많은 상태 정보를 유지하고 사용하고 있으며, 검색엔진과의 각각의 인터랙션은 오늘날 전형적인 것보다 더 긴 기간 및 여러 검색들에 걸쳐 확장한다고 생각될 수 있다. Part of the context can be handled by the search engine itself and can be translated into search results. For example, a search engine may track a user's scan history and may cross-reference this scan history with existing keyboard-based queries. In such cases, the search engine maintains and uses more state information for each individual user than most existing search engines do, and each interaction with the search engine is longer than typical and multiple search It can be thought of as extending across the fields.

컨텍스트의 일부는 검색 문의시 검색엔진으로 전송될 수 있고(섹션 3.3), 장 래의 문의에서 일부로서 역할하기 위해 엔진에 저장될 수도 있고, 따라서 검색엔진으로부터의 결과에 적용되는 필터 또는 부차적 검색이 된다. Part of the context can be sent to the search engine in search queries (section 3.3) and stored in the engine to serve as part of future queries, so that filters or secondary searches applied to results from the search engine are do.

검색에의 데이터-Data-to search 스트림Stream 입력 input

검색 프로세스로의 중요한 입력은, 어떻게 사용자 커뮤니티가 문서의 렌더링된 버전과 인터랙팅하는가에 대한 더 넓은 컨텍스트 이다 -예컨대, 어떤 문서가 가장 널리 읽혀지고 누구에게 읽혀지는가. 이것은, 가장 자주 링크되는 페이지나 지난번 검색결과로부터 가장 자주 선택되는 페이지를 복귀시키는 웹 검색과 유사하다. 이에 대한 더 자세한 논의는 섹션 13.4 및 14.2를 참조. An important input to the retrieval process is the wider context of how the user community interacts with the rendered version of the document-for example, which documents are most widely read and to whom. This is similar to a web search that returns the most frequently linked page or the most frequently selected page from the last search result. See sections 13.4 and 14.2 for a more detailed discussion of this.

4.2.3. 문서 서브-영역 4.2.3. Document sub-area

상술된 본 시스템은 전체로서 문서에 대한 정보 뿐만 아니라 문서의 서브-영역에 대한, 심지어 개개의 단어까지에 대한 정보도 발행하고(emit) 사용한다. 기존의 많은 검색엔진들은 단지 특정 문의에 관련된 파일이나 문서를 찾는 것에 관심을 집중하고 있다. 더 미세한 것(finer grain)에도 작용하여 문서 내에서의 위치까지도 식별할 수 있는 검색엔진은 상술한 본 시스템에 큰 이익을 제공할 것이다. The system described above issues and uses not only information about the document as a whole, but also information about sub-regions of the document, even individual words. Many existing search engines focus on finding files or documents related to a particular query. Search engines that work on finer grains and can even identify locations within a document will provide a significant benefit to the system described above.

4.3. 결과를 제공함 4.3. Provided results

검색엔진은, 제공된 결과에 영향을 주기 위해 현재 유지하고 있는 추가 정보 중 일부를 사용할 수 있다. The search engine may use some of the additional information it currently maintains to influence the results provided.

또한 본 시스템은, 단지 페이퍼 카피에 속하고 있다는 것의 결과로서 사용자가 액세스하고 있는 특정 문서를 제공할 수도 있다(섹션 7.4 참조). The system may also provide the particular document that the user is accessing as a result of just belonging to a paper copy (see section 7.4).

또한 검색엔진은, 텍스트의 간단한 검색을 넘어서, 상술한 시스템에 적합한 새로운 액션이나 옵션을 제공할 수도 있다. Search engines can also go beyond simple search of text and provide new actions or options suitable for the systems described above.

5.5. 마크업, 주석, 및 메타데이터Markup, Annotation, and Metadata

캡쳐-검색-발견(retrieve) 프로세스를 수행하는 것에 더하여, 상술한 본 시스템은 여분의 기능성을 문서와 연관시키고, 보다 구체적으로는, 문서 내의 텍스트의 특정 위치나 세그먼트와 연관시킨다. 이 여분의 기능성은 종종, 전적으로 그런 것은 아니지만, 그의 전자 부본과 연관됨으로 인해 렌더링된 문서와 연관된다. 일 예로서, 웹 페이지의 하이퍼링크는 그 웹페이지의 프린트 출력물이 스캔될 때 동일한 기능성을 가질 수 있다. 몇몇의 경우, 이러한 기능성은 전자 문서에 한정되지 않고, 다른 곳에서도 저장되거나 생성된다. In addition to performing a capture-search-retrieve process, the system described above associates extra functionality with a document, and more specifically with a particular location or segment of text in the document. This extra functionality is often associated with the rendered document due to, but not entirely, its electronic copy. As one example, a hyperlink of a web page may have the same functionality when the print output of that web page is scanned. In some cases, this functionality is not limited to electronic documents, but stored or created elsewhere.

이러한 부가 기능성의 층(layer)은 여기서 "마크업"이라고 언급된다. This layer of additional functionality is referred to herein as "markup."

5.1. 오버레이 , 정적 및 동적 5.1. Overlays , static and dynamic

마크업에 대해 생각하는 한가지 방법은 문서 상의 "오버레이"로서 인데, 이것은 문서 또는 문서의 일부분에 대한 추가 정보를 제공하거나, 문서 또는 문서의 일부분에 연관된 액션을 특정할 수 있다. 마크업은 사람이 판독할 수 있는 컨텐트를 포함할 수 있지만, 종종 사용자에게 보이지 않거나 및/또는 기계적 사용으로만 의도된다. 그러한 예로서, 사용자가 렌더링된 문서의 특정 영역으로부터 텍스트를 캡쳐하거나 혹은 특정 문구의 발음을 설명하는 오디오 샘플을 캡쳐할 때, 근처의 디스플레이 상에 팝업 메뉴가 디스플레이되는 옵션이 있다. One way to think about markup is as an "overlay" on a document, which can provide additional information about the document or part of the document, or specify an action associated with the document or part of the document. Markups may include human readable content, but are often intended to be invisible to the user and / or intended for mechanical use only. As such an example, when the user captures text from a specific area of the rendered document or captures an audio sample describing the pronunciation of a particular phrase, there is an option to display a pop-up menu on the nearby display.

5.1.1. 여러 소스로부터 가능한 여러 층 5.1.1. Multiple layers available from different sources

임의의 문서는 다수의 오버레이를 동시에 가질 수 있고, 이들은 다양한 위치 로부터 유래될 수 있다. 마크업 데이터는 문서의 작가, 사용자, 또는 또다른 측에 의해 생성되거나 공급될 수 있다. Any document can have multiple overlays at the same time and they can be derived from various locations. Markup data may be generated or supplied by the writer, user, or another side of the document.

마크업 데이터는 전자 문서에 첨부되거나 또는 그에 내장될 수 있다. 마크업 데이터는 종래의 위치에서(예컨대, 동일한 문서이지만 다른 파일네임 접미사를 갖는 곳에) 발견될 수 있다. 마크업 데이터는 원본 문서의 위치를 찾은 문의의 검색결과에 포함될 수 있고, 또는 동일한 또는 상이한 검색엔진에서의 별개의 문의에 의해 발견될 수 있다. 마크업 데이터는 최초의 캡쳐된 텍스트 및 다른 캡쳐 정보 또는 컨텍스트 정보를 사용하여 발견될 수 있고, 또는 캡쳐의 위치와 문서에 대한 기존에 추론된 정보를 사용하여 발견될 수 있다. 마크업 그 자체가 문서에 포함되어 있지 않더라도, 마크업 데이터는 문서에 특정되어 있는 위치에서 발견될 수 있다. Markup data may be attached to or embedded in an electronic document. Markup data can be found in conventional locations (eg, where the same document but with a different file name suffix). Markup data may be included in the search results of the query that locates the original document, or may be found by separate queries in the same or different search engines. Markup data can be found using the original captured text and other capture information or contextual information, or can be found using the location of the capture and existing inferred information about the document. Even if the markup itself is not included in the document, the markup data can be found at a location specific to the document.

종래의 html 웹페이지에 대한 링크가 html 문서 내에 정적 데이터(static data)로서 종종 내장되는 방식과 유사하게, 마크업은 문서에 대개 정적이고 특유한 것일 수 있다. 그러나 마크업은 많은 수의 문서에 대해 동적으로 생성 및/또는 적용 될 수도 있다. 동적 마크업의 일 예는, 문서에서 언급된 회사의 최신 주식가격을 포함하는 그 문서가 첨부되어 있는 정보이다. 널리 적용되는 마크업의 일 예는, 다수의 문서나 문서의 섹션에 대해 특정 언어로 자동적으로 이용가능한 번역 정보이다. Similar to the way in which links to conventional html web pages are often embedded as static data in html documents, markup can be mostly static and specific to the document. However, markup may be dynamically generated and / or applied to a large number of documents. One example of dynamic markup is information to which the document is attached, including the latest stock prices of the companies mentioned in the document. One example of markup that is widely applied is translation information that is automatically available in a particular language for multiple documents or sections of the document.

5.1.2. 개인 "플러그-인" 층 5.1.2. Personal "plug-in" floor

사용자는 또한 마크업 데이터를 인스톨하거나 그것의 특정 소스에 서명할 수 있고, 따라서 특정 캡쳐에 대한 시스템 반응을 개인화시킨다. The user can also install markup data or sign its specific source, thus personalizing the system response to a particular capture.

5.2. 키워드 및 문구, 상표 및 로고 5.2. Keywords and phrases, trademarks and logos

문서의 일부 요소는, 특정 문서내의 위치 보다는 그들 자체의 특성에 기초하여 그들과 연관되어 있는 기능성 또는 특정 "마크업"을 가질 수 있다. 예로서, 사용자에게 어느 조직에 관한 추가 정보를 링크시킬 수 있는 로고 및 상표 뿐만 아니라, 순수하게 스캔될 목적으로 문서에서 프린트되는 특정 마크가 있다. 동일한 원리가 텍스트 내의 "키워드" 또는 "키 문구"에도 적용된다. 조직들은 그들이 연관되어 있는 혹은 그들이 연관되고 싶어하는 특정 문구들을 등록할 수 있고, 그 문구가 어디에서 스캔되든지 간에 이용가능하게 되는 특정 마크업을 부착한다. Some elements of a document may have a specific "markup" or functionality associated with them based on their own characteristics rather than location within a particular document. By way of example, there are logos and trademarks that can link the user to additional information about an organization, as well as specific marks printed on the document for purely scanning purposes. The same principle applies to "keywords" or "key phrases" in text. Organizations can register specific phrases they are related to or want to be associated with and attach specific markup that becomes available wherever the phrase is scanned.

어떤 단어, 문구 등도 연관 마크업을 가질 수 있다. 예컨대, 본 시스템은, 사용자가 언제 단어 "책" 또는 책제목 또는 책에 관련된 토픽을 캡쳐하든지 간에, 특정 항목을 팝업 메뉴에 추가할 수 있다(예컨대 온라인 서점으로의 링크). 본 시스템의 일부 실시예에서, 단어 "책", 책 제목, 또는 책에 관련된 토픽 근처에서 캡쳐가 발생하였는지를 판단하기 위해 디지털 부본 문서 또는 인덱스가 참조될 수 있고, 시스템의 움직임이 키워드 요소의 이러한 근접성에 따라 변경된다. 앞서의 일예에서, 마크업으로 인해, 비상업적 텍스트 또는 문서로부터 캡쳐된 데이터가 상업적 트랜잭션을 유발할 수 있다는 점에 유의하라. Any word, phrase, etc. can have an associated markup. For example, the system may add a particular item to a pop-up menu (eg, a link to an online bookstore) whenever the user captures the word "book" or book title or topic related to the book. In some embodiments of the present system, a digital copy document or an index may be referenced to determine if a capture occurred near the word "book", book title, or topic associated with the book, and the movement of the system may be such a proximity of the keyword element. Will change accordingly. In the above example, note that due to markup, data captured from non-commercial text or documents can cause commercial transactions.

5.3. 유저-공급된 콘텐트 5.3. User -supplied content

5.3.1. 멀티미디어를 포함한, 유저 코멘트 및 주석 5.3.1. User comments and annotations , including multimedia

주석은 문서와 연관될 수 있느 다른 유형의 전자 정보이다. 예로서, 유저는 음성 주석으로서 이후의 검색을 위해 특정 문서에 대한 자신의 생각의 오디오 파일을 첨부할 수 있다. 멀티미디어 주석의 다른 예로서, 유저는 문서라 칭해지는 곳에 사진을 첨부할 수 있다. 유저는 문서를 위한 주석을 공급하지만 시스템은 다른 소스로부터의 주석을 연관시킬 수 있다(예로서, 워크 그룹내의 기타 유저를 공유한다).Annotation is another type of electronic information that can be associated with a document. As an example, a user may attach an audio file of his or her thoughts on a particular document for later retrieval as a voice annotation. As another example of a multimedia annotation, a user may attach a photo to what is called a document. The user supplies annotations for the document but the system can associate annotations from other sources (eg, share other users in the workgroup).

5.3.2. 프루프 -판독으로부터의 노트 5.3.2. Proof -notes from reading

유저-소스화된 마크업의 중요한 예는 프루프-판독, 편집 또는 검토 목적의 일부분으로서 종이 문서로 된 주석이다. An important example of user-sourced markup is a paper document annotation as part of proof-reading, editing or reviewing purposes.

5.4. 써드 -파티 콘텐트 5.4. Third -Party Content

상기한 바와 같이, 마크업 데이터는 문서의 다른 판독제에 의하는 바와 같이, 써드-파티에 의해 공급될 수 있다. 온라인 토의 및 검토인, 특정 작업, 자원봉사자에 의한 번역 및 설명에 관한 컴뮤니티-관리 정보와 같은 것은 좋은 예이다. As mentioned above, the markup data may be supplied by a third party, such as by another reading agent of the document. Good examples are community-managed information about online discussions and reviewers, specific tasks, translations and explanations by volunteers.

써드-파티 마크업의 다른 예는 광고주에 의해 제공된다. Another example of third-party markup is provided by an advertiser.

5.5. 유저의 데이터 스트림에 기초한 동적 마크업 5.5. Dynamic markup based on the user 's data stream

시스템의 여러 또는 모든 유저에 의해 문서로부터 획득된 데이터를 분석함에 의해, 마크업은 컴뮤니의 활동 및 관심사를 기초로 발생될 수 있다. 예로서는 "이 책을 즐긴 사람은 또한 ...도 즐긴다"라는 것을, 유저에게 말하는 ㅈ 또는 마크업을 생성하는 온라인 책방일 수 있다. 마크업은 익명성이 덜 할 수 있고, 유저에게 그의 계약 리스트의 어느 사람이 이 문서를 최근에 읽었는 지를 알려줄 수 있다. 데이터스트림 분석의 다른 예는 섹션 14에 포함되어 있다.By analyzing data obtained from documents by several or all users of the system, markup may be generated based on the activity and interests of the community. An example could be an online bookstore that generates a markup or tells the user that "the person who enjoys this book also enjoys ...". The markup can be less anonymous and can tell the user which person on his contract list has recently read this document. Another example of datastream analysis is included in section 14.

5.6. 외부 이벤트 및 데이터 소스에 기초한 마크업 5.6. Markup based on external events and data sources

마크업은 흔히, 통합된 데이터베이스로부터의 입력, 공중 인터넷으로부터의 정보 또는 로컹 운영체제에 의해 수집된 통계치와 같은, 데이터 소스 및 외부 이벤트에 기초한다. Markup is often based on data sources and external events, such as input from an integrated database, information from the public Internet, or statistics collected by the local operating system.

데이터 소스는 더욱 로컬일 수 있고, 특히 유저의 신분, 지역 및 활동과 같은 유저의 콘텍스트에 대한 정보를 제공할 수 있다. 예로서, 시스템은 유저의 모바일 폰과 통신할 수 있고 유저가 최근에 전화로 통화나 누군가에게 문서를 전송할 옵션을 부여하는 마크업 층을 제공한다. The data source may be more local and may provide information about the user's context, such as the user's identity, region, and activity in particular. By way of example, the system can provide a markup layer that can communicate with a user's mobile phone and give the user the option to send a document or call someone recently by phone.

6. 인증, 개인유별화 및 보안 6. Authentication, Personalization and Security

대부분의 상황에서, 유저의 신분은 공지되어진다. 때때로 이것은 "익명 신분"으로 되고, 요기서 유저는 예로서 캡쳐 디바이스의 일련번호에 의해서만 식별된다. 통상적으로, 시스템은 시스템을 개인유별화하는 데에 사용될 수 ㅣㅇㅆ고 활동 및 트랜잭션이 유저의 이름으로 수행될 수 있게 하기 위해, ㄱ에 대한 더욱 상세한 지식을 갖는 것이 예상된다. In most situations, the user's identity is known. Sometimes this becomes an "anonymous identity", where the user is identified only by the serial number of the capture device, for example. Typically, a system can be used to personalize the system and it is expected to have a more detailed knowledge of A in order to allow activities and transactions to be performed in the name of the user.

6.1. 유저 이력 및 "수명 라이브러리" 6.1. User history and "lifetime library"

시스템이 수행할 수 있는 가장 간명하고 유용한 기능중의 하나는 유저가 캡쳐한 텍스트에 대한 유저의 레코드 및, 발견된 임의의 문서에 대한 상세사항, 이 문서내의 위치 및 결과적으로 취해진 임의의 액션을 포함하는, 그 캡쳐에 관련된 임의 추가 정보를 유지하는 것이다. One of the most concise and useful functions that the system can perform is the user's record of the text captured by the user, details of any document found, its location within this document, and any action taken as a result. To retain any additional information related to the capture.

이 저장된 이력은 유저 및 시스템 모두에 유익하다. This stored history is beneficial for both the user and the system.

6.1.1 유저를 위해 6.1.1 for the user

유저에게는, 유저가 판독 및 캡쳐한 모든 것에 대한 레코드인, "수명 라이브러리"가 주어질 수 있다. 이것은 단순히 개인적인 관심사일 수 있지만, 예로서 그의 다음 페이퍼의 섹인을 위한 자료를 수집하는 학자에 의해 라이브러리에 사용될 수 도 있다. The user can be given a "life library", which is a record of everything the user has read and captured. This may simply be a personal concern, but may be used in a library by a scholar, for example, collecting data for the section of his next paper.

몇몇 환경에서, 유저는 다른 사람들이 판독하는 알고 관심사항을 발견할 수 있도록, 웹로그에 마찬가지 방식으로 웹상에서 그것을 출판함에 의해, 라이브러리가 공표되길 바랄 수 있다. In some circumstances, a user may wish to publish a library by publishing it on the Web in a similar way to a weblog, so that others can know and read about it.

마지막으로, 유저가 일부 텍스트를 획득하고 시스템이 이 캡쳐에 대해 즉시 작용할 수 없는 경우에(예로서, 문서의 전자 버젼이 아직 이용블가능하기 때문에) 캡쳐는 라이브러리에 저장될 수 있고 유저의 요구에 따라 또는 자동으로, 이후에 처리될 수 있다. 유저는 새로운 마크업 서비스에 가입할 수 있고 이것들을 이전에 캡쳐된 스캔에 적용할 수 있다. Finally, if the user acquires some text and the system cannot immediately act on this capture (eg, because an electronic version of the document is still available), the capture can be stored in a library and meet the needs of the user. Accordingly or automatically, afterwards. The user can subscribe to new markup services and apply them to previously captured scans.

6.1.2. 시스템을 위해 6.1.2. For the system

유저의 과거 캡쳐에 대한 레코드는 시스템을 위해서도 유용하다. 시스템 동작의 대다수 태양은 유저의 판독 습관 및 이력을 앎으로써 향상될 수 있다. 가장 간명한 예는 유저에 의해 행해진 임의의 스캔은 유저가 가장 최근에 스캔한 문서로부터 올 가능성이 높고 특히 이전 스캔이 최종 수분 전에 행해졌다면 그것은 동일한 문서로부터 올 가능성이 매우 높다. 마찬가지로, 문서는 시작-끈 순서로 판독될 가능성이 더 높다, 따라서, 예로서, 영어 문서인 경우, 나중의 스캔은 문서에서 더욱 멀리 아래로 발생할 가능성이 높다. 그러한 요인들은 모호성의 경우에 시스템이 캡쳐의 위치를 수립하는 데에 도움을 줄 수 있고, 또한 캡쳐되어야 할 텍스트의 양을 감소시킬 수 있다. Records of past captures of users are also useful for the system. Many aspects of system operation can be improved by knowing the user's reading habits and history. The most straightforward example is that any scan done by the user is likely to come from the document the user last scanned, especially if the previous scan was done a few minutes before the last one. Similarly, documents are more likely to be read in start-off order, so, for example, in the case of English documents, later scans are more likely to occur farther down in the document. Such factors can help the system locate the capture in the case of ambiguity and can also reduce the amount of text to be captured.

6.2 지불 , 아이덴티티 및 인증 디방스와 같은 스캐너 6.2 Scanners such as Payment , Identity and Authentication Defense

캡쳐 프로세스는 광학 스캐너 또는 음성 레코더와 같은 일정 종류의 디바이스로 시작하기 때문에, 이 디바이스는 유저를 식별하고 일정한 액션을 인증하는 키로서 사용될 수 있다. Since the capture process begins with some kind of device, such as an optical scanner or voice recorder, the device can be used as a key to identify the user and authenticate certain actions.

6.2.1. 스캐너와 전화 또는 기타 계정과의 연관 6.2.1. Association of scanner with phone or other account

디바이스는 모바일 폰에 내장되거나 또는 기타 방식으로 모바일 폰 계정과 연관될 수 있다. 예로서, 스캐너는 모바일 폰 계정과 연관된 SIM 카드를 스캐너에 삽입함에 의해 모바일 폰 계정과 연관될 수 있다. 마찬가지로, 디바이스는 크레딧 카드 또는 기타 지불 카드에 내장되거나 또는 그것에 연결되는 카드를 위한 기능설비를 가질 수 있다. 디바이스는 지불 토큰으로 사용될 수 있고, 지불 트랜잭션은 렌더링된 문서로부터의 캡쳐에 의해 개시될 수 있다. The device may be embedded in the mobile phone or otherwise associated with the mobile phone account. By way of example, the scanner may be associated with a mobile phone account by inserting a SIM card associated with the mobile phone account into the scanner. Similarly, the device may have functionality for a card embedded in or connected to a credit card or other payment card. The device can be used as a payment token and the payment transaction can be initiated by capture from the rendered document.

6.2.2. 인증을 위해 스캐너 입력 사용 6.2.2. Use scanner input for authentication

스캐너는 유저 또는 계정과 연관된 몇몇 토큰, 심볼 또는 텍스트를 스캐닝하는 프로세스를 통해 특정 유저또는 계정과도 연관될 수 있다. 또한, 스캐너는 예로서 유저의 지문을 스캐닝함에 의해, 생체정보를 위해 사용될 수 있다. 오디오-기잔 캡쳐 디바이스의 경우에, 시스템은 유저의 음성 패턴을 매칭함에 의해 또는 유저가 일정한 암호 또는 구절을 말하도록 요구함에 의해 유저를 식별할 수 있다. The scanner may also be associated with a particular user or account through the process of scanning for some tokens, symbols or text associated with the user or account. The scanner may also be used for biometric information, for example by scanning a user's fingerprint. In the case of an audio-based capture device, the system may identify the user by matching the user's voice pattern or by requiring the user to speak a certain password or phrase.

예로서, 유저가 책으로부터 일정한 인용구를 스캐너하거나 온라인 판매자로부터 책을 구입할 옵션이 제공된다면, 유저는 이 옵션을 선택할 수 있고, 그후 트랜잭션을 확인하기 위해 유저의 지문을 스캔하도록 프롬프팅한다. As an example, if the user is provided with the option to scan certain quotes from the book or purchase the book from an online merchant, the user may select this option and then prompt to scan the user's fingerprint to confirm the transaction.

섹션 15.5. 및 15.6.도 참조 하시요Section 15.5. See also 15.6.

6.2.3. 보안 스캐닝 디바이스 6.2.3. Security scanning devices

캡쳐 디바이스가 유저를 인증 및 식별하기 위해 그리고 유저를 대신하여 트랜잭션을 개시시키기 위해 사용되는 경우, 시스템의 다른 부분과 디바이스간의 통신이 보안유지되는 것이 매우 중요하다. 또한 다른 디바이스가 스캐너를 대역하는 상황, 또는 다른 컴포넌트와 디바이스간의 통신이 인터셉트되는 소위 "중간에 낀 사람" 공격에 대해 보호하는 것이 매우 중요하다. When a capture device is used to authenticate and identify a user and initiate a transaction on behalf of the user, it is very important that the communication between the device and other parts of the system is secured. It is also very important to protect against situations where other devices band the scanner, or so-called "intermediate" attacks where communication between other components and devices is intercepted.

그러한 보안을 제공하기 위한 기술은 당업계에서 양호하게 잘 이해되고; 다양한 실시예, 디바이스에서의 하드웨어 및 소프트웨어 및 시스템네의 다른 곳은 그런 기술을 구현하기 위해 구성된다. Techniques for providing such security are well understood in the art; Various embodiments, hardware and software in the device, and elsewhere in the system, are configured to implement such techniques.

7. 모델 및 엘리먼트 출판 7. Model and Element Publishing

상기한 시스템의 이점은 시스템의 다수의 이점을 얻기 위해 문서를 생성, 인쇄 및 출판하는 종래의 프로세스를 변경할 필요가 없다는 것이다. 문서의 생성자 또는 출판자-이후엔 단순히 "출판자"로서 참조됨-는 상기한 시스템을 지원하는 기능을 생성하길 원하는 이유가 있다. The advantage of such a system is that there is no need to change the conventional process of creating, printing and publishing documents in order to take advantage of the system's many advantages. The creator or publisher of the document, hereafter simply referred to as the "publisher", has a reason for wanting to create a function that supports the above system.

이 섹션은 주로 출판된 문서와 관련된다. 광고와 같은, 기타 상용 트랜잭션에 관한 정보에 대해서는 " P-커머스"라는 제목의 섹션 10을 참조하시요.This section is mainly related to published documents. See section 10 titled "P-Commerce" for information about other commercial transactions, such as advertisements.

7.1. 인쇄된 문서에 대한 전자 컴패니언 7.1. Electronic companion to the printed document

시스템은 문서가 연관된 전자 프레즌스를 갖는 것을 허용한다. 종ㅎ래에 출판자는 CD-ROM에 추가 디지털 정보, 교습 영화 및 기타 멀티미디어 데이터, 샘플 코드 또는 문서, 또는 추가의 기준 재료등을 포함하는 북을 탑재한다. 또한, 몇몇 출판자는 출판 시점 후 갱신될 수 있는 정보 및, 오자, 추가 코멘트, 갱산된 기준 재료, 색인 및 관련 데이터의 추가 소스, 및 다른 언어로의 번역과 같은, 그러한 재료를 제공하는 특정 출판물과 연관된 웹사이트를 유지한다. 온라인 포럼은 독자가 출판물에 대해 그들의 코멘트를 달 수 있도록 한다. The system allows the document to have an associated electronic presence. Often, publishers place books on CD-ROMs containing additional digital information, teaching movies, and other multimedia data, sample code or documents, or additional reference materials. In addition, some publishers may have information that may be updated after the time of publication, and certain publications that provide such material, such as typos, additional comments, refined reference materials, indexes, and additional sources of related data, and translations into other languages. Maintain an associated website. Online forums allow readers to comment on publications.

상기한 시스템은 그러한 재료들이 이전 것 보다 렌더링된 문서에 더욱 밀접하게 결합되는 것을 허용하고, 그리고 그들의 발견과 그들과의 상호작용이 유저에 대해 더욱 용이하게 될 수 있도록 한다. 문서로부터의 텍스트의 일부를 캡쳐함에 의해, 시스템은 자동적으로 유저를 문서와 연관된 디지털 재료에 자동적으로 연결시킬 수 있고, 더욱 상세히는 문서의 특정 부분과 연관된다. 마찬가지로, 유저는 텍스트의 그 섹션을 토의하는 온라인 커뮤니티에, 그리고 다른 독자에 의해 주석 및 해석에 연결될 수 있다. 과거에, 그러한 정보는 통상적으로 특정 쪽전호 또는 장을 탐색함에 의해 차아질 필요가 있곤 했다. The above system allows such materials to be more closely coupled to the rendered document than before, and their discovery and interaction with them can be made easier for the user. By capturing a portion of the text from the document, the system can automatically connect the user to the digital material associated with the document, more specifically associated with a particular portion of the document. Similarly, a user can be connected to comments and interpretation by an online community that discusses that section of text, and by other readers. In the past, such information would typically need to be filled by searching for a particular page or chapter.

이러한 예시적 애플리케이션은 학문적 교재 분야이다(섹션 17.5). This exemplary application is in the field of academic texts (section 17.5).

7.2. 인쇄된 문서에 대한 "가입" 7.2. "Sign Up" for Printed Documents

몇몇 출판자는 독자들이 새로운 관련 자료를 통지받기 원한다면 또는 책의 새로운 판이 출판되는 경우 가입할 수 있는 메일링 리스트를 가질 수 있다. 설명 된 시스템으로, 유저는 특정 문서 또는 문서의 일부분에서 관심사항을 등록할 수 있고, 출판자가 임의의 그러한 기능을 제공하는 것을 고려하기 이전인 경우에도 등록할 수 있다. 독자의 관심사항은 출판자에게 제공될 수 있고, 언제 및 어느 때 갱신본, 추가 정보, 세로운 판 또는 기존 책에서 관심있는 것으로 wmd명된 주제에 관한 완전히 새로운 출판물을 제공할 지에 대한 그들의 결정에 영향을 미칠 수 있다. Some publishers may have a mailing list that readers can subscribe to if they want to be notified of new relevant material or if a new edition of the book is published. With the described system, a user can register an interest in a particular document or portion of a document, and even before the publisher considers providing any such functionality. Readers' interests may be provided to publishers and may influence their decisions as to when and when to provide entirely new publications on updates, additional information, new editions, or topics that have been identified as interest in existing books. Can be.

7.3. 특정한 의미를 갖춘 또는 특정한 데이터를 포함하는 인쇄된 마크 7.3. Printed marks with specific meaning or containing specific data

시스템의 다수의 태양은 문서에 이미 존재하는 텍스트의 사용을 통해 단순히 인에이블된다. 문서가, 시스템과 연관지어 사용될 수 있다는 지식하에서 산출된다면, 여분의 기능은 특정 마크 형태로 여분의 정보를 인쇄함에 의해 추가될 수 있고, 이것은 텍스트 또는 필요한 액션을 식별하는 데에 사용될 수 있고, 또는 그렇지않으면 시스템과 문서의 상호작용을 향상시킨다. 가명하고 가장 중요한 예는 문서가 명확하게 시스템을 통해 액세스가능하다는 것을 독자에게 지시하는 것이다. 특정항 아이콘이, 그 문서가 그것과 연관된 온라인 포럼을 갖는다는 것을 지시하는 데에 사용될 수 있다. Many aspects of the system are simply enabled through the use of text already present in the document. If the document is produced under the knowledge that it can be used in conjunction with the system, the extra functionality can be added by printing the extra information in the form of a specific mark, which can be used to identify the text or action required, or Otherwise, it improves the system's interaction with the document. An obvious and most important example is to instruct the reader that the document is clearly accessible through the system. A particular term icon can be used to indicate that the document has an online forum associated with it.

그러한 심볼은 순전히 독자를 의도한 것일 수 있거나, 일정한 액션을 개시시키는 데에 사용되고 스캐닝된 경우 시스템에 의해 인식될 수 있다. 충ㅂ준한 데이터는 심볼 이상의 것을 식별하기 위해 심볼로 인코딩 될 수 있고; 그것은 또한 시에 의해 인식 및 판독될 수 있는, 심볼의 위치, 판, 문서에 관한 정보를 저장할 수 있다. Such symbols may be purely intended for the reader or may be recognized by the system when used and scanned to initiate certain actions. Full data can be encoded into symbols to identify more than symbols; It can also store information about the position of the symbol, the version, the document, which can be recognized and read by the poem.

7.4. 페이퍼 문서의 소유를 통한 인증 7.4. Authenticate by owning a paper document

인쇄 문서의 소유 또는 그에 대한 액세스가, 예로서 문서의 전자 복사본 또는 추가 재료로의 액세스와 같은, 일정한 특권을 유저에게 부여하는 몇몇 상황이 있다. 상기한 시스템으로, 그러한 특권은 유저가 문서로부터 텍스트의 일부분을 캡쳐링하거나, 또는 특정하게 인쇄된 심볼을 스캐닝함에 의한 결과로서 간단하게 허여될 수 있다. 시스템이, 유저가 전체 문서를 소유하고 있었다는 것을 보장할 것이 필요로 되는 경우에, 유저에게 특정 페이지로부터 특정 항목 또는 어구, 예로서 " 페이지 46의 두번째 라인"과 같은 것을 스캐닝하도록 프롬프팅할 수 있다. There are some situations in which ownership of or access to a printed document grants a user certain privileges, such as, for example, access to an electronic copy or additional material of the document. With such a system, such privileges can be granted simply as a result of the user capturing a portion of the text from the document, or scanning a particular printed symbol. If the system needs to ensure that the user has owned the entire document, it can prompt the user to scan a particular item or phrase, such as "second line of page 46," from a particular page. .

7.5. 만료하는 문서 7.5. Expired documents

인쇄 문서가 여분의 재료 및 기능으로의 게이트웨이이면, 그러한 특징으로의 액세스는 시간제약에 놓일 수 있다. 만료날짜 후, 유저는 요금을 지불하거나 상기와 같은 특징들을 다시 액세스하기 위해 문서의 새로운 버젼을 획득하는 것이 요구될 수 있다. 페이퍼 문서는 물론 여전히 사용가능하지만, 그 향상된 전자 기능의 일부를 손실하게 된다. 이것은, 출판자가 전자 재료로의 액세스를 위한 요금을 수납하는 경우, 또는 유저가 때때로 새로운 버젼을 획득힐 필요가 있는 경우, 이점이 있기 때문에, 또는 배포중 남아있는 인쇄된 문서의 오래된 버젼과 연관된 단점이 있기 때문에, 소망된다. If the printed document is a gateway to extra material and functionality, access to such features may be time limited. After the expiration date, the user may be required to obtain a new version of the document to pay a fee or to access such features again. Paper documents are still available, of course, but they lose some of their enhanced electronics. This is advantageous if the publisher receives a fee for access to the electronic material, or if the user sometimes needs to obtain a new version, or is associated with an older version of the printed document that remains during distribution. Because there is, it is hoped.

7.6. 인기도 분석 및 출판 결정 7.6. Popularity Analysis and Publishing Decisions

섹션 10.5는 광고 가격 및 저자의 보상에 영향을 미치는 시스템의 통계치의 사용을 토의한다. Section 10.5 discusses the use of the statistics of the system to influence advertising prices and author compensation.

몇몇 실시예에서, 시스템은 페이퍼 문서뿐 아니라 그것과 연관된 전자 커뮤니팅서의 활동으로부터 출판물의 인기도를 추론한다. 이들 요인들은 출판자 미래에 출판할 것에 대한 결정을 행하는 데에 조력한다. 기존의 책에서 장이 지나치게 인기 있는 것으로 판명되면, 별개의 출판으로 확대될 가치가 있을 것이다. In some embodiments, the system infers the popularity of the publication from the activity of the paper document as well as the electronic communication book associated with it. These factors assist in making decisions about what to publish in the future of the publisher. If the chapter turns out to be too popular in an existing book, it may be worth expanding to a separate publication.

8. 문서 액세스 서비스 8. Document Access Service

설명된 시스템의 중요한 태양은 문서의 렌더링된 카피에 대한 액세스를 갖는 유저에게 그 문서의 전자 버젼에의 액세스 능력을 부여하는 것이다. 몇몇 경우에, 문서는 유저가 액세스하는 개인 네트워크 또는 공중 네트워크상에서 자유로이 이용가능하다. 시스템은 문서를 식별, 탐지 및 검색하기 위해 캡쳐된 텍스트를 사용하고, 몇몇 경우엔 그것을 유저의 스크린에 디스플레이하거나 그것을 유저의 이메일 박스에 위치시킨다. An important aspect of the described system is to give a user with access to a rendered copy of a document the ability to access an electronic version of that document. In some cases, the document is freely available on a private or public network that the user has access to. The system uses the captured text to identify, detect, and retrieve the document, and in some cases displays it on the user's screen or places it in the user's email box.

몇몇 경우에, 문서는 전자 형태로 이용가능하지만 여러 이유로 유저에게 액세스불가능할 수 있다. 문서를 검색하는 데에 충분한 연결이 없을 수 있고, 유저는 그것을 검색할 권한이 없을 수 있고, 그것을 액세스하는 권한관과 연관된 비용이 들 수 있고, 또는 문서는 단지 여러 간으성을 지명하기위해, 새로운 버젼으로 대체가능하고 철회될 수 있다. 시스템은 통통상적으로 피드백을 이들 상황에 대해 유저에게 제공한다. In some cases, the document is available in electronic form but may be inaccessible to the user for various reasons. There may not be enough connections to retrieve the document, the user may not have permission to search it, there may be a cost associated with the authority to access it, or the document may be new, just to name several simplicity. It is replaceable with the version and can be withdrawn. The system typically provides feedback to the user about these situations.

섹션 7.4에 설명된 바와 같이, 특정 유저에게 허영된 액세스의 정도 또는 특성은 유저가 이미 문서의 인쇄된 카피에 대한 액세스를 갖는다면 상이할 수 있다. As described in section 7.4, the degree or nature of access granted to a particular user may be different if the user already has access to a printed copy of the document.

8.1. 인증된 문서 액세스 8.1. Authenticated document access

문서로의 액세스는 특정 유저 또는 특정 기준을 충족하는 유저에게로 제한적일 수 있거나, 유저가 보안 네트워크에 연결된 경우와 같은 일정 환경에서만 이용가능하다. 섹션 6은 유저의 신뢰도 및 스캐너가 수립될 수 있는 몇몇 방식을 설명한다. Access to the document may be limited to a particular user or a user who meets certain criteria, or may only be available in certain circumstances, such as when the user is connected to a secure network. Section 6 describes the user's trust and some ways in which the scanner can be established.

8.2. 문서 구입-복제권-소유 보상 8.2. Buying Documents -Clone-Ownership Reward

일반공중에 자유로이 사용가능한 전자 문서는 비용지불, 또는 출판자 또는 복제권자에 대한 보상으로 액세스가능할 수 있다. 유저은 지불 설비를 구현할 수 있거나 섹션 6.2에 설명된 것을 포함하여, 유저와 연관된 지불 방법을 사용할 수 있다. Electronic documents, which are freely available in the public, may be accessible as a payment or as a reward for the publisher or copyright holder. The user may implement a payment facility or use a payment method associated with the user, including those described in section 6.2.

8.3 문서 에스크로우 및 프로액티브 검색 8.3 Documentation Escrow and Proactive Search

전자 문서는 흔히 일시적이고; 렌더링된 문서의 디지털 소스 버젼은 현재 이용가능하지만 미래에는 액세스불가능할 수 있다. 시스템은 유저가 요구받지 않은 경우에도, 유저를 대신하여 기존 버젼을 검색 및 저장하고, 따라서 ㅅ가 미래에 그성르요구하면 그거의 이용가능성을 보장한다. 이것은 또한 미래 캡쳐를 식별하는 프로세스의 일부로서 탐색하는 것과 같은, 시스템의 사용에 이용가능하다. Electronic documents are often temporary; Digital source versions of the rendered document are currently available but may not be accessible in the future. The system retrieves and stores existing versions on behalf of the user, even if the user is not required, thus ensuring that availability will be available if you request it in the future. It is also available for use of the system, such as searching as part of the process of identifying future captures.

문서로의 액세스를 위해 지불이 요구되는 경우에, 신뢰된 "문서 에스크로우" 서비스는, 유저가 서비스로부터 문서를 요구해야한다면 복제권자가 미래에 충분히 보상된다는 보장으로, 가장 적합한 요금의 지불과 같은 경우에서와 같이, 유저를 대신하여 문서를 검색할 수 있다. In the case where payment is required for access to a document, a trusted "document escrow" service is such as in the case of payment of the most appropriate fee, with the guarantee that the copyright holder will be fully compensated in the future if the user must request the document from the service. As such, the document can be retrieved on behalf of the user.

이 주제에관한 변형들은 문서가 캡쳐시점에 전자 형태로 이용불가능하다면 구현될 수 있다. 유저는 전자 문서가 이후 날짜에 이용가능하게 되어야한다면 유저를 대신하여 문서를 위한 지불을 행하거나 요구를 제출하도록 하는 서비스를 허가할 수 있다. Variations on this subject can be implemented if the document is not available in electronic form at the time of capture. The user may authorize the service to make a payment or submit a request for the document on behalf of the user if the electronic document should be available at a later date.

8.4. 기타 가입 및 계정과의 연관 8.4. Associate with other subscriptions and accounts

때때로 지불은 다른 계정 또는 가입과의 기존 연관에 기초하여 포기, 감소 또는 충족될 수 있다. 신문의 인쇄된 버젼에 대한 가입자는 예로서, 전자 버젼을 검색할 자격이 자동으로 부여될 수 있다. Sometimes payments may be waived, reduced or met based on an existing association with another account or subscription. Subscribers to the printed version of the newspaper may be automatically entitled to retrieve the electronic version, for example.

그밖의 경우에, 연관은 덜 직접적일 수 있는 데; 유저는 그들의 고용주에 의해 수립된 계정에 기초하여, 또는 가입자인 친구에 의해 소우된 인쇄된 카피의 스캐닝을 기초로 하여 액세스권한이 부여될 수구 있다, In other cases, the association may be less direct; A user may be granted access based on an account established by their employer or based on scanning of a printed copy sourced by a friend who is a subscriber.

8.5 포토카핑을 스캔 및 인쇄로 대체하기 8.5 Replacing photocopying with scanning and printing

페이퍼 문서로부터 텍스트를 캡쳐링하고, 전자적 원본을 식별하고, 캡쳐와 연관된 원본의 일부 또는 원본을 인쇄된하는 프로세스는 다양한 이점을 지닌 채 종래의 포토카핑에 대한 대안을 형성한다. The process of capturing text from a paper document, identifying an electronic original, and printing a portion or original of the original associated with the capture form an alternative to conventional photocopying with various advantages.

·페이퍼 문서는 최종 인쇄본과 동일 위치에 있을 필요가 없고, 임의 경우에 동일 시간에 그곳에 있을 필요가 없다. The paper document does not have to be in the same position as the final printed copy, and in any case does not have to be there at the same time.

·포토카핑 프로세스에 의해 페이퍼 문서, 특히 오래되고, 약하고 값어치 있는 문서에 야기된 마모 및 손상은 방지도리 수 있다. The photocopying process can prevent wear and damage caused by paper documents, especially old, weak and valuable documents.

·복사본의 품질은 통상적으로 더욱 높다.The quality of the copy is usually higher.

·어느 문서 또는 가장 빈번히 복사되는 문서의 일부분에 관한 레코드가 유 지될 수 있다.A record may be kept of which document or part of the document is most frequently copied.

·지불은 프로세스의 일부로서 복제권자에 행해질 수 있다.Payment can be made to the proprietor as part of the process.

·승인되지 않은 복사는 금지된다. Unauthorized copying is prohibited.

8.6 포토카피로부터 귀중한 원본을 위치지정함 8.6 positioning valuable originals from photocopy

문서가 법적 문서 또는 역사적 또는 기타 특정 중요성을 가는 문서인 경우와 같이 특히 귀중한 경우에, 사람들은 원본은 안전한 위치에 보관한 채 흔히 ㅅ년 동안 이들 문서의 복사본을 이용한다. In particularly valuable cases, such as when documents are legal documents or documents of historical or other specific importance, people often use copies of these documents for four years, while keeping the originals in a safe place.

설명된 시스템은, 누군가가 보관된 원본 페이퍼 문서를 찾기위해 복사본에 대한 액세스를 용이하게 하는, 예로서 보관 창고에, 원본 문서의 위치를 기록하는 데이터베이스에 연결될 수있다. The described system can be connected to a database that records the location of the original document, such as in a storage warehouse, which facilitates access to the copy to find the original paper document stored by someone.

9. 텍스트 인식 기술 9. Text Recognition Technology

광학식 문자 인식(OCR) 기술은 전통적으로 전체 페이지를 캡쳐링하는 플랫-베드 스캐너로부터, 대량의 텍스트를 포함하는 이미지에 집중되어왔다. OCR 기술은 흔히 유용한 텍스트를 생성하기 위해 유저에 의한 상당한 훈련 및 보정을 필요로 한다. OCR 기술은 흔히 OCR을 수행하는 머신에 대한 상당한 처리 능력을 필요로 하는 한편, 다수의 시스템은 사전을 사용하는 데, 그들은 효과적으로 무한한 용어로 동작하는 것이 예상된다. Optical character recognition (OCR) technology has traditionally focused on images containing large amounts of text, from flat-bed scanners that capture entire pages. OCR techniques often require significant training and correction by the user to produce useful text. While OCR technology often requires significant processing power for machines that perform OCR, many systems use dictionaries, which are expected to operate effectively in infinite terms.

상기한 종래의 모든 특징들은 상기한 시스템으로 개선될 수 있다. All of the above features can be improved with the system described above.

이 섹션이 OCR에 대해 집중된 반면에, 토의된 다수의 이슈들은 특히 음성 이닛과 같은, 기타 인식 기술에 직접 매핑된다. 섹션3.1에서 설명된 바와 같이, 페이퍼로부터 클링하는 프로세스는 오디오를 캡쳐링하는 디바이스내로 텍스트를 판독함에 의해 유저에 의해 달성된다. 당업자는 이미지, 폰트 및 텍스트 조각에 대해 여기에서 설명된 원리가 흔히 오디오 샘플, 유저 음성 모델 및 음소에 대해 적용됨을 이해할 것이다. While this section is focused on OCR, many of the issues discussed map directly to other recognition technologies, especially voice inlets. As described in section 3.1, the process of clinging from the paper is accomplished by the user by reading text into the device capturing the audio. Those skilled in the art will appreciate that the principles described herein for images, fonts, and text fragments often apply to audio samples, user voice models, and phonemes.

9.1 적절한 디바이스를 위한 최적화 9.1 Optimization for the right device

상기한 시스템에 사용을 위한 스캐닝 디바이스는 소형, 휴대형 및 저전력이다. 스캐닝 디바이스는 한 타임에 단지 몇개의 워드만 캡쳐할 수 있고, 몇몇 구현에선 한번에 완전한 한 문자를 캡쳐하지 못할 수 있는 반면에 텍스트를 통해 한 수평 슬라이스를 캡쳐할 수 있는 데, 다수의 그러한 수평 슬라이스는 텍스트가 유추될 수 있는 인식가능한 신호를 함께 형성한다. 스캐닝 디바이스는 또한 매우 제한적인 프로세싱 파워 또는 저장을 갖고 따라서, 몇몇 실시예에서 모든 OCR 프로세스를 수행ㅎㄹ 수 있는 반면에, 다수의 실시예는 캡쳐된 신호를 텍스트로 변환하기 위해, 가능하면 나중에, 더욱 강력한 디바이스로의 연결에 좌우되게 된다. 마지막으로, 그것은 유저 상호작용을 위한 매우 제한된 기능을 갖고, 따라서 이후의 유저 입력을 위해 임의 요구를 지연시킬 필요가 있거나, 지금 보다 더욱 큰 정도로 "최선-추측" 모드로 동작할 필요가 있다. Scanning devices for use in such systems are compact, portable and low power. The scanning device can capture only a few words at a time, and in some implementations may not be able to capture one complete character at a time, while a single horizontal slice can be captured through text, many such horizontal slices being The text together forms a recognizable signal that can be inferred. The scanning device also has very limited processing power or storage and thus may perform all OCR processes in some embodiments, while many embodiments further, if possible, later to convert the captured signal into text. It depends on the connection to a powerful device. Finally, it has very limited functionality for user interaction, and therefore needs to delay any request for subsequent user input, or operate in "best-guessing" mode to a greater extent than now.

9.2 "일정치않은" OCR 9.2 " Uneven " OCR

상기한 시스템내의 OCR의 주요한 새로운 특징은 일반적으로, 그것이 디지털 형태로 검색될 수 있고 어느 곳에 존재하는 텍스트의 이미지를 검사할 것이라는 사실이다. 텍스트의 정확한 트랜스크립션은 반드시 OCR 엔진으로부터 요구되는 것은 아니다. OCR 시스템은 몇몇 경우엔 확률 가중치를 포함하는 가능한 매치의 행렬 또는 집합을 출력하고, 그것은 디지털 원본을 탐색하기 위해 사용될 수 있다. A major new feature of OCR in such systems is that in general, it can be retrieved in digital form and inspect an image of text present anywhere. Correct transcription of text is not necessarily required from the OCR engine. The OCR system outputs a matrix or set of possible matches including probability weights in some cases, which can be used to search for digital sources.

9.3 반복적 OCR-추측, 명확화, 추측... 9.3 Iterative OCR-guessing, disambiguation, speculation ...

인식을 수행하는 디바이스가 프로세싱시 문서 인덱스를 접촉할 수 있다면, OCR 프로세스는 그것이 진행함에 따라 문서 코퍼스의 콘텐츠에 의해 통지될 수 있고, 잠재적으로 상당히 큰 인식 정확도를 제공한다. If the device performing the recognition can contact the document index during processing, the OCR process can be notified by the contents of the document corpus as it proceeds, potentially providing significantly greater recognition accuracy.

그러한 연결은 충분한 텍스트가 디지털 소스를 식별하기 위해 캡쳐된 경우에 디바이스가 유저에게 통지할 수 있게 한다. Such a connection allows the device to notify the user if enough text has been captured to identify the digital source.

9.4. 유사한 렌더링 지식 이용 9.4. Use similar rendering knowledge

시스템이 문서의 유사하게 인쇄된 렌더링의 여러 태양-페이지의 레이아웃 또는 인쇄에 사용된 폰트 타이프페이스와 같은, 또는 어느 섹션이 이탤릭체인지와 같은-, 이것은 역시 인식 프로세스에 조력할 수 있다(섹션 4.1.1).This may also assist in the recognition process, such as the font typeface used in the layout or printing of several aspects of a similarly printed rendering of a document, or the font typeface used in printing (section 4.1). One).

9.5. 폰트 캐싱- 호스트상의 폰트 결정, 클라이언트로의 다운로드 9.5. Font caching- font determination on host , download to client

문서 커퍼스내에서의 후보자 소스 텍스트가 확인됨에 따라, 그것의 폰트, 또는 렌더링은 인식으로 도움을 주기위해 디바이스에 다운로드될 수 있다. As candidate source text in the document compass is identified, its font, or rendering, can be downloaded to the device to aid in recognition.

9.6. 자동보정 및 문자 오프셋트 9.6. Auto Correct and Character Offset

텍스트 프래그먼트의 컴포넌트 문자가 문서 서명으로서 사용될 수 있는 텍스트의 프래그먼트를 표현하는 가장 잘 인식된 방법일 수 있 반면에, 텍스트의 다른 표현들은, 텍스트 프래그먼트를 디지털 문서 및/또는 데이터베이스에 위치시키고 시도하는 경우, 또는 텍스트 프래그먼트의 표현을 판독가능형태로 명확하하는 경우 에, 텍스트 프래그먼트의 실제 텍스트가 사용될 필요가 없다는 것을 충분히 잘 나타낼 수 있다. 텍스트 프래그먼트의 기타 표현은 실제 텍스트 표현이 부족하다는 이점을 제공할 수 있다. 예로서, 텍스트 프래그먼트의 광학식 문자 인식은 흔히, 전체 프래그먼트에 대해 광학식 문자 인식에 의존함이 없이 텍스트 프래그먼트를 재생성하기 위해 탐색하는 데에 사용될 수 있는 캡쳐된 텍스트 프래그먼트의 기타 표현과는 다르게, 에러가 되기 쉬운 경향이 있다. 현재 시스템에 사용되는 몇몇 디바이스에 대해 더욱 적절할 수 있다. Whereas the component characters of a text fragment may be the best recognized way of representing a fragment of text that can be used as a document signature, other representations of text may be found when attempting to place the text fragment in a digital document and / or database. In the case of clarifying the representation of the text fragment, or, in a readable form, it can be well represented that the actual text of the text fragment need not be used. Other representations of text fragments can provide the advantage of a lack of actual text representation. By way of example, optical character recognition of text fragments is often an error, unlike other representations of captured text fragments that can be used to navigate to regenerate text fragments without relying on optical character recognition for the entire fragment. It tends to be easy. It may be more appropriate for some devices currently used in the system.

당업자는 텍스트 프래그먼트의 외양을 기술하는 많은 방법이 있다는 것을 인식할 것이다. 텍스트 프래그먼트의 그러한 특징화는 다음과 같은 것에 제한되진 않지만, 워드 길이, 상대적 워드 길이, 문자 높이, 문자 폭, 문자 형태, 문자 빈도, 토큰 빈도등과 같은 것을 포함한다. 몇몇 실시예에서, 매칭 텍스트 토큰간의 오프셋트(즉, 간섭 토큰의 수에다 일을 더한 것)는 텍스트의 프래그먼트를 특징화하는 데에 사용된다. Those skilled in the art will appreciate that there are many ways to describe the appearance of a text fragment. Such characterization of text fragments includes, but is not limited to, the following: word length, relative word length, character height, character width, character type, character frequency, token frequency, and the like. In some embodiments, the offset between the matching text tokens (ie, the number of interference tokens plus one) is used to characterize the fragment of the text.

종래의 OCR은 스캐닝된 텍스트의 문자를 결정하기 위한 시도로 폰트, 글자 구조 및 형태에 관한 지식을 이용한다. 본 발명의 실시예는 상이하고; 그것들은 인식 프로세스에서 조력하기 위해 렌더링된 텍스트 자체를 사용하는 다양한 방법을 채용한다. 이들 실시예는 "서로를 인식하기 위해" 문자(또는 토큰)을 사용한다. 그라러 자체-인식을 일컫는 한 예는 "템플릿 매칭", 이고 "컨볼루션"과 유사하다. 그러한 자체-인식을 수행하기 위해, 시스템은 텍스트의 복사본을 그 자체에 대해 수평으로 슬라이스하고 텍스트 이미지의 매칭 지역을 노트한다. 종래의 템플릿 매 칭 및 컨볼루션 기술은 다양한 관련 기술을 포함한다. 문자/토큰을 토큰화 및/또는 인식하기 위한 기술은 문자/토큰이 매칭하는 경우 그 자신의 컴포넌트와 상관시키는 데에 사용된다. Conventional OCR uses knowledge of fonts, character structures and forms in an attempt to determine the characters of the scanned text. Embodiments of the invention are different; They employ various ways of using the rendered text itself to assist in the recognition process. These embodiments use characters (or tokens) to “recognize each other”. One example of grading self-awareness is "template matching", which is similar to "convolution". To perform such self-awareness, the system slices a copy of the text horizontally relative to itself and notes the matching area of the text image. Conventional template matching and convolution techniques include a variety of related techniques. Techniques for tokenizing and / or recognizing characters / tokens are used to correlate the characters / tokens with their own components if they match.

자동보정의 경우, 매치하는 완전 연결된 지역은 관심있는 것이다. 이것은 문자(또는 문자의 그룹)이 동일 문자(또는 그룹)의 다른 인스턴스를 오버레이하는 경우 발생한다. 매치하는 완전 연결된 지역은 컴포넌트 토큰내에 텍스트의 토큰화를 자동으로 제공한다. 텍스트의 두 개의 복사본이 서로를 지나 슬라이딩됨에 따라, 완전한 매칭이 발생(즉, 수직 슬라이스의 모든 픽셀들이 매칭된다)하는 지역이 노트된다. 문자/토큰이 스스로 매칭하는 경우, 이 매칭의 수평 범위(예로서, 텍스트의 연결된 매칭부)도 매칭한다.In the case of autocalibration, the fully connected region that matches is of interest. This occurs when a character (or group of characters) overlays another instance of the same character (or group). Fully matched regions that match automatically provide tokenization of text within component tokens. As two copies of the text slide past each other, an area is noted where complete matching occurs (ie all pixels of the vertical slice are matched). If a character / token matches itself, it also matches the horizontal range of the match (eg, a linked match of text).

이 스테이지에서 각각의 토큰의 실제 아이덴티티즉, 토큰 이미지에 대응하는, 특정 글자, 숫자 또는 심볼, 또는 이들의 그룹)를 결정할 필요가 없고, 단지 스캐닝된 텍스트에서의 동일 토큰의 다음 발생에 대한 오프셋트만을 결정할 필요가 있다. 오프셋트 수는 동일 토큰의 다음 발생까지의 거리(토큰의 수)이다. 토큰이 텍스트 스트링내에서 고유하면, 오프셋트는 제로(0)이다. 이렇게 발생된 토큰 오프셋트의 시퀀스는 스캐닝된 텍스트를 식별하기 위해 사용될 수 있는 서명이다. At this stage there is no need to determine the actual identity of each token, i.e. a specific letter, number or symbol, or group thereof, corresponding to the token image, but only an offset for the next occurrence of the same token in the scanned text. Only need to decide. The offset number is the distance (number of tokens) to the next occurrence of the same token. If the token is unique within the text string, the offset is zero. The sequence of token offsets generated in this way is a signature that can be used to identify the scanned text.

몇몇 실시예에서, 스캐닝된 토큰의 스트링에 대하여 결정된 토큰 오프셋트는 그것들의 콘텐츠의 토큰 오프셋트에 기초하여 전자 문서의 코퍼스를 색인하는 인덱스에 비교된다(섹션 4.1.2). 다른 실시예에서, 스캐닝된 토큰의 스트링에 대하여 결정된 토큰 오프셋트는 텍스트로 변환되고, 그것들의 콘텐츠에 기초하여 전자 문 서의 코퍼스를 색인하는 더욱 종래의 인덱스에 비교된다. In some embodiments, the token offset determined for the string of scanned tokens is compared to an index that indexes the corpus of the electronic document based on the token offset of their content (section 4.1.2). In another embodiment, the token offset determined for the string of scanned tokens is converted to text and compared to a more conventional index that indexes the corpus of electronic documents based on their contents.

상기한 바와 같이, 유사한 토큰-상관 프로세스는 캡쳐 프로세스가 음성 워드의 오디오 샘플로 이루어지는 경우 음성 프래그먼트에 적용될 수 있다. As noted above, a similar token-correlation process may be applied to speech fragments when the capture process consists of audio samples of speech words.

9.7. 폰트/문자 "자기-인식" 9.7. Font / character "self-recognition"

종래의 템플릿-매칭 OCR은 문자 이미지의 라이브러리에 스캐닝된 이미지를 비교한다. 본질적으로, 알파벳은 각각의 폰트에 대해 저장되고 새로이 스캐닝된 이미지는 매칭 문자를 발견하기위해 저장된 이미지와 비교된다. 이 프로세스는 올바른 폰트가 식별될 때 까지 초기 지연이 계속된다. 그후, OCR 프로세스는 비교적 고속인데 이는 대부분의 문서가 동이 폰트를 전체적으로 사용하기 때문이다. 후속 이미지는 따라서 최근 식별된 폰트 라이브러리와의 비교에 의해 텍스트로 변환될 수 있다. Conventional template-matching OCR compares the scanned image to a library of character images. In essence, the alphabet is stored for each font and the newly scanned image is compared with the stored image to find matching characters. This process continues with an initial delay until the correct font is identified. After that, the OCR process is relatively fast because most documents use the same font throughout. Subsequent images can thus be converted into text by comparison with the recently identified font library.

가장 흔히 사용되는 폰트의 문자의 형태는 관련된다. 예로서 대부분의 폰트에서, 글자 "C" 및 글자 "e" 는 시각적으로 관련되고-"t" "f"등도 마찬가지이다. OCR 프로세스는 아직 스캐닝되지 않은 글자들에 대한 템플릿을 구성하기 위해 상기 관계를 사용함에 의해 향상된다. 예로서 판독기가 이전에 보지못한 폰트의 페이퍼 문서로부터 텍스트의 짧은 스트링을 스캔하여 시스템은 스캐닝된 이미지와 비교하는 이미지 템플릿의 셋트를 갖지 않는 경우, 시스템은 그것이 알파벳의 모든 글자를 아직 보지 못한 경우에도 폰트 템플릿 라이브러리를 구성하기 위해 일정한 문자들간에 가능한 관계를 레버리지할 수 있다. 시스템은 그러면 후속 스캐닝된 텍스트를 인식하기 위해 그리고 구성된 폰트 라이브러리를 더욱 정교하게 하기 위해 구 성된 폰트 템플릿 라이브러리를 사용할 수 있다. The most commonly used font's character form is relevant. As an example in most fonts, the letter "C" and the letter "e" are visually related-"t" "f" and so on. The OCR process is enhanced by using the relationship to construct a template for letters that have not yet been scanned. For example, if the reader scans a short string of text from a paper document of a font that has not been previously seen and the system does not have a set of image templates to compare with the scanned image, the system will not see all the letters of the alphabet yet. Possible relationships between certain characters can be leveraged to construct a font template library. The system can then use the configured font template library to recognize subsequent scanned text and to further refine the configured font library.

9.8. 인식되지 않은 것(그래픽을 포함한)은 어느 것이나 서버로 전송 9.8. Anything not recognized (including graphics) is sent to the server

이미지가 탐색 프로세스에서의 사용을 위한 적절한 형태로 기계 트랜스크립션될 수 없는 경우에, 이미지는 스스로 유저에 의한 나중의 사용을 위해, 가능한 수동 트랜스크립션을 위해, 또는 상이한 자원이 시스템에 이용가능할 수 있을 때 나중 날짜에서의 프로세싱을 위해, 보관될 수 있다. If the image cannot be machine transcribed into a suitable form for use in the search process, the image may be available for future use by the user on its own, for possible manual transcription, or with different resources available to the system. When possible, it can be archived for later processing.

10. P- 커머스 10. P- Commerce

시스템에 의해 가능한 행해진 다수의 액션들은 몇몇 상용 트랜잭션이 발생하는 결과로 된다. 어구 P-커머스는 본원에서 시스템을 통한 페이퍼로부터 개시된 상용 액티비티들을 설명하기 위해 사용된다. Many of the actions performed by the system are the result of several commercial transactions occurring. The phrase P-commerce is used herein to describe commercial activities initiated from paper through the system.

10.1. 물리적 인쇄된 복사본에 의한 문서의 판매 10.1. Sale of documents by physically printed copies

유저가 텍스트를 문서로부터 캡쳐하는 경우, 유저는 페이퍼 또는 전자 형태로 구입을 위한 그 문서가 제공된다. 유저는 페이퍼 문서에 인용된 또는 언급된 문서, 또는 동일 저자에 의한 문서 또는 유사한 주제와 같은 관련 문서들이 제공된다. When the user captures text from the document, the user is provided with the document for purchase in paper or electronic form. The user is provided with documents that are cited or mentioned in the paper document, or documents by the same author or similar subject matter.

10.2. 페이퍼에 의해 개시되거나 도움을 받은 것들의 판매 10.2. Sale of those initiated or assisted by paper

텍스트의 캡쳐는 다양한 방식으로 사용 액티비티에 링크될 수 있다. 캡쳐된 텍스트는 아이템을 판매하도록 디자인된 카탈로그일 수 있고, 이경우 텍스트는 아이템의 구입과 매우 직접적으로 연관될 수 있다(섹션 18.2). 텍스트는 또한 광고의 일부분일 수 있고, 이 경우 광고되는 아이템의 판매는 계속이어질 수 있다. Capture of text can be linked to the activity of use in a variety of ways. The captured text can be a catalog designed to sell the item, in which case the text can be very directly related to the purchase of the item (section 18.2). The text may also be part of an advertisement, in which case the sale of the advertised item may continue.

그 밖의 경우에, 유저는 상용 트랜잭션에서 그들의 잠재적 관심사항이 유추될 수 있는 기타 텍스트를 캡쳐한다. 특정 국가의 소설 셋트의 독자는 그곳의 휴일에 관심이 갈 수 있다. 유저는 그들에게 몇몇 상용 기회가 결과적으로 제시될 수 있것을 아는 텍스트의 특정 프래그먼트를 캡쳐할 수 있고, 또는 그것은 그들의 캡쳐 활동의 부수적인 것일 수 있다. In other cases, users capture other text whose potential interests may be inferred in commercial transactions. Readers of novel sets in a particular country may be interested in the holidays there. The user may capture a particular fragment of text that knows that some commercial opportunity may be presented to them as a result, or it may be ancillary to their capture activity.

10.3. 판매되어지는 아이템상의 레이블 , 아이콘, 일련번호, 바코드의 캡쳐 10.3. Being sold Item top Capture of labels , icons, serial numbers, barcodes

때때로 텍스트 또는 심볼은 실제로 아이템상에 또는 그것의 포장에 인쇄된다. 그 예로는 일련번호 또는 제품 id는 흔히 전자 방비의 피스의 바닥부 또는 후면상의 레이블에서 찾을 수 있다. 시스템은 유저에게 그 텍스트를 캡쳐링함에 의해 하나 이상의 동일 아이템을 구입하는 편리한 방법을 제공한다. Sometimes text or symbols are actually printed on an item or on its packaging. For example, the serial number or product id can often be found on a label on the bottom or back of the piece of electronic defense. The system provides the user with a convenient way to purchase one or more identical items by capturing the text.

10.4. 콘텍스추얼 광고 10.4. Contextual Advertising

광고로부터 텍스트의 직접적 캡쳐외에, 시스템은 렌더링된 문서에서 반드시 명시적일 필요가 없는 새로운 종류의 광고를 허용하지만, 그럼에도 그것은 사람들이 판독하는 것에 기초하고 있다. In addition to the direct capture of text from an advertisement, the system allows a new kind of advertisement that does not necessarily have to be explicit in the rendered document, but it is nevertheless based on what people read.

10.4.1. 스캔 콘텍스트 및 이력에 기초한 광고 10.4.1. Ads based on scan context and history

종래의 페이퍼 출판에서, 광고는 신문 기사의 텍스트에 비해 큰 공간을 소비하고, 그리고 이것들의 제한돈 수가 특정 기사 주변에 배치될 수 있다. 상기한 시스템에서, 광고는 개별적인 워드 또는 구와 연관될 수 있고, 그 텍스트를 캡쳐링함에 의해 그리고 과거 스캔에 대한 그들의 이력을 고려함에 의해 유저가 도시한 특정 관심사에 따라 선택될 수 있다. In conventional paper publishing, advertisements consume a large amount of space relative to the text of a newspaper article, and a limited number of these can be placed around a particular article. In the system described above, advertisements may be associated with individual words or phrases and may be selected according to the particular interests shown by the user by capturing the text and considering their history for past scans.

상기한 시스템으로, 특정 인쇄된 문서에 밀접하게되는 구입을 위해 그리고 특정 인쇄 출판물에서의 그들의 광고의 유효성에 대한 상당히 많은 피드백을 광고자가 얻는 것이 가능하다. With such a system, it is possible for advertisers to get a great deal of feedback on the validity of their advertisements for purchases that are closely tied to a particular printed document and for the effectiveness of their advertisement in a particular printed publication.

10.5. 보상의 모델 10.5. Model of compensation

시스템은 광고자 및 시장판매자에 대한 보상의 몇몇 새로운 모델을 가능케 한다. 광고를 포함하는 인쇄된 문서의 발행자는 그들의 문서로부터 발생된 구입으로부터 약간의 수입을 얻을 수 있다. 이것은 광고가 원본 인쇄된 형태에 존재하는 지의 여부에 관계없이 트루이고; 그것은 전자적으로 출판자에 의해 또는 강고주 또는 제3자에 의해 추가되어질 수 잇고, 그러한 광고의 소스는 유저에 의해 가입되어질 수 있다. The system enables several new models of rewards for advertisers and marketers. Publishers of printed documents containing advertisements may earn some revenue from purchases generated from their documents. This is true regardless of whether the advertisement is in the original printed form; It can be added electronically by the publisher or by a hard-liner or third party, and the source of such advertisements can be subscribed by the user.

10.5.1. 인기도-기반 보상 10.5.1. Popularity-based reward

시스템에 의해 발생된 통계치에 대한 분석은 출판물의 ㅇ리정 부분에 대한 인기도를 나타낼 수 있다(섹션 14.2.). 신문에서, 독자가 특정 페이지 또는 기사, 또는 특정 문서의 인기도를 보는 데 소비한 시간의 양을 나타낼 수 있다. 몇몇 환경에서, 작가 또는 출판업자는 반포된 복사본의 수 또는 기록된 워드와 같은 더욱 전통적인 메트릭스 보단 독자의 액티비티에 기초하여 보상을 받기에 적합할 수 있다. 그의 작품이 주제에 대한 권위가 빈번하게 판독되는 작가는 그의 착이 복사본 만큼 판매되지만 드믈게 열람되는 작가와는 미래의 계약에서 상이하게 고려될 수 있다. Analysis of the statistics generated by the system may indicate the popularity of the marginal portion of the publication (section 14.2). In a newspaper, it may indicate the amount of time a reader spent looking at a particular page or article, or the popularity of a particular document. In some circumstances, a writer or publisher may be eligible to be rewarded based on the activity of the reader rather than more traditional metrics, such as the number of copies distributed or the words recorded. An author whose work is frequently read for authority on the subject may be sold as a copy of his apparel, but may be considered different in future contracts with a rarely viewed writer.

10.5.2. 인기도-기반 광고 10.5.2. Popularity-based advertising

문서에서의 광고에 관한 결정은 독자관계에 대한 통계치에 기초할 수 있다. 가장 인기있는 컬럼니스트주변의 광고는 프리미엄 레이트로 판매될 수 있다. 광고자들은 문서가 그것이 어떻게 수용되는지에 대한 지식에 기초하여 출판된 후 일정 시간에 요금청구되거나 보상될 수 있다.The decision about advertising in the document may be based on statistics about the readership. Advertisements around the most popular columnists can be sold at a premium rate. Advertisers can be billed or compensated at certain times after a document is published based on knowledge of how it is accepted.

10.6. 수명 라이브러리에 기초한 마켓팅 10.6. Marketing based on lifetime library

섹션 6.1 및 16.1에 설명된 스캔 이력 또는 "수명 라이브러리"는 유저의 습관 또는 관심사항에 대한 정보의 극히 값어치 있는 소스일 수 있다. 적절한 동의 및 프라이버시 이슈에 종속하여, 그러한 데이터는 유저에게 상품 또는 서비스의 제공을 통지한다. 익명 형태의 경우에도, 수집된 통계치는 매우 유용할 수 있다. The scan history or "lifetime library" described in sections 6.1 and 16.1 can be an extremely valuable source of information about a user's habits or interests. Depending on the appropriate consent and privacy issues, such data notifies the user of the provision of goods or services. Even in the anonymous form, the statistics collected can be very useful.

10.7. (이용가능한 때)이후 날짜에서의 판매/정보 10.7. Sales / information on later date (when available)

상용 트랜잭션을우한 광고 및 기타 기회들은 텍스트 캡쳐시에 유저에게 즉시 제공되지 않을 수 있다. 예로서, 소설에 대한 시퀄을 구입할 기회는 유저가 소설을 읽는 시점에 이용불가능할 수 있지만, 시스템은 시퀄이 출판된 경우 그것들에게 기회를 제공할 수 있다. Advertisements and other opportunities for commercial transactions may not be immediately available to the user at the time of text capture. By way of example, the opportunity to purchase a qualifier for a novel may not be available at the time a user reads the novel, but the system may offer them an opportunity if the qualifier is published.

유저는 구입 또는 기타 상용 트랜잭션에 관련한 데이터를 캡쳐할 수 있지만, 캡쳐가 행해진 시점에서 트랜잭션을 개시 및/또는 완료하는 것을 선택하지 않을 수 있다. 몇몇 실시예에서, 캡쳐와 관련한 데이터는 수명 라이브러리에 저장되고, 이들 수명 라이브러리 엔트리는 "액티브"상태(즉, 캡쳐가 행해졌었던 시점에서 이용가능한 것들과 유사한 후속 상호작용할 수 있는)에 있을 수 있다. 따라서 유저는 어떤 나중 시점에 캡쳐를 리뷰할 수 있고, 선택적으로 그 캡쳐에 기초하여 트랜잭 션을 완료할 수 있다. 시스템은 원래 캡쳐가 언제 어디서 발생했는 지를 추적할 수 있기 때문에, 트랜잭션에 포함된 모든 당사자들은 적절하게 보상받을 수 있다. 예로서 유저가 데이터를 캡쳐하는 광고의 바로 다음에 나타나는 -이야기를 출판한 출판자- 및- 이야기를 쓴 작가는 유저가, 6개월 후, 그들의 수명 라이브러리를 방문한 경우, 이력으로부터 특정한 캡쳐를 선택한 경우, 및 팝업 메뉴로부터 "이 아이템을 Amazon으로부터 구입"을 결정함에 의해 보상될 수 있다(이것은 캡쳐의 시점에서 선택적으로 제시된 메뉴와 동일 또는 유사할 수 있다). The user may capture data relating to purchases or other commercial transactions, but may not choose to initiate and / or complete the transaction at the time the capture was made. In some embodiments, data relating to capture is stored in a lifespan library, and these lifespan library entries may be in an "active" state (ie, capable of subsequent interaction similar to those available at the time the capture was done). Thus, the user can review the capture at some later point in time and can optionally complete a transaction based on the capture. Because the system can track when and where the original capture occurred, all parties involved in the transaction can be properly compensated. For example, a writer who writes a story-a publisher who published a story-that appears immediately after an advertisement that captures data by the user, selects a particular capture from history, if the user visited their life library six months later, And by purchasing "purchase this item from Amazon" from the pop-up menu (which may be the same as or similar to the menu optionally presented at the time of capture).

11. 운영체제 및 애플리케이션 통합 11. Operating system and application integration

현대 운영체제(OS) 및 기타 소프트웨어 패키지는 상기 시스템으로 유익하게 이용될 수 있는 다수의 특징을 가지며, 그것의 사용을 위해 더욱 양호한 플랫폼을 제공하기 위해 다양한 방식으로도 수정될 수 있다. Modern operating systems (OSs) and other software packages have a number of features that can be beneficially used with the system, and can be modified in various ways to provide a better platform for its use.

11.1. 메타데이터 및 인덱싱에서 스캔 및 인쇄-관련된 정보의 통합 11.1. Integration of scan and print-related information in metadata and indexing

새롭고 머지않아 다가오는 파일 시스템 및 그들의 연관된 데이터베이스는 흔히 각각의 파일과 연관된 다양한 메타데이터를 저장할 능력을 갖는다. 통상적으로, 이 메타데이터는 파일을 생성한 유저의 ID, 생성일짜, 최종 수정 및 최종 사용과 같은 것들을 포함한다. 더욱 새로운 파일 시스템은 키워드, 이미지 특징, 문서 소스 및 유저 코멘트와 같은 여분의 정보들이 저장되는 것을 허용하고 몇몇 시스템에서 이 메타데이터는 임의의로 확장될 수 있다. 파일 시스템은 그러므로 현재 시스템을 구현하는 데에 유용한 정보를 저장하는 데에 사용할 수 있다. 예로서, 데이터는, 상기 시스템을 사용하여 페이퍼로부터 어느 텍스트가 그리고 언제 누구에 의해 캡쳐되었는지에 대한 상세히 나타낼 수 있는 바와 같은 데이터는, 주어진 문서가 최종 인쇄되었을 때 파일 시스템에 의해 저장될 수 있다. New and upcoming file systems and their associated databases often have the ability to store various metadata associated with each file. Typically, this metadata includes such things as the ID of the user who created the file, creation date, last modification, and last use. Newer file systems allow extra information such as keywords, image features, document sources and user comments to be stored and in some systems this metadata can be extended to arbitrary. The file system can therefore be used to store information useful for implementing the current system. By way of example, data may be stored by the file system when a given document was last printed, as the data may be used to describe in detail which text and from whom was captured from the paper using the system.

운영체제는 또는 유저가 로컬 파일을 더욱 용이하게 발견할 수 있게하는 탐색 엔진 기능을 통합하기 시작한다. 이들 기능들은 시스템에 의해 유익하게사용될 수 있다. 그것은 섹션 3 및 4에서 토의된, 다수의 탐색-관련 개념들은 오늘날의 인터넷-기반 및 유사한 탐색 엔진에 뿐만아니라 모든 개인용 컴퓨터에 적용되는 것을 의미한다. The operating system also begins incorporating search engine functionality that allows the user to more easily find local files. These functions can be beneficially used by the system. That means that many of the search-related concepts discussed in sections 3 and 4 apply to all personal computers as well as to today's Internet-based and similar search engines.

몇몇 경우에 특정 소프트웨어 애플리케이션은 OS에 의해 제공된 기능 이강 및 그것을 넘어서는 시스템을 위한 지원을 포함하게 된다. In some cases, a particular software application will include a set of capabilities provided by the OS and support for the system beyond it.

11.2. 캡쳐 디바이스를 위한 OS 지원 11.2. capture OS support for devices

펜 스캐너와 같은 캡쳐 디바이스의 사용이 점점 일반화됨에 따라, 마우스 및 프린터에 제공되는 지원과 많이 유사한 방식으로, 지원체계를 운영체제에 구축하는 것이 바람직한 데, 이는 캡쳐 디바이스의 적용가능성이 단일 소프트웨어 애플리케이션 범위를 넘어 확장하기 때문이다. 시스템의 동작에 대한 기타 태양에 대해서도 마찬가지로 옳다. 몇몇 샘플이 이하에 설명된다. 몇몇 실시예에서, 전체 설명된 시스템, 또는 그 핵심부분이 OS에 의해 제공된다. 몇몇 실시예에서, 시스템을 우한 지원은 시스템의 태양을 직접 구현하는 것을 포함하는, 기타 소프트웨어 패키지에 의해 사용될 수 있는 애플리케이션 프로그래밍 인터페이스(APIs)에 의해 제공된다. As the use of capture devices, such as pen scanners, is becoming more common, it is desirable to build a support system into the operating system in much the same way as the support provided for mice and printers, where the applicability of the capture device may be limited to a single software application. Because it extends beyond. The same is true for other aspects of the operation of the system. Some samples are described below. In some embodiments, the entire described system, or key portion thereof, is provided by the OS. In some embodiments, support for the system is provided by application programming interfaces (APIs) that can be used by other software packages, including directly implementing aspects of the system.

11.2.1. OCR 및 기타 인식 기술을 위한 지원 11.2.1. Support for OCR and Other Recognition Technologies

렌더링된 문서를 캡쳐링하기 위한 대부분의 방법은 시스템에 사용하기에 적합 텍스트와 같은, 스캐닝된 이미지 또는 몇몇 음성 워드등과 같은 소스 데이터를 해석하기 위해 몇몇 인식 소프트웨어를 필요로 한다. 어떤 OS는 그것이 OCR을 위한 지원을 포함하기엔 OS에 대해 덜 일반적일 지라도, 음성 또는 수기 인식을 위한 지원을 포함하는 데, 이는 과거에 OCR의 사용은 적은 범위의 애플리케이션에 제한되어왔기 때문이다. Most methods for capturing rendered documents require some recognition software to interpret the source data, such as scanned images or some spoken words, such as text suitable for use with the system. Some OSs include support for voice or handwriting recognition, although it is less common for the OS to include support for OCR, since the use of OCR in the past has been limited to a small range of applications.

인식 컴포넌트가 OS의 일부분이 되어짐에 따라, 드것들은 OS에 의해 제공된 기타 기능설비의 이점을 취할 수 있다. 대부분의 시스템은 철자 사전, 문법 분석 툴, 구제화 및 국부화 기능설비를 포함하는 데 이들 모두는 상기한 시스템의 인식 프로세스를 위해 상기 시스템에 의해 유익하게 채용될 수 있는 데, 이는 그것들이 특정 유저가 흔히 만나게 되는 단어 및 어구를 포함하도록 특정 유저를 위해 맞춤식으로 될 수 있기 때문이다. As the recognition component becomes part of the OS, they can take advantage of the other facilities provided by the OS. Most systems include spelling dictionaries, grammar analysis tools, phraseization and localization facilities, all of which can be beneficially employed by the system for the recognition process of the system, as they are specific users. Can be customized for a particular user to include words and phrases that are often encountered.

운영체제가 전체-텍스트 인덱싱 기능설비를 포함한다면, 이것들은 또한 섹션 9.3에 설명된 바와 같은, 인식 프로세스를 통지하는 데에 사용될 수 있다. If the operating system includes a full-text indexing facility, these may also be used to inform the recognition process, as described in section 9.3.

11.2.2. 스캔에 취해져야 할 액션 11.2.2. Actions to be Taken for Scanning

광학 스캔 또는 기타 캡쳐가 발생하고 OS에 주어진다면, 어떠한 다른 시스템도 캡쳐에 대한 소유권을 주장하지 않는 경우의 환경하에서 취해져야 할 디폴트 액션을 갖는다. 디폴트 액션의 예는 유저에게 대안 선택권을 제시하는 것이거나, 캡쳐된 텍스트를 OS의 내장된 탐색기능설비에 전송하는 것이다. If an optical scan or other capture occurs and is given to the OS, it has a default action to be taken under circumstances where no other system claims ownership of the capture. Examples of default actions are to present an alternative option to the user or to send captured text to the OS's built-in search facility.

11.2.3. OS는 특정 문서 또는 문서 유형에 대한 디폴트 액션을 갖는다 11.2.3. OS has default actions for specific documents or document types

렌더링된 문서의 디지털 소스가 발견된다면, OS는 그 특정 문서 또는 그 부류의 문서가 스캔되었을 때 취해질 표준 액션을 가질 수 있다. 애플리케이션 및 기타 서브시스템은 일정한 파일 유형을 취급하는 그들의 능력에 대해 애플리케이션에 의한 방송과 유사한 방식으로, 특정 캡쳐 유형의 잠재적 핸들러로서 OD네 등록할 수 있다. If a digital source of a rendered document is found, the OS may have a standard action to be taken when that particular document or class of documents is scanned. Applications and other subsystems can register ODs as potential handlers for certain capture types, in a manner similar to broadcast by applications for their ability to handle certain file types.

렌더링된 문서, 또는 문서로부터의 캡쳐와 연관된 마크업 데이터는 운영체제에 특정 애플리케이션을 런칭하고, 애플리케이션 인수, 파라미터, 또는 데이터등을 전달하게하는 명령을 포함할 수 있다. The markup data associated with the rendered document, or capture from the document, may include instructions to launch a particular application to the operating system and pass application arguments, parameters, data, and the like.

11.2.4. 표준 액션내로의 매핑 및 제스처의 해석 11.2.4. Into a standard action Mapping and Interpreting Gestures

섹션 12.1.3에서 "제스춰"의 사용이 설명되고, 특히 광학 스캐닝의 경우에, 핸드헬드 스캐너로 특정한 이동이 행해진 장소는 텍스트의 영역의 시작 및 끝을 표시하는 바와 같은 표준 액션을 표현한다.The use of "gestures" is described in section 12.1.3, and especially in the case of optical scanning, the place where a particular move is made with the handheld scanner represents a standard action, such as marking the beginning and end of an area of text.

이것은 텍스트의 영역을 선택하기 위해 커서 키를 사용하는 한편, 또는 문서를 스크롤하기 위해 마우스상의 휘을 사용하는 한편 키보드상의 시프트 키를 프레싱하는 바와 같은 액션과 유사하다. 유저에 의한 그러한 액션은 그것들이 OS에 의해 시스템-와이드 방식으로 해삭되는 충분히 표준이고, 이에따라 일정한 작용을 보장한다. 동일한 작용이 스캐너 제스춰 및 기타 스캐너-관련 액션에 바람직하다. This is similar to an action, such as using the cursor keys to select an area of text, or pressing the shift key on the keyboard while using the whee on the mouse to scroll the document. Such actions by the user are sufficiently standard that they are hacked in a system-wide manner by the OS, thus ensuring a constant action. The same action is desirable for scanner gestures and other scanner-related actions.

11.2.5. 표준(및 비-표준) 아이콘/텍스트 인쇄된 메뉴 아이템에 대한 셋트 응답 11.2.5. Set response to standard (and non-standard) icons / text printed menu items

마찬가지 방식으로, 어떤 텍스트의 아이템 또는 기타 심볼은, 스캐닝되었을 때, 표준 액션이 발생하게 하고, OS는 이들에 대한 선택을 제공할 수 있다. 예로서는 임의의 문서에서 텍스트 "[인쇄]"를 스캐닝하는 것은 OS로 하여금 그 문서의 복사본을 검색 및 인쇄하게 할 수 있다는 것이다. OS는 그러한 액션을 등록하는 방법을 제공하고 그것들을 특정한 스캔과 연관시킬 수 있다. In the same way, items of text or other symbols of text cause standard actions to occur when scanned, and the OS can provide a choice for them. By way of example, scanning the text "[print]" in any document may allow the OS to retrieve and print a copy of that document. The OS provides a way to register such actions and associate them with a particular scan.

11.3. 전형적인 스캔-개시된 액티비티를 위한 시스템 GUI 컴포넌트에서의 지원 11.3. Support in system GUI components for typical scan-initiated activities

대부분의 소프트웨어 애플리케이션은 OS에 의해 제공된 표준 그래픽 유저 인터페이스에 기초한다. Most software applications are based on standard graphical user interfaces provided by the OS.

디벨로퍼에 의한 이들 컴포넌트의 사용은, 예로서, 모든 프로그래머가 동일 기능을 독립적으로 이행함이 없이, 임의의 텍스트-에디팅 콘텍스에서의 좌측-커서의 누름이 그 커서를 좌측으로 이동시키는 바와 같은, 복수의 패키지에 걸쳐 일관된 작용을 보장하는 데에 도움을 준다. The use of these components by the developer is, for example, as pressing the left-cursor in any text-editing context moves the cursor to the left, without all programmers independently implementing the same functionality. It helps to ensure consistent behavior across multiple packages.

이들 컴포넌트에서의 우사한 일관성은 액티비티가 상기한 시스템의 텍스트-캡쳐 또는 기타 태양에 의해 개시되는 경우에 바람직하다. 몇몇 예가 하기에 주어진다. Similar consistency in these components is desirable when the activity is initiated by a text-capture or other aspect of the system described above. Some examples are given below.

11.3.1. 특정 텍스트 콘테트를 찾기 위한 인터페이스 11.3.1. Interface for finding specific text content

이 시스템의 전형적인 사용은 페이퍼 문서의 일정 영역을 유저를 위해 스캐닝하는 것일 수 잇고, 시스템을 위해 디스플레이 또는 편잡할 수 있는 소프트웨어 패키지에서 전자 카운터파트를 개봉하는 것과, 그 패키지가 스캐닝된 텍스트를 스크롤 및 하이라이트하게 하는 것일 수 있다(섹션 12.2.1.). 전자 문서를 발견 및 개방하는, 이 프로세스의 제1 부분은 통상적으로 OS에 의해 제공되고 패키지에 걸쳐 표준이다. 그러나, 제2 부분-문서내에 텍스트의 특정 부분을 위치시키고 패키지가 그것을 스크롤하고 하이라이트하게 하는-은 아직 표준화되지 않았고 흔히 각각의 패키지에 의해 상이하게 구현된다. 이 기능을 위한 표준 API의 이용가능성은 시스템의 이러한 태양의 동자을 상당히 향상시킨다. Typical use of this system may be scanning a portion of a paper document for a user, opening an electronic counterpart in a software package that can be displayed or manipulated for the system, scrolling and scanning the scanned text. May be highlighted (section 12.2.1.). The first part of this process of discovering and opening electronic documents is typically provided by the OS and is standard throughout the package. However, the second part, which places a particular part of the text in the document and causes the package to scroll and highlight it, has not yet been standardized and is often implemented differently by each package. The availability of standard APIs for this function significantly improves the motivation of this aspect of the system.

11.3.2. 텍스트 상호작용 11.3.2. Text interaction

텍스트의 일부분이 문서내에 위치되었다면, 시스템은 그 텍스트에 대해 다양한 동작을 수행하길 바랄 것이다. 예로서, 시스템은 주위 텍스트를 요구할 수 있고, 따라서 몇 개 워드에 대한 유저의 캡쳐는 시스템에서 그것들을 포함하는 전체 문장 또는 단락을 액세싱하는 결과로 된다. 다시, 이 기능은 텍스트를 취급하는 소프트웨어의 모든 부분에 구현되는 것에 의하기 보단 OS에 의해 유용하게 제공될 수 있다. If a piece of text is located in a document, the system will want to perform various actions on that text. By way of example, the system may require surrounding text, so the user's capture of several words results in accessing the entire sentence or paragraph containing them in the system. Again, this functionality can be usefully provided by the OS rather than being implemented in every piece of software that handles text.

11.3.3. 콘텍스추얼 ( 팝업 ) 메뉴 11.3.3. Contextual ( popup ) menu

시스템에 의해 인에이블되는 동작의 몇몇은 유저 피드백을 요구하고, 그갓은 데이터를 취급하는 애의 콘텍스트내에서 최적으로 요구될 수 있다. 몇몇 실시예에서, 시스템은 통상적으로 몇몇 텍스트상에 우측 마우스 버튼을 클림하는 것과 연관된 애플리케이션 팝업 메뉴를 사용한다. 시스템은 그러한 메뉴에 여분의 옵션을 삽입하고, 그것들이 페이퍼 문서를 스캐닝하는 바와 같은 액티비티의 결과로서 디스플레이되어지게 한다. Some of the operations enabled by the system require user feedback, which may be optimally desired within the context of the data handler. In some embodiments, the system typically uses an application popup menu associated with clicking the right mouse button over some text. The system inserts extra options into such menus and allows them to be displayed as a result of the activity as scanning a paper document.

11.4. 웹/네트워크 인터페이스 11.4. Web / network interface

오늘날의 증대하는 네트워크화된 세계에서, 개별적인 머신에서 이용가능한 기능의 대부분은 네트워크를 통하여 액세스될 수 있고, 상기한 시스템과 연관된 기능은 어떠한 예외도 없다. 예로서, 사무실 환경에서 유저에 의해 수신된 다수의 페이퍼 문서는 동일한 합동 네트워크상에서 다른 유저의 기게에 의해 인쇄되어질 수 있다. 한 컴퓨터상의 시스템은, 캡쳐에 응답하여, 적절한 허가 컨트롤에 종속되어, 그 캡쳐에 대응할 수 있는 문서에 대해 다른 머신에 질의할 수 있다. In today's growing networked world, most of the functionality available on individual machines can be accessed through the network, and the functionality associated with such systems is no exception. As an example, multiple paper documents received by a user in an office environment may be printed by other users' machines on the same joint network. In response to the capture, a system on one computer may, depending on the appropriate permission controls, query the other machine for a document that may correspond to the capture.

11.5. 보관을 야기하는 문서의 인쇄 11.5. Printing of Documents That Cause Archiving

페이퍼 및 문서의 통합에서의 중요한 한 요인은 둘사이에서의 변환에 대해 가능한한 많은 정보를 유지하는 것이다. 몇몇 실시예에서, OS는 문서가 언제 눅에 의해서 인쇄었는 지에 대한 단일 레코드를 유지한다. 몇몇 실시예에서, OS는 시스템의 사용에 더욱 양ㅎ하게 적합하게 하는 하나이상의 츠가 액션을 취한다. 그 예들은 다음사항들을 포함한다. One important factor in the integration of paper and documents is to keep as much information as possible about the conversion between the two. In some embodiments, the OS maintains a single record of when the document was printed by Luke. In some embodiments, the OS takes one or more actions that make it more suitable for use of the system. Examples include the following:

·문서가 인쇄되었던 소스에 대한 정보와 함께 인쇄된 모든 문서의 디지털 렌더링된 버젼을 보관 Keep a digitally rendered version of every printed document with information about the source from which the document was printed.

·미래의 스캔 해석에 도움을 줄 수 있는-예로서 사용된 폰트 및 라인 끊김이 발생한 곳-인쇄된 버젼에 대한 유용한 정보의 서브셋트를 보관Keep a subset of useful information about the printed version, as well as the fonts used and where line breaks occurred, as an example to aid future scan interpretation.

·임의의 인쇄된 복사본과 연관된 소스 문서의 버젼을 보관Keep a version of the source document associated with any printed copy;

· 미래의 탐색을 위해 결과를 인쇄 및 저장시 자동적으로 문서를 인덱싱Automatically index documents when printing and saving results for future navigation

11.6. 나의(인쇄된/ 스캐닝된 ) 문서 11.6. My (Printed / Scanned ) Documents

OS는 흔히 특정 중요도를 갖는 파일 또는 폴더의 일정한 카테고리를 유지한 다. 유저의 문서는, 정해진 방식에 의해 또는 설계에 의해, 예로서 "나의 문서" 폴더에서 발견될 수 있다. 표준 파일-열기 다이얼로그는 자동적으로 최근에 열려진 문서의 리스트를 포함할 수 있다.The OS often maintains a certain category of files or folders of particular importance. The user's document can be found by way of design or by way of example in the "My Documents" folder. The standard file-open dialog may automatically include a list of recently opened documents.

상기한 시스템에의 사용을 위해 최적화된 OS상에서, 그러한 카테고리는 저장된 파일의 페이퍼 버젼과 유저의 상호작용을 고려하는 방식으로 향상 또는 증대될 수 있다. "나의 문서" 또는 "나의 최근-판독 문서"와 같은 카테고리는 유용하게 식별될 수 있고 그의 동작에 통합될 수 있다. On an OS optimized for use with such a system, such categories can be enhanced or augmented in a manner that takes into account the user's interaction with the paper version of the stored file. Categories such as "My Documents" or "My Recently-Read Documents" can be usefully identified and incorporated into their operation.

11.7. OS-레벨 마크업 계층구조 11.7. OS-level markup hierarchy

시스템의 중용한 태양은 섹션5에서 토의된 "마크업" 개념을 이용하여 제공되므로, OS 자체 및 복수 애플리케이션에 액세스가능했던 방식으로 OS에 의해 제공된 그런 마크업을 위한 지원체계를 갖는 것이 유익하다. 또한, 마크업의 층들은 OS가 제공할 수 있는 기능 및 그것의 제어하에서 문서에 대한 그것의 지식에 기초하여, OS에 의해 제공될 수 있다. Since the critical aspects of the system are provided using the "markup" concept discussed in Section 5, it would be beneficial to have a support mechanism for such markup provided by the OS in a manner that was accessible to the OS itself and multiple applications. Further, the layers of markup may be provided by the OS, based on the functionality that the OS can provide and its knowledge of the document under its control.

11.8. OS DRM 기능의 사용 11.8. Use of OS DRM Features

증가하는 운영체제의 수는 "디지털 권한 관리"; 특정 유저에게 허여된 권한에 따라 특정 데이터의 사용에 대한 제어 능력, 소프트웨어 엔티티 또는 머신의 몇몇 형태를 지원한다. 예로서, 특정 문서의 비인가된 복사 또는 배포를 금지할 수 있다. The growing number of operating systems includes "digital rights management"; It supports some form of software entity or machine control over the use of specific data, depending on the privileges granted to specific users. As an example, unauthorized copying or distribution of certain documents may be prohibited.

12. 유저 인터페이스 12. User Interface

시스템의 유저 인터페이스는 캡쳐 디바이스가 상대적으로 기능이 떨어지거나 케이블에 의해 연결된다면 전체적으로 PC상에 있을 수 있고, 그것이 정교하거나 그것 자체가 상당한 프로세싱 능력을 갖는다면 전체적으로 디바이스상에 있을 수 있다. 시스템의 기능의 일부 또는 전부는 모바일 폰 또는 PDA와 같은 기타 디바이스상에서 구현될 수 있다. The user interface of the system may be entirely on the PC if the capture device is relatively poorly functioning or connected by cable, or it may be entirely on the device if it is sophisticated or itself has significant processing power. Some or all of the functionality of the system may be implemented on other devices such as mobile phones or PDAs.

다음 섹션에서의 기술은 어떤 구현에서 바람직할 수 있는 것인가에 대한 것이지만, 그것이 모든 경우에 반드시 적절한 것은 아니고 여러 방식으로 수정될 수 있다. The description in the next section is about what implementation may be desirable, but it is not necessarily appropriate in all cases and can be modified in many ways.

12.1. 캡쳐 디바이스에서 12.1. capture On the device

모든 캡쳐 디바이스로, 그러나 특히 광학 스캐너인 경우에, 유저의 주의는 일반적으로 그 디바이스 및 스캐닝시의 페이퍼에 있게된다. 스캐닝의 프로세스의 일부로서 필요로되는 피드백과 임의의 입력은, 필요로 되는 것 이상으로, 예로서 컴퓨터의 스크린에서와 같은 곳에, 유저의 주의를 돌릴 것을 필요로 하지 않는 것이 바람직하다. With all capture devices, but especially in the case of optical scanners, the user's attention is generally placed on the device and on the paper during scanning. The feedback and any input required as part of the scanning process is preferably not required to distract the user beyond what is necessary, such as on a screen of a computer, for example.

12.1.1. 스캐너에 의한 피드백 12.1.1. Feedback by the scanner

휴대형 스캐너는 특정 조건에 대해 유저에게 피드백을 제공하는 다양한 방식을 갖는다. 가장 분명한 유형은 스캐너가 인디케이터 라이트 또는 풀 디스플레이를 갖는 경우에, 다이렉트 비주얼 유형, 및 스캐너가 경보음, 클릭 또는 기타 사운드를 갖는 경우에, 청각적 유형이다. 중요한 대안은 스캐너가 진동, 신호음, 또는 그렇지않으면 유저의 터치 센스를 흉내내는 경우에서의, 촉각 피드백, 및 컬러화된 광 스폿 내지 정교한 디스플레이까지의 어떤 것을 페이퍼에 트사함에 의해 싱태를 지시하는 투사 피드백을 포함한다. Handheld scanners have a variety of ways of providing feedback to the user about specific conditions. The most obvious type is the direct visual type when the scanner has an indicator light or full display, and an audible type when the scanner has an alarm, click or other sound. An important alternative is tactile feedback, in which case the scanner simulates vibration, beeps, or otherwise the user's touch sense, and projection feedback indicating the state by stripping something from the colored light spot to the sophisticated display. It includes.

디바이스에 제공될 수 있는 중요한 직접적 피드백은 다음 것들을 포함한다. Important direct feedback that may be provided to the device includes the following.

·스캐닝 프로세스에 의한 피드백 - 유저의 스캐닝이 과속, 지나치게 큰 각도이거나, 또는 특정 라인에서 지나치게 높거나 낮게 표류함Feedback by the scanning process-the user's scanning is overspeed, too large an angle, or drifting too high or too low on a particular line

·충분한 콘텐트 - 연결해제 동작에 중요한- 매치가 존재한다면 하나의 매치에 대한 발견이 매우 확실한 정도가 되도록 충분히 스캐닝되었음 Sufficient content-important for disconnect actions-if a match exists, the discovery of one match has been scanned sufficiently to ensure a certain degree

·공지된 콘텍스트 - 텍스트의 소스가 찾아졌음Known context-source of text found

·공지된 고유 콘텍스트 - 텍스트의 하나의 고유 소스가 찾아졌음Known Unique Context-One unique source of text was found

·콘텐트의 이용가능성 - 콘텐트가 유저에게 무료로 또는 유료로 이용가능한 지에 대한 지시 Availability of content-an indication of whether the content is available to the user for free or for a fee

예컨대 문서의 일부나 전부를 디스플레이하는데 충분한 성능을 가지고 있다면 시스템의 이후의 단계와 정상적으로 연관된 사용자 상호작용의 많은 부분이, 캡처 장치에서 일어날 수도 있다.For example, if there is sufficient performance to display some or all of the document, much of the user interaction normally associated with later stages of the system may occur at the capture device.

12.1.2. 스캐너 제어12.1.2. Scanner control

본 장치는 사용자가 기본적인 텍스트 캡처에 더하여 입력을 하기 위한 다양한 방법을 제공할 수 있다. 본 장치가 키보드와 마우스 같은 입력 옵션을 갖는 호스트 장치와 밀접히 연관되어 있을 때조차, 예컨대 스캐너를 조작하고 마우스를 사용하는 것 사이의 전후를 사용자가 스위칭하는 것이 파괴적일 수 있다.The device may provide a variety of methods for the user to input in addition to basic text capture. Even when the device is closely associated with a host device having input options such as a keyboard and a mouse, it may be disruptive for the user to switch back and forth between, for example, operating the scanner and using the mouse.

핸드헬드 스캐너는 버튼, 스크롤/조그 휠, 터치 감응면, 및/또는 장치의 움직임을 감지하기 위한 가속도계를 포함할 수 있다. 이 중 몇몇은 스캐너를 계속 유지하면서 보다 풍부한 상호작용 세트를 가능하게 한다.Handheld scanners may include buttons, scroll / jog wheels, touch sensitive surfaces, and / or accelerometers for sensing device movement. Some of these enable a richer set of interactions while still maintaining the scanner.

예컨대, 어떠한 텍스트를 스캐닝하는데 응하여, 본 시스템은 사용자에게 몇몇 가능한 매칭 문서 세트를 제공한다. 사용자는 스캐너 측면의 스크롤 휠을 사용하여 리스트중 하나를 선택하고 버튼을 클릭하여 선택을 확인한다.For example, in response to scanning any text, the system provides the user with some possible set of matching documents. The user selects one of the lists using the scroll wheel on the side of the scanner and confirms the selection by clicking the button.

12.1.3. 제스처12.1.3. gesture

종이 가운데로 스캐너를 이동시키는 주된 이유는 텍스트를 캡처하기 위한 것이지만, 몇몇 움직임은 장치로 탐지될 수 있고 사용자의 다른 의도를 나타내는데 사용될 수 있다. 이러한 움직임을 본 명세서에서는 '제스처'라고 한다.The main reason for moving the scanner in the middle of the paper is to capture text, but some movements can be detected by the device and used to indicate other intentions of the user. This movement is referred to herein as a gesture.

예컨대, 사용자는 종래 좌-우 순으로 처음 몇 단어를 스캐닝하고 마지막 몇단어를 역순, 즉 우-좌 순으로 스캐닝함으로써 넓은 영역의 텍스트를 지시할 수 있다. 사용자는 또한 스캐너를 몇 라인 위에서 페이지 아래로 이동시킴으로써 대상 텍스트의 수직 범위를 지시할 수도 있다. 이후 스캔은 이전에 스캔한 동작의 취소를 지시할 수 있다.For example, a user may indicate a large area of text by scanning the first few words in conventional left-right order and the last few words in reverse, ie right-left order. The user may also indicate the vertical range of the target text by moving the scanner down the page a few lines up. The scan may then indicate the cancellation of the previously scanned operation.

12.1.4. 온라인/오프라인 동작12.1.4. Online / offline behavior

본 시스템의 많은 태양은 스캐너와 호스트 랩탑 같은 시스템 구성요소 사이 또는 회사의 데이터베이스와 인터넷 검색으로의 접속의 형태로 외부 세계와의 네트워크 접속성에 의존적일 수 있다. 그러나, 이러한 접속성은 항상 존재할 수 있는 것은 아니고 따라서 시스템의 일부 또는 전부가 "오프라인"이 되도록 간주될 수 있는 경우가 일을 것이다. 본 시스템을 이러한 상황에서 유용하게 계속 기능하도록 하는 것이 바람직하다.Many aspects of the system may rely on network connectivity between the system components, such as scanners and host laptops, or with the outside world in the form of connections to company databases and Internet searches. However, it will be the case that such connectivity may not always exist and thus may be considered to be "offline" of some or all of the system. It is desirable to keep the system useful in this situation.

본 장치는 다른 부분과 접촉하고 있지 않을 때 텍스트를 캡처하는데 사용될 수 있다. 매우 간단한 장치는 단순히 캡처, 이상적으로는 캡처된 때를 지시하는 타임스탬프와 연관된 이미지 또는 오디오 데이터를 저장할 수 있다. 본 장치가 본 시스템의 나머지 부분과 접촉하여 있을때 다양한 캡처가 여기에 업로드될 수 있고 이후 처리될 수 있다. 본 장치는 예컨대 광학 스캔과 연관된 음성 주석이나 위치 정보등 캡처와 연관된 기타 데이터를 업로드할 수도 있다.The device can be used to capture text when not in contact with another part. Very simple devices can simply store image or audio data associated with a timestamp that ideally indicates when it was captured. When the device is in contact with the rest of the system, various captures can be uploaded here and then processed. The device may also upload other data associated with the capture, such as voice annotations or location information associated with the optical scan.

보다 복잡한 장치는 연결되어 있지 않을때에도 시스템 동작의 일부 또는 전부를 스스로 수행할 수 있다. 이렇게 하는 성능을 개선시키기 위한 다양한 기술이 15.3.절에 설명되어 있다. 때로는 원하는 동작의 전부가 아닌 일부가 오프라인에 있는동안에 수행될 수 있는 경우가 있다. 예컨대, 텍스트는 인식될 수 있지만, 소스 식별은 인터넷 기반 검색 엔진과의 접속에 의존적일 수 있다. 따라서, 몇몇 실시예에서는, 본 장치는 본 시스템의 나머지 부분이 연결이 복구될때 효과적으로 진행하도록 각각의 동작이 얼마나 많이 진행했는지에 대한 충분한 정보를 저장한다.More complex devices can perform some or all of the system's operations themselves, even when not connected. Various techniques for improving the performance of this are described in Section 15.3. Sometimes some but not all of the desired actions can be performed while offline. For example, text may be recognized, but source identification may be dependent on a connection with an internet based search engine. Thus, in some embodiments, the apparatus stores enough information about how much each operation has progressed so that the rest of the system proceeds effectively when the connection is restored.

본 시스템의 동작은 일반적으로 즉시 이용가능한 연결에 의하여 유용하지만, 몇몇 캡처를 수행하여 이것을 배치로 처리하는 것이 이로울 수 있는 몇몇 상황이 있다. 예컨대, 아래 13절에서 설명되어 있는 바와 같이, 특정 캡처 원의 식별은 거의 동시에 사용자에 의해 수행된 다른 캡처를 검사함으로써 크게 강화될 수 있다. 사용자에게 라이브 피드백이 제공되는 완전 연결 시스템에서는, 시스템이 현재의 캡처를 처리할 때 과거의 캡처를 이용할 수 있을 뿐이다. 그러나 캡처가 오프라인에 있을 때 장치에 의해 저장된 배치 중 하나이면, 본 시스템은 이러한 분석 을 수행할때 이전의 것은 물론 이후의 캡처로부터 이용가능한 임의의 데이터를 고려할 수 있을 것이다.The operation of the system is generally useful by means of ready-to-use connections, but there are some situations where it may be beneficial to take some captures and process them in batches. For example, as described in section 13 below, the identification of a particular capture source can be greatly enhanced by examining other captures performed by the user at about the same time. In a fully connected system where live feedback is provided to the user, past captures can only be used when the system processes the current capture. However, if the capture is one of the batches saved by the device when offline, the system may consider any data available from previous and subsequent captures when performing this analysis.

12.2. 호스트 장치로12.2. As host device

스캐너는 종종 사용자와 보다 상세한 상호작용을 포함한 시스템의 많은 기능을 수행하기 위하여 PC, PDA, 전화기 또는 디지털 카메라등의 몇몇 다른 장치와 통신한다.Scanners often communicate with some other device such as a PC, PDA, telephone or digital camera to perform many of the functions of the system, including more detailed interactions with the user.

12.2.1. 캡처에 응해 수행된 동작12.2.1. Action taken in response to capture

호스트 장치가 캡처를 수신할 때, 다양한 동작을 개시할 수 있다. 로케이팅이후 시스템이 수행한 가능한 동작의 불완전 목록과 이러한 캡처와 연관된 전자 문서 사본 및 문서내 위치가 잇따른다.When the host device receives the capture, it can initiate various operations. Following locating, there is an incomplete list of possible actions the system has performed, along with a copy of the electronic document and location within the document associated with this capture.

● 캡처의 상세한 정보를 사용자 내역에 저장할 수 있다.(6.1 절)● Detailed information of the capture can be stored in the user history (Section 6.1).

● 로컬 스토리지 또는 원격지에서 문서를 검색할 수 있다.(8절)● Documents can be retrieved from local storage or remote (section 8).

● 동작 시스템의 메타데이터 및 문서와 연관된 기타 기록을 업데이트할 수 있다(11.1 절)Update the metadata of the operating system and other records associated with the document (section 11.1);

● 다음의 적절한 동작을 결정하기 위해 문서와 연관된 마크업을 검사할 수 있다.(5 절)• You can examine the markup associated with the document to determine the appropriate action to follow (section 5).

● 소프트웨어 애플리케이션을 개시하여 문서 편집, 보기 또는 기타 동작을 수행할 수 있다. 애플리케이션 선택은 소스 문서 또는 스캔의 콘텐츠이거나 캡처의 몇몇 다른 태양에 좌우된다.(11.2.2, 11.2.3 절)● Launch a software application to perform document editing, viewing, or other operations. Application selection depends on the content of the source document or scan, or on some other aspect of capture (Sections 11.2.2, 11.2.3).

● 애플리케이션을 스크롤하여 하이라이트하거나 삽입점을 이동시키거나 캡 처 위치를 지시할 수 있다.(11.3 절) ● You can scroll through the application to highlight it, move the insertion point, or indicate the capture location (section 11.3).

● 캡처된 텍스트의 정확한 경계를 수정하여, 예컨대 캡처된 텍스트 주위의 전체 단어, 문장 또는 절을 선택할 수 있다.(11.3.2 절)You can modify the exact boundaries of the captured text, for example to select whole words, sentences or phrases around the captured text (section 11.3.2).

● 사용자에게 캡처 텍스트를 클립보드에 복사하거나 기타 표준 동작 시스템 또는 이에 대한 애플리케이션 특정 동작을 수행하기 위한 옵션을 제공할 수 있다.● Give the user the option to copy the capture text to the clipboard or to perform other standard operating systems or application specific actions on it.

● 문서 또는 캡처된 텍스트에 주석을 연관시킬 수 있다. 이것은 사용자 입력을 통해 바로 되거나 예컨대 광학 스캔과 연관된 음성 주석의 경우에는 먼저 캡처되었을 수 있다.(19.4 절)● You can associate comments with documents or captured text. This may be done directly through user input or may have been captured first in the case of a voice annotation, eg associated with an optical scan (section 19.4).

● 사용자가 선택하는 또 다른 가능한 동작 세트를 결정하기 위하여 마크업을 검사할 수 있다.Markup can be examined to determine another possible set of actions for the user to select.

12.2.2. 상황 팝업 메뉴12.2.2. Context popup menu

때로는 시스템이 취하는 적당한 동작이 명확하지만, 어떤 경우는 사용자의 선택을 요한다. 이를 수행하는 하나의 좋은 방법은 "팝업 메뉴"를 사용하거나, 콘텐츠가 스크린상에 디스플레이되는 경우에는 콘텐츠에 가까이 나타나는 소위 "컨텍스트 메뉴"에 의하는 것이다.(11.3.3 절 참조). 몇몇 실시예에서는, 스캐너 장치가 종이 문서사아에 팝업 메뉴를 띄운다. 사용자는 키보드와 마우스등의 전통적인 방법을 사용하는 메뉴로부터 또는 캡처 장치상의 제어부를 사용함으로써(12.1.2 절), 제스처를 사용함으로써(12.1.3 절), 또는 스캐너를 사용하는 컴퓨터 디스플레이와의 상호작용에 의해서(12.2.4 절) 선택할 수 있다. 몇몇 실시예에서는, 캡처 결과로 나타날 수 있는 팝업 메뉴는 사용자가 응답하지 않으면 나타나는, 예컨대 사용자가 메뉴를 무시하고 또 다른 캡처를 수행하는 경우에 일어나는 동작을 나타내는 디폴트 항목을 포함한다.Sometimes the proper action the system takes is clear, but in some cases it's up to the user. One good way to do this is to use a "pop-up menu" or, if the content is displayed on the screen, a so-called "context menu" that appears close to the content (see section 11.3.3). In some embodiments, the scanner device displays a pop-up menu on the paper document. The user may interact with the computer display using a scanner, either from a menu using traditional methods such as a keyboard and mouse, or by using controls on the capture device (section 12.1.2), by using gestures (section 12.1.3), or by using a scanner. It can be chosen by action (Section 12.2.4). In some embodiments, the popup menu that may appear as a result of the capture includes a default item that appears if the user does not respond, e.g., an action that occurs when the user ignores the menu and performs another capture.

12.2.3. 명확화에 대한 피드백12.2.3. Feedback on clarification

사용자가 텍스트 캡처를 개시하면, 먼저 매칭할 수 있는 몇몇 문서나 기타 텍스트의 위치가 있을 것이다. 보다 많은 텍스트가 캡처되고 기타 요인을 고려하면(13절), 후보 위치의 수는 실제 위치가 식별될때까지 줄어들거나 사용자의 입력이 없이는 더이상의 명확화는 가능하지 않다. 몇몇 실시예에서, 시스템은, 문서 또는 예컨대 리스트, 썸네일 이미지 또는 텍스트 세그먼트 형태로 그리고 캡처가 계속될때 수를 줄이기 위해 디스플레이내의 엘리먼트 수에 대하여 발견되는 위치를 실시간으로 디스플레이한다. 몇몇 실시예에서는, 시스템은 모든 후보 문서의 썸네일을 디스플레이하는데, 이러한 썸네일의 크기와 위치는 정확한 매칭이 있는 확률에 좌우된다.When the user initiates text capture, there will be some document or other text location that can be matched first. As more text is captured and other factors are taken into account (Section 13), the number of candidate locations is reduced until the actual location is identified or no further clarification is possible without user input. In some embodiments, the system displays in real time the locations found for the number of elements in the display in the form of a document or for example in the form of a list, thumbnail image or text segment and to reduce the number as capture continues. In some embodiments, the system displays thumbnails of all candidate documents, the size and position of which thumbnails depend on the probability of an exact match.

캐처가 명확하게 식별되면, 예컨대 청각적 피드백을 사용하여 이 사실을 사용자에게 강조할 수 있다.Once the catcher is clearly identified, it can be emphasized to the user, for example using audio feedback.

때때로 캡처된 텍스트는 많은 문서에서 나타나고 인용구가 되도록 인식될 것이다. 본 시스템은 예컨대, 원본 소스 문서주위에 인용 참조를 포함하는 문서를 그룹화함으로써 스크린상에 이것을 지시할 수 있다.Sometimes the captured text will appear in many documents and will be recognized as a quotation. The system may indicate this on the screen, for example, by grouping documents containing cited references around the original source document.

12.2.4. 스크린으로부터 스캐닝12.2.4. Scanning from the screen

몇몇 광학 스캐너는 종이는 물론 스크린상에 디스플레이된 텍스트를 캡처할 수 있다. 따라서, '렌더링된 문서'라는 말은 본 명세서에서 종이 인쇄물이 렌더링 의 유일한 형태가 아니고 시스템에서 사용하기 위한 텍스트나 심볼의 캡처 또한 텍스트가 전자 디스플레이상에 디스플레이될 때 똑같이 가치있을 수 있다는 것을 나타내는데 사용된다.Some optical scanners can capture text displayed on the screen as well as paper. Thus, the term 'rendered document' is used herein to indicate that paper prints are not the only form of rendering and that the capture of text or symbols for use in the system may also be equally valuable when the text is displayed on an electronic display. do.

상기한 시스템의 사용자는 옵션 리스트를 선택하는 등 다양한 다른 이유로 컴퓨터 스크린과 상호작용할 필요가 있을 수 있다. 사용자가 스캐너를 두고 마우스나 키보드를 사용하여 개시하는 것은 불편할 수 있다. 다른 절에서 이러한 툴을 변화시킬 필요없이 입력하는 방법으로서 스캐너(12.1.2 절에서 설명) 또는 제스처(12.1.3절에서 설명)의 물리적 제어부를 설명했지만, 스크린 자체에 있는 스캐너를 사용하여 몇몇 텍스트나 심볼을 스캐닝하는 것은 본 시스템이 제공하는 중요한 대안이다.The user of such a system may need to interact with the computer screen for a variety of different reasons, such as selecting a list of options. It may be inconvenient for the user to place the scanner and initiate it using a mouse or keyboard. While other sections have described the physical controls of a scanner (described in section 12.1.2) or a gesture (described in section 12.1.3) as a way to enter these tools without having to change them, some text can be obtained using the scanner on the screen itself. B Scanning symbols is an important alternative provided by this system.

몇몇 실시예에서, 스캐너의 광학부에 의해 라이트펜과 마찬가지 방식으로 사용되어, 컴퓨터의 특별한 하드웨어나 소프트웨어의 도움으로 텍스트를 실제로 스캐닝할 필요없이 스크린상의 위치를 바로 감지할 수 있다.In some embodiments, the optics of the scanner can be used in the same manner as the light pen, so that with the help of the special hardware or software of the computer, the position on the screen can be detected directly without the need to actually scan the text.

13. 컨텍스트 설명13. Context Description

상기 시스템의 중요한 태양은 사용하는 문서를 식별하는데 도움이되도록 텍스트열의 단순 캡처이상의 다른 요인을 사용한다는 것이다. 적당한 양의 텍스트의 캡처는 종종 문서를 유일하게 식별할 수 있지만 많은 경우에는 몇몇 후보 문서를 식별할 것이다. 하나의 해결책은 사용자가 스캔되는 문서를 확인하도록 하는 것이지만 바람직한 대안의 방법은 자동적으로 가능성을 좁히도록 하는 기타 요인을 사용하는 것이다. 이러한 보충 정보에 의해 캡처를 요하는 문서의 양을 상당히 줄일 수 있고 그리고/또는 전자 사본의 위치가 식별될 수 있는 신뢰성과 속도를 상당히 증가시킬 수 있다. 이러한 여분의 재료를 "컨텍스트"라고 부르고 4.2.2.절에서 간단히 설명하였다. 이후 보다 깊이 알아보기로 한다.An important aspect of the system is that it uses other factors beyond simple capture of text strings to help identify the documents used. Capturing an appropriate amount of text will often uniquely identify the document, but in many cases will identify some candidate documents. One solution is to allow the user to confirm the document being scanned, but the preferred alternative is to use other factors that automatically narrow down the possibilities. Such supplemental information can significantly reduce the amount of documents requiring capture and / or significantly increase the reliability and speed with which the location of the electronic copy can be identified. This extra material is called "context" and is briefly described in Section 4.2.2. We will look deeper later.

13.1. 시스템 및 캡처 컨텍스트13.1. System and capture context

아마도 이러한 정보의 가장 중요한 예는 사용자의 캡처 내역일 것이다.Perhaps the most important example of this information is the user's capture history.

소정의 캡처는 이전의 것과 같은 문서에서 오거나, 이전의 캡처가 마지막 몇분내에 일어난다면 특히 연관된 문서에서 왔을 가능성이 높다(6.1.2 절). 반대로, 시스템이 두 스캔간에 폰트가 변했다는 것을 감지하면, 다른 문서에서 왔을 가능성이 높다.It is likely that a given capture comes from the same document as the previous one, or especially from the associated document if the previous capture occurred in the last few minutes (section 6.1.2). Conversely, if the system detects that the font has changed between two scans, it is likely that it is from another document.

사용자의 장기간의 캡처 내역 및 독서 습관 또한 유용하다. 이것은 사용자의 관심과 연관성 모델을 개발하는데 사용할 수도 있다.The user's long-term capture history and reading habits are also useful. It can also be used to develop a model of interest and association of users.

13.2. 사용자의 실세상 컨텍스트13.2. User's Real Context

유용한 컨텍스트의 또 다른 예는 사용자의 지리적 위치이다. 예컨대, 파리의 사용자는 시애틀 타임즈보다는 르몽드를 읽을 확율이 훨씬 더 높다. 따라서 문서의 인쇄 버전의 타이밍, 크기 그리고 지리적 배포는 중요할 수 있고, 시스템의 동작에서 어느 정도 추측할 수 있다.Another example of a useful context is the user's geographic location. For example, users in Paris are much more likely to read Le Monde than the Seattle Times. Thus, the timing, size, and geographical distribution of the printed version of the document can be important and can be speculated to some extent in the operation of the system.

예컨대 출근하는 동안 항상 하나의 타입의 출판물을 읽고, 점심시간이나 퇴근하는 동안 열차안에서 다른 것을 읽는 사용자의 경우에는 하루의 시간이 또한 관련될 수 있다.For example, the time of day may also be relevant for a user who always reads one type of publication while going to work and who reads something else on the train during lunchtime or leaving home.

13.3. 관련 디지털 컨텍스트13.3. Related digital context

보다 종래적 수단에 의해 써칭되고 검색된 것을 포함하는 사용자의 최근의 전자 문서 사용 또한 유용한 지시자일 수 있다.User's recent use of electronic documents, including those searched and retrieved by more conventional means, may also be useful indicators.

회사 네트워크등 몇몇 경우에는 다음과 같이 다른 요인이 유용한 것으로 간주될 수 있다.In some cases, such as a corporate network, other factors can be considered useful:

● 문서가 최근에 인쇄되었는가?● Has the document been recently printed?

● 문서가 최근에 회사 파일 서버에서 수정되었는가?● Has the document been recently modified on a corporate file server?

● 문서가 최근에 이메일로 송부되었는가?● Has the document been recently emailed?

이러한 예 전부는 사용자가 종이 버전 문서를 더 많이 읽는 것같다는 것을 암시하고 있을 수 있다. 반대로, 문서가 있는 매점이 인쇄되었을 수 있는 임의의 장소로 송부되었거나 결코 문서가 인쇄되지 않았음을 확인할 수 있으면, 종이에서 기원하는 임의의 검색에서 안전하게 제거될 수 있다.All of these examples may imply that the user is likely to read more paper version documents. Conversely, if a canteen with a document can be sent to any place where it may have been printed or can be confirmed that the document has never been printed, it can be safely removed from any search originating from the paper.

13.4. 기타 통계-글로벌 컨텍스트13.4. Miscellaneous Statistics-Global Context

14절은 종이 기반 검색에서 나온 데이터열의 분석을 다루고 있지만, 여기서 다른 독자가 있는 문서의 평판, 이러한 평판의 타이밍, 그리고 가장 빈번히 스캔되는 문서의 일부에 대한 통계의 전부는 검색 프로세스에 유용할 수 있는 더많은 요인의 예임을 인식하여야한다. 본 시스템은 종이 세상에 구글 형태의 페이지 랭킹을 가능하게 한다.Section 14 covers the analysis of data streams from paper-based retrieval, but here all of the statistics on the reputation of documents with other readers, the timing of those reputations, and some of the documents that are most frequently scanned may be useful for the retrieval process. It should be recognized that this is an example of more factors. The system enables Google to rank pages in the paper world.

검색 엔진에 대한 컨텍스트의 몇몇 다른 의미에 대하여 4.2.2절을 또한 참조하자.See also Section 4.2.2 for some other meanings of the context for search engines.

14. 데이터열 분석14. Data string analysis

본 시스템의 사용의 부작용으로서 지나치게 가치있는 데이터열을 생성한다. 이러한 데이터열은 사용자가 무엇을 언제 읽는지에 대한 기록이고 많은 경우에 있어서는 사용자가 읽고 있는 것에서 특히 가치있다고 발견하는 것에 대한 기록이다. 이러한 데이터는 이전에 종이 문서에서는 결코 진정으로 이용될 수 없었다.As a side effect of the use of this system, it generates excessively valuable data streams. This data string is a record of what the user reads when and in many cases a record of what the user finds especially valuable in what they are reading. Such data has never been truly available in paper documents before.

이러한 데이터가 시스템 및 시스템의 사용에 유용할 수 있는 몇가지 방식을 6.1절에서 설명하고 있다. 이 절에서는 기타 사용에 대하여 중점을 두고 있다. 물론 사람들이 읽고 있는 것에 대한 임의의 데이터 배포로 고려될 수 있는 중요한 프라이버시 문제가 있지만, 데이터의 익명을 보존하는 것과 같은 문제는 당업자에게 주지된 사실이다.Some of the ways in which this data can be useful for the system and its use are described in Section 6.1. This section focuses on other uses. Of course there are important privacy issues that can be considered as any data distribution to what people are reading, but issues such as preserving the anonymity of the data are well known to those skilled in the art.

14.1. 문서 추적14.1. Document tracking

소정의 사용자가 어떤 문서를 읽고 있는지를 시스템이 알고 있다면 누가 소정의 문서를 읽고 있는지 또한 시스템이 추론할 수 있다. 이것은 조직을 통한 문서의 추적을 가능하게 하여, 예컨대 누가 언제 문서를 읽는지, 얼마나 널리 배포되어 있는지, 배포에 얼마나 오래 걸리는지, 그리고 철지난 사본으로부터 다른 사람들이 작업하고 있는 한편 현재의 버전을 누가 보았는지의 분석을 가능하게 한다.If the system knows which document a given user is reading, the system can also infer who is reading a given document. This enables tracking of documents throughout the organization, for example, who reads the document, how widely distributed it takes, how long it takes to distribute, and who has seen the current version while others are working on it from an outdated copy. Enable analysis.

보다 넓은 분포를 갖는 출판 문서에 대하여, 개개의 사본의 추적은 보다 어렵지만, 독자층의 분포 분석은 여전히 가능하다.For published documents with a wider distribution, tracking individual copies is more difficult, but analysis of the distribution of readership is still possible.

14.2. 읽기 랭킹-문서 및 소구역의 인기14.2. Reading ranking-popularity of documents and subregions

사용자가 그들에게 특별한 관심이 있는 텍스트나 데이터를 캡처하는 상황에서, 본 시스템은 특정 문서의 인기와 이러한 문서의 소구역의 인기를 추론할 수 있 다. 이것은 시스템 자체(4.2.2 절)와 작가, 출판업자 그리고 광고자(7.6절, 10.5절)에 대한 중요한 정보원에 가치있는 입력을 형성한다. 이러한 데이터는 또한, 예컨대 렌더링된 문서로부터의 조회에 대한 검색 결과에 순위를 매기는 것을 돕고, 그리고/또는 웹브라우저내에 타이핑된 종래의 조회에 순위를 매기는 것을 돕기 위해 검색 엔진과 검색 인덱스에서 통합될 때 유용하다.In situations where a user captures text or data of particular interest to them, the system can infer the popularity of a particular document and the popularity of a subregion of that document. This forms valuable input to the system itself (Section 4.2.2) and important sources of information for writers, publishers and advertisers (Section 7.6, 10.5). Such data can also be integrated in search engines and search indexes, for example, to help rank search results for queries from rendered documents, and / or to rank prior queries typed within a web browser. Useful when

14.3. 사용자 분석-프로파일 생성14.3. User Analysis-Profile Creation

사용자가 무엇을 읽고 있는지를 앎으로써 시스템이 사용자의 관심과 활동의 상당히 상세한 모델을 생성하게 할 수 있게 한다. 이것은 개괄적 통계 기초에 유용할 수 있지만-예컨대, "이러한 신문을 구매하는 사용자의 35%가 그 작가의 최근의 책 또한 읽는다"따위, 하기하는 바와 같이 개인 사용자와 기타 상호작용을 가능하게할 수도 있다.Knowing what the user is reading allows the system to generate a fairly detailed model of the user's interests and activities. This may be useful for a general statistical basis—for example, it may enable other interactions with individual users, as described below, such as “35% of users who purchase such newspapers also read the author's recent books”. .

14.3.1. 사회적 네트워킹14.3.1. Social networking

일 예는 일 사용자를 관련된 관심을 가진 다른 사람과 연결하는 것이다. 이것은 사용자에게 이미 알려진 사람일 수 있다. 본 시스템은 한 대학 교수에게, "XYZ대학에 있는 당신 동료도 이 신문을 읽고 있다는 것을 알았습니까?"라고 물을 수 있다. 본 시스템은 사용자에게 "당신은 또한 제인에어를 읽고있는 당신의 이웃과 연결되기를 원합니까?"라고 물을 수 있다. 이러한 연결은 물리적인 세계에서나 온라인에서 북클럽과 친밀한 사회적 구조를 자동적으로 형성하는데 기반이 될 수 있다.One example is to connect one user with another person with a related interest. This may be a person already known to the user. The system can ask a university professor, "Did you know your colleague at XYZ University is reading this paper?" The system may ask the user, "Do you also want to connect with your neighbor who is reading Jane Air?" This connection can be the basis for the automatic formation of intimate social structures with book clubs, both in the physical world and online.

14.3.2. 마케팅14.3.2. Marketing

10.6절에서 이미 시스템과의 상호작용에 기초하여 제품과 서비스를 개별 사용자에게 제공하는 아이디어에 대하여 언급하였다. 예컨대 현재의 온라인 책판매자는 종종 책판매자와의 이전의 상호작용에 기초하여 사용자에게 추전한다. 이러한 추천은 실제의 책과의 상호작용에 기초될 때 훨씬 더 유용하게 된다.In Section 10.6, we have already mentioned the idea of providing products and services to individual users based on their interaction with the system. For example, current online book sellers often recommend users based on their previous interactions with the book seller. This recommendation becomes even more useful when based on the actual interaction with the book.

14.4. 데이터열의 기타 태양에 기초한 마케팅14.4. Marketing based on other aspects of the data stream

본 시스템이 문서를 출판하는 사람, 문서를 통해 광고하는 사람, 그리고 종이로 개시된 기타 판매에 영향을 미칠 수 있는 방법 중 몇 가지가 설명되었다(10 절). 몇몇 상업적 활동은 결국 종이 문서와 직접적 상호작용을 가지지는 않지만 영향을 받을 수는 있다. 예컨대, 하나의 커뮤니티의 사람들이 보다 많은 시간을 신문의 금융부문보다 스포츠부문을 읽는데 할애하고 있다는 것을 아는 것이 헬쓰클럽을 하려고하는 사람에게 관심이 있을 수 있다.Some of the ways in which the system can affect the publishing of documents, the advertising of the documents, and other sales initiated by paper have been described (v. 10). Some commercial activities eventually do not have direct interaction with paper documents, but may be affected. For example, it may be of interest to a person trying to do a health club to know that people in one community spend more time reading the sports sector than the financial sector of the newspaper.

14.5. 캡처될 수 있는 데이터 타입14.5. Data types that can be captured

누가 어떤 문서의 어느 정도를 언제 어디서 읽는지와 같은 상기 통계에 더하여, 문서가 위치되었는지 여부에 관계없이 캡처된 문서의 실제 콘텐츠를 검사하는 것이 관심있을 수 있다.In addition to the above statistics, such as who reads what extent and when of a document, it may be of interest to examine the actual content of the captured document regardless of whether the document is located.

많은 경우에, 사용자는 몇몇 문서를 캡처할 뿐만 아니라 결과로서 몇몇 활동이 일어나게 할 것이다. 이것은, 예컨대 아는 사람에게 문서에 대한 레퍼런스를 이메일로보내는 것일 수 있다. 사용자나 이메일 수신자의 식별 정보가 없을때에도 누군가 이 문서를 이메일보낼 가치가 있는 것으로 간주했다는 것을 아는 것은 매우 유용하다.In many cases, the user will not only capture some documents but also cause some activity to occur as a result. This could be, for example, emailing a reference to a document to an acquaintance. It is very useful to know that someone considered this document worth sending an email even when no user or email recipients were identified.

텍스트의 특정 문서나 일부의 가치를 추론하기 위하여 설명한 다양한 방법에 더하여, 몇몇 경우에는 사용자가 이에 등급을 할당함으로써 가치를 외부적으로 지시할 것이다.In addition to the various methods described for inferring the value of a particular document or portion of text, in some cases the user will externally indicate the value by assigning a rating to it.

마지막으로, 특정 사용자 세트가 그룹을 형성하기 위해 알려져 있다면, 예컨대 그 사용자들이 특정 회사의 사원에게 알려져 있다면, 상기 그룹의 집합적 통계가 그 그룹에 대한 특정 문서의 중요성을 추론하는데 사용될 수 있다.Finally, if a particular set of users is known to form a group, such as if the users are known to an employee of a particular company, then the collective statistics of that group can be used to infer the importance of a particular document for that group.

15. 장치 특징 및 기능15. Device features and functions

시스템에 사용하기 위한 캡처 장치는 문서의 렌더링된 버전으로부터 텍스트를 캡처하는 방식 정도만을 필요로 한다. 상기한 바와 같이(1.2 절), 이러한 캡처는 문서의 일부의 사진을 찍거나 이동 전화 키보드에 몇몇 문자를 타이핑하는 것을 포함한 다양한 방법을 통해 수행될 수 있다. 이러한 캡처는 한번에 한 라인 또는 두 텍스트를 기록할 수 있는 소형 핸드헬드 광학 스캐너이거나 사용자가 문서로부터 텍스트를 판독하는 음성 레코드 등의 음성 캡처 장치를 사용하여 수행될 수 있다. 사용된 장치는 이들, 예컨대 음성 주석 또한 기록할 수 있는 광학 스캐너,의 조합일 수 있고 캡처링 기능은 이동 전화, PDA, 디지털 카메라 또는 휴대용 뮤직 플레이어등의 몇몇 다른 장치에 내장될 수 있다.A capture device for use with the system only needs a way of capturing text from the rendered version of the document. As noted above (section 1.2), such capture can be performed through a variety of methods, including taking a picture of a portion of the document or typing some characters on a mobile phone keyboard. Such capture may be performed using a small handheld optical scanner capable of recording one line or two text at a time, or using a voice capture device such as a voice record in which the user reads text from a document. The device used can be a combination of these, such as an optical scanner, which can also record voice annotations, and the capturing function can be embedded in some other device such as a mobile phone, PDA, digital camera or portable music player.

15.1. 입출력15.1. I / O

본 장치용의 많은 유용한 부가 입출력 장치를 12.1절에서 설명하였다. 이것은 입력을 위하여 버튼, 스크롤 휠 및 터치 패드를 포함하고 출력을 위해 디스플레이, 지시광, 음성 및 촉각 변환기를 포함한다. 때때로 본 장치는 이들의 대부분을 부가하거나 거의 부가하지 않을 수 있다. 때로는 캡처 장치는, 예컨대 무선 링크를 사용하여 이것을 이미 구비하고 있는 다른 장치와 통신할 수 있을 것이고(15.6절), 때로는 캡처 기능이 이러한 기타 장치에 부가될 것이다(15.7절).Many useful additional I / O devices for this device are described in Section 12.1. It includes buttons, scroll wheels and touch pads for input and a display, indicator light, voice and tactile transducers for output. Sometimes the device may add most or little of them. Sometimes the capture device will be able to communicate with other devices already equipped with it, for example using a wireless link (section 15.6), and sometimes a capture function will be added to this other device (section 15.7).

15.2. 연결성15.2. Connectivity

몇몇 실시예에서, 본 장치는 시스템 자체의 대부분을 구현한다. 그러나 몇몇 실시예에서는 본 장치는 종종 PC나 다른 컴퓨팅 장치와 통신하고 통신 시설을 사용하여 보다 넓은 세계와 통신한다.In some embodiments, the apparatus implements most of the system itself. However, in some embodiments, the device often communicates with a PC or other computing device and uses a communications facility to communicate with the wider world.

대개 이러한 통신 시설은 이더넷, 802.11 또는 UWB등의 범용 데이터망의 형태이거나 USB, IEEE-1394(파이어와이어), 블루투스^TM 또는 적외선등의 표준 주변접속망의 형태로 된다. 파이어와이어나 USB등의 유선 접속이 사용되면, 본 장치는 동 접속을 통해 전기를 공급받을 수 있다. 몇몇 경우에는, 캡처 장치가 접속된 기계에 나타나서 USB저장 장치등 종래의 주변장치로 될 수 있다.Typically, these communication facilities are in the form of general-purpose data networks such as Ethernet, 802.11, or UWB, or in the form of standard peripheral networks such as USB, IEEE-1394 (Firewire), Bluetooth ^™, or infrared. When a wired connection such as Firewire or USB is used, the device can be supplied with electricity through the connection. In some cases, the capture device may appear on a connected machine and become a conventional peripheral such as a USB storage device.

마지막으로, 본 장치는 몇몇 경우에 이 장치와 결합하여 사용되거나 편리한 저장을 위해 또 다른 장치와 "도킹"할 수 있다.Finally, the device may in some cases be used in conjunction with this device or "dock" with another device for convenient storage.

15.3. 캐싱 및 기타 온라인/오프라인 기능15.3. Caching and Other Online / Offline Features

3.5절 및 12.1.4절은 분리된 동작을 주로 다루었다. 캡처 장치가 전체 시스템 기능의 제한된 서브셋을 가지고 시스템의 다른 부분과 통신하지 않는다면, 이용가능한 기능이 때로 줄어들더라도 본 장치는 여전히 유용할 수 있다. 가장 간단한 단계에서, 본 장치는 캡처되는 원 이미지나 음성 데이터를 기록할 수 있고 이후 이 것은 처리될 수 있다. 그러나 사용자를 위하여, 캡처된 데이터가 손작업에 충분할 것같은지 여부, 인식될 수 있는지 또는 인식될 수 있을 것 같은지 여부, 그리고 데이터원이 식별될 수 있는지 또는 이후 식별될 수 있을 것 같은지에 대하여 가능한 곳에 피드백을 제공하는 것이 중요할 수 있다. 사용자는 이후 캡처링 활동이 가치있는 것인지를 알 것이다. 상기한 모든것이 알려지지않더라도, 원데이터는 여전히 저장될 수 있고 따라서, 적어도 사용자는 이후 이것을 참조할 수 있다. 사용자는 예컨대 스캔이 OCR프로세스에 의해 인식될 수 없을때 스캔 이미지를 제공받을 수 있다.Sections 3.5 and 12.1.4 dealt mainly with separate operations. If the capture device does not communicate with other parts of the system with a limited subset of the overall system functionality, the device may still be useful even if the available functionality is sometimes reduced. In the simplest step, the device can record the original image or audio data to be captured and this can then be processed. However, for the user, feedback where possible about whether the captured data is likely to be sufficient for handwork, whether it can be recognized or likely to be recognized, and whether the data source can be identified or later identified. It may be important to provide The user will then know whether the capturing activity is valuable. Even if all of the above is unknown, the raw data can still be stored and thus at least the user can refer to it later. The user may be provided with the scanned image, for example, when the scan cannot be recognized by the OCR process.

이용가능한 옵션의 범위의 일부를 설명하기 위해, 오히려 최소 광학 스캐닝 장치와 이후 훨씬 많은 풀기능의 스캐닝 장치 둘을 이하에 설명하고 있다. 많은 장치는 이러한 두개 사이의 중간 지점을 차지하고 있다.To illustrate some of the range of options available, rather the following describes a minimal optical scanning device and then a much more full-featured scanning device. Many devices occupy a midpoint between these two.

15.3.1. 심플 스캐너-보급형 오프라인 예15.3.1. Simple Scanner-Affordable Offline Example

심플 스캐너는 텍스트 라인의 길이를 따라 이동할때 페이지에서 픽셀을 판독할 수 있는 스캐닝 헤드를 구비하고 있다. 이것은 페이지를 따라 움직임을 탐지하여 이러한 움직임에 대한 몇몇 정보로 픽셀을 기록할 수 있다. 또한 각각의 스캔이 타임스탬프되도록 할 수 있는 클록 또한 구비한다. 클록은 심플 스캐너가 연결되어있을때 호스트 장치와 동기된다. 클록은 하루의 실제 시간을 나타낼 수는 없지만 상대 시간이 결정될 수 있고 따라서 호스트는 스캔의 실제 시간을 추론할 수 있거나 최악의 경우 스캔간 경과 시간을 추론할 수 있다.Simple scanners have a scanning head that can read pixels on the page as they travel along the length of the text line. It can detect motion along the page and record pixels with some information about this motion. It also has a clock that allows each scan to be time stamped. The clock is synchronized with the host device when a simple scanner is connected. The clock cannot represent the actual time of the day, but relative time can be determined so that the host can infer the actual time of the scan or, in the worst case, the elapsed time between scans.

심플 스캐너는 OCR자체를 수행하기 위한 충분한 처리 전력을 가지고 있지 않 지만, 전형적인 단어길이, 단어 간격, 및 폰트 크기와의 관계에 대한 몇몇 기초 지식을 가지고 있다. 이것은 스캔이 판독가능할 수 있는지, 페이퍼를 가로질러 헤드가 너무 빨리, 너무 느리게 또는 너무 부적절하게 이동하고 있는지, 그리고 소정 크기의 충분한 단어가 식별될 문서에 대하여 스캔되었을것 같은 것을 결정하는때를 사용자에게 알려주는 기본적인 몇몇 지시광을 포함한다.Simple scanners do not have enough processing power to perform OCR itself, but they do have some basic knowledge of their relationship to typical word length, word spacing, and font size. This tells the user when the scan may be readable, whether the head is moving too quickly, too slowly or too inappropriately across the paper, and when enough words of a certain size may have been scanned for the document to be identified. It contains some basic indicator lights.

심플 스캐너는 USB접속을 갖고 컴퓨터상의 USB포트에 플러깅될 수 있고, 여기서 충전될 것이다. 컴퓨터에서는 심플 스캐너는 타임 스탬핑된 데이터 파일이 기로고딘 USB저장장치가 되도록 나타나고 시스템 소프트웨어의 나머지는 이 지점에서부터 인수한다.Simple Scanner has a USB connection and can be plugged into a USB port on your computer, where it will be charged. On the computer, a simple scanner appears to make the time stamped data file a Grogodin USB storage device, and the rest of the system software takes over from this point.

15.3.2. 수퍼스캐너-고급형 오프라인 예15.3.2. Super Scanner-Advanced Offline Example

수퍼스캐너 또한 그 전체 동작에 대하여 연결성에 좌우되지만, 이것은 오프라인인 동안 캡처되는 데이터에 대한 더나은 판단을 하는 것을 도울 수 있는 상당한 량의 온보드 저장 및 프로세싱을 구비한다.Superscanners also depend on connectivity for their overall operation, but this has a significant amount of onboard storage and processing that can help make better judgments about the data captured while offline.

수퍼스캐너가 텍스트의 라인을 따라 이동하면, 캡처된 픽셀은 함께 스티칭되어 텍스트를 인식하려고 시도하는 OCR엔진에 전달된다. 사용자의 가장 많이 판독된 출판물로부터 온 것을 포함하는 많은 폰트가 다운로드되어 PC상의 사용자 스펠링 체커 사전과 동기되고 따라서 사용자가 빈번히 마주치는 많은 단어를 포함하는 사전을 구비한 것 처럼, 이러한 작업을 수행하는데 도움을 준다. 또한 사용 빈도를 구비한 단어 및 표현 리스트가 스캐너상에 저장될 수 있는데 이것은 사전과 결합될 수 있다. 스캐너는 인식 프로세스에 도움을 주고 언제 충분한 량의 텍스트가 캡처되었는지에 대한 판단을 알려주기 위해 빈도 통계를 사용할 수 있다. 보다 빈번히 사용된 표현이 검색 질문의 기초로 덜 유용할 것 같다.As the superscanner moves along a line of text, the captured pixels are stitched together and passed to an OCR engine that attempts to recognize the text. Many fonts, including those from the user's most read publications, are downloaded and synchronized with the user spelling checker dictionary on the PC, thus helping to do this, as they have a dictionary containing many words that the user frequently encounters. Gives. In addition, a list of words and expressions with frequency of use can be stored on the scanner, which can be combined with a dictionary. The scanner can use frequency statistics to aid in the recognition process and to inform the decision as to when a sufficient amount of text was captured. More frequently used expressions are likely to be less useful as the basis for search queries.

또한, 사용자가 가장 일반적으로 읽었던 신문과 정기간행물의 최근 이슈의 기사에 대한 풀 인덱스를 사용자가 온라인 책판매자로부터 최근 구매한 책에 대한 또는 사용자가 지난 몇달 내에 무엇인가를 스캔한 인덱스인 것처럼 장치에 저장한다. 마지막으로, 시스템에 이용가능한 데이터를 갖는 수천개의 가장 인기있는 출판물의 제목이 저장되어 기타 정보가 없을때에 사용자가 제목을 스캔할 수 있고 특정 작업으로부터의 캡처가 이후 전자적 형태로 검색될 수 있을 것 같은지에 대한 좋은 생각을 가질 수 있다.In addition, a full index of the most commonly read newspapers and articles of the latest issue of the periodical is stored on the device as if it were an index of the books you recently purchased from an online book seller or if you scanned something in the last few months. Save it. Finally, the titles of thousands of the most popular publications with data available to the system are stored so that when there is no other information, the user can scan the title and capture from a specific task can then be retrieved electronically. You can have a good idea about the same thing.

스캐닝 프로세스동안, 본 시스템은 사용자에게 캡처된 데이터가 충분한 품질을 갖고 있고 전자 사본이 연결이 복구될때 검색될 수 있도록 하기 위해 충분한 특성을 가지고 있다고 알린다. 종종 본 시스템은 사용자에게 스캔이 성공적이었던 것으로 알려지고 컨텍스트가 온보드 인덱스중 하나에서 인식되었고, 또한 관심 출판물이 그 데이터가 시스템에서 이용가능한 것으로 알려지고 따라서 이후의, 검색이 성공적어야한다는 것을 알린다.During the scanning process, the system informs the user that the captured data is of sufficient quality and that the electronic copy has sufficient characteristics to be retrieved when the connection is restored. Often the system is informed to the user that the scan was successful and the context was recognized in one of the onboard indexes, and the publication of interest also informs that the data is known to be available in the system and that a subsequent, search must be successful.

수퍼스캐너는 PC의 파이어와이어나 USB포트에 연결된 크래들에 도킹되고, 이때 캡처된 데이터의 업로드에 더하여, 다양한 온보드 인덱스 및 기타 데이터베이스가 최근의 사용자의 활동과 새로운 출판물에 기초하여 업데이트된다. 또한 무선 공중망에 접속하거나 블루투스를 통해 이동 전화로 통신하기 위한 장치를 구비하고, 이 장치가 이용가능한 때 여기서 공중망과 통신할 수 있다.Superscanners dock to cradles connected to a PC's FireWire or USB port, and in addition to uploading captured data, various onboard indexes and other databases are updated based on recent user activity and new publications. Also provided is a device for connecting to a wireless public network or for communicating with a mobile phone via Bluetooth, which device can communicate with the public network when available.

15.4. 광학 스캐닝을 위한 구조15.4. Structure for Optical Scanning

광학 스캐너 장치에 특히 바람직할 수 있는 몇몇 구조를 이하 살펴본다.Some structures that may be particularly desirable for optical scanner devices are discussed below.

15.4.1. 플렉시블한 포지셔닝 및 편리한 광학기기15.4.1. Flexible positioning and convenient optics

종이가 계속 인기있는 이유중 하나는 예컨대 컴퓨터가 실용될 수 없거나 불편한 다양한 상황에서도 사용이 편리하기 때문이다. 따라서 종이와 사용자의 상호작용의 상당한 부분을 캡처하기 위한 장치 또한 마찬가지로 사용이 편리해야한다. 과거에는 스캐너의 경우는 이러하지 않았는데 가장 작은 핸드헬드 장치조차 다소 다루기가 힘들었다. 페이지와 접촉하게 되도록 설계된 것은 종이와 정확한 각으로 유지되어 스캔될 텍스트의 길이를 따라 매우 정확하게 이동되어야했다. 이것은 사무실 책상위에 비지니스 보고서를 스캐닝할때는 허용될 수 있지만, 기차를 기다리면서 소설의 표현을 스캐닝할때는 비실용적일 수 있다. 종이에서 떨어진 거리이ㅔ서 동작하는 카메라 타입 광학기기에 기초한 스캐너는 마찬가지로 몇몇 상황에서 유용할 수 있다.One reason why paper continues to be popular is that it is convenient to use in a variety of situations, for example, where computers are not practical or inconvenient. Thus, a device for capturing a significant portion of the user's interaction with the paper must likewise be easy to use. In the past, this was not the case with scanners, and even the smallest handheld device was somewhat unwieldy. Designed to come into contact with the page had to be kept at a precise angle with the paper and moved very accurately along the length of the text to be scanned. This may be acceptable when scanning a business report on an office desk, but may be impractical when scanning a novel's expression while waiting for a train. Scanners based on camera type optics operating at a distance from paper may likewise be useful in some situations.

본 시스템의 일부 실시예는 종이와 접촉하여 스캔하고 렌즈 대신 페이지에서 광센서 장치로 이미지를 전송하기 위한 광섬유 다발인 이미지 콘딧을 사용하는 스캐너를 사용한다. 이러한 장치는 자연적 위치에 유지되도록 할 수 있도록 형성될 수 있다. 예컨대, 몇몇 실시예에서, 페이지와 접촉하고 있는 부분이 쐐기형으로 되어있고, 이것은 사용자의 손이 하이라이터 펜의 사용과 마찬가지의 움직임으로 페이지 위에서 보다 자연적으로 움직이게 할 수 있다. 콘딧은 종이와 직접 접촉하거나 근접하여 있고, 발생할 수 있는 손상으로부터 이미지 콘딧을 보호할 수 있는 대체가능한 투명 팁을 구비할 수 있다. 12.2.4절에 설명한 바와 같이, 스캐너는 종이는 물론 스크린에서부터 스캔하는데 사용될 수 있고 팁의 재질은 이러한 디스플레이의 손상 가능성을 줄이도록 선택될 수 있다.Some embodiments of the system use a scanner that uses an image conduit, which is an optical fiber bundle, to scan in contact with paper and transfer an image from a page to an optical sensor device instead of a lens. Such a device may be configured to be held in its natural location. For example, in some embodiments, the portion that is in contact with the page is wedge shaped, which may allow a user's hand to move more naturally on the page in the same motion as using a highlighter pen. The conduit may be in direct contact with or in close contact with the paper and have a replaceable transparent tip that may protect the image conduit from possible damage. As described in Section 12.2.4, the scanner can be used to scan from the screen as well as paper and the tip material can be selected to reduce the likelihood of damage to these displays.

마지막으로, 본 장치의 몇몇 실시예는 스캐닝 프로세스동안 사용자가 너무 빠르게, 너무 느리게, 너무 비균일하게 스캐닝하거나 스캐닝 라인상에서 너무 높거나 낮게 드리프트할 때 빛, 소리 또는 촉각적 피드백을 사용함으로써 사용자에게 지시할 것이다.Finally, some embodiments of the device instruct the user by using light, sound or tactile feedback when the user scans too quickly, too slowly, too non-uniformly or drifts too high or too low on the scanning line during the scanning process. something to do.

15.5. 보안, 식별, 인증, 개인화 및 빌링15.5. Security, Identification, Authentication, Personalization, and Billing

6절에서 설명한 바와 같이, 캡처 장치는 보안 거래, 구매, 및 다양한 기타 동작에 대한 식별 및 인증의 중요한 부분을 형성할 수 있다. 따라서, 이러한 역할에 필요한 회로 및 소프트웨어에 더하여, 스마트카드 리더, RFID, 또는 PIN을 타이핑하기 위한 키패드 등 보다 보안을 강화할 수 있는 다양한 하드웨어 구조를 ㅜㅂ가할 수 있다.As described in section 6, the capture device may form an important part of the identification and authentication for secure transactions, purchases, and various other operations. Thus, in addition to the circuitry and software required for this role, a variety of hardware architectures can be added to enhance security, such as smart card readers, RFID, or keypads for typing PINs.

또한 사용자 식별을 돕기 위한 다양한 생체인식 센서를 포함할 수도 있다. 예컨대 광학 스캐너의 경우 스캐닝 헤드는 지문을 판독할 수도 있다. 음성 레코더에 대하여 사용자의 음성 패턴이 사용될 수 있다.It may also include various biometric sensors to aid in user identification. For example, in the case of an optical scanner, the scanning head may read a fingerprint. The voice pattern of the user can be used for the voice recorder.

15.6. 장치 연관15.6. Device association

몇몇 실시예에서, 본 장치는 기타 부근 장치와의 연관을 형성하여 자체 또는 그것들의 기능을 증가시킬 수 있다. 예컨대 몇몇 실시예에서, 본 장친,ㅡㄴ 동작에 대한 보다 상세한 피드백을 제공하기 위해 근처의 PC나 전화기의 디스플레이를 사용하거나 네트워크 접속을 사용한다. 한편 본 장치는 다른 장치에 의해 수행된 동작을 인정하기 위한 보안 및 식별 장치로서 역할을 수행할 수 있다. 또한 장치의 주변장치로서 기능하기 위하여 단순히 연관을 형성할 수 있다.In some embodiments, the device may form an association with other nearby devices to increase itself or their functionality. For example, in some embodiments, the display of a nearby PC or phone is used or a network connection is used to provide more detailed feedback on this operation. On the other hand, the device may serve as a security and identification device for acknowledging an operation performed by another device. It can also simply form an association to function as a peripheral of the device.

이러한 연관의 흥미로운 태양은 본 장치의 캡처 장치를 사용하여 개시되고 인증될 수 있다는 것이다. 예컨대 자신을 보안적으로 공중 컴퓨터 단말에 식별하기 원하는 사용자는 단말의 스크린의 특정 영역에 디스플레이된 코드나 심볼을 스캔하여 키 전송을 유효하게 하기 위하여 본 장치의 스캐닝 장치를 사용할 수 있다. 음성 기록 장치에 의해 추출된 음성 신호를 사용하여 유사한 프로세스를 수행할 수 있다.An interesting aspect of this association is that it can be initiated and authenticated using the capture device of the device. For example, a user who wants to securely identify himself or herself to a public computer terminal may use the scanning device of the apparatus to scan a code or symbol displayed on a specific area of the screen of the terminal to validate key transmission. A similar process can be performed using the speech signal extracted by the speech recording device.

15.7. 기타 장치와의 통합15.7. Integration with other devices

몇몇 실시예에서, 캡처 장치의 기능은 이미 사용중인 몇몇 다른 장치와 통합된다. 통합 장치는 전력공급, 데이터 캡처 및 저장 용량, 및 네트워크 인터페이스를 공유할 수 있다. 이러한 통합은 단순히 편리를 위함이거나, 비용을 줄이거나, 또는 이러한 통합이 없다면 이용가능하지 않을 기능을 가능하게 하기 위해 수행될 수 있다.In some embodiments, the functionality of the capture device is integrated with some other device already in use. The integrated device can share power supply, data capture and storage capacity, and network interface. Such integration may be performed simply for convenience, to reduce costs, or to enable functionality that would not be available without such integration.

캡처 기능이 통합될 수 있는 장치의 몇몇 예는 다음과 같다.Some examples of devices in which the capture function may be integrated are as follows.

● 마우스, 스타일러스, USB "웹캠" 카메라, 블루투스^TM 헤드셋 또는 원격 제어등의 기존 주변장치● Traditional peripherals such as a mouse, stylus, USB "webcam" camera, Bluetooth ^TM headset or remote control

● PDA, MP3플레이어, 음성 레코더, 디지털 카메라 또는 이동 전화등 또 다 른 프로세싱/저장 장치• Other processing / storage devices such as PDAs, MP3 players, voice recorders, digital cameras or mobile phones.

● 시계, 보석류, 펜, 자동차 키 장식물등 편리만을 위한 기타 휴대품● Watches, jewelry, pens, car key ornaments, etc.

15.7.1. 이동 전화 통합15.7.1. Mobile phone integration

통합의 장점의 예와 같이, 캡처 장치와 같은 수정된 이동 전화의 사용을 살펴본다.As an example of the benefits of integration, we look at the use of a modified mobile phone such as a capture device.

몇몇 실시예에서, 전화기 하드웨어는, 텍스트 캡처가 음성 인식을 통해 적절히 수행될 수 있는 경우, 전화기 자체에 의해 프로세싱될 수 있거나 전화 호의 타단부에서 시스템에 의해 처리되거나 또는 미래의 프로세싱을 위해 전화기의 메모리내에 저장될 수 있는 경우 처럼 본 시스템을 지원하도록 수정되지 않는다. 현대의 많은 전화기는 시스템의 일부를 구현할 수 있는 소프트웨어를 다운로드할 수 있는 기능을 갖고 있다. 그러나 이러한 음성 캡처는, 예를들어 상당한 배경 노이즈가 있는 많은 상황에서 차선책일 것 같고 정확한 음성 인식이 최적의 때에서도 어려운 일이다. 오디오 장치는 음성 주석을 캡처하는데 최적으로 이용될 수 있다.In some embodiments, the phone hardware may be processed by the phone itself, or processed by the system at the other end of the phone call or the memory of the phone for future processing, if text capture can be properly performed through speech recognition. It is not modified to support the system as it can be stored within. Many modern phones have the ability to download software to implement part of the system. However, such speech capture may be the next best option, for example in many situations with significant background noise, and even when accurate speech recognition is optimal. Audio devices can be optimally used to capture voice annotations.

몇몇 실시예에서, 많은 이동 전화기에 내장된 카메라는 텍스트의 이미지를 캡처하는데 이용된다. 보통 카메라의 뷰파인더로 기능하는 전화기 디스플레이는 이미지의 품질과 텍스트의 세그먼트가 캡처되는 OCR에 대한 적합성에 관한 정보 및 OCR이 전하기에서 수행될 수 있다면 텍스트의 사본을 라이브 카메라 이미지상에 겹칠 수 있다.In some embodiments, cameras embedded in many mobile phones are used to capture images of text. The phone display, which normally serves as the camera's viewfinder, can superimpose information on the quality of the image and suitability for the OCR in which the segment of text is captured and a copy of the text onto the live camera image if the OCR can be carried out.

몇몇 실시예에서, 전화기는 전용 캡처 장치를 부가하거나 전화기와 통신하는 별개의 블루투스 접속 주변장치나 클립온 어댑터로 이러한 기능을 제공하도록 수정 된다. 캡처 메커니즘의 특성이 어떠한 것이라도, 현대의 이동전화와의 통합은 많은 다른 장점을 제공한다. 전화기는 보다 광범위한 세계와 접속을 갖는데, 이것은 쿼리가 원거리 검색 엔진이나 시스템의 기타 부분으로 제공될 수 있고 즉시 저장이나 뷰잉을 위하여 문서 사본이 검색될 수 있음을 의미한다. 전화기는 일반적으로 시스템의 많은 기능이 로컬적으로 수행되도록 하는 충분한 프로세싱 전력과 합리적인 량의 데이터를 캡처하는데 충분한 저장장치를 구비하고 있다. 저장량은 또한 종종 사용자가 확장할 수 있다. 전화기는 합리적으로 양호한 디스플레이와 음성 장치를 구비하여 사용자 피드백 및 촉각적 피드백을 위한 진동 기능을 제공한다. 또한 양호한 전력 공급장치도 구비한다.In some embodiments, the phone is modified to provide this functionality with a separate Bluetooth connected peripheral or clip-on adapter that adds or communicates with a dedicated capture device. Whatever the nature of the capture mechanism, integration with modern mobile phones offers many other advantages. The phone has a wider world connection, which means that queries can be provided to remote search engines or other parts of the system and documents can be retrieved for immediate storage or viewing. Telephones generally have sufficient processing power to allow many of the system's functions to be performed locally and sufficient storage to capture a reasonable amount of data. The amount of storage can also often be extended by the user. The telephone has a reasonably good display and voice device to provide a vibration function for user feedback and tactile feedback. It also has a good power supply.

가장 중요한 점은, 전화기는 대부분의 사용자들이 이미 휴대하고 있는 장치라는 점이다.Most importantly, the phone is a device that most users already carry.

III장-본 시스템의 응용예Section III-Application Examples

본 장은 본 시스템의 사용과 이에 가능한 응용예의 리스트를 기술한다. 이러한 리스트는 순수하게 설명을 위한 것이고 이것만을 포함하는 것은 아니다.This chapter describes the use of the system and a list of possible applications. This list is purely illustrative and not exclusive.

16. 개인적 응용16. Personal Application

16.1. 라이프 도서관16.1. Life library

라이프 도서관(6.1.1절 참조)은 가입자가 저장하기를 바라는 임의의 중요한 문서의 디지털 아카이브이며 본 시스템의 서비스의 실시예의 세트이다. 중요한 책, 잡지 기사, 신문 클리핑등 전부를 라이프 도서관에 디지털 형태로 저장할 수 있다. 또한, 문서와 함께 가입자의 주석, 코멘트, 및 유의사항을 저장할 수 있다. 라이프 도서관은 인터넷과 월드와이드웹을 거쳐 접속될 수 있다.The Life Library (see Section 6.1.1) is a digital archive of any important document that the subscriber wishes to store and is a set of embodiments of the services of the system. Important books, magazine articles and newspaper clippings can all be stored digitally in the Life Library. In addition, it is possible to store the subscriber's comments, comments, and notes with the document. The Life Library can be accessed via the Internet and the World Wide Web.

본 시스템은 가입자를 위한 라이프 도서관 문서 아카이브를 생성하고 관리한다. 가입자는 문서로부터 정보를 스캔함으로써 또는 특정 문서가 가입자의 라이프 도서관에 부가되도록 시스템에 지시함으로써 어떤 문서를 자신의 라이프 도서관에 저장했었기를 바라는지 지시한다. 스캔된 정보는 일반적으로 문서에서온 텍스트이지만 문서를 식별하는 바코드나 기타 코드일 수도 있다. 본 시스템은 이러한 코드를 받아들이고 소스 문서를 식별하기 위해 이것을 사용한다. 문서가 식별된이후 시스템은 사용자의 라이프 도서관에 문서 사본을 저장하거나 문서를 얻을 수 있는 소스로의 링크를 저장할 수 있다.The system creates and manages a life library document archive for subscribers. The subscriber indicates what documents he or she wishes to store in his or her life library by scanning information from the document or by instructing the system to add a particular document to the subscriber's life library. The scanned information is typically text from a document, but can also be a barcode or other code that identifies the document. The system accepts this code and uses it to identify the source document. After the document is identified, the system can store a copy of the document in the user's life library or a link to a source from which the document can be obtained.

라이프 도서관 시스템의 일 실시예는 가입자가 전자 사본을 얻도록 승인되는지를 확인할 수 있다. 예컨대, 독자가 뉴욕 타임즈(NYT)내의 기사 사본으로부터 텍스트나 식별자를 스캔하여 그 기사가 독자의 라이프 도서관에 부가된다면, 라이프 도서관 시스템은 독자가 NYT의 온라인 버전에 가입되어 있는지 NYT에 확인할 것이고, 가입되어 있으면 독자는 자신의 라이프 도서관 계정에 저장되어 있는 기사 사본을 얻고 그렇지 않다면 문서를 식별하는 정보와 이를 주문하는 방법이 자신의 라이프 도서관 계정에 저장된다.One embodiment of the life library system may verify that a subscriber is authorized to obtain an electronic copy. For example, if a reader scans text or identifiers from a copy of an article in the New York Times (NYT) and the article is added to the reader's life library, the life library system will check with the NYT to see if the reader is subscribed to the online version of the NYT. If so, the reader gets a copy of the article stored in his or her life library account, and otherwise the information identifying the document and how to order it is stored in his or her life library account.

몇몇 실시예에서, 본 시스템은 접근 권한 정보를 포함하는 각각의 가입자를 위한 가입자 프로파일을 유지한다. 문서 접근 정보는 몇가지 방식으로 컴파일될 수 있는데, 그 중 두가지는 1) 가입자가 자신의 계정명과 암화등과 함께 라이프 도서관 시스템에 문서 접근 정보를 공급하는 것과 2) 라이프 도서관 서비스 프로바이 더가 가입자의 정보를 갖는 출판업자에게 조회하여 출판업자가 라이프 도서관 가입자가 기사에 접근하도록 승인되면 전자 사본으로의 접근을 제공함으로써 응답하는 것이다. 라이프 도서관 가입자가 문서의 전자 사본을 갖도록 승인되지 않으면, 출판업자는 라이프 도서관 서비스 프로바이더에게 가격을 제공하고, 이후 고객에게 전자 문서를 구매하는 옵션을 제공한다. 그럴 경우, 라이프 도서관 서비스 프로바이더는 출판업자에게 바로 지불하고 이후 라이프 도서관 고객에게 과금시키거나 라이프 도서관 서비스 프로바이더가 구매에 대하여 고객의 크레디트 카드로 즉시 과금한다. 라이프 도서관 서비스 프로바이더는 거래를 용이하게 하는데 대한 소액의 고정 비용 또는 구매 가격의 몇프로를 얻는다.In some embodiments, the system maintains a subscriber profile for each subscriber that includes access rights information. Document access information can be compiled in several ways, two of which are: 1) the subscriber providing document access information to the life library system, along with his / her account name and password, and 2) the life library service provider Inquiries to the information-bearing publisher and the publisher responds by providing access to the electronic copy when the life library subscriber is authorized to access the article. If the Life Library subscriber is not authorized to have an electronic copy of the document, the publisher provides a price to the Life Library service provider and then gives the customer the option to purchase the electronic document. If so, the Life Library Service Provider will pay the publisher directly and later charge the Life Library Customer or the Life Library Service Provider will immediately charge the customer's credit card for purchases. The life library service provider gets a small fixed cost or a few percent of the purchase price to facilitate the transaction.

본 시스템은 가입자의 개별 도서관 및/또는 가입자가 기록 보관 권한을 갖는임의의 다른 도서관내에 문서를 보관할 수 있다. 예컨대, 사용자가 인쇄된 문서에서 텍스트를 스캔하면, 라이프 도서관 시스템은 렌더링된 문서와 그 전자 사본을 식별할 수 있다. 원본 문서가 식별되면, 라이프 도서관 시스템은 사용자의 개별 도서관과 가입자가 기록 보관 권한을 갖는 그룹 도서관내에 원본 문서에 관한 정보를 기록할 수 있다. 그룹 도서관은 프로젝트에서 함께 작업하는 그룹, 학술 연구원 그룹, 그룹 웹로그등을 위한 문서 저장소와 같은 합동 아카이브이다.The system may store documents in a subscriber's individual library and / or any other library in which the subscriber has recordkeeping authority. For example, when a user scans text in a printed document, the life library system can identify the rendered document and its electronic copy. Once the original document is identified, the life library system can record information about the original document in the user's individual library and in a group library in which the subscriber has recordkeeping rights. Group libraries are joint archives such as document repositories for groups working together on projects, groups of academic researchers, group weblogs, and so on.

라이프 도서관은, 연대순으로, 토픽별로, 가입자 관심 수준별로, 출판 유형(신문, 책, 잡지, 기술 논문등)별로, 읽는 장소 또는 시간 별로, ISBN이나 십진 분류법등 많은 방식으로 조직될 수 있다. 일 대안으로, 본 시스템은 다른 가입자가 동 문서를 어떻게 분류했는지에 기초하여 분류법을 알 수 있다. 본 시스템은 사용 자에게 분류법을 제안하거나 사용자를 위하여 문서를 자동적으로 분류할 수 있다.Life libraries can be organized in chronological order, by topic, by subscriber level, by publication type (newspaper, book, magazine, technical paper, etc.), by reading place or time, or by ISBN or decimal classification. In one alternative, the system may know the taxonomy based on how other subscribers have classified the document. The system can suggest taxonomy to users or automatically classify documents for users.

다양한 실시예에서, 문서에 주석이 바로 삽입되거나 별개 파일로 유지될 수 있다. 예컨대, 가입자가 신문 기사에서 텍스트를 스캔하면, 그 기사는 하이라이트되고 스캔된 텍스트와 함께 자신의 라이프 도서관에 보관된다. 대안으로, 연관된 주석 파일과 함께 라이프 도서관에 기사를 보관한다(따라서 보관된 문서를 수정되지 않은 채 유지한다). 본 시스템의 실시예는 각각의 가입자의 도서관에 원본 문서 사본을, 많은 가입자가 접근할 수 있는 마스터 도서관에 사본을, 또는 출판업자에 의해 유지된 사본에 링크를 유지할 수 있다.In various embodiments, annotations can be inserted directly into the document or maintained as separate files. For example, when a subscriber scans text in a newspaper article, the article is highlighted and stored in its life library along with the scanned text. Alternatively, keep the article in the Life Library along with its associated annotation file (thus keeping the archived document unmodified). Embodiments of the system may maintain a link to an original document copy in each subscriber's library, a copy in a master library accessible to many subscribers, or a copy maintained by a publisher.

몇몇 실시예에서, 라이프 도서관은 문서 수정본(예컨대, 하이라이트한 부분등)과 (그밖에 어디에 저장된) 문서의 온라인 버전으로의 링크만을 저장한다. 본 시스템 또는 가입자는 가입자가 이어서 문서를 검색할때 변화와 문서를 합친다.In some embodiments, the Life Library only stores document revisions (eg, highlights, etc.) and links to online versions of documents (stored elsewhere). The system or subscriber merges the changes and documents when the subscriber subsequently retrieves the document.

주석이 별개 파일로 유지되면, 원본 문서와 주석 파일이 가입자에게 제공되고 가입자는 이들을 결합하여 수정된 문서를 생성한다. 대안으로, 본 시스템은 가입자에게 상기 두 파일을 제공하기 전에 결합한다. 또 다른 대안으로, 주석 파일은 문서 파일에 대한 오버레이이고 가입자 컴퓨터내의 소프트웨어에 의해 문서에 겹쳐질 수 있다.If the comments are kept in separate files, the original document and the comments file are provided to the subscriber and the subscriber combines them to create a modified document. Alternatively, the system combines before providing the two files to the subscriber. As another alternative, the annotation file is an overlay to the document file and can be overlaid on the document by software in the subscriber computer.

라이프 도서관 서비스의 가입자는 본 시스템이 가입자의 아카이브를 유지하도록 하기 위해 매월 요금을 지불한다. 대안으로, 가입자는 아카이브에 저장된 각 문서에 대하여 소액(예컨대, 소액 지불)을 지불한다. 대안으로, 가입자는 액세스당 요금으로 가입자의 아카이브에 접근하기 위하여 지불한다. 대안으로, 가입자는 도서관을 컴파일하여 다른 사람이 라이프 도서관 서비스 프로바이더와 저작권자와 같이 수익 공유 모델에 대한 기사/주석에 접근하게할 수 있다. 대안으로, 라이프 도서관 서비스 프로바이더는 라이프 도서관의 가입자가 문서를 주문할때 출판업자로부터 지불을 받는다(출판업자와의 수익 공유 모델. 여기서 라이프 도서관 서비스 프로바이더는 출판업자의 수익의 일부 몫을 얻는다).Subscribers to Life Library Services pay a monthly fee to ensure that the system maintains their archives. Alternatively, the subscriber pays a small amount (eg, a small payment) for each document stored in the archive. Alternatively, the subscriber pays to access the subscriber's archive at a fee per access. Alternatively, the subscriber can compile the library so that others can access articles / comments on the revenue sharing model, such as life library service providers and copyright holders. Alternatively, Life Library Service Providers receive payment from publishers when Life Library subscribers order documents (revenue sharing model with publishers, where Life Library Service Providers get some share of publisher revenue) .

몇몇 실시예에서, 라이프 도서관 서비스 프로바이더는 저작권있는 기사의 과금과 지불을 용이하게 하기 위하여 가입자와 저작권자(또는 저작권 청산 센터, 약자로 CCC)간의 중개자로 작용한다. 라이프 도서관 서비스 프로바이더는 가입자의 빌링 정보와 기타 사용자의 계정 정보를 사용하여 이러한 중간 서비스를 제공한다. 필수적으로, 라이프 도서관 서비스 프로바이더는 가입자와의 앞에 존재하는 관계를 레버리징하여 가입자 대신 저작권있는 기사의 구매를 가능하게 한다.In some embodiments, the life library service provider acts as an intermediary between the subscriber and the copyright holder (or copyright clearing center, abbreviated CCC) to facilitate billing and payment of copyrighted articles. The Life Library Service Provider provides this intermediary service by using the subscriber's billing information and other user's account information. Essentially, the life library service provider leverages the existing relationship with the subscriber to enable the purchase of copyrighted articles on behalf of the subscriber.

몇몇 실시예에서, 라이프 도서관 시스템은 문서로부터의 발췌문을 저장할 수 있다. 예컨대, 가입자가 종이 문서로부터 텍스트를 스캔할때, 라이프 도서관내에 전체 문서가 보관되기 보다는 스캔된 텍스트 주변의 영역은 발췌되고 라이프 도서관에 위치된다. 원본 스캔의 상태를 보존함으로써 가입자가 흥미있는 부분을 발견하기 위해 문서를 다시 읽는 것을 방지하기 때문에 문서가 긴 경우 이것은 특히 유용하다. 물론, 종이 문서의 전체 전자 사본으로의 하이퍼링크가 발췌 기사와 함께 포함될 수 있다.In some embodiments, the life library system can store excerpts from documents. For example, when a subscriber scans text from a paper document, the area around the scanned text is extracted and placed in the life library, rather than the entire document stored in the life library. This is particularly useful if the document is long because it preserves the state of the original scan, preventing the subscriber from rereading the document to find the portion of interest. Of course, a hyperlink to the full electronic copy of the paper document can be included with the excerpt.

몇몇 실시예에서, 본 시스템은 작가, 출판물 제목, 출판일, 출판업자, 저작권자(또는 저작권자의 라이센싱 대리인), ISBN, 문서의 공공의 주석으로의 링크, 리드랭크등, 라이프 도서관내의 문서에 대한 정보를 저장하기도 한다. 문서에 대한 이러한 부가적인 정보의 몇몇은 종이 문서 메타데이터의 형태이다. 일반 공중과 같은 자신들외의 사람에 의한 접근을 위하여 제3자는 공공의 주석을 생성할 수 있다. 문서상의 제3자의, 주석으로의 링크는 유용한데 이는 다른 사용자의 주석 파일을 읽음으로써 가입자의 문서의 이해를 강화하기 때문이다.In some embodiments, the system provides information about documents in the Life Library, such as the author, publication title, publication date, publisher, copyright holder (or the copyright holder's licensing agent), ISBN, link to the document's public annotation, leadrank, and so on. It also stores. Some of this additional information about the document is in the form of paper document metadata. A third party can create public annotations for access by anyone other than themselves, such as the general public. Links to comments by third parties on the document are useful because they enhance the subscriber's understanding of the document by reading other users' comment files.

몇몇 실시예에서, 본 시스템은 클래스별로 기사를 보관한다. 이러한 특징은 라이프 도서관 가입자가 각각의 종이 문서로의 접근없이 종이 문서의 전체 클래스로 전자 사본을 신속히 저장할 수 있게한다. 예컨대, 가입자가 내셔널 지오그래픽 잡지의 사본으로부터 몇몇 텍스트를 스캔할때, 시스템은 가입자게게 내셔널 지오그래픽의 모든 백 이슈를 보관하기 위한 옵션을 제공한다. 가입자가 모든 백 이슈를 보관하도록 결정하면, 라이프 도서관 서비스 프로바이더는 가입자가 이렇게 하도록 승인되어 있는지를 내셔널 지오그래픽 소사이어티에 확인한다. 승인되어 있지 않으면, 라이프 도서관 서비스 프로바이더는 내셔널 지오그래픽 잡지 컬렉션을 보관할 권리의 구매를 중개할 수 있다.In some embodiments, the system stores articles by class. This feature allows Life Library subscribers to quickly store electronic copies of the entire class of paper documents without access to each paper document. For example, when a subscriber scans some text from a copy of a National Geographic magazine, the system offers the subscriber the option to archive all back issues of National Geographic. If the subscriber decides to archive all back issues, the Life Library Service Provider checks with the National Geographic Society to see if the subscriber is authorized to do so. If not approved, Life Library Service Provider may broker the purchase of the right to keep National Geographic magazine collections.

16.2. 라이프 세이버16.2. Life saver

라이프 도서관 개념을 변형 또는 개선한 것이 "라이프 세이버"인데, 본 시스템은 다른 활동에 대하여 보다 많이 추론하기 위하여 사용자에 의해 캡처된 텍스트를 사용한다. 특정 식당의 메뉴, 특정 극장 공연의 프로그램, 특정 기차역에서의 시간표, 또는 지역 신문의 기사를 스캐닝함으로써 본 시스템이 사용자의 위치와 사회 활동에 대한 추론을 가능하게 하고, 예컨대 웹사이트로서 그들에 대한 자동 다 이어리를 구성할 수 있다. 사용자는 이 다이어리를 편집하고 수정하고, 사진등 부가 기사를 부가하며 스캔된 항목을 다시 볼 수 있을 것이다.A modification or improvement of the life library concept is the "life saver", which uses text captured by the user to reason more about other activities. By scanning menus at specific restaurants, programs at certain theater performances, timetables at specific train stations, or articles from local newspapers, the system enables inferences about the user's location and social activities, for example, as a website. You can organize a diary. The user will be able to edit and edit this diary, add additional articles such as photos and view the scanned item again.

17. 학술적 응용17. Academic Applications

상기한 시스템이 지원하는 휴대용 스캐너는 학술적 설정에 있어서 많은 어쩔 수 없는 용도를 갖는다. 이러한 스캐너는 학생/선생님간 상호작용을 향상시킬 수 있고 배우는 경험을 증대시킬 수 있다. 이러한 용도중, 학생들은 그들의 독특한 필요에 맞추기 위해 학습 기사에 주석을 달 수 있고 선생님들은 학업 성과를 모니터링할 수 있으며 학생들의 과제물에 인용된 소스 기사를 자동적으로 확인할 수 있다.The portable scanners supported by these systems have many compelling uses in academic settings. These scanners can enhance student / teacher interaction and enhance the learning experience. Among these uses, students can annotate learning articles to meet their unique needs, teachers can monitor academic performance and automatically identify source articles cited in their work.

17.1. 아이들용 책17.1. Children's books

책과 같은 종이 문서와 아이들의 상호작용은 본 시스템의 특정 실시예 세트를 채용하는 문해 습득 시스템에 의해 모니터링된다. 아이들은 문해 습득 시스템의 다른 엘리먼트와 통신하는 휴대용 스캐너를 사용한다. 휴대용 스캐너에 더하여, 문해 습득 시스템은 디스플레이 및 스피커를 구비한 컴퓨터와 이 컴퓨터에 의해 액세스가능한 데이터베이스를 포함한다. 본 스캐너는 컴퓨터와 (유선, 단거리 RF등으로) 연결된다. 아이가 책에서 모르는 단어를 보면, 그것을 스캐너로 스캔한다. 일 실시예에서, 문해 습득 시스템은 스캔된 텍스트와 데이터베이스내의 리소스를 비교하여 단어를 식별한다. 데이터베이스는 사전, 시소러스, 및/또는 멀티미디어 파일(예컨대 소리, 그래픽등)을 포함한다. 단어가 식별된후, 이 시스템은 컴퓨터 스피커를 사용하여 그 단어와 정의를 아이들에게 발음해준다. 또 다른 실시 예에서, 이 단어와 정의가 문해 습득 시스템에 의해 컴퓨터 모니터상에 디스플레이된다. 스캔된 단어에 대한 멀티미디어 파일은 컴퓨터 모니터와 스피커를 통해 재생될 수도 있다. 예컨대, "금발의 미녀와 세마리의 곰"을 읽고있는 아이가 단어 "곰"을 스캔하면, 시스템은 단어 "곰"을 발음하고 컴퓨터 모니터상에 곰에 대한 짧은 동영상을 재생할 수 있다. 이런식으로, 아이들은 씌여진 단어를 발음하는 것을 배우고 그 단어가 의미하는 바를 멀티미디어 소개를 통해 시각적으로 습득한다.Children's interaction with paper documents, such as books, is monitored by a literacy learning system employing a particular set of embodiments of the present system. Children use a handheld scanner that communicates with other elements of the literacy acquisition system. In addition to a handheld scanner, the literacy acquisition system includes a computer with a display and a speaker and a database accessible by the computer. The scanner is connected to a computer (by wire, short-range RF, etc.). If your child sees a word you don't know in a book, scan it with a scanner. In one embodiment, the literacy acquisition system compares the scanned text with resources in the database to identify words. The database may include dictionaries, thesaurus, and / or multimedia files (eg, sounds, graphics, etc.). After the words have been identified, the system uses computer speakers to pronounce the words and their definitions to the children. In another embodiment, these words and definitions are displayed on a computer monitor by a literacy acquisition system. The multimedia file for the scanned word may be played through the computer monitor and the speaker. For example, if a child reading "Blonde Beauty and Three Bears" scans the word "bear", the system may pronounce the word "bear" and play a short video of the bear on a computer monitor. In this way, children learn to pronounce the written word and visually learn what the word means through a multimedia introduction.

문해 습득 시스템은 배움 프로세스를 강화하기 위하여 즉각적인 청각적 및/또는 시각적 정보를 제공한다. 아이들은 이러한 보충 정보를 사용하여 기록된 문서의 보다 깊은 이해를 신속히 얻는다. 본 시스템은 초심 독자가 독서하는 것을 가르치는데 사용될 수 있고 아이들이 보다 넓은 어휘력등을 얻는데 돕기위해 사용될 수 있다. 이 시스템은 아이들에게 친밀하지 않은 단어나 아이들이 보다 많은 정보를 원하는 정보를 제공한다.A literacy acquisition system provides instant audio and / or visual information to enhance the learning process. Children use this supplemental information to quickly gain a deeper understanding of the written document. The system can be used to teach beginner readers to read and to help children gain broader vocabulary. This system provides words that are not familiar to children or information that children want more information about.

17.2. 문해 습득17.2. Literacy

몇몇 실시예에서, 본 시스템은 개인 사전을 컴파일한다. 독자가 새롭거나, 재미있거나, 특히 유용하거나 성가신 단어를 보면, 컴퓨터 파일로 (정의와 함께) 저장한다. 이러한 컴퓨터 파일은 독자의 개인화된 사전이 된다. 이러한 사전은 일반적으로 보통의 사전보다 사이즈가 작고 따라서 이동 단말 또는 연관된 장치로 다운로드될 수 있고 따라서 시스템이 즉시 액세스될 수 없는 경우에도 이용가능하게 된다. 몇몇 실시예에서, 개인 사전 입력은 적당한 단어의 발음을 돕기 위한 음성 파일과 단어가 스캔된 종이 문서를 식별하는 정보를 포함한다.In some embodiments, the system compiles a personal dictionary. When a reader sees a new, funny, especially useful or annoying word, save it as a computer file (with definitions). These computer files become reader's personalized dictionaries. Such dictionaries are generally smaller in size than ordinary dictionaries and thus can be downloaded to a mobile terminal or associated device and thus become available even if the system cannot be accessed immediately. In some embodiments, the personal dictionary entry includes a voice file to help pronunciation of the appropriate word and information identifying the paper document on which the word was scanned.

몇몇 실시예에서, 본 시스템은 학생들을 위한 커스터마이징된 스펠링 밍 어휘 테스트를 생성한다. 예컨대, 학생이 과제물을 읽을때, 그 학생은 휴대용 스캐너로 낯선 단어를 스캔할 수 있다. 이 시스템은 그 학생이 스캔한 모든 단어 목록을 저장한다. 이후, 시스템은 커스터마이징된 스펠링/어휘 테스트를 학생에게 연관된 모니터상에 제공한다(또는 이러한 테스트를 연관된 프린터에 인쇄한다).In some embodiments, the system generates customized spelling vocabulary tests for students. For example, when a student reads an assignment, the student can scan a strange word with a portable scanner. The system stores a list of all words scanned by the student. The system then presents the customized spelling / lexical test to the student on the associated monitor (or prints this test on the associated printer).

17.3. 음악 교습17.3. Music lessons

악보의 배치는 텍스트 라인에 문자를 배치하는 것과 유사하다. 본 시스템에서 텍스트를 캡처하기 위하여 상기한 같은 스캐닝 장치를 이용하여 악보를 캡처할 수 있고, 알려진 악곡의 데이터베이스에 검색를 구성하는 유사한 프로세스가 캡처가 일어나는 악곡이 식별될 수 있도록 하여 이후 검색, 재생, 또는 몇몇 다른 활동의 기초가 될 수 있게된다.The placement of sheet music is similar to placing text on a line of text. The system can capture sheet music using the same scanning device as described above to capture text, and a similar process of constructing a search in a database of known music allows a piece of music to be captured to be identified and subsequently retrieved, played back, or Can be the basis for some other activities.

17.4. 표절 탐지17.4. Plagiarism detection

선생님들은 본 시스템을 사용하여 학생들의 과제물에서 텍스트를 스캐밍하고 스캔된 텍스트를 본 시스템에 제공함으로써 표절을 탐지하거나 원본을 확인할 수 있다. 예컨대, 학생의 과제물에 있는 발췌문이 학생이 인용한 소스로부터 왔다는 것을 확인하기를 원하는 선생님은 그 발췌문의 일부를 스캔하고 시스템에 의해 식별된 문서의 제목과 학생이 인용한 문서의 제목을 비교할 수 있다. 마찬가지로, 본 시스템은 그 학생의 원래의 작업물로 제출된 과제물로부터의 텍스트의 스캔을 사용하여 그 텍스트가 카피되었는지를 보일 수 있다. Teachers can use this system to scan text from students' work and provide scanned text to the system to detect plagiarism or identify originals. For example, a teacher who wants to confirm that an excerpt in a student's work is from a source cited by the student can scan a portion of the excerpt and compare the title of the document cited by the student with the title of the document identified by the system. . Similarly, the system may use a scan of text from an assignment submitted to the student's original work to show if the text has been copied.

17.5 개선된 텍스트상자 17.5 improved textbox

일부 실시예에서, 학술 교재로부터 텍스트를 캡쳐하는 것은 학생 또는 스태프을 보다 상세한 설명, 추가적인 연습, 자료에 대한 학생 및 스태프의 논의, 관련된 모범적인 과거의 시험 문제, 주제에 대해 추가적인 논문, 주제에 대한 강의 기록, 등에 연결시킨다.(섹션 7.1을 보라)In some embodiments, capturing text from an academic text can provide a more detailed description of the student or staff, additional practice, discussion of the student and staff about the material, relevant exemplary past exam questions, additional papers on the subject, and lectures on the topic. To records, etc. (see section 7.1).

17.6 언어 학습 17.6 Language Learning

일부 실시예에서, 시스템은 외국어를 가르치기 위해 사용된다. 예를 들면 스페인어 단어를 스캐닝하는 것은 그 단어의 영어 정의와 함께 스페인어로 상기 단어를 크게 읽도록 할 수 있다.In some embodiments, the system is used to teach foreign languages. For example, scanning a Spanish word can cause the word to be read aloud in Spanish with an English definition of the word.

시스템은 새로운 언어 습득 프로세스를 개선시키기 위해 즉각적인 청각 및/또는 시각 정보를 제공한다. 독자는 자료의 보다 깊은 이해를 빨리 습득하기 위해 이러한 보충 정보를 이용한다. 시스템은 외국어를 읽으려하는 초보 학생을 가르치고, 학생들이 보다 많은 어휘 등을 습득하는 것을 돕는 데에 사용될 수 있다, 시스템은 독자에게 낯설거나 또는 독자가 더 많은 정보를 원하는 외국어 단어에 관한 정보를 제공한다.The system provides instant auditory and / or visual information to improve the new language acquisition process. The reader uses this supplemental information to quickly gain a deeper understanding of the data. The system can be used to teach a beginner student who wants to read a foreign language and to help students acquire more vocabulary, etc. The system provides the reader with information about foreign language words that are unfamiliar or that the reader wants more information about. do.

신문 또는 책과 같은 페이퍼 문서와의 독자의 상호작용은 언어 스킬 시스템에 의해 모니터링된다. 상기 독자는 언어 스킬 시스템과 통신하는 휴대가능한 스캐너를 갖는다. 일부 실시예에서, 언어 스킬 시스템은 디스플레이 및 스피커를 구비한 컴퓨터 및, 상기 컴퓨터에 의해 액세스가능한 데이터베이스를 포함한다. 상기 스캐너는 컴퓨터와 통신한다(배선에 의해 접속된, 단거리 범위 RF, 등). 독자 가 기사에서 모르는 단어를 보았을 때, 독자는 스캐너로 그것을 스캔한다. 데이터베이스는 외국어 사전, 유의어반의어사전, 및/또는 멀티미디어 파일(음성, 그래픽 등)을 포함한다. 일 실시예에서, 시스템은 스캔된 단어를 식별하기 위해 스캔된 텍스트를 자신의 데이터베이스에서의 리소스와 비교한다. 단어가 식별된 후에, 시스템은 컴퓨터 스피커를 사용하여 단어와 그 정의를 독자에게 전달한다. 일부 실시예에서, 단어와 그 정의 모두가 컴퓨터 모니터 상에 디스플레이된다. 스캔된 단어에 연관된 문법 팁에 관한 멀티미디어 파일 또한 컴퓨터 모니터와 스피커를 통해서 재생될 수 있다. 예를 들면, "to speak" 라는 단어가 스캔되면, 시스템은 "hablar"라는 단어를 발음하고, 적절한 스페인어 발음을 나타내는 짧은 오디오 클립을 재생시키고, "hablar"의 다양한 동사 변화형의 완벽한 목록을 디스플레이한다. 이러한 방식으로, 학생은 문자로 쓰여진 단어를 발음하는 것을 배우고, 멀티미디어 프리젠테이션에 의해 단어의 철자법을 시각적으로 배우고, 동사를 변화시키는 방법을 배운다. 시스템은 또한 공통적인 문구와 함께 적절한 "hablar"의 사용법에 관한 문법팁을 설명한다.Readers' interactions with paper documents, such as newspapers or books, are monitored by the language skill system. The reader has a portable scanner that communicates with the language skill system. In some embodiments, the language skill system includes a computer with a display and speakers, and a database accessible by the computer. The scanner communicates with a computer (connected by wiring, short range RF, etc.). When a reader sees an unknown word in an article, the reader scans it with a scanner. The database may include a foreign language dictionary, a dictionary of synonyms, and / or multimedia files (voice, graphics, etc.). In one embodiment, the system compares the scanned text with resources in its database to identify the scanned words. After the words are identified, the system uses computer speakers to convey the words and their definitions to the reader. In some embodiments, both words and their definitions are displayed on a computer monitor. Multimedia files on grammar tips associated with the scanned words can also be played through computer monitors and speakers. For example, when the word "to speak" is scanned, the system pronounces the word "hablar", plays a short audio clip indicating the proper Spanish pronunciation, and displays a complete list of the various verb variations of "hablar". do. In this way, the student learns to pronounce words written in letters, visually learns how to spell words by multimedia presentation, and learns how to change verbs. The system also describes grammar tips on the proper use of "hablar" with common phrases.

일부 실시예에서, 유저는 유저의 모국어(또는 유저가 합리적으로 잘아는 언어)가 아닌 언어로 된 렌더링된 문서로부터 단어 또는 짧은 문구를 스캔한다. 일부 실시예에서, 시스템은 유저의 "선호하는" 언어의 우선 순위 목록을 관리한다. 시스템은 렌더링된 문서의 전자 사본을 식별하고, 문서 내의 스캔 위치를 판정한다. 시스템은 또한 유저의 선호하는 언어 중 어느 하나로 번역된 문서의 제 2 전자 사본을 식별하고, 원문서에서의 스캔 위치에 상응하는 번역된 문서에서의 위치 를 판정한다. 상응 위치가 정확하게 알려지지 않을 때, 시스템은 스캔된 위치의 상응 위치를 포함하는 작은 영역(예를 들면, 문단)을 식별한다. 그런 다음, 상기 상응하는 번역된 위치가 유저에게 제시된다. 이것은, 흔히 단어-당-단어 기반으로 정확하게 번역하기 어려운 은어 또는 다른 관용어법을 포함하는, 스캔된 위치에서의 특정한 어법의 정확한 번역을 유저에게 제공한다.In some embodiments, a user scans a word or short phrase from a rendered document in a language other than the user's native language (or language that the user reasonably understands). In some embodiments, the system maintains a priority list of the user's "preferred" languages. The system identifies the electronic copy of the rendered document and determines the scan location within the document. The system also identifies a second electronic copy of the translated document in any of the user's preferred languages and determines a location in the translated document that corresponds to a scan location in the original document. When the corresponding location is not known exactly, the system identifies a small area (eg paragraph) containing the corresponding location of the scanned location. The corresponding translated location is then presented to the user. This provides the user with an accurate translation of a particular phrase at the scanned location, often including a slang or other idiom that is difficult to translate accurately on a word-per-word basis.

17.7 연구자료의 수집 17.7 Collection of Research Data

특정한 토픽을 연구하는 유저는, 그들이 일련의 개인적인 아카이브에 토픽에 관련된 것으로 기록하기를 원하는, 인쇄된 그리고 화면 상의 모든 종류의 자료를 만날 수 있다. 시스템은 이러한 프로세스가 임의의 하나의 자료에서 짧은 문구를 스캐닝하는 결과로써 자동화될 수 있고, 또한 주제에 대한 간행물에 삽입하기에 적합한 참고문헌을 생성할 수도 있다.Users who study a particular topic can find all sorts of printed and on-screen materials that they want to record as relevant to the topic in a series of personal archives. The system can be automated as a result of scanning short phrases in any one material, and can also generate references suitable for insertion into a publication on a subject.

18. 상업적 적용 18. Commercial application

명백하게, 상업적 활동이 본 문에서 논의된 거의 모든 프로세스로부터 만들어질 수 있지만, 여기서 우리는 소수의 명확한 수입 흐름에 집중하자.Obviously, commercial activities can be created from almost all the processes discussed in this article, but here we focus on a few clear income streams.

18.1 요금기반 검색 및 인덱싱 18.1 Price-Based Search and Indexing

종래 인터넷 검색 엔진은 일반적으로 전자문서의 무료 검색을 제공하고, 또한 인덱스에 컨텐트 공급자의 컨텐트를 포함하는 것에 대해 컨텐트 공급자에게 요 금을 청구하지 않는다. 일부 실시예에서, 시스템은 유저에 대한 비용 청구 및/또는 검색 엔진에 대한 비용지급 및/또는 시스템의 운용 및 사용과 연결하여 컨텐츠 공급자에 대한 비용을 청구한다.Conventional Internet search engines generally provide free retrieval of electronic documents and also do not charge the content provider for including the content provider's content in the index. In some embodiments, the system bills the content provider in connection with billing the user and / or paying the search engine and / or operating and using the system.

일부 실시예에서, 시스템 서비스에 대한 가입자는 페이퍼 문서의 스캔에 기원한 검색을 위한 요금을 지불한다. 예를 들면, 증권중개인은 컴파니 X에 의해 제공된 새로운 제품에 관한 월스트리트 저널의 기사를 읽을 수 있을 것이다. 페이퍼 문서로부터 컴파니 X의 명칭을 스캐닝하고, 필요한 요금의 지불에 동의함으로써, 증권 중개인은 애널리스트의 보고서와 같은 상기 회사에 관한 프리미엄 정보를 취득하기 위해 특별한 또는 독점적인 데이터베이스를 검색하기위해 시스템을 이용한다. 시스템은 또한, 예를 들면 특정한 날에 발간된 모든 신문들이 인덱싱되고 그것들이 기사가 게재될 때까지 사용가능한 것을 보장함으로써, 페이퍼 형태로 가장 많이 읽혀지는 문서가 인덱싱의 우선순위를 가지도록 배치한다.In some embodiments, a subscriber to a system service pays a fee for a search originating in the scanning of a paper document. For example, a stockbroker could read an article in the Wall Street Journal about a new product offered by Company X. By scanning the name of Company X from the paper document and agreeing to pay the necessary fees, securities brokers use the system to search a special or proprietary database to obtain premium information about the company, such as analyst reports. . The system also arranges the documents that are read most in paper form to prioritize indexing, for example by ensuring that all newspapers published on a particular day are indexed and available until the article is published.

컨텐트 공급자는 페이퍼 문서로부터 제시된 검색 문의에서 특정한 용어에 연관된 요금을 지불한다. 예를 들면, 일 실시예에서, 시스템은 공급자에 관한 추가적인 컨텍스트에 기초한(이 경우, 컨텍스트는 컨텐트 공급자가 결과 목록을 위로 올리기 위한 요금을 지급한 것임) 가장 선호되는 컨텐트 공급자를 선택한다. 근본적으로, 검색 공급자는 컨텐트 공급자와의 사전의 재정 계약에 기초하여 페이퍼 문서 검색 결과를 조정한다. 섹션 5.2에서의 키워드 및 주요 어구의 상세를 보라.The content provider pays a fee associated with a particular term in the search query presented from the paper document. For example, in one embodiment, the system selects the most preferred content provider based on an additional context about the provider, in which case the context is paid by the content provider to raise the list of results. In essence, the search provider adjusts the paper document search results based on a prior financial agreement with the content provider. See details of keywords and key phrases in section 5.2.

특정한 컨텐트에 대한 액세스가 특정한 그룹의 사람(클라이언트 또는 피고용자 등)에 한정되는 경우, 이러한 컨텐트는 방화벽에 의해 보호될 수 있고, 그 결과 일반적으로 제 3자에 의해 인덱싱할 수 없게된다. 컨텐트 공급자는 그럼에도 불구하고 보호된 컨텐트에 대한 인덱스를 공급하기를 원할 수 있다. 그러한 경우, 컨텐트 공급자는 시스템 가입자들에게 컨텐트 공급자의 인덱스를 제공하도록 서비스 공급자에 요금을 지급할 수 있다. 예를 들면, 법률회사는 클라이언트의 문서 모두를 인덱스할 수 있다. 상기 문서들은 법률회사의 방화벽 뒤에 저장된다. 그러나 법률회사는 자신의 피고용인과 클라이언트가 휴대가능한 스캐너를 통해 상기 문서들에 액세스하여 서비스 공급자에게 인덱스(또는 인덱스에 대한 포인터)를 제공하고, 법률회사의 피고용자 또는 클라이언트가 그들의 휴대가능한 스캐너를 통해 페이퍼-스캔된 검색 용어를 제시할 때, 서비스 공급자가 차례로 법률회사의 인덱스를 검색하기를 원한다. 법률회사는 상기 기능을 가능하게 하기 위해 서비스 공급자의 시스템에 피고용인 및/또는 클라이언트의 목록을 제공할 수 있거나, 또는 시스템이 상기 법률회사의 인덱스를 검색하기 전에 법률회사에 문의함으로써 액세스 권한을 검증할 수 있다. 상술한 예에서, 법률회사에 의해 제공된 인덱스는 그 클라이언트의 문서에 한정되는 것이지, 법률회사의 모든 문서의 인덱스가 아님에 유의하라. 따라서, 서비스 공급자는 법률회사가 그 클라이언트를 위해 인덱싱한 문서에 대해서만 법률회사의 클라이언트가 액세스하도록 승인할 수 있다.If access to certain content is limited to a specific group of people (such as clients or employees), such content may be protected by a firewall, and as a result, generally not indexable by third parties. The content provider may nevertheless want to supply an index for protected content. In such case, the content provider may pay a fee to the service provider to provide indexes of the content provider to system subscribers. For example, a law firm can index all of a client's documents. The documents are stored behind the law firm's firewall. However, law firms may have their employees and clients access these documents through portable scanners to provide service providers with an index (or a pointer to an index), and law firm employees or clients may use paper through their portable scanners. When presenting the scanned search term, the service provider wants to search the law firm's index in turn. The law firm may provide a list of employees and / or clients to the service provider's system to enable the function, or verify the access rights by contacting the law firm before the system retrieves the law firm's index. Can be. In the above example, note that the index provided by the law firm is limited to that client's document, not the index of all the law firm's documents. Thus, a service provider can authorize a law firm's client access only to documents that the law firm has indexed for that client.

페이퍼 문서에 기원한 검색으로부터 야기될 수 있는 적어도 2 가지의 개별 수입흐름이 있는데: 하나는 검색 기능으로부터의 수입 흐름이고, 다른 하나는 컨텐트 전달 기능으로부터의 수입흐름이다. 상기 검색 기능의 수입은 스캐너 유저의 유료 가입으로부터 생성되지만, 또한 검색당 요금에 따라 생성될 수도 있다. 컨텐 트 전달 수입은 컨텐트 공급자 또는 저작권 소유자와 공유될 수 있지만(서비스 공급자는 각 전달에 대해 일정한 매매의 퍼센트 또는 최소지불액과 같은 고정된 요금을 취할 수 있다.), 또한 서비스 공급자가 거래를 중개하는 지의 여부를 고려하지 않고, 가입자가 온라인 카탈로그로부터 주문을 하고, 시스템에 그에 배송 또는 제공하는 모든 아이템에 대해 시스템이 요금 또는 퍼센트를 취하는 "위탁" 모델에 의해 생성될 수도 있다. 일부 실시예에서, 시스템 서비스 공급자는 소정의 시간 동안 또는 식별된 제품의 구매가 이루어진 임의의 후속하는 시간에 가입자가 컨텐트 공급자로부터 행한 모든 구매에 대한 수입을 받는다.There are at least two separate revenue streams that may result from a search originating in the paper document: one is the revenue stream from the search function, and the other is the import flow from the content delivery function. The revenue of the search function is generated from the paid subscription of the scanner user, but may also be generated according to the fee per search. Content delivery revenue can be shared with content providers or copyright owners (service providers can take a fixed fee, such as a percentage or minimum payment of a certain sale for each delivery), but also if the service provider brokers the transaction. Regardless of whether or not a subscription is made, it may be generated by a “consignment” model in which the subscriber places an order from an online catalog and the system takes a fee or a percentage for every item that is delivered or provided to the system. In some embodiments, the system service provider receives revenue for all purchases made by the subscriber from the content provider for a given time or any subsequent time a purchase of the identified product has been made.

18.2 카탈로그 18.2 Catalog

소비자는 페이퍼 카탈로그로부터 구매를 행하기 위해 휴대가능한 스캐너를 이용할 수 있다. 가입자는 상기 카탈로그를 식별하는 정보를 카탈로그로부터 스캔한다. 이러한 정보는 카탈로그로부터의 텍스트, 바코드, 또는 카탈로그의 또다른 식별자이다. 가입자는 그녀/그가 구매하기를 원하는 제품을 식별하는 정보를 스캔한다. 카탈로그 메일링 라벨은 카탈로그 벤더에 대해 소비자를 식별하는 소비자 식별 번호를 포함할 수 있다. 만약 그렇다면, 가입자는 이러한 소비자 식별 번호도 스캔할 수 있다. 시스템은 소비자의 선택과 소비자 식별 번호를 벤더에게 제공함으로써 카탈로그 구매를 돕기위해 가입자와 벤더 사이에서 중개자로서의 역할을 한다.A consumer can use a portable scanner to make a purchase from a paper catalog. The subscriber scans from the catalog information that identifies the catalog. This information is text from a catalog, a barcode, or another identifier of the catalog. The subscriber scans the information identifying the product she / he wants to purchase. The catalog mailing label may include a consumer identification number that identifies the consumer to the catalog vendor. If so, the subscriber can also scan this consumer identification number. The system acts as an intermediary between the subscriber and the vendor to assist in catalog purchase by providing the consumer's choice and consumer identification number to the vendor.

18.3 쿠폰 18.3 Coupon

소비자는 페이퍼 쿠폰을 스캔하여, 추후의 복구와 사용을 위해 스캐너, 또는 컴퓨터와 같은 원격 디바이스에 쿠폰의 전자 복사본을 저장한다. 전자적 저장의 장점은 소비자가 페이퍼 쿠폰을 가지고 다니는 번거로움으로부터 자유롭다는 것이다. 추가적인 장점은 전자 쿠폰은 어떠한 위치에서건 가져올 수 있다는 것이다. 일부 실시예에서, 시스템은 쿠폰의 만료 기일을 추적하고, 곧 만료되는 쿠폰에 관해 소비자에게 경고하고, 및/또는 기한이 만료된 쿠폰을 저장에서 삭제할 수 있다. 쿠폰 발급자에 대한 장점은 누가 쿠폰을 사용하고 있는지, 그리고 언제 어디서 그것들이 캡쳐되고 사용되는지에 관해 보다 많은 피드백을 받을 가능성이 있다는 것이다.The consumer scans the paper coupon and stores an electronic copy of the coupon on a remote device such as a scanner or a computer for later recovery and use. The advantage of electronic storage is that the consumer is free from the hassle of carrying paper coupons. An additional advantage is that the electronic coupon can be taken from any location. In some embodiments, the system may track the expiration date of the coupon, warn the consumer about a coupon that is about to expire, and / or delete the expired coupon from storage. An advantage for coupon issuers is that they are likely to receive more feedback about who is using the coupon and when and where they are captured and used.

19. 일반적인 응용 19. Typical Applications

19.1 폼 19.1 Form

시스템은 페이퍼 폼에 상응하는 전자 문서를 자동으로 채우는(auto-populate) 데에 사용될 수 있다. 유저는 페이퍼 폼을 고유하게 식별하는 일련의 텍스트 또는 바코드를 스캔한다. 스캐너는 인접한 컴퓨터에 유저를 식별하는 폼과 정보의 신원을 통신한다. 인접한 컴퓨터는 인터넷 연결을 가진다. 인접한 컴퓨터는 폼의 제 1 데이터베이스와 스캐너의 유저에 관한 정보(서비스 공급자의 가입자 정보 데이터베이스와 같은)를 구비한 제 2 데이터베이스에 액세스할 수 있다. 인 접한 컴퓨터는 제 1 데이터베이스로부터의 페이퍼 폼의 전자적 버전에 액세스하고, 제 2 데이터베이스로부터 취득된 유저 정보로부터 폼의 필드를 자동으로 채운다. 그런다음 인접한 컴퓨터는 완료된 폼을 의도한 수취인에게 이메일로 전송한다. 대안으로, 상기 컴퓨터는 인접한 프린터에서 완료된 폼을 인쇄할 수 있다.The system can be used to auto-populate an electronic document corresponding to a paper form. The user scans a series of text or bar codes that uniquely identify the paper form. The scanner communicates the identity of the information with the form identifying the user to an adjacent computer. Adjacent computers have an Internet connection. The adjacent computer can access a first database of forms and a second database having information about the user of the scanner (such as a service provider's subscriber information database). The adjacent computer accesses the electronic version of the paper form from the first database and automatically fills in the fields of the form from the user information obtained from the second database. The adjacent computer then e-mails the completed form to the intended recipient. Alternatively, the computer can print the completed form on an adjacent printer.

외부 데이터베이스에 액세스하는 것이 아닌, 일부 실시예에서, 시스템은 신원 모듈, SIM, 또는 보안 카드에서와 같이, 유저 정보를 포함하고 있는 휴대가능한 스캐너를 구비한다. 상기 스캐너는 인접한 PC로 폼을 식별하는 정보를 제공한다. 인접한 PC는 전자 폼에 액세스하여, 상기 폼을 채우기 위해 필요한 정보를 스캐너에게 문의한다.In some embodiments, rather than accessing an external database, the system includes a portable scanner containing user information, such as in an identity module, SIM, or security card. The scanner provides information identifying the form to an adjacent PC. The adjacent PC accesses the electronic form and asks the scanner for the information needed to fill the form.

19.2 업무용 명함 19.2 Business Cards

시스템은 페이퍼 문서로부터 전자 주소록 또는 다른 컨택트 목록을 자동으로 채우는 데에 사용될 수 있다. 예를 들면, 새롭게 만난 사람의 업무용 명함을 받았을 때, 유저는 그/그녀의 휴대폰으로 상기 명함의 이미지를 캡쳐할 수 있다. 시스템은 상기 명함의 전자 복사본을 위치지정하고, 이것은 새롭게 만난 사람의 컨택트 정보를 가진 휴대폰의 온보드 주소록을 업데이트하는 데에 사용될 수 있다. 상기 전자 복사본은 업무용 명함에 압축될 수 있는 새롭게 만난 사람에 관한 보다 많은 정보를 담을 수 있다. 또한, 온보드 주소록은 전자 복사본에 대한 임의의 변경사항이 휴대폰의 주소록에서 자동으로 업데이트되도록 전자 복사본에 대한 링크도 저장할 수 있다. 본 예에서, 업무용 명함은 전자 복사본이 있음을 가리키는 심볼 또 는 텍스트를 선택적으로 포함한다. 전자 복사본이 없다면, 휴대폰은 새롭게 만난 사람에 대해 주소록에 기입사항을 채워 놓기 위해 OCR 및 표준 업무용 명함 포맷의 지식을 이용할 수 있다. 심볼은 또한 상기 이미지로부터 직접 정보를 추출하는 프로세스를 도울 수도 있을 것이다. 예를 들면, 업무용 명함 상의 전화번호 다음에 있는 전화 아이콘은 전화 번호의 위치를 판정하기 위해 인식될 수 있다.The system can be used to automatically populate an electronic address book or other contact list from a paper document. For example, when receiving a business card of a newly met person, the user can capture an image of the card with his / her mobile phone. The system locates an electronic copy of the business card, which can be used to update the onboard address book of the mobile phone with the contact information of the newly met person. The electronic copy may contain more information about the newly met person, which may be compressed into a business card. The onboard address book can also store a link to the electronic copy so that any changes made to the electronic copy are automatically updated in the phone's address book. In this example, the business card optionally includes a symbol or text indicating that there is an electronic copy. Without an electronic copy, cell phones can use knowledge of OCR and standard business card formats to fill in entries in the address book for new contacts. The symbol may also help the process of extracting information directly from the image. For example, a telephone icon next to a telephone number on a business card may be recognized to determine the location of the telephone number.

19.3 교정/편집 19.3 Proofreading / Editing

시스템은 교정 및 편집 프로세스를 개선시킬수 있다. 시스템이 편집 프로세스를 개선시킬 수 있는 한가지 방식은 페이퍼 문서와 편집자의 상호작용을 그의 전자 사본에 링크하는 것에 의한 것이다. 편집자가 페이퍼 문서를 읽고, 그 문서의 여러 부분을 스캔할 때, 시스템은 상기 페이퍼 문서의 전자 사본에 대한 적절한 주석과 편집을 할 것이다. 예를 들면, 편집자가 텍스트 부분을 스캔하고 스캐너로 "새로운 문단"의 제어 제스쳐를 하면, 스캐너와 통신하는 컴퓨터는 상기 문서의 전자 복사본에서의 스캔된 텍스트의 위치에 "새로운 문단"의 브레이크를 삽입할 것이다.The system can improve the calibration and editing process. One way the system can improve the editing process is by linking the editor's interaction with the paper document to its electronic copy. When an editor reads a paper document and scans various parts of the document, the system will make appropriate comments and edits to the electronic copy of the paper document. For example, when an editor scans a piece of text and makes a gesture control of a "new paragraph" with the scanner, the computer communicating with the scanner inserts a break of "new paragraph" at the position of the scanned text in the electronic copy of the document. something to do.

19.4 음성 주석 19.4 Voice Annotation

유저는 상기 문서로부터 텍스트 부분을 스캐닝하고, 스캔된 텍스트에 연관된 음성 녹음을 함으로써 문서에 대한 음성 주석을 달 수 있다. 일부 실시예에서, 스캐너는 유저의 구두 주석을 녹음하기 위한 마이크로폰을 구비한다. 구두 주석이 녹음된 후에, 시스템은 텍스트가 스캔된 문서를 식별하고, 상기 문서내에 스캔된 텍스트를 위치시키고, 그 지점에 음성 주석을 첨부한다. 일부 실시예에서, 시스템은 스피치를 텍스트로 변환하고, 텍스트의 코멘트로서 주석을 첨부한다.A user can annotate a document by scanning a text portion from the document and making a voice recording associated with the scanned text. In some embodiments, the scanner has a microphone for recording the user's verbal annotations. After the oral comment is recorded, the system identifies the document from which the text was scanned, locates the scanned text within the document, and attaches a voice comment at that point. In some embodiments, the system converts speech to text and annotates as comments in the text.

일부 실시예에서, 시스템은 상기 문서에 포함된 주석에 대한 참조로서만 상기 주석을 상기 문서와 분리시켜 유지한다. 그런 다음 주석은 지정된 가입자 또는 유저 그룹을 위한 문서에 대한 주석 마크업 층이 된다.In some embodiments, the system keeps the annotation separate from the document only as a reference to the annotation included in the document. The annotation then becomes the annotation markup layer for the document for the specified subscriber or user group.

일부 실시예에서, 각각의 캡쳐 및 연관된 주석에 대해, 시스템은 문서를 식별하고, 소프트웨어패키지를 이용하여 그것을 열고, 스캔 위치로 스크롤하고, 음성 주석을 재생한다. 그런 다음, 유저는 음성 주석, 그들 자신 또는 다른 사람에 의해 제시된 변경 또는 기타 기록된 코멘트를 참조하면서 문서와 상호작용한다.In some embodiments, for each capture and associated annotation, the system identifies the document, opens it using a software package, scrolls to the scan location, and plays back the voice annotation. The user then interacts with the document, referring to voice annotations, changes made by themselves or others, or other recorded comments.

19.5 텍스트로 된 도움말 19.5 Textual Help

상술한 시스템은 전자 도움말 메뉴로 페이퍼 문서를 개선시키는 데에 사용될 수 있다. 일부 실시예에서, 페이퍼 문서에 연관된 마크업 층은 상기 문서에 대한 도움말 메뉴 정보를 포함한다. 예를 들면, 유저가 문서의 특정한 부분으로부터 텍스트를 스캔할 때, 시스템은 상기 문서에 연관된 마크업을 체크하고, 도움말 메뉴를 유저에게 제공한다. 도움말 메뉴는 스캐너의 디스플레이 또는 연관된 인접한 디스플레이 상에 표시된다.The system described above can be used to enhance the paper document with an electronic help menu. In some embodiments, the markup layer associated with the paper document includes help menu information for the document. For example, when a user scans text from a particular portion of a document, the system checks the markup associated with the document and provides a help menu to the user. The help menu is displayed on the display of the scanner or on an associated adjacent display.

19.6 디스플레이 사용 19.6 Using the Display

일부 상황에서, 텔레비전, 컴퓨터 모니터, 또는 다른 유사한 디스플레이로부터 정보를 스캔할 수 있는 것이 유리하다. 일부 실시예에서, 휴대가능한 스캐너가 컴퓨터 모니터와 텔레비전으로부터 정보를 스캔하는 데에 사용된다. 일부 실시예에서, 휴대가능한 광학 스캐너는 래스터라이징, 스크린 브랭킹 등과 같은 전형적인 음극선 튜브(CRT) 디스플레이 기술로 작업하기에 최적화된 조명 센서를 가진다.In some situations, it is advantageous to be able to scan information from a television, computer monitor, or other similar display. In some embodiments, a portable scanner is used to scan information from computer monitors and televisions. In some embodiments, the portable optical scanner has an illumination sensor optimized for working with typical cathode ray tube (CRT) display technologies such as rasterizing, screen blanking, and the like.

문서로부터 텍스트를 읽는 유저의 오디오를 캡쳐함으로써 동작하는 음성 캡쳐 디바이스는 일반적으로 문서가 페이퍼 상에 있는지, 디스플레이 상인지, 또는 기타 다른 매체 상인지의 여부에 상관없이 작동한다.Voice capture devices that operate by capturing audio of a user reading text from a document generally operate whether the document is on paper, on a display, or on some other medium.

19.6.1. 공공 키오스크 및 동적 세션 IDs 19.6.1. Public Kiosk and Dynamic Session IDs

디스플레이의 직접 스캐닝의 일 사용은 섹션 15.6에 기술된 것과 같은 디바이스에 연관된 것이다. 예를 들면, 일부 실시예에서, 공공 키오스크는 동적 세션 ID를 자신의 모니터 상에 표시한다. 상기 키오스크는 인터넷 또는 회사 인트라넷과 같은 통신 네트워크에 연결된다. 세션 ID는 주기적으로, 하지만, 적어도 새로운 세션 ID가 모든 유저에게 표시되도록 하기 위해 키오스크가 사용될 때는 항상 변화한다. 상기 키오스크를 사용하기 위해, 가입자는 키오스크 상에 표시된 세션 ID를 스캔한다; 상기 세션 ID를 스캐닝함으로써 유저는, 자신이 인쇄된 문서의 스캔 또는 키오스크 스크린 자체로부터 발생한 컨텐트의 전달을 위해 상기 키오스크를 자신의 스캐너에 임시로 연결시키기를 원한다는 것을 시스템에 표현한다. 상기 스캐너는 세션 ID와 스캐너를 인증하는 다른 정보(시리얼 번호, 계정 번호, 또는 기타 식별 정보)를 직접 시스템에 통신할 수 있다. 예를 들면, 스캐너는 유저의 휴대폰(블루투스™를 통해 유저의 스캐너와 쌍을 이루는)을 통해 세션 초기화 메시지를 전송함으로써 시스템과 직접(여기서 "직접"은 키오스크를 통해 메시지를 통과시키지 않는 것을 의미함) 통신할 수 있다. 대안으로, 스캐너는 키오스크와 무선 링크를 구축하고, 세션 초기화 정보를 키오스크에 전송함으로써(아마도 블루투스™, 등과 같은 짧은 범위의 RF를 통해) 키오스크의 통신 링크를 이용할 수 있다; 응답하여, 키오스크는 자신의 인터넷 연결을 통해 시스템에 세션 초기화 정보를 전송한다.One use of direct scanning of the display is associated with a device as described in section 15.6. For example, in some embodiments, a public kiosk displays a dynamic session ID on its monitor. The kiosk is connected to a communication network such as the Internet or a corporate intranet. The session ID changes periodically, but at all times when the kiosk is used to ensure that at least the new session ID is displayed to all users. To use the kiosk, the subscriber scans the session ID displayed on the kiosk; By scanning the session ID, the user expresses to the system that he wishes to temporarily connect the kiosk to his scanner for the scanning of the printed document or the delivery of content from the kiosk screen itself. The scanner may communicate the session ID and other information (serial number, account number, or other identifying information) that authenticates the scanner directly to the system. For example, the scanner sends a session initiation message through the user's mobile phone (paired with the user's scanner via Bluetooth ™), directly with the system (where "direct" means not passing the message through the kiosk). Can communicate. Alternatively, the scanner may use the kiosk's communication link by establishing a wireless link with the kiosk and sending session initialization information to the kiosk (perhaps via a short range of RF such as Bluetooth ™, etc.); In response, the kiosk sends session initialization information to the system via its internet connection.

시스템은 디바이스가 스캐너와 연결되어 있는 기간(또는 세션) 동안 스캐너에 이미 연결된 디바이스를 다른 사람이 이용하는 것을 방지할 수 있다. 이러한 특징은 또다른 사람의 세션이 끝나기 전에 다른 사람이 공공 키오스크를 이용하는 것을 방지하는 데에 유용하다. 인터넷 카페에서 컴퓨터를 이용하는 것에 관련된 이러한 개념의 예로서, 유저는 그녀/그가 사용하기를 원하는 PC의 모니터 상의 바코드를 스캔하고; 응답하여, 시스템이 그것이 디스플레이되는 모니터로 세션 ID를 전송하고; 유저는 모니터로부터 세션 ID를 스캔함으로써(또는 그것을 휴대가능한 스캐너 상의 키패드 또는 터치 스크린 또는 마이크로폰을 통해 입력함으로써) 세션을 초기화하고; 다른 스캐너가 세션 ID를 스캔하고 그/그녀의 세션 동안 모니터를 이용할 수 없도록 시스템이 자신의 데이터베이스에서 세션 ID를 그/그녀의 스캐너의 시리얼 번호(또는 유저 스캐너를 고유하게 식별하는 다른 식별자)와 연결시킨다. 스캐너는 모니터와 연결된 PC와 통신하거나(블루투스™와 같은 무선 링크, 또 는 도킹 스테이션과 같은 배선 연결을 통해), 또는 휴대폰 등과 같은 다른 수단을 통해 시스템과 직접(즉, PC를 통하지 않고) 통신한다.The system can prevent others from using a device already connected to the scanner during the period (or session) with which the device is connected with the scanner. This feature is useful to prevent others from using a public kiosk before another person's session is over. As an example of this concept related to using a computer in an internet cafe, a user scans a barcode on the monitor of a PC she / he wants to use; In response, the system sends the session ID to the monitor on which it is displayed; The user initiates the session by scanning the session ID from the monitor (or entering it through a keypad or touch screen or microphone on a portable scanner); The system associates the session ID with its serial number (or other identifier that uniquely identifies the user scanner) in its database so that other scanners can scan the session ID and the monitor is unavailable during his / her session. Let's do it. The scanner communicates with the PC connected to the monitor (via a wireless link, such as Bluetooth ™, or via a wired connection, such as a docking station), or directly (ie, not through a PC) to the system via another means, such as a mobile phone. .

제 4 4th 파트part - 시스템 및 휴대가능한 System and portable 디바이스의Device 상세 Detail

도 4는 휴대가능한 스캐닝 디바이스의 일반적인 사용을 도시한 사시도이다. 도시된 예에서, 유저는 스캐닝 능력을 가진 휴대가능한 디바이스(500)를 통해 신문(410)에서 텍스트를 스캔한다. 유저는 휴대가능한 스캐너(500)로 텍스트(420)의 라인의 일부를 스캔하였다. 텍스트(420)의 라인의 일부의 이미지는 스캐너(500)에 의해 저장되고, 원격 저장을 위해 다른 디바이스로 전송되고, 압축되고, 또는 다양한 방식으로 처리될 수 있다. 일부 실시예에서, 휴대가능한 스캐너(500)는 신문 기사를 고유하게 식별하기 위해서 언제 충분한 정보가 스캔되었는지를 가리킨다.4 is a perspective view illustrating a general use of a portable scanning device. In the example shown, the user scans text in newspaper 410 via portable device 500 with scanning capability. The user scanned a portion of the line of text 420 with a portable scanner 500. An image of a portion of the line of text 420 may be stored by the scanner 500, sent to another device for remote storage, compressed, or processed in various ways. In some embodiments, portable scanner 500 indicates when enough information has been scanned to uniquely identify a newspaper article.

도 4에 도시된 예에서, 휴대가능한 디바이스(500)는 펜 폼-팩터의 스캐너이다. 그러나, 디지털 카메라와 같은 임의의 이미지 캡쳐링 능력을 가진 휴대가능한 디바이스가 도 4에 도시된 휴대가능한 디바이스(400)와 적절한 등가물이다.In the example shown in FIG. 4, the portable device 500 is a pen form-factor scanner. However, a portable device with any image capturing capability, such as a digital camera, is a suitable equivalent to the portable device 400 shown in FIG.

도 5는 일반적인 휴대가능한 스캐닝 디바이스(500)의 실시예의 기능 블록도이다. 휴대가능한 스캐닝 디바이스(500)는 스캔될 그래픽 또는 텍스트 등의 대상물에 조광하는 광원(505)을 갖는다. 스캔된 대상물로부터 반사된 광이 렌즈(510)를 통과해서 지나고, 그의 특성(컬러, 강도 등)이 전하결합소자(Charge-Coupled Device)(CCD) 어레이(515)와 같은 적절한 디바이스에 의해 등록된다. CCD 어레이(515)에 저장된 아날로그 데이터는 아날로그-디지털(A/D) 컨버터(520)에 의해 디 지털 형태로 변환된다.5 is a functional block diagram of an embodiment of a general portable scanning device 500. The portable scanning device 500 has a light source 505 that illuminates an object such as a graphic or text to be scanned. Light reflected from the scanned object passes through the lens 510 and its characteristics (color, intensity, etc.) are registered by a suitable device such as a charge-coupled device (CCD) array 515. . Analog data stored in the CCD array 515 is converted into a digital form by the analog-to-digital (A / D) converter 520.

도 5에 도시된 실시예에서, DSP(575)는 전원(540)에 의해 전력이 공급되고, 시스템 클록(570), A/D 컨버터(520), 이미지 압축 로직(525), 메모리(530), 빌링/가입/디바이스 식별자 메모리(580), 전력 관리 로직(535), 위치 모듈(545), 통신 인터페이스(550) 및 유저 인터페이스(560)와 동작가능하게 연결된다. 이미지 데이터가 A/D 컨버터(520)에 의해 디지털화된 후, 디지털 신호 처리기(DSP)(575)는 메모리(530)에 저장된 프로그램에 따라 이미지 데이터에 대한 다양한 동작을 수행할 수 있다.In the embodiment shown in FIG. 5, DSP 575 is powered by power source 540, system clock 570, A / D converter 520, image compression logic 525, memory 530. And operatively coupled with billing / subscription / device identifier memory 580, power management logic 535, location module 545, communication interface 550, and user interface 560. After the image data is digitized by the A / D converter 520, the digital signal processor (DSP) 575 may perform various operations on the image data according to a program stored in the memory 530.

디지털 신호 처리기(575)는 메모리(530)에 디지털 이미지 데이터를 저장할 수 있다. 메모리(530) 공간을 절약하기 위해, DSP(575)는 저장 이전에 디지털 이미지 데이터를 압축하기 위해 이미지 압축 로직(525)에 액세스함으로써 이미지 압축 스킴을 구현할 수 있다. 잘알려진 JPEG 또는 JBIG 압축 스킴과 같은 많은 종류의 이미지 압축 스킴이 사용될 수 있다. 일부 경우에, DSP(575)는 광학 문자 인식(OCR)을 대안으로 사용하여, 메모리(530)에 저장하기 전에 스캔된 이미지 데이터를 텍스트로 변환한다.The digital signal processor 575 may store digital image data in the memory 530. To save memory 530 space, DSP 575 may implement an image compression scheme by accessing image compression logic 525 to compress digital image data prior to storage. Many kinds of image compression schemes can be used, such as the well-known JPEG or JBIG compression schemes. In some cases, DSP 575 alternatively uses optical character recognition (OCR) to convert the scanned image data into text before storing in memory 530.

전력 관리 로직(535)은 전원(540) 상태와 휴대가능한 스캐닝 디바이스(500)의 여러 컴포넌트에 의한 전력소비 속도를 모니터링한다. 전원(540)이 배터리와 같은 내부 전원이라면, 배터리 수명을 연장시키기 위해 전력관리 로직(535)은 특정한 컴포넌트를 휴지시키거나 또는 저전력 모드로 들어가게한다. 추가하여, 전력관리 로직(535)은 유저 인터페이스(560)로 하여금, 적색 LED를 조광하고, 가청 알람 을 소리나게하는 등의 "로우 배터리" 경고를 통신하거나, 또는 LCD 상에 "로우 배터리" 아이콘을 디스플레이하도록 할 수 있다.The power management logic 535 monitors the power 540 status and the rate of power consumption by the various components of the portable scanning device 500. If power source 540 is an internal power source such as a battery, power management logic 535 may suspend certain components or enter a low power mode to extend battery life. In addition, the power management logic 535 may cause the user interface 560 to communicate a "low battery" alert, such as dimming a red LED, sounding an audible alarm, or "low battery" icon on the LCD. Can be displayed.

메모리(530)는 DSP(575)에 대한 프로그램 명령어를 포함할 수 있다. 그것은 또한 압축 또는 비압축된 포맷의 텍스트 및/또는 이미지 데이터를 저장하는 데에 사용될 수도 있다. 추가로, 상기 이미지 데이터에 연관된 타임스탬프와 위치스탬프가 메모리(530)에 저장될 수 있다.The memory 530 may include program instructions for the DSP 575. It may also be used to store text and / or image data in a compressed or uncompressed format. In addition, timestamps and location stamps associated with the image data may be stored in the memory 530.

클록(570)은 휴대가능한 스캐너(500)의 다양한 컴포넌트의 동작을 동기화시키기 위한 클록 신호를 제공한다. 클록(570)은 또한 타임 스탬핑 이미지 데이터에 대한 참조시간을 제공할 수도 있다. 예를 들면, 유저가 텍스트의 일부를 스캔할 때, DSP(575)는 스캔 데이터에 대해 OCR을 실행하고, 그 결과인 텍스트를 클록(570)으로부터 습득된 타임 스탬프와 함께 메모리(530)에 저장한다. 대안으로, 특히 GPS 수신기가 위치 모듈(545)에 포함되어 있다면, 타임스탬프가 위치 모듈(545)로부터 습득될 수 있다.The clock 570 provides a clock signal for synchronizing the operation of various components of the portable scanner 500. Clock 570 may also provide a reference time for the time stamped image data. For example, when a user scans a portion of text, DSP 575 performs OCR on the scan data and stores the resulting text in memory 530 with the time stamp learned from clock 570. do. Alternatively, timestamps may be obtained from the location module 545, especially if a GPS receiver is included in the location module 545.

위치 모듈(545)은 휴대가능한 디바이스(500)에 위치 판정 기능을 제공한다. 위치 모듈(545)은, 인공위성과 그라운드 기반의 전송기로 구성된 GPS 네트워크에 의한 신호 브로드캐스트를 모니터링함으로써 위치 및 시간 정보를 제공하는 GPS 수신기를 포함할 수 있다. 이러한 위치 정보는 특정한 스캔이 어디서 발생했는지를 가리키는 위치 스탬프를 제공하는 데에 사용될 수 있다. 예를 들면, 유저가 텍스트의 일부를 스캔할 때, DSP(575)는 스캔 데이터에 대한 OCR을 수행하고, 그결과인 텍스트를 위치 모듈(545)로부터 습득된 위치 스탬프와 함께 메모리(530)에 저장할 수 있다. 위치 스탬프는 국가, 주, 지역, 도시, 제공하는 네트워크 액세스 지점, 100미터 이내의 위치, 정확한 위치 등과 같이 한정하는 다양한 레벨에 있을 수 있다.The location module 545 provides location determination functionality to the portable device 500. The location module 545 may include a GPS receiver that provides location and time information by monitoring signal broadcasts by a GPS network comprised of satellites and ground based transmitters. This location information can be used to provide a location stamp indicating where a particular scan occurred. For example, when a user scans a portion of text, the DSP 575 performs an OCR on the scan data and places the resulting text in memory 530 along with the location stamp obtained from the location module 545. Can be stored. Location stamps can be at various levels, such as country, state, region, city, providing network access point, location within 100 meters, exact location, and the like.

통신 인터페이스(550)는 휴대가능한 디바이스(500)가 다른 디바이스와 통신하도록 인가하는 송수신기를 포함한다. 통신 인터페이스(505)는 짧은 범위 RF(블루투스, IEEE 802.11 등), 휴대폰, 또는 광학(적외선 등)과 같은 무선 인터페이스가 될 수 있다. 통신 인터페이스(550)가 무선 능력을 포함하는 경우, 휴대가능한 스캐닝 디바이스는 또한 무선 기능을 구현하기위해 필요한 안테나 또는 렌즈를 포함한다.The communication interface 550 includes a transceiver that authorizes the portable device 500 to communicate with other devices. The communication interface 505 can be a short range RF (Bluetooth, IEEE 802.11, etc.), a mobile phone, or a wireless interface such as optical (infrared, etc.). When the communication interface 550 includes wireless capability, the portable scanning device also includes an antenna or lens needed to implement the wireless function.

통신 인터페이스(550)는 또한 USB 및 그와 유사한 스킴 등과 같은 유선 인터페이스를 포함할 수 있다. 통신 인터페이스(550)가 USB와 같은 유선 인터페이스인 경우, 통신 인터페이스(550)는 내부 전원(540)을 충전하거나 휴대가능한 스캐닝 디바이스(500)를 가동하기 위해 전력을 공급할 수 있다.The communication interface 550 can also include a wired interface, such as USB and similar schemes. If the communication interface 550 is a wired interface such as USB, the communication interface 550 can supply power to charge the internal power source 540 or to power the portable scanning device 500.

유저 인터페이스(560)는 스피커 및 마이크로폰과 같은 오디오 능력, LCD 디스플레이 또는 LEDs와 같은 시각 능력, 및/또는 버저 및 변환기와 같은 촉각(접촉) 능력을 포함할 수 있다.User interface 560 may include audio capabilities such as speakers and microphones, visual capabilities such as LCD displays or LEDs, and / or tactile (contact) capabilities such as buzzers and transducers.

도 6은 시스템에 의해 일반적으로 사용되는 데이터 레코드(600)용 포맷을 도시한 데이터 구조도이다. 데이터 레코드(600)는 스캔된 데이터(630)를 포함한다. 상기 스캔된 데이터(630)는 텍스트, 이미지, 심볼 또는 임의의 적절한 데이터 타입이 될 수 있다. 데이터 레코드(600)는 또한 스캔된 데이터(630)에 연관된 타임 스 탬프(610)를 포함한다. 일부 실시예에서, 타임 스탬프(610)는 스캔된 테이터(630)가 디바이스(500)에 의해 획득된 시간을 가리킨다. 데이터 레코드(600)는 스캔된 데이터(630)에 연관된 위치 스탬프(620)를 포함한다. 일부 실시예에서, 위치 스탬프(620)는 스캔된 데이터(630)가 획득된 위치를 가리킨다. 일부 실시예에서, 타임 스탬프(610)와 위치 스탬프(620)는 각각 시간과 위치에 의해 스캔된 데이터(630)를 인덱싱하고, 그에 의해 스캔 시간 및/또는 위치에 의해 저장된 데이터에 대한 검색을 가능하게 한다.6 is a data structure diagram illustrating the format for a data record 600 generally used by the system. Data record 600 includes scanned data 630. The scanned data 630 may be text, image, symbol or any suitable data type. Data record 600 also includes a time stamp 610 associated with the scanned data 630. In some embodiments, time stamp 610 indicates the time at which scanned data 630 was obtained by device 500. Data record 600 includes a location stamp 620 associated with the scanned data 630. In some embodiments, location stamp 620 indicates the location from which scanned data 630 was obtained. In some embodiments, time stamp 610 and location stamp 620 respectively index the scanned data 630 by time and location, thereby enabling retrieval of data stored by scan time and / or location. Let's do it.

도 7은 휴대가능한 디바이스(500)를 이용하여 문서가 스캔된 위치 및/또는 시간에 관한 정보를 검출하고 저장하기 위해 시스템에 의해 일반적으로 수행되는 단계들을 도시하는 흐름도이다. 단계(710)에서, 휴대가능한 디바이스(500)는 스캔된 이미지 또는 텍스트와 같은 데이터(630)를 획득한다. 휴대가능한 디바이스(500)는 데이터 레코드(600)에 타임스탬프(610) 또는 위치 스탬프(620)를 포함할지 여부에 관한 미리정해진 명령어를 포함할 수 있다. 단계(715)에서 휴대가능한 디바이스(500)는 타임스탬프(610)가 데이터 레코드(600)에서 필요한지 여부를 판정한다. 타임스탬프가 데이터 레코드(600)에서 필요하다면, 단계(720)에서 휴대가능한 디바이스(500)는 클록(570)으로부터(GPS가 가능하다면, 위치 모듈(545)로부터 가능함) 타임스탬프 정보(610)를 습득하고, 단계(725)로 진행한다. 단계(715)에서 타임스탬프가 필요하지 않는다면, 휴대가능한 디바이스(500)는 단계(725)로 진행한다. 단계(725)에서, 휴대가능한 디바이스(500)는 위치 스템프(620)가 데이터 레코드(600)에서 필요한지 여부를 판정한다. 위치 스탬프가 데이터 레코드(600)에서 필요하다면, 휴대가능한 디바이스(500)는 위치 모듈(255)로부터 위치 스탬프 정보(620)를 습득하고, 단계(735)로 진행한다. 위치 스탬프가 단계(725)에서 필요하지않다면, 휴대가능한 디바이스(500)는 단계(735)로 진행한다. 단계(735)에서, 휴대가능한 디바이스(500)가 데이터(630)를 연관된 타임스탬프(610) 또는 위치 스탬프(620)와 함께 메모리(530)에 저장한다.FIG. 7 is a flow diagram illustrating steps generally performed by a system to detect and store information regarding the location and / or time at which a document was scanned using portable device 500. In step 710, portable device 500 obtains data 630, such as a scanned image or text. The portable device 500 may include predetermined instructions as to whether to include a time stamp 610 or a location stamp 620 in the data record 600. In step 715 the portable device 500 determines whether a time stamp 610 is needed in the data record 600. If a timestamp is needed in the data record 600, the portable device 500 at step 720 obtains the timestamp information 610 from the clock 570 (or from the location module 545, if GPS is available). Acquisition, and flow proceeds to step 725. If no time stamp is needed at step 715, the portable device 500 proceeds to step 725. In step 725, portable device 500 determines whether location stamp 620 is needed in data record 600. If a location stamp is needed in the data record 600, the portable device 500 obtains the location stamp information 620 from the location module 255 and proceeds to step 735. If no location stamp is needed at step 725, the portable device 500 proceeds to step 735. In step 735, portable device 500 stores data 630 in memory 530 along with an associated time stamp 610 or location stamp 620.

문서를 식별하기 위해 충분한 정보가 스캔되었음을 유저에게 표시함.Indicate to the user that sufficient information has been scanned to identify the document.

일부 실시예에서, 휴대가능한 스캐너(500)는 유저에게 문서를 식별하기에 충분한 정보가 스캔되었음을 가리킬 수 있다. 예를 들면, 휴대가능한 스캐너(500)는 문서를 고유하게 식별하는 특정한 스캔을 가리키는 미리정해진 임계값을 가질 수 있다. 임계값에 일치하거나, 초과할 때, 휴대가능한 스캐너(500)는 유저 인터페이스(560)를 통해 유저에게 문서를 식별하기에 충분한 정보가 스캔되었음을 가리킬 수 있다. 이러한 미리정해진 임계값들은 발견법(heuristics)(즉, 경험적 방법(rule of thumb)), 통계분석 또는 기타 적절한 방법에 기초하여 결정될 수 있다.In some embodiments, the portable scanner 500 may indicate to the user that enough information has been scanned to identify the document. For example, the portable scanner 500 may have a predetermined threshold that indicates a particular scan that uniquely identifies the document. When meeting or exceeding the threshold, the portable scanner 500 may indicate via the user interface 560 that the user has scanned enough information to identify the document. These predetermined thresholds can be determined based on heuristics (ie, rule of thumb), statistical analysis, or other appropriate method.

일부 실시예에서, 임계값을 결정하기 위해 시스템에 의해 사용된 하나의 발견방법은 문어적 표현의 고유한 문자의 관찰에 기초한다. 대부분의 문서는 4에서 10단어 사이(영어에서는 대략 20-50 문자 또는 심볼)의 스캔으로 매우 많은 집대성된 자료- 예를 들면, 1 백만개의 문서를 포함하는 자료-내에서 고유하게 식별될 수 있다. 이러한 발견방법은 그것이 테스트된 모든 언어를 수용한다. 4-10 단어의 범위에서의 스캔이 복제 문서를 만드는 경우, 유저는 그 결과를 보다 좁히기 위해 추가적인 단어들을 스캔하도록 프롬프트될 수 있다.In some embodiments, one discovery method used by the system to determine the threshold is based on the observation of the unique characters of the written expression. Most documents can be uniquely identified within a very large collection of data-for example, a document containing 1 million documents-with a scan between 4 and 10 words (approximately 20-50 characters or symbols in English). . This discovery method accepts all languages tested. If a scan in the range of 4-10 words creates a duplicate document, the user may be prompted to scan additional words to further narrow the result.

휴대가능한 디바이스(500)는 유저에게 유저 인터페이스(560)의 시각, 오디오, 또는 촉각 능력에 의해 충분한 정보가 스캔되었음을 가리킬 수 있다. 스캔된 정보가 미리정해진 임계값에 일치하거나 초과하는 지를 판정한 때, DSP(575)는, 유저에게 정보가 스캔된 문서를 식별하기에 충분한 정보가 스캔되었음을 통신하도록 유저 인터페이스(560)에 지시한다.The portable device 500 may indicate to the user that sufficient information has been scanned by the visual, audio, or tactile capabilities of the user interface 560. Upon determining whether the scanned information meets or exceeds a predetermined threshold, the DSP 575 instructs the user interface 560 to communicate to the user that enough information has been scanned to identify the document from which the information was scanned. .

타임스탬프와 위치스탬프도 문서를 식별하기위해 사용될 수 있다. 예를 들면, AP 통신사의 기사가 많은 신문에 게재될 수 있지만, 정확한 신문이 위치 스탬프에 의해 판정될 수 있다. 위치스탬프가 스캔이 시애틀에서 실시되었음을 가리킨다면, 시애틀 신문이 스캔된 AP 기사의 소스일 가능성이 가장 높다. 유사하게 일부 실시예에서, 시스템은 타임스탬프 이전에 간행된 문서에 대한 후보자 문서의 범위를 줄이는 데에 타임스탬프를 사용한다.Time stamps and location stamps can also be used to identify documents. For example, an article of an AP news agency may be published in many newspapers, but an accurate newspaper may be determined by a location stamp. If the location stamp indicates that the scan was conducted in Seattle, then the Seattle newspaper is most likely the source of the scanned AP article. Similarly, in some embodiments, the system uses timestamps to reduce the range of candidate documents for documents published prior to timestamps.

유저에게 문서 또는 문서 그룹내의 위치를 식별하기에 충분한 정보가 스캔되었음을 가리킴.Indicates that the user has scanned enough information to identify the document or its location within the document group.

일부실시예에서, 휴대가능한 스캐너(500)는 유저에게 문서 또는 문서 그룹내의 위치를 식별하기에 충분한 정보가 스캔되었음을 가리킬 수 있다. 예를 들면, 휴대가능한 스캐너(500)는 문서 또는 문서 그룹 내의 위치를 고유하게 식별하는 특정한 스캔을 가리키는 임계값을 가질 수 있다. 임계값에 일치하거나, 초과할 때, 휴대가능한 스캐너(500)는 유저 인터페이스(560)를 통해 유저에게 문서 또는 문서 그룹내의 위치를 식별하기에 충분한 정보가 스캔되었음을 가리킬 수 있다. 이러한 임계값들은 발견법(즉, 경험적 방법), 통계분석, 특정한 문서 또는 문서 그룹에 관한 정보(예를 들면 인덱스), 또는 기타 적절한 방법에 기초하여 결정될 수 있다.In some embodiments, the portable scanner 500 may indicate to the user that enough information has been scanned to identify the document or location within the document group. For example, the portable scanner 500 may have a threshold that indicates a particular scan that uniquely identifies a location within a document or group of documents. When matching or exceeding the threshold, the portable scanner 500 may indicate via the user interface 560 that the user has scanned enough information to identify a document or location within a group of documents. These thresholds can be determined based on heuristics (ie, empirical methods), statistical analysis, information about a particular document or group of documents (eg, indexes), or other appropriate method.

상기 설비에 의해 사용되는 위치를 판정하는 한가지 접근방식은 문서 또는 문서 그룹을 나타내는 인덱스를 참고하는 것을 포함하고, 캡쳐가 상기 인덱스 내에서 고유한 것이 아니라면, 현재 캡쳐의 위치에 관한 추론을 수정하고, 개량하기 위해 추가적인 컨텍스트 정보(예를 들면, 최종 캡쳐 위치, 최종 캡쳐이후의 경과된 시간 등)를 이용하는 것이다.One approach to determining the location used by the facility includes referencing an index representing a document or group of documents, modifying the inferences regarding the location of the current capture if the capture is not unique within the index, Additional contextual information (eg, last capture location, elapsed time since last capture, etc.) is used to refine.

일부 실시예에서, 시스템의 위치 결정은 확률적인 것이다. 특정한 캡쳐가 문서 또는 문서 그룹에서의 여러 위치에서 일치하는 경우, 시스템은 유저의 가장 최근의 캡쳐에 근접한 위치에 보다 높은 확률을 부여할 수 있다. 임계값은 캡쳐위치가 알려진 것에 연관된다. 이러한 임계값은 잠재적인 매칭 위치와 연관될 가능성을 포함할 수 있다. 예를 들면, 일부 실시예에서, 하나의 위치가 적어도 80%의 유저의 위치일 확률을 가진다면, 시스템은 그 위치를 선택한다.In some embodiments, positioning of the system is probabilistic. If a particular capture matches at multiple locations in a document or group of documents, the system can give a higher probability to the location closest to the user's most recent capture. The threshold is associated with what the capture position is known. Such a threshold may include the possibility of being associated with a potential matching location. For example, in some embodiments, if one location has a probability of at least 80% of the user's location, the system selects that location.

휴대가능한 디바이스(500)는 유저에게 유저 인터페이스(560)의 시각, 오디오, 또는 촉각 능력을 통해 알려진 위치를 가리킬 수 있다. 스캔된 정보가 미리정해진 임계값과 일치하거나 초과하는 지를 판정한 때, DSP(575)는, 유저에게 정보가 스캔된 위치를 식별하기에 충분한 정보가 스캔되었음을 통신하도록 유저 인터페이스(560)에 지시한다.The portable device 500 may point a user to a known location through the visual, audio, or tactile capabilities of the user interface 560. Upon determining whether the scanned information matches or exceeds a predetermined threshold, the DSP 575 instructs the user interface 560 to communicate to the user that enough information has been scanned to identify the location where the information was scanned. .

스캔된 이미지 및/또는 제스쳐를 통한 스캐너 동작 제어Control scanner behavior via scanned image and / or gesture

일부 실시예에서, 휴대가능한 스캐너(500)는 유저 인터페이스(560)를 통해 유저 입력에 의해 제어된다. 예를 들면, 유저 인터페이스(560)가 유저에게 메뉴가 표시될 수 있는 디스플레이를 포함하는 경우, 유저는 휴대가능한 스캐너(500)의 동작을 제어하기 위해 메뉴 선택을 선택할 수 있다.In some embodiments, the portable scanner 500 is controlled by user input via the user interface 560. For example, if the user interface 560 includes a display from which a menu can be presented to the user, the user can select a menu selection to control the operation of the portable scanner 500.

일부 실시예에서, 휴대가능한 스캐너(500)는 유저에 의해 실시되는 제스쳐에 의해 제어된다. 예를 들면, 전방으로의 텍스트를 스캐닝하는 것은 유저가 메모리에 텍스트를 저장하기를 원한다는 것을 가리킬 수 있다. 동일한 텍스트를 반대방향으로 스캐닝하는 것은 유저가 텍스트를 메모리에서 삭제하기를 원한다는 것을 가리킬 수 있다. 문서의 텍스트 상에서 앞뒤로 러빙하는 것은 유저가 상기 문서의 전자 사본에서의 텍스트를 하이라이팅하기를 원한다는 것을 가리킬 수 있다. 시스템은 원 형상동작, 흔드는 동작 등과 같은 많은 제스쳐들이 휴대가능한 디바이스(500)의 동작을 제어하는 데에 사용될 수 있도록 한다. 가능한 동작들은 스캔 프로세스를 개시하고, 유저가 특정한 기사 또는 문서로부터(그리고, 그 결과 후속하는 스캔된 데이터가 새로운 기사 또는 문서로부터 나올것이라는) 스캐닝했다는 것을 신호하는 것 등을 포함한다.In some embodiments, portable scanner 500 is controlled by a gesture performed by a user. For example, scanning the text forward may indicate that the user wants to store the text in memory. Scanning the same text in the opposite direction may indicate that the user wants to delete the text from memory. Rubbing back and forth on the text of the document may indicate that the user wants to highlight the text in the electronic copy of the document. The system allows many gestures, such as circular motion, rocking motion, and the like, to be used to control the operation of the portable device 500. Possible actions include initiating a scanning process, signaling that the user has scanned from a particular article or document (and, as a result, subsequent scanned data will come from a new article or document), and the like.

일부 실시예에서, 휴대가능한 스캐너(500)는 속도 또는 방향 변화를 감지하여, 그에 의해 제어 제스쳐를 판정하기 위해 가속도계와 같은 가속 센서(간략화를 위해 도 5에 도시되지 않음)를 포함한다.In some embodiments, the portable scanner 500 includes an acceleration sensor (not shown in FIG. 5 for simplicity), such as an accelerometer, to sense a speed or direction change and thereby determine a control gesture.

휴대가능한 스캐너(500)는 또한 이미지 캡쳐 메커니즘을 통해 DSP(575)로의 입력된 제어 코맨드들에 응답하기 위해 프로그래밍될 수도 있다.(도 5에 도시된 실시예에서, 이미지 캡쳐 메커니즘은 광원(505), 렌즈(510), CCD 어레이(515), 및 A/D 컨버터(520)를 포함한다.) 이들 코맨드들은 생체 정보(지문 등), 또는 통상의 텍스트를 스캐닝한 패턴(상술한 데이터 저장을 제어하기위해 역방향으로 텍스트를 스캐닝하는 것과 같은) 등의, 스캐너에 의해 인식된 특별한 심볼들이 될 수 있다. 예를 들면, 카탈로그와 같은 문서는 휴대가능한 디바이스(500)에 대해 특별한 으미를 가지는 명령어 심볼의 메뉴를 포함할 수 있다. 제어 프로그램을 실행시키기 위해, 유저는 상기 심볼 중에 하나를 스캔한다. 응답하여, DSP(575)는 특별한 제어 신호에 연관된 제어 프로그램에 액세스하고 그를 실행한다. 카탈로그의 예에서, 특별한 심볼 중에 하나는 스캐너를 통해 카탈로그로부터 제품을 주문하는 데에 사용될 수 있는 구매 프로그램을 초기화할 수 있다. 유저는 주문된 제품에 관한 정보를 스캔하고, 휴대가능한 스캐너는 인터넷과 통신 인터페이스(550) 사이의 연결을 통해 상기 제품과 판매를 완료하기 위해 필요한 다른 정보(빌링 및 배송 정보 등)를 카탈로그 벤더에게 통신한다.The portable scanner 500 may also be programmed to respond to control commands input to the DSP 575 via the image capture mechanism. (In the embodiment shown in FIG. 5, the image capture mechanism is a light source 505. , Lenses 510, CCD array 515, and A / D converter 520.) These commands control biometric information (fingerprints, etc.), or patterns scanned normal text (data storage described above). May be special symbols recognized by the scanner, such as scanning text in the reverse direction. For example, a document such as a catalog may include a menu of command symbols with a special trail for portable device 500. To execute the control program, the user scans one of the symbols. In response, DSP 575 accesses and executes the control program associated with the particular control signal. In the example of a catalog, one of the special symbols can initiate a purchasing program that can be used to order products from the catalog via a scanner. The user scans information about the ordered product, and the portable scanner provides the catalog vendor with other information (such as billing and shipping information) needed to complete the product and sale via a connection between the Internet and the communication interface 550. Communicate

빌링/가입/디바이스 식별자를 가진 스캐너Scanner with billing / subscription / device identifier

휴대가능한 디바이스(500)는 빌링, 가입, 및/또는 디바이스 식별자에 관한 정보를 저장하기 위한 메모리(580)를 포함할 수 있다. 이러한 메모리(580)는 가입자 식별 모듈(SIM) 또는 스마트 카드에서와 같이 착탈가능하거나, 또는 프로그래밍 가능 읽기용 메모리(PROM)과 같이 착탈불가능할 수 있다. 문서의 전자 사본이 스캔된 데이터에 기초하여 위치되는 경우, 가입 정보는 유저가 전자 사본에 액세스하 도록 인가되어야 하는지 여부를 검증하는 데에 사용될 수 있다. 예를 들면, 신문은 자신의 온라인 버전에 대한 액세스를 위해 추가 요금을 지불할 수 있다. 유저의 가입 정보는 유저가 온라인 버전에 가입했는 지를 가리키는 계정 번호를 포함할 수 있다.The portable device 500 may include a memory 580 for storing information regarding billing, subscriptions, and / or device identifiers. This memory 580 may be removable, such as in a subscriber identity module (SIM) or smart card, or may be removable, such as a programmable read memory (PROM). If an electronic copy of the document is located based on the scanned data, the subscription information can be used to verify whether the user should be authorized to access the electronic copy. For example, a newspaper may pay an additional fee for access to its online version. The user's subscription information may include an account number indicating whether the user has subscribed to the online version.

유사하게, 빌링 정보는 휴대가능한 스캐너(500)로 구매를 수행하는 데에 사용될 수 있다. 일부 실시예에서, 메모리(580)는 유저의 신용 카드 또는 다른 금융 정보를 포함한다. 예를 들면, 유저가 문서로부터 텍스트를 스캔하고, 자신이 상기 문서의 전자 사본에 대한 액세스를 구매하기를 원한다는 것을 나타낼 때(상술한 유저 인터페이스(560) 또는 제스쳐 제어를 통해), 빌링 정보는 저작권 소유자 또는 컨텐트 공급자에 결제를 제공하기 위해 사용될 수 있다.Similarly, billing information can be used to make a purchase with the portable scanner 500. In some embodiments, memory 580 includes a user's credit card or other financial information. For example, when a user scans text from a document and indicates that he or she wishes to purchase access to an electronic copy of the document (via user interface 560 or gesture control described above), the billing information is copyrighted. Can be used to provide a payment to the owner or content provider.

휴대가능한 디바이스(500)는 메모리(580)에 시리얼 번호와 같은 디바이스 식별자를 저장할 수 있다. 이러한 디바이스 식별자들은 휴대가능한 디바이스(500)를 고유하게 식별하는 역할을 하고, 그것들이 소거될 수 없도록 일반적으로 PROM에 저장된다. 거래를 위한 추가적인 보안은 휴대가능한 디바이스의 시리얼 번호를 네트워크 데이터베이스에서의 유저 계정 또는 가입에 연관시킴으로써 휴대가능한 디바이스를 하나의 유저에게만 관련시키는 것에 의해 습득될 수 있다. 일부 실시예에서, 추가적인 보안은 스캐너를 스마트 카드에 대해 락을 걸기 위해 스마트 카드에 디바이스 식별자를 저장함으로써(또는, 스마트 카드 식별자를 휴대가능한 스캐너(500)에 저장함으로써) 달성된다. 이러한 실시예에서, DSP(575)는 휴대가능한 스캐너(500)가 작동하기 전에 정확한 스마트 카드가 삽입되었는지를 검증한다.The portable device 500 may store a device identifier, such as a serial number, in the memory 580. These device identifiers serve to uniquely identify the portable device 500 and are generally stored in a PROM such that they cannot be erased. Additional security for the transaction can be learned by associating the portable device to only one user by associating the serial number of the portable device to a user account or subscription in the network database. In some embodiments, additional security is achieved by storing the device identifier on the smart card (or by storing the smart card identifier in the portable scanner 500) to lock the scanner to the smart card. In this embodiment, the DSP 575 verifies that the correct smart card has been inserted before the portable scanner 500 operates.

등가 위치 기술Equivalent location technology

온-보드 GPS 수신기를 참조하여 위치 모듈(545)이 우선해서 논의되었지만, 많은 다른 위치 기술들이 사용될 수 있다. 이러한 기술들은 개선된 시간차 측위(Enhanced Observed Time Difference)(EOTD), GPS 지원 측위(Assisted GPS)(A-GPS), 차분 GPS(Differential GPS)(DGPS), 도착 시간 차이(Time Difference of Arrival)(TDOA), 도착각도, 국부 송수신기 파일럿 신호의 트라이앵귤레이션 및 모니터링을 포함한다. EOTD, TDOA, 및 도착각도는, 네트워크에서의 로직을 휴대가능한 디바이스의 위치를 측정하기 위해 각 베이스 스테이션에서 수신된 신호에 관한 데이터에 관련시키도록 상기 휴대가능한 디바이스가 네트워크된 베이스 스테이션으로 신호를 전송시킬 때 가장 적합한 것이다. 트라이앵귤레이션은 내부 또는 외부 중에 어느 하나가 될 수 있다. 휴대가능한 디바이스는 그것이 적어도 3개의 외부 송신기(IEEE 802.11 베이스 스테이션 등의)로부터 신호를 수신할 때 내부 트라이앵귤레이션을 수행하고, 상기 수신된 신호의 특성에 기초하여 대략적인 위치를 연산한다. 외부 트라이앵귤레이션은 휴대가능한 디바이스 외부의 네트워크된 수신기가 상기 휴대가능한 디바이스로부터 수신된 신호의 특성에 기초한 상기 휴대가능한 디바이스의 위치를 추정하기 위해 사용될 때 발생한다. 일부 실시예에서, 시스템은 외부 수신기로부터 휴대가능한 디바이스의 거리를 측정하기 위해 하나 이상의 외부 수신기에서의 수신된 신호의 강도를 이용한다. 모바일 수신기가 원하는 송신기로부터의 신호에 "락온" 시킬 수 있도록, 고정된 송신기가 자주 특정한 송신기를 식 별하는 파일럿 신호를 브로드캐스팅한다. 고정된 송신기의 위치와 대략적인 커버리지 영역이 알려지는 경우, 휴대가능한 디바이스의 위치는 송신기가 그것을 "분간하는" 것에 기초하여 추정될 수 있다. 예를 들면, 휴대가능한 디바이스가 IEEE 802.11 무선 액세스 포인트로부터 신호를 수신한다면, 휴대가능한 디바이스가 무선 액세스 포인트의 300피트 이내(현재 IEEE 802.11g 송신기의 대략적인 옥외 범위)에 있다는 것을 추정할 수 있다.Although the location module 545 has been discussed first with reference to an on-board GPS receiver, many other location techniques can be used. These technologies include Enhanced Observed Time Difference (EOTD), Assisted GPS (A-GPS), Differential GPS (DGPS), and Time Difference of Arrival ( TDOA), arrival angle, triangulation and monitoring of the local transceiver pilot signal. EOTD, TDOA, and Arrival Angles transmit signals to the networked base station where the portable device associates logic in the network with data about signals received at each base station to measure the location of the portable device. It is the most suitable for the Triangulation can be either internal or external. The portable device performs internal triangulation when it receives signals from at least three external transmitters (such as an IEEE 802.11 base station) and calculates an approximate location based on the characteristics of the received signal. External triangulation occurs when a networked receiver external to the portable device is used to estimate the location of the portable device based on the characteristics of the signal received from the portable device. In some embodiments, the system uses the strength of the received signal at one or more external receivers to measure the distance of the portable device from the external receiver. A fixed transmitter often broadcasts a pilot signal that identifies the particular transmitter so that the mobile receiver can "lock on" the signal from the desired transmitter. If the location of the fixed transmitter and the approximate coverage area are known, the location of the portable device can be estimated based on the transmitter "dividing" it. For example, if the portable device receives a signal from an IEEE 802.11 wireless access point, one can assume that the portable device is within 300 feet of the wireless access point (approximately the outdoor range of the current IEEE 802.11g transmitter).

핸드헬드 문서 데이터 캡쳐 디바이스Handheld Document Data Capture Device

여러 실시예에서 시스템과 사용되는 휴대가능 데이터 캡쳐 디바이스가 본문 전체에 걸쳐 다양한 관점에서 기술된다. 이러한 점에서 진행시켜, 다양한 타입의 휴대가능한 캡쳐 디바이스의 능력과 기능을 상세히 설명하기 위해 추가적인 논의가 제공된다.In various embodiments, a portable data capture device for use with the system is described in various respects throughout the text. Proceeding in this regard, further discussion is provided to detail the capabilities and functions of various types of portable capture devices.

일부 실시예에서, 데이터 캡쳐 능력을 가진 휴대가능한 디바이스가, 유저가 문서를 고유하게 식별하기 위해 충분한 텍스트 또는 기타 정보를 캡쳐했음을 유저에게 가리킬 수 있다. 휴대가능한 디바이스는 스캔된 정보의 양을 충분한 정보가 스캔되었는지를 판정하기 위해 미리정해진 임계 레벨과 비교할 수 있다(이 임계 방법은 상기 스캐너가 컴퓨터와 통신하지 않을 때 특히 유용하다.). 휴대가능한 디바이스가 원격 컴퓨터와 통신할 때, 상기 원격 컴퓨터는 텍스트가 스캔된 문서를 상기 컴퓨터가 식별했다는 것을 가리키는 메시지를 상기 디바이스에 전송할 수 있다. 상기 휴대가능 디바이스는 이미지를 획득하기 위한 이미지 캡쳐 디바이스, 상 기 이미지를 처리하기 위한 프로세서, 데이터를 저장하기 위한 메모리, 및/또는 로직(소프트웨어 프로그램), 다른 디바이스와 통신하기 위한 입/출력 통신 인터페이스, 전원, 스캔되는 정보에 조광하기 위한 조광원 및 위치 모듈을 포함할 수 있다.In some embodiments, a portable device with data capture capability may indicate to the user that the user has captured enough text or other information to uniquely identify the document. The portable device can compare the amount of information scanned with a predetermined threshold level to determine if enough information has been scanned (this threshold method is particularly useful when the scanner is not in communication with a computer). When the portable device communicates with a remote computer, the remote computer can send a message to the device indicating that the computer has identified a document whose text has been scanned. The portable device may be an image capture device for acquiring an image, a processor for processing the image, a memory for storing data, and / or logic (software program), an input / output communication interface for communicating with another device. And a light source and a location module for dimming the power, the scanned information.

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스에 의해 캡쳐된 텍스트 또는 심볼은 스캐너가 소프트웨어 프로그램을 실행시키거나 또는 특정한 미리 정해진 동작(메모리로부터 데이터를 소거, 턴온/오프, 금융 거래를 시작하고 및/또는 완료하는 등)을 실시하도록 상기 디바이스의 제어 로직 또는 제어 소프트웨어에 의한 제어 코맨드로서 사용되고 번역될 수 있다.In some embodiments, the text or symbols captured by the portable data capture device may cause the scanner to execute a software program or to execute certain predetermined operations (erase data on memory, turn on / off, initiate financial transactions, and / or Can be used and translated as control commands by the control logic or control software of the device.

일부 실시예에서, 페이퍼 문서로부터 데이터를 캡쳐한 후에, 휴대가능한 데이터 캡쳐 디바이스는 유저에게 상기 페이퍼 문서의 하나 이상의 전자 사본이 인식되거나 위치되었음을 가리킨다. 휴대가능한 디바이스가 원격 컴퓨터와 통신할 때, 원격 컴퓨터는 텍스트가 스캔된 문서의 전자 사본을 상기 컴퓨터가 배치시켰음을 가리키는 메시지를 상기 휴대가능한 디바이스로 전송할 수 있다. 상기 메시지 수신에 응답하여, 상기 휴대가능한 스캐너는 유저에게 상기 전자 사본이 배치되고, 유저가 스캐닝을 정지시킬 수 있음을 가리킨다. 많은 가능한 것들 중에, 상기 지시자는 시각(예를 들면, LED, 디스플레이 등), 청각(예를 들면, 스피커, 비퍼 등) 또는 촉각(터치 감지를 자극함)이 될 수 있다.In some embodiments, after capturing data from the paper document, the portable data capture device indicates to the user that one or more electronic copies of the paper document have been recognized or located. When the portable device communicates with the remote computer, the remote computer may send a message to the portable device indicating that the computer has placed an electronic copy of the document whose text has been scanned. In response to receiving the message, the portable scanner indicates to the user that the electronic copy is placed and that the user can stop scanning. Among many possible ones, the indicator can be visual (eg, LEDs, displays, etc.), hearing (eg, speakers, beepers, etc.) or tactile (stimulating touch sensing).

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 위치 및/또는 시간 판정 능력을 가지고, 어디서 및/또는 언제 데이터 캡쳐가 발생하는지에 관한 위치 및/또는 시간 정보를 상기 캡쳐된 데이터와 함께 저장할 수 있다. 상기 시간 정보는 특정한 데이터 캡쳐 이벤트에 연관된 타임 스탬프가 될 수 있다. 위치 정보는 특정한 데이터 캡쳐 이벤트에 연관된 위치 스탬프가 될 수 있다.In some embodiments, the portable data capture device has location and / or time determination capability and may store location and / or time information with the captured data regarding where and / or when data capture occurs. The time information can be a time stamp associated with a particular data capture event. The location information can be a location stamp associated with a particular data capture event.

일부 실시예에서, 휴대가능한 스캐너와 같은 휴대가능한 데이터 캡쳐 디바이스는 속도, 반복, 방향 등과 같은 스캔의 특성에 의해 제어된다. 추가로, 스캐너의 제어 프로그램 또는 로직은 상표 심볼과 같은 특별한 심볼에 반응할 수 있다. 이러한 특별한 심볼들은 휴대가능한 디바이스에 의해 실시될 특정 동작 또는 실행될 프로그램에 연관될 수 있다.In some embodiments, a portable data capture device, such as a portable scanner, is controlled by the nature of the scan, such as speed, repetition, direction, and the like. In addition, the control program or logic of the scanner may respond to special symbols such as trademark symbols. These special symbols may be associated with a particular operation to be performed by the portable device or the program to be executed.

일부 실시예에서, 스캐너와 같은 휴대가능한 데이터 캡쳐 디바이스는 빌링/가입/디바이스 식별자 정보를 메모리에 저장 시킨다. 가입 정보는 예를 들면 선지급된 계좌와 같은 식별된 문서의 전자 사본에 액세스할 유저의 권한을 검증하는 데에 사용될 수 있다. 빌링 정보는 식별된 문서의 전자 사본에 대한 액세스를 위해 지급하는 데에 사용될 수 있다. 디바이스 식별자는 유저의 신원을 검증하는 것을 보조하는 보안 특징으로서 사용될 수 있다.In some embodiments, a portable data capture device, such as a scanner, stores billing / subscription / device identifier information in memory. Subscription information may be used to verify a user's authority to access an electronic copy of an identified document, such as, for example, a prepaid account. Billing information may be used to pay for access to an electronic copy of the identified document. The device identifier can be used as a security feature to assist in verifying the identity of the user.

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 잉크 펜 및/또는 물리적 하이라이터와 조합된다. 이러한 조합은 유저가 동시에 페이퍼 문서와 전자 문서 상에 주석을 달고 하이라이팅할 수 있도록 한다. 추가로, 일부 실시예는 페이퍼 문서에 대한 디지털 서명을 추가하기 위해 잉크젯 프린터와 같은 인쇄 메커니즘에 통합된다.In some embodiments, the portable data capture device is combined with an ink pen and / or a physical highlighter. This combination allows users to annotate and highlight on paper and electronic documents at the same time. In addition, some embodiments are integrated into a printing mechanism, such as an inkjet printer, to add a digital signature to the paper document.

입/출력Input / output

휴대가능한 데이터 캡쳐 디바이스는 정보 및 명령어를 입력 및 출력하기 위한 다양한 수단을 포함한다. 유저, 통신 서비스 공급자, 원격 네트워크 디바이스, 및 캡쳐된 정보는 정보 및 명령어(동작 명령어와 같은)의 잠재적인 소스 중 일부이다.The portable data capture device includes various means for inputting and outputting information and instructions. The user, communication service provider, remote network device, and the captured information are some of the potential sources of information and instructions (such as operational instructions).

유저 인터페이스User interface

일부 실시예에서, 유저 인터페이스(UI)는 유저가 휴대가능한 데이터 캡쳐 디바이스와 상호작용하는 주된 수단이다. 정보 및 제어 명령어가 유저-인터페이스를 통해 휴대가능한 데이터 캡쳐 디바이스로 들어간다. 유저는 유저-인터페이스를 통해 휴대가능한 데이터 캡쳐 디바이스와 상호작용한다. 유저는 UI를 통해 휴대가능한 데이터 캡쳐 디바이스로 제어 코맨드 및 정보를 제공한다. 유사한 방식으로, 유저는 UI를 통해 휴대가능한 데이터 캡쳐 디바이스로부터 정보를 수신한다. 예를 들면, 유저는 디바이스 상의 키패드를 통해 텍스트를 입력하고, 디바이스의 디스플레이 상에서 키패드 기입의 시각적 확인을 수신한다.In some embodiments, the user interface (UI) is the primary means by which the user interacts with the portable data capture device. Information and control instructions enter the portable data capture device via the user-interface. The user interacts with a portable data capture device through a user-interface. The user provides control commands and information to the portable data capture device via the UI. In a similar manner, a user receives information from a portable data capture device via a UI. For example, a user enters text through a keypad on the device and receives a visual confirmation of keypad entry on the device's display.

입력input

데이터 입력을 위한 UI 수단은: 청각, 촉각, 제스쳐, 및 광학의 4 가지의 넓은 카테고리로 기술될 수 있다. 다양한 실시에에서, 휴대가능한 데이터 캡쳐 디바이스는 이러한 카테고리의 일부 또는 모두로부터 다양한 UI 수단의 조합을 갖는다.UI means for data entry can be described in four broad categories: auditory, tactile, gesture, and optical. In various embodiments, the portable data capture device has a combination of various UI means from some or all of these categories.

청각ear

청각 UI는 스피치와 같은 음성 신호를 휴대가능한 데이터 캡쳐 디바이스에 입력하기 위한 수단으로 구성된다. 음성 신호를 전기 에너지로 변환하는 것은 마이크로폰과 같은 오디오-전기 변환기를 필요로 한다. 휴대가능한 디바이스는 디지털화된 파형으로 저장하고, 전송하거나, 또는 텍스트로 변환하고 디지털화된 텍스트로 저장하는 것을 포함하는 음성 신호에 대한 다수의 동작을 수행할 수 있다.The auditory UI consists of means for inputting a voice signal, such as speech, into a portable data capture device. Converting voice signals into electrical energy requires an audio-to-electric converter such as a microphone. The portable device may perform a number of operations on voice signals including storing, transmitting as digitized waveforms, or converting to text and storing as digitized text.

마이크로폰microphone

일부 실시예에서, 휴대가능한 디바이스는 스피치를 캡쳐하기 위한 마이크로폰을 포함한다. 이러한 특징은 음성 주석을 문서에 집어넣고, 메시지를 기록하고, 다른 유저와 대화하는 데에 유용하다(예를 들면, 데이터 캡쳐 디바이스가 모바일 폰 기능을 가진다면).In some embodiments, the portable device includes a microphone for capturing speech. This feature is useful for embedding voice annotations in documents, recording messages, and communicating with other users (eg, if the data capture device has mobile phone capabilities).

촉각(접촉)Tactile (contact)

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 기계적 또는 촉각(접촉) 입력을 수용한다. 일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 유저가 데이터 캡쳐 프로세스를 시작하기 위해 누를 수 있는 스위치를 포함한다. 팁스위치를 가진 실시예에서, 유저는 스캐닝 프로세스를 시작하기 위해 페이퍼에 대해 디바이스를 누른다. 다른 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 스캐닝 타겟에 근접한 곳을 검출하기 위해 센서를 채용한다.In some embodiments, the portable data capture device accepts a mechanical or tactile (contact) input. In some embodiments, the portable data capture device includes a switch that the user can press to start the data capture process. In an embodiment with a tip switch, the user presses the device against the paper to begin the scanning process. In another embodiment, the portable data capture device employs a sensor to detect where proximity to the scanning target.

조그(엄지) 휠Jog wheel

컴퓨터 마우스 상의 휠과 유사한 조그 휠은 컴퓨터 전자기기 또는 메뉴와 상호작용하기에 유용하다. 예를 들면, 일부 실시예에서, 시스템은 스캐너 유저에게 인접한 디스플레이 상에 메뉴 선택을 제시한다. 문서로부터 휴대가능한 스캐너를 들어올리고 메뉴 아이템 중에 하나를 스캔하는 것이 아니라, 유저는 메뉴 선택을 스크롤 다운되도록 엄지 휠을 움직일 수 있다.Jog wheels, similar to the wheels on a computer mouse, are useful for interacting with computer electronics or menus. For example, in some embodiments, the system presents the menu selection on the display adjacent to the scanner user. Rather than lifting a portable scanner from a document and scanning one of the menu items, the user can move the thumb wheel to scroll down the menu selections.

키패드Keypad

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 데이터를 디바이스에 입력시키기 위한 키패드 및/또는 버튼을 갖는다. 일부 실시예에서, 캡쳐 디바이스는 선택을 되돌리거나 소거하기 위한 취소 버튼과 선택을 확인하기 위한 (예를 들면 구매 확인) 확인 버튼을 갖는다. 일부 실시예에서, 캡쳐 디바이스는 컨텍스트 스위치를 가리키거나 컨텍스트를 설정하기를 원하는 버튼을 가진다. 예를 들면, 제 1 문서에서 텍스트를 스캐닝 한 후에, 유저는 컨텍스트 버튼을 누름으로써 자신이 제 1 문서로부터 스캐닝을 하였고 그 다음에 제 2 문서로부터 텍스트를 스캔할 것이라는 것을 가리킬 수 있고, 컨텍스트 버튼을 누름으로써 유저는 시스템에게 자신의 컨텍스트 스캐닝이 변하였음을 알린다.In some embodiments, the portable data capture device has a keypad and / or a button for entering data into the device. In some embodiments, the capture device has a cancel button to reverse or erase the selection and a confirmation button to confirm the selection (eg purchase confirmation). In some embodiments, the capture device has a button that points to the context switch or wants to set the context. For example, after scanning text in the first document, the user can indicate that he has scanned from the first document and then will scan the text from the second document by pressing the context button, then press the context button. By pressing, the user informs the system that his context scanning has changed.

팁 스위치/근접 센서Tip switch / proximity sensor

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 팁 스위치 또는 근접 센서를 갖는다. 펜-형상 스캐너에서, 팁 스위치는 상기 스캐너의 팁이 문서에 대해 눌려질 때 활성화되는 스위치이다. 상기 팁 스위치는, 유저가 페이퍼에 대해 스캐너를 얼마나 세게 누르느냐에 따라 스캐너가 자신의 행동을 수정할 수 있도록, 압력 감지 능력을 포함할 수 있다. 예를 들면, 스캐너는 (컴퓨터 및 단어 처리 소프트웨어와 조합하여) 그것이 페이퍼에 대해 심하게 눌려지면 하이라이팅 기능을 실시할 것이다. 다른 예로서, 스캐너는 스캐너 팁을 페이퍼에 대해 누름으로써 스위치를 온/오프하고 팁 스위치를 활성화시킬 수 있다.In some embodiments, the portable data capture device has a tip switch or proximity sensor. In a pen-shaped scanner, the tip switch is a switch that is activated when the tip of the scanner is pressed against the document. The tip switch may include pressure sensing capabilities such that the scanner can modify its behavior depending on how hard the user presses the scanner against the paper. For example, the scanner will perform a highlighting function (in combination with computer and word processing software) if it is pressed hard against the paper. As another example, the scanner may turn the switch on / off and activate the tip switch by pressing the scanner tip against the paper.

미세(granule)/표면 텍스쳐 센서Granule / surface texture sensor

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 문서 상의 표면의 변형을 검출하기 위해 2 개의 병렬 미세/표면 텍스쳐 센서를 포함한다. 이러한 종류의 입력 센서의 일반적인 사용은 점자 텍스트를 캡쳐하는 것이다. 병렬 표면 텍스쳐 센서는 상기 병렬 센서에 의해 이동된 상대적인 속도/거리로부터 센서의 각도를 판정할 수 있다. 비접촉 광학 센서는 휴대가능한 데이터 캡쳐 디바이스의 실시예의 사용에 잘 들어맞는 일반적인 타입의 표면 텍스쳐 센서이다.In some embodiments, the portable data capture device includes two parallel micro / surface texture sensors to detect deformation of the surface on the document. A common use of this kind of input sensor is to capture braille text. The parallel surface texture sensor can determine the angle of the sensor from the relative speed / distance moved by the parallel sensor. Non-contact optical sensors are a general type of surface texture sensor that fits well with the use of embodiments of portable data capture devices.

제스쳐(모션 기반)Gesture (motion based)

유저는 그것을 가지고 제스쳐를 취함으로서 휴대가능한 데이터 캡쳐 디바이스에 데이터와 코맨드를 입력할 수 있다. 상기 디바이스는 스캐닝 헤드하에서 통과하는 데이터를 관찰함으로써, 모션 센서에서의 변화를 모니터링함으로써, 또는 기계적 모션-감지 수단에 의해 제스쳐를 검출할 수 있다.The user can enter data and commands into the portable data capture device by making a gesture with it. The device can detect the gesture by observing the data passing under the scanning head, by monitoring the change in the motion sensor, or by mechanical motion-sensing means.

광학 데이터의 관찰Observation of Optical Data

광학 엘리먼트 하에서 지나가는 표면 또는 데이터를 관찰함으로써, 휴대가능한 데이터 캡쳐 디바이스는 광학 컴퓨터 마우스가 하는 것과 매우 동일한 방식으로 상대적인 모션을 연산할 수 있다. 디바이스의 상대적인 동작을 분석함으로써, 상기 디바이스는 유저가 상기 디바이스로 어떠한 제스쳐를 하는 지를 판정할 수 있다. 일부 실시예에서, 광학 시스템은 또한 상기 렌더링된 문서의 표면 상의 패턴을 관찰함으로써 상대적인 모션을 검출할 수 있다. 일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 상기 렌더링된 문서의 표면상의 절대위치 코드를 체크함으로써 절대 위치를 검출할 수 있다.By observing the surface or data passing under the optical element, the portable data capture device can calculate relative motion in much the same way that an optical computer mouse does. By analyzing the relative behavior of the device, the device can determine what user gestures to the device. In some embodiments, the optical system can also detect relative motion by observing a pattern on the surface of the rendered document. In some embodiments, the portable data capture device can detect the absolute position by checking the absolute position code on the surface of the rendered document.

자이로/가속도계 모션 센서Gyro / Accelerometer Motion Sensor

일부 실시예에서, 휴대가능한 캡쳐 디바이스는 속도나 방향의 변화를 감지하기 위해 가속도계와 같은 가속도 센서를 포함하고, 그에 의해 제어 제스쳐를 판정한다. 일부 실시예에서, 휴대가능한 캡쳐 디바이스는 모션 및 제스쳐를 검출하기 위해 자이로스코프를 사용한다. 단일 칩 링-레이저 자이로스코프는 특히 이러한 태스크에 적합하다.In some embodiments, the portable capture device includes an acceleration sensor, such as an accelerometer, to detect a change in speed or direction, thereby determining a control gesture. In some embodiments, the portable capture device uses a gyroscope to detect motion and gestures. Single chip ring-laser gyroscopes are particularly suitable for this task.

기계(볼 포인트, 롤러 등)Machine (ball point, roller, etc.)

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 광학적으로 인코딩되는 잉크 펜의 볼포인트와 유사한 롤링 엘리먼트나 볼을 포함한다. 볼이 페이퍼 표면을 따라 이동하면서, 광학 센서는 광학적으로 인코딩된 엘리먼트의 모션을 검출한다.In some embodiments, the portable data capture device includes a rolling element or ball similar to the ball point of an optically encoded ink pen. As the ball moves along the paper surface, the optical sensor detects the motion of the optically encoded element.

일부 실시예에서, 휴대가능한 캡쳐 디바이스는 상대적인 모션을 기록하기 위해 볼포인트 전체에 전기적으로 대전된 잉크의 흐름을 측정한다. 상기 볼에 흐르는 잉크에는 이러한 프로브에 의해 검출되는 전하가 주어진다. 볼포인트 하우징에 내장된 전류-감지 프로브는 볼 상의 잉크의 흐름을 검출한다. 복수의 프로브가 있다면, 잉크 흐름의 방향은, 그 결과의 볼의 모션, 및 그에 따른 표면을 지나는 실질적인 모션이 추측될 수 있다. 이 전기적으로 대전된 잉크 기술은 범용목적의 입력 기록 디바이스로서 역할을 할 수 있고, 여기서 유저는 종래 잉크로 기록하고, 반면에 모션이 기입되고 기록될 수 있다. 수신기에서 흘러나오는 대전된 잉크만 센서에 의해 감지되도록 볼이 자신의 하우징에서 드러나는 경계 또는 그 근방의 가드 링이 상기 대전된 잉크를 방전시키는 데에 사용될 수 있다.In some embodiments, the portable capture device measures the flow of electrically charged ink throughout the ballpoint to record the relative motion. The ink flowing in the ball is given the charge detected by this probe. A current-sensitive probe embedded in the ballpoint housing detects the flow of ink on the ball. If there are a plurality of probes, the direction of the ink flow can be inferred from the motion of the resulting ball, and hence the actual motion across the surface. This electrically charged ink technology can serve as a general purpose input recording device, where a user writes in conventional ink, while motion can be written and recorded. A guard ring at or near the boundary where the ball emerges from its housing may be used to discharge the charged ink so that only charged ink flowing out of the receiver is sensed by the sensor.

광학optics

유저는 광학 감지 시스템에 의해 휴대가능한 데이터 캡쳐 디바이스로 데이터 및 코맨드를 입력할 수 있다.The user can enter data and commands into a data capture device that is portable by the optical sensing system.

스캐너/이미징 시스템Scanner / imaging system

키워드 또는 심볼을 스캐닝함으로써, 유저는 코맨드와 데이터를 상기 디바이 스에 입력할 수 있다. 상기 휴대가능한 디바이스는 특정한 그래픽 심볼을 코맨드로 인식하도록 프로그래밍될 수 있다. 예를 들면, 유저가 "$"라는 심볼을 스캔할 때, 휴대가능한 디바이스는 그것을 구매와 같은 금융거래를 시작하라는 코맨드로 인식한다.By scanning keywords or symbols, the user can enter commands and data into the device. The portable device can be programmed to recognize a particular graphic symbol as a command. For example, when a user scans the symbol "$", the portable device recognizes it as a command to initiate a financial transaction such as a purchase.

출력Print

휴대가능한 데이터 캡쳐 디바이스의 UI는 또한 정보를 유저에게 보여줄 수도 있다. 이러한 정보는 대개 디바이스의 동작 상태에 관한 것이다. 유저에게 정보를 보여주기위한 UI 출력 수단은: 청각, 촉각, 및 옵테컬의 3가지 넓은 카테고리로 분류될 수 있다. 휴대가능한 데이터 캡쳐 디바이스의 실시예는 이러한 카테고리의 일부 또는 모두로부터 UI 출력 수단의 다양한 조합을 가질 것이다.The UI of the portable data capture device may also show information to the user. This information is usually about the operating state of the device. UI output means for showing information to the user can be classified into three broad categories: auditory, tactile, and optical. Embodiments of a portable data capture device will have various combinations of UI output means from some or all of these categories.

일부 실시예에서, 휴대가능한 캡쳐 디바이스는 호스트 컴퓨터에 스캔 결과를 전송하고, 디스플레이의 방식으로 유저에게 동작 상태 또는 모드를 통신할 수 있다. 상기 디스플레이는 온보드 휴대가능한 캡쳐 디바이스이거나, 또는 호스트 컴퓨터에 연관될 수 있다. 일부 실시예에서, 휴대가능한 디바이스는 유선 또는 무선 통신 매체를 사용한다. 일부 실시예에서, 유저는 정보를 보여주기위해 호스트 컴퓨터에 연결된 모니터를 사용한다. 적절한 유선 연결의 예로는: RS-232; PS/2; 시리얼; USB; 이더넷; 토큰링; 프린터 연결(예를 들면 IEEE 1284); 방화벽; RJ45(전화선); 홈플러그 및 광섬유를 포함한다. 적절한 무선 연결의 예로는: 무선 이더넷(예를 들면, IEEE 802.11a,b,g); 블루투스™; 적외선(텔레비전 원격제어에서와 같은, IrDA); 및 초광대역을 포함한다. 휴대가능한 디바이스는 청각(예를 들면, 압전 스피커), 촉각(휴대폰의 진동을 포함), 또는 시각적 경고를 이용하여 유저에게 통신한다.In some embodiments, the portable capture device can send the scan results to the host computer and communicate the operating state or mode to the user in the manner of display. The display may be an onboard portable capture device or associated with a host computer. In some embodiments, the portable device uses a wired or wireless communication medium. In some embodiments, the user uses a monitor connected to the host computer to display the information. Examples of suitable wired connections are: RS-232; PS / 2; Serial; USB; Ethernet; Token ring; Printer connection (eg IEEE 1284); firewall; RJ45 (telephone line); Home plug and optical fiber. Examples of suitable wireless connections include: wireless Ethernet (eg, IEEE 802.11a, b, g); Bluetooth ™; Infrared (such as in television remote control, IrDA); And ultra-wideband. The portable device communicates to the user using hearing (eg, piezoelectric speaker), tactile (including vibration of the mobile phone), or visual alert.

다양한 실시예에서, UI는, 오류가 발생하여 유저가 다시 스캔해야하는 것; 다른 디바이스로의 통신 링크가 개방되고 활성화된 것; 휴대가능한 디바이스가 턴온된 것; 제스쳐가 검출된 것; 또는 스캐너가 현재 어떠한 모드인지를 가리킬 수 있다. 예를 들면, 일부 실시예에서, 스캔이 반복될 필요가 있으면 스캐너는 단순히 진동을 한다.In various embodiments, the UI may indicate that an error has occurred and the user has to scan again; The communication link to another device is opened and activated; The portable device is turned on; Gesture is detected; Or it can indicate which mode the scanner is currently in. For example, in some embodiments, the scanner simply vibrates if the scan needs to be repeated.

청각ear

다수의 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 가청 경고음을 유저에게 제공할 수 있다. 이러한 가청 경고는 전기 신호를 사운드로 변환하는 스피커와 같은 전기-음향 변환기를 필요로한다.In many embodiments, the portable data capture device may provide an audible warning sound to the user. Such audible alerts require an electro-acoustic transducer, such as a speaker, that converts electrical signals into sound.

스피커speaker

일부 실시예에서, 휴대가능한 캡쳐 디바이스는 사운드를 생성하기 위해 스피커 또는 압전 엘리먼트를 갖는다. 이들 스피커는 유저에게 텍스트를 읽어주거나 또는 유저에게 디바이스의 상태 변화를 경고하기 위해 사용될 수 있다. 예를 들면, 일부 실시예에서, 상기 디바이스는 페이퍼 문서가 식별되었고, 상기 페이퍼 문서의 전자 사본이 배치되었음을 유저에게 경고하기 위해 스캐닝 동안 발신음을 낸 다. 다른 예로서, 텍스트가 페이퍼 문서로부터 스캔되면서, 캡쳐 디바이스는 텍스트-투-스피치 프로세스를 스캔된 텍스트에 적용하고, 그 결과물인 오디오를 재생한다.In some embodiments, the portable capture device has a speaker or piezoelectric element to produce sound. These speakers can be used to read text to the user or to alert the user to changes in the device's state. For example, in some embodiments, the device emits a dial tone during scanning to alert the user that a paper document has been identified and an electronic copy of the paper document has been placed. As another example, as text is scanned from a paper document, the capture device applies a text-to-speech process to the scanned text and reproduces the resulting audio.

접촉contact

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 진동에 의해 유저와 통신한다. 상기 접촉 UI는 가청 경고를 주변의 노이즈 레벨 때문에 들을 수 없는 환경, 또는 가청 경고가 사회적으로 수용되지 못하는 장소(예를 들면, 극장)에서 특히 유용하다.In some embodiments, the portable data capture device communicates with the user by vibration. The contact UI is particularly useful in environments where audible alerts are not audible due to ambient noise levels, or where audible alerts are not socially acceptable (eg, theaters).

진동vibration

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 디바이스 상태의 변화를 유저에게 알리는 바이브레이트 엘리먼트를 갖추고 있다. 일부 모바일 폰의 실시예에서, 이러한 바이브레이트 엘리먼트는 폰의 배터리 팩에 포함되어 있다. 일부 실시예에서, 드라이버가 그들의 경로 외부로 드리프팅하고 있음을 드라이버에게 알리는 "럼블 스트립(rumble strip)"과 유사하게, 휴대가능한 데이터 캡쳐 디바이스는 스캐닝 헤드가 라인에서 벗어나면 바이브레이팅한다.In some embodiments, the portable data capture device is equipped with a vibrate element that notifies the user of a change in device state. In some mobile phone embodiments, this vibrating element is included in the phone's battery pack. In some embodiments, similar to a “rumble strip” that informs the driver that the driver is drifting out of their path, the portable data capture device vibrates when the scanning head is off the line.

옵티컬Optical

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 시각적 수단에 의해 UI를 통하여 유저와 통신한다. 일부 실시예에서, 이 디바이스는 스캐닝되는 페이퍼상에 메뉴 또는 다른 정보를 투영한다. 예를 들어, 캡쳐 디바이스가 컴퓨터상의 워드프로세싱 프로그램으로 작업하고 있을 때, 이 디바이스는 시스템이 그 워드프로세싱 프로그램내의 오픈 문서내의 노란부분에 스캐닝된 텍스트를 하일라이트할 것을 나타내기 위해 페이퍼상에 노란 빛을 투영할 수 있다. In some embodiments, the portable data capture device communicates with the user via the UI by visual means. In some embodiments, the device projects menu or other information onto the scanned paper. For example, when the capture device is working with a word processing program on a computer, the device may display yellow light on the paper to indicate that the system will highlight the scanned text in the yellow portion of the open document in that word processing program. Can be projected.

디스플레이display

휴대가능한 데이터 캡쳐 디바이스는 디스플레이를 포함할 수 있다. 일부 실시예에서, 니얼바이 디스플레이는 휴대가능한 데이터 캡쳐 디바이스에 대한 정보를 디스플레이에 라우팅하고, 디스플레이상에 정보를 나타내도록 휴대가능한 데이터 캡쳐 디바이스와 연관될 수 있다. 컴퓨터 모니터와 같은 니얼바이 디스플레이를 사용하는 것은 휴대가능한 데이터 캡쳐 디바이스가 디스플레이가 없거나, 정보가 휴대가능한 데이터 캡쳐 디바이스의 작은 디스플레이상에 나타내기 적합하지 않을 때, 특히 유용하다. The portable data capture device can include a display. In some embodiments, the Nialby display may be associated with the portable data capture device to route information about the portable data capture device to the display and present the information on the display. Using a Nialby display, such as a computer monitor, is particularly useful when the portable data capture device has no display or information is not suitable for presentation on a small display of the portable data capture device.

LEDsLEDs

또한, 발광 다이오드(LED)가 유저와 시각적으로 통신하기 위해 사용될 수 있다. 예를 들어, 일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 휴대가능한 데이터 캡쳐 디바이스가 켜져 있고 데이터를 캡쳐할 준비가 되었음을 나타내기 위해 녹색 LED를 가동한다.In addition, light emitting diodes (LEDs) may be used to visually communicate with the user. For example, in some embodiments, the portable data capture device activates a green LED to indicate that the portable data capture device is on and ready to capture data.

다른 디바이스와의 통신Communicate with other devices

휴대가능한 데이터 캡쳐 디바이스의 통신 인터페이스는 휴대가능한 데이터 캡쳐 디바이스가 다른 디바이스들과 통신하게 하는 송수신기를 포함한다. 휴대가능한 데이터 캡쳐 디바이스는 컴퓨터, 모바일 폰, 무선 송수신기와 같은 다른 호환가능한 전자 디바이스와 통신할 수 있다. The communication interface of the portable data capture device includes a transceiver that allows the portable data capture device to communicate with other devices. The portable data capture device can communicate with other compatible electronic devices such as computers, mobile phones, wireless transceivers.

유선cable

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 다른 전자 디바이스들과 통신하기 위해 유선 커넥션을 사용한다. 임의의 적합한 프로토콜이 컴퓨터와 연결될 때, 통신을 위해 사용될 수 있다. 일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 연결된 커넥션을 통해 호스트 컴퓨터와 통신하기 위해 유니버설 시리얼 버스(USB) 프로토콜을 사용한다. In some embodiments, the portable data capture device uses a wired connection to communicate with other electronic devices. When any suitable protocol is connected to the computer, it can be used for communication. In some embodiments, the portable data capture device uses the Universal Serial Bus (USB) protocol to communicate with the host computer over a connected connection.

USBUSB

유니버설 시리얼 버스(USB)는 일부 실시예의 휴대가능한 데이터 캡쳐 디바이스에 의해 사용되는 프로토콜이다. 일부 실시예에서, 컴퓨터와 휴대가능한 데이터 캡쳐 디바이스 사이에 통신 채널을 제공하는 것과 더불어, USB는 휴대가능한 데이터 캡쳐 디바이스의 배터리를 재충전하는 전력을 제공한다. 일부 실시예에서, USB 인터페이스는 유저가 USB 메모리 디바이스를 휴대가능한 데이터 캡쳐 디바이스에 부착할 수 있게 한다. Universal Serial Bus (USB) is a protocol used by the portable data capture device of some embodiments. In some embodiments, in addition to providing a communication channel between the computer and the portable data capture device, the USB provides power to recharge the battery of the portable data capture device. In some embodiments, the USB interface allows a user to attach a USB memory device to a portable data capture device.

광섬유Fiber optic

또한 광섬유 통신 채널이 휴대가능한 데이터 캡쳐 디바이스의 일부 실시예에 의해 사용될 수 있다. 다양한 실시예에 대하여 적합한 광섬유 타입은 단일모드 및 멀티모드이다. 멀티모드 광섬유의 일 장점은 값싼 LED 광원을 사용할 수 있다는 것이다. 또한, 커넥터 연결과 배열이 멀티모드 광섬유일 때 덜 중요하다.Fiber optic communication channels may also be used by some embodiments of portable data capture devices. Suitable fiber types for various embodiments are monomode and multimode. One advantage of multimode fiber is the use of cheap LED light sources. It is also less important when the connector connection and arrangement is a multimode fiber.

무선wireless

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스의 통신 인터페이스는 무선 인터페이스이다. 적합한 무선 테크놀로지는 근거리 RF(블루투스, IEEE 802.11, 등), 셀룰러, 또는 광(적외선 등) 등이다. 통신 인터페이스가 무선 능력을 포함하는 경우에, 무선 능력을 구현하기 위해 필수적인 안테나 또는 렌즈를 포함하는 것이 전형적이다. In some embodiments, the communication interface of the portable data capture device is a wireless interface. Suitable wireless technologies are near field RF (Bluetooth, IEEE 802.11, etc.), cellular, or light (such as infrared). If the communication interface includes a wireless capability, it typically includes an antenna or lens that is necessary to implement the wireless capability.

WLAN, 셀룰러, BT, 등WLAN, cellular, BT, etc

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 IEEE 802.11 표준의 송수신기를 이용하여 무선 근거리 통신망(WLAN) 능력을 구현한다. 휴대가능한 데이터 캡쳐 디바이스는 원격 컴퓨터와 통신하기 위해 전형적으로 WLAN "핫스팟"을 사용한다. 일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 모바일폰 또는 개인용 컴퓨터와 같은 니얼바이 디바이스와 쌍을 이루고 통신하기 위해 블루투스(BT) 근거리 RF 방법을 사용한다. 또한, 휴대용 데이터 캡쳐 기능을 구현하는 모바일 폰은 원격 컴퓨터에 캡쳐된 데이터를 전송하기 위해 셀룰러 통신 네트워크를 사용할 수 있다. In some embodiments, the portable data capture device implements wireless local area network (WLAN) capability using a transceiver of the IEEE 802.11 standard. Portable data capture devices typically use WLAN "hotspots" to communicate with remote computers. In some embodiments, the portable data capture device uses a Bluetooth (BT) near field RF method to pair and communicate with a Nialby device, such as a mobile phone or a personal computer. In addition, a mobile phone implementing a portable data capture function may use a cellular communication network to send captured data to a remote computer.

데이터 캡쳐 서브시스템Data capture subsystem

휴대가능한 데이터 캡쳐 디바이스는 데이터 캡쳐 서브시스템을 갖는다. 이 데이터 서브시스템은 일반적으로 보이스, 옵티컬, 및/또는 자성 스트립 데이터를 캡쳐하는 능력을 가진다. 캡쳐된 데이터는 후속 프로세싱 및 전송을 위해 메모리 내에 저장된다. 일부 실시예에서, 캡쳐된 정보는 압축될 수 있고, 그리고/또는 메모리 공간 및 통신 채널 대역폭을 절약하기 위해 자동으로 삭제될 수 있다. 자동 삭의제 일 예는 OCR 프로세스에 의해 텍스트로 전환된 후 스캐닝된 이미지를 삭제하는 것이다. 캡쳐된 정보의 모두를 포함하지 않는 이미지를 저장하는 것이 메모리를 절약할 수 있다. 예는 GIF 또는 JPG와 같은 압축 포맷을 포함한다. 다른 접근은 불필요한 색상 정보를 저장하지 않는 것이다. 예를 들어, 전형적인 CCD 이미지 센서는 각 픽셀당 24레벨의 색상 정보를 캡쳐할 수 있다(즉, 1600만 이상의 상이한 색상을 구별한다). 표준 OCR의 목적을 위해, 휴대가능한 데이터 캡쳐 디바이스는 단지 흰색, 거의 흰색, 거의 검은색, 검은색(2비트)을 구분할 수 있으면 된다. 이러한 24에서 2비트로의 감소는 대략 92%의 저장 공간을 절약하게 한다. The portable data capture device has a data capture subsystem. This data subsystem generally has the ability to capture voice, optical, and / or magnetic strip data. The captured data is stored in memory for subsequent processing and transmission. In some embodiments, the captured information may be compressed and / or automatically deleted to save memory space and communication channel bandwidth. An automatic deletion example is to delete the scanned image after it has been converted to text by the OCR process. Storing images that do not contain all of the captured information can save memory. Examples include compressed formats such as GIF or JPG. Another approach is to not store unnecessary color information. For example, a typical CCD image sensor can capture 24 levels of color information per pixel (ie, distinguish more than 16 million different colors). For the purpose of a standard OCR, a portable data capture device only needs to be able to distinguish between white, almost white, almost black, and black (2 bits). This 24 to 2 bit reduction saves approximately 92% of storage space.

보이스voice

휴대가능한 데이터 캡쳐 디바이스가 광학 데이터 캡쳐 시스템을 가질 때, 일부 실시예에서, 보이스 캡쳐 서브시스템이 많은 환경에서 유용하다. 보이스 캡쳐 서브시스템은 전형적으로 유저가 렌더링된 문서로부터 텍스트를 읽을 수 있게 한다. 온보드 마이크로폰은 말한 단어를 캡쳐한다. 그 다음, 스피치-투-텍스트 어플리케이션은 스피치를 텍스트 형식으로 변환한다. 그 다음 텍스트는, 예를 들어, 렌더링된 문서의 전자 대응물을 로케이팅 하기 위해 사용된다. 일부 실시예에서, 이러한 데이터 캡쳐는 아래에 서술되는 모바일 폰 또는 스캐노테이터(scannotator)이다. When the portable data capture device has an optical data capture system, in some embodiments, the voice capture subsystem is useful in many environments. The voice capture subsystem typically enables the user to read text from the rendered document. The onboard microphone captures the spoken word. The speech-to-text application then converts the speech to text format. The text is then used, for example, to locate the electronic counterpart of the rendered document. In some embodiments, such data capture is a mobile phone or scannotator, described below.

옵티컬 데이터 캡쳐 서브시스템Optical data capture subsystem

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 옵티컬 데이터 캡쳐 서브시스템을 포함한다. 옵티컬 데이터 캡쳐 시스템은 전형적으로 이미지 센서 및 옵티컬 경로를 포함한다. 옵티컬 경로는 휴대가능한 디바이스의 하우징 내에 애퍼쳐(aperture)를 통과한다. 일부 실시예에서, 옵티컬 엘리먼트는 디바이스의 하우징의 일부를 포함한다. 옵티컬 경로는 렌즈 또는 빛을 포커싱하기 위한 애퍼쳐, 및/또는 옵티컬 경로를 보호하기 위한 투명 커버를 포함할 수 있다. 일부 실시예에서, 이미지 콘딧은 렌더링된 문서로부터 이미지 센서로 빛을 가이드하는 옵티컬 경로의 일부이다.In some embodiments, the portable data capture device includes an optical data capture subsystem. Optical data capture systems typically include an image sensor and an optical path. The optical path passes through an aperture in the housing of the portable device. In some embodiments, the optical element includes a portion of a housing of the device. The optical path may include an aperture for focusing the lens or light, and / or a transparent cover for protecting the optical path. In some embodiments, the image conduit is part of an optical path that guides light from the rendered document to the image sensor.

일부 실시예에서, 휴대용 스캐너는 애퍼쳐 뒤에 이미지 센서를 가진다. 일부 실시예에서, 애퍼쳐는 이미지 센서 및 디바이스의 내부 옵티컬 경로를 먼지와 손상으로부터 보호하는 투명 커버를 가진다. 일부 실시예에서, 이 커버는 플라스틱 또는 유리이다. 휴대용 스캐너가 렌즈를 갖춘 경우, 렌즈는 전형적으로 애퍼쳐로부터 페이퍼까지의 거리가 애퍼쳐로부터 이미지 센서까지의 거리에 따라 변화가능하도록 포커싱할 수 있다. 이러한 관계는 1/f=1/u+1/v로 형성될 수 있다(f는 렌즈의 초첨거리, u는 애퍼쳐에서 문서까지의 거리, v는 센서에서 애퍼쳐까지의 거리이다). 일부 실시예에서, 휴대용 스캐너는 하나 이상의 포커싱 렌즈를 사용한다. In some embodiments, the portable scanner has an image sensor behind the aperture. In some embodiments, the aperture has a transparent cover that protects the internal optical path of the image sensor and device from dust and damage. In some embodiments, the cover is plastic or glass. When a portable scanner is equipped with a lens, the lens can typically focus such that the distance from the aperture to the paper is variable depending on the distance from the aperture to the image sensor. This relationship can be formed as 1 / f = 1 / u + 1 / v (f is the focal length of the lens, u is the distance from the aperture to the document, v is the distance from the sensor to the aperture). In some embodiments, the portable scanner uses one or more focusing lenses.

일부 실시예에서, 유저가 휴대가능한 데이터 캡쳐 디바이스를 렌더링된 문서로 움직일 때, 옵티컬 시스템은 데이터를 캡쳐할 수 있다. 휴대가능한 데이터 캡쳐 디바이스가 렌더링된 문서에 접근할 때 데이터를 캡쳐하는 것은 휴대가능한 데이터 캡쳐 디바이스에 넓은 시야를 제공할 수 있고, 그러므로 이 캡쳐의 시각적 컨 텍스트에 대한 부가적인 정보를 제공한다. 이러한 타입의 옵티컬 시스템을 가진 휴대용 스캐너에서, 스캐너가 문서의 표면과 접촉하기 전에도, 스캐너가 문서로부터 데이터를 캡쳐한다. 일부 경우에서, 스캐너가 문서에 접근할 때 데이터를 캡쳐하는 것은 유저가 텍스트의 라인을 따라 러빙하지 않고 터칭 또는 탭핑에 의해 페이퍼와 상호작용할 수 있게 한다. 유저 경험은 유저가 텍스트의 라인을 따라 스캐닝하기보다 ("터칭") 텍스트로 포인팅한다.In some embodiments, the optical system may capture data when the user moves the portable data capture device to the rendered document. Capturing data when the portable data capture device accesses the rendered document can provide a wide field of view to the portable data capture device and therefore provide additional information about the visual context of this capture. In portable scanners with this type of optical system, the scanner captures data from the document even before the scanner contacts the surface of the document. In some cases, capturing data as the scanner accesses the document allows the user to interact with the paper by touching or tapping without rubbing along a line of text. The user experience points to text ("touching") rather than the user scanning along lines of text.

옵티컬 캡쳐 서브시스템 구성Optical capture subsystem configuration

옵티컬 캡쳐 서브시스템은 각각 특정 어플리케이션에 대한 특정 장점을 가진 다양한 구성으로 구현될 수 있다.The optical capture subsystem can be implemented in a variety of configurations, each with specific advantages for a particular application.

일차원 센서 어레이One-dimensional sensor array

일부 실시예에서, 광센싱 엘리먼트는 일차원 선형 센서 어레이이다. 일차원 어레이는 옵티컬 정보를 캡쳐하는 센서의 열(row)로 이루어진다. 일차원 어레이는 몇몇 생체 어플리케이션, 특히 지문 스캐닝에 매우 적합하다. 일부 실시예에서, 센서는 전하 결합 소자(CCD), 또는 상보성 금속 산화물 반도체(CMOS) 디바이스이다. 그러나, 임의의 적합한 광센싱 디바이스로 대체될 수 있다. In some embodiments, the light sensing element is a one-dimensional linear sensor array. One-dimensional arrays consist of rows of sensors that capture optical information. One-dimensional arrays are well suited for some biometric applications, especially fingerprint scanning. In some embodiments, the sensor is a charge coupled device (CCD), or a complementary metal oxide semiconductor (CMOS) device. However, it can be replaced with any suitable light sensing device.

2차원 센서 어레이2-D sensor array

2차원 센서 어레이는 일차원 어레이와 유사하지만, 센서 엘리먼트의 열이 서로로부터 2차원의 동일 평면 오프셋을 가진다. 2차원 어레이는 거리, 스캔 각, 스큐(skew)의 정보를 산출할 수 있다는 장점이 있다. 일부 실시예에서, 2차원 어레이는 센서 엘리먼트의 적어도 2개의 평행 열, 또는 행으로 이루어진다. 그러나, 2 차원 센서 어레이의 많은 토폴로지 변형이 가능하다.Two-dimensional sensor arrays are similar to one-dimensional arrays, but the rows of sensor elements have two-dimensional coplanar offsets from each other. The two-dimensional array has an advantage of calculating distance, scan angle, and skew information. In some embodiments, the two-dimensional array consists of at least two parallel columns or rows of sensor elements. However, many topological variations of two dimensional sensor arrays are possible.

옵티컬 센서 엘리먼트의 이차원 어레이는 문자 구조(팁, 어센더(ascender)/디센더(descender) 수직 엘리먼트), 타이밍, 및 위치의 코릴레이션에 의해 동일 시간에 동작 및 디-스큐(de-skew)를 검출할 수 있다. 어센더/디센더는 평균 텍스트 문자보다 텍스트의 열의 중간선의 위/아래로 더 뻗은 텍스트 문자이다. 어센더의 예는 문자 "t"이다. 디센더의 예는 문자 "p"이다. Two-dimensional arrays of optical sensor elements allow motion and de-skew at the same time by correlation of character structure (tips, ascender / descender vertical elements), timing, and position. Can be detected. An ascender / descender is a text character that extends above and below the midline of a column of text than the average text character. An example of an ascender is the letter "t". An example of a descender is the letter "p".

로직 프로세싱은 옵티컬로 캡쳐된 데이터의 이미지 스큐를 결정할 수 있다. 예를 들어, 헤드 각은 옵티컬로 캡쳐된 데이터로 프린트된 텍스트의 스트롱 수직 엘리먼트를 코릴레이팅함으로써 결정된다. 폰트에 따라서, 스트롱 수직 엘리먼트는 "abcdefghijklmnopqrstuvwxyz"로 이루어진 알파벳 중에서 문자 "bdhiklmnpqrtu" 내에 있다. 또한, "y"는 수직 스트로크가 없는 유일한 어센더/디센더이다. 또한 디-스큐잉 프로세스에 사용될 수 있는 나머지 알파벳 문자의 왼쪽 및 오른쪽 에지와 연관된 수직 정보가 있다. Logic processing may determine image skew of the optically captured data. For example, the head angle is determined by correlating strong vertical elements of text printed with optically captured data. Depending on the font, the strong vertical element is in the letter "bdhiklmnpqrtu" in the alphabet consisting of "abcdefghijklmnopqrstuvwxyz". Also, "y" is the only ascender / descender without vertical strokes. There is also vertical information associated with the left and right edges of the remaining alphabetic characters that can be used in the de-skew process.

렌즈lens

일부 실시예에서, 옵티컬 데이터 캡쳐 서브시스템은 광센싱 엘리먼트에 빛을 포커싱하기 위해 렌즈를 사용한다. 렌즈 서브시스템은 2차원 어레이 광센서에 매우 유용한 추가가 될 수 있다. In some embodiments, the optical data capture subsystem uses a lens to focus light on the light sensing element. The lens subsystem can be a very useful addition to two-dimensional array optical sensors.

광섬유 이미지 콘딧Fiber optic image conduit

일부 실시예에서, 이미지 콘딧은 옵티컬 캡쳐 시스템의 일부를 형성한다. 일부 실시예에서, 광섬유 이미지 콘딧은 캡쳐 정보가 있는 표면에 접촉한다. 일부 실시예에서, 광섬유 이미지 콘딧은 스캔 영역 내에 더 많은 환경광(ambient light)을 허용하도록 스캐닝되는 표면 위에 위치된다. 이러한 구성에서, 개별 광섬유의 제한된 수용각은 이미지 콘딧의 팁이 문서의 표면과 약간 떨어져 있더라도, 이미지가 매우 좋은 품질임을 보증한다. 데이터 캡쳐 끝부(스캐닝되는 표면과 가장 가까운 끝부)에 투명 플라스틱 분리자 또는 캡을 가진 이미지 콘딧은 광섬유 이미지 콘딧이 스캐닝되는 표면에 접촉하지 않고, 그 표면에 더 많은 환경광을 조명하게 하고, 유저에게 스캐닝된 재료의 더 좋은 화면(view)을 제공하는 일 실시예이다. 이미지 콘딧의 팁과 표면 사이의 간격은 전형적으로 0.001인치 내지 0.1인치 범위내이다. 이미지 콘딧은 렌더링된 문서를 수직으로 유지하지 않을 때도, 이미지 콘딧이 데이터 캡쳐가 가능하도록 조각(sculpt)될 수 있다. 일부 실시예에서, 이미지 콘딧은 쐐기형 팁을 가지도록 조각될 수 있다. 일부 실시예에서, 유저가 번들을 통해 렌더링된 문서를 볼 수 있도록, 광 경로를 역으로 보았을 때, 광섬유 이미지 콘딧은 투명하거나, 반투명일 수 있다. 그러므로, 이미지 콘딧은 스캐닝된 이미지를 옵티컬 센서로 전달하는 수단으로써 역할함과 더불어 뷰파인더(viewfinder)로서 역할할 수 있다. In some embodiments, the image conduit forms part of the optical capture system. In some embodiments, the fiber optic image conduit contacts the surface with the capture information. In some embodiments, the fiber optic image conduit is positioned over the surface being scanned to allow more ambient light within the scan area. In this configuration, the limited acceptance angle of the individual optical fiber ensures that the image is of very good quality, even if the tip of the image conduit is slightly away from the surface of the document. An image conduit with a transparent plastic separator or cap at the data capture end (closest to the surface being scanned) allows the fiber optic image conduit to illuminate more ambient light on the surface without contacting the surface being scanned, One embodiment provides a better view of the scanned material. The spacing between the tip and the surface of the image conduit is typically in the range of 0.001 inch to 0.1 inch. Image conduits can be sculpted to allow data capture even when the rendered document is not held vertically. In some embodiments, the image conduit may be carved to have a wedge tip. In some embodiments, the fiber optic image conduits may be transparent or translucent when viewing the light path in reverse, so that the user can view the rendered document through the bundle. Therefore, the image conduit can serve as a viewfinder as well as a means of delivering the scanned image to the optical sensor.

일반적으로, 광섬유의 그룹이 이미지를 전달하는데 사용될 수 있다. 이것은 일차원 어레이에서와 같이, 광섬유의 단일 열(row); 광섬유의 다수의 열; 또는 엄격한 배열이 아닌 광섬유의 그룹 또는 다발일 수 있다. 또한, 많은 광섬유의 플렉시블 블러시가 사용될 수 있다. 광섬유의 고정된 배열이 없는 경우에, 몇몇 이미지를 캡쳐하는 개별 광섬유의 끝부와 센서 엘리먼트와 연결된 다른 끝부 사이의 관 계는 제조시에 또는 사용 동안에 실험적으로 결정될 수 있다.In general, a group of optical fibers may be used to convey the image. This may include a single row of optical fibers, such as in a one-dimensional array; Multiple rows of optical fibers; Or a group or bundle of optical fibers that are not stringent arrangements. In addition, a flexible blush of many optical fibers can be used. In the absence of a fixed arrangement of optical fibers, the relationship between the ends of the individual optical fibers capturing some images and the other ends connected to the sensor elements can be determined experimentally during manufacture or during use.

외부 광섬유 조명External fiber optic lighting

일반적으로, 광섬유 이미지 콘딧은 환경광을 통해 데이터를 캡쳐할 수 있으나, 일부 실시예에서, 이미지 콘딧 광섬유의 서브셋과 같은, 옵션의 엘리먼트가 광원으로부터 문서의 표면으로 빛을 운반할 수 있다. 이러한 광섬유는 필수적으로 렌더링된 문서의 표면을 조명하기 위한 소형의 플래시라이트로써 역할한다. 나머지 광섬유는 조명된 데이터를 캡쳐하고, 이미지 센서로 재전송한다. 전형적으로, 광섬유 이미지 콘딧이 문서의 표면을 따라 쉽게 드러깅되도록 조각된 실시예에서 필수적으로, 이미지 콘딧의 외부 광섬유는 문서로 빛을 전송하기 위해 사용된다. In general, optical fiber image conduits may capture data through ambient light, but in some embodiments, optional elements, such as a subset of image conduit optical fibers, may carry light from the light source to the surface of the document. This optical fiber essentially serves as a compact flashlight for illuminating the surface of the rendered document. The remaining fiber captures the illuminated data and sends it back to the image sensor. Typically, in embodiments where the optical fiber image conduit is easily engraved along the surface of the document, essentially the external optical fiber of the image conduit is used to transmit light to the document.

CCD/CMOS 옵티컬 센서CCD / CMOS optical sensor

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 이미지 센서를 포함한다. 솔리드 스테이트 옵티컬 이미지 센서는 컴퓨터 디스플레이로부터 정보를 캡쳐하며, 현대의 디지털 카메라의 주요 컴포넌트이다. 적합한 이미지 센서의 일 예는 CMOS 이미지 센서이다. 다른 예는 CCD 이미지 센서이다. 이러한 기술 모두는 전형적으로 센서의 그리드를 따라서 전기적 신호로써 컴퓨터 칩이 빛을 측정하게 한다. 다른 예는 광 센시티브 포토 트렌지스터의 선형 어레이이다. In some embodiments, the portable data capture device includes an image sensor. Solid state optical image sensors capture information from computer displays and are a major component of modern digital cameras. One example of a suitable image sensor is a CMOS image sensor. Another example is a CCD image sensor. All of these techniques typically allow a computer chip to measure light as an electrical signal along a grid of sensors. Another example is a linear array of light sensitive photo transistors.

비가시 스펙트럼Invisible Spectrum

일부 실시예에서, 옵티컬 스캐닝 서브시스템은 비가시 스펙트럼에서 동작한다. 비가시 스펙트럼에서 빛을 검출하기 위한 능력으로, 휴대용 스캐닝 디바이스는 UV 또는 IR 특성을 가진 잉크로 프린트된 숨겨진 컨트롤 심벌을 캡쳐할 수 있 다. 적합한 "쓰기가능한" 영역을 포함한 문서에 대하여, 일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 이 영역의 (화학적, 온도적, 광학적) 상태를 읽고 변화하고, 그러므로 숨겨진 정보를 남긴다. 일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 문서 또는 문서의 일부를 스캔했음을 나타내는 스캐너에서 인식가능한 특수 잉크(예컨대, IR)를 사용한다.In some embodiments, the optical scanning subsystem operates in the invisible spectrum. With the ability to detect light in the invisible spectrum, portable scanning devices can capture hidden control symbols printed with ink with UV or IR properties. For documents that include a suitable "writable" area, in some embodiments, the portable data capture device reads and changes the state (chemical, thermal, optical) of this area, thus leaving hidden information. In some embodiments, the portable data capture device uses special ink (eg, IR) recognizable in the scanner indicating that the document or part of the document has been scanned.

인간/머신 판독가능성Human / machine readability

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 인간 및 머신이 판독가능한 데이터를 캡쳐한다. 인간이 판독가능한 데이터의 예는 텍스트이다. 머신이 판독가능한 데이터의 예는 바코드, 아이콘, 및 (그래픽에 내장된, 또는 비가시 스펙트럼 특성을 가진 잉크로 쓰여진 것과 같은) 숨겨진 데이터이다. In some embodiments, the portable data capture device captures human and machine readable data. An example of human readable data is text. Examples of machine readable data are barcodes, icons, and hidden data (such as written in ink embedded in graphics or with invisible spectral properties).

디스플레이로부터의 데이터 캡쳐Capture data from display

일부 실시예에서, 또한, 휴대가능한 데이터 캡쳐 디바이스는 디스플레이 디바이스로부터 직접적으로 읽을 수 있고, 그러므로 디스플레이 스크린에 직접적으로 포인팅, 하일라이팅, 발췌, 언더라인, 복사, 붙이기, 삭제, 등을 위해 사용될 수 있다. 이러한 능력은 강력한 문서 편집 시스템을 이끌 수 있고, 이 시스템은 유저가 문서를 프린트하고, 휴대용 스캐너로 프린트된 버전상에서 직접 작업하고, (그리고 또한 다이나믹 디스플레이와 상호작용 가능하고), 그러므로 (수정된) 새로운 버전으로 프린트할 수 있다. 이러한 방법은 페이퍼와 디지털 세상의 최고의 특성의 일부를 결합한다. In some embodiments, the portable data capture device can also read directly from the display device and thus be used for pointing, highlighting, extracting, underlining, copying, pasting, deleting, etc. directly on the display screen. . This ability can lead to a powerful document editing system, which allows the user to print documents, work directly on the printed version with a handheld scanner (and also interact with the dynamic display), and thus (modified) You can print to the new version. This method combines some of the best features of the paper and digital world.

스크린으로부터의 데이터 캡쳐는 디스플레이 상에 도시된 이미지를 광학적으 로 캡쳐함으로써, 또는 데이터 캡쳐를 시도하는 휴대가능한 디바이스를 위에 디스플레이 상에 위치를 결정함으로써 이루어질 수 있다. 이 위치 방법은 디스플레이와 연관된 메모리, 통상적으로 비디오 메모리로부터 이미지를 검색한다. 그 다음 컴퓨터는 그것의 비디오 메모리로부터의 스크린 위치에 디스플레이된 정보를 검색한다. 비디오 메모리내의 정보는 OCR 어플리케이션에 의해, 휴대가능한 디바이스에 의해 직접적으로 캡쳐된 이미지를 처리하는 것과 유사한 프로세싱될 수 있다. Data capture from the screen can be made by optically capturing the image shown on the display, or by positioning the portable device attempting to capture the data on the display. This location method retrieves images from memory associated with the display, typically video memory. The computer then retrieves the information displayed at the screen location from its video memory. The information in the video memory can be processed by the OCR application, similar to processing an image captured directly by the portable device.

생체 검출Biometric detection

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스 보안 및 인증을 위한 생체(보이스, 지문, 망막, DNA) 정보를 캡쳐하는 능력을 가진다. 앞서 언급한 바와 같이, 일차원 선형 옵티컬 어레이는 어레이에 대해 유저의 손가락을 스위핑하여 지문 스캐너로써 기능할 수 있다. In some embodiments, the portable data capture device has the ability to capture biometric (voice, fingerprint, retina, DNA) information for security and authentication. As mentioned above, a one-dimensional linear optical array can function as a fingerprint scanner by sweeping a user's finger against the array.

자성 스크립(신용카드)Magnetic script (credit card)

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 P-커머스(구매) 어플리케이션에 대해 특히 유용할 수 있는, 신용카드에 통상적으로 사용되는 자성 스크립으로부터 데이터를 캡쳐할 수 있다.In some embodiments, the portable data capture device may capture data from magnetic scripts commonly used in credit cards, which may be particularly useful for P-commerce (purchase) applications.

기능/동작 행위Function / Operation Behavior

프로세서 또는 다른 컨트롤 로직은 휴대가능한 데이터 캡쳐 디바이스의 전체 동작을 조화시킨다. 통상적으로, 프로세서는 메모리에 저장된 프로그램으로부터 동작한다. 디바이스의 기능 및 동작 행위에 특별한 관계의, 메모리는 획득, 저장 및 옵티컬 센서에 의하여 획득된 데이터의 프로세싱에 관한 프로그램 명령어를 저 장한다. 프로세서는 렌더링된 문서로부터 데이터의 획득, 저장, 및 프로세싱을 위해 메모리로부터 명령어를 검색할 수 있다. The processor or other control logic coordinates the overall operation of the portable data capture device. Typically, a processor operates from a program stored in memory. Of special interest to the functionality and operational behavior of the device, the memory stores program instructions relating to the acquisition, storage and processing of data acquired by the optical sensor. The processor may retrieve instructions from memory for obtaining, storing, and processing data from the rendered document.

휴대가능한 데이터 캡쳐 디바이스의 다양한 실시예에서 프로세싱 능력은 데이터 캡쳐; 데이터, 특히 이미지 데이터 프로세싱; 데이터 압축 및 다른 이미지 처리; 메모리와 연관된 알고리즘 및 다른 기능 캐싱; 통신; 및 암호화/복호화 알고리즘과 같은 보안 어플리케이션을 위해 사용될 수 있다.In various embodiments of a portable data capture device, the processing capability may include data capture; Data, in particular image data processing; Data compression and other image processing; Algorithm and other function caching associated with memory; Communication; And security applications such as encryption / decryption algorithms.

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 네트워크 및 연관된 컴퓨터와 상호작용을 위해 다양한 모드 및 상태를 가진다. 예를 들어, 일부 실시예에서, 컴퓨터 및 워드프로세싱 소프트웨어로 작업할 때, 휴대용 스캐너는 페이퍼 문서로부터 스캐닝된 텍스트가 전자문서내에서 하일라이팅되도록 하는 하일라이팅 모드; 페이퍼 문서로부터 스캐닝된 텍스트가 전자문서내에서 언더라인되도록 하는 언더라인 모드; 페이퍼 문서로부터 스캐닝된 텍스트가 전자문서내의 커서 위치에 삽입되게 하는 복사 모드 등을 가진다. In some embodiments, the portable data capture device has various modes and states for interacting with the network and associated computer. For example, in some embodiments, when working with a computer and word processing software, the portable scanner may include a highlighting mode that allows text scanned from a paper document to be highlighted in an electronic document; An underline mode for causing the text scanned from the paper document to be underlined in the electronic document; And a copy mode for allowing text scanned from the paper document to be inserted at the cursor position in the electronic document.

유저는 휴대가능한 데이터 캡쳐 디바이스를 유저 인터페이스를 통해 컨트롤할 수 있다. 예를 들어, 유저 인터페이스는 유저에게 메뉴를 디스플레이할 수 있는 디스플레이를 포함할 수 있다. 유저는 휴대용 스캐너의 작동을 컨트롤하기 위해 메뉴 옵션 중에서 선택한다.The user can control the portable data capture device through the user interface. For example, the user interface may include a display capable of displaying a menu to the user. The user selects from menu options to control the operation of the handheld scanner.

휴대가능한 데이터 캡쳐 디바이스의 일부 주요 태스크는 렌더링된 문서로부터의 데이터 캡쳐; 다른 전자 디바이스의 컨트롤; 상태 표시; 데이터 보안 및 유저 프라이버시; 네트워크 데이터의 로컬 캐싱; 키워드 프로세싱; 검색; 및 OCR을 포함 한다. Some major tasks of the portable data capture device include capturing data from the rendered document; Control of other electronic devices; Status display; Data security and user privacy; Local caching of network data; Keyword processing; Search; And OCR.

캡쳐/스캔Capture / Scan

일부 실시예에서, 프로세서는 옵티컬 센서에 의해 캡쳐된 이미지를 검색할 수 있고, 이 이미지 내에 나타난다면, 문자를 결정하기 위해 전통적인 OCR(옵티컬 문자 인식) 기술을 수행한다. In some embodiments, the processor may retrieve an image captured by the optical sensor and, if present within this image, performs traditional OCR (Optical Character Recognition) techniques to determine characters.

시간/위치 스탬프Time / Location Stamp

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 특정 동작을 수행한 시간과 위치를 기록하기 위해 사용되는 시간 및/또는 위치 스탬프를 생성한다. 예를 들어, 유저가 문서로부터 텍스트를 스캔할 때, 휴대가능한 데이터 캡쳐 디바이스는 스캐닝된 문서와 연관된 시간 스탬프, 및/또는 위치 스탬프를 생성한다. 휴대가능한 데이터 캡쳐 디바이스는 스캔을 위한 컨텍스트를 만들기 위해 서비스 제공자의 네트워크 또는 호스트 컴퓨터에 스캐닝된 텍스트와 함께 이 시간/위치 스탬프를 전송한다. 휴대가능한 데이터 캡쳐 디바이스는 시간 데이터를 위한 내부 클록, 또는 네트워크로부터의 시간 신호가 사용가능하다면 네트워크 시간을 사용할 수 있다. GPS 및 많은 다른 방법이 휴대가능한 데이터 캡쳐 디바이스의 위치를 결정하기 위해 사용가능하다. 일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 시간/위치 데이터를 위해 내부 클록 및 GPS 기술을 사용한다. In some embodiments, the portable data capture device generates a time and / or location stamp that is used to record the time and location of performing the particular operation. For example, when a user scans text from a document, the portable data capture device generates a time stamp and / or a location stamp associated with the scanned document. The portable data capture device sends this time / location stamp along with the scanned text to the service provider's network or host computer to create a context for the scan. The portable data capture device may use an internal clock for time data, or network time if a time signal from the network is available. GPS and many other methods are available for determining the location of a portable data capture device. In some embodiments, the portable data capture device uses internal clock and GPS technology for time / location data.

위치 능력이 GPS 수신기의 컨텍스트에서 주로 서술되었지만, 많은 다른 위치 기술이 사용될 수 있다. 이러한 기술의 일부는 EOTD(Enhanced Observed Time Difference), A-GPS(Assisted GPS, DGPS(Differential GPS), TDOA(Time Difference of Arrival), 도착 각도, 트리앵귤레이션, 및 로컬 송수신기 파일럿 신호의 모니터링 등이다. 네트워크 내의 로직이 휴대가능한 데이터 캡쳐 디바이스의 위치를 추정하기 위해 각 베이스 스테이션에서 수신된 신호에 대한 데이터를 코릴레이팅하는 것과 같이, 휴대가능한 데이터 캡쳐 디바이스가 베이스 스테이션과 네트워킹된 신호를 전송할 때, EOTD, TDOA, 및 도각 각도가 가장 적합하다. 트리앵귤레이션은 내부적 또는 외부적일 수 있다. 일부 실시예에서, 적어도 3개의 (IEEE 802.11 베이스 스테이션과 같은) 외부 송수신기에서부터 신호를 수신하고, 수신된 신호의 특성을 기초로 대략적인 위치를 계산할 때, 휴대가능한 데이터 캡쳐 디바이스는 내부적 트리앵귤레이션을 수행한다. 휴대가능한 데이터 캡쳐 디바이스의 외부의 네트워킹된 수신기가 휴대가능한 데이터 캡쳐 디바이스로부터 수신된 신호의 특성을 기초로 휴대가능한 데이터 캡쳐 디바이스의 위치를 추정하기 위해 사용될 때, 외부적 트리앵귤레이션이 발생한다. 외부적 트리앵귤레이션의 일 예는 외부의 수신기로부터 휴대가능한 데이터 캡쳐 디바이스의 거리를 추정하기 위해 하나 이상의 외부의 수신기에서의 수신된 신호 강도를 사용하는 것이다. 고정된 송신기는 모바일 수신기가 원하는 송신기로부터의 신호를 "락 온" 하도록 특정 송신기를 식별하는 파일럿 신호를 방출한다. 이 고정된 송신기의 위치 및 대략적인 커버리지 영역을 알고 있으므로, 휴대가능한 데이터 캡쳐 디바이스의 위치는 송신기가 수신한 것을 기초로 추정될 수 있다. 예를 들어, 휴대가능한 데이터 캡쳐 디바이스가 IEEE802.11 무선 접근점으로부터 신호를 수신하고 있다면, 이 휴대가능한 데이터 캡쳐 디바이스는 무선 접근점의 300피트(현재 IEEE802.11 송신기의 대략적인 외부범위)이내에 있는 것으로 가정할 수 있다. Although location capabilities are primarily described in the context of a GPS receiver, many other location techniques can be used. Some of these technologies include Enhanced Observed Time Difference (EOTD), Assisted GPS, Differential GPS (DGPS), Time Difference of Arrival (TDOA), Arrival Angle, Triangulation, and Monitoring of Local Transceiver Pilot Signals. When the portable data capture device sends a networked signal with the base station, such as logic in the network correlating data for signals received at each base station to estimate the location of the portable data capture device, the EOTD (TDOA, and angle of view are best suited.) Triangulation may be internal or external In some embodiments, the signal is received from at least three external transceivers (such as an IEEE 802.11 base station) and the characteristics of the received signal. When calculating the approximate location based on the Performs re-angulation External triangulation when a networked receiver external to the portable data capture device is used to estimate the position of the portable data capture device based on the characteristics of the signal received from the portable data capture device. An example of external triangulation is to use the received signal strength at one or more external receivers to estimate the distance of the portable data capture device from an external receiver. It emits a pilot signal that identifies the particular transmitter to “lock on” the signal from the desired transmitter, since the location of this fixed transmitter and the approximate coverage area are known, the position of the portable data capture device is determined by Can be estimated on the basis For example, if a portable data capture device is receiving a signal from an IEEE802.11 wireless access point, the portable data capture device will be 300 feet of the wireless access point (approximately out of range of the current IEEE802.11 transmitter). Can be assumed to be within.

캡쳐된 데이터를 통한 컨트롤Control over captured data

캡쳐된 데이터는 앞서 언급된 문서 디스엠비규에이션(disambiguation) 및 전자 대응물 로케이션과 더불어 다양한 사용을 위해 놓여질 수 있다. 일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 스캐닝된 데이터를 통해 컨트롤되고 프로그래밍된다. 유저는 맵 키와 유사한 커맨드의 출력된 메뉴로부터 또는 간단한 텍스트로부터 커맨드를 스캔할 수 있다. 예를 들어, 유저는 다음의 캡쳐된 데이터가 컨트롤 커맨드로써 처리되어야 함을 휴대가능한 데이터 캡쳐 디바이스에 알리는 특수 아이콘을 스캔할 수 있다. 유저가 휴대가능한 데이터 캡쳐 디바이스가 커맨드와 미리 연관된 동작(이 예에서는 프레드로 호출을 발생시킴)을 수행하도록 하는 "프레드를 호출하라"와 같은 커맨드를 스캔한다. 이와 유사하게, 일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 유저가 P-커머스 구매 거래를 개시하기 원함을 나타내는 (스트링의 일부가 아닌) 디바이스 자체에 의해 스캐닝될 때,"구매하라"라는 단어를 인식하도록 프로그램될 수 있다. The captured data can be placed for various uses in conjunction with the document disambiguation and electronic counterpart locations mentioned above. In some embodiments, the portable data capture device is controlled and programmed through the scanned data. The user can scan the command from an output menu of commands similar to the map key or from simple text. For example, the user can scan a special icon informing the portable data capture device that the next captured data should be processed as a control command. The user scans a command such as "Call Fred" that causes the portable data capture device to perform an action previously associated with the command (in this example, making a call to Fred). Similarly, in some embodiments, the portable data capture device uses the word "buy" when scanned by the device itself (not part of the string) indicating that the user wants to initiate a P-commerce purchase transaction. Can be programmed to recognize.

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 유저에 의해 그려진 컨트롤 심벌을 인식한다. 그리하여, 유저는 원하는 커맨드 아이콘 또는 단어를 그림으로써 페이퍼의 임의의 부분상에 커맨드 메뉴를 간편하게 생성할 수 있다. 일부 실시예에서의 시스템에 의해 인식되는 컨트롤 아이콘은 (P-커머스 구매를 개시하기 위한) "$"; (하일라이트 모드로 들어가기 위한) "!"; 및 (다음 번호가 주소록내에 저장되거나 다이얼링될 수 있는 전화 번호임을 나타내기 위한) 폰 아이콘 등 이 있다. In some embodiments, the portable data capture device recognizes the control symbols drawn by the user. Thus, the user can easily create a command menu on any part of the paper by drawing the desired command icon or word. The control icon recognized by the system in some embodiments may be "$" (to initiate a P-commerce purchase); "!" (To enter highlight mode); And a phone icon (to indicate that the next number is a phone number that can be stored or dialed in the address book).

휴대가능한 데이터 캡쳐 디바이스가 본 명세서에 언급된 것 이외의 키워드를 스캔할 때, 휴대가능한 데이터 캡쳐 디바이스의 행위는 캡쳐된 컨트롤 데이터를 사용함으로써 사용가능한 행위의 서브셋이다. When the portable data capture device scans keywords other than those mentioned herein, the behavior of the portable data capture device is a subset of the behaviors available by using the captured control data.

제스처를 통한 컨트롤Gesture Control

휴대가능한 데이터 캡쳐 디바이스와 상호작용하는 유저에 대한 직관적인 방법이 휴대가능한 데이터 캡쳐 디바이스와의 제스처에 의한 것이다. 유저 경험은 특정 제스처에 소정의 동작 및 행동을 연관지음으로써 크게 강화된다. 이러한 제스처의 일부는 제스처를 검출하는 방법과 함께 아래에 서술된다. An intuitive way for the user to interact with the portable data capture device is by gesture with the portable data capture device. User experience is greatly enhanced by associating certain actions and actions with particular gestures. Some of these gestures are described below along with methods of detecting gestures.

발명자는 충분한 길이의 텍스트 스트링이 문서의 풀(pool), 또는 "코퍼스(corpus)"로부터 문서를 디스엠비규에이팅하기 위해 사용될 수 있음을 발견했다. 휴대가능한 데이터 캡쳐 디바이스는 렌더링된 문서에서 피처(텍스트, 아이콘, 등)의 이미지를 캡쳐한다. 이 이미지는 온보드 휴대용 문서 이미지 디바이스에 의해 (예컨대, 피처 추출 테크닉을 적용함으로써) 프로세싱될 수 있고, 휴대용 이미지 디바이스와 통신하는 컴퓨터에 의해 프로세싱될 수 있다. 일반적으로, 캡쳐된 이미지는 알파벳과 숫자의 문자, 예컨대, 텍스트 프래그먼트의 연속 스트링과 대응한다. 이 시스템은 페이퍼 문서의 전자 대응물을 만들고, 페이퍼 문서를 식별하기 위해 텍스트 프래그먼트를 사용한다. 전형적으로, 이것은 적어도 최초의 소정의 길이의 텍스트 프래그먼트를 요구한다. 렌더링된 문서가 디스엠비규에이팅되고 난 후, 페이퍼 문서의 전자 대응물은 상호작용 가능하다. 상호작용의 범위는 유저에 게 전달되는 전자 문서의 대응물을 가지는 것부터, 소스 문서에 관한 부가적인 내용(subject matter)의 전송, 문서 맵(마크업) 정보의 전송, 전자 대응물 문서를 내비게이팅하기 위해 소스 문서를 사용하는 것, 전자 대응물의 편집, 복잡한 재정 처리를 수행하는 것이다. 바람직하게는, 이러한 상호작용은 커맨드 입력 디바이스와 같은 휴대용 문서 이미지 디바이스를 사용하여 이루어진다. 다수의 커맨드 입력을 제공하고, 컴팩트 사이즈를 유지하면서도 직관적으로 사용되는 유저 인터페이스를 가진 휴대용 문서 이미지 디바이스를 갖추는 것이 바람직하다. The inventors have discovered that a text string of sufficient length can be used to disassemble a document from a pool of documents, or "corpus". The portable data capture device captures an image of the feature (text, icon, etc.) in the rendered document. This image can be processed by the onboard portable document image device (eg, by applying feature extraction techniques) and can be processed by a computer in communication with the portable image device. In general, the captured image corresponds to a sequence of letters of the alphabet and numbers, for example a text fragment. The system creates an electronic counterpart of the paper document and uses text fragments to identify the paper document. Typically, this requires a text fragment of at least the first predetermined length. After the rendered document is disembedded, the electronic counterpart of the paper document is interactable. The scope of the interaction ranges from having an electronic document correspondence delivered to the user, from sending additional subject matter to source documents, sending document map (markup) information, and navigating electronic counterpart documents. To use source documents to do this, to edit the electronic counterparts, and to perform complex financial processes. Preferably, this interaction is done using a portable document image device such as a command input device. It is desirable to have a portable document image device that provides a large number of command inputs and has an intuitive user interface while maintaining a compact size.

일부 실시예에서, 휴대가능한 디바이스는 유저의 제스처에 의해 컨트롤된다. 예를 들어, 포워드 방향으로 텍스트를 스캐닝하는 것은 유저가 텍스트를 메모리에 저장하고자 함을 나타낼 수 있다. 동일한 텍스트를 역방향으로 스캐닝하는 것은 유저가 메모리로부터 텍스트를 지우고자 함을 나타낼 수 있다. 문서내의 텍스트를 앞뒤로 러빙하는 것은 유저가 문서의 전자 대응물내의 텍스트를 하일라이팅하기를 원함을 나타낼 수 있다. 이 시스템은 원형 동작, 스네이크 동작, 등과 같은 많은 제스처를 휴대가능한 디바이스의 작동을 컨트롤하기 위해 사용할 수 있다. 많은 행동이 스캐닝 프로세스 시작; 유저가 특정 기사 또는 문서로부터 스캐닝을 완료했다는(그러므로, 후속의 스캐닝된 데이터는 새로운 기사 또는 문서로부터 스캐닝될 것이라는) 신호; 하일라이팅; 이전 엔트리의 삭제; 등과 같은 소정의 제스처와 연관될 수 있다. In some embodiments, the portable device is controlled by the user's gesture. For example, scanning the text in the forward direction may indicate that the user wishes to store the text in memory. Scanning the same text backwards may indicate that the user wishes to erase the text from memory. Rubbing the text back and forth in the document may indicate that the user wants to highlight the text in the electronic counterpart of the document. This system can use many gestures, such as circular motion, snake motion, and the like, to control the operation of the portable device. Many actions begin the scanning process; A signal that the user has completed scanning from a particular article or document (and therefore subsequent scanned data will be scanned from a new article or document); Highlighting; Deletion of the previous entry; May be associated with a predetermined gesture, such as the like.

또한, 발명자는 문서가 디스엠비규에이팅된 (그러므로, 문서 내에서 후속 스캔을 위한 컨텍스트를 설정한) 후 , 더 짧은 텍스트 프래그먼트를 사용하는 문서 내의 위치를 식별이 가능함을 발견했다. 그 다음, 이러한 식별된 위치는, 예를 들어, 문서 주석달기, 문서 편집하기, 문서로부터의 텍스트 및/또는 이미지 추출하기와 같은 문서와의 상호작용을 위한 앵커 포인트로써 사용될 수 있다. In addition, the inventors have found that after a document has been disembedded (and therefore setting the context for subsequent scanning within the document), it is possible to identify a location within the document using shorter text fragments. This identified location can then be used as an anchor point for interaction with the document, such as, for example, annotating the document, editing the document, extracting text and / or images from the document.

일부 실시예에서, 휴대용 문서 데이터 캡쳐 디바이스가 소스 문서의 표면을 따라 움직여질 때, 휴대용 문서 데이터 캡쳐 디바이스가 일련의 이미지 데이터의 프레임을 캡쳐하도록 구성될 수 있다. 일부 실시예에서, 휴대용 문서 데이터 캡쳐 디바이스는 소정의 제한된 속도내에서 사용될 때, 일련의 최소한 부분적으로 오버래핑된 이미지를 캡쳐하도록 구성될 수 있다. 오버랩의 크기는 전형적으로 프레임 사이의 X-Y 동작이 계산되기에 충분해야 한다. 이 휴대용 문서 데이터 캡쳐 디바이스는 캡쳐된 이미지 데이터의 프레임으로부터 피처를 추출하도록 구성되어 있다. 이 휴대용 문서 이미징 디바이스는 캡쳐된 이미지를 텍스트로 변환하기 위해 옵티컬 문자 인식 스킴을 사용하고, 그 다음 캡쳐된 이미지 데이터로부터 텍스트 스트링을 구성하기 위해 스티칭(stitching) 알고리즘을 사용할 수 있고, 또는 텍스트 스트링의 표현을 향상하기 위해서 이미지 데이터의 스티칭된 모든 프레임상의 문자 오프셋을 사용하거나, 또는 상대 위치를 계산하기 위해 이미지 데이터의 후속 프레임의 픽셀레이션 내의 차이를 사용할 수 있다. 옵티컬 문자 인식은 텍스트 스트링을 발생시키기 위해 사용되고, 텍스트 스트링은 그 페이지에 관하여 수평이거나 수직일 수 있다. 휴대용 문서 데이터 캡쳐 디바이스 커맨드 입력과 소스 문서에 관한 휴대용 문서 데이터 캡쳐 디바이스의 소정의 동작(제스처)을 연관시키는 데이터베이스에 동작적으로 연결된다. 휴대용 문서 데이터 캡쳐 디바이스는 제스처/커맨 드 입력 페어링 라이브러리와 함께 미리-구성되어 있을 수 있고, 유저에 의해 트레이닝이 가능할 수 있다. 부가적으로, 제스처는 동일한 제스처의 수행이 그 제스처의 컨텍스트, 예컨대, 문서 내의 시간 프레임 또는 위치에 의존하는 상이한 커맨드 입력의 실행을 일으키는 "오버로딩"될 수 있다. In some embodiments, when the portable document data capture device is moved along the surface of the source document, the portable document data capture device can be configured to capture a frame of a series of image data. In some embodiments, the portable document data capture device may be configured to capture a series of at least partially overlapped images when used within certain limited speeds. The size of the overlap should typically be sufficient for X-Y operation between frames to be calculated. This portable document data capture device is configured to extract a feature from a frame of captured image data. This portable document imaging device can use an optical character recognition scheme to convert the captured image into text, and then use a stitching algorithm to construct a text string from the captured image data, or The character offset on every stitched frame of the image data can be used to enhance the representation, or the difference in pixelation of subsequent frames of the image data can be used to calculate the relative position. Optical character recognition is used to generate a text string, which may be horizontal or vertical with respect to the page. The portable document data capture device is operatively connected to a database that associates a command input with a predetermined action (gesture) of the portable document data capture device with respect to the source document. The portable document data capture device can be pre-configured with a gesture / command input pairing library and can be trained by the user. Additionally, the gesture may be "overloaded" causing performance of the same gesture to cause the execution of different command inputs that depend on the context of the gesture, eg, a time frame or location in the document.

일부 실시예에서, 유저는 소스 문서의 텍스트 라인을 따라 왼쪽에서 오른쪽으로 휴대용 문서 이미지 디바이스를 움직임으로써, 소스 문서의 일 섹션의 이미지를 캡쳐한다. 먼저, 휴대용 문서 이미지 디바이스는 소스 문서가 디스엠비규에이팅된 것인지 판정한다. 소스 문서가 디스엠비규에이팅되지 않았으면, 캡쳐된 이미지 데이터의 프레임으로부터 문서 피처를 추출하고, 소스 문서를 디스엠비규에이팅하기 위해 추출된 피처를 차례로 사용하는 컴퓨터로 추출된 피처를 통신한다. 소스 문서가 이미 디스엠비규에이팅되었으면, 휴대용 문서 이미지 디바이스는 문서 피처를 추출하고, 문서 내부에 위치를 정하기 위해 추출된 피처를 사용하고, 전자 문서내의 상응한 영역/텍스트를 선택한다. In some embodiments, the user captures an image of one section of the source document by moving the portable document image device from left to right along the text line of the source document. First, the portable document image device determines whether the source document has been disassembled. If the source document has not been disassembled, extract the document feature from the frame of captured image data and communicate the extracted feature to a computer that in turn uses the extracted feature to disassemble the source document. . If the source document has already been disembedded, the portable document image device extracts the document feature, uses the extracted feature to locate within the document, and selects the corresponding area / text in the electronic document.

제스처가 문서 내의 위치에 관하여 오버로딩되는 방법의 예로, 실질적으로 소스 문서의 동일한 영역 상에서 왼쪽에서 오른쪽으로 휴대용 문서 이미지 디바이스를 두 번 움직이는 것은 선택된 영역내의 텍스트가 언더라인되도록 한다. 동일한 제스처가 문서 내의 위치에 관하여 오버로딩되는 방법의 다른 예로, 실질적으로 소스 문서의 동일한 영역 상에서 왼쪽에서 오른쪽으로 휴대용 문서 이미지 디바이스를 세 번 움직이는 것은 선택된 영역 내의 텍스트를 굵게 한다. As an example of how a gesture is overloaded with respect to a location in a document, moving the portable document image device twice from left to right on substantially the same area of the source document causes the text in the selected area to be underlined. As another example of how the same gesture is overloaded with respect to a location within a document, moving the portable document image device three times from left to right on substantially the same area of the source document makes the text in the selected area bold.

일부 실시예에서, 실질적으로 소스 문서의 모든 이전에 선택된 영역 상에서 오른쪽에서 왼쪽으로 휴대용 문서 이미지 디바이스를 움직이는 것은 이전 커맨드 입력이 수행되지 않도록 한다. 예를 들어, 오버로딩된 왼쪽에서 오른쪽으로의 선형 제스처의 앞선 서술에 따라서, 유저가 굵게 된 선택된 영역을 가지고 선택된 영역 상에서 오른쪽에서 왼쪽으로 휴대용 문서 이미지 디바이스를 한 번 움직였다면, 선택된 영역 내의 텍스트는 굵은 상태에서 언더라인으로 변하게 될 것이다. 선택된 영역 상에서 오른쪽에서 왼쪽으로 휴대용 문서 이미지 디바이스를 한 번 움직이는 것은 선택된 영역내의 텍스트가 초기의 포맷으로 변하게 하고, 오른쪽에서 왼쪽으로의 동작을 3번 반복하는 것은 선택된 영역을 완전히 선택해제 한다.In some embodiments, moving the portable document image device from right to left on substantially all previously selected areas of the source document prevents previous command input from being performed. For example, according to the preceding description of the overloaded left-to-right linear gesture, if the user has moved the portable document image device once from right to left on the selected area with the selected area thickened, the text in the selected area is bold. Will change from underline to underline. Moving the portable document image device once from right to left on the selected area causes the text in the selected area to change to the initial format, and repeating the right to left operation three times completely deselects the selected area.

오버로딩의 이점을 설명하기 위해, 일부 실시예에서, 소스 문서의 이전에 선택된 영역의 일부 위에서 오른쪽에서 왼쪽으로 휴대용 문서 이미지 디바이스를 움직이는 것은 이전에 선택된 영역/텍스트의 일부가 삭제되게 한다. To illustrate the benefits of overloading, in some embodiments, moving the portable document image device from right to left over a portion of a previously selected area of the source document causes a portion of the previously selected area / text to be deleted.

일부 예에서, 유저는, 예를 들어, 하나 또는 다수의 단락을 복사하거나 삭제하는 것과 같이 텍스트의 상대적으로 큰 블록과 상호작용하고자 할 수 있다. 일부 실시예에서, 유저는 시작 위치를 정하기 위해 왼쪽에서 오른쪽으로 휴대용 문서 이미지 디바이스를 움직이고, 끝 위치를 정하기 위해 왼쪽에서 오른쪽으로 휴대용 문서 이미지 디바이스를 움직인다. 휴대용 문서 이미지 디바이스는 선택된 영역의 시작과 끝을 정하기 위해 텍스트 스트링(또는 그것의 심볼적 표현)을 사용한다. 그 다음 후속 커맨드 입력이 선택된 영역에서 기능한다. 예를 들어, 일부 실시예에서, 유저가 휴대용 문서 이미지 디바이스를 시작과 끝 위치 사이에 "X"자 패턴으로 움직이면, 시작과 끝 위치 사이의 소스 문서의 영역은 삭제된다. 이와 유사하 게, 일부 실시예에서, 유저가 선택된 영역 내에서, 예컨대, 지그재그식과 같이 휴대용 문서 이미지 디바이스를 페이지 아래로 움직이면서 앞뒤로 움직인다면, 선택된 영역은 삭제된다. 유저가 선택된 영역 내에서 휴대용 문서 이미지 디바이스를 아래로 움직이면, 선택된 영역은 하일라이팅될 것이다. 일부 실시예에서, 유저가 선택된 영역에서 원형의 방식으로 휴대용 문서 이미지 디바이스를 움직이면, 선택된 영역은 복사된다.In some examples, a user may wish to interact with a relatively large block of text, such as copying or deleting one or multiple paragraphs. In some embodiments, the user moves the portable document image device from left to right to establish a start position, and moves the portable document image device from left to right to establish an end position. The portable document image device uses a text string (or a symbolic representation thereof) to determine the start and end of the selected area. Subsequent command inputs then function in the selected area. For example, in some embodiments, if the user moves the portable document image device in a "X" pattern between the start and end positions, the area of the source document between the start and end positions is deleted. Similarly, in some embodiments, if the user moves back and forth while moving the portable document image device down the page, such as in a zigzag pattern, the selected area is deleted. If the user moves the portable document image device down within the selected area, the selected area will be highlighted. In some embodiments, if the user moves the portable document image device in a circular manner in the selected area, the selected area is copied.

일부 실시예에서, 유저는 소스 문서의 영역을 선택하기 위해 원형 동작을 사용한다. 이 시스템은 전형적으로 전자 문서 내에 있어야 하는 소스문서의 포맷을 실질적으로 유지하거나, 마크업 문서내의 번역 정보를 사용하여 전자 대응물에 페이퍼 문서의 레이아웃을 매핑한다. 휴대용 문서 이미지 디바이스는 휴대용 문서 이미지 디바이스가 원형 패턴으로 움직이는지를 판정하도록 일련의 이미지 데이터의 프레임으로부터 추출된 피처를 사용하고, 문자 매핑 스킴을 사용하는 전자 문서 내의 선택된 영역의 위치를 판정하기 위해 추출된 피처를 사용한다. 일부 실시예에서, 상이한 커맨드 입력은 시계방향의 원형동작 및 시계반대방향의 원형 동작과 각각 연관된다. In some embodiments, the user uses a circular action to select an area of the source document. The system typically substantially maintains the format of the source document that should be in the electronic document, or uses the translation information in the markup document to map the layout of the paper document to the electronic counterpart. The portable document image device uses features extracted from a series of frames of image data to determine whether the portable document image device moves in a circular pattern, and to determine the location of a selected area within an electronic document using a character mapping scheme. Use the feature. In some embodiments, different command inputs are associated with clockwise circular motion and counterclockwise circular motion, respectively.

일부 실시예에서, 휴대용 스캐너는 제스처를 기초로 컨트롤을 인식하고 작동한다. 예를 들어, 일부 실시예에서, 휴대용 스캐너는 텍스트 위에서 옵티컬 센서를 패싱함으로써 텍스트를 스캔하고, 그 후 스캐너 내의 메모리에 저장한다. 반대 방향으로 텍스트 위에서 옵티컬 센서를 패싱함으로써, 텍스트는 메모리에서 삭제된다. 일부 실시예에서, 원형으로 스캐닝함으로써, 유저는 스캐너의 호스트 컴퓨터 상의 워드프로세싱 프로그램내의 전자 문서내의 원형 텍스트를 하일라이팅할 수 있다. In some embodiments, the portable scanner recognizes and operates the control based on the gesture. For example, in some embodiments, a portable scanner scans text by passing an optical sensor over the text and then stores it in memory in the scanner. By passing the optical sensor over the text in the opposite direction, the text is deleted from memory. In some embodiments, by scanning in a circle, a user can highlight the circular text in an electronic document in a word processing program on the scanner's host computer.

동작 검출Motion detection

제스처 기반 커맨드를 검출하고 작동하기 위해, 휴대가능한 데이터 캡쳐 디바이스는 동작을 검출하고 해석할 수 있어야 한다. 동작을 검출하고 제스처 커맨드에 동작을 매핑하는 다양한 방법이 아래에 서술된다. 일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 제스처를 식별하기 위해 후속 이미지 캡쳐들 사이에 동작 벡터를 계산한다.In order to detect and operate gesture based commands, a portable data capture device must be able to detect and interpret motion. Various methods of detecting an action and mapping the action to a gesture command are described below. In some embodiments, the portable data capture device calculates a motion vector between subsequent image captures to identify the gesture.

일부 실시예에서, 휴대용 스캐너는 동작이 검출될 때마다 제스처 해석 어플리케이션을 개시한다. 동작을 검출하는 일 방법은 옵티컬 마우스와 같이, 연속적으로 캡쳐된 이미지를 비교하는 것이다. 일부 실시예에서, 제1이미지가 패턴을 위해 분석된다. 프로세서는 이 이미지를 메모리로 가져오기 위해 소프트웨어 명령어를 사용하고, 그 다음 배경과 다른 이미지의 부분을 찾는다(예컨대, 흰 배경상에 검은 텍스트를 식별하는 것). 이 프로세서는 이 패턴의 위치와 무슨 패턴인지 메모리에 기록한다. 그 다음, 이 프로세서는 제2이미지를 로딩하고 오리지널 패턴의 검출을 시도한다. 그 다음, 그 패턴의 위치가 제1이미지와 어떻게 변했는지를 비교한다. 이 차이가 벡터로서 인코딩된다. 이러한 프로세스를 반복함으로써, 일련의 벡터가 형성된다. "점의 연결"과 유사하게, 이러한 라인 세그멘트, 또는 벡터는 동작 시퀀스를 알아낼 수 있다. In some embodiments, the portable scanner launches a gesture interpretation application each time an action is detected. One way of detecting motion is to compare successively captured images, such as an optical mouse. In some embodiments, the first image is analyzed for a pattern. The processor uses software instructions to bring this image into memory, and then finds a portion of the image that is different from the background (eg, identifying black text on a white background). The processor records the location of this pattern and what pattern it is in memory. The processor then loads the second image and attempts to detect the original pattern. Next, compare how the position of the pattern has changed with the first image. This difference is encoded as a vector. By repeating this process, a series of vectors is formed. Similar to "connection of points", these line segments, or vectors, can determine the sequence of motions.

일부 실시예에서, 이 프로세서는 제1 및 제2이미지내의 픽셀(또는 몇몇 대표 픽셀) 사이에 벡터를 찾을 수 있다. 이미지 사이에 벡터를 계산하기 위해서, 프로세서는 제1 및 제2이미지의 캡쳐 사이의 기간 동안에 이 디바이스의 이동 경로를 판정하기 위해, 먼저 수평 축을 따라 이미지를 비교한 다음 수직 축을 따라 이미지를 비교한다. 그 다음 프로세서는 제1이미지 내의 모든 픽셀을 오른쪽으로 한 픽셀씩(일부 픽셀은 더 이상 이 이미지의 부분이 아닌 채로) 움직인다. 그 다음 프로세서는 이 벡터를 다시 계산한다. 이 벡터가 더 짧아졌다면, 프로세서는 픽셀 사이의 수평 거리가 없을 때까지, 픽셀을 이동시킨다. 이 벡터가 더 길어졌다면, 프로세서는 픽셀을 왼쪽으로 이동시킨다. 이동 벡터의 수평 컴포넌트가 결정되고 나면, 프로세서는 이동 벡터의 수직 컴포넌트를 결정하기 위해 수직축을 따라 비교를 반복한다. 프로세서가 이동 벡터의 수직 및 수평 컴포넌트를 계산했을 때, 제1 및 제2 이미지 사이의 상대적인 선형 동작을 알게 된다. In some embodiments, the processor may find a vector between pixels (or some representative pixels) in the first and second images. To calculate the vector between the images, the processor first compares the images along the horizontal axis and then the images along the vertical axis to determine the path of travel of the device during the period between the capture of the first and second images. The processor then moves every pixel in the first image one pixel to the right (with some pixels no longer part of this image). The processor then recalculates this vector. If this vector is shorter, the processor moves the pixels until there is no horizontal distance between the pixels. If this vector is longer, the processor moves the pixel to the left. After the horizontal component of the motion vector is determined, the processor repeats the comparison along the vertical axis to determine the vertical component of the motion vector. When the processor calculates the vertical and horizontal components of the motion vector, it knows the relative linear motion between the first and second images.

의도된 제스처의 판정Determination of the intended gesture

일부 실시예에서, 제스처가 의도된 것인지를 판정하기 위한 노력으로 동작 벡터의 계산이 따른다. 이 단계의 복잡도는 제스처 분류 존재에 의존할 수 있다. 예를 들어, 스캐너가 오직, 예컨대, 백워드의 한 제스처만 인식한다면, 분류는 동작의 임의의 수직 컴포넌트를 고려할 필요가 없을 것이다. 스캐너가 백워드의 제스처만 인식하는 실시예와 같은, 일부 실시예에서, 본 명세서에 서술된 바와 같이, 다수의 벡터가 단일 대표 벡터로 대체될 수 있다. 예를 들어, 유저가 완벽하게 수평으로 스캔하려 했지만 수직으로 약간 흔들렸다면, 이 스캐너는 이 유저가 수평 라인을 의도했음을 판정하고, 많은 벡터를 하나의 수평 벡터로 대체할 수 있다. In some embodiments, the computation of the motion vector follows with an effort to determine if the gesture is intended. The complexity of this step may depend on the presence of the gesture classification. For example, if the scanner recognizes only one gesture of, for example, a backward, the classification will not need to consider any vertical component of the operation. In some embodiments, such as embodiments in which the scanner only recognizes gestures of backwards, as described herein, multiple vectors may be replaced with a single representative vector. For example, if a user attempts to scan perfectly horizontal but slightly shakes vertically, the scanner can determine that the user intended the horizontal line, and replace many vectors with one horizontal vector.

백워드 및 포워드Backward and forward

휴대가능한 데이터 캡쳐 디바이스와의 제스처의 직관적이고 기본적인 방법은 텍스트의 라인을 따라 포워드 및 백워드로 스캔하는 것이다. 일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 저장된 문자 탬플릿과 스캐닝된 텍스트 이미지를 비교함으로써, 텍스트의 라인을 따른 포워드 및 백워드의 움직임을 인식한다. 예로써, 영어 알파벳을 사용하여, 스캐닝된 문자가 탬플릿과 매칭한다면, 스캐너는 왼쪽에서 오른쪽(포워드)으로 움직인 것이다. 스캐닝된 문자가 탬플릿의 미러 이미지이면, 스캐너는 오른쪽에서 왼쪽(리버스)으로 움직인 것이다. 일부 실시예에서, 백워드 및 포워드 이동은 앞서 언급된 벡터 방법에 의해 판정된다. An intuitive and basic method of gesture with a portable data capture device is to scan forward and backward along a line of text. In some embodiments, the portable data capture device recognizes the movement of the forward and backward along a line of text by comparing the scanned text template with the scanned text image. By way of example, using the English alphabet, if the scanned character matches the template, the scanner has moved from left to right (forward). If the scanned text is a mirror image of the template, the scanner has moved from right to left (reverse). In some embodiments, the backward and forward movements are determined by the aforementioned vector method.

원(Circle)Circle

휴대가능한 데이터 캡쳐 디바이스와의 제스처의 직관적이고 기본적인 다른 방법은 텍스트의 영역에 원을 그리는 것이다. 일부 실시예에서, 원형상 제스처는 앞서 언급된 벡터 방법에 의해 식별된다. 일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 원형 이동을 검출하기 위해 절대위치 정보를 사용한다. 절대위치 정보를 얻는 일 방법은, 예컨대, 문서에 프린트된 인코딩된 그리드를 가진 문서로부터 정보를 얻는 것이다. 이 디바이스에 의해 취해진 각각의 이미지는 문서 표면에 관한 이 디바이스의 이동을 판정하기 위해 사용될 수 있는 절대위치 정보를 담고 있다. Another intuitive and basic method of gesture with a portable data capture device is to circle a region of text. In some embodiments, the circular gesture is identified by the aforementioned vector method. In some embodiments, the portable data capture device uses absolute position information to detect circular movement. One way of obtaining absolute position information is to obtain information, for example, from a document having an encoded grid printed on the document. Each image taken by this device contains absolute position information that can be used to determine the movement of this device relative to the document surface.

벡터 접근을 사용할 때, 이 프로세서는 벡터가 다른 벡터, 특히, 제1벡터의 시작점을 교차하는지 판정하기 위해 동작 벡터를 모두 더한다. 원형상 제스처 검 출 테크닉과 결합하여, 교차가 있는지 판정하기 위해 벡터가 사용되고, 그 다음 그러한 교차가 실제로 일어났었는지 판정하기 위해 절대위치 분석이 사용된다. When using a vector approach, the processor adds all of the motion vectors to determine if the vector intersects the start of another vector, in particular the first vector. In combination with the circular gesture detection technique, a vector is used to determine if there is an intersection, and then absolute position analysis is used to determine if such an intersection actually occurred.

도 8은 유저가 원형상 제스처를 만들었음을 검출하기 위해 시스템에 의해 전형적으로 수행되는 단계를 보여준다. 단계(800)에서, 이 시스템이 새로운 기본 제스처를 검출했을 때, 이 단계가 시작한다. 단계(810)에서, 이 시스템은 이 제스처 그 자체에 교차가 있는지 판정한다. 도 9는 원형상 제스처를 수행하려는 유저의 시도의 몇몇 예를 도시한다. 제1제스처(900)는 (910)에서 교차한다. 이 예에서, 이 동작의 시작과 끝은 실제로 서로 교차할 때이고, 그러므로 교차로써 검출된다. 제2제스처(920)는 원으로 판정될 수도 있는 제스처를 도시한다. 시작과 끝이 (930)에서 서로 가장 가깝다. 일부 실시예에서, 이러한 거리는 교차로 간주될 수 있는 오차 이내일 수 있다. 제3제스처(940)는 원으로 간주될 수 없는 제스처를 도시한다. 일부 실시예에서, 두 가까운 점(950, 960) 사이의 거리는 교차로 간주되기에 너무 멀다(그러나, 일부 실시예는 이것 조차도 인정하도록 프로그램될 수도 있다). 이러한 제스처가 그 자체와 교차되었다면, 원 검출의 프로세서는 시스템의 수직 컴포넌트를 고려하는 도 8의 단계(820)로 계속된다. 제스처가 자체와 교차되지 않았다면, 새로운 제스처를 기다리기 위해 이러한 반복으로 복귀한다. 일부 실시예에서, 수직 컴포넌트는 그 제스처가 유저가 원으로 해석되기 원하지 않는 러빙 제스처가 아닌지 확정하기 위해 고려될 수 있다. 일부 실시예에서, 수직 컴포넌트는 그 제스처 동안에 도달한 가장 높은 점과 낮은 점 사이의 차이일 수 있다. 일부 실시예에서, 이 스테이지는 이 차이와 임계값을 비교함으로써 판정될 수 있고, 이 프로세서는 그 제스처가 원이 아닌지 판정한다. 이 컴포넌트가 만족되면, 이 시스템은 수평 컴포넌트를 고려하는 단계(830)로 계속된다. 수평 평가는 수직 평가와 유사하게 수행된다. 제스처가 이러한 표준(교차, 수직 및 수평)을 모두 만족하면, 이 시스템은 단계(840)에서 그것을 원으로 분류한다. 이러한 표준 중 하나가 만족하지 않으면, 프로세스는 새로운 제스처를 기다리기 위해 단계(800)로 돌아간다.8 shows the steps typically performed by the system to detect that the user made a circular gesture. In step 800, when the system detects a new basic gesture, this step begins. In step 810, the system determines if there is an intersection in this gesture itself. 9 shows some examples of a user's attempt to perform a circular gesture. The first gesture 900 intersects at 910. In this example, the beginning and the end of this operation are actually when they cross each other and are therefore detected as crossings. The second gesture 920 shows a gesture that may be determined to be a circle. Start and end are closest to each other at 930. In some embodiments, this distance may be within an error that can be considered to be an intersection. The third gesture 940 shows a gesture that cannot be considered a circle. In some embodiments, the distance between two close points 950 and 960 is too far to be considered as an intersection (but some embodiments may be programmed to accept even this). If this gesture was crossed with itself, then the processor of original detection continues to step 820 of FIG. 8 considering the vertical component of the system. If the gesture did not intersect itself, it returns to this iteration to wait for a new gesture. In some embodiments, the vertical component may be considered to confirm that the gesture is not a rubbing gesture that the user does not want to interpret as a circle. In some embodiments, the vertical component may be the difference between the highest point and the lowest point reached during the gesture. In some embodiments, this stage can be determined by comparing this difference with a threshold, and the processor determines if the gesture is not a circle. If this component is satisfied, the system continues to step 830 considering the horizontal component. Horizontal evaluation is performed similarly to vertical evaluation. If the gesture meets all of these standards (cross, vertical and horizontal), the system classifies it as a circle in step 840. If one of these criteria is not satisfied, the process returns to step 800 to wait for a new gesture.

러빙(Rubbing)Rubbing

텍스트의 스트링을 앞뒤로 러빙하는 것은 휴대용 이미지 데이터 캡쳐 디바이스를 컨트롤하기 위해 사용될 수 있는 직관적이고 기본적인 또 다른 제스처이다. 일부 실시예에서, 앞뒤로 러빙 제스처는 하일라이팅 커맨드로 해석될 수 있다. 예를 들어, 유저가 포워드 제스처로 일련의 스캔을 하고, 러빙 제스처로 하나의 스캔 목표를 지정할 수 있다. 이에 응답하여, 이 스캐너는 후속의 검색시에 그 부분이 하일라이팅되도록 러빙 동작에 의해 식별된 텍스트를 플래그(flag)할 수 있다(예컨대, 이 "러빙된" 텍스트가 밝게 된 부분 위에 있다). 다른 실시예에서, 포워드 제스처로 표시된 텍스트는 언더라인될 수 있다. Rubbing a string of text back and forth is another intuitive and basic gesture that can be used to control a portable image data capture device. In some embodiments, rubbing gestures back and forth may be interpreted as highlighting commands. For example, a user may perform a series of scans with a forward gesture and a scan target with a rubbing gesture. In response, the scanner may flag the text identified by the rubbing operation so that the portion is highlighted in subsequent retrieval (eg, this “rubbed” text is above the lightened portion). In another embodiment, the text indicated by the forward gesture may be underlined.

도 10은 러빙 제스처를 검출하기 위해 시스템에 의해 전형적으로 수행되는 단계를 도시하는 플로우 다이어그램이다. 본 명세서에 서술된 바와 같이, 러빙 제스처는 수직으로 아래 위로의 동작; 일부 경우에는, 유저가 텍스트의 스트링을 수평으로 앞뒤로의 러빙 동작이다. 도 10에 도시된 프로세스에서, 새로운 기본 제스처가 단계(1000)에서 시작된다. 단계(1010)에서, 이 시스템은 서술된 이외의 방향 을 검출하고, 단계(1020)에서, 이 시스템은 방향의 변화를 검출한다. 단계(1030)에서, 이 시스템은 그 방향이 이전 이동 방향의 역인지를 보기 위해 방향 변화를 평가한다. 일부 실시예에서, 역(reverse)은 이전 벡터의 끝점에서부터 170과 190도 사이의 포인트가 새로운 벡터일 때로 정의된다(정확히 반대방향은 180도이다).10 is a flow diagram illustrating steps typically performed by a system to detect a rubbing gesture. As described herein, the rubbing gesture is vertically up and down; In some cases, the user rubs the string of text horizontally back and forth. In the process shown in FIG. 10, a new basic gesture begins at step 1000. In step 1010, the system detects a direction other than that described, and in step 1020, the system detects a change in direction. In step 1030, the system evaluates the change in direction to see if the direction is inverse of the previous direction of travel. In some embodiments, the reverse is defined when the point between 170 and 190 degrees from the end of the previous vector is the new vector (exactly the opposite direction is 180 degrees).

새로운 방향이 역이 아니면(일부 실시예에서, 스캐닝의 종료를 포함), 이 시스템은 단계(1000)로 새로운 제스처를 기다리기 위해 계속된다. 이 새로운 방향이 역이면, 그 후 이 시스템은 또 다른 방향 변화를 검출하기 위해 단계(1040)로 계속된다. 단계(1040)로부터, 이 시스템은 이 새로운 방향이 두번째 역인지 결정하는 단계(1050)로 계속된다. 동작의 세번째 방향이 두번째 방향과 역이면, 이 시스템은 러빙 제스처와 연관된 소정의 행위를 수행하기 위해 단계(1060)로 계속된다.If the new direction is not reverse (in some embodiments, including the termination of scanning), the system continues to wait for a new gesture to step 1000. If this new direction is inverse, then the system continues to step 1040 to detect another direction change. From step 1040, the system continues with step 1050 to determine if this new direction is the second reverse. If the third direction of operation is inverse to the second direction, the system continues to step 1060 to perform the predetermined action associated with the rubbing gesture.

삭제를 위한 백워드Backward for deletion

일부 실시예에서, 포워드 스캔은 스캐너가 메모리에 스캐닝된 정보를 저장하도록 한다. 이 스캔, 또는 그것의 색션이 후속의 백워드 제스처로 스캐닝되면, 백워드 방향으로 스캐닝된 이 부분은 메모리로부터 삭제된다. 예로써, 도 11은 문서(1120)에서 백워드 방향(1110)으로 움직이는 스캐너(1100)를 도시한다. 더 이전의 포워드 스캔에 의해 메모리에 캡쳐되고 저장된 텍스트가 박스(1130)로 도시되어 있다. 박스(1140)는 백워드 스캔에 의해 캡쳐된 "처음" 문자인 최우측 문자로부터, 백워드 방향으로 스캐닝된 텍스트를 보여준다. 박스(1140)에 있는 텍스트가 백워드 스캔에 의해 캡쳐된 것일 때, 각 문자는 이전에 스캐닝된 스트링과 비교된다. 이 스캐너(1110)는 백워드 스캔에 의한 처음(최우측)문자와 포워드 스캔의 마 지막(최우측) 문자를 비교하고, 이와 유사한 방법으로, 백워드 스캔에서부터의 문자와 스캐닝된 스트링과의 매칭이 끝날 때까지 계속된다. 이 스캐너는 포워드 스캔에서의 상응하는 위치의 문자와 매치하지 않는 백워드 스캔에서의 문자를 만날 때, 두 스트링의 비교를 멈춘다. 이 비교가 멈춘 후, 이 스캐너는 매칭된 문자를 메모리에서 삭제한다. In some embodiments, the forward scan causes the scanner to store the scanned information in memory. If this scan, or its section, is scanned with a subsequent backward gesture, this portion scanned in the backward direction is deleted from the memory. By way of example, FIG. 11 shows the scanner 1100 moving in the backward direction 1110 in the document 1120. Text captured and stored in memory by a previous forward scan is shown by box 1130. Box 1140 shows the text scanned in the backward direction from the rightmost character, which is the "first" character captured by the backward scan. When the text in box 1140 is captured by a backward scan, each character is compared with a previously scanned string. The scanner 1110 compares the first (rightmost) character by the backward scan with the last (rightmost) character of the forward scan and, in a similar manner, matching the characters from the backward scan with the scanned string. Continue until it's over. The scanner stops comparing two strings when it encounters a character in a backwards scan that does not match the character of the corresponding position in the forward scan. After this comparison stops, the scanner deletes the matched characters from memory.

스캐닝 센서가 각 문자의 이미지를 함께 스티치하는 방향을 관찰함으로써, 이 스캐너는 팔린드로메(palindrome)를 검출하고, 삭제 제스처로 그것을 해석하지 않게 한다. 이 시스템은 그 스캔이 일어나는 방향을 관찰함으로써, 팔린드로메를 검출한다. 포워드(왼쪽에서 오른쪽) 방향으로 스캐닝된 팔린드로메는 후속의 왼쪽에서 오른쪽으로의 이미지 스티칭에 의해 구성될 것이다. 오른쪽에서 왼쪽(백워드)으로의 스캔은 문자의 온른쪽에서 시작하여 왼쪽으로 움직이는 문자 이미지를 캡쳐할 것이다. 영어에 있어서, 이 오른쪽에서 왼쪽으로의 이동은 초기의 왼쪽에서 오른쪽으로의 스캔의 미러 이미지인 이미지를 야기한다. 팔린드로메 문자는 미러 이미지가 아닐 것이고, 그러므로 역 스캔과 구별될 수 있다.By observing the direction in which the scanning sensor stitches the image of each character together, the scanner detects the palindrome and does not interpret it as a delete gesture. The system detects palindrome by observing the direction in which the scan occurs. Palindrome scanned in the forward (left to right) direction will be constructed by subsequent left to right image stitching. Scanning from right to left (backward) will capture a text image starting from the right side of the text and moving to the left. In English, this right-to-left shift results in an image that is a mirror image of the initial left-to-right scan. The Palindrome character will not be a mirror image and can therefore be distinguished from a reverse scan.

제스처 및 컴퓨터 모니터Gesture and computer monitor

일부 실시예에서, 제스처 커맨드는 컴퓨터 디스플레이상에 렌더링된 문서로 사용될 수 있다. 예를 들어, 유저가 텍스트가 삽입된 위치를 식별하기 위해 컴퓨터 모니터상에 삽입기호("^") 스캔을 제스처할 수 있다. 이 예에서, 이 스캐너는 컴퓨터와 통신하고, 나타난 위치에 텍스트를 삽입하기 위한 커맨드로써 삽입기호 제스처를 인식한다. 이에 응답하여, 컴퓨터는 마지막 포워드 스캔으로부터 텍스트 를 삽입한다. In some embodiments, gesture commands can be used as documents rendered on a computer display. For example, a user may gesture an insertion sign (“^”) scan on a computer monitor to identify where the text was inserted. In this example, the scanner communicates with the computer and recognizes the insert sign gesture as a command to insert text at the location indicated. In response, the computer inserts text from the last forward scan.

일부 실시예에서, 스캐너는 마우스, 조이스틱, 또는 다른 포인팅 디바이스와 유사한 방법으로 컴퓨터와 상호작용하기 위해 사용될 수 있다. 예를 들어, 스캐너는 수직적으로 포인팅 다운을 유지함으로써 조이스틱과 같은 기능을 할 수 있다. 유저가 이 조이스틱을 기울이거나, 또는 이동에 의해 주어진 방향으로 움직일 때, 이 움직임은 스캐너의 이미지의 변화로 반사될 수 있다. 예를 들어, 스캐너가 포워드로 기울여지면, 이미지 센서는 각각 더 스큐되어, 반대 방향으로 움직이는 일련의 이미지로 기록될 수 있다. 얼마나 이 이미지의 부분의 움직였는지 또는 스큐되었는지 매핑함으로써, 이 스캐너는 그것이 움직여진 정도를 판정할 수 있다. 다른 예로써, 유저는 컴퓨터 포인팅 악세사리로써 휴대용 스캐너를 사용함으로써, 컴퓨터 모니터 상에 문서를 통해 스크롤할 수 있다. In some embodiments, the scanner may be used to interact with the computer in a similar manner as a mouse, joystick, or other pointing device. For example, the scanner can function like a joystick by keeping the pointing down vertically. When the user tilts this joystick or moves it in the direction given by the movement, this movement can be reflected as a change in the image of the scanner. For example, if the scanner is tilted forward, the image sensors may each be further skewed and recorded as a series of images moving in opposite directions. By mapping how much of the part of this image has moved or skewed, the scanner can determine how far it has been moved. As another example, a user may scroll through a document on a computer monitor by using a handheld scanner as the computer pointing accessory.

다른 디바이스의 컨트롤 및 다른 디바이스와의 어소시에이션Control of other devices and association with other devices

일부 실시예에서, 이 휴대가능한 데이터 캡쳐 디바이스는 다른 전자 디바이스를 컨트롤할 수 있고, (예컨대, 컴퓨터 디스플레이를 사용함으로써) 그 자신의 유저 인터페이스를 강화하기 위해 다른 전자 디바이스를 사용할 수 있고, 그리고 예를 들어, 스캐닝된 데이터 트리를 사용함으로써, 다른 전자 디바이스의 유저 인터페이스를 강화할 수 있다. In some embodiments, this portable data capture device can control other electronic devices, use other electronic devices to enhance its own user interface (eg, by using a computer display), and For example, by using the scanned data tree, it is possible to enhance the user interface of other electronic devices.

(필요하다면) 렌더링된 문서로부터 다른 식별자 또는 타이틀을 스캐닝함으로써 컨텍스트를 설정한 후에, 휴대가능한 디바이스는 스캐닝에 의해 원하는 동작을 나타내기 위해 사용될 수 있다. 예를 들어, 유저는 VCR+ 코드의 스캔이 뒤따르는, 텔레비젼 가이드와 같은 문서를 식별하는 코드를 스캐닝함으로써 그의 비디오 레코더(VCR)를 프로그래밍할 수 있다. 그 VCR+ 코드는 VCR이 그 코드와 연관된 소정의 동작을 수행하도록 IR 통신에 의해 VCR과 통신된다. After setting the context by scanning another identifier or title (if necessary) from the rendered document, the portable device can be used to indicate the desired action by scanning. For example, a user can program his video recorder (VCR) by scanning code that identifies a document, such as a television guide, followed by a scan of the VCR + code. The VCR + code is communicated with the VCR by IR communication so that the VCR performs certain operations associated with the code.

특히, 블루투스, USB, 또는 IEEE 802.11 커넥션을 가진, 디바이스 주변의 스캐너는 프로그래밍 행위를 정의하기 위해 사용될 수 있다. 전자 레인지(microwave) 주변의 얼려진 음식 패키지의 스캐닝은 적절한 요리 시간을 설정할 수 있다. 자동차의 컨텍스트에서, 주소 스캐닝은 그 스캐너가 그 자동차의 온보드 내비게이션 시스템을 그 주소로 프로그래밍하도록 할 수 있다. In particular, scanners around the device with Bluetooth, USB, or IEEE 802.11 connections can be used to define programming behavior. Scanning of frozen food packages around the microwave can set an appropriate cooking time. In the context of a car, address scanning can cause the scanner to program the car's onboard navigation system to that address.

다른 디바이스의 유저 컨트롤 인터페이스는 휴대가능한 데이터 캡쳐 디바이스의 능력에 의해 강화될 수 있다. 본질적으로, 휴대가능한 데이터 캡쳐 디바이스는 페이퍼로부터 정보를 스캐닝함으로써 다른 디바이스를 컨트롤한다. 전형적인 시스템에서, 휴대가능한 데이터 캡쳐 디바이스는 스캐닝된 정보를 블루투스 페어링을 가진 다른 디바이스에 대한 명령어로 변환한다. The user control interface of the other device can be enhanced by the capabilities of the portable data capture device. In essence, the portable data capture device controls the other device by scanning information from the paper. In a typical system, the portable data capture device converts the scanned information into instructions for another device with Bluetooth pairing.

니얼바이 디바이스와의 어소시에이션Association with Nialby devices

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 호스트 머신과 쌍을 이룬다. 호스트 머신은 컴퓨터, PDA 디바이스, 또는 모바일 폰과 같은 모바일 통신 디바이스 또는 Blackberry™ 텍스트 메세징 디바이스가 바람직하다. 인증 및 보안 정보의 교환은 휴대가능한 데이터 캡쳐 디바이스 및 호스트 디바이스 사이의 페어링 프로세스의 일부이다. 휴대가능한 데이터 캡쳐 디바이스는 인증 및 보안 프로시저를 현재 연결되지 않은 호스트 디바이스와 상호작용를 하기 전에 수행한 다. 보안 프로시저는 생체 식별과 같은, 유저 식별 프로시저를 선택적으로 포함할 수 있다. In some embodiments, the portable data capture device is paired with a host machine. The host machine is preferably a computer, PDA device, or mobile communication device such as a mobile phone or Blackberry ™ text messaging device. The exchange of authentication and security information is part of the pairing process between the portable data capture device and the host device. Portable data capture devices perform authentication and security procedures before interacting with a host device that is not currently connected. The security procedure may optionally include a user identification procedure, such as biometric identification.

도 12는 휴대가능한 스캐너와 니얼바이 디바이스를 연관하기 위한 시스템 구성의 블럭 다이어그램을 도시한다. 모바일 디바이스(1218)는 휴대가능한 스캐너 기능(1210)과 상호작용하거나 통합할 수 있다. 사람에 의해 편리하게 전달되도록 디자인된 스캐너인 휴대가능한 스캐너(1210)는 펜-타입 디바이스, 마우스, 원격 컨트롤, 또는 휴대가능한 폰일 수 있다. 휴대가능한 스캐너(1210)는 근거리 통신 능력(예컨대, 블루투스™와 같은 근거리 RF, USB와 같은 근거리 유선, 등)을 포함할 수 있고, 근거리 통신 능력은 모바일 디바이스(1218)와 통신하기 위해 사용될 수 있다. 이 스캐너는 시스템에 다른 그러한 스캐너들 사이에서 그 스캐너를 고유하게 식별하는 유저 ID 코드(1222)를 포함한다.12 shows a block diagram of a system configuration for associating a portable scanner with a Nialby device. Mobile device 1218 can interact or integrate with portable scanner function 1210. Portable scanner 1210, a scanner designed to be conveniently delivered by a person, may be a pen-type device, mouse, remote control, or portable phone. The portable scanner 1210 may include near field communication capability (eg, near field RF such as Bluetooth ™, near field wire such as USB, etc.), and the near field communication capability may be used to communicate with the mobile device 1218. . This scanner includes a user ID code 1222 that uniquely identifies the scanner among other such scanners in the system.

모바일 디바이스(1218)의 예는 랩탑, 노트북, 또는 서브 노트북 컴퓨터; PDA와 같은 휴대용 컴퓨터; 또는 다른 무선 전화 등이다. 일부 실시예에서, 스캐너 기능(1210) 및 모바일 디바이스(1218)는 동일한 디바이스이다. Examples of mobile device 1218 include a laptop, notebook, or sub notebook computer; Portable computers such as PDAs; Or other cordless phone. In some embodiments, scanner function 1210 and mobile device 1218 are the same device.

가능하다면 다른 정보와 함께, 하나 이상의 스캔에서 휴대용 스캐너(1210)에 의해 캡쳐된 정보는 컨텐트 로케이션 및 검색 서비스(1206)와 통신하는 네트워크(1202)와 통신한다. 일부 실시예에서, 이 정보는 컨텐트 요청/로케이션/검색 동작을 개시할 수 있다. 적어도 하나의 스캔으로부터의 정보는 프린트된 소스, 예컨대, 신문, 잡지, 전단지, 책, 메뉴얼, 팸플릿, 라벨, 또는 광고로부터 얻을 수 있다. 또한, 하나 이상의 스캔으로부터의 정보는 전기적, 또는 디지털로 디스플레이 된 정보, 예컨대, 텍스트, 바코드, 아이콘, 그림, 또는 전자 디스플레이로부터의 다른 정보로부터 얻을 수 있다.If possible, along with other information, the information captured by handheld scanner 1210 in one or more scans is in communication with network 1202 in communication with content location and retrieval service 1206. In some embodiments, this information may initiate a content request / location / search operation. Information from at least one scan can be obtained from a printed source, such as a newspaper, magazine, leaflet, book, manual, pamphlet, label, or advertisement. In addition, information from one or more scans may be obtained from electrical or digitally displayed information, such as text, barcodes, icons, pictures, or other information from an electronic display.

모바일 디바이스(1218)는 더 넓은 범위의 통신 능력을 네트워크(1202)에 제공한다. 그러한 통신의 예는 (예컨대, 다이얼 업 모뎀을 사용하는) 표준 공중 교환 전화망, 디지털 가입자 라인, 비동기식 디지털 가입자 라인, 케이블 모뎀, 이더넷, 광대역 LAN 기술, IEEE 802.11과 같은 무선 LAN 기술, 및 무선 셀 폰 기술등이다. Mobile device 1218 provides a wider range of communication capabilities to network 1202. Examples of such communications include standard public switched telephone networks (eg, using dial-up modems), digital subscriber lines, asynchronous digital subscriber lines, cable modems, Ethernet, broadband LAN technologies, wireless LAN technologies such as IEEE 802.11, and wireless cell phones. Technology.

네트워크(1202)는 통신 스위칭, 라우팅, 및 데이터 저장 능력을 포함한다. 특히, 네트워크(1202)는 시스템의 컴포넌트 사이에 정보를 라우팅하고 전송한다. 네트워크(1202)는 인터넷, 인트라넷 또는 인트라넷들, 유선 및/또는 무선 네트워크 부분을 포함할 수 있다. Network 1202 includes communications switching, routing, and data storage capabilities. In particular, network 1202 routes and transmits information between components of the system. Network 1202 may include the Internet, intranet or intranets, wired and / or wireless network portion.

디바이스 데이터베이스(1204)는 휴대용 스캐너(1210), 일부 실시예 및/또는 일부 조건하에서, 모바일 디바이스(1218)와 연관될 수 있는 디바이스에 대한 정보를 포함한다. 일부 실시예에서, 디바이스 데이터베이스(1204)는 디바이스 식별자의 어소시에이션에 디바이스 어드레스와 제공한다. 또한, 디바이스 데이터베이스(1204)는 디바이스 식별자의 연관에 지원되는 컨텐트 타입을 제공한다. 일부 실시예에서, 디바이스 데이터베이스(1204)는 관련 데이터베이스, 인덱스, 매핑 테이블, 및 강화된 도메인 네임 서비스 중 하나 이상을 포함한다. Device database 1204 includes information about handheld scanner 1210, a device that may be associated with mobile device 1218 under some embodiments and / or some conditions. In some embodiments, device database 1204 provides a device address and association to the device identifier. The device database 1204 also provides the content types supported for the association of device identifiers. In some embodiments, device database 1204 includes one or more of an associated database, indexes, mapping tables, and enhanced domain name services.

디바이스 어소시에이션(1208)은 휴대용 스캐너와 입/출력(I/O), 저장부, 또는 프로세싱 디바이스 사이에 어소시에이션을 구성한다. 일부 실시예에서, 디바이 스 데이터베이스(1204) 및 디바이스 어소시에이션(1208)은, 예를 들어, 컨텐트 검색부(1206)에 의한 다른 기능에 의해 분리되어 접근될 수 있는 다른 기능이다. 일부 실시예에서, 디바이스 어소시에이션(1208) 및 디바이스 데이터베이스(1204)는 공통 기능의 컴포넌트로 통합될 수 있다. Device association 1208 configures an association between a handheld scanner and an input / output (I / O), storage, or processing device. In some embodiments, device database 1204 and device association 1208 are other functions that can be accessed separately, for example, by other functions by content retrieval unit 1206. In some embodiments, device association 1208 and device database 1204 may be integrated into components of common functionality.

컨텐트 검색부(1206)는, 특히, 디바이스 정보 및 디바이스 관련 정보를 얻기 위해 디바이스 데이터베이스(1204) 및 디바이스 어소시에이션(1208)과 통신한다. 일부 실시예에서, 디바이스 테이터베이스(1204) 및/또는 디바이스 어소시에이션(1208)은 네트워트(1202)와 같은 네트워크를 사용하여 컨텐트 검색부(1206)와 통신한다. The content retrieval unit 1206, in particular, communicates with the device database 1204 and the device association 1208 to obtain device information and device related information. In some embodiments, device database 1204 and / or device association 1208 communicate with content retrieval unit 1206 using a network, such as network 1202.

디바이스 데이터베이스(1204), 이 디바이스 어소시에이션(1208), 및 컨텐트 검색부(1206)는 "서비스 제공자"를 구성할 수 있다. 서비스 제공자는 클라이언트 요청을 충족하는 정보 및/또는 서비스의 네트워크 접근가능한 제공자이다. 서비스 제공자는 예약-기반, 광고 지원, 사용-당-페이, 및 컨텐트 및/또는 통신 서비스 처리-당-페이로 제공할 수 있다. The device database 1204, this device association 1208, and the content retrieval unit 1206 may constitute a “service provider”. Service providers are network accessible providers of information and / or services that meet client requests. The service provider may provide reservation-based, ad support, per-use-pay, and content and / or communication service processing-per-pay.

컨텐트 검색부(1206)는 컨텐트 로케이션 및 검색 기능을 포함한다. 컨텐트는 텍스트, 디지털 사운드 또는 음악, 또는 다나 이상의 디지털 이미지 또는 비디오 중 적어도 하나이다. 이 컨텐트 검색(1206)은 휴대용 스캐너(1210)에 의해 스캐닝된 정보에 의해 식별되고, 관계되고, 그리고/또는 대응하는 컨텐트를 가진다. The content retrieval unit 1206 includes a content location and retrieval function. The content is at least one of text, digital sound or music, or one or more digital images or videos. This content search 1206 has identified, related, and / or corresponding content by the information scanned by the portable scanner 1210.

컨텐트 검색부(1206)는 네트워크(1202)와 통신하고, 휴대용 스캐너(1210)와 연관된 I/O, 저장부, 또는 프로세싱 디바이스에 로케이팅된 컨텐트를 제공한다. The content retrieval unit 1206 communicates with the network 1202 and provides the located content to I / O, storage, or processing devices associated with the portable scanner 1210.

연관 디바이스는, 특히 이미지/비디오 렌더링 시스템(1212) 또는 오디오 렌더링 시스템(1214)이다. 몇몇의 디바이스(예를 들어 통합 디바이스(1216))는 오디오 및 이미지/비디오 시스템(1212,1214)을 포함한다. 그런 통합 디바이스(1216)의 예로서 랩탑 컴퓨터, 데스크탑 컴퓨터, 텔레비젼, 다수 사용자 컴퓨터 시스템 또는 키오스크를 포함한다.The associating device is in particular an image / video rendering system 1212 or an audio rendering system 1214. Some devices (eg, integrated device 1216) include audio and image / video systems 1212, 1214. Examples of such integrated device 1216 include laptop computers, desktop computers, televisions, multi-user computer systems or kiosks.

휴대 스캐너(1210)와 연관될 수 있는 다른 디바이스는 데이터 저장 디바이스(1220) 또는 프린터를 포함한다. 데이터 저장 디바이스(1220)의 예는 컴퓨터 하드 드라이브, 휴대 플래시 저장 디바이스, 휴대 음악 및/또는 비디오 및/또는 전자책 플레이어(예를 들어, 휴대 콘텐츠 플레이어), 및 광 저장 매체를 포함한다. 랩탑, 데스크탑, 및 네트워크 기반 컴퓨터와 같은 연산 자원은 스캐너(1210)와 연관된 프로세싱 능력을 향상시키기 위하여 휴대 스캐너(1210)와 연관될 수 잇다.Other devices that may be associated with the handheld scanner 1210 include a data storage device 1220 or a printer. Examples of data storage device 1220 include computer hard drives, portable flash storage devices, portable music and / or video and / or ebook players (eg, portable content players), and optical storage media. Computational resources such as laptops, desktops, and network-based computers may be associated with the handheld scanner 1210 to enhance the processing power associated with the scanner 1210.

콘텐츠가 전달되어야 할 디바이스를 식별하는 것은 연관된 디바이스에 대한 디바이슬 식별자를 수신하는 것을 포함할 수 있다. 디바이스 식별자는 스캐너와 연관된 휴대 디바이스(1218) 또는 스캐너(1210)에 의해 제공되어 질 수 있다. 디바이스 식별자의 예는 바코드, 유일한 디바이스 시리얼 넘버, 인터넷 프로토콜 어드레스와 같은 네트워크 어드레스, 영숫자 코드, 또는 유일한 디바이스 이름일 수 있다.Identifying the device to which content is to be delivered may include receiving a device identifier for the associated device. The device identifier may be provided by the portable device 1218 or scanner 1210 associated with the scanner. Examples of device identifiers may be bar codes, unique device serial numbers, network addresses such as Internet protocol addresses, alphanumeric codes, or unique device names.

몇 실시예에서, 연관된 디바이스의 네트워크 어드레스는 필수적이지만 연관되 디바이스의 식별자는 그렇지 않다. 시스템은 어떤 경우에 연관된 디바이스의 능력의 완전한 이해 없이도 작동할 수 있다. 다른 경우, 그 능력은 추론될 수 있 다. 예를 들어, 디바이스가 웹 브라우저를 통해 통신 세션 식별자를 요청하고, 스캐너가 이어서 통신 세션 식별자를 시스템에 제출하면, 디바이스는 세션 식별자가 스캔되는 디스플레이를 가지고 있을 것이다.In some embodiments, the network address of the associated device is required but the identifier of the associated device is not. In some cases, the system can operate without a full understanding of the capabilities of the associated device. In other cases, the ability can be deduced. For example, if the device requests a communication session identifier via a web browser, and the scanner subsequently submits the communication session identifier to the system, the device will have a display from which the session identifier is scanned.

몇 실시예에서, 스캐너를 식별하는 유일한 사용자(또는 디바이스) ID와 연관되도록 하기 위해 하나 이상의 디바이스가 스캐너의 사용자에 의해 "등록"된다. 예를 들어, 스캐너의 사용자에 의해 소유된 랩탑 컴퓨너는 스캐너의 유일한 사용자 및/또는 디바이스 ID와 연관된 "디바이스 ＃1"로 등록될 수 잇다. (따라서, 연관 디바이스 식별자는 단일 사용자에 의해 등록된 제한된 수의 디바이스라면 매우 간단할 것이다) 연관 디비이스는 현재의 네트워크 어드레스를 서비스 프로바이더에게 자동저으로 등록하는 로직을 포함할 수 있다(네트워크 어드레스는 예를 들어, 랩탑 이 새로운 위치로 이동되고, 인터넷과 새로운 연결을 설립할 때마다 자주 바뀌기 때문이다). 이것은 서비스 프로바이더와 새로운 세션을 개시할 때 사용자의 작업을 간단하게 한다. 왜냐하면, 사용자는 단지 연관 디바이스의 식별자를 스캔만하고, 자동적으로 서비스 프로바이더에게 연관 디바이스를 참조하고, 현재의 네트워크 어드레스를 검색하고 이후의 시스템 응답을 지정된 디바이스로 전달하도록 하는 명령을 전달하기 때문이다. 게다가, 서비스 프로바이더의 시스템의 관리가 서비스 프로바이더에게 알려진 모든 디바이스에서 유일한 식별자의 세트를 생성하고 유지할 필요가 없기 때문에 간단하다. 서비스의 각각의 사용자는 단순히 사용될 디바이스를 등록하면 되고, 긴 시리얼 넘버와 같은 복잡한 디바이스 식별자를 적용(및 이후에 적용)할 필요가 없다.In some embodiments, one or more devices are "registered" by a user of the scanner to be associated with a unique user (or device) ID that identifies the scanner. For example, a laptop computer owned by a user of a scanner may be registered as a "device # 1" associated with a unique user and / or device ID of the scanner. (The associated device identifier will therefore be very simple if there is a limited number of devices registered by a single user.) The associated device may include logic to automatically register the current network address with the service provider (network address). For example, because laptops move to new locations and change frequently each time they establish new connections with the Internet). This simplifies the user's work when initiating a new session with the service provider. This is because the user only scans the identifier of the associated device and automatically passes the command to the service provider to refer to the associated device, retrieve the current network address and forward the subsequent system response to the designated device. . In addition, the management of the service provider's system is straightforward because there is no need to create and maintain a unique set of identifiers on every device known to the service provider. Each user of the service simply needs to register the device to be used, and does not need to apply (and later apply) a complex device identifier such as a long serial number.

게다가, 주어진 스캐너(및/또는 사용자)와 연관된 좁은 영역의 디바이스로부터 디바이스를 선택하는 것은 바람직한 디바이스를 식별하기 위해 다양한 방법을 사용할 수 있도록 해준다. 예를 들어, 디바이스는 선택된 아이콘을 스캔하거나, 스캐너로 특정 제스춰를 수행함으로써 식별될 수 있다.In addition, selecting a device from a narrow area of the device associated with a given scanner (and / or user) allows the use of various methods to identify the desired device. For example, the device may be identified by scanning the selected icon or by performing a particular gesture with the scanner.

디바이스 식별자는 그것을 스캔하고 그것을 콘텐츠 위치 및 저장(1206, 서비스 프로바이더) 시스템으로 전달함으로써 제공될 수 있다. 몇 실시예에서, 시스템은 디바이스 식별자가 휴대 스캐너(1210)에 의해 스캔될 수 있도록 디바이스의 시각적인 디스플레이상에 나타나도록 한다. 사용자는 상기 디바이스에 첨부된 시리얼 넘버로부터 디바이스 식별자를 스캔하거나, 디바이스에 첨부된 바코드로부터 스캔할 수 있다. 연관 디바이스의 식별자는 스캐너(1210)에 의한 콘텐츠 요청/위치 지정/검색 동작과 함께 또는 전에 제공될 수 있다. The device identifier may be provided by scanning it and delivering it to a content location and storage 1206 (service provider) system. In some embodiments, the system allows the device identifier to appear on the visual display of the device so that it can be scanned by the handheld scanner 1210. The user can scan the device identifier from the serial number attached to the device or from the barcode attached to the device. The identifier of the associated device may be provided with or before the content request / location / search operation by scanner 1210.

몇 실시예에서, 시스템은 하나 이상의 디바이스를 적어도 부분적으로 휴대 스캐너(1210)의 위치와 가까이 있기 때문에 휴대 스캐너(1210)와 연관시키기 위해 선택한다. 몇 실시예에서, 시스템은 GPS 위성 위치 정보, 다수 RF 송수신기를 이용하여 삼각측량된 정보 및/또는 휴대 스캐너(1210) 근처에 사용중인 Wi-Fi 또는 다른 무선 액세스 포인트 위치를 이용하여 휴대 스캐너(1210)의 위치를 식별한다. In some embodiments, the system selects one or more devices to associate with the handheld scanner 1210 because it is at least partially close to the location of the handheld scanner 1210. In some embodiments, the system may use GPS satellite location information, triangulated information using multiple RF transceivers, and / or handheld scanner 1210 using Wi-Fi or other wireless access point location in use near the handheld scanner 1210. ) Location.

몇 실시예에서, 시스템은 위치지정된 콘텐츠의 타입(예를 들어 텍스트, 비디오, 아니면 오디오인지)의 특징을 검사하고, 근처의 후보 디바이스가 상기 콘텐츠 타입의 렌더링을 지원하는 지를 결정함으로써 휴대 스캐너와 연관시키기 위한 하나 이상의 디바이스를 선택한다.In some embodiments, the system associates with a handheld scanner by examining a characteristic of the type of positioned content (eg, whether it is text, video, or audio) and determining if a nearby candidate device supports rendering of the content type. Select one or more devices to make.

몇 실시예에서, 연관 디바이스에 대한 디바이스 식별자는 연관 디바이스의 네트워크 어드레스를 식별하기 위해 사용된다. 디바이스 식별자는 시스템에 알려진 모든 다른 디바이스로부터 상기 디바이스를 구별하는 유일한 ID이거나, 스캐너(1210)와 연관된 유일한 사용자 및/또는 디바이스 ID(1222)와 함깨, 상기 시스템에 대해 상기 디바이스를 유일하게 식별하도록 작용하는 식별자일 수 있다. 네트워크 어드레스는 여러 가능성 중에서도 IP 어드레스, MAC 어드레스, URL, 또는 정보가 전달된느 특정 디바이스인 것으로 네트워크(1202)에 의해 인식되어 지는 디바이스 이름 또는 식별자를 포함한다.In some embodiments, the device identifier for the associated device is used to identify the network address of the associated device. The device identifier is a unique ID that distinguishes the device from all other devices known to the system, or acts to uniquely identify the device to the system, with a unique user and / or device ID 1222 associated with the scanner 1210. It may be an identifier. The network address includes, among other possibilities, an IP address, a MAC address, a URL, or a device name or identifier that is recognized by the network 1202 as being a specific device to which information is conveyed.

몇 실시예에서, 시스템은 휴대 스캐너(1210)가 상기 디바이스와 연관되어 있는 한 휴대 스캐너(1210)를 사용하는 사람에 의해 배타적 사용되도록 연관 디바이스를 구성하는 것에 의해 위치지정된 콘텐츠를 연관 디바이스에 전달한다. 연관 디바이스를 스캐너(1210)를 사용하는 사람에 의해 배타적인 액세스가 되도록 구성하는 것은 공중 또는 반공중 환경에서 특히 중요하다.In some embodiments, the system delivers the positioned content to the associating device by configuring the associating device for exclusive use by the person using the handheld scanner 1210 as long as the handheld scanner 1210 is associated with the device. . Configuring the associated device for exclusive access by the person using scanner 1210 is particularly important in an air or semi-air environment.

몇 실시예에서, 시스템은 휴대 스캐너(1210) 및/또는 연관 휴대 디바이스(1218)에 그 액세스가 연관된 I/O 또는 저장 디바이스에 의해 제어되는 정보에 액세스를 제공한다. 그런 정보의 예는 스캐너(1210)의기능을 가능 및/또는 용이하게 하는 정보이고, 키워드 정의, 문서 인덱스, OCR /또는 음성 인식을 용히하게 하는 테이블 및 파라미터를 포함할 수 있다.In some embodiments, the system provides portable scanner 1210 and / or associated portable device 1218 with access to information controlled by the I / O or storage device with which the access is associated. Examples of such information are information that enables and / or facilitates the functionality of the scanner 1210 and may include tables and parameters that facilitate keyword definitions, document indexes, OCR / or speech recognition.

도 13 은 스캔 디바이스 및 서비스 프로바이더에 연관되는 전형적인 쿼리 세션을 도시하는 블록 다이어그램이다. 이 실시예에서, 세션 지향 애플리케이션은 웹 브라우저이다.FIG. 13 is a block diagram illustrating an exemplary query session associated with a scan device and a service provider. In this embodiment, the session oriented application is a web browser.

휴대 스캐너(1210)는 디스플레이(1302)를 포함하는 컴퓨터 시스템과 상호작용하고 상기 컴퓨터 시스템으로부터 정보를 캡쳐한다. 컴퓨터 시스템의 예는 데스크탑, 랩탑, 또는 휴대용 컴퓨터, PDA, 또는 셀룰러 또는 다른 무선폰을 포함한다. 컴퓨터 시스템은 웹 브라우저(1304) 로직을 포함한다. 웹 브라우저(1304)는 서버를 가진 네트워크를 통해 일반적으로 통신한다. 서버는 인터 알리아, 웹 서버, CGI 스크립터 서버, 사설 네트워크(인트라넷) 서버, 또는 유선 또는 무선 전화 지원 네트워크의 서버를 포함한다.The handheld scanner 1210 interacts with and captures information from the computer system including the display 1302. Examples of computer systems include desktops, laptops, or portable computers, PDAs, or cellular or other wireless phones. The computer system includes web browser 1304 logic. Web browser 1304 generally communicates over a network with a server. Servers include interalias, web servers, CGI scripter servers, private network (intranet) servers, or servers in wired or wireless telephone support networks.

웹 브라우징 세션은 세션 식별자(세션 ID 1306)에 의해 특정지워진다. 세션 ID(1306)는 브라우저 통신 세션을 유일하게 식별하는 코드이다. 세션 ID(1306)의 예로서는 HTTP 세션 ID 뿐만 아니라 다른 프로토콜 세션 ID이다. 몇 실시예에서는, 웹 브라우저(1304)가 서비스 프로바이더(1308)에 속하는 웹 사이트를 지정하는 URL로부터 웹 페이지를 로드하도록 되어 있을 때, 서비스 프로바이더(1308)는 웹 브라우저(1304)로부터의 요청과 연관된 네트워크 어드레스를 기록하고, 유일한 세션 ID 코트(1306)가 디스플레이되는 웹 페이지를 돌려준다. 서비스 프로바이더(1308)는 유일한 세션 ID 코트(1306) 및 웹 브라우저(1304) 애플리케이션을 제공하는 디바이스의 네트워크 어드레스의 연관을 (예를 들어, 디바이스 연관 데이터베이스(1208)에)기록한다.The web browsing session is specified by the session identifier (session ID 1306). Session ID 1306 is a code that uniquely identifies a browser communication session. Examples of session IDs 1306 are HTTP protocol IDs as well as other protocol session IDs. In some embodiments, when the web browser 1304 is configured to load a web page from a URL that specifies a website belonging to the service provider 1308, the service provider 1308 requests the request from the web browser 1304. Record the network address associated with the < RTI ID = 0.0 > and < / RTI > The service provider 1308 records the association (eg, in the device association database 1208) of the unique session ID coat 1306 and the network address of the device providing the web browser 1304 application.

세션 식별자(1306)는 브라우저(1304)의 사용자에게 디스플레이될 수 있다. 세션 식별자(1306)가 디스플레이될 수 있도록 특정 기능이 웹 브라우저(1304)에 제 공될 수 있다. 휴대 스캐너(1210)는 유일한 스캐너 및/또는 사용자 ID(1222)와 함께 스캔된 유일 세션 ID 코드(1306)를 서비스 프로바이드(1308)에게 스캐너(1210)가 서비스 프로바이더(1308)와 통신하게 해주는 하나 이상의 네트워크 통신 채널을 이용하여 전달한다. 이것은 쿼리 세션을 개시하는 서비스 프로바이더(1308)에의 요청을 포함한다. 이후의 스캔(예를 들어, 이후의 쿼리)에 대한 응답은 세션 ID(1306)에 이전에 연관된 네트워크 어드레스에 있는 웹 브라우저(1304)에 전달되어 진다. 몇 실시예에서, 시스템은, 상기 시스템이 연관된 디바이스(1302)를 통해 쿼리 세션을 개시하도록 하는 목적 및 사용자를 정확하게 식별하였다는 것을 사용자에게 확인해주는, 웹 브라우저(1304)상에 디스플레이될 수 있는 쿼리 세션 개시 요청 긍정응답에 응답한다. 사용자가 쿼리 세션을 종료하면, 예를 들어 "엔드 세션(end session)"아이콘 또는 명령이 연관 디바이스(1302)의 디스플레이로부터 스캔되고, 현재의 세션을 종료하도록 서비스 프로바이더(1308)에 전달될 수 있다. 서비스 프로바이더(1308)는 그 후 웹 브라우저(1304)에 디스플레이를 클리어하도록(세션에서 이전에 디스플레이된 어느 잠재적으로 민감한 정보를 제거하고) 하고, 새로운 커리 세션을 개시하기 위해 스캐되는 새로운 유일 세션 ID코드(1306)를 디스플레이한다. 유사하게, 서비스 프로바이더(1308)에 의해 스캐너(1210)로부터 어떠한 통신이 수신되지 않는 동안의 사전 결정된 시간 간격이후에, 세션은 자동적으로 시간 종료되고, 유사하게 종료될 것이다. Session identifier 1306 may be displayed to a user of browser 1304. Specific functionality may be provided to web browser 1304 such that session identifier 1306 may be displayed. The handheld scanner 1210 causes the service provider 1308 to communicate with the service provider 1308 to the service provider 1308 with a unique session ID code 1306 scanned along with a unique scanner and / or user ID 1222. Deliver using one or more network communication channels. This includes a request to the service provider 1308 to initiate a query session. The response to a later scan (eg, a later query) is passed to the web browser 1304 at the network address previously associated with the session ID 1306. In some embodiments, the system may display a query that may be displayed on a web browser 1304 that assures the user that the system has correctly identified the user and the purpose of initiating a query session via the associated device 1302. Respond to the Session Initiation Request Acknowledgment. When a user ends a query session, for example, an "end session" icon or command may be scanned from the display of the associating device 1302 and passed to the service provider 1308 to end the current session. have. The service provider 1308 then clears the display in the web browser 1304 (removing any potentially sensitive information previously displayed in the session) and scans for a new unique session ID to initiate a new curry session. Display code 1306. Similarly, after a predetermined time interval while no communication is received from the scanner 1210 by the service provider 1308, the session will automatically time out and similarly terminate.

쿼리 세션 개시 요청을 전달한 이후, 휴대 스캐너(1210)는 프린트된 소스로부터 정보를 스캔한다. 스캔된 정보는 텍스트, 바코드, 그림표지, 및/또는 다른 프린트된 소스의 식별자를 포함한다. 스캔된 정보는 제품 이름, 바코드, 회사 이름, 로고, 상표, 또는 제품의 다른 식별자를 포함한다. 스캔된 정보는 노래 제목, 아티스트 이름, 시집 이름, 및/또는 음악 콘텐츠의 다름 식별자를 포함한다. 스캔된 정보는 이미지 이름, 캡션, 헤딩, 및/또는 이미지 콘텐츠의 다른 식별자, 또는 영화 제목, 배우 이름, 제작자 이름, 감독 이름, 스튜디오 이름, 제품 이름, 또는 비디오 콘텐츠의 다른 식별자를 포함한다.After passing the query session initiation request, the handheld scanner 1210 scans the information from the printed source. The scanned information includes text, barcodes, pictograms, and / or identifiers of other printed sources. The scanned information includes the product name, barcode, company name, logo, trademark, or other identifier of the product. The scanned information includes song titles, artist names, poem names, and / or different identifiers of music content. The scanned information may include image names, captions, headings, and / or other identifiers of image content, or movie titles, actor names, producer names, director names, studio names, product names, or other identifiers of video content.

다른 가능한 부가적인 정보와 함께, 적어도 하나의 스캔에 의해 캡쳐된 정보(스캔된 세션 ID(1306)를 포함하여)는 콘텐츠 요청내에 합체될 것이다. 스캔된 정보는 하나 이상의 통신하에서 서비스 프로바이더(1308)에게 전달될 것이다. 서비스 프로바이더(1308)는 세션 ID 코드(1306)를 적어도 부분적으로 콘텐츠를 웹 브라우저(1304)로 다시 인도하는데 적용할 것이다. 이것은 웹 브라우저(1304)가 휴대 스캐너(1210)의 활동의 결과로 전달된 콘텐츠를 수신하도록 할 것이다.Along with other possible additional information, the information captured by the at least one scan (including the scanned session ID 1306) will be incorporated into the content request. The scanned information will be passed to the service provider 1308 under one or more communications. The service provider 1308 will apply the session ID code 1306 to deliver the content back to the web browser 1304 at least partially. This will allow the web browser 1304 to receive the content delivered as a result of the activity of the handheld scanner 1210.

전달된 콘텐츠는 정보가 스캔되는 프린트된 문서의 전자적 버전, 스캔의 정보와 연관된 디비털 음악, 디지털 음석 기록, 오디오 뉴스 또는 논평, 오디오 제품 정보, 또는 다른 기록된 또는 합셩된 사운드, 적어도 하나의 디지털 이미지, 디지털 사진, 제품 이미지 도는 디디오, 뉴스 리포트 또는 만평의 비디오, 또는 다른 디지털 이미지 또는 비디오를 포함한다.The delivered content may include an electronic version of the printed document from which the information is scanned, digital music associated with the information in the scan, digital phonogram, audio news or commentary, audio product information, or other recorded or combined sound, at least one digital Images, digital photographs, product images or videos, news reports or cartoons of video, or other digital images or videos.

도 14 는 스캐너 연관 디바이스에 콘텐츠를 제공하기 위해 시스템에 으해 디바이스 사이에 수행되는 상호작용을 보여주는 작용 흐름도이다. 14 is an operational flow diagram illustrating interactions performed between devices by a system to provide content to a scanner associated device.

상호작용(1402)에서, 웹 브라우저 로직을 포함하는 디스플레이 디바이스는 서비스 프로바이더(예를 들어, 디바이스 연관 및/또는 디바이스 데이터베이스를 포함하는 시스템)에 요청을 전달하여 상기 브라우저와 연관된 네트워크 어드레스와 함께 디바이스 연관 데이터베이스에 기록되는 유일한 세션 ID 코드를 생성하도록 한다. 상호 작용(1404)에서, 유일한 세션 ID가 생성되어 연관된 네트워크 어드레스에 있는 브라우저로 다시 전달된다. 상호 작용(1406)에서, 유일한 세션 ID는 디스플레이된 위치로부터 스캔된다. 상호 작용(1408)에서, 쿼리 세션 개시 요청이 유일한 사용자 및/또는 스캐너 ID 및 유일한 세션 ID 코드를 포함하는 서비스 프로바이더에 전달된다. 서비스 프로바이더는 상호 작용(1408)에서 발행된 요청에 포함되는 있는 유일한 세션 ID 코드를 디바이스 연관 데이터베이스에 기록된 네트워크 어드레스를 식별하기 위해 적용되고, 쿼리 세션 긍정응답은 상호 작용(1410)에서 식별된 네트워크 어드레스에 있는 디바이스로 전달된다. 브라우저는 쿼리 세션 요청 긍정 응답을 스캐너의 사용자에게 디스플레이한다. 서비스 프로바이더는 또한, 디바이스 연관 데이터베이스에서, 유일한 세션 ID 가 지금 스캐너의 사용자에 의해 "소유"되고 있다는, 즉 다른 휴대 스캐닝 디바이스가 이 세션 ID와 연관될 수 없다는 것을 기록한다. 서비스 프로바이더는 유일한 사용자 및/또는 스캐너 ID를 현재 확성화된 세션 ID 코드 및 연관된 네트워크 어드레스와 연관시킨다.In interaction 1402, a display device including web browser logic forwards a request to a service provider (eg, a system that includes a device association and / or device database) to device with the network address associated with the browser. Generate a unique session ID code that is recorded in the association database. At interaction 1404, a unique session ID is generated and passed back to the browser at the associated network address. At interaction 1406, a unique session ID is scanned from the displayed location. At interaction 1408, a query session initiation request is passed to the service provider that includes a unique user and / or scanner ID and a unique session ID code. The service provider applies the unique session ID code that is included in the request issued at interaction 1408 to identify the network address recorded in the device association database, and the query session acknowledgment is identified at interaction 1410. It is delivered to the device at the network address. The browser displays the query session request acknowledgment to the user of the scanner. The service provider also records in the device association database that the unique session ID is now "owned" by the user of the scanner, ie no other portable scanning device can be associated with this session ID. The service provider associates a unique user and / or scanner ID with the currently established session ID code and associated network address.

상호 작용(1412)에서, 스캐너는 스캔된 정보(REQ)를 콘텐츠 검색 기능으로 전달한다. 콘텐츠 검색은 스캔된 정보에 응답하여 제공하여야 할 콘텐츠를 결정한다.In interaction 1412, the scanner passes the scanned information REQ to the content retrieval function. The content search determines the content to be provided in response to the scanned information.

몇 실시예에서, 콘텐츠 타입은 상호 작용(1414)에서 디바이스 데이터베이스 로 전달된다. 콘텐츠 타입은, 적어도 부분적으로, 하나 이상의 디바이스가 스캐너와 현재 활성화되어 연관되어 있는 경우에 어떤 연관 디바이스가 가장 콘텐츠를 랜더링하기에 적당한지를 결정하는 데 사용될 수 있다. 콘텐츠에 대한 적당한 디바이스가 이용가능하지 않다고 식별되면, 적당한 렌더링 디바이스가 이용가능할 때 나중에 그런 콘텐츠가 액세스될 수 있도록, 그런 콘텐츠에 대한 링크 또는 콘테츠 자체는 데이터베이스에 저장되고, 사용자에 대해 미리 결정된 어드레스로 이메일 전송되거나, 또는 달리 유지될 것이다. In some embodiments, the content type is passed to the device database at interaction 1414. The content type may be used, at least in part, to determine which associated device is most suitable for rendering the content if one or more devices are currently active and associated with the scanner. If a suitable device for the content is identified as not available, the link or content itself to that content is stored in a database and a predetermined address for the user so that such content can be accessed later when the appropriate rendering device is available. Will be emailed or otherwise maintained.

상호 작용(1416)에서, 디바이스 데이터베이스는 연관 디바이스 어드레스 또는 네트워크 어드레스를 콘텐츠 검색으로 전달한다. 상호 작용(1418)에서, 콘텐츠 검색은 콘텐츠를 연관 디바이스에 제공한다.At interaction 1416, the device database passes the associated device address or network address to the content search. At interaction 1418, content search provides the content to the associated device.

몇 실시예에서, 상기 시스템은 저장 디바이스를 프린트된 문서의 스캔에 응답하여 상기 시스템에 의해 전달된 전자적 콘텐츠(오디오, 비디오, 디지털 문서 등등)를 저장할 목적으로 사용자의 스캐너에 연관시킨다. 예를 들어, 저장 능력을 가지는 디바이스(하드 드라이버, 기록가능한 DVD, CD-ROM 등등을 가지는 컴퓨터와 같이)을 유일하게 식별하는 식별자를 스캐닝함으로써, 상기 시스템은 프린트된 문서의 스캔(휴대 스캐너로부터 발생하는)에 응답하는 콘텐츠의 미래의 전달이 대응하는 저장 디바이스에 전달되고, 이후의 검색을 위해 아카이브에 수록되도록 그 데이터베이스를 수정할 것이다.In some embodiments, the system associates a storage device with a user's scanner for the purpose of storing electronic content (audio, video, digital documents, etc.) delivered by the system in response to a scan of a printed document. For example, by scanning an identifier that uniquely identifies a device having a storage capability (such as a computer with a hard drive, a recordable DVD, a CD-ROM, etc.), the system can scan a printed document (from a mobile scanner). Future delivery of content responsive to the corresponding storage device will be delivered to the corresponding storage device and modified in its database for later retrieval.

몇 실시예에서, 상기 시스템은 사용자의 위치 및 상기 사용자의 휴대 전자 디바이스와 연관될 수 있는 근처의 디바이스를 결정할 것이다. 상기 시스템은 사 용자의 위치를 휴대 디바이스의 보드 장착 GPS의 방법이나, 무선 신호의 삼각측량, 상기 디바이스를 지원하는 통신 네트워크 송수신기의 위치를 결정하는 방법, 사용자에게 쿼리하는 방법, 또는 다른 적당한 방법으로 결정할 것이다. In some embodiments, the system will determine a user's location and nearby devices that may be associated with the user's portable electronic device. The system may use the user's location as a method of board-mounted GPS in a portable device, triangulation of wireless signals, to determine the location of a communication network transceiver supporting the device, to query a user, or other suitable method. Will decide.

몇 실시예에서, 상기 시스템은 휴대 스캐닝 디바이스로 연동하여 사용되는 I/O 디바이스에 대한 위치 정보를 가지는 디바이스 데이터베이스를 유지한다. 시스템이 I/O 디바이스와 연관을 위한 휴대 스캐너로부터의 요청을 수신하면, 상기 시스템은 휴대 스캐너의 위치를 결정하고, 그 후 상기 디바이스 데이터베이스를 참조함으로써 적당한 후보군을 식별한다. In some embodiments, the system maintains a device database having location information for I / O devices used in conjunction with a portable scanning device. When the system receives a request from the portable scanner for association with an I / O device, the system determines the location of the portable scanner and then identifies the appropriate candidate group by referring to the device database.

몇 실시예에서, 상기 시스템은 사용자가 휴대 스캐너와 디바이스의 연관을 미리 설정할 수 있도록 해준다. 하나의 실시예로서, 사용자는 자신의 집 컴퓨터가 그의 스캐너로부터의 콘텐츠 요청의 수신자로서 지정되기를 원할 것이다. 이것을 하기 위해, 사용자는 서비스 프로바이더의 웹사이트에 액세스하고, 스캔된 쿼리에 대한 응답을 수신하도록 되어 있는 데이터 저장소(예를 들어, 집 컴퓨터) 및 디바이스의 식별자를 수동적으로 입력한다. 대안적으로, 상기 시스템은 수신 디바이스를 자동적으로 식별하도록 명세서 전반에 걸쳐 논의된 다양한 스캐닝 방법을 이용한다.In some embodiments, the system allows a user to preset the association of the device with the portable scanner. In one embodiment, the user will want his home computer to be designated as the recipient of content requests from his scanner. To do this, the user manually enters the identifier of the device and the data store (eg, home computer) that is supposed to access the service provider's website and receive a response to the scanned query. Alternatively, the system uses various scanning methods discussed throughout the specification to automatically identify the receiving device.

몇 실시예에서, 공중 키오스크는 동적 세션 ID를 디스플레이한다. 키오스크는 인터넷 또는 기업 인트라넷과 같은 통신 네트워크에 연결되어 있다. 연결은 케이블 모뎀, 전화 시스템(PSTN, ADSL, DSL, 셀룰러 등), 무선 로컬 영역 네트워크(WLAN, IEEE802.11 등등) 또는 다른 적당한 액세스 방법에 의할 수 있다. 세션 ID는 주기적으로 변하나, 새로운 세션 ID가 새로운 사용자마다에게 디스플레이될 수 있도록 적어도 키오스크가 사용될 때마다 변한다. 키오스크를 사용하기 위해서는, 사용자는 키오스크에 의해 디스플레이된 세션 ID내에서 스캔한다; 세션 ID를 스캔함으로써, 사용자는 상기 시스템에 프린트된 문서의 스캔으로부터 기인하는 콘텐츠의 전달을 위해 그 스캐너와 키오스크를 일시적으로 연관시키기를 원한다는 것을 알려준다. 스캐너는 세션 ID 및 스캐너를 인증하기 위한 정보(시리얼 넘버, 어카운트 넘버, 또는 다른 식별 정보)를 직접적으로 상기 시스템에 전달(셀룰러 단문 메시지 서비스와 같은 무선 통신을 통하거나, 상기 통신 네트워크에 키오스크의 링크를 사용함으로써)한다. 예를 들어, 스캐너는 세션 개시 정보를 키오스크에 전달(블루투스와 같은 근거리 RF를 통해)함으로써 키오스크 통신 링크를 적용할 것이다. 키오스크는 그 후 세션 개시 정보를 인터넷 연결을 통해 서비스 프로바이더의 시스템에 전달한다. 스캐너는 세션 개시 메시지를 사용자의 셀룰러폰(블루투스를 통해 사용자의 스캐너와 쌍으로 되는)을 통해 전달함으로써 서비스 프로바이더의 시스템과 직접적으로(여기에서, "직접적"이라는 의미는 키오스크를 통해 메시지의 전달이 없다는 것이다) 통신한다.In some embodiments, the public kiosk displays a dynamic session ID. Kiosk is connected to a communications network such as the Internet or a corporate intranet. The connection may be by cable modem, telephone system (PSTN, ADSL, DSL, cellular, etc.), wireless local area network (WLAN, IEEE802.11, etc.) or other suitable access method. The session ID changes periodically, but at least each time a kiosk is used so that a new session ID can be displayed to every new user. To use a kiosk, the user scans within the session ID displayed by the kiosk; By scanning the session ID, the user is informed that the system wants to temporarily associate the scanner with the kiosk for delivery of content resulting from the scanning of the printed document. The scanner passes the session ID and information for authenticating the scanner (serial number, account number, or other identifying information) directly to the system (via wireless communication, such as a cellular short message service, or a link of a kiosk to the communication network). By using). For example, the scanner will apply the kiosk communication link by conveying session initiation information to the kiosk (via a near field RF such as Bluetooth). The kiosk then forwards the session initiation information to the service provider's system via an internet connection. The scanner delivers the session initiation message through the user's cellular phone (paired with the user's scanner via Bluetooth) directly to the service provider's system (here, "direct" means delivery of the message through the kiosk). There is no communication).

몇 실시예에서, 시스템은 디바이스가 스캐너와 연과되어 있는 주기(세션)동안 스캐너와 연관된 디바이스를 다른 것이 사용하지 못하도록 한다. 이러한 특질은 다른 것이 이전 세션이 종료하기 전에 공중 키오스클 사용하지 못하도록 하는 데 매우 유용하다. 인터넷 카페에서 컴퓨터의 사용과 관련된 이러한 개념의 한 예로서, 사용자는 키오스크 디스플레이로부터 세션 ID를 스캐닝함으로써(또는 휴대 스캐너상의 키패드 또는 터치스크린을 통해 기입하거나) 세션을 개시하고; 시스템은 그 데이테베이스내에 상기 세션 ID를 다른 스캐너가 세션 ID를 스캔하고 그의 세션동안 키오스크를 사용하지 못하도록 그의 스캐너의 시리얼 넘버(또는 사용자 및/또는 사용자의 스캐너를 유일하게 식별하는 다른 식별자)를 상기 세션 ID와 연관시킨다. 스캐너는 상기 디스플레이와연관된 컴퓨너와 통신(블루투스와같은 무선 링크, 독킹 스테이션과 같은 하드와이어드 링크를 통해)하거나, 셀루라폰과 같은 다른 수단에 의해 서비스 프로바이더 시스템과 직접적(컴퓨터를 통하지 않고) 통신할 수 있다.In some embodiments, the system prevents others from using the device associated with the scanner during the period (session) in which the device is associated with the scanner. This feature is very useful to prevent others from using a public kiosk before the previous session ends. As an example of this concept related to the use of a computer in an internet cafe, a user initiates a session by scanning a session ID from a kiosk display (or filling in via a keypad or touchscreen on a handheld scanner); The system uses the session ID in its database to determine the serial number of his scanner (or other identifier that uniquely identifies the user and / or the user's scanner) to prevent other scanners from scanning the session ID and using the kiosk during his session. Associate with the session ID. The scanner can communicate with the computer associated with the display (via a wireless link such as Bluetooth, a hardwired link such as a docking station), or directly (without a computer) to the service provider system by other means, such as a cell phone. Can be.

몇 실시예에서, 휴대 스캐너의 기능은 연관 디바이스에 따라 변화될 수 있다. 예를 들어, 휴대 스캐너가 OCR 능력을 가진 근처의 컴퓨터와 연관되어 있다면, 스캐너는 스캔된 이미지를 데이터를 컴퓨터로 전달할 것이고, 연관된 컴퓨터가 OCR 능력을 가지지 않았다면, 휴대 스캐너는 스캔된 이미지를 텍스트로 변환하기 위해 온보드 OCR 기능을 적용하고 이 텍스트를 서비스 프로바이더에게 전달할 것이다.In some embodiments, the functionality of the handheld scanner may vary depending on the associated device. For example, if the handheld scanner is associated with a nearby computer that has OCR capability, the scanner will deliver the scanned image data to the computer, and if the associated computer does not have OCR capability, the handheld scanner will translate the scanned image into text. We will apply the onboard OCR feature to the conversion and pass this text to the service provider.

몇 실시예에서, 스캐너는 스캐닝보다는 무선 통신(예를 들어 블루투스 링크)에 의해 컴퓨터로부터 통신 세션 식별자를 획득한다. 예를 들어, 휴대 스캐너가 컴퓨터와 블루투스 연결을 만든 후에, 사용자가 휴대 스캐너로 스캔하도록 컴퓨터 디스플레이상에 그것을 디스플레이하기 보다는, 컴퓨터는 블루투스 연결을 통신 세션 식별자를 스캐너에게 전달하기 위해 사용한다. In some embodiments, the scanner obtains a communication session identifier from the computer by wireless communication (eg, a Bluetooth link) rather than scanning. For example, after the handheld scanner makes a Bluetooth connection with the computer, rather than displaying it on the computer display for the user to scan with the handheld scanner, the computer uses the Bluetooth connection to convey the communication session identifier to the scanner.

몇 실시예에서, 시스템은 휴대 전자 디바이스보다 더 나은 비디오 또는 오디 오 능력을 가지는 다른 디바이스를 연관시킴으로써 휴대 전자 디바이스에 대한 사용자 인터페이스를 향상시킨다. 예를 들어, 공항에서 비행기를 기다리는 가입자는 텔레비젼 가이드를 브라우저하고 그가 보기를 원하는 쇼를 알게 된다. 서비스 프로바이더의 웹사이트를 브라우저하도록 그의 컴퓨터상의 웹 브라우저를 사용하면, 가입자는 그의 랩탑 컴퓨너로 전달되는 통신 세션 식별자를 얻을 수 있다. 통신 세션 식별자 및 텔레비젼 가이드로부터 쇼를 식별하는 정보를 스캐닝하면, 가입자는 랩탑 컴퓨터를 그가 가지기를 원하는 비디오 콘텐츠(텔레비젼 쇼)가 전달되어야 할 위치로 식별시킨다. 시스템은 그것을 랩탑 컴퓨터로 전송하기 전에 가입자가 상기 콘텐츠를 액세스할 적당한 권한(에를 들어, 그가 '케이블 텔레비젼' 서비스 계약을 가지는자; 광대역 인터넷 액세스가 상기 비디오 전달을 위해 필요하다면 그가 인터넷 서비스 프로바이더와 광대역 서비스 계약을 가지는 지 등)을 가지는 지를 체크할 것이다.In some embodiments, the system enhances the user interface for the portable electronic device by associating another device with better video or audio capabilities than the portable electronic device. For example, a subscriber waiting for a flight at an airport browsers a television guide and finds out which show he wants to watch. Using a web browser on his computer to browser the service provider's website, the subscriber can obtain a communication session identifier that is passed to his laptop computer. Scanning the information identifying the show from the communication session identifier and the television guide, the subscriber identifies the laptop computer as the location where the video content (TV show) he wants to have is delivered. The system is responsible for providing the subscriber with appropriate rights to access the content (e.g., he has a 'cable TV' service contract; if broadband internet access is required for the video delivery before he transfers it to the laptop computer; Check whether you have a broadband service contract, etc.).

퍼스널 컴퓨터Personal computer

몇 실시예에서, 휴대 문서 데이터 갭쳐 디바이스는 퍼스널 컴퓨터의 동작을 제어한다. 휴대 디바이스는 PC가 소프트웨어 애플리케이션을 실행하거나 및/또는 다른 동작을 실행하도록 하는 데이터 및 명령을 PC로 전달한다. 예를 들어, 컴퓨터와 LCD 프로젝터로 파워포인트 프리젠테이션을 하고자 할 때, 사용자는 파워포인트 슬라이드의 문서 카피를 스캐닝함으로써 컴퓨터의 동작을 제어할 수 있다. 사용자는 슬라이드로부터 정보를 스캔하여 컴퓨터가 상기 슬라이드를 진행시키도록 한다. 휴대 디바이스는 랜더링된 문서로부터 워드 프로세싱 소프트웨어, 웹 브라 우저, 및 다른 소프트웨어 애플리케이션을 제어하도록 이용되어 질 수 있다. 사용자는 휴대 디바이스로 퍼스널 컴퓨터를 제어함으로써 전자 문서를 편집하고, 인터넷을 통해 물건 구입을 하고, 메시지를 전달할 수 있다. In some embodiments, the portable document data gap device controls the operation of the personal computer. The portable device delivers data and instructions to the PC that cause the PC to run a software application and / or perform other operations. For example, when making a PowerPoint presentation with a computer and an LCD projector, the user can control the operation of the computer by scanning a copy of the document of the PowerPoint slide. The user scans the information from the slides and causes the computer to advance the slides. The portable device can be used to control word processing software, web browsers, and other software applications from the rendered document. The user can edit the electronic document, make a purchase through the Internet, and deliver a message by controlling the personal computer with the portable device.

편집edit

몇 실시예에서, 휴대 데이터 캡쳐 디바이스는 호스트 컴퓨터에 대한 데이터 입력 디바이스로 작용할 수 있다. 휴대 디바이스 및 호스트 컴퓨터는 워드 프로세싱 소프트웨어와 함께 작동하여, 강력한 문서 편집 시스템을 구성한다.In some embodiments, the portable data capture device can act as a data input device for the host computer. The portable device and host computer work with word processing software to form a powerful document editing system.

문서 편집 시스템은 컴퓨터의 워드 프로세싱 애플리케이션에서 문서에 대한 편집 명령으로서 프린트된 표면상에 사용자의 움직임을 반영하고 및/또는 해석한다. 휴대 디바이스의 사용에 의해, 사용자는 워드 프로세싱 소프트웨어가 북마크, 강조/밑줄/볼드체/이탤릭체 텍스트, 자름, 복사, 붙여넣기, 검색, 저장 및 인쇄와 같은 다양한 기능을 수행하도록 한다. The document editing system reflects and / or interprets the user's movements on the printed surface as editing instructions for the document in a word processing application on the computer. By the use of a portable device, a user allows word processing software to perform various functions such as bookmarks, highlighting / underlined / bold / italic text, cutting, copying, pasting, searching, saving and printing.

몇 실시예에서, 휴대 디바이스상의 강조 지시자의 색깔은 디지털 카피에서 나타날 강조의 색깔을 지시한다. 몇 실시예에서, 색깔있는 광은 사용자에게 디지털 카피에서 나타나는 강조의 색깔, 캡쳐 디바이스의 상태 등등을 지시하기 위해 종이로 반사될 수 있다. In some embodiments, the color of the highlight indicator on the portable device indicates the color of the highlight that will appear in the digital copy. In some embodiments, the colored light may be reflected off the paper to indicate to the user the color of the accent that appears in the digital copy, the state of the capture device, and so forth.

VCRVCR

몇 실시예에서, 휴대 데이터 갭쳐 디바이스는 비디오 기록 디바이스를 제어할 수 있다. 예를 들어, 텔레비젼 가이드로부터의 데이터를 캡쳐함으로써, 휴대 디바이스는 비디오 기록 디바이스가 소정의 텔레비젼 프로그램을 기록하도록 프로 그램하는 명령을 전달할 수 있다. 몇 실시예에서, 휴대 디바이스는 적외선(IR) 통신에 의해 비디오 기록 디바이스에 명령을 전달할 수 있다.In some embodiments, the portable data gap device can control the video recording device. For example, by capturing data from a television guide, the portable device can deliver a command to program the video recording device to record a given television program. In some embodiments, the portable device can communicate commands to the video recording device by infrared (IR) communication.

상태 지시자Status indicator

휴대 데이터 캡쳐 디바이스의 사용자 인터페이스는 사용자에게 디바이스의 현재 상태에 대해 알려줄 수 있다. 디바이스는 시각적, 청각적, 촉감적 지시자에 의해 사용자에게 경보를 발할 수 있다. 몇몇의 더 유용한 사용자 인터페이스 상태 지시가가 아래에 기술될 것이지만, 그러나 이것이 가능한 모든 리스트는 아니다.The user interface of the portable data capture device can inform the user about the current state of the device. The device may alert the user by visual, audio, and tactile indicators. Some more useful user interface status indicators will be described below, but this is not all possible listings.

충분한 스캔 지시자Enough scan indicator

몇 실시예에서, 휴대 캡쳐 디바이스는 사용자에게 문서를 식별하기에 충분한 정보가 캡쳐되었다는 것을 지시한다. 예를 들어, 휴대 스캐너는 특정 스캔이 유일하게 문서를 식별하였다는 것을 지시하는 소정의 임계값을 저장할 수 있다. 임계값이 충족되거나 초과하는 경우, 휴대 스캐너는 사용자에게 사용자 인터페이스를 통해 충분한 정보가 상기 문서를 식별하기 위해 스캔되었다는 것을 지시한다. 이러한 소정의 임계값은 발견적 방법(즉, 비형식적인 방법), 통계 분석, 또는 다른 적당한 방법에 기초하여 결정될 수 있다.In some embodiments, the portable capture device instructs the user that sufficient information has been captured to identify the document. For example, the handheld scanner may store a predetermined threshold indicating that a particular scan uniquely identified a document. If the threshold is met or exceeded, the handheld scanner instructs the user that sufficient information has been scanned to identify the document via the user interface. This predetermined threshold may be determined based on a heuristic (ie, informal method), statistical analysis, or other suitable method.

휴대 데이터 캡쳐 디바이스는 충분한 정보가 사용자 인터페이스의 시각적, 청각적, 촉감적 능력에 의해 스캔되었다는 것을 사용자에게 지시한다. 스캔된 정보가 소정의 임계값을 충족하거나 초과한다고 결정되면, 디바이스의 프로세서는 사용자 인터페이스에 정보가 스캔되는 문서를 식별하기에 충분한 정보가 스캔되었다는 것을 사용자에게 전달하도록 지시할 것이다.The portable data capture device instructs the user that sufficient information has been scanned by the visual, audio and tactile capabilities of the user interface. If it is determined that the scanned information meets or exceeds a predetermined threshold, the processor of the device will instruct the user interface to inform the user that enough information has been scanned to identify the document whose information is being scanned.

몇 실시예에서, 충분한 스캔 지시자가 스캔의 "충분성"의 다양한 레벨의 확신을 지시한다. 예를 들어, 빨간 광은 충분치 않은 텍스트가 캡쳐되었다는 것을 지시하고, 노란 광은 50%정도의 충분성을 가진 텍스트가 캡쳐되었다는 것을 지시하고, 녹색 광은 거의 완전히 충분한 텍스트가 캡쳐되었다는 것을 지시할 수 있다.In some embodiments, sufficient scan indicators indicate various levels of confidence in the "sufficiency" of the scan. For example, red light can indicate that insufficient text has been captured, yellow light can indicate that text with as much as 50% of fullness has been captured, and green light can indicate that almost enough text has been captured. have.

충분성 결정 방법How to Determine Sufficiency

몇 실시예에서, 상기 시스템은 충분성 임계값을 결정하기 위해, 쓰여진 표현의 독특한 캐릭터의 관찰에 기초한 발견적 방법을 이용한다. 대부분의 문서는 10 단어(20-50 정도의 캐릭터나 기호)보다 적은 스캔으로 유일하게 식별될 수 있다. 이러한 발견적 방법은 테스트되는 모든 언어에 적용할 수 있다. 4-10 단어 범위의 스캔이 복제된 문서의 결과로 나타나면, 사용자는 더 좁은 결과를 위하여 추가의 단어를 스캔하도록 프롬프트될 수 있다. In some embodiments, the system uses a heuristic method based on observation of the unique character of the written representation to determine the sufficiency threshold. Most documents can be uniquely identified with fewer than 10 words (20-50 characters or symbols). This heuristic can be applied to any language tested. If a scan of the 4-10 word range results in a duplicated document, the user may be prompted to scan additional words for narrower results.

휴대 디바이스에서 로직을 프로세싱하는 것은 스캔이 소스 문서를 유일하게 식별할 수 있는지 없는지를 결정할 수 있다. 몇 실시예에서, 충분성 임계값이 이전의 스캔의 관찰에 기초한 매개변수화된 비형식적인 방법이다. 예를 들어, 스캐너는 8 단어가 유일하도록(여기서, "단어"는 뛰움사이의 일련의 캐릭터이다) 프로그램될 수 있다. 대안적으로, 스캐너는 캡쳐된 텍스트가 모두 각각 3개의 캐릭터보다 긴 적어도 6 단어를 가지기를 요구하는 충분성 임계값으로 프로그램될 수 있다. 다른 접근방법은 어떤 물리적인 거리(예를 들어, 텍스트의 4 인치의 스캔은 표준 폭 페이지의 하나의 라인의 절반이상이 될 것이다)의 스캔이후에 스캔이 유일하다고 결정하는 것이다. 다른 접근 방법은 단어보다는 스캔되는 캐릭터에 기초하 여(예를 들어, 40 캐릭터 이후에 스캔이 유일하다) 임계값을 설정하는 것이다. 다른 대안으로, 스캔 충분성은 스캔된 텍스터를 검색 엔진으로 송부하고, 검색 결과를 수신함으로써 결정될 수 있다. 검색 엔진이 유일한 매치를 전송하면, 스캔은 충분하다. 스캔이 유일하다고 결정하는 다른 방법은 2차원의 바코드와 같은 임베이드된 데이타가 발견되면, 유일하게 식별되었다는 정보를 전달하도록 디자인하는 것이다. Processing logic in the portable device can determine whether the scan can uniquely identify the source document. In some embodiments, the sufficiency threshold is a parameterized informal method based on observations of previous scans. For example, the scanner can be programmed so that 8 words are unique (where "word" is a series of characters between runs). Alternatively, the scanner may be programmed with a sufficiency threshold that requires that the captured text all have at least six words longer than three characters each. Another approach is to determine that the scan is unique after a scan of some physical distance (e.g. a 4 inch scan of text will be more than half of one line of a standard width page). Another approach is to set thresholds based on the character being scanned rather than words (eg, scanning is only unique after 40 characters). Alternatively, scan sufficiency may be determined by sending the scanned text to a search engine and receiving the search results. If the search engine sends a unique match, the scan is enough. Another way to determine that scans are unique is to design to convey information that is uniquely identified when embedded data such as two-dimensional barcodes are found.

콘텍스트는 문서나 문서내의 특정 위치를 식별하기 위해 필요한 캡쳐되는 정보의 양에 영향을 미친다. 시스템이 특정 스캔에 대해 더 많은 콘텍스트를 안다면, 유일성을 위해 더 짧은 구문이 요구된다. 공지의 문서내에서, 상기 시스템은 문서내에서 유일하다고 하기 위해 필요한 것을 지시하는 서브인덱스를 계산한다. 즉, 시스템이 문서내의 모든 텍스트를 알고 있기 때문에, 시스템은 캐릭터 및/또는 단어의 조합이 모호한지 아닌지를 결정할 수 있다. 특정 문서에서 명화하기 위해서 얼마나 많은 텍스트가 캡쳐되어야 하는 지는 문서내의 전체 캐릭터 수, 단어 길이, 문서내에서 단어가 몇 번 사용되었는지의 함수이다. 수학적으로 표시하자면, 이러한 관계의 일 예는: 충분성=f(캐릭터의 전체수, 반복되는 캐릭터의 수, 단어 길이)이다. The context affects the amount of information captured to identify the document or a specific location within the document. If the system knows more context for a particular scan, shorter syntax is required for uniqueness. In known documents, the system calculates a subindex indicating what is needed to be unique in the document. That is, because the system knows all the text in the document, the system can determine whether the combination of characters and / or words is ambiguous. How much text should be captured to clarify in a particular document is a function of the total number of characters in the document, word length, and how many times the word is used in the document. Mathematically, one example of such a relationship is: sufficiency = f (total number of characters, number of repeated characters, word length).

임계값 방법은 휴대 데이터 캡쳐 디바이스가 캡쳐된 텍스트가 유일한지 아닌지를 실시간으로 사용자에게 지시할 수 있도록 해준다.The threshold method allows the portable data capture device to instruct the user in real time whether the captured text is unique or not.

검색을 검색엔진에 전달하는 방법을 사용할 때, 유일성은 하나 또는 제로 히트-즉, 다른 인덱스된 문서가 검색 쿼리에 매칭하는 콘텐츠를 가지고 있지 않다, 에 의해 결정된다. When using a method of passing a search to a search engine, uniqueness is determined by one or zero hits, that is, the other indexed documents do not have content that matches the search query.

스캔의 충분성을 결정하기 위해 단어 카운트 임계값을 이용하는 경우, 스캔된 단어의 길이는 문서 또는 영역을 충분히 식별하기 위해 필요한 단어의 수에 영향을 미친다. 긴 단어는 일반적으로 짧은 단어보다 더 많은 명확성을 가진다. 따라서, 단어 "amalgamation"은 단어 "the" 보다 더 명확한 값을 가니다. 단어 길이는 OCR 이전에 있어서도, 캐릭터 스트링에 흰 여백이 있는지를 관찰함으로써 결정될 수 있다. 중간에 흰 여백을 가지지 않은 많은 수의 캐릭터는 높은 명확한 값을 가지는 긴 단어임을 지시한다. 적은 수의 캐릭터가 많은 수의 흰 여백에 의해 분리되어 있다는 것은 더 낮은 명확한 값을 가지는 짧은 단어를 의미한다. When using the word count threshold to determine the sufficiency of the scan, the length of the scanned word affects the number of words needed to fully identify the document or area. Long words generally have more clarity than short words. Thus, the word "amalgamation" has a clearer value than the word "the". The word length can be determined by observing whether there is a white margin in the character string, even before OCR. A large number of characters with no white space in the middle indicate long words with high clear values. A small number of characters separated by a large number of white spaces means a short word with a lower clear value.

폰트 크기, 색채 및 폰트 타입에 관한 정보는 명확화에 매우 유용하다. 이러한 특질을 안다는 것은 문서 또는 영역을 식별하기에 필요한 텍스트의 양을 줄일 수 있다. Information about font size, color and font type is very useful for clarity. Knowing these features can reduce the amount of text needed to identify a document or area.

매입된 제어 데이타 지시자Embedded control data indicator

몇 실시예에서, 휴대 데이타 캡쳐 디바이스는 사용자가 문서내에 매입된 제어 데이타를 접하는 때에 사용자에게 경고한다. 예를 들어, 알려진 키워드를 접할 때 사용자에게 경고할 수 있다. 다른 예로서, 휴대 디바이스는 사용자에게 마크업 레이어로 문서내에 설정된 활성 영역으로 경고할 수 있다. 또 다른 예로서, 디바이스는 비가시적인 특질을 가진 잉크(예를 들어, UV/IR 잉크)로 매입된 제어 데이터나 2차원 바코드를 접할 때 사용자에게 경고할 수 있다.In some embodiments, the portable data capture device alerts the user when the user encounters control data embedded in the document. For example, you can warn users when they encounter a known keyword. As another example, the portable device may alert the user to an active area set in the document with a markup layer. As another example, the device may warn the user when encountering control data or two-dimensional barcodes embedded in ink having invisible properties (eg, UV / IR inks).

콘텍스트 지시자Context indicator

몇 실시예에서, 휴대 디바이스는 사용자에게 디바이스가 현재의 콘텍스트를 인식하거나 인식하지 않는지를(예를 들어, 디바이스가 사용자가 현재 작업하고 있는 문서나 알려진 문서내의 영역의 아이덴터티를 알고 있는지 등) 지시한다. 콘텍스트 "락"은 피-코머스(p-commerce) 애플리케이션에 특히 유용하다. 예를 들어, 콘텍스트 지시지가 시스템이 시스템의 카탈로그로부터 아이템을 스캐닝하고 있다는 것을 알고 있다는 것을 사용자에게 알려준다. 따라서, 정확한 아이템이 정확한 벤더로부터 구매되었다는 것을 확신한다. 몇 실시예에서, 콘텍스트 지시자는 렌더링된 문서의 이름이나 다른 식별 정보를 디스플레이한다.In some embodiments, the portable device indicates to the user whether the device recognizes or does not recognize the current context (eg, if the device knows the document the user is currently working on or the identity of an area within a known document, etc.). . The context "lock" is particularly useful for p-commerce applications. For example, the context indicator informs the user that the system knows that it is scanning an item from the system's catalog. Thus, it is assured that the correct item was purchased from the correct vendor. In some embodiments, the context indicator displays the name or other identifying information of the rendered document.

온라인/오프라인 지시자Online / offline indicator

몇 실시예에서, 휴대 디바이스는 사용자에게 온라인 또는 오프라인 모드에서 작동하는 지를 경고한다. 디바이스가 온라인이면, 호스트 컴퓨터나 서비스 프로바이더의 네트워크로의 활성화된 연결을 가진다. 디바이스가 오프라인이면, 다른 시스템 디바이스와 통신하고 있지 않다.In some embodiments, the portable device alerts the user whether it is operating in online or offline mode. If the device is online, it has an active connection to the network of the host computer or service provider. If the device is offline, it is not communicating with another system device.

데이타 캡쳐 지시자Data capture indicator

몇 실시예에서, 디바이스는 사용자에게 데이타를 캡쳐링하고 있거나 그렇지 않다면 정확히 작동하고 있다는 것을 지시한다.In some embodiments, the device indicates to the user that it is capturing data or is otherwise operating correctly.

에러 지시자Error indicator

몇 실시예에서, 디바이스는 사용자에게 에러를 경고한다. 예를 들어, 디바이스는 사용자에게 종이 문서가 식별되었으나 사용자가 상기 종이 문서의 전자적 대응물에 액세스할 권한이 없다는 것을 경고하기 위해 비프음을 발할 수 있다. 다 른 예로서, 디바이스는 사용자에게 마지막 스캔이 반복되어야 한다는 것; 호스트 컴퓨터나 서비스 프로바이더 네트워크에 액세스가 거부되었다는 것; 문서 전달이 일어나자 않았다(예를들어, 사용자의 라이버러리 아카이브가 문서를 수신/억셉트하지 않았다)는 것; 휴대 디바이스가 메모리 초과; 배터리 방전 등을 경고할 수 있다.In some embodiments, the device warns the user of the error. For example, the device may emit a beep to warn the user that a paper document has been identified but the user is not authorized to access the electronic counterpart of the paper document. As another example, the device may indicate to the user that the last scan should be repeated; Access is denied to the host computer or service provider network; Document delivery did not occur (eg, the user's library archive did not receive / accept documents); The portable device is out of memory; The battery can be discharged.

강조 색깔 지시자Highlighting color indicator

몇 실시예에서, 휴대 디바이스는 사용자에게 강조 기능이 어떤 색깔로 나타나는지를 보여줄 수 있다. 몇 실시예에서, 호스트 컴퓨터는 디스플레이상에서 현재 강조 모드의 색깔을 보여준다(예를 들어, 노랑은 워드 프로세싱 소크트웨어가 노랑으로 강조되고 있다는 것을 의미한다).In some embodiments, the portable device can show the user what color the highlighting feature appears. In some embodiments, the host computer shows the color of the current highlight mode on the display (eg, yellow means that the word processing software is highlighted in yellow).

보안/프라이버시Security / Privacy

몇 실시예에서, 휴대 데이타 캡쳐 디바이스는 인증받지 않은 사용자가 디바이스를 사용하지 못하고, 데이터 전송이 기밀적이고, 사용자의 아이덴터티가 상업 거래를 위해 검증되어야 한다는 보안 및 프라이버시 과정을 가진다.In some embodiments, the portable data capture device has a security and privacy process in which an unauthorized user cannot use the device, data transmission is confidential, and the user's identity must be verified for commercial transactions.

몇 실시예에서, 휴대 디바이스는 사용자 데이터의 프라이버시 및 보안을 확보하기 위해 암호화 과정을 이용한다. 디바이스의 메모리에 저장된 데이타 및 다른 디바이스로 전송된 데이타는 암호화될 수 있다. 부가적으로, 사용자는 디바이스 프로파일을 다른 디바이스와 공유된 정보의 양과 종류를 제한하도록 설정할 수 있다. 몇 실시예에서, 시스템은 모든 검색 결과가 휴대 디바이스로 재전송되고, 시스템이나 서비스 프로바이더 네트워크에 저장되지 않도록 사용자가 지정할 수 있 도록 해준다. In some embodiments, the portable device uses an encryption process to ensure the privacy and security of the user data. Data stored in the memory of the device and data sent to other devices can be encrypted. Additionally, the user can set the device profile to limit the amount and type of information shared with other devices. In some embodiments, the system allows the user to specify that all search results are re-sent to the portable device and not stored in the system or service provider network.

몇 실시예에서, 스캐너는 컴퓨터, PDA, 또는 휴대폰과 같은 호스트 머신과 쌍으로 형성된다. 시스템은 휴대 데이터 캡쳐 디바이스를 호스트 머신의 식별자(예를 들어 시리얼 번호 등)를 휴대 디바이스의 메모리의 소정의 위치에 프로그래밍함으로써 특정 호스트 머신과만 작동하도록 락킹할 수 있다. 다른 디바이스와 통신하기 전에, 휴대 디바이스는 어떤 머신이 그의 할당된 호스트인지를 알아보기 위해 소정의 메모리 영역을 체크한다. 스캐너를 다른 디바이스와 사용하고자 한다면, 시스템(또는 스캐너 자체)은 새로운 통신 쌍이 작동하기 전에 사용자가 그의 아이덴터티를 검증/인증하도록 요구할 것이다. In some embodiments, the scanner is paired with a host machine such as a computer, PDA, or cell phone. The system can lock the portable data capture device to work only with a particular host machine by programming the host machine's identifier (eg, serial number, etc.) to a predetermined location in the portable device's memory. Before communicating with another device, the portable device checks a certain memory area to see which machine is its assigned host. If the scanner is to be used with another device, the system (or the scanner itself) will require the user to verify / authenticate his identity before the new communication pair can work.

생체측정 사용Use biometrics

몇 실시예에서, 휴대 데이터 캡쳐 디바이스 및 그의 연관된 시스템은 보안 및 프라이버시를 위해 생체 측정을 사용한다. 예를 들어, 사용자는 휴대 디바이스에 지문을 스캐닝함으로써 그의 아이덴터티를 검증할 수 있다. 다른 예로서, 몇 실시예에서, 디바이스는 생체 정보를 프라이버시를 위한 데이터 암호화에 사용할 수 있다. 예를 들어 타원 곡선 암호화를 위해 지문 스캔을 이용한다. 몇 실시예에서, 휴대 디바이스는 텍스트 및 생체측정을 스캐닝하기 위해 동일한 광 경로를 이용한다.In some embodiments, the portable data capture device and its associated system use biometrics for security and privacy. For example, a user can verify his or her identity by scanning a fingerprint on the portable device. As another example, in some embodiments, the device may use biometric information for data encryption for privacy. For example, fingerprint scans are used for elliptic curve encryption. In some embodiments, the portable device uses the same light path to scan text and biometrics.

온라인/오프라인 거동Online / offline behavior

몇 실시예에서, 휴대 문서 데이터 캡쳐 디바이스는 디바이스가 온라인인지 오프라인인지에 따라 다른 거동을 보인다. 디바이스가 호스트 컴퓨터, 통신 네트 워크, 데이터 캡쳐 서비스 프로바이더 네트워크와 같은 다른 디바이스와 통신하고 있지 않으면 오프라인이다. 스캔너 서비스 프로바이더 네트워크로서 참조되는 데이터 캡쳐 서비스 프로바이더 네트워크는 라이프 라이버러리 아카이브 프로바이더와 같은, 휴대 문서 데이타 갭쳐 디바이스의 애플리케이션을 지원하는 서비스 프로바이더이다. In some embodiments, the portable document data capture device exhibits different behavior depending on whether the device is online or offline. Offline if the device is not communicating with other devices, such as a host computer, communication network, or data capture service provider network. A data capture service provider network, referred to as a scanner service provider network, is a service provider that supports applications of portable document data gap device, such as a life library archive provider.

몇 실시예에서, 휴대 디바이스는 오프라인 상태인 경우에도 작동할 수 잇다. 사용자는 여전히 렌더링된 문서로부터 데이터를 스캔하고, 음성 주석을 달고, 문서를 검색하고, p-commerce 거래를 개시할 수 있다. 이러한 기능의 일부(거래, 주석, 및 검색)은 네트워크 연결이 복구될 때까지 완료되지 않을 것이다.In some embodiments, the portable device can operate even when offline. The user can still scan data from the rendered document, annotate voice, retrieve the document, and initiate a p-commerce transaction. Some of these features (transactions, annotations, and searches) will not complete until network connectivity is restored.

오프라인 거동의 하나의 형태는 문서가 전자적 형태로 현재 이용가능하지 않을 때 발생한다. 따라서, 문서로부터 캡쳐된 데이터에 기초한 검색은 매치없음으로 재전송되어 올 것이다. 이것이 발생하면, 시스템은 검색 쿼리를 저장할 수 있고, 미래의 일정 시점에 문서가 이용가능할 때가지 그것을 주기적으로 다시 제출할 수 있다. 시스템은 또한 전자적 대응물이 현재 이용가능하지 않다는 것을 사용자에게 알려줄 것이다.One form of offline behavior occurs when a document is not currently available in electronic form. Thus, searches based on data captured from the document will be resent with no match. If this happens, the system can save the search query and periodically resubmit it until the document is available at some point in the future. The system will also inform the user that the electronic counterpart is not currently available.

몇 실시예에서, 휴대 데이터 캡쳐 디바이스는 캡쳐된 원래 데이터(이미지 또는 음성)를 다음의 검색을 위해 메모리에 유지한다. 이러한 능력은 시스템이 다음의 프로세싱을 위해 "캡쳐된 대로의" 데이터를 복원할 수 있도록 해준다. 예를 들어, 사용자가 휴대 스캐너로 텍스트를 스캔할 때, 스캔된 이미지는 메모리로 저장되고, OCR 프로세스가 상기 스캔된 이미지에 수행된다. 이미지가 OCR 프로세스에 의해 인식되지 않는다면, 원래 이미지 데이터는 호스트 컴퓨터나 서비스 프로바이더에 다음 프로세싱을 위해 전달될 수 있다. 몇 실시예에서, 스캔된 이미지 데이타는 새로운 데이타로 겹쳐 쓰여지기전까지 메모리에 유지된다. 예를 들어, 디바이스는 원래 이미지 및 프로세싱된 이미지(예를 들어, OCR된 텍스트)가 메모리가 찰될까지 저장할 수 있다. 몇 실시예에서, 디바이스는 프로세싱된 이미지를 저장만하고, 프로세싱된 이미지보다 더 많은 메모리 공간을 사용하는 원래 이미지를 겹쳐 쓰도록 한다. In some embodiments, the portable data capture device maintains the captured original data (image or voice) in memory for subsequent retrieval. This capability allows the system to restore data "as captured" for further processing. For example, when a user scans text with a handheld scanner, the scanned image is stored in memory and an OCR process is performed on the scanned image. If the image is not recognized by the OCR process, the original image data can be delivered to the host computer or service provider for further processing. In some embodiments, the scanned image data is held in memory until overwritten with new data. For example, the device may store the original image and the processed image (eg, OCR text) until the memory is full. In some embodiments, the device only saves the processed image and allows to overwrite the original image using more memory space than the processed image.

휴대 디바이스는 오프라인 모드인 경우에 국부 캐시된 데이터를 액세스할 수 있다. 또한, 몇 실시예에서, 휴대 데이터 캡쳐 디바이스가 호스트 컴퓨터 및/또는 네트워크로의 연결이 가능할 때를 감지하고, 자동적으로 이에 따라 거동을 변경한다. 예를 들어, 온라인/오프라인 감지를 가진 휴대 디바이스는 연결이 안되는 경우 자동적으로 캡쳐된 데이터를 캐싱하는 것을 시작할 수 있다. The portable device can access the locally cached data when in the offline mode. Further, in some embodiments, the portable data capture device detects when a connection to the host computer and / or network is possible and automatically changes its behavior accordingly. For example, a portable device with online / offline sensing can begin caching captured data automatically when disconnected.

국부 캐싱Local caching

사용자가 필요할 것 같은 정보를 국부적으로 캐싱함으로써, 시스템은 대기 시간을 줄이고, 네트워크 대역폭을 보존할 수 있다. 국부적으로 캐시된 검색 인덱스, 키워드 라이버러리, 마크업 정보, 및 폰트 라이버러리는 사용자 경험이나 네트워크 작동을 향상시킨다. 폰트 라이버러리의 국부 캐싱은 휴대 디바이스가 오프라인 모드인 상태인 때에도, 템플렛 기반 OCR을 수행할 수 있도록 해준다.By locally caching information that a user may need, the system can reduce latency and conserve network bandwidth. Locally cached search indexes, keyword libraries, markup information, and font libraries improve user experience or network operation. Local caching of the font library enables template-based OCR even when the portable device is in offline mode.

몇 실시예에서, 문서 데이터 캡쳐 디바이스는 네트워크 트랙픽을 감소하기 위해 최근 스캔의 결과를 국부적으로 캐시할 것이다. 왜냐하면, 네트워크 트래픽 의 거의 50%는 동일한 재료, 특허 최근에 발표된 재료에 대한 히트를 반복할 것이기 때문이다. In some embodiments, the document data capture device will locally cache the results of recent scans to reduce network traffic. This is because nearly 50% of network traffic will repeat hits on the same material, a recently published patent.

사용자의 라이프 라이버러리는 휴대 디바이스와 연관된 호스트 컴퓨터내에 캐시될 수 있다. 사용자의 라이프 라이버러리내의 문서를 나타내는 토큰은 휴대 디바이스내에 국부적으로 캐시될 수 있다. 사용자의 라이프 라이버러리의 국부적인 캐시는 사용자가 오프라인 모드에서도 그의 라이프 라이버러리를 액세스할 수 있도록 해준다.The user's life library can be cached in a host computer associated with the portable device. Tokens representing documents in the user's life library may be cached locally within the portable device. The user's local library's local cache allows the user to access his life library even in offline mode.

몇 실시예에서, 시스템은 휴대 디바이스상에 종이 문서를 충분히 식별하고, 전자적인 대응물을 위치시키도록 스캔에 대해 요구되는 텍스트의 양을 표시를 캐시한다. 이런 국부 캐시는 보통의 텍스트가 가지는 것보다 더 적은 명확화 값을 가지는 진부한 표현 및 일반적인 표현의 리스트를 포함한다. 캡쳐된 데이터가 일반적인 표현이나 진부한 표현을 포함한다면, 최소한의 명확화 임계값은 증가하고, 추가의 텍스트가 문서를 충분히 식별하도록 요구된다. 따라서, 일반적인 문구는 문서를 식별하기 위해 스캔되어야 할 텍스트의 양을 증가시킨다. 이러한 일반적인 문구를 국부적으로 캐시함으로써, 휴대 디바이스는 사용자에게 문서를 식별하기에 충분한 텍스트가 캡쳐되어야 한다는 것을 지시할 수 있는 능력을 향상시킨다.In some embodiments, the system caches an indication of the amount of text required for the scan to sufficiently identify the paper document on the portable device and locate the electronic counterpart. This local cache contains a list of conventional and general expressions with less disambiguation than normal text has. If the captured data contains a generic representation or a banal representation, the minimum disambiguation threshold is increased and additional text is required to fully identify the document. Thus, generic phrases increase the amount of text that must be scanned to identify the document. By locally caching this generic phrase, the portable device improves the ability to instruct the user that enough text should be captured to identify the document.

시스템에 의해 인덱스되도록 알려진 문서(예를 들어, 신문, 잡지 등)의 리스트를 국부적으로 캐시함으로써, 스캐너는 오프라인 모드에서도 콘텍스틀 알고 있다고 지시할 수 있다. By locally caching a list of documents (eg, newspapers, magazines, etc.) known to be indexed by the system, the scanner can indicate that the context is known even in offline mode.

몇 실시예에서, 휴대 디바이스가 국부적으로 캐시되지 않은 폰트를 접했을 때, 그들의 호스트 컴퓨터나 서비스 프로바이더로부터 적당한 폰트 라이버러리를 다운로드 한다. In some embodiments, when portable devices encounter fonts that are not locally cached, they download appropriate font libraries from their host computer or service provider.

문서가 식별되면, 상기 문서와 연관된 마크업 문서가 스캐너로 다운로딩될 수 있다. 마크업 문서의 국부 캐시는 상기 문서에 대한 스캐너의 거동의 국부 결정을 가능하게 해준다.Once a document is identified, markup documents associated with the document can be downloaded to the scanner. The local cache of markup documents enables local determination of the scanner's behavior on the document.

몇 실시예에서, 시스템은 접하게 될 문서에 대한 인덱스나 다른 데이터를 미리 캐시한다. 예를 들어, 몇 실시예에서, 시스템은 지역 뉴스에 대한 인덱스 및 마크업 문서를 상기 신문으로부터 사용자의 스캐닝 데이터를 예상하여 매일 아침 사용자의 휴대 스캐너로 다운하도록 한다. In some embodiments, the system pre-caches the index or other data for the document to be encountered. For example, in some embodiments, the system allows indexing and markup documents for local news to be downloaded to the user's handheld scanner every morning in anticipation of the user's scanning data from the newspaper.

폰트 템플렛Font template

몇 실시예에서, 휴대 문서 데이터 캡쳐 디바이스는 폰트 라이버러리 및 폰트 템플렛을 국부적으로 캐시한다. 폰트 템플렛은 폰트가 인식된 이후에 디바이스로 다운로드될 수 있다. 휴대 디바이스가 알파벳의 모든 캐릭터의 하나의 예를 캡쳐할 때까지 기다릴 필요가 없다. 시스템이 몇몇의 캡쳐된 캐릭터의 폰트를 인식한 이후에, 폰트 라이버러리는 국부 캐시 능력을 가진 데이터 캡쳐 디바이스내로 다운로드될 수 있다. 시스템은 휴대 데이터 캡쳐 디바이스내에 폰트 템플렛을 국부적으로 캐시함으로써 OCR 대기시간을 감소할 수 있다.In some embodiments, the portable document data capture device locally caches font libraries and font templates. The font template may be downloaded to the device after the font is recognized. There is no need to wait until the portable device has captured one example of every character of the alphabet. After the system recognizes the fonts of some captured characters, the font library can be downloaded into a data capture device with local cache capabilities. The system can reduce OCR latency by locally caching font templates in the portable data capture device.

인덱스index

몇 실시예에서, 시스템은 검색 인덱스를 휴대 데이터 캡쳐 디바이스상에 캐시한다. 몇 실시에에서, 시스템은 사용자가 필요로 할 것 같은 인덱스를 미리 캐 시할 수 있다. 예를 들어, 시스템은 지역 신문에 대한 최근의 인덱스로 매일 휴대 디바이스를 미리 인덱스할 수 있다.In some embodiments, the system caches the search index on the portable data capture device. In some embodiments, the system may pre-cache an index that the user may need. For example, the system may pre-index the portable device every day with the latest index for the local newspaper.

키워드 라이버러리Keyword Library

키워드 라이버러리는 휴대 데이터 캡쳐 디바이스의 적당한 예에서 국부적으로 캐시될 수 있다. 키워드를 국부적으로 캐시하는 것은 휴대 디바이스가 키워드 캡쳐에 응답하여 그 거동을 국부적으로 결정할 수 있도록 해준다. 거동의 국부적인 결정은 휴대 디바이스가 호스트 머신이나 서비스 프로바이더의 네트워크에 연결되어 있지 않을 때 특히 유용하다. The keyword library may be cached locally in a suitable example of a portable data capture device. Locally caching keywords allows the portable device to locally determine its behavior in response to keyword capture. Local determination of behavior is particularly useful when the portable device is not connected to the host machine or service provider's network.

마크업 정보Markup Information

몇 실시예에서, 휴대 데이터 캡쳐 디바이스는 문서에 대한 마크업 데이터를 다운로드한다. 이러한 능력은 휴대 디바이스가 문서로부터 캡쳐된 데이터에 응답하여 그 거동의 적어도 일부를 국부적으로 결정하도록 해준다. In some embodiments, the portable data capture device downloads markup data for the document. This capability allows the portable device to locally determine at least a portion of its behavior in response to data captured from the document.

키워드 프로세싱Keyword processing

몇 실시예에서, 휴대 스캐너 캡쳐된 데이터내의 키워드를 인식하고, 키워드 애플리케이션을 지원한다. 키워드에 응답하여 취해진 액션은 시스템 및 키워드가 캡쳐되는 렌더링된 문서와 연관된 마크업 문서에 의해 미리 결정된다. 일반적으로, 전체적인 키워드 정의는 시스템 레벨에서 되고, 국부적인 키워드 정의는 마크업 문서에서 이루어진다. 마크업 문서에서 달리 규정되지 않았다면, 국부적인 정의는 전체적인 정의에 우선한다. In some embodiments, portable scanners recognize keywords in captured data and support keyword applications. The action taken in response to the keyword is predetermined by the markup document associated with the system and the rendered document in which the keyword is captured. In general, global keyword definitions are made at the system level, and local keyword definitions are made in markup documents. Unless otherwise specified in the markup document, local definitions take precedence over the entire definition.

키워드는 스캐너에 의해 인식되는 특수한 기호(에를 들어 애플 컴퓨터™의 상표 기호로서 사용되는 사과 아이콘과 같은)이거나 정규의 텍스트일 수 있다. 예를 들어, 캐탈로그와 같은 문서는 휴대 디바이스에 매우 중요한 명령 기호의 메뉴를 포함할 수 있다. 키워드와 연관된 제어 프로그램을 실행하기 위해, 사용자는 특수 기호중의 하나를 스캔할 것이다. 응답으로 디바이스의 프로세서는 키워드와 연관된 제어 프로그램에 액세스하고 이를 실행할 것이다. 카탈로그 예에서, 특수 기호중의 하나는 스캐너를 통해 카탈로그로부터 제품을 주문하는데 사용될 수 있는 구매 프로그램을 개시할 수 있다. 사용자는 주문되어야 할 제품에 대한 정보를 스캔하고, 휴대 스캐너는 이러한 제품과 판매를 완료하는 데 필요한 다른 정보(빌링 및 배송 정보와 같은)를 카탈로그 벤더에게 인터넷과 통신 인터페이스사이의 연결을 통해 전달할 것이다.The keyword may be a special symbol recognized by the scanner (such as an apple icon used as a trademark symbol on Apple Computer ™) or regular text. For example, a document such as a catalog may include a menu of command symbols that are very important to a portable device. To run the control program associated with the keyword, the user will scan one of the special symbols. In response, the processor of the device will access and execute the control program associated with the keyword. In the catalog example, one of the special symbols may initiate a purchase program that can be used to order products from the catalog via a scanner. The user scans information about the products to be ordered, and the handheld scanner will pass these products and other information (such as billing and shipping information) needed to complete the sale to the catalog vendor via the connection between the Internet and the communication interface. .

검색 거동Search behavior

몇 실시예에서, 휴대 문서 데이터 캡쳐 디바이스는 검색 애플리케이션을 지원한다. 검색 쿼리에 대한 입력은 렌더링된 문서로부터 캡쳐되고, 특히 종이 문서로부터 광 스캐닝에 의해 캡쳐된다.In some embodiments, the portable document data capture device supports a search application. Input to the search query is captured from the rendered document, in particular by light scanning from the paper document.

몇 실시예에서, 시스템은 휴대 데이타 캡쳐 디바이스로부터 발하는 검색 쿼리에 검색 조건이 종이 문서로부터 비롯되었다는 것을 지시하기 위해 택을 붙인다.In some embodiments, the system attaches a tag to the search query issued from the portable data capture device to indicate that the search condition originated from the paper document.

데이터 캡쳐를 통한 문서 ID/위치 Document ID / location via data capture

본 시스템은 렌더링된 문서를 식별하고 렌더링된 문서의 전자 부본의 위치를 찾기 위해, 렌더링된 문서로부터 캡쳐된 데이터를 사용할 수 있다. 본 시스템은 문서의 전집(corpus)의 인덱스를 검색함으로써 문서를 식별하고 위치를 찾는다. 본 시스템은 검색 문의를 검색엔진이나 검색 애플리케이션 소프트웨어에 전송함으로써 검색을 실행한다. The system can use the data captured from the rendered document to identify the rendered document and locate the electronic copy of the rendered document. The system identifies and locates documents by searching the index of the document's corpus. The system executes a search by sending a search query to a search engine or search application software.

검색 쿼리Search query

검색 문의는 휴대용 데이터 캡쳐 디바이스 또는 네트워크 내에서 구성될 수 있다. 일부 실시예에서, 검색 쿼리메시지는 휴대가능한 디바이스 식별자를 포함할 것이다. The search query can be configured within a portable data capture device or network. In some embodiments, the search query message will include a portable device identifier.

컨텍스트를 갖는 검색 쿼리Search query with context

검색에 관한 컨텍스트는 검색 결과의 정확성을 높일 수 있다. 일부 실시예에서, 핸드헬드 문서 데이터 캡쳐 디바이스는 컨텍스트 정보를 포함하고 있는 검색 문의를 제공한다. 컨텍스트는, 사용자의 이력으로부터, 사용자 인구의 전체 이력 행동(the aggregate historical behavior)으로부터, 문서의 속성으로부터, 또는 검색 환경으로부터 유래될 수 있다. The context of the search can increase the accuracy of the search results. In some embodiments, the handheld document data capture device provides a search query that includes contextual information. The context can be derived from the user's history, from the aggregate historical behavior of the user population, from the attributes of the document, or from the search environment.

시간 time

검색 조건(term)이 문서로부터 캡쳐된 시간은 명확화(disambiguation)에 유용한 컨텍스트 이다. 예컨대, 만약 검색엔진이 검색 쿼리가 문서로부터 캡쳐된 날짜를 알고 있다면, 캡쳐 날짜 이후에 공표된 임의의 문서는 캡쳐된 데이터의 소스가 될 수 없기 때문에, 검색엔진은 이러한 문서를 무시할 수 있다. 일부 실시예에서, 검색 쿼리는 언제 검색 스트링이 렌더링된 문서로부터 캡쳐되었는지를 가리키는 타임 스탬프를 포함한다. The time when the search term is captured from the document is a useful context for disambiguation. For example, if the search engine knows the date when the search query was captured from the document, the search engine can ignore this document because any document published after the capture date cannot be the source of the captured data. In some embodiments, the search query includes a time stamp indicating when the search string was captured from the rendered document.

위치location

검색 조건이 문서로부터 캡쳐된 위치는 명확화에 유용한 컨텍스트 이다. 예컨대, 만약 검색엔진이 검색 문의가 문서로부터 캡쳐된 지리적 위치를 알고 있다면, 그 위치에서 배포되거나 공표되지 않은 임의의 문서는 캡쳐된 데이터의 소스가 아닐 가능성이 있기 때문에, 검색엔진은 이러한 문서를 무시할 수 있다. 일부 실시예에서, 검색 문의는 어떤 지리적 위치에서 검색 스트링이 렌더링된 문서로부터 캡쳐되었는지를 가리키는 위치 스탬프를 포함한다. The location where the search condition is captured from the document is a useful context for clarity. For example, if a search engine knows the geographic location where the search query was captured from a document, the search engine may ignore this document because any document that is not distributed or published at that location may not be the source of the captured data. Can be. In some embodiments, the search query includes a location stamp indicating at which geographical location the search string was captured from the rendered document.

(가입자 계정으로부터의) 사용자 이력User history (from subscriber account)

사용자의 이력은 문서를 식별하고 위치를 찾는데에 유용한 컨텍스트 이다. 예컨대, 만일 사용자가 매일 아침 시애틀 타임즈 신문에서 그리고 저녁에는 이코노미스트 매거진에서 텍스트를 스캐닝하는 패턴을 가진다면, 아침에 제공된 검색 문의는 이코노미스트 보다는 시애틀 타임즈로부터 올 가능성이 더 크다. 일부 실시에에서, 본 시스템은 사용자 이력에 근거한 검색 쿼리결과에 등급을 정할 것이다. The user's history is a useful context for identifying and locating documents. For example, if a user has a pattern of scanning text every morning in the Seattle Times newspaper and in the evening Economist magazine, the search query provided in the morning is more likely to come from the Seattle Times than the Economist. In some embodiments, the system will rank search results based on user history.

전체의 사용자 인구 메타데이터(aggregate user population metadata)Aggregate user population metadata

휴대용 문서 데이터 캡쳐 디바이스의 모든 사용에 대한 전체 행동은 명확화에 유용한 컨텍스트를 또한 제공한다. 사용자는 유사한 문서로부터 유사한 정보를 스캔하는 경우가 그렇지 않은 경우보다 더 많은데, 예컨대 사용자 인구가 최신 해리포터 소설로부터 캡쳐된 다수의 검색 쿼리를 최근에 제공하여 왔고 숀 해니티(Sean Hannity)의 최신 서적으로부터는 아무것도 제공하지 않는 경우이다. 따라서, 만약 검색 문의가 몇가지 매치된 결과를 제공한다면, 그 소스 문서는 숀 해니티의 책이 아니라 최신 해리 포터 소설일 가능성이 더 크다. 그러므로, 일부 실시 예에서 본 시스템은 전체의 사용자 인구 행동에 근거하여 검색 쿼리결과에 등급을 정한다. The overall behavior for all use of the portable document data capture device also provides a context useful for clarification. Users are more likely to scan similar information from similar documents than otherwise, for example, the user population has recently provided a number of search queries captured from the latest Harry Potter novels and from Sean Hannity's latest books. Is the case when nothing is provided. Thus, if the search query provides some matched results, the source document is more likely to be the latest Harry Potter novel, not Sean Haniti's book. Thus, in some embodiments, the system ranks search query results based on overall user population behavior.

검색 쿼리의 구성 Organization of Search Queries

일부 실시예에서, 휴대가능한 디바이스는 페이퍼 문서로부터 연속 텍스트를 캡쳐링하고 그 텍스트에 근거하여 검색 쿼리를 구성한다. 그러면 검색 문의는 검색엔진 또는 다른 검색 소프트웨어에 제공된다. 페이퍼 문서를 식별하고 페이퍼 문서의 전자 부본을 찾기 위해, 검색엔진은 그 데이터 인덱스의 검색을 수행한다. 일부 실시예에서, 비록 더 많은 정보가 페이퍼 문서로부터 캡쳐링되었지만, 휴대용 캡쳐 디바이스는 전자 부본을 식별하기에 충분한 정도의 정보를 제공함으로써 통신 대역폭을 절약한다. 무선 대역폭이 제한적이기 때문에, 무선 시스템에서는 필요한 정보만을 전송하는 것이 장점이 된다. 일부 실시예에서 본 시스템은 사용자가 디바이스의 키패드로부터 더 많은 텍스트를 입력함으로써 검색 쿼리를 변경하거나 강화하는 것을 가능하게 한다. In some embodiments, the portable device captures continuous text from the paper document and constructs a search query based on the text. The search query is then provided to a search engine or other search software. To identify the paper document and find the electronic copy of the paper document, the search engine performs a search of its data index. In some embodiments, although more information has been captured from the paper document, the portable capture device saves communication bandwidth by providing enough information to identify the electronic copy. Since wireless bandwidth is limited, it is advantageous to transmit only necessary information in a wireless system. In some embodiments, the system enables a user to modify or enhance a search query by entering more text from the device's keypad.

부분 단어 명확화 Partial word disambiguation

일부 실시예에서 핸드헬드 디바이스 및 시스템은 부분 단어 명확화를 지원한다. 텍스트가 렌더링된 문서로부터 캡쳐되었을 때, 사용자가 단어 영역 상에서 캡쳐를 개시하고 종료하는 것이 어렵다. 캡쳐된 스트링의 시작과 끝에 있는 단어는 일반적으로 절단된다. 전통적으로 검색 인덱스는 전체 단어로 이루어져 있어서, 부분적인 단어는 전통적인 검색 애플리케이션에는 거의 혹은 아무런 가치가 없다. 그러나 이러한 절단된, 또는 "부분적인" 단어는 명확화 가치를 여전히 크게 가질 수 있다. 일부 실시예에서, 검색엔진은 복수의 검색 결과들 중에서 선택하기 위해 부분 단어를 사용한다. 예컨대, 검색엔진은 결과 중에서 선택하기 위한 검색 스트링의 끝에서 검색 인덱스 및 부분 단어를 검색하기 위해 전체 단어를 사용한다. 따라서, 이러한 엔진의 검색 문의는 바람직하게는 부분 단어 정보를 포함한다. In some embodiments handheld devices and systems support partial word disambiguation. When text is captured from the rendered document, it is difficult for the user to start and end capture on the word area. Words at the beginning and end of the captured string are usually truncated. Traditionally, search indexes consist of whole words, so partial words have little or no value in traditional search applications. However, such truncated or "partial" words can still have great clarity value. In some embodiments, the search engine uses partial words to select among a plurality of search results. For example, the search engine uses the entire word to search the search index and partial words at the end of the search string to select among the results. Thus, the search query of such an engine preferably contains partial word information.

멀티-라인 스캔Multi-line scan

일부 실시예에서, 휴대용 데이터 캡쳐 디바이스는 한 동작동안 다수개의 텍스트 라인을 캡쳐링할 수 있다. 페이지의 일부를 촬영하기 위해 내장 카메라를 사용하는 모바일 전화, 두개의 텍스트 라인을 캡쳐하는 광학 헤드를 갖는 펜-타입 스캐너 등은 한 라인 이상의 텍스트를 캡쳐할 수 있는 광학 스캐너의 일 예이다. 도15는 문서(1520)의 두 라인으로부터 텍스트를 캡쳐하는 휴대용 스캐너(1510)를 도시한다. 박스(1500)는 캡쳐링된 텍스트를 나타낸다. 텍스트의 한라인 이상이 캡쳐될 때, 각각의 라인은 "line1 text" AND "line2 text" 포맷의 검색 쿼리로 전송될 수 있다. 대안적으로, 만약 근사적인 열 폭이 알려져 있다면, 검색 쿼리는 "line1 text" WITHIN X WORDS "line2 text" 로서 구성될 수 있고, 여기서 X는 일반적으로 근사적인 열 폭보다 적다. 도15에 도시된 일례에서, 명확화 검색 쿼리는 "study of law committed to" AND "and public service the." 로서 구성될 수도 있다. In some embodiments, the portable data capture device can capture multiple text lines during one operation. A mobile phone using a built-in camera to take a portion of a page, a pen-type scanner with an optical head that captures two lines of text, and the like are examples of optical scanners that can capture more than one line of text. 15 shows a portable scanner 1510 that captures text from two lines of document 1520. Box 1500 represents the captured text. When more than one line of text is captured, each line can be sent to a search query in the format "line1 text" AND "line2 text". Alternatively, if an approximate column width is known, the search query can be constructed as "line1 text" WITHIN X WORDS "line2 text", where X is generally less than the approximate column width. In the example shown in Figure 15, the disambiguation search query is "study of law committed to" AND "and public service the." It may be configured as.

스탬프(컨텍스트, 스캐너 ID, 사용자 ID) Stamp (Context, Scanner ID, User ID)

일부 실시예에서, 본 시스템은 문서를 식별하기 위해 시간 스탬프 및 위치 스탬프를 사용한다. 예컨대 AP통신 기사가 많은 신문들에 나타날 수 있지만, 정확 한 신문은 위치 스탬프에 의해 결정될 수 있다. 만약 위치 스탬프가 스캔이 시애틀에서 수행된 것을 가리킨다면, 시애틀 신문이 그 스캐닝된 AP통신 기사의 소스일 가능성이 가장 크다. 유사하게, 타임 스탬프는 후보 문서의 범위를 타임 스탬프 이전에 공표된 것들로 한정하는데 사용될 수 있다. 일부 실시예에서 핸드헬드 디바이스는 타임 및/또는 위치 스탬프를 포함하는 검색 문의를 구성할 수 있다. In some embodiments, the system uses a time stamp and a location stamp to identify the document. For example, an Associated Press article may appear in many newspapers, but the exact newspaper can be determined by location stamps. If the location stamp indicates that the scan was performed in Seattle, the Seattle newspaper is most likely the source of the scanned Associated Press article. Similarly, time stamps can be used to limit the range of candidate documents to those published prior to the time stamp. In some embodiments, the handheld device may construct a search query that includes a time and / or location stamp.

단어길이/ 컨벌루션 문의Word Length / Convolution Inquiries

일부 실시예에서 본 시스템은 단어 길이에 의해 문서를 인덱싱하고 검색한다. 가장 간단한 경우로, 세가지의 단어 길이-긴,짧은,불명료한-가 사용된다. 길고 짧은 단어의 연속 패턴은 충분한 길이의 각각의 문서에 대한 유일한 식별자를 형성한다. 따라서, 텍스트에 근거한 전통적인 검색 쿼리보다는 단어 길이에 근거한 검색 문의를 제공함으로써 문서의 위치가 찾아질 수 있다. 단어 길이 검색 문의의 일 예는 다음과 같다: 11001110?010??10110, 여기서 1은 짧은 단어, 0은 긴 단어, ?는 불명료 이다. 불명료한 임의의 대상은 검색엔진에 의해 반드시 와일드 카드로서 처리된다. 단어 길이 검색은 캡쳐된 이미지에서 개개의 문자를 구별할 수 없는 이미징 디바이스에 특히 유용하다. 예컨대, 렌더링된 문서의 사진을 찍기 위해 저해상도의 모바일 폰 카메라가 사용되지만, 이 카메라는 문자 수준까지 이미지 분해를 할 수 없다. 그러나 길고 짧은 단어의 매칭 연속 패턴(matching sequential pattern)을 검색함으로써 문서가 식별될 수 있다. 유사하게, 이 문서의 어딘가에서 논의된 것처럼, 문자의 반복 빈도수를 가리키는 컨벌루션-기반 문의에 의해 문서가 인덱싱되고 검색될 수 있다.In some embodiments, the system indexes and retrieves documents by word length. In the simplest case, three word lengths are used: long, short and obscure. The continuous pattern of long and short words forms a unique identifier for each document of sufficient length. Thus, the location of a document can be found by providing a search query based on word length rather than a traditional search query based on text. An example of a word length search query is as follows: 11001110? 010 ?? 10110, where 1 is a short word, 0 is a long word and? Is ambiguity. Any obscure object is always treated as a wildcard by the search engine. Word length search is particularly useful for imaging devices that are unable to distinguish individual characters in the captured image. For example, a low resolution mobile phone camera is used to take a picture of a rendered document, but the camera is not capable of image resolution down to the character level. However, a document can be identified by searching for a matching sequential pattern of long and short words. Similarly, as discussed elsewhere in this document, documents can be indexed and retrieved by convolution-based queries that indicate the frequency of repetition of characters.

텍스트 특징Text features

스캐닝된 텍스트로부터 유래된 검색 문의는 프런트 타입, 사이즈 및 컬러와 같은 텍스트에 관한 정보를 포함할 수 있다. 이들 텍스트 특성은 텍스트가 캡쳐되는 문서를 명확하게 하는데 사용될 수 있다. 하지만, 전통적인 검색 문의는 이러한 정보를 허비한다.Search queries derived from the scanned text may include information about the text, such as front type, size, and color. These text features can be used to clarify the document in which the text is captured. However, traditional search queries waste this information.

서류 식별기Document identifier

유저가 알려진 문서내에서 검색한다면, 검색 문의는 문서 식별기를 포함할 수 있다. 검색 엔진은 문서 식별기를 사용할 수 있어서 검색 결과를 의도된 서류로 한정할 수 있다. 전통적인 검색 문의는 문서 식별기를 포함하지 않는다.If the user searches within a known document, the search query may include a document identifier. The search engine may use a document identifier to limit the search results to the intended document. Traditional search queries do not include document identifiers.

병렬 검색Parallel search

유저 경험을 풍부하게 하기 위해, 검색은 로컬 디바이스 및 네트워크에서 병렬로 발생할 수 있다. 하나의 검색이 하나의 결과를 리턴할 때, 다른 것은 종료할 수 있다.To enrich the user experience, searches can occur in parallel on local devices and networks. When one search returns one result, the other can end.

네트워크 동작Network behavior

일부의 실시예에서, 휴대용 문서 데이터 캡쳐 디바이스 스캐너와 스캐닝 서비스 제공자 네트워크 사이의 메시지는 고유 트랜잭션 코드를 포함한다. 트랜잭션 코드는 시스템이 각각의 트랜잭션을 식별한다. 일부의 실시예에서, 트랜잭션 코드는 해시의 스캐너 ID, 스캐닝된 정보, 문서 정보, 및 타입/로케이션 정보로부터 만들어진다.In some embodiments, the message between the portable document data capture device scanner and the scanning service provider network includes a unique transaction code. The transaction code identifies the system to each transaction. In some embodiments, the transaction code is generated from a scanner ID, scanned information, document information, and type / location information of the hash.

일부의 실시예에서, 휴대용의 문서 데이터 캡쳐 디바이스는 전자 시리얼 넘 버(ESN) 또는 네트워크 어드레스와 같은 고유 식별기를 가지고 있어서 스캐닝 서비스 제공자는 디바이스를 식별할 수 있다. 일부의 실시예에서, 휴대가능한 디바이스는 암호화된 빌링 및 어커운트 정보를 가지고 있는 가입자 식별 모듈(SIM)을 포함한다. 일부의 실시예에서, 제거가능한 식별 모듈은 다른 유저가 데이터 캡쳐 디바이스를 빌릴 수있게 하고 그리고 이것을 식별 모듈을 삽입하므로서, 이것을 그들의 어카운트에 임시로 연관시킨다.In some embodiments, the portable document data capture device has a unique identifier such as an electronic serial number (ESN) or network address so that the scanning service provider can identify the device. In some embodiments, the portable device includes a subscriber identity module (SIM) having encrypted billing and account information. In some embodiments, the removable identification module enables other users to borrow the data capture device and inserts it into the account, thereby temporarily associating it with their account.

각각의 스캐닝 서비스 제공자의 가입자는 서비스 제공자의 네트워크에서 데이터베이스에 저장된 가입자 어카운트를 가지고 있다. 가입자 어카운트 데이터 레코드는 빌링/가입 정보, 가입자 이름 및 주소, 가입자가 액세스할 수 있는 전자 문서, 종이 문서의 가입에 관한 정보, 유저 히스토리 정보, 가입자의 휴대가능한 데이터 캡쳐 디바이스의 식별자(ESN, 등), 보안/암호 키, 그리고 유저의 라이프 라이브러리 및/또는 개인적인 웹페이지(블로그)의 위치를 포함할 수 있다. 예를 들면, 유저는 휴대가능한 디바이스로 문서로부터 데이터를 캡쳐할 수 있고 그리고 UI를 통해서 "blog this document" 커맨드를 입력할 수 있다. 시스템은 문서를 명확히 하고 그리고 유저의 어카운트에 사전 명시된 유저의 블로그 페이지에 문서를 링크를 발행한다.The subscriber of each scanning service provider has a subscriber account stored in a database in the service provider's network. Subscriber account data records include billing / subscription information, subscriber name and address, electronic documents accessible by the subscriber, information about the subscription of paper documents, user history information, identifiers of the portable data capture device of the subscriber (ESN, etc.) , Security / password keys, and the location of the user's life library and / or personal web pages (blogs). For example, a user can capture data from a document with a portable device and enter a "blog this document" command through the UI. The system clarifies the document and publishes a link to the document on the user's blog page pre-specified in the user's account.

일부의 실시예에서, 네트워크는 가입자의 휴대용 문서 데이터 캡쳐 디바이스의 생방송 활동(OAA)과 프로그래밍(OAP)과 같은 원격 활동과 프로그래밍을 수행한다. 데이터 캡쳐 디바이스를 켜면, 그것은 서비스 제공자의 네트워크에 등록될 것이다. 한번 등록하면, 서비스 제공자는 디바이스내로 활성화 데이터를 다운로드할 수 있다. 활성화 데이터는 서비스 제공자가 디바이스에 메시지를 루팅하는데 사용할 수 있는 네트워크 어드레스 또는 다른 고유 식별기를 포함할 수 있다. 디바이스가 활성화된 후, 서비스 제공자는 원격 프로그래밍을 사용할 수 있어서 디바이스를 임의의 업데이트로 업데이트할 수 있다(예를 들면, 국지적으로 캐싱된 마크업 데이터).In some embodiments, the network performs remote activities and programming such as live broadcast activity (OAA) and programming (OAP) of the subscriber's portable document data capture device. When you turn on the data capture device, it will register with the service provider's network. Once registered, the service provider can download activation data into the device. The activation data may include a network address or other unique identifier that the service provider may use to route a message to the device. After the device is activated, the service provider can use remote programming to update the device with any updates (eg, locally cached markup data).

디바이스가 서비스 제공자의 시스템에 등록된 때, 서비스 제공자는 가입자 어카운트에 대하여 서비스 식별기를 체크하므로서 디바이스가 가입자에 속한다는 것을 증명할 수 있다.When the device is registered with the service provider's system, the service provider may prove that the device belongs to the subscriber by checking the service identifier against the subscriber account.

일부의 실시예에서, 시스템은 휴대가능한 디바이스 근처(물리적으로 또는 연결속도 관점에서) 네트워크 엘리먼트에 인덱스 또는 다른 데이터를 이동시킬 수 있어서 잠복(latency)을 줄일 수 있고 그리고 네트워크 리소스와 교신하게 한다. 빈번하게 액세스하는 데이터를 휴대가능한 디바이스에 이동시키는 것은 네트워크 엔터티의 수를 줄이는 것인데 이것은 그 정보를 휴대가능한 디바이스로 고유의 방식으로 취급하여야 한다.In some embodiments, the system can move indexes or other data to network elements near the portable device (physically or in terms of connection speed) to reduce latency and to communicate with network resources. Moving frequently accessed data to a portable device reduces the number of network entities, which must treat the information in a unique way as a portable device.

네트워크 강화된 비모호성Network Enhanced Unambiguity

일부의 실시예에서, 네트워크 및 휴대용 문서 데이터 캡쳐 디바이스는 비모호성 프로세스를 반복한다. 예를 들면, 유저는 종이 문서로부터 캡쳐된 데이터로 이루어지는 검색 문의를 제출할 수 있다. 서비스 제공자는 검색 문의를 검색 엔진에 제출할 수 있지만 문서를 명확하게 할 수는 없다. 응답에 있어서, 네트워크는 유저가 제출된 문서로부터 추가적인 정보를 신속하게 캡쳐하게 한다. 유저는 추가 적인 정보를 제출하고 그리고 서비스 제공자는 이전에 제출된 정보와 결함된 새로운 정보를 사용하여 제출된 문서를 명확하게 한다. 서비스 제공자와 유저는 이러한 프로세스를 필요한 만큼 반복하여 문서를 명확하게 한다. 서비스 제공자는 전형적으로 이전에 제출된 정보를 유지하고 그리고 새롭게 제출된 정보를 축적하여 문서를 명확하게 한다.In some embodiments, network and portable document data capture devices repeat the non-ambiguous process. For example, a user can submit a search query made up of data captured from a paper document. The service provider may submit a search query to the search engine but cannot clarify the document. In response, the network allows the user to quickly capture additional information from the submitted document. The user submits additional information, and the service provider clarifies the submitted documents using previously submitted information and new information that is defective. Service providers and users repeat this process as necessary to clarify the document. Service providers typically retain previously submitted information and accumulate newly submitted information to clarify the document.

가입자 어카운트/레코드Subscriber account / record

일부의 실시예에서, 휴대용 문서 데이터 캡쳐 디바이스는 빌링, 가입 및/또는 디바이스 식별기에 관련된 정보를 저장하기 위한 메모리를 포함한다. 이러한 메모리는 Subscriber Identity Module(SIM) 또는 스마트 카드에서 제거가능하고 또는, Programmable Read Only Memory(PROM)와 같이 제거 불가능하다. 일부의 실시예에서, SIM 메모리는 유저의 모바일 폰 서비스 어카운트와 관련되어 있다. 문서의 전자 카피가 캡쳐된 데이터에 근거해서 위치되는 경우에는, 가입 정보는 유저가 전자 카피에 액세스하여야 하는지를 증명하는데 사용될 수 있다. 예를 들면, 신문은 온라인 버젼에 액세스하기 위한 추가적인 요금을 지불해야 한다. 서비스 제공자와의 유저 어카운트는 신문과 같은 종이 문서를 위한 가입 정보를 포함할 수 있는데, 이것은 유저가 종이 문서의 온라인 버젼에 가입되었는지를 표시한다.In some embodiments, the portable document data capture device includes a memory for storing information related to billing, subscriptions, and / or device identifiers. Such memory may be removable from a Subscriber Identity Module (SIM) or a smart card, or may not be removable, such as Programmable Read Only Memory (PROM). In some embodiments, the SIM memory is associated with the user's mobile phone service account. If an electronic copy of the document is located based on the captured data, the subscription information can be used to verify that the user should access the electronic copy. For example, newspapers must pay an additional fee to access the online version. The user account with the service provider may include subscription information for a paper document, such as a newspaper, which indicates whether the user has subscribed to an online version of the paper document.

일부의 실시예에서, 시스템은 유저의 가입자 어카운트에서 빌링 정보를 사용하여 휴대용 스캐너로 구입을 할 수 있다. 메모리는 유저의 암호화된 크레디트 카드 또는 다른 금융 정보를 포함하고 있다. 예를 들면, 유저는 문서로부터 텍스트를 스캐닝하고 그리고 문서의 전자 카피에 액세스 구매하려고 한다는 것을 표시할 때(유저 인터페이스 또는 상기한 제스쳐 제어를 통해서), 빌링 정보는 카피라이트 홀더 또는 컨텐츠 제공자에 지불을 제공하는데 사용될 수 있다.In some embodiments, the system may make purchases with a handheld scanner using billing information in the user's subscriber account. The memory contains the user's encrypted credit card or other financial information. For example, when a user scans text from a document and indicates that he or she intends to access and purchase an electronic copy of the document (via the user interface or gesture control described above), the billing information may be paid to the copyright holder or content provider. Can be used to provide

일부의 실시예에서, 휴대가능한 디바이스는 메모리의 시리얼 넘버와 같은 디바이스 식별기를 포함한다. 이러한 디바이스 식별기는 휴대가능한 디바이스를 고유하게 식별하고 그리고 PROM에 전형적으로 저장하여 이들은 삭제할 수 없다. 트랜잭션을 위한 추가적인 보안이 디바이스의 시리얼 넘버를 네트워크 데이터 베이스에서 유저의 어카운트 또는 가입을 관련시키는 것 같이, 단지 한명의 유저와 휴대가능한 디바이스를 관련시키므로서 얻어질 수 있다. 다른 방식으로는, 시스템은 스마트 카드에 디바이스 식별기를 저장하여(또는 휴대용 스캐너에 스마트 카드 식별기를 저장) 스캐너를 스마트 카드에 잠근다. 디바이스의 프로세서는 휴대용 스캐너(200)가 작동이 허용되기 전에 올바른 스마트 카드가 삽입되었다는 것을 식별한다. 내부 프로세서를 갖춘 스마트 카드는 또한 이들이 휴대가능한 디바이스에 삽입되었다는 것을 식별할 수 있는데 이들은 스마트 카드의 임의의 정보에 액세스 허용되기 전에 잠긴다.In some embodiments, the portable device includes a device identifier such as a serial number of memory. Such device identifiers uniquely identify portable devices and typically store in a PROM so they cannot be deleted. Additional security for transactions can be obtained by associating a portable device with only one user, such as associating a user's account or subscription with the serial number of the device in the network database. Alternatively, the system locks the scanner to the smart card by storing the device identifier on the smart card (or storing the smart card identifier on the portable scanner). The processor of the device identifies that the correct smart card has been inserted before the portable scanner 200 is allowed to operate. Smart cards with internal processors can also identify that they have been inserted into a portable device, which are locked before any information on the smart card is allowed to access.

주석Remark

일부 실시예에서, 휴대용 문서 데이터 캡쳐 디바이스는 주석 기능을 포함하고 있다. 주석 소프트웨어는 디바이스가 문서에서 위치, 마크 또는 텍스트에 보이스 또는 텍스트 주석을 부착하게 할 수 있다. 주석은 문서 내에서 절대적인 위치 또는 텍스트 스트링과 관련될 수 있다. 주석이 텍스트 스트링과 관련된다면, 이들 시스템은 주석이 부착된 텍스트를 편집 또는 삭제하면 가입자에게 통지할 수 있다. 편집 또는 삭제로 진행되기 전에, 시스템은 유저에게 더 진행시키고자 하는지 확인을 받는다.In some embodiments, the portable document data capture device includes an annotation function. Annotation software may allow a device to attach voice or text annotations to a location, mark or text in a document. A comment can be associated with an absolute position or text string within a document. If the comment is associated with a text string, these systems can notify the subscriber if they edit or delete the annotated text. Before proceeding with editing or deleting, the system is asked if the user wants to proceed further.

텍스트text

텍스트 주석은 휴대가능한 디바이스의 키패드를 통해서, 또는 보이스 데이터의 텍스트로의 전환에 의해 렌더링된 문서로부터 텍스트를 스캐닝하므로서 입력될 수 있다.Text annotations can be entered through the keypad of a portable device or by scanning text from a rendered document by conversion of voice data to text.

보이스voice

일부 실시예에서, 휴대용 데이터 캡쳐 디바이스는 보이스를 캡쳐하기 위해서 마이크로폰을 포함하고 있다. 캡쳐 속도는 아래의 스캐노테이터 섹션에서 더 상세히 설명하는 바와 같이, 유저에 의해 특정된 위치에서 오디오 파일로서 문서에 주석을 붙일 수 있다.In some embodiments, the portable data capture device includes a microphone to capture voice. The capture rate can be annotated to the document as an audio file at a location specified by the user, as described in more detail in the Scavenator section below.

OCROCR

일부 실시예에서, 휴대용 문서 데이터 캡쳐 디바이스는 탑재된 OCR 능력을 가지고 있다. 일부 실시예에서, 시스템은 호스트 컴퓨터 또는 서비스 제공자가 OCR을 실행할 수 있다. OCR은 템플릿 매칭, 컨벌루션, 및 단어 길이 OCR을 포함하는 많은 적절한 방법에 의해서 달성될 수 있다.In some embodiments, the portable document data capture device has onboard OCR capabilities. In some embodiments, the system may be hosted by the host computer or service provider. OCR can be accomplished by many suitable methods including template matching, convolution, and word length OCR.

트레이드 마크 심볼을 코드로 전환Convert Trademark Symbols to Code

일부 실시예에서, 휴대용 데이터 캡쳐 디바이스는 트레이드 마크 심볼을 코드 또는 간단한 텍스트로 전환한다. 휴대가능한 디바이스는 인식할 수 있는 트레이드 마크 심볼의 데이터베이스를 가지고 있다. 트레이드 마크를 스캔해서 인식할 때, 휴대가능한 디바이스는 트레이드 마크 이미지를 위한 코드를 대체할 수 있다. 그리고 휴대용 디바이스는 코드를 서비스 제공자 네트워크에 보낸다. 서비스 제공자는 트레이드 마크 코드와 관련된 소정의 액션을 취한다. 예를 들면, 유저는 Mitsubishi^TM "three diamond" 트레이드 마크의 이미지를 스캐닝할 수 있다. 휴대용 스캐너는 이미지를 탑재된 트레이드 마크 심볼의 라이브러리와 비교하고 그리고 three diamond 심볼을 Mitsubishi 트레이드 마크라고 식별한다. 라이브러리는 트레이드 마크를 시스템 서비스 제공자로 고유하게 식별하는 각각의 트레이드 마크와 관련된 고유 코드를 가지고 있다. 전체적인 이미지 파일을 서비스 제공자에게 전송하기 보다는, 휴대용 스캐너는 코드를 대체하고 그리고 이 코드를 전송한다. 이미지를 위한 코드를 대체하는 것은 네트워크로의 메시지의 크기를 줄이는데, 이것은 무선통신의 중요한 장점이다. 일부 실시예에서, 시스템은 이미지 파일을 허용하지 않는 일부의 통신채널(셀룰라 SMS 채널같은)에 문자숫자식 코드를 보낸다. 믈론, 트레이드 마크 이미지는 또한 ASCII 텍스트로 전환할 수 있다. 예를 들면, Mitsubishi^TM three diamond 로고는 텍스트 스트링 "Mitsubishi trademark"로 전환될 수 있다.In some embodiments, the portable data capture device converts the trademark symbol into code or simple text. The portable device has a database of recognizable trademark symbols. When scanning and recognizing a trademark, the portable device can replace the code for the trademark image. The portable device then sends the code to the service provider network. The service provider takes some action related to the trademark code. For example, a user may scan an image of the Mitsubishi ^™ “three diamond” trademark. The handheld scanner compares the image with a library of onboard trademark symbols and identifies the three diamond symbol as a Mitsubishi trademark. The library has a unique code associated with each trademark that uniquely identifies the trademark as a system service provider. Rather than sending the entire image file to the service provider, the portable scanner replaces the code and sends this code. Replacing the code for the image reduces the size of the message to the network, which is an important advantage of wireless communication. In some embodiments, the system sends alphanumeric codes to some communication channels (such as cellular SMS channels) that do not allow image files. Of course, trademark images can also be converted to ASCII text. For example, the Mitsubishi ^™ three diamond logo can be converted to the text string “Mitsubishi trademark”.

단어 길이Word length

일부 실시예에서, 선택적인 스캐닝 서브시스템은 단어 길이를 개별적인 문자가 무엇인지 결정할 수 없을지라도, 단어 길이를 합리적인 접근으로 분류한다. 다행히도, 단어 길이 패턴은 또한 문서를 식별하는데도 사용할 수 있다. 문서의 이 미지에서 단어를 카테고리로 분류하므로서, 휴대용 스캐너는 문서를 식별한느데 사용할 수 있는 코드를 구성할 수 있다. 가장 간단한 경우에, 단어는 긴, 짧은 그리고 알 수 없는 의 3개의 카테고리로 분류된다. 짧은 단어는 소정의 수의 문자보다 적고 그리고 긴 단어는 소정의 문자 수보다 더 많은 문자를 가지고 있다. 다시 말해서, 짧은 단어＜X＜긴 단어 인데, 여기에서 X는 짧은 단어를 긴 단어와 구별하는 소정 수의 문자이다. 문서가 단어 길이에 의해 표시되는 특정 인덱스를 검색하면 렌더링된 문서를 식별할 것이다. 전자 카운터파트가 위치된 후, 전자 카운트 파트는 전자 카운터파트에서 긴/짧은/알 수 없는 단어의 매칭 인접 스트링을 찾고 그리고 단어 길이 패턴을 단어의 개별적인 문자로 전환하므로서 스캐닝된 스트링에서 OCR을 실행하는데 사용할 수 있다.In some embodiments, the optional scanning subsystem classifies the word length as a reasonable approach, although the word length cannot determine what individual letters are. Fortunately, word length patterns can also be used to identify documents. By categorizing words in the image of a document, a handheld scanner can construct code that can be used to identify a document. In the simplest case, words are classified into three categories: long, short and unknown. Short words have fewer than a certain number of characters and long words have more than a certain number of characters. In other words, short words <X <long words, where X is a predetermined number of characters that distinguish a short word from a long word. Searching for a specific index, indicated by the word length, will identify the rendered document. After the electronic counterpart is located, the electronic count part finds a matching adjacent string of long / short / unknown words in the electronic counterpart and executes OCR on the scanned string by converting the word length pattern into individual letters of the word. Can be used.

템플릿 매칭Template matching

템플릿 매칭 OCR은 캡쳐된 이미지를 저장된 문자 이미지와 비교한다. 매치가 발견되면, 문자는 식별된다. 템플릿 매칭 OCR은 프론트 스타일, 사이즈, 이탤릭체, 등에서 변화에 민감하다. 중요하게, 캡쳐된 문자가 저장된 템플릿과 다르게 보이면 템플릿 매칭 시스템을 참조하여야 한다. 템플릿 매칭은 트레이드 마크 및 그래픽 아이콘의 이미지를 인식하는데 매우 유용하다.Template matching OCR compares the captured image with the stored text image. If a match is found, the character is identified. Template matching OCR is sensitive to changes in front style, size, italics, and so on. Importantly, if the captured characters look different from the stored template, then you should refer to the template matching system. Template matching is very useful for recognizing images of trademarks and graphical icons.

컨볼루션Convolution

도 16은 문자 오프셋을 결정하기 위한 컨볼루션의 하나의 실시예를 도시하고 있다. 대체적으로, 본 실시예는 텍스트의 이미지를 자신을 가로질러 슬라이딩하므로서 계획할 수 있다. 텍스트의 스트링을 위한 전환 패턴이 결정되면, OCR은 통계 적인 분석에 의해서 또는 소스 문서를 전환-강화 이미지의 거색을 통해서 식별하므로서 실행될 수 있다. 이러한 실시예는 센서 이미지가 캡쳐된 이미지을 가지고 난 후에 1610을 시작한다. 이러한 스캐너에서 프로세서는 픽셀의 큐를 만들어서 원래의 이미지와 비교할 수 있다. 이러한 큐는 이러한 원래의 이미지의 카피의 수직 슬라이스가 될 수 있다. 또 다른 접근법은 어드레스 포인터를 사용하여 수직 슬라이스의 트랙을 계속 비교하고, 그리고 이들 슬라이스들의 카피를 프로세서에 임시적으로 만든다. 다음 단계(1620)는 하나의 길이를 비교한다. 길이는 수직 슬라이스의 수평 폭으로 한다. 이것은 하나의 픽셀, 또는 다중 픽셀이 될 수 있다. 이것은 여백에 근거하여 발견적으로 결정될 수 있다. 이러한 슬라이스는 전체적인 이미지가 될 수 있다. 이러한 슬라이스는 원래의 이미지로부터의 슬라이스와 비교된다. 이것은 이러한 슬라이스가 원래의 이미지로부터의 슬라이스에 연속적으로 비교하므로서 이루어진다. 일부 실시예에서, 이러한 비교 슬라이스는 다음 스텝의 대응하는 슬라이스와 계속하여 비교된다. 하나의 스텝은 수평 폭과 동일한 거리가 될 수 있다. 하나의 스텝은 하나의 픽셀 또는 다중 픽셀이 될 수 있다. 이러한 프로세스의 다음 단계(1630)에서, 이러한 프로세서는 이러한 이미지가 자신과 매칭되는 메모리에 기록된다. 이러한 데이터는 수직 슬라이스가 이러한 원래의 이미지의 다른 수직 슬라이스와 매치된다는 것을 포함하고 있다. 매칭 섹션은 문자일 수도 있고 문자가 아닐 수도 있다(즉, 이것은 단지 순서적으로 나타나는 2개의 문자일 수 있다). 다음 스텝(1640)은 이러한 비교가 완성되었는지를 결정한다. 비교는 필수적으로 하나의 슬라이스를 비교하는 것이 아니고, 이렇게 더욱 큰 프로세스 를 말하는 것이다. 전환 프로세스가 완성되었는지 결정하는 하나의 방법은 더 이상 비교할 슬라이스가 없는지를 확인하는 것이다.16 illustrates one embodiment of a convolution for determining a character offset. Alternatively, this embodiment can be planned by sliding an image of text across itself. Once the conversion pattern for the string of text is determined, OCR can be performed by statistical analysis or by identifying the source document through the coarse color of the conversion-enhanced image. This embodiment begins 1610 after the sensor image has the captured image. In such a scanner, the processor can create a queue of pixels and compare them with the original image. This cue can be a vertical slice of a copy of this original image. Another approach uses address pointers to continuously compare tracks of vertical slices, and temporarily make copies of these slices to the processor. The next step 1620 compares one length. The length is the horizontal width of the vertical slice. This can be one pixel or multiple pixels. This can be determined heuristically based on margins. This slice can be an overall image. This slice is compared with the slice from the original image. This is done by successively comparing these slices to slices from the original image. In some embodiments, this comparison slice is continuously compared with the corresponding slice of the next step. One step may be the same distance as the horizontal width. One step may be one pixel or multiple pixels. In a next step 1630 of this process, such a processor is written to a memory where this image matches itself. This data includes that the vertical slice matches another vertical slice of this original image. The matching section may or may not be a letter (ie, it may be only two letters appearing in sequence). The next step 1640 determines whether this comparison is complete. Comparisons do not necessarily compare one slice, but rather a larger process. One way to determine if the conversion process is complete is to ensure that there are no more slices to compare.

도 17은 전환 프로세스를 개념화하기 위한 하나의 방식을 예시하고 있다. 이것은 문자 오프셋을 찾기위해 단일의 슬라이스를 사용하는 단계적인 분석을 도시하고 있다. 예시적인 스텝은 1700으로 도시된 1과 같은 숫자이다. 라인(1710)은 스텝들을 분리하는데 사용된다. determinative라는 단어의 이미지가 비교된다. 좌측이 슬라이스(1720)이고 그리고 우측이 메모리에서의 카피(1730)이다. 오버랩이 발견되면, 삼각형(1740)으로 표시된다.17 illustrates one way to conceptualize the conversion process. This shows a stepwise analysis using a single slice to find the character offset. An exemplary step is a number, such as 1, shown at 1700. Line 1710 is used to separate the steps. Images of the word determinative are compared. Left is slice 1720 and right is copy 1730 in memory. If an overlap is found, it is indicated by triangle 1740.

도 18은 다른 예를 도시하고 있다. 여기에서, 슬라이스 카피(1820)는 메모리에서의 카피(1830) 위에 도시되어 있어서 어떻게 매치가 발견되었는지 더욱 명화가게 되어 있다(1840).18 shows another example. Here, slice copy 1820 is shown over copy 1830 in memory to further clarify how a match was found (1840).

도 19는 시스템에 의해서 전형적으로 실행되는 스텝들을 도시하는 플로우 다이어그램인데 전환 프로세스가 이미지에서 실행된다. 이미지의 어떤 부분이 문자인지 결정하기 어려운 경우가 있다. 하나의 접근책은 문자들의 분리된 수를 가진 섹션으로 이러한 이미지를 세분하는 것이다. 일부 실시예에서, 이러한 프로세스는 매치로서 반복적으로 완성되거나, 또는 모든 매치가 발견된 후에 시작될 수 있다. 스텝(1910)에서, 이러한 이미지는 하나의 세그멘트인데, 즉 문자들의 분리된 수의 이미지이다(이러한 이미지는 여백에 둘러싸인 섹션에 생길 수 있다). 스텝(1920)에서, 프로세스에 더 많은 매칭 서브섹션이 있다면, 시스템은 스텝(1930)으로 계속진행되고, 시스템은 스텝(1970)에서 종료한다. 스텝(1930)에서, 이들 섹션은 기록 된다. 일차원 위치 측정은 메모리에 보내질 수 있다. 이들 세그멘트를 매칭 카운터파트와 연관시키는 하나의 방법은 식별기를 사용하는 것이다. 다른 접근책은 이들을 메모리에 저장하여 상대적인 위치가 이들이 어떻게 매치되는가와 같은 정보를 제공한다(즉, 각각의 매칭 쌍은 연속적으로 저장되고, 그리고 홀수의 매치는 하나의 반복된 구역을 가지고 있어서 짝수가 있다). 스텝(1940)에서, 시스템은 임의의 이들 매칭 세그멘트가 임의의 세그멘트와 오버랩되는지 판정한다. 이러한 오버랩은 하나의 세그멘트가 다른 세그멘트를 전적으로 포함하는 곳에서, 또는 각각의 오버랩의 단지 하나의 섹션에서 발생한다. 스텝(1950)에서, 시스템은 이들 세그멘트로 세분화된다. 이러한 세분화 스텝은 제 1 세그멘트가 다중 문자를 가지는 곳에서 그리고 제 2 세그멘트가 더 작은 수의 이들 문자를 가지는 곳에서 일어난다. 예를 들면, 제 1 매치 세그멘트는 "ing"를 포함하고 있고 그리고 제 2 세그멘트는 "in."을 포함하고 있다. 이러한 프로세스는 이들을 "in"(즉, 매치된 것)과 "g"(나머지)를 포함하는 세그멘트로 세분화할 수 있다. 모든 세그멘트가 분리된 수의 문자로 시작하면, 분리된 수의 문자를 제거하는 것은 또한 분리된 수의 문자를 남길 것이다. 스텝(1960)에서, 시스템은 이들 세그멘트의 각각을 완전히 오버랩되거나 또는 전적으로 오버랩되지않는 가장 큰 세그멘트로서 저장한다. 이러한 프로세스는 위치가 1930에 저장될 때와 유사하다. 일부 실시예에서, 상호 관련되는 매칭 세그멘트의 동일한 시스템이 사용된다. 이러한 프로세스 후에, 원래의 이미지는 식별된 다수의 매칭 세그멘트를 가질 것이다. 시스템은 이들 세그멘트 사이(또는 이들 세그멘트와 이러한 이미지의 적어도 하나의 에지 사이)의 스페이스를 임의의 다른 세그멘트와 매치되지 않는 새로운 세그멘트로서 취급한다. 일부 실시예에서, 시스템은 블러브(blob) 분석 또는 결합 분석과 같은 문자 분석 기술을 사용하여 세그멘트를 더 세분화한다. 이들 세그멘트는 이들이 기초로 하는 텍스트를 결정하기 위해서 사용될 수 있다. 일부 실시예에서, 시스템은 이들 세그멘트를 오프셋으로서 나타내고 그리고 이들 오프셋을 사용하여 어떤 텍스트가 이들 오프셋을 만들었는지 찾는다. 일부 실시예에서, 이들 정보를 담고 있는 저장은 다중 문자를 포함하고 있는 세그멘트를 고려할 수 있는 데이터로 채워진다.19 is a flow diagram showing steps typically executed by a system in which a conversion process is performed on an image. Sometimes it is difficult to determine which part of an image is a letter. One approach is to subdivide this image into sections with separate numbers of characters. In some embodiments, this process may be iteratively completed as a match, or may start after all matches have been found. In step 1910 this image is one segment, that is an image of a separate number of characters (such an image may occur in a section surrounded by a margin). At step 1920, if there are more matching subsections in the process, the system continues to step 1930 and the system ends at step 1970. At step 1930, these sections are recorded. One-dimensional position measurements can be sent to memory. One way to associate these segments with matching counterparts is to use an identifier. Another approach is to store them in memory to provide information such as how their relative positions match (ie, each matching pair is stored consecutively, and odd matches have one repeated region so that even numbers have). At step 1940, the system determines if any of these matching segments overlap with any segment. This overlap occurs where one segment entirely contains another segment, or in only one section of each overlap. At step 1950, the system is subdivided into these segments. This segmentation step occurs where the first segment has multiple characters and where the second segment has a smaller number of these characters. For example, the first match segment contains "ing" and the second segment contains "in." This process can subdivide them into segments that include "in" (ie, matched) and "g" (rest). If all segments begin with a separated number of characters, removing the separated number of characters will also leave a separate number of characters. In step 1960, the system stores each of these segments as the largest segment that either fully overlaps or does not fully overlap. This process is similar to when the location is stored in 1930. In some embodiments, the same system of interrelated matching segments is used. After this process, the original image will have a number of matching segments identified. The system treats the space between these segments (or between these segments and at least one edge of this image) as new segments that do not match any other segment. In some embodiments, the system further refines segments using character analysis techniques such as bubble analysis or binding analysis. These segments can be used to determine the text on which they are based. In some embodiments, the system represents these segments as offsets and uses these offsets to find out what text made these offsets. In some embodiments, the storage containing these information is filled with data that can take into account segments containing multiple characters.

디바이스에서 웹 서버Web server on the device

Microsoft^TM Internet Explorer와 같은 컴퓨터 구동 웹 브라우저 소프트웨어는 휴대용 데이터 캡쳐 디바이스의 일부 실시예에 포함된 내부 웹페이지를 액세스할 수 있다. 그래서 컴퓨터는 휴대용 디바이스의 내부 웹페이지에 액세스할 수 있고, 휴대용 디바이스는 USB 케이블과 같은 통신 채널에 의해 컴퓨터에 연결될 수 있다.Computer-driven web browser software, such as Microsoft ^™ Internet Explorer, can access internal web pages included in some embodiments of a portable data capture device. Thus, the computer can access the internal webpage of the portable device and the portable device can be connected to the computer by a communication channel such as a USB cable.

대표적인 실시예Representative Example

다음은 휴대용 문서 데이터 캡쳐 디바이스의 대표적인 실시예를 설명한다. 이들 실시예는 모든 가능한 실시예를 설명할 수 없지만 가능한 개략적인 것을 설명한다는 의미이다. The following describes a representative embodiment of a portable document data capture device. These embodiments are not meant to describe all possible embodiments but are meant to be as schematic as possible.

모바일폰Mobile phone

휴대용 문서 데이터 캡쳐 디바이스의 모바일폰 실시예는 폰과 스캐너의 기능 을 포함한다. 모바일폰은 전용 스캐닝 서브시스템으로 또는 집적된 카메라로 이미지 데이터를 받아들일 수 있다. 보이스 주석은 모바일폰의 마이크로폰으로 받아들일 수 있다. 유저는 폰의 스캐너를 통해서, 또는 마이크로폰을 통해서 폰의 키패드상의 검색 문의 텍스트를 입력할 수 있다.Mobile phone embodiments of a portable document data capture device include the functionality of a phone and a scanner. The mobile phone can accept image data with a dedicated scanning subsystem or with an integrated camera. Voice annotations can be accepted as the microphone of a mobile phone. The user can enter the search query text on the phone's keypad through the phone's scanner or through the microphone.

일부 실시예에서, 종이 문서가 폰의 카메라로 이미지로 되고 그리고 폰의 디스플레이에 나타날 때, 모바일폰은 종이 서류 이미지에 붙여진 마크업 레이어를 나타낼 소프트웨어를 가질 수 있다. 종이 문서가 폰의 카메라를 통해 보이면, 이미지는 마크업 서류 데이터에 의해 화질이 높아진다.In some embodiments, when a paper document is imaged with the phone's camera and appears on the phone's display, the mobile phone may have software to present a markup layer pasted onto the paper document image. When a paper document is viewed through the phone's camera, the image is enhanced in quality by markup document data.

페이지와 물리적으로 접촉하지 않는 스캐너에 의해 스캐닝될 페이지에서 텍스트를 식별하는 방법How to identify text on a page to be scanned by a scanner that is not in physical contact with the page

모바일폰 카메라를 스캐닝 디바이스로서 사용하는 어려움 중의 하나는 텍스트가 스캐닝될 유저를 나타내는 것이다. 일부 실시예에서, 모바일폰은 근접 스캔 구역을 밝게 할 스캐닝될 표면에 빔을 투사한다. 일부 실시예에서, 모바일폰은 폰의 디스플레이에 스캐닝될 구역을 디스플레이한다. 디스플레이는 여러가지 방식으로 화질이 높아져서 유저에게 이미지의 어떤 서브셋이 스캐닝되거나 또는 OCR되는지 보여준다. 예를 들면, 디스플레이는 텍스트가 캡쳐될 구역 주위에 박스를 그릴 수 있다. 다른 방법으로는, 폰은 카메라로부터 또는 문서 소스로부터 문서의 이미지와 겹쳐지는 디스플레이에서 스캔의 경계를 나타낼 수 있는데, 즉 디스플레이 스크린상에 빨강 선으로 또는 그늘진 배경 등으로 나타낼 수 있다.One of the difficulties of using a mobile phone camera as a scanning device is to indicate the user whose text will be scanned. In some embodiments, the mobile phone projects a beam on the surface to be scanned that will brighten the proximity scan area. In some embodiments, the mobile phone displays the area to be scanned on the display of the phone. The display is enhanced in several ways to show the user which subset of the image is scanned or OCR. For example, the display can draw a box around the area where text is to be captured. Alternatively, the phone may indicate the boundaries of the scan in the display overlapping the image of the document from the camera or from the document source, i.e. as a red line or shaded background or the like on the display screen.

스캐닝된 텍스트를 근거로 액션을 취하고 유저에게 선택하게 하는 방법How to take an action based on the scanned text and let the user make a selection

일부 실시예에서, 시스템은In some embodiments, the system is

-- 종이 문서의 일부분의 이미지를 캡쳐하고,-Capture an image of a portion of a paper document,

-- ocr 이미지를 선택하고, 오프셋을 찾고, 압축하고,-select an ocr image, find the offset, compress it,

-- 이미지 또는 텍스트 데이터를 셀룰라 네트워크를 통해서 서버에 전송하고,-Send image or text data to a server over a cellular network,

-- 문서 및 (있다면)관련된 마크업을 위치시키고,-Locate the document and (if any) associated markup,

-- 동작/프레젠테이션 데이터를 무선 핸드셋에 전송하고,-Transmit motion / presentation data to the wireless handset,

-- 데이터를 유저에게 나타내고,-Present the data to the user,

-- 유저로부터 지시를 선택적으로 수신하고,-Optionally receive instructions from the user,

-- 유저로부터 지시를 저장 또는 전송하므로서,-By storing or sending instructions from the user,

모바일폰 컨텍스트에서 종이 문서를 명확하게 한다.Clarify paper documents in the context of mobile phones.

일부 실시예에서, 메뉴를 위한 데이터(즉, '마크업 데이터")의 적어도 일부분 그리고 문서의 인덱스는 모바일폰에 다운로드되고 남겨진다. 메뉴/마크업 정보는 모바일폰의 디스플레이에서 유저에게 보여진다. 선택적으로, 메뉴는 유저가 들을 수 있게 나타낼 수 있다.In some embodiments, at least a portion of the data for the menu (ie, 'markup data') and the index of the document are downloaded and left on the mobile phone The menu / markup information is shown to the user on the display of the mobile phone. Optionally, the menu can be presented for the user to hear.

일부 실시예에서, 모바일폰 데이터 캡쳐 디바이스는 폰의 오디오 기능을 사용하여 문서를 명확하게 한다. 유저는 무선 핸드셋 또는 랜드라인 폰을 사용할 수 있어서 보이스, 스캔, dtmf 톤, 등을 받아들이는 서버로 다이얼할 수 있고, 그리고 현존하는 전화통신의 오디오 채널을 사용하여 종이-대-디지털 문서 시스템의 장점을 얻는다.In some embodiments, the mobile phone data capture device uses the phone's audio function to clarify the document. Users can use a wireless handset or landline phone to dial a server that accepts voice, scan, dtmf tones, etc., and the advantages of a paper-to-digital document system using existing telephony audio channels. Get

예를 들어, 사용자는 문서의 대표적인 부분, 예를 들어, 문서 식별자, 타이틀등을 읽는다. 이 시스템은 일부 실시예에서 - 스크린에 대해 선택적인- 모호성에 대한 피드백을 제공하고, 사용자는 디지털 문서의 매칭이 발견되었는지 여부의 인식의 여부를 수신한다. 사용자는 관심의 추가 아이템을 스캐닝하는 것과 같은 모호성을 해결하기 위한 선택적 액션을 취할 수 있다. 사용자는 컨텍스트를 설정하기 위해 크게 읽을 수 있고 시스템에 모호하지 않은 데이터를 제시할 수 있다. 예를 들어, 사용자는 "...said we need this war to achieve peace..." 비모호셩 데이터로 이어지는 "NY Times, yesterday"를 말함으로써 컨텍스트를 설정할 수도 있다. 그다음 시스템은 비모호성 데이터에 매칭하는 텍스트를 위해 뉴욕타임지의 어제 에디션을 검색한다. For example, a user reads a representative portion of a document, such as a document identifier, a title, and the like. The system provides feedback on ambiguity—selective to the screen—in some embodiments, and the user receives whether to recognize whether a match of the digital document has been found. The user may take optional actions to resolve ambiguity, such as scanning for additional items of interest. The user can read aloud and present unambiguous data to the system to set the context. For example, a user may set a context by saying "NY Times, yesterday" followed by "... said we need this war to achieve peace ..." unambiguous data. The system then searches for yesterday's edition of The New York Times for text that matches unambiguous data.

모바일 폰 카메라 시스템의 사용에 의한 스캐닝 방법Scanning method by using mobile phone camera system

단순한 스캐닝은 모든 실시예에서 폰의 카메라의 전체 해상도를 필요로 하지 않을 수 있다. 카메라 센서 에어리어의 선택된 부분만을 사용하는 것은 보다 높은 데이터속도 및 보다 낮은 전력소비량의 장점을 갖게 된다. Simple scanning may not require the full resolution of the phone's camera in all embodiments. Using only selected portions of the camera sensor area has the advantage of higher data rates and lower power consumption.

일부 실시예에서, 광섬유 이미지 콘딧은 스캐닝 서브시스템의 일부이다. 스캐닝 서브시스템은 기존의 카메라 이미지 센서와 공경합될 수 있다. 일부 실시예에서, 카메라 이미지 센서의 일부는 스캐닝 서브시스템의 배타적 사용을 위해 예비된다. In some embodiments, the fiber optic image conduit is part of the scanning subsystem. The scanning subsystem can co-exist with existing camera image sensors. In some embodiments, some of the camera image sensors are reserved for exclusive use of the scanning subsystem.

액션, 전자 트랜잭션 또는 검색을 위한 For action, electronic transaction, or search 컨텍스트Context 설정 방법 How to set up

셀룰러 서비스 프로바이더를 갖는 모바일 폰 가입자의 어카운트는 p-커머스 구매 트랜잭션에 대한 정보를 계산/빌링하기 위해 사용될 수 있다. The account of the mobile phone subscriber with the cellular service provider can be used to calculate / bill information about the p-commerce purchase transaction.

모바일 폰 기능의 다른 태양은 컨텍스트를 설정하기 위해 사용될 수 있다. 예를 들어, 폰 콜 액티비티 및 히스토리는 컨텍스트를 달성하고, 입력을 우선순위화하고 그리고 검색 문의를 다루기 위해 사용될 수 있다. 또한, 웹/WAP/이메일/IM 액티비티 및 그 히스토리에 대한 폰 네트워크의 사용; 그 폰의 지리학적 위치 및 히스토리와 같은 폰 네트워크의 다른 사용은 컨텍스트를 설정하는데 사용될 수 있다. Another aspect of mobile phone functionality can be used to establish a context. For example, phone call activities and history can be used to achieve context, prioritize input, and handle search queries. In addition, the use of the phone network for Web / WAP / email / IM activities and their history; Other uses of the phone network, such as the phone's geographic location and history, can be used to establish the context.

폰의 텍스트 메시징 사전은 OCR 프로세스를 강화하는데 사용될 수 있다. 많은 모바일 폰상의 텍스트 메시징에 대한 T9 예측 텍스트 소프트웨어는 또한 OCR 및 비모호성 프로세스를 강화하는데 사용될 수 있다. 예를 들어, T9 예측 텍스트 소프트웨어는 OCR 에러를 수정하는데 사용될 수 있다. 텍스트 입력 및 SMS 메시징 히스토리는 OCR에 대한 어휘로서 사용될 수 있다. The phone's text messaging dictionary can be used to enhance the OCR process. T9 predictive text software for text messaging on many mobile phones can also be used to enhance OCR and non-ambiguity processes. For example, T9 predictive text software can be used to correct OCR errors. Text entry and SMS messaging history can be used as a vocabulary for OCR.

일부 실시예에서, 모바일 폰은 아이콘을 명령어로서 인식하여 아이콘 다음에 인쇄된 넘버를 다이얼하게 된다. 일부 실시예에서, 모바일 폰은 폰넘버를 스캔시에 폰 넘버를 인식하고 사전결정된 액션을 취하게 된다. 가능한 사전결정된 액션의 일부는 폰 콜을 행하고, 이 폰 넘버를 폰 주소록에 저장하는 것이다. 일부 실시예에서, 폰은 폰의 지리학적 위치를 콘텍스트로서 사용하여 폰 넘버를 인식하는 것을 돕게 된다. 예를 들어, 북아메리카 시스템하의 폰 넘버는 10개의 디짓으로 구성된다. 폰은 그 넘버가 폰 넘버인지를 결정하기 위해 넘버를 폰이 스캐닝할 때 이들의 위치 콘텍스트를 사용할 수 있다. 폰이 북아메리카에 있을 때, 11개의 디 짓 넘버가 폰 넘버로서 자동적으로 저장된다. 폰이 유럽에 있을 때, 동일한 11개의 디짓 넘버가 폰넘버로서 자동적으로 저장될 수도 있다. In some embodiments, the mobile phone recognizes the icon as a command to dial the number printed after the icon. In some embodiments, the mobile phone will recognize the phone number and take a predetermined action when scanning the phone number. Part of the possible predetermined action is to make a phone call and store this phone number in the phone address book. In some embodiments, the phone will use the phone's geographic location as the context to help recognize the phone number. For example, the phone number under the North American system consists of ten digits. The phone can use their location context when the phone scans the number to determine if the number is the phone number. When the pawn is in North America, eleven digit numbers are automatically stored as the pawn number. When the phone is in Europe, the same eleven digit numbers may be automatically stored as the phone number.

렌더링 문서의 스캔에 의해 또는 그것을 사용하여 개시되는 모바일 폰 상거래Mobile phone commerce initiated by or using a scan of a rendered document

모바일 폰은 렌더링된 문서로부터 정보를 캡쳐링함으로써 상거래를 개시할 수 있다. 예를 들어, 사용자는 그의 모바일 폰 카메라로 문서로부터의 복수의 라인의 텍스트의 이미지를 캡쳐링하고; 폰은 캡쳐된 데이터내의 키워드를 인식하고; 키워드는 그 키워드와 연관된 프로덕트에 대한 구매 주문을 폰이 전달하도록 하는 폰내의 소프트웨어 애플리케이션을 트리거링한다. 사용자는 그가 상거래를 완료하기를 원하는지 여부를 (폰 키패드등으로) 나타낼 수 있다. 만약 그가 상거래를 완료하기 원한다면, 구매 가격은 사용자의모바일 폰 어카운트에 청구된다. The mobile phone can initiate commerce by capturing information from the rendered document. For example, a user may capture an image of a plurality of lines of text from a document with his mobile phone camera; The phone recognizes the keywords in the captured data; The keyword triggers a software application within the phone that allows the phone to deliver a purchase order for the product associated with that keyword. The user may indicate whether he wants to complete the commerce (eg with a phone keypad). If he wants to complete the commerce, the purchase price is charged to the user's mobile phone account.

모바일 폰 (또는 서비스 프로바이더로 가입을 필요로 하는 임의의 무선 통신 디바이스)은 렌더링된 문서에 기초하여 상거래를 인증하고 완료하기 위해 사용될 수 있다. 예를 들어, 가입자는 웹페이지 구매 폼을 채워 제출할 수 있다. 이에 응답하여, 웹 판매자는 가입자의 컴퓨터에 코드를 다시 전송하고, 이 컴퓨터는 모니터상에 그것을 디스플레이한다. 그다음, 가입자는 그의 모바일 폰으로서 컴퓨터 모니터를 포토그래핑하고 이 이미지를 셀룰러 네트워크를 통해 판매자에게 전송한다. 판매자가 모바일 폰 메시지를 수신할 때, 판매자는 모바일 폰 어카운트가 웹페이지상에 제출된 정보와 매칭하는지를 검사하여 사용자를 인증하고 상거래를 완료할 수 있다. The mobile phone (or any wireless communication device requiring subscription with a service provider) can be used to authenticate and complete commerce based on the rendered document. For example, a subscriber can fill out and submit a webpage purchase form. In response, the web seller sends the code back to the subscriber's computer, which displays it on the monitor. The subscriber then photographs the computer monitor as his mobile phone and sends this image to the seller over the cellular network. When the seller receives the mobile phone message, the seller may check that the mobile phone account matches the information submitted on the webpage to authenticate the user and complete the commerce.

일부 실시예에서, 가입자는 DTMF 또는 보이스 입력에 의해 p-상거래가 완료될 수 있는 서버에 가입자를 접속시키는 다이얼업 넘버를 호출할 수 있다. In some embodiments, the subscriber may call a dialup number that connects the subscriber to a server where p-commerce can be completed by DTMF or voice input.

모바일 mobile 폰과Pawn and 조합하여 스캐너 기능을 사용하는 방법 How to use the scanner features in combination

일부 실시예에서, 모바일 폰은 스캐닝을 위한 중간 플랫폼(호스트 디바이스)이다. 예를 들어, 이러한 상황은 BlueTooth™ 스캐너가 모바일 폰에 접속될 때 발생할 수 있다. 모바일 폰이 중간 플랫폼으로서 기능하고 있을 때, 폰은 문서 인덱스, 마크업 문서 및, 폰과 함께 사용되고 있는 사용자/스캐너 전용 사용자 어카운트 데이터를 저장할 수 있다. In some embodiments, the mobile phone is an intermediate platform (host device) for scanning. For example, this situation can occur when a BlueTooth ™ scanner is connected to a mobile phone. When the mobile phone is functioning as an intermediate platform, the phone can store document indexes, markup documents, and user / scanner specific user account data being used with the phone.

일부 실시예에서, 셀룰러 폰 시스템은 임의의 음성 콜로 데이터 채널을 개방한다. 호출자는 그의 셀폰으로 문서의 픽쳐를 취하거나 (또는 문서를 식별하기 위해 충분한 데이터를 캡쳐링하고, 예를 들어, 텍스트의 스트링을 문서로부터 스캐닝하고), 이 캡쳐링된 데이터는 음성 채널로 펑쳐링되고, 수신 폰내의 소프트웨어는 이 스트링을 복구하고 이것을 사용하여, 호출자에 의해 스트링이 스캐닝되었던 문서의 전자 카피를 위치시킨다. In some embodiments, the cellular phone system opens the data channel with any voice call. The caller takes a picture of the document (or captures enough data to identify the document, for example, scans a string of text from the document) with his cell phone, and the captured data is punctured into the voice channel. The software in the receiving phone then recovers this string and uses it to locate an electronic copy of the document that the string was scanned by the caller.

일부 실시예에서, 모바일 폰이 폰 넘버 및 이름을 캡쳐링할 때, 이 둘 모두를 폰의 주소록에 저장하도록 사전프로그래밍된다. 단지 하나의 폰 넘버만이 캡쳐링되었을 때, 폰은 자동으로 이 넘버를 다이얼하도록 사전프로그래밍되어 있다. In some embodiments, when the mobile phone captures the phone number and name, it is preprogrammed to store both in the phone's address book. When only one phone number has been captured, the phone is preprogrammed to automatically dial this number.

SMS, MMSSMS, MMS

검색 문의는 모바일 폰에 대한 셀룰러 네트워크의 쇼트 메시지 서비스(SMS) 텍스트 메시징 시스템의 사용에 의해 서비스 프로바이더 또는 네트워크에 효율적으 로 전송될 수 있다. 이미지 및 오디오 파일은 멀티미디어 메시지를 모바일 폰이 송수신하도록 셀룰러 네트워크의 멀티미디어 메시징 서비스(MMS)에 의해 송신될 수 있다. The search query can be efficiently sent to the service provider or network by the use of the Short Message Service (SMS) text messaging system of the cellular network for the mobile phone. The image and audio files may be sent by the multimedia messaging service (MMS) of the cellular network for the mobile phone to send and receive multimedia messages.

모바일 폰 실시예의 관심 태양은 특히, 프래그먼트가 콘텍스트에 의해 더 퀄리파이되는 경우에, 위치를 식별하는데 오직 데이터의 작은 프래그먼트만이 필요하다는 것이 관찰된다는 것이다. 그다음, 이러한 쇼트 프래그먼트 접근은 이전에 문서 이미지를 전송할 수 없었던 제한된 배역폭 채널을 통해 문서 스캔 데이터를 송신하는 놀라운 능력에 갖게 되는데까지 이른다. An aspect of interest of the mobile phone embodiment is that it is observed that only a small fragment of data is needed to identify the location, especially if the fragment is further qualified by the context. This short fragment approach then leads to the incredible ability to send document scan data over a limited bandwidth channel that could not previously transmit a document image.

컴퓨터 마우스Computer mouse

휴대가능한 데이터 캡쳐 디바이스의 일실시예는 스캐너 능력을 갖는 광마우스이다. 일부 실시예에서, 이 광마우스는 스캐닝 및 모션감지를 위해 동일한 광경로를 사용한다. 일부 실시예에서, 광마우스는 스캐닝되고 있는 텍스트를 관찰하기 위한 뷰파인더를 갖고 있다. One embodiment of a portable data capture device is an optical mouse with scanner capabilities. In some embodiments, the optical mouse uses the same optical path for scanning and motion sensing. In some embodiments, the optical mouse has a viewfinder for observing the text being scanned.

이 뷰파인터로 인해 사용자는 스캐너가 어디에 타켓팅되고 있는지를 알 수 있다. 이 뷰파인더를 구현하는데 사용될 수 있는 기술들은, 마우스 밑의 문서를 도시하는 클리어 플라스틱 윈도우; 페리스코프와 유사한 일련의 미러; 스캐너의 실시간 출력을 도시하는 디스플레이; 또는 광섬유 이미지 콘딧이다. This viewfinder lets the user know where the scanner is being targeted. Techniques that can be used to implement this viewfinder include: a clear plastic window showing a document under the mouse; A series of mirrors similar to a periscope; A display showing the real time output of the scanner; Or fiber optic image conduits.

도 20은 마우스 밑의 표면을 드러내는 뷰잉 윈도우(2104)를 구비한 스캐너/마우스(2100)를 도시한다. 이 스캐너/마우스(2100)는 뷰잉 윈도우(2104)가 존재하는 하우징(2102)을 갖고 있다. 이 뷰잉 윈도우는 스캐너/마우스(2100)가 어느 텍 스트를 캡쳐링하고 있는지를 지시하기 위한 타겟(2106)을 가질 수 있다. 이 뷰잉 윈도우는 도 24내에 도시된 미러 어레인지먼트와 함께 사용될 수 있다. 20 shows a scanner / mouse 2100 with a viewing window 2104 revealing the surface under the mouse. This scanner / mouse 2100 has a housing 2102 in which a viewing window 2104 is present. This viewing window may have a target 2106 to indicate which text the scanner / mouse 2100 is capturing. This viewing window can be used with the mirror arrangement shown in FIG. 24.

도 21은 사용자가 무엇이 스캐닝되고 있는지를 볼 수 있도록 하우징(2104)의 상부상에 장착된 디스플레이(LCD, LED등)을 구비한 스캐너/마우스(2100)를 도시하고 있다. 디스플레이(2102)는 실시간으로 광스캐닝 서브시스템의 출력을 도시할 수도 있다. 일부 실시예에서, 프로세서(호스트 컴퓨터의 프로세서 또는 온보드 프로세서)는 광메커니즘의 출력이 디스플레이(2102)에 송신되기 전에 상기 출력을 조작할 수 있다. 도 25 역시 참조하라. FIG. 21 shows a scanner / mouse 2100 with a display (LCD, LED, etc.) mounted on top of housing 2104 so that a user can see what is being scanned. Display 2102 may show the output of the light scanning subsystem in real time. In some embodiments, the processor (processor of the host computer or onboard processor) may manipulate the output before the output of the optical mechanism is sent to the display 2102. See also FIG. 25.

도 22는 전통적인 기계적 x/y 메커니즘 및 광스캐너를 구비한 마우스와 같은, 별개의 포지션-감지부(2210) 및 스캐닝 메커니즘(2220)을 구비한 마우스의 블록도를 도시하고 있다. 제어 로직(2240)은 포지션-감지 메커니즘(2210), 스캐닝 메커니즘(2220), 디스플레이(2230), I/O 서브시스템(2250), 및 메모리(2260)와 동작가능하게 접속되어 있다. 옵셔널 디스플레이(2230)는 사용자에게 스캐닝된 데이터를 보여줄 수 있다. 메모리(2260)는 스캐닝된 데이터 및 명령어를 저장할 수 있다. I/O 서브시스템(2250)은 블루투스 송수신기 또는 USB 포트와같은 무선 또는 유선 통신 수단에 의해 호스트 컴퓨터와 통신한다. 일부 실시예에서, I/O 서브시스템(2250)은 또한 스위치, 키패드 또는 버트과 같은 사용자 입력 디바이스를 포함한다. FIG. 22 shows a block diagram of a mouse with separate position-sensing unit 2210 and scanning mechanism 2220, such as a mouse with a traditional mechanical x / y mechanism and a light scanner. Control logic 2240 is operatively connected with position-sensing mechanism 2210, scanning mechanism 2220, display 2230, I / O subsystem 2250, and memory 2260. The optional display 2230 can show the scanned data to the user. The memory 2260 may store scanned data and instructions. I / O subsystem 2250 communicates with a host computer by wireless or wired communication means such as a Bluetooth transceiver or a USB port. In some embodiments, I / O subsystem 2250 also includes a user input device such as a switch, keypad or butt.

도 23은 x/y 모션을 검출하고 렌더링된 문서로부터 데이터를 스캐닝하는데 사용될 수 있는 광센서 어셈블리(2310)를 구비한 마우스의 블록도를 도시하고 있 다. 제어 로직(2320)은 광어셈블리(2310), I/O 서브시스템(2330), 디스플레이(2350), 및 메모리(2340)와 동작가능하게 접속되어 있다. 제어 로직/프로세서(2320)는 어느 기능(스캐닝 또는 모션 감지)이 필요한지를 결정할 수 있다. 대안으로, I/O 서브시스템(2330)은 x/y 모션 및 스캐닝 기능 사이를 전환하는 사용자 선택가능한 스위치를 포함할 수도 있다. 메모리(2340)는 데이터 및 명령어를 저장할 수 있다. 디스플레이(2350)는 사용자에게 스캐닝된 데이터 및/또는 디바이스 상태(예를 들어, 디바이스가 현재 스캐너 모드인지 또는 마우스 모드인지, 등)를 보여줄 수 있다. FIG. 23 shows a block diagram of a mouse with an optical sensor assembly 2310 that can be used to detect x / y motion and scan data from a rendered document. The control logic 2320 is operatively connected to the optical assembly 2310, the I / O subsystem 2330, the display 2350, and the memory 2340. The control logic / processor 2320 can determine which function (scanning or motion sensing) is required. Alternatively, I / O subsystem 2330 may include a user selectable switch to switch between x / y motion and scanning functions. The memory 2340 may store data and instructions. The display 2350 can show the user scanned data and / or device status (eg, whether the device is currently in scanner mode or mouse mode, etc.).

도 24는 스캐너 헤드 아래에 있는 것의 뷰파인터에 이르는 이미지를 반사하기 위해 일련의 미러(2410)를 사용하는 마우스/스캐너(2400)의 측면도를 도시한다. 광원(2420)은 사용자에 의해 스캐닝되고 있는 렌더링된 문서(2430)의 일부를 조명한다. 광원(2420)로부터의 광의 적어도 일부는 문서(2430)로부터 반사하고 광경로(2440)를 따라 이동하여 사용자가 볼 수 있는 뷰파인더 윈도우(2450)에 이른다. 이러한 대안의 실시예에서, 뷰파인더(2450)는 스캐너 헤드/광원(2420)의 일측상에 놓일 수 있다(도 26 참조).FIG. 24 shows a side view of mouse / scanner 2400 using a series of mirrors 2410 to reflect an image up to the viewfinder of what is under the scanner head. The light source 2420 illuminates a portion of the rendered document 2430 that is being scanned by the user. At least a portion of the light from light source 2420 reflects off document 2430 and travels along light path 2440 to a viewfinder window 2450 that the user can see. In this alternative embodiment, the viewfinder 2450 may be placed on one side of the scanner head / light source 2420 (see FIG. 26).

도 25는 광감지 반도체 칩(2520; CMOS, CCD등)에 동작가능하게 접속된 이미지 콘딧(2510)를 사용하는 마우스/스캐너(2500)의 일예를 도시한다. FIG. 25 shows an example of a mouse / scanner 2500 using an image conduit 2510 operatively connected to a photosensitive semiconductor chip 2520 (CMOS, CCD, etc.).

CCD(2520)의 출력은 직접 디스플레이(2530) 및 프로세서(2540)으로 인가될 수도 있다(대안으로, 상기 출력은 디스플레이(2530)에 루팅되기 전에 처리될 수도 있다). 프로세서(2540)는 CCD(2520), 디스플레이(2530), 메모리(2550) 및 I/O 서 브시스템(2560)과 동작가능하게 접속되어 있다. The output of CCD 2520 may be directly applied to display 2530 and processor 2540 (alternatively, the output may be processed before being routed to display 2530). The processor 2540 is operatively connected to the CCD 2520, the display 2530, the memory 2550, and the I / O subsystem 2560.

도 26은 본질상, 사용자가 스캐닝 헤드 아래를 통과하는 텍스트를 볼 수 있도록 하는 스캐닝 메커니즘(2620)의 일측상의 윈도우(2610)인 뷰파인터를 구비한 마우스/스캐너(2600)의 평면도를 도시하고 있다. 프로세서(2630)는 스캐닝 메커니즘(2620), 메모리(2640), I/O 서브시스템(2650), 및 전원(2660)과 동작가능하게 접속되어 있다. 전원(2660)은 보통 무선으로 통신하는 스캐너내에 포함되지만 옵션으로 유선 마우스에 포함도리 수 있다. FIG. 26 shows a plan view of a mouse / scanner 2600 with a view finder, which is a window 2610 on one side of the scanning mechanism 2620 that allows a user to see text passing under the scanning head in nature. . The processor 2630 is operatively connected with the scanning mechanism 2620, the memory 2640, the I / O subsystem 2650, and the power supply 2660. The power supply 2660 is usually included in a scanner that communicates wirelessly but may optionally be included in a wired mouse.

USB 포트를 구비한 스캔 헤드 Scan head with USB port 액세사리Accessories

어댑터 포트를 구비한 스캐닝 액세사리는 휴대가능한 데이터 캡쳐 디바이스의 또 다른 예이다. 스캐닝 액세사리는 모바일 폰 또는 PDA와 같은 다른 디바이스상의 적합한 커넥터에 플러그인되어 상기 디바이스에 스캐닝 능력을 업그레이드할 수 있다. 일부 실시예에서, 액세사리는 단지 광 캡쳐 서브시스템 및 어댑터(이 어댑터를 통해 전력을 전달한다)를 갖고 있다. 일부 실시예에서, 액세사리는 제어 로직, 메모리, 및 전원을 포함하고 있다. Scanning accessories with adapter ports are another example of a portable data capture device. The scanning accessory can be plugged into a suitable connector on another device, such as a mobile phone or PDA, to upgrade the scanning capability to that device. In some embodiments, the accessory only has an optical capture subsystem and an adapter (which delivers power through this adapter). In some embodiments, the accessory includes control logic, memory, and a power supply.

스캐노테이터Scanotator

일부 실시예에서, 시스템은 렌더링된 문서의 전자 카운터파트내의 선택된 포지션으로 오디오 주석을 타겟팅한다("시스템"). 페이퍼 문서내의 선택된 포인트로 구술된 주석을 타겟팅하기 위해, 사용자는 선택된 포인트에서 텍스트의 일부를 스캐닝하기 우해 핸드헬드 광스캐너를 사용한다. 그다음, 사용자는 광스캐너내의 마이크로폰에 의해 캡쳐링되고 텍스트의 스캐닝된 부분과 연관되어 저장되는 주석을 구술한다. In some embodiments, the system targets the audio annotation to the selected position within the electronic counterpart of the rendered document (“system”). To target the dictated annotation to a selected point in the paper document, the user uses a handheld optical scanner to scan a portion of the text at the selected point. The user then dictates the annotation captured by the microphone in the optical scanner and stored in association with the scanned portion of the text.

스캐너는 다양한 타입의 유선 또는 무선 커넥션을 통해 컴퓨터 시스템 또는 유사한 디바이스에 접속되거나 통신할 수 있다. 일단 접속되면, 저장된 연관이 예를 들어, 문서의 전자 버전으로 선택된 포인트를 디스플레이하면서 주석을 재생하고, 문서의 전자 버전으로 선택된 포인트과 관련하여 보이스 인식을 통해 취득된 주석의 텍스츄얼 버전을 디스플레이하고, 이 주석에 따라 선택된 포인트에서 문서의 전자 버전을 자동 리바이싱하고, 선택된 포인트에서 전자 문서내의 오디오 파일로서 상기 주석을 임베딩하고, 오디오 주석을 포함하는 연관된 오디오 파일에 포인터(예를 들어, 하이퍼링크등)를 삽입하는 등에 사용될 수 있다. 일부 실시예에서, 스캐닝된 텍스트가 보다 방대한 전자 문서사이로부터 문서를 식별 및/또는 위치시키는데 사용될 수 있다. 대안으로, 다른 접근이 상기 문서를 식별하는데 사용될 수 있다. The scanner may be connected to or communicate with a computer system or similar device through various types of wired or wireless connections. Once connected, the stored association plays a annotation, for example displaying a point selected with the electronic version of the document, displays a textual version of the annotation acquired through voice recognition with respect to the point selected with the electronic version of the document, According to this annotation, the system automatically re-elects the electronic version of the document at the selected point, embeds the annotation as an audio file within the electronic document at the selected point, and points to an associated audio file containing the audio annotation (e.g. ) Can be used for insertion. In some embodiments, the scanned text can be used to identify and / or locate documents from a wider electronic document. Alternatively, other approaches can be used to identify the document.

임의의 실시예에서, 스캐너는 주석 사이를 항해하는 컨트롤과 같은 컨트롤을 포함한다. 스캐너가 컴퓨터 시스템에 접속될 때, 내비게이션 컨트롤은 컴퓨터 시스템상에 디스플레이된 문서내의 주석 사이를 항해할 수 있다. 스캐너가 컴퓨터 시스템에 접속되어 있지 않을 때, 내비게이션 컨트롤은 스캐너에 저장된 주석 사이를 항해하거나, 스캐너의 메모리내의 그러한 주석을 리뷰, 리바이스, 또는 삭제할 수 있다. In some embodiments, the scanner includes controls such as controls to navigate between annotations. When the scanner is connected to a computer system, the navigation control can navigate between the annotations in the document displayed on the computer system. When the scanner is not connected to a computer system, the navigation control can navigate between the annotations stored in the scanner, or review, recall, or delete such annotations in the scanner's memory.

상술된 기능의 일부 또는 모두를 제공함으로써, 시스템은 전자 문서의 렌더링된 카피를 사용하여 전자 문서에 편리하고 정확하게 주석을 사용자가 달 수 있도 록 한다. By providing some or all of the functions described above, the system allows a user to conveniently and accurately annotate an electronic document using a rendered copy of the electronic document.

도 27은 샘플 핸드헬드 문서 데이터 캡쳐 디바이스의 모습의 사시도이다. 페이퍼 문서를 판독하는 동안, 사용자는 전자 오리지널의 편집 또는 다른 인터랙션을 필요로 하는 문서내의 타이핑 또는 스펠링 에러, 사실에 근거한 부정확도, 또는 다른 이슈를 인지할 수 있다. 사용자는 주석 디바이스상의 SCAN 버튼(2701)을 누르고 광센서(2711)를 사용하여 컨텍스트를 캡쳐링하기 위해 문서의 몇 단어를 스캔한다. 임의의 실시예에서, 시각 지시기(2721)가 스캐닝된 텍스트가 인식되었는지, 및/또는 스캐닝된 텍스트가 렌더링된 문서에 상응하는 전자 문서 및/또는 이러한 문서내의 단일 위치를 식별하는데 충분한지 또는 충분할 가능성이 높은지 여부를 지시한다. 그다음, 사용자는 빌트인 마이크로폰을 사용하여 음성 주석을 기록하기 위해 REC 버튼(2702)을 누른다. 버튼(2703)을 누름으로써 빌트인 스피터(2731)를 사용하여 주석을 사용자가 리뷰할 수 있고, REC 버튼(2702)을 다시 누름으로써 그 주석을 겹쳐쓰기할 수 있다. 27 is a perspective view of a state of a sample handheld document data capture device. While reading a paper document, a user may be aware of typing or spelling errors, factual inaccuracies, or other issues in the document that require editing or other interaction of the electronic original. The user presses the SCAN button 2701 on the annotation device and scans a few words of the document to capture context using an optical sensor 2711. In some embodiments, the visual indicator 2721 is likely or sufficient to identify whether the scanned text was recognized and / or to identify the electronic document and / or a single location within such document that the scanned text corresponds to the rendered document. Indicates whether this is high. The user then presses the REC button 2702 to record the voice annotation using the built-in microphone. The user can review the annotation using the built-in speaker 2731 by pressing the button 2703 and overwrite the annotation by pressing the REC button 2702 again.

사용자가 인쇄된 문서의 리부를 마쳤을 때, 사용자 (또는 어시스턴트)는 직접 또는 USB 포트가 불편하게 위치되어 있다면 익스텐션 케이블을 통하여 (주석 디바이스의 내부 배터리를 역시 재충전시킬 수 있는) 컴퓨터상의 USB 포트내로 주석 디바이스의 USB 커넥터(2741)를 플러깅한다. 그것을 단순히 플러깅함으로써 문서를 편집하기 위한 적합한 소프트웨어 패키지가 적합한 문서를 론칭하고, 로딩하고, 제1 주석의 포인트에서 편집 커서를 배치할 수 있고, 심지어 스캐닝되었던 단어를 선택할 수도 있다. 그다음, 사용자는 기록된 주석을 듣기 위하여 PLAY 버튼(103) 을 누르고 정상적으로 방법으로 텍스트에 임의의 필요한 편집을 행할 수 있다. 사용자는 NEXT 버튼(2705)를 눌러 다음 주석으로 스킵하고, 그다음, 다시 PLAY 등을 누를 수 있다. When the user finishes reprinting the printed document, the user (or assistant) can be annotated directly into the USB port on the computer (which can also recharge the internal battery of the tin device) via an extension cable, if the USB port is inconveniently located Plug the USB connector 2741 of the device. By simply plugging it in, a suitable software package for editing the document can launch the appropriate document, load it, place the edit cursor at the point of the first annotation, and even select the scanned word. Then, the user can press the PLAY button 103 to hear the recorded comment and make any necessary edits to the text in the normal way. The user can press the NEXT button 2705 to skip to the next comment and then press PLAY again.

REC 버튼(2702)은 예를 들어, 어시스턴트가 오리지널 주석 또는 이들의 편집된 버전의 적합성에 관한 질문을 갖고 있다면, 동일한 위치에 추가 주석을 더하는데 사용될 수 있다. REC button 2702 can be used to add additional annotations to the same location, for example, if the assistant has questions regarding the suitability of the original annotations or their edited version.

SCAN 버튼(2701)은 PC에 접속될 때, 주석 달렸고 더이상 필요하지 않다는 것을 지시하기 위해 'DONE' 버튼으로서 사용될 수 있다. 임의의 실시예에서, 동일한 버튼이 디바이스가 페이퍼와 접촉하고 있을 때 스캐닝을 트리거링하고 그렇지 않을 때 오디오 기록을 트리거링한다. 임의의 실시예에서, 디바이스는 광센서(2711)가 페이퍼와 접촉하고 있을 때를 검출하기 위해 광센서(2711) 근방에 센서 또는 버튼(명료성을 위해 도시되지 않았다)을 가질 수 있다. SCAN button 2701 can be used as a 'DONE' button to indicate that when connected to a PC, it is annotated and no longer needed. In some embodiments, the same button triggers scanning when the device is in contact with the paper and triggers audio recording when it is not. In some embodiments, the device may have a sensor or button (not shown for clarity) near the light sensor 2711 to detect when the light sensor 2711 is in contact with the paper.

임의의 실시예에서, 주석 디바이스는 후방부에 클립을 가지고 있어, 오디오 Post-It® 노트의 세트로서 기능하기 위해 주석된 문서에 클립핑될 수 있다. In some embodiments, the annotation device has a clip at the back so that it can be clipped to the annotated document to function as a set of audio Post-It® notes.

도 28은 주석 디바이스(2800)의 일실시예의 블록도를 도시하고 있다. 이 디바이스는 페이퍼 문서로부터 텍스트의 이미지를 캡쳐링하기 위한 광스캐닝 헤드(2816) 및 텍스트와 연관된 보이스 주석을 캡쳐링하기 위한 마이크로폰(2802)을 포함하고 있다. 이러한 입력 디바이스로부터 캡쳐링된 데이터는 중앙 컨트롤 디바이스(2810)에 의해 처리되는 것이 가능하고, 메모리(2814)내에 저장된다. 하나 이상의 버튼(2812)가 이러한 프로세스를 사용자가 제어하기 위해 제공되고, 여기에 LED로 도시된 일부 시각 지시기(2804)는 사용자에게 피드백을 준다. 물론, 시각 지시기는 예를 들어, 액정 디스플레이(LCD)와 같은 임의의 적합한 유저 인터페이스일 수도 있다. 28 shows a block diagram of one embodiment of an annotation device 2800. The device includes a light scanning head 2816 for capturing an image of text from a paper document and a microphone 2802 for capturing voice annotations associated with the text. Data captured from such an input device can be processed by the central control device 2810 and stored in memory 2814. One or more buttons 2812 are provided for the user to control this process, and some visual indicators 2804 shown here as LEDs provide feedback to the user. Of course, the visual indicator may be any suitable user interface such as, for example, a liquid crystal display (LCD).

선택적으로, 상기 디바이스는 사용자에게 음성 주석이 재싱되고 다른 오디오 피드백이 주어질 수 있도록 하는 라우드스피커(2806)을 포함한다. Optionally, the device includes a loudspeaker 2806 that allows the user to be re-voiced and to be given other audio feedback.

인터페이스(2808)는 PC 도는 다른 처리 디바이스에 전송될 수 있도록 하는 인터페이스(2808)는 여기에 USB로서 도시되어 있지만, 방화벽, Bluetooth™, 802.11, 적외선, 이더넷 또는 다른 유선 또는 무선 통신 기술일 수 있다. USB와 같은 유선 기반 통신 기술은 또한 배터리와 같은 내부 전력 소스를 충전을 위해 또는 즉시 연산을 위해 디바이스에 전력을 제공할 수 있다. Interface 2808 is shown here as USB, which allows it to be transmitted to a PC or other processing device, but may be a firewall, Bluetooth ™, 802.11, infrared, Ethernet or other wired or wireless communication technology. Wired-based communication technologies such as USB can also provide power to the device for charging or for immediate computation of an internal power source such as a battery.

도 29는 통신(2902), 보통 USB 포트를 통해 PC(2900)와같은 처리 디바이스에 접속된 디바이스(2800)를 도시하고 있다. 모니터링 시스템(2904)은 상기 디바이스가 접속될 때를 검출하여 상기 디바이스와 통신하고 최종 연산을 코디네이팅하는 책임을 갖고 있다. 보통 이것은 심볼 또는 텍스트 형태로 분석 및 전환을 위해 상기 디바이스로부터 서브시스템(2906)으로 상기 캡쳐링된 이미지를 검색하는 단계, 적합한 문서를 위치시키는 검색 서브시스템(2908)으로 최종 텍스트를 통과시키는 단계, 및 사용자가 이들을 보거나, 편집하거나 또는 상화작용할 수 있도록 하는 애플리케이션(2910)으로 통과되는 문서의 디테일을 처리하는 단게를 수반한다. 또한, 모니터링 시스템(2904)은 애플리케이션을 제어하여, 예를 들어, 모니터링 시스템(2904)가 문서를 이전의 스캠의 위치로 스크롤링하도록 할 수 있다. 캡쳐링된 오디오 주석은 사용자에게 재생되기 위해 PC의 오디오 시스템(2912)으로 전달될 수 있다. 이 오디오 시스템(2912)은 아날로그 오디오를 디지털 형태로 또는 그 반대로 변환시키기 위해 디지털-아날로그 및/또는 아날로그-디지털 변환 능력을 가질 수 있다. FIG. 29 shows device 2800 connected to a processing device, such as PC 2900, via a communication 2902, usually a USB port. The monitoring system 2904 is responsible for detecting when the device is connected, communicating with the device and coordinating the final operation. Normally this involves retrieving the captured image from the device to subsystem 2906 for analysis and conversion in symbol or text form, passing the final text to retrieval subsystem 2908 to locate a suitable document, And processing the details of the documents passed to the application 2910 that allows the user to view, edit or interact with them. In addition, the monitoring system 2904 can control the application, such that, for example, the monitoring system 2904 can scroll the document to the location of the previous scam. The captured audio annotation can be delivered to the audio system 2912 of the PC for playback to the user. The audio system 2912 may have digital-to-analog and / or analog-to-digital conversion capability to convert analog audio into digital form or vice versa.

이러한 프로세스의 많은 구성요소가 디바이스(2800)가 기본 레벨의 정교함보다 많은 것을 가지고 있다면 디바이스(2800)상에서 일어나는 것이 가능하다. PC상에 2906으로 도시된, 임의의 통합된 텍스트의 인식 및 이미지의 해석은 디바이스(2800)상에서 그 PC(2900)로의 접속 전에 또는 그 접속 동안 디바이스(2800)에서 완료되거나 부분적으로 완료될 수 있어서, 예를 들어, 이미지 자체 대신에 또는 이미지 자체는 물론 PC(2900)에 전달되는 것은 텍스트 또는 일부 다른 유도 데이터이다. 이와 마찬가지로, 오디오 주석은 디바이스내에 내장된 오디오 팩실리티(2906)를 통해 사용자에게 재생될 수 있고, 프로세스로의 유저 인터페이스는 디바이스(2800)상의 버튼을 통해 부분적으로 또는 전체적으로 동작될 수 있다. It is possible for many components of this process to occur on device 2800 if device 2800 has more than a base level of sophistication. Recognition of any integrated text and interpretation of the image, shown at 2906 on the PC, may be completed or partially completed at the device 2800 prior to or during the connection to the device 2800 on the device 2800. For example, it is text or some other derived data that is passed to the PC 2900 as well as instead of the image itself. Similarly, the audio annotation can be played back to the user via the audio facility 2906 embedded in the device, and the user interface to the process can be operated in part or in full through a button on the device 2800.

다시, 도 28에서, 임의의 실시예에서, 디바이스의 스캐닝 헤드(2816)는 페이로부터는 물론 컴퓨터 디스플레이와 같은 디스플레이 디바이스로부터의 이미지를 캡쳐링할 수 있다. Again, in FIG. 28, in any embodiment, the scanning head 2816 of the device may capture an image from a display device, such as a computer display, as well as from the pay.

주석을 달 텍스트가 페이퍼상의 단어의 이미지를 스캐닝하고 해석하는 대신에 디바이스의 마이크로폰(2802)내에 사용자에의해 크게 읽혀진 스피치 프래그먼트를 캡쳐링하고 인식함으로써 식별될 수 있다. Annotated text can be identified by capturing and recognizing speech fragments read aloud by the user in the device's microphone 2802 instead of scanning and interpreting the image of the word on the paper.

대안의 실시예에서, 마이크로폰(2802)이 텍스트 및 주석 모두를 캡쳐링하기 위해 사용되어, 스캐닝 헤드(28160가 생략될 수 있고, 상술된 이미지-압축 및 OCR 스테이지가 오디오-처리 및 스피치 인식 스테이지로 대체될 수 있다. 이러한 실시에에서, 사용자는 주석을 위한 요구되는 위치에서 텍스트 및 주석을 마이크로폰내에 크게 읽는다. 임의의 실시예에서, 사용자는 어느 오디오가 위치를 마킹하고 있고 어느 것이 주석인지를 지시하기 위해 디바이스의 유저 인터페이스(사용자에게 정보를 제시하고 사용자로부터의 입력을 수신하기 위한 버튼, 디스프레이, 키패드, 마이크로폰등)을 조작할 수 있다. 나중에 PC(9200)는 렌더링된 문서와 연관된 전자 문서를 식별하기 위해 적합한 오디오를 텍스트로 전환시키고 이러한 텍스트를 사용할 수 있다. 전자 문서가 주석 위치를 마킹하는 텍스트열을 통해 식별된 후에, PC(2900)는 주석이 적합한 삽입 포인트에서 전자 문서내로 삽입될 수 있도록 할 수 있다. In an alternative embodiment, the microphone 2802 is used to capture both text and annotations so that the scanning head 28160 can be omitted, and the image-compression and OCR stage described above is replaced by an audio-processing and speech recognition stage. In this embodiment, the user reads text and annotations loudly into the microphone at the required location for the annotation In some embodiments, the user indicates which audio is marking the position and which is the annotation To manipulate the device's user interface (buttons, displays, keypads, microphones, etc. to present information to and receive input from the user). You can convert the appropriate audio into text and use this text to identify it. After is identified through the text string marking the annotation location, PC 2900 may enable the annotation to be inserted into the electronic document at the appropriate insertion point.

도 30은 컴퓨터 시스템 및 이 시스템이 실행되는 다른 디바이스의 적어도 일부에 보통 통합된 컴포넌트의 일부를 도시하는 블록도이다. 이러한 컴퓨터 시스템 및 디바이스(300)는 컴퓨터 프로그램을 실행하기 위한 하나 이상의 중앙 처리 장치("CPU"; 3001); 데이터 구조를 포함하는 데이터 및 프로그램이 사용되고 있는 동안에 이들을 저장하기 위한 컴퓨터 메모리(3002); 프로그램 및 데이터를 영구 저장하기 위한 하드 드라이브와같은 영구 저장 디바이스(3003); 컴퓨터 판독가능 매체에 저장된 프로그램 및 데이터를 판독하기 위한 CD-ROM 드라이브와 같은 컴퓨터 판독가능 매체 드라이브(3004); 데이터 구조를 포함하는 데이터 및/또는 프로그램을 교환하기 위해 인터넷과 같은 다른 컴퓨터 시스템에 상기 컴퓨터 시스템을 접속시 키기 위한 네트워크 커넥션(3005); 및 USB 커넥터 또는 다른 적합한 버스 커넥터와 같은 데스크톱 버스 커넥터(3006)를 포함할 수 있다. CPU에 의해 실행되는 프로그램은 광학식 문자 인식("OCR") 소프트웨어와 같은 스캐닝된 이미지를 인식하기 위한 소프트웨어 및/또는 음성 인식 소프트웨어와 같은 구술된 오디오를 인식하기 위한 소프트웨어는 물론, 시스템과 연관되고 여기에 어딘가에 기술된 프로그래을 포함할 수 있다. 상술된 바와 같이 구성된 컴퓨터 시스템이 보통 시스템의 동작을 지원하기 위해 사용되었지만, 당업자는 이 시스템이 다양한 타입 및 구성, 그리고 다양한 컴포넌트를 가진 디바이스를 사용하여 구현될 수 있음을 이해할 것이다. 30 is a block diagram illustrating a portion of a component that is typically integrated into at least a portion of a computer system and other devices on which the system is run. Such computer system and device 300 may include one or more central processing units (“CPUs”) 3001 for executing computer programs; Computer memory 3002 for storing data and programs including data structures while they are in use; Permanent storage device 3003, such as a hard drive, for permanent storage of programs and data; A computer readable medium drive 3004 such as a CD-ROM drive for reading programs and data stored on the computer readable medium; A network connection 3005 for connecting the computer system to another computer system, such as the Internet, for exchanging data and / or programs comprising a data structure; And a desktop bus connector 3006, such as a USB connector or other suitable bus connector. Programs executed by the CPU may be associated with and associated with the system, as well as software for recognizing scanned images, such as optical character recognition (“OCR”) software, and / or software for recognizing spoken audio, such as speech recognition software. It may include the program described elsewhere. Although a computer system configured as described above is commonly used to support the operation of the system, those skilled in the art will understand that the system can be implemented using devices having various types and configurations, and various components.

도 31은 전자 문서에 주석을 달기 위해 시스템에 의해 사용된 전형적인 프로세스를 도시하는 순서도이다. 단계(3101)에서, 시스템은 단어의 작은, 이어지는 시퀀스와같은, 렌더링된 문서의 일부를 스캐닝한다. 단계(3102)에서, 시스템은 렌더링된 문서의 스캐닝된 부분에 관한 오디오 주석을 입력한다. 단계(3103)에서, 보다 많은 주석이 존재한다면, 시스템은 또 다른 주석을 구성하기 위해 단계(3101)에서 계속하고, 그렇지 않으면, 시스템은 단계(3104)로 진행한다. 단계(3104)에서, 시스템은 단계(3101)에서 스캐닝된 문서 부분 및 이들의, 컴퓨터 시스템으로 단계(3102)에서 입력된 오디오 주석을 업로딩한다. 단계(3105)에서, 시스템은 렌더링된 문서에 상응하는 디지털 문서를 식별한다. 임의의 실시예에서, 시스템은 렌더링된 문서로부터 전자 문서의 유니버스의 컨텐츠로 스캐닝된 하나 이상의 문서 부분내의 텍스트를 비교함으로써, 렌더링된 문서에 상응하는 디지털 문서를 식별한다. 31 is a flow chart illustrating an exemplary process used by the system to annotate an electronic document. In step 3101, the system scans a portion of the rendered document, such as a small, subsequent sequence of words. In step 3102, the system enters audio annotations regarding the scanned portion of the rendered document. In step 3103, if more annotations are present, the system continues at step 3101 to construct another annotation, otherwise the system proceeds to step 3104. In step 3104, the system uploads the portion of the document scanned in step 3101 and the audio annotation input in step 3102 to the computer system. In step 3105, the system identifies a digital document corresponding to the rendered document. In some embodiments, the system identifies the digital document corresponding to the rendered document by comparing text in the one or more document portions scanned from the rendered document into the content of the universe of the electronic document.

단계(3106)에서, 시스템은 업로딩된 주석에 따라 단계(3105)에서 식별된 디디털 문서를 수정한다. 일부 실시예에서, 단계(3106)는 상응하는 스캐닝된 부분 근방에 디지털 문서내의 포인트로 오디오 클릭으로서 각 주석을 첨부시키는 단계를 수반한다. 임의의 실시예에서, 단계(3106)는 그러한 포인트에서 디지털 문서로 상기 주석의 음성 인식된 텍스츄얼 버전을 첨부하는 단계를 수반한다. 임의의 실시예에서, 단계(3106)은 주석의 음성 인식된 컨텐츠에 기초하여, 식별된 디지털 문서의 스캐닝된 부분에 대해 편집을 자동으로 행하는 단계를 수반한다. 음성 인식이 사용되는 경우에, 주석이 업로딩되기 전 또는 후에 실행될 수 있다. 단계(3106)후에, 이러한 단계는 종료한다. In step 3106, the system modifies the digital document identified in step 3105 according to the uploaded annotation. In some embodiments, step 3106 involves attaching each annotation as an audio click to a point in the digital document near the corresponding scanned portion. In some embodiments, step 3106 involves attaching a speech recognized textual version of the annotation to the digital document at such a point. In some embodiments, step 3106 involves automatically editing the scanned portion of the identified digital document based on the voice recognized content of the annotation. If speech recognition is used, it may be executed before or after the annotation is uploaded. After step 3106, this step ends.

물론, 단계(3106)는 모든 실시예에서 나타나지 않고, 상술된 것과 다른 실시예와 상이할 수 있다. 예를 들어, 임의의 실시예에서, 미래 사용을 위해 주석을 업로딩하고 저장하는 단계는 충분할 수 있고, 주석은 디지털 오리지널로부터 별도로 저장될 수 있다. 특히, 이 오리지널을 수정하는 것은, 예를 들어, 주석이 충분한 특권을 가지지 않거나 CD와 같은 기록불능 매체인 이유로 인해, 불가능하다. Of course, step 3106 does not appear in all embodiments and may be different from the embodiments described above. For example, in some embodiments, uploading and storing the annotation for future use may be sufficient, and the annotation may be stored separately from the digital original. In particular, modifying this original is not possible, for example, because the comment does not have sufficient privileges or is a non-writable medium such as a CD.

당업자는 도 31에 도시된 단계가 다양한 방법으로 대체될 수 있음을 이해할 것이다. 예를 들어, 단계의 순서는 재배열될 수 있거나, 서브스텝이 병렬로 실행될 수 있거나, 도시된 단계가 생략될 수 있거나, 또는 다른 단계가 포함될 수도 있다. Those skilled in the art will appreciate that the steps shown in FIG. 31 may be substituted in various ways. For example, the order of the steps may be rearranged, the substeps may be executed in parallel, the illustrated steps may be omitted, or other steps may be included.

도 32는 사용자에 의해 입력된 주석을 표시하기 위해 시스템에 의해 사용된 샘플 주석 테이블(3200)을 도시하는 테이블 도면이다. 임의의 실시예에서, 시스템 은 주석 테이블(3200)의 버전을 주석 디바이스(2800) 및/또는 컴퓨터 시스템(2900)내에 저장한다. 임의의 실시예에서, 시스템은 주석 디바이스(2800)로부터 컴퓨터 시스템(2900)으로 주석 테이블(3200)의 버전을 업로딩한다. 32 is a table diagram illustrating a sample annotation table 3200 used by the system to display annotations entered by a user. In some embodiments, the system stores a version of the annotation table 3200 in the annotation device 2800 and / or computer system 2900. In some embodiments, the system uploads a version of the annotation table 3200 from the annotation device 2800 to the computer system 2900.

주석 테이블(3200)은 행(3201-3203)과 같은, 각 생성된 주석에 대한 행을 포함한다. 주석 테이블(3200)의 행은, 상이한 렌더링된 문서와 관련된 시퀀스 넘버 구별 주석을 포함하는 문서 시퀀스 넘버 열(3211); 스캐닝 동안 캡쳐링된 생 또는 처리된 이미지 데이터 또는 주석에 대하여 스캐닝된 텍스트의 인식된 텍스츄얼 버전을 포함하는 스캐닝된 텍스트 열(3212); 및 주석에 대하여 캡쳐링된 생 또는 처리된 오디오 데이터 또는 주석의 음성 인식된 텍스츄얼 버전을 포함하는 주석 열(3213)을 포함하는 열과 교차한다. 예를 들어, 행(3201)은 주석이 달린 제1 문서에서, 사용자가 이러한 렌더링된 문서에서 텍스트 "idealized husbanary practices"를 스캐닝하였고, 오디오 주석 "audio citation to Huff reference"을 첨부하였음을 지시한다. Annotation table 3200 includes a row for each generated annotation, such as rows 3201-3203. A row of the annotation table 3200 may include a document sequence number column 3211 that includes sequence number distinct annotations associated with different rendered documents; A scanned text string 3212 containing a recognized textual version of the scanned text for raw or processed image data or annotations captured during scanning; And a comment column 3213 that includes a captured or processed audio data or a speech-recognized textual version of the comment captured for the comment. For example, line 3201 indicates that in the annotated first document, the user scanned the text “idealized husbanary practices” in this rendered document and attached the audio annotation “audio citation to Huff reference”.

도 32가 컨텐츠 및 오거니제이션이 인간 리더기에 의해 보다 인식가능하도록 설계된 컨텐츠 및 오거니제이션을 갖는 테이블을 도시하고 있지만, 당업자는 이러한 정보를 저장하기 위해 시스템에 의해 사용된 실제 데이터 구조가 예를 들어, 상이한 방식으로 구성될 수 있고; 도시된 것보다 많거나 적은 정보를 포함할 수 있고; 압축되고 및/또는 암호화될 수 있는 등, 도시된 테이블과 상이할 수 있다는 것을 이해할 것이다. 예를 들어, 임의의 실시예에서, 시스템은 각각의 스캔에 대한 타임스탬프 및/또는 위치 스탬프를 포함할 수 있다. 필요한 타임 및 위치 정보는 온보드 글로벌 전지구 측위 시스템(GPS) 능력으로부터 얻어질 수도 있고, 무선 통신 능력을 갖고 있는 디바이스(2800)의 경우에 무선 통신망으로부터 얻어질 수도 있다. Although FIG. 32 illustrates a table having content and organization designed so that content and organization are more recognizable by a human reader, those skilled in the art will appreciate, for example, that the actual data structure used by the system to store this information is, for example, Can be configured in different ways; May contain more or less information than shown; It will be appreciated that it may differ from the table shown, such as being compressed and / or encrypted. For example, in some embodiments, the system may include a time stamp and / or location stamp for each scan. The necessary time and location information may be obtained from onboard global global positioning system (GPS) capability, or from a wireless communication network in the case of a device 2800 having wireless communication capability.

상기 시스템은 아래에 설명되는 임의의 샘플 모드를 포함하는 하나 이상의 다양한 모드로 사용될 수 있다. 하나의 샘플 모드에서, 주석 디바이스(2800)는 그 주석 및 스캐닝된 정보 모두를 식별된 문서내로 자동적으로 다운로딩한다. 주석은 노트, 멀티미디어(오디오) 노트 파일등과같은 문서의 이루가 된다. 예를 들어, 사용자는 (사용자의 컴퓨터상의 상응하는 전자 문서를 갖고 있는) 페이퍼 문서로부터 임의의 텍스트를 스캐닝하고, 그래서, 문서에서 주석이 어디에 속해있는지를 지시하고, 이러한 포인트에서 포함되어야 하는 임의의 엑스트라 정보에 대한 음성 주석을 만들게 된다. 나중에, 사용자는 주석 디바이스를 컴퓨터의 USB 포트로 플러깅한다. 임의의 실시예에서, 사용자는 (마이크로소프트 워드와 같은) 문서 편집 애플리케이션내의 문서를 열고 매크로를 론칭하여 주석을 다운로딩하고 이것을 텍스트 또는 내장된 오디오 파일로서, 워드 문서에 첨부한다. The system can be used in one or more of various modes, including any of the sample modes described below. In one sample mode, the annotation device 2800 automatically downloads both the annotation and the scanned information into the identified document. Comments are made up of documents such as notes, multimedia (audio) note files, and so on. For example, a user scans any text from a paper document (having a corresponding electronic document on the user's computer), so that it indicates where the annotation belongs in the document, and which points should be included at this point. A voice comment will be made for the extra information. Later, the user plugs the annotation device into the USB port of the computer. In some embodiments, the user opens a document in a document editing application (such as Microsoft Word), launches a macro to download the annotation, and attaches it as a text or embedded audio file to the word document.

다른 샘플 모드에서, 컴퓨터는 검색 인덱스에 액세스하고 검색 인덱스에 스캐닝된 텍스트를 비교하고, 그래서, 스캐닝된 텍스트를 사용하여 추가 사용자 간섭 없이 정확한 문서를 식별함으로써, 주석이 속해있는 문서를 자동적으로 식별한다. 이러한 자동적인 다운로드 및 "코멘트"로서 상기 문서내에 통합하는 단계는 작업 그룹에 의해 문서의 공동 편집을 위해 사용될 수 있다. 이러한 방식으로 사용될 때, 컴퓨터는 상기 문서내의 각 그룹 멤버의 개별적인 에디트 및 코멘트를 저장한 다. 대안으로, 컴퓨터는 이들을 개별적으로 저장하고 처리하고, 이들을 적합한 형태로 조합한다. In another sample mode, the computer automatically identifies the document to which the annotation belongs, by accessing the search index and comparing the scanned text to the search index, and thus using the scanned text to identify the correct document without further user intervention. . This automatic downloading and incorporation into the document as a "comment" can be used for collaborative editing of the document by the workgroup. When used in this way, the computer saves the individual edits and comments of each group member in the document. Alternatively, the computer stores and processes them separately and combines them in a suitable form.

사용예Example

다음은 핸드헬드 문서 데이터 캡쳐 디바이스의 일부 사용예이다. 이러한 예는 모든 가능한 실시예를 개시할 수 있지만 일부 사용의 대략적인 개관을 제시하고자 의도되었다. The following are some examples of use of the handheld document data capture device. This example may disclose all possible embodiments but is intended to give an overview of some uses.

P-P- 커머스Commerce

임의의 실시예에서, 핸드헬드 문서 데이터 캡쳐 디바이스는 p-커머스 능력 및 애플리케이션을 갖고 있다. 예를 들어, 일부 실시예는 p-커머스 활동과 관련된 아이콘 및 키워드를 인식할 수 있다. 이러한 키워드 및 아이콘은 데이터를 페이퍼 문서로부터 캡쳐링함으로써 물건 서비스를 구매하기 위한 p-커머스 트랜잭션 소프트웨어를 론칭할 수 있다. 일부 실시예에서, 이 디바이스는 $(구매) 마크를 만날 때 구매 상태 도는 모드에 놓이게 된다. 구매 프로세스가 자동화되어 있기 때문에, 일부 실시예는 카탈로그 또는 광고로부터 단일 스캔으로 구매 트랜잭션을 가입자가 완료할 수 있도록 한다. 일부 실시예에서, 디바이스는 크레디트 카드 프로세서와 직접 인터랙팅할 수 있도록 온보드 메모리내에 금융 정보를 저장할 수 있다. In some embodiments, the handheld document data capture device has p-commerce capabilities and applications. For example, some embodiments may recognize icons and keywords related to p-commerce activity. These keywords and icons can launch p-commerce transactional software for purchasing goods services by capturing data from paper documents. In some embodiments, the device is placed in a purchase state or mode when it encounters a $ (purchase) mark. Because the purchase process is automated, some embodiments allow a subscriber to complete a purchase transaction in a single scan from a catalog or advertisement. In some embodiments, the device may store financial information in onboard memory for direct interaction with the credit card processor.

키워드keyword

일부 실시예에서, 핸드헬드 문서 데이터 캡쳐 디바이스는 렌더링된 문서와 결합하여 마크업 문서 및 캐워드의 사용을 지원한다. 일부 실시예는 특정 스캔이 특별히 처리되어야 함을 지시하는 텍스트내의 다양한 보충 마킹(예를 들어, 언더라 인, 폰트, 텍스트의 컬러, 토큰, 아이콘)을 인식한다. 이러한 보충 마킹중 하나와 만났을 때, 디바이스는 검출된 마킹과 연관된 애플리케이션을 실행한다. 다양한 실시예에 의해 지원된 키워드는 회사명 및 상표를 포함한다. 일부 상표 및 아이콘은 전화내에서 코드 또는 텍스트로 전환되어 SMS 또는 다른 텍스트 기반 메시징을 통해 서비스 프로바이더에게 전송된다. 전화내의 캐워드 리스트의 로컬 캐싱은 그래픽스의 텍스트로의 로컬 전환에 대해 유용하다. 대안으로, 그래피컬 상표 및 아이콘은 멀티미디어 메시징을 통해 이미지로서 전송될 수 있다. In some embodiments, the handheld document data capture device supports the use of markup documents and words in combination with the rendered document. Some embodiments recognize various supplemental markings (eg, underline, font, color of text, tokens, icons) in the text indicating that a particular scan should be specially processed. Upon encountering one of these supplemental markings, the device executes an application associated with the detected marking. Keywords supported by various embodiments include company names and trademarks. Some trademarks and icons are converted into codes or text within the phone and sent to the service provider via SMS or other text-based messaging. Local caching of the reward list in the phone is useful for local conversion of graphics to text. Alternatively, graphical trademarks and icons may be transmitted as images via multimedia messaging.

선불prepayment

일부 실시예에서, 휴대가능한 데이터 캡쳐 디바이스는 스캐닝 서비스 프로바이더로부터 서비스에 대해 선불된 가입을 갖는다. 선불된 어카운트는 2개의 주요 장점을 갖고 있다. 첫번째는 선불된 어카운트가 시스템의 익명의 사용을 가능하게 하여, 가입자의 프라이버시를 지켜줄 수 있다는 것이다. 두번째는 선불된 어카운트가 불량 또는 아무런 신용 히스토리를 갖지 못한 사람에게까지 잠재적인 가입자를 확충시킬 수 있다는 것이다. 미리 지불함으로써, 가입자는 그의 신용 히스토리에 관계없이 시스템 서비스를 사용할 수 있다. In some embodiments, the portable data capture device has a prepaid subscription for the service from the scanning service provider. Prepaid accounts have two main advantages. The first is that prepaid accounts can enable anonymous use of the system to protect subscriber privacy. The second is that prepaid accounts can expand potential subscribers to people with bad or no credit history. By paying in advance, the subscriber can use the system service regardless of his credit history.

예를 들어, 고객은 스토어에서 휴대가능한 데이터 캡쳐 디바이스를 구매할 수도 있다. 디바이스는 특정 수의 선불된 트랜잭션에 사용될 수 있다. 디바이스로 실행되는 전형적인 트랜잭션은 전자 문서에 액세스하는 것이다. 그래서, 고객은 선불 디바이스로 로컬 신문으로부터 텍스트를 스캐닝하고 보충 전자물에 즉시 익명으로 액세스할 수도 있다. 스캐닝 서비스 프로바이더는 각 트랜잭션이 차변에 기입되는 선불 디바이스와 연관된 어카운트 파일을 갖고 있다. 고객이 모든 선불 트랜잭션을 사용하였을 때, 그는 스토어를 돌아가거나 (뱅크 ATM 머신등을 통해) 전자 지불을 행함으로써 보다 많은 선불 트랜잭션을 선택적으로 구매할 수 있다. 서비스 프로바이더는 새로 구매한 트랜잭션을 휴대가능한 디바이스의 선불 어카운트 파일에 대변기입한다. For example, a customer may purchase a data capture device that is portable at the store. The device can be used for a certain number of prepaid transactions. A typical transaction executed by a device is to access an electronic document. Thus, a customer may scan text from a local newspaper with a prepaid device and instantly and anonymously access supplemental electronics. The scanning service provider has an account file associated with the prepaid device to which each transaction is debited. When a customer has used all prepaid transactions, he can optionally purchase more prepaid transactions by returning to the store or making electronic payments (via a bank ATM machine, etc.). The service provider credits the newly purchased transaction into the prepaid account file of the portable device.

결론conclusion

상술된 시스템이 다양한 방법으로 채용되거나 확장될 수 있다는 것을 당업자는 이해할 것이다. 상술된 설명이 특정 실시예에 대해 언급하였지만, 본 발명의 범위는 이어지는 청구범위 및 거기에 기재된 요소에 의해서만 한정된다.Those skilled in the art will appreciate that the system described above may be employed or extended in various ways. Although the foregoing description refers to specific embodiments, the scope of the present invention is limited only by the claims that follow and the elements described therein.

Claims

As a portable data capture device,

Control logic;

Memory coupled with control logic for storing instructions and data executable by the control logic;

A data capture element for capturing data from the paper document;

Instructions stored in memory that, when executed by the control logic, cause the data capture element to capture text from a paper document;

Portable data capture device, when executed by the control logic, instructions stored in memory to cause the portable data capture device to automatically submit at least a portion of the captured text to a search engine .

A portable device having scanning capability,

An image capture device for obtaining an image;

A processor for processing an image captured by the image capture device;

A memory for storing an image obtained by the image capture device;

A communication interface for transmitting information to and receiving information from an external device; And

And a user interface operable to instruct a user that sufficient information has been obtained to identify a source document of the acquired image.

3. The portable device of claim 2, further comprising image compression logic for compressing an image captured by the image capture device.

The portable device of claim 2, further comprising an internal power source.

5. The portable device of claim 4, further comprising power management logic for monitoring the extended operational life of the internal power supply and the power consumption of the portable device.

The portable device of claim 2 further comprising an illumination source.

3. The portable device of claim 2, further comprising a location module.

8. The system of claim 7, wherein the location module comprises at least one location capability selected from the group of GPS, A-GPS, DGPS, triangulation, monitoring of local transceiver pilot signals, TDOA, EOTD and angle of arrival. Portable device.

A portable device having a scanning capability, the portable device control method comprising:

Scanning the control data;

Processing the control data;

Identifying an application corresponding to the control data;

Accessing the application in response to the control data; And

And executing the application.

10. The method of claim 9, wherein the application causes the portable device to store data in a memory.

10. The method of claim 9 wherein the application causes the portable device to erase data from memory.

10. The method of claim 9, wherein the application causes the text in the electronic document to be highlighted.

As a portable scanner,

A processor;

A first memory operatively connected with the processor;

And a second memory operatively connected with the process for storing at least one selected from the group of information, content subscription information, service subscription information, and a device identifier.

14. The portable scanner of claim 13, wherein the second memory is a subscriber identity module.

The portable scanner of claim 13, wherein the second memory is a smart card.

A portable device control application in a computer readable medium,

A control identifier identifying a command associated with a short scanned image; And

And a processor that operates on commands associated with the short scanned image.

In a portable device having scanning capability, a method for notifying a user that sufficient data has been scanned from the document to identify the document,

Scanning a symbol from the document;

Estimating how many symbols have been scanned from the document;

Comparing the estimate to a threshold number to identify a document; And

Indicating that enough symbols have been scanned to identify the document.

18. The method of claim 17, wherein the indicating step is a visual signal recognizable by a user of the portable scanning device.

18. The method of claim 17, wherein the threshold number for identifying the document is in the range of 20 to 40 symbols.

18. The method of claim 17, wherein the threshold number is less than 41 symbols.

18. The method of claim 17, wherein said instructing is an auditory signal recognizable by a user of said portable scanning device.

A method in a computing system for processing text capture operations,

Determining that the user used the handheld text capture device to perform a text capture operation from the rendered document that yields a text sequence; And

Acting on the text capture operation.

23. The method of claim 22, wherein the rendered document includes at least one line of text and the text sequence calculated by the text capture operation is a suitable subset of the text contained within a single line of the rendered document. How to.

23. The method of claim 22, wherein the rendered document includes text of at least one page, and wherein the text sequence calculated by the text capture operation is a suitable subset of text contained within a single page of the rendered document. How to.

23. The method of claim 22, wherein the calculated text sequence consists of words, and wherein the text capture operation involves a specific user interaction with each of the words of the calculated text sequence.

27. The method of claim 25, wherein the user speaks each of the words of the calculated text sequence.

27. The method of claim 25, wherein the user directs a selection sensor to each of the words of the calculated text sequence.

23. The method of claim 22, wherein the calculated text sequence consists of ordered words, and the text capture operation involves a capturing physics corresponding to each of the words of the calculated text sequence in the order of the calculated text sequence. The method characterized by the above.

23. The method of claim 22, wherein the calculated text sequence consists of ordered words, and the text capture operation involves a capturing physics corresponding to each of the words of the calculated text sequence in reverse order of the calculated text sequence. The method characterized by the above.

23. The method of claim 22, wherein the text capture operation involves manually moving a selection sensor across the rendered document.

The method of claim 22, wherein the text capture operation involves capturing image data from a non-rectangular region of the rendered document.

23. The method of claim 22, wherein the text sequence produced by the text capture operation includes less than ten words.

23. The method of claim 22, wherein the text capture operation involves capturing an image of less than ten words.