WO2022190900A1

WO2022190900A1 - Image processing apparatus, program, and system

Info

Publication number: WO2022190900A1
Application number: PCT/JP2022/007871
Authority: WO
Inventors: 杜朗鳥居; 泰弘大川; 一隆朝日; 和也藤井; 翔太永渕; 和久吉田; 裕之堺; 崇青木; 琢磨赤木; 賢太郎瀬崎; 昌孝佐藤; 瑛央高田
Original assignee: 株式会社東芝; 東芝インフラシステムズ株式会社
Priority date: 2021-03-08
Filing date: 2022-02-25
Publication date: 2022-09-15
Also published as: JP2022136656A

Abstract

Provided are an information processing apparatus, a program, and a system that make it possible to suppress the amount of transfer to an apparatus for performing character recognition. According to an embodiment, an information processing apparatus is provided with an image interface, a communication interface, and a processor. The image interface acquires a captured image including a character string. The communication interface connects to an external apparatus. The processor generates interim information comprising information generated in the course of character recognition processing from the captured image, and transmits the interim information to the external apparatus via the communication interface.

Description

Information processing device, program and system

The embodiments of the present invention relate to information processing devices, programs and systems.

A system is provided that uses a cloud server to perform character recognition on images acquired by terminals. In such a system, the terminal sends the acquired image to the cloud server and acquires the result of character recognition from the cloud server.

Conventionally, the system had the problem of a large amount of transfer because it was necessary to send images from the terminal to the cloud server.

Japanese Patent Application Laid-Open No. 2015-90623

In order to solve the above problems, an information processing device, a program, and a system that can reduce the amount of data transferred to a device that performs character recognition are provided.

According to the embodiment, an information processing device includes an image interface, a communication interface, and a processor. An image interface acquires a captured image containing a character string. The communication interface connects to external devices. The processor generates intermediate information composed of information generated in the course of character recognition processing from the captured image, and transmits the intermediate information to the external device through the communication interface.

FIG. 1 is a block diagram showing a configuration example of a recognition system according to an embodiment. FIG. 2 is a block diagram showing a configuration example of the OCR device according to the embodiment. FIG. 3 is a block diagram illustrating a configuration example of a server according to the embodiment; FIG. 4 is a diagram illustrating an operation example of the OCR device according to the embodiment; FIG. 5 is a diagram illustrating a configuration example of intermediate information according to the embodiment. FIG. 6 is a diagram illustrating an operation example of the server according to the embodiment; FIG. 7 is a flow chart showing an operation example of the OCR device according to the embodiment. 8 is a flowchart illustrating an operation example of the server according to the embodiment; FIG. FIG. 9 is a diagram showing another operation example of the OCR device according to the embodiment.

embodiment

Embodiments will be described below with reference to the drawings.
The recognition system according to the embodiment recognizes a character string from an image using character recognition processing (OCR (Optical Character Recognition) processing). Here, the recognition system recognizes the destination of the parcel from an image such as a slip attached to the parcel. A recognition system sorts the packages based on the recognized destination.

FIG. 1 shows a configuration example of a recognition system 1 according to an embodiment. As shown in FIG. 1, the recognition system 1 includes a segmentation device 2, a camera 3, a network 6, an OCR device 10, a server 20, and the like.

The OCR device 10 connects to the sorting device 2 and camera 3 . Also, the OCR device 10 and the server 20 are connected to the network 6 .

In addition to the configuration shown in FIG. 1, the recognition system 1 may further include a configuration according to need, or a specific configuration may be excluded from the recognition system 1.

The sorting device 2 sorts the packages thrown in by an operator, a conveyor belt, a robot, or the like. The sorting device 2 receives destination information (character string information) related to the destination (character string) of the parcel from the OCR device 10 . The sorting device 2 sorts packages based on the destination information. For example, the sorting device 2 sorts packages into chutes, pockets, carts, trays, or the like as sorting destinations. For example, the sorting device 2 is composed of a sorter, a conveyor belt, a robot, or the like.

The camera 3 shoots the packages that are put into the sorting device 2. The camera 3 photographs the surface on which the destination is displayed. For example, the camera 3 takes an image of the side to which the slip is attached. The camera 3 supplies the captured image (captured image) to the OCR device 10 .

For example, the camera 3 is a CCD (Charge Coupled Device) camera. Moreover, the camera 3 may be provided with a light source for illuminating the baggage.

The OCR device 10 (information processing device, first information processing device, external device) acquires the captured image from the camera 3 . The OCR device 10 generates intermediate information related to OCR processing from the captured image. The OCR device 10 transmits the intermediate information to the server 20 and receives from the server 20 destination information related to the destination of the parcel shown in the captured image. The OCR device 10 inputs the received destination information to the sorting device 2 . The OCR device 10 and intermediate information will be detailed later.

The network 6 relays communication between the OCR device 10 and the server 20. For example, network 6 is the Internet.

The server 20 (information processing device, second information processing device, external device) receives intermediate information from the OCR device 10 . The server 20 generates destination information based on the received intermediate information. The server 20 supplies the generated destination information to the OCR device 10 . The server 20 will be detailed later.

Next, the OCR device 10 will be explained.
FIG. 2 shows a configuration example of the OCR device 10 according to the embodiment. FIG. 2 is a block diagram showing a configuration example of the OCR device 10. As shown in FIG. As shown in FIG. 2, the OCR device 10 includes a processor 11, a ROM 12, a RAM 13, an NVM 14, a communication section 15, an operation section 16, a display section 17, a sorting device interface 18, a camera interface 19, and the like.

The processor 11, ROM 12, RAM 13, NVM 14, communication section 15, operation section 16, display section 17, sorting device interface 18 and camera interface 19 are connected to each other via a data bus or the like.
It should be noted that the OCR device 10 may have a configuration other than the configuration shown in FIG.

The processor 11 (first processor) has a function of controlling the operation of the OCR device 10 as a whole. Processor 11 may include an internal cache, various interfaces, and the like. The processor 11 implements various processes by executing programs pre-stored in the internal memory, ROM 12 or NVM 14 .

It should be noted that some of the various functions realized by the processor 11 executing the program may be realized by hardware circuits. In this case, processor 11 controls the functions performed by the hardware circuits.

The ROM 12 is a non-volatile memory in which control programs, control data, etc. are stored in advance. The control program and control data stored in the ROM 12 are preinstalled according to the specifications of the OCR device 10 .

The RAM 13 is a volatile memory. The RAM 13 temporarily stores data being processed by the processor 11 . RAM 13 stores various application programs based on instructions from processor 11 . Also, the RAM 13 may store data necessary for executing the application program, execution results of the application program, and the like.

The NVM 14 is a non-volatile memory in which data can be written and rewritten. The NVM 14 is composed of, for example, a HDD (Hard Disk Drive), SSD (Solid State Drive), flash memory, or the like. The NVM 14 stores control programs, applications, various data, etc. according to the operational use of the OCR device 10 .

The communication unit 15 (communication interface, first communication interface) is an interface for connecting to the network 6 . That is, the communication unit 15 is an interface for transmitting/receiving data to/from the server 20 or the like through the network 6 . For example, the communication unit 15 is an interface that supports wired or wireless LAN (Local Area Network) connection.

The operation unit 16 receives inputs for various operations from the operator. The operation unit 16 transmits a signal indicating the input operation to the processor 11 . The operation unit 16 may be composed of a touch panel.

The display unit 17 displays image data from the processor 11 . For example, the display unit 17 is composed of a liquid crystal monitor. When the operating section 16 is configured by a touch panel, the display section 17 may be formed integrally with the operating section 16 .

The partitioning device interface 18 is an interface for connecting to the partitioning device 2. The sorting device interface 18 transmits signals (eg, destination information) from the processor 11 to the sorting device 2 . The sorting device interface 18 also transmits signals from the sorting device 2 to the processor 11 .

The camera interface 19 (image interface) is an interface for connecting to the camera 3. Camera interface 19 transmits signals from processor 11 to camera 3 . The camera interface 19 also transmits signals (such as captured images) from the camera 3 to the processor 11 .

Next, the server 20 will be explained.
FIG. 3 shows a configuration example of the server 20 according to the embodiment. FIG. 3 is a block diagram showing a configuration example of the server 20. As shown in FIG. As shown in FIG. 3, the server 20 includes a processor 21, a ROM 22, a RAM 23, an NVM 24, a communication section 25, an operation section 26, a display section 27, and the like.

The processor 21, ROM 22, RAM 23, NVM 24, communication section 25, operation section 26 and display section 27 are connected to each other via a data bus or the like.
It should be noted that the server 20 may have a configuration other than the configuration shown in FIG. 3 as necessary, or may exclude a specific configuration from the server 20 .

The processor 21 (second processor) has a function of controlling the operation of the server 20 as a whole. Processor 21 may include an internal cache, various interfaces, and the like. The processor 21 implements various processes by executing programs pre-stored in the internal memory, ROM 22 or NVM 24 .

It should be noted that some of the various functions realized by the processor 21 executing the program may be realized by hardware circuits. In this case, processor 21 controls the functions performed by the hardware circuits.

The ROM 22 is a non-volatile memory in which control programs, control data, etc. are stored in advance. The control programs and control data stored in the ROM 22 are installed in advance according to the specifications of the server 20 .

The RAM 23 is a volatile memory. The RAM 23 temporarily stores data being processed by the processor 21 . RAM 23 stores various application programs based on instructions from processor 21 . In addition, the RAM 23 may store data necessary for executing the application program, execution results of the application program, and the like.

The NVM 24 is a non-volatile memory in which data can be written and rewritten. The NVM 24 is composed of, for example, an HDD, SSD, flash memory, or the like. The NVM 24 stores control programs, applications, various data, etc. according to the operational use of the server 20 .

The communication unit 25 (communication interface, second communication interface) is an interface for connecting to the network 6. That is, the communication unit 25 is an interface for transmitting/receiving data to/from the OCR device 10 or the like via the network 6 . For example, the communication unit 25 is an interface that supports wired or wireless LAN connection.

The operation unit 26 receives inputs for various operations from the operator. The operation unit 26 transmits a signal indicating the input operation to the processor 21 . The operation unit 26 may be composed of a touch panel.

The display unit 27 displays image data from the processor 21 . For example, the display unit 27 is composed of a liquid crystal monitor. When the operating section 26 is configured by a touch panel, the display section 27 may be formed integrally with the operating section 26 .

Next, functions realized by the OCR device 10 will be described. The functions realized by the OCR device 10 are realized by the processor 11 executing a program stored in the internal memory, the ROM 12, the NVM 14, or the like.

FIG. 4 is a diagram for explaining the functions realized by the OCR device 10. FIG.

First, the processor 11 has a function of acquiring a captured image (first step).
Here, it is assumed that there is a package to be thrown into the sorting device 2 at a position where the camera 3 can shoot.

The processor 11 causes the camera 3 to photograph the luggage through the camera interface 19. The processor 11 acquires the photographed image 103 of the luggage from the camera 3 through the camera interface 19 .

Also, the processor 11 acquires the parameters of the captured image 103 that has been acquired. Here, the processor 11 acquires the size of the captured image 103 acquired.

In addition, the processor 11 has a function (second step) of extracting a baggage image in which the baggage is shown from the photographed image.

The processor 11 extracts the baggage image 104 from the photographed image 103 using predetermined image processing. For example, processor 11 extracts package image 104 by edge detection. Processor 11 may also extract package image 104 using artificial intelligence such as a neural network. The method by which the processor 11 extracts the parcel image 104 from the captured image 103 is not limited to a specific method.

Also, the processor 11 acquires parameters of the extracted parcel image 104 . Here, the processor 11 acquires the coordinates, size, angle (inclination of the baggage image 104) and color of the baggage image 104. FIG.

The processor 11 also has a function of extracting a character string image in which the character string appears from the package image 104.

Here, it is assumed that the parcel image 104 includes the destination character string, barcode, and label.

The processor 11 extracts a character string image showing the character string of the destination from the package image 104 using predetermined image processing. For example, the processor 11 detects and extracts character string images by pattern recognition. Also, the processor 11 may extract the character string image using artificial intelligence such as a neural network. The method by which processor 11 extracts the character string image from parcel image 104 is not limited to a specific method.

Also, the processor 11 acquires parameters of the extracted character string image. Here, the processor 11 acquires the coordinates and size of the character string image. Also, the processor 11 may acquire a flag indicating whether the character string in the character string image is handwritten or printed. For example, the processor 11 uses predetermined image processing to determine whether the character string in the character string image is handwritten or printed.

The processor 11 may also read the barcode from the parcel image 104. For example, processor 11 obtains the coordinates and size of the barcode. The processor 11 also decodes the barcode to obtain information indicated by the barcode.

The processor 11 may also read the label from the package image 104. For example, processor 11 obtains the coordinates and size of the label. In addition, the processor 11 performs OCR processing on the label to acquire information (precautionary notes, etc.) written on the label.

The processor 11 also has a function (third step) of extracting character candidates (images), which are candidates for areas containing one character, from the character string image.

The processor 11 extracts character candidates 105 with overlapping lines from the character string image. Here, the character candidate 105 is a pattern surrounded by a rectangle. After extracting the character candidates 105, the processor 11 extracts the connection pattern of the character candidates 105 based on the coordinates of the character candidates 105 and the like. A connection pattern indicates a pattern of a series of character strings formed by each character candidate 105 .

As shown in FIG. 4, the processor 11 extracts multiple connection patterns. In FIG. 4, lines 106 between character candidates 105 indicate connections between character candidates 105 . That is, the connection pattern indicates the connection of the character candidates 105 indicated by connecting the character candidates 105 from the start point 107 to the end point 108 by the line 106 .

The processor 11 also has a function (fifth step) of calculating a score (likelihood) indicating the possibility that the character candidate 105 is a predetermined character string by OCR processing.

The processor 11 matches the character candidate 105 with the dictionary information by OCR processing. Processor 11 calculates the score of character candidate 105 by matching. Here, the score indicates the possibility that the image of character candidate 105 is a predetermined character.

Here, processor 11 calculates a score for a plurality of predetermined characters. That is, the processor 11 calculates a score indicating the possibility that the character candidate 105 is the predetermined character for each of the plurality of predetermined characters.
Processor 11 similarly calculates a score for each character candidate 105 .

The processor 11 also has a function of generating intermediate information based on the information obtained in the first to fifth steps.
The intermediate information consists of information generated during the OCR process. That is, the intermediate information is information for recognizing character strings. Here, the intermediate information does not include the captured image 103, package image 104, and recognition result. Also, the intermediate information is binary data.

FIG. 5 shows a configuration example of intermediate information. As shown in FIG. 5, the intermediate information includes "size of image", "coordinates of package", "size of package", "angle of package", "color of package", "coordinates of character string", "text Column clip size", "barcode", "label", "handwriting/printing judgment", "character candidate coordinates", "character candidate clip size", "connection of character candidates", and "character candidate score" etc.

It should be noted that the intermediate information may have a configuration as necessary in addition to the configuration shown in FIG. 5, or a specific configuration may be excluded from the intermediate information.

"Image size" is obtained by the first step.
“Image size” indicates the size of the captured image 103 .

"Package coordinates", "package size", "package angle" and "package color" are obtained by the second step.

“Package coordinates” indicates the coordinates of the package image 104 .
“Package size” indicates the size of the package image 104 .
“Angle of Package” indicates the inclination of the package image 104 .
“Package color” indicates the color of the package image 104 .

"Character string coordinates", "character string cutout size", "barcode", "label", and "handwriting/printing determination" are acquired in the third step.

"Coordinates of character string" indicates the coordinates of the character string image.
“Character string cutout size” indicates the size of the character string image.
“Barcode” is information related to the barcode of package image 104 . For example, "barcode" indicates the coordinates of the barcode, the size of the barcode, and the information indicated by the barcode.

“Label” is information related to the label of package image 104 . For example, "label" indicates the coordinates of the label, the size of the label, and the information written on the label.
“Handwritten/printed determination” indicates whether the character string in the character string image is handwritten or printed.

"Coordinates of character candidates", "Cut-out size of character candidates" and "Connection of character candidates" are acquired in the fourth step.
“Coordinates of character candidate” indicates the coordinates of the character candidate 105 .
“Cut-out size of character candidate” indicates the size of the character candidate 105 .
“Connection of character candidates” indicates each connection pattern of the character candidates 105 .

The "character candidate score" is obtained by the fifth step.
“Character Candidate Score” indicates the score of each character candidate 105 .

The processor 11 also has a function of transmitting destination information related to the destination to the sorting device 2 .
After generating the intermediate information, the processor 11 transmits the generated intermediate information to the server 20 through the communication unit 15 .

As will be described later, the server 20 transmits destination information to the OCR device 10 for the intermediate information.

The processor 11 receives destination information from the server 20 through the communication unit 15 . Upon receiving the destination information, processor 11 transmits the received destination information to sorting device 2 through sorting device interface 18 .

Next, functions realized by the server 20 will be described. The functions realized by the server 20 are realized by the processor 21 executing a program stored in the internal memory, the ROM 22, the NVM 24, or the like.

FIG. 6 is a diagram for explaining the functions realized by the server 20. FIG.

First, the processor 21 has a function of recognizing a character string written in a character string image based on intermediate information.
As described above, the processor 21 of the OCR device 10 transmits intermediate information to the server 20 through the communication section 15 .

The processor 21 of the server 20 receives the intermediate information from the OCR device 10 through the communication section 15.

Upon receiving the intermediate information, the processor 21 acquires one connection pattern from the intermediate information.

After acquiring one connection pattern, the processor 21 matches the connection pattern with a predetermined candidate (character string). Here, the processor 21 calculates an evaluation value indicating the possibility that the character string indicated by the connection pattern is a predetermined candidate. Processor 21 calculates an evaluation value for each of the plurality of candidates.

For example, the NVM 24 stores an address database showing multiple candidates (here, address candidates). The processor 21 inputs each candidate indicated by the address database and each information indicated by the intermediate information into a predetermined evaluation function to calculate an evaluation value for each candidate.

The processor 21 similarly calculates the evaluation value of each candidate for each connection pattern indicated by the intermediate information.

After calculating the evaluation value of each candidate for each connection pattern, the processor 21 identifies the largest evaluation value. After identifying the highest evaluation value, the processor 21 obtains candidates corresponding to the identified evaluation value. The processor 21 acquires the candidate as a character string (here, destination) described in the character string image.

The processor 21 also has a function of transmitting destination information related to the recognized character string to the OCR device 10 .

As described above, the processor 21 recognizes the character string written in the character string image. Here, it is assumed that the processor 21 has recognized the destination as a character string. Upon recognizing the destination, processor 21 generates destination information associated with the recognized destination.

For example, the destination information includes the recognized destination (the destination itself). In addition, the destination information may indicate the sorting destination to which the parcels of the recognized destination are sorted. For example, the destination information may indicate a chute, a pocket, a cart, a tray, or the like into which articles are sorted in the sorting device 2 . The configuration of the destination information is not limited to a specific configuration.

After generating the destination information, the processor 21 transmits the generated destination information to the OCR device 10 through the communication unit 25 .

Next, an operation example of the recognition system 1 will be described.
First, an operation example of the OCR device 10 will be described. FIG. 7 is a flowchart for explaining an operation example of the OCR device 10. As shown in FIG.

First, the processor 11 of the OCR device 10 acquires the captured image 103 from the camera 3 (S11). After obtaining the photographed image 103, the processor 11 extracts the luggage image 104 from the photographed image 103 (S12).

After extracting the package image 104, the processor 11 extracts a character string image from the package image 104 (S13). After extracting the character string image, the processor 11 determines whether the character string described in the character string image is handwritten or printed (S14).

After determining whether the character string described in the character string image is handwritten or printed, the processor 11 extracts character candidates 105 from the character string image (S15). After extracting the character candidates 105, the processor 21 calculates the score of each character candidate 105 (S16).

After calculating the score of each character candidate 105, the processor 21 generates intermediate information (S17). After generating the intermediate information, the processor 21 transmits the generated intermediate information to the server 20 through the communication unit 25 (S18).

After transmitting the intermediate information to the server 20, the processor 21 determines whether the destination information has been received through the communication unit 25 (S19). When determining that the destination information has not been received through the communication unit 25 (S19, NO), the processor 21 returns to S19.

When determining that the destination information has been received through the communication unit 25 (S19, YES), the processor 21 transmits the received destination information to the sorting device 2 through the sorting device interface 18 (S20).
After sending the destination information to the sorting device 2, the processor 21 ends the operation.

Next, an operation example of the server 20 will be described. FIG. 8 is a flowchart for explaining an operation example of the server 20. As shown in FIG.

First, the processor 21 of the server 20 receives intermediate information from the OCR device 10 through the communication unit 25 (S21). Upon receiving the intermediate information, the processor 21 acquires one connection pattern from the intermediate information (S22).

After acquiring one connection pattern, the processor 21 matches the connection pattern with each candidate (S23). The processor 21 matches the connection pattern with each candidate, and calculates the evaluation value of each candidate (S24).

After calculating the evaluation value of each candidate, the processor 21 determines whether there is another connection pattern (S25). When determining that there is another connection pattern (S25, YES), the processor 21 returns to S22.

If it is determined that there is no other connection pattern (S25, NO), the processor 21 recognizes the character string written in the character string image based on each evaluation value (S26). Upon recognizing the character string, the processor 21 transmits destination information related to the recognized character string to the OCR device 10 through the communication unit 25 (S27).
After transmitting the destination information to the OCR device 10, the processor 21 ends its operation.

Next, a modified example of the OCR device 10 will be described.
Here, processor 11 of OCR device 10 generates a score map.

FIG. 9 is a diagram for explaining functions realized by the OCR device 10 in the modified example.

Description of the first to third steps is omitted because they are as described above.

The processor 11 generates a score map in the fourth step.

The score map can be obtained by applying machine learning, pattern recognition, CNN (Convolutional Neural Network), etc. to the character string image. The width W of the score map changes (proportionally or the same) according to the width of the character string image. The width H of the score map is the number of types of characters to be recognized +1.

In the example of FIG. 9, "ΦABCDEF" has H = 7 (actual OCR includes numbers, kanji, etc., so H = several thousand).

Here, "Φ" is a special character representing "nothing".
Each row (ordinate) of the score map corresponds to each character (ΦABCDEF) to be recognized.
Each column (abscissa) of the score map corresponds to each column of the character string image.

The score map has the characteristic that the score (value) of the corresponding column in the row corresponding to the "character" written in the character string image is large.

In the example of FIG. 9, the second and third columns of the fourth row corresponding to "C" of "CAFFEE" have larger values than the other rows (ΦBCEF) (in the figure below, (highest value in bold).
The same is true for "A" and "E".

The server 20 can recognize the characters written in the character string image by obtaining each character corresponding to the maximum score in each column of the score map.

Processor 11 generates intermediate information including a score map. Note that the intermediate information may include information obtained from the first to third steps.
After generating the intermediate information, the processor 11 transmits the generated intermediate information to the server 20 through the communication unit 15 .

Note that the processor 11 of the OCR device 10 may generate information indicating the sorting destination based on the destination information. For example, if the destination information includes a destination, the processor 11 may generate information indicating a sorting destination for sorting parcels at the destination. Processor 11 transmits the generated information to sorting device 2 .

Also, the sorting device 2 and the camera 3 may be integrally formed. Also, the sorting device 2, the camera 3 and the OCR device 10 may be integrally formed.

The recognition system configured as described above generates intermediate information from the captured image in the OCR device. The recognition system sends intermediate information to the server. A recognition system recognizes a character string based on the intermediate information at the server. As a result, the recognition system can reduce the amount of data transferred to the server compared to when images are sent to the server. Therefore, the recognition system can reduce the transfer time to the server and can quickly recognize the character string.

Also, since the recognition system does not send the image to the server, it is possible to prevent the image from being read. As a result, the recognition system can reduce risks such as leakage of personal information.

Although several embodiments of the invention have been described, these embodiments are presented as examples and are not intended to limit the scope of the invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and modifications can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the scope of the invention described in the claims and equivalents thereof.

Claims

an image interface for acquiring a captured image containing a character string;
a communication interface that connects to an external device;
generating intermediate information composed of information generated in the course of character recognition processing from the captured image;
transmitting the intermediate information to the external device through the communication interface;
a processor;
Information processing device.
The processor
extracting character candidates from the captured image;
Extracting a connection pattern of the character candidates,
calculating a score indicating the possibility that the character described in the character candidate is a predetermined character string;
the intermediate information includes the connection pattern of the character candidates and the score;
The information processing device according to claim 1 .
the intermediate information includes coordinates and sizes of the character candidates;
The information processing apparatus according to claim 2.
The intermediate information indicates whether the character string is handwritten or printed,
The information processing apparatus according to any one of claims 1 to 3.
the processor receives string information related to the string from the external device through the communication interface;
The information processing apparatus according to any one of claims 1 to 4.
the string is a destination,
The information processing device according to claim 5 .
comprising a sorting device interface for connecting to a sorting device for sorting articles,
the processor transmits the string information to the segmentation device through the segmentation device interface;
The information processing device according to claim 6 .
A program executed by a processor,
to the processor;
A function to generate intermediate information composed of information generated in the course of character recognition processing from a photographed image containing character strings;
a function of transmitting the intermediate information to an external device;
program to realize
a communication interface for transmitting and receiving data to and from an external device;
receiving intermediate information composed of information generated in the course of character recognition processing from a photographed image containing a character string from the external device through the communication interface;
recognizing the character string based on the intermediate information;
generating string information associated with said recognized string;
a processor;
Information processing device.
The intermediate information includes a connection pattern of character candidates extracted from the photographed image and a score indicating a possibility that the character described in the character candidate is a predetermined character,
The processor
calculating an evaluation value indicating the possibility that the character string is a predetermined candidate based on the connection pattern and the score;
recognizing the string based on the evaluation value;
The information processing apparatus according to claim 9 .
the processor transmits string information related to the string to the external device through the communication interface;
The information processing apparatus according to claim 9 or 10.
A program executed by a processor,
to the processor;
A function of receiving from an external device intermediate information composed of information generated in the course of character recognition processing from a photographed image containing a character string;
a function of recognizing the character string based on the intermediate information;
the ability to generate string information associated with said recognized string;
program to realize
A system comprising a first information processing device and a second information processing device,
The first information processing device is
an image interface for acquiring a captured image containing a character string;
a first communication interface connected to the second information processing device;
generating intermediate information composed of information generated in the course of character recognition processing from the captured image;
transmitting the intermediate information to the second information processing device through the first communication interface;
a first processor;
with
The second information processing device is
a second communication interface that transmits and receives data to and from the first information processing device;
receiving the intermediate information from the first information processing device through the second communication interface;
recognizing the character string based on the intermediate information;
generating string information associated with said recognized string;
a second processor;
comprising a
system.