WO2022131733A1

WO2022131733A1 - Method, system, and non-transitory computer-readable recording medium for estimating information regarding object on basis of images in low power wide area network (lpwan) environment

Info

Publication number: WO2022131733A1
Application number: PCT/KR2021/018927
Authority: WO
Inventors: 정종수; 박수민; 송보근
Original assignee: 주식회사 콕스랩
Priority date: 2020-12-16
Filing date: 2021-12-14
Publication date: 2022-06-23
Also published as: KR20220086403A; KR102597045B1

Abstract

According to an aspect of the present invention, provided is a method for estimating information regarding an object on the basis of images in a lower power wide area network (LPWAN) environment, the method comprising the steps of: obtaining, by using an object detection model trained to estimate upper category information about an object to be detected from a captured image, a result of detecting the object from the captured image of the object and a partial image of the captured image, which is generated on the basis of the detection result; and estimating, by using an object classification model trained to estimate lower category information about the object from the partial image of the captured image, the lower category information about the object on the basis of the obtained detection result and the partial image, wherein the detection result includes the upper category information about the object.

Description

A method, system, and non-transitory computer-readable recording medium for estimating information about an object based on an image in an LPWAN (LOW POWER WIDE AREA NETWORK) environment

The present invention relates to a method, a system, and a non-transitory computer-readable recording medium for estimating information about an object based on an image in an LPWAN (Low Power Wide Area Network) environment.

In recent years, with the rapid development of artificial intelligence-related technologies, various technologies for a method of detecting an object in an image using artificial intelligence have been introduced.

As an example of the related art, a technique for detecting a specific object in the above captured image by analyzing a captured image obtained by a plurality of network cameras using an artificial neural network model can be exemplified. In order to detect an object from an image using

And, when it is necessary to acquire an image from a plurality of network cameras in real time and to respond in real time to a result of detecting a specific object in the image, not only the construction cost of such a computer system increases, but also a plurality of Since the amount of data exchanged between the network cameras and the above computer system increases, the communication cost becomes very high.

Recently, as an alternative to cost reduction, industrial edge computing solutions that lighten GPUs have emerged. .

In addition, recently, instead of using a high-end GPU, a microprocessor or FPGA (Field Programmable Gate Array) equipped with a function to accelerate only matrix multiplication is embedded in the embedded system, so that artificial neural network models can be modeled at low cost without consuming large amounts of power. There are also attempts to support object detection based on it. However, in this case, there is a problem that the artificial neural network model, which is software, needs to be deformed and lightened appropriately for the above special hardware, and the performance of the artificial neural network model is deteriorated due to such deformation and weight reduction.

On the other hand, concepts such as edge computing and on-device AI, which have been in the spotlight recently, reduce the load on the central server by distributing the functions of the central server, and reduce the amount of data exchanged with the central server through the communication network. The goal is to increase the response speed of the model. In particular, reducing the communication load, that is, the amount of data exchanged with the central server, is a factor that directly affects the reduction of system construction cost in addition to increasing the response speed of AI models.

However, in constructing an object detection system using an artificial neural network model, a processor with limited computing power is used instead of a high-spec GPU for the purpose of reducing the system construction cost, and a broadband network such as Ethernet, Wi-Fi, LTE, etc. Instead, if a bandwidth-limited communication network such as LPWAN (Low Power Wide Area Network) is used (that is, the amount of data exchanged with the server is minimized), the artificial neural network model will be better than when using a high-end GPU and broadband network. performance is likely to deteriorate. This is because, as described above, many calculations are required for object detection based on an artificial neural network model.

Accordingly, the present inventor(s) supports the construction of an object detection system using an artificial neural network model at a low cost by using a network camera having only limited computing power and an LPWAN having a limited bandwidth, while supporting the performance of the artificial neural network model (ie, detection speed and accuracy) is also proposed to support a high level of support.

An object of the present invention is to solve all of the problems of the prior art described above.

In addition, the present invention provides an object detection model that is learned to estimate upper category information of an object to be detected in a captured image, based on the detection result of the object in the captured image of the object and the detection result Obtaining the generated partial image of the captured image, and using the object classification model trained to estimate sub-category information of the object from the partial image regarding the captured image, the obtained detection result and the above Another purpose is to estimate sub-category information of the above object based on a partial image of , and to include upper-category information of the above object in the detection result.

In addition, the present invention detects the above object in the captured image of the above object using an object detection model that is learned to estimate upper category information of the object to be detected in the captured image, and the detection result and the above detection Another purpose is to transmit a partial image related to the above captured image generated based on the result to the server, and to include upper category information of the above object in the above detection result.

Another object of the present invention is to support the construction of an object detection system using an artificial neural network model at a low cost while maintaining high performance of the artificial neural network model.

A representative configuration of the present invention for achieving the above object is as follows.

According to an aspect of the present invention, an object detection model is trained to estimate upper category information of an object to be detected in the captured image, and the object is detected in the captured image of the object and generated based on the detection result obtaining a partial image related to the captured image, and using an object classification model trained to estimate sub-category information of the object from the partial image regarding the captured image and estimating lower category information of the object based on the method, wherein the detection result includes upper category information of the object.

According to another aspect of the present invention, detecting the object in a captured image of the object using an object detection model trained to estimate upper category information of the object to be detected in the captured image, and the detection result and the detection There is provided a method comprising transmitting a partial image related to the captured image generated based on a result to a server, wherein the detection result includes upper category information of the object.

According to another aspect of the present invention, based on a result of detecting the object in a captured image of the object and the detection result using an object detection model that is learned to estimate upper category information of an object to be detected in the captured image The obtained detection result and There is provided a system comprising an object classifier for estimating lower category information of the object based on the partial image, and the detection result includes upper category information of the object.

According to another aspect of the present invention, an object detection unit for detecting the object from a captured image of the object by using an object detection model trained to estimate upper category information of an object to be detected from the captured image, and the detection result and A system is provided, comprising: a detection result management unit configured to transmit a partial image related to the captured image generated based on the detection result to a server, wherein the detection result includes upper category information of the object.

In addition to this, another method for implementing the present invention, another system, and a non-transitory computer-readable recording medium for recording a computer program for executing the method are further provided.

According to the present invention, it is possible to support the construction of an object detection system using an artificial neural network model at a low cost while maintaining the performance of the artificial neural network model high.

1 is a diagram illustrating a schematic configuration of an entire system for estimating information about an object based on an image in an LPWAN environment according to an embodiment of the present invention.

2 is a diagram illustrating in detail an internal configuration of a server according to an embodiment of the present invention.

3 is a diagram illustrating in detail an internal configuration of a network camera according to an embodiment of the present invention.

4 is a diagram exemplarily illustrating a process of estimating information about an object according to an embodiment of the present invention.

100: communication network

200: server

210: detection result acquisition unit

220: object classification unit

230: model management unit

300: network camera

310: object detection unit

320: detection result management unit

240, 330: communication department

250, 340: control unit

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS [0012] DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS [0010] DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS [0010] Reference is made to the accompanying drawings, which show by way of illustration specific embodiments in which the present invention may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice the present invention. It should be understood that the various embodiments of the present invention are different but need not be mutually exclusive. For example, certain shapes, structures, and characteristics described herein may be implemented with changes from one embodiment to another without departing from the spirit and scope of the present invention. In addition, it should be understood that the location or arrangement of individual components within each embodiment may be changed without departing from the spirit and scope of the present invention. Accordingly, the following detailed description is not to be taken in a limiting sense, and the scope of the present invention should be taken as encompassing the scope of the claims and all equivalents thereto. In the drawings, like reference numerals refer to the same or similar elements throughout the various aspects.

Hereinafter, various preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings in order to enable those of ordinary skill in the art to easily practice the present invention.

Whole system configuration

As shown in FIG. 1 , the entire system according to an embodiment of the present invention may include an LPWAN 100 , a server 200 , and a network camera 300 .

First, LPWAN (Low Power Wide Area Network; 100) according to an embodiment of the present invention means a low-power wireless wide area network that has a very wide service range of 10 km or more and provides a communication speed of several hundred kilobits per second (kbps) do. Such LPWAN 100 includes LoRaWAN, SIGFOX, LTE-MTC, Narrowband Internet of Things (NB-IoT), and the like, and in particular, LoRaWAN Although the communication speed is slower than the existing short-distance wireless communication such as silver, Wi-Fi, Bluetooth, and Zigbee, long-distance communication of about 30 km in open areas and about 1 km in the city center is possible.

Next, the server 200 according to an embodiment of the present invention detects the above object in the captured image of the above object using an object detection model that is learned to estimate upper category information of the object to be detected in the captured image. Using an object classification model that is trained to acquire a partial image of the captured image generated based on the obtained result and the detection result above, and estimate sub-category information of the object from the partial image regarding the captured image Accordingly, it is possible to perform a function of estimating sub-category information of the above object based on the obtained detection result and the above partial image. Here, according to an embodiment of the present invention, the above detection result may include upper category information of the above object.

Next, the network camera 300 according to an embodiment of the present invention detects the above object in the captured image of the above object using an object detection model that is learned to estimate upper category information of the object to be detected in the captured image. Detect and transmit the detection result and the partial image related to the captured image generated based on the detection result to the server. Here, according to an embodiment of the present invention, the above detection result may include upper category information of the above object.

The configuration and functions of the server 200 and the network camera 300 according to the present invention will be described in detail through the following detailed description.

On the other hand, the network camera 300 according to an embodiment of the present invention is a digital device including a function capable of communicating with the server 200 and an image capturing function in an LPWAN environment, and is provided with a memory means and equipped with a microprocessor. Any digital device with computing capability may be employed as the network camera 300 according to the present invention. Here, the network camera 300 according to an embodiment of the present invention may refer to the network camera itself (eg, a commercial security camera), but is connected (or combined) with the network camera by wire and/or wirelessly. It may also refer to a hardware device that can be used inclusively.

Meanwhile, the server 200 and the network camera 300 may include an application (not shown) supporting a function of estimating information about an object based on an image in an LPWAN environment according to the present invention. Such an application may be downloaded from an external application distribution server (not shown). Here, at least a part of the application may be replaced with a hardware device or a firmware device capable of performing substantially the same or equivalent function as the application, if necessary.

server configuration

Hereinafter, the internal configuration of the server 200 that performs an important function for the implementation of the present invention and the function of each component will be described.

2 is a diagram illustrating in detail the internal configuration of the server 200 according to an embodiment of the present invention.

As shown in FIG. 2 , the server 200 according to an embodiment of the present invention includes a detection result acquisition unit 210 , an object classification unit 220 , a model management unit 230 , a communication unit 240 , and a control unit ( 250) may be included. According to an embodiment of the present invention, the detection result acquisition unit 210, the object classification unit 220, the model management unit 230, the communication unit 240, and the control unit 250, at least some of them are external systems (not shown) and may be a program module in communication with. Such a program module may be included in the server 200 in the form of an operating system, an application program module, or other program modules, and may be physically stored in various known storage devices. Also, such a program module may be stored in a remote storage device capable of communicating with the server 200 . Meanwhile, such a program module includes, but is not limited to, routines, subroutines, programs, objects, components, data structures, etc. that perform specific tasks or execute specific abstract data types according to the present invention.

On the other hand, although described above with respect to the server 200, this description is exemplary, and at least some of the components or functions of the server 200 may be realized or included in an external system (not shown) as needed. is apparent to those skilled in the art.

First, the detection result acquisition unit 210 according to an embodiment of the present invention uses an object detection model that is learned to estimate upper category information of an object to be detected in a captured image, and is configured to It is possible to perform a function of acquiring a result of detecting an object and a partial image related to the above captured image generated based on the above detection result.

Specifically, the object detection unit 310 according to an embodiment of the present invention may detect the above object from a captured image of the object using the above object detection model, and based on the detection result of the above object Thus, it is possible to generate a partial image related to the above captured image. In addition, the detection result acquisition unit 210 according to an embodiment of the present invention may acquire the detection result and the partial image in the LPWAN environment.

More specifically, according to an embodiment of the present invention, the above detection result may include upper category information of an object to be detected in a captured image and identification information of the network camera 300 that has performed the detection. That is, the detection result obtained by the detection result obtaining unit 210 according to an embodiment of the present invention may mean a result of detecting an object to be detected in the captured image by the network camera 300 at a higher category level. . And, according to an embodiment of the present invention, the upper category information may be associated with two or more lower category information of an object selected by a user, that is, an object to be detected in a captured image.

For example, according to an embodiment of the present invention, it may be assumed that 'bus' and 'truck' are selected by a user as sub-category information of an object to be detected in a captured image. In this case, the detection result acquisition unit 210 according to an embodiment of the present invention determines the upper category information of the above object as 'vehicle', or the user above determines the upper category information associated with the lower category information. As the information, appropriate information may be provided to the above user so that they can select 'vehicle', 'large vehicle', and the like.

For another example, according to an embodiment of the present invention, it may be assumed that 'person' is selected by a user as upper category information of an object to be detected in a captured image. In this case, the detection result acquisition unit 210 according to an embodiment of the present invention determines the sub-category information of the object as 'adult' and 'child', or the user As related sub-category information, appropriate information may be provided to the above user so that 'adult', 'child', 'man', 'female', etc. can be selected.

However, the types of upper category information and lower category information of an object and a method of determining corresponding information according to an embodiment of the present invention are not limited to those described above, and may vary within the scope capable of achieving the object of the present invention. can be changed to

Continuing, the partial image related to the captured image obtained by the detection result obtaining unit 210 according to an embodiment of the present invention is an image related to a region in which an object is detected in the captured image (eg, the above object). may mean an image cut along the boundary of a detected bounding box). And, when a plurality of objects are detected in the captured image, the object detection unit 310 according to an embodiment of the present invention may generate a partial image for each of the plurality of detected objects. In addition, the detection result acquisition unit 210 according to an embodiment of the present invention may acquire the respective generated partial images. Meanwhile, according to an embodiment of the present invention, the partial image above may mean only the partial image itself, but may also include information (eg, coordinates) about the position occupied by the partial image in the captured image. . Since the detection result acquisition unit 210 according to an embodiment of the present invention acquires only a partial image of the captured image, not the entire captured image of the object captured by the network camera 300 as described above, the LPWAN and Even in a communication environment where bandwidth is limited, the object detection system can be operated stably.

On the other hand, the object detection model used in the object detection unit 310 according to an embodiment of the present invention may be distributed by reducing the weight of the object detection model generated by the server 200 .

Specifically, the model manager 230 according to an embodiment of the present invention may generate an object detection model that is learned to estimate upper category information of an object to be detected from a captured image. According to an embodiment of the present invention, the learning data on the object to be detected in the photographed image used in generating such an object detection model is the photographed image about the object and the labeling data about the photographed image. It may include information about an area in which the above object is located in the image and information on the upper category of the above object. That is, the model manager 230 according to an embodiment of the present invention may train the object detection model so that the object detection model estimates upper category information instead of lower category information of the object. Also, in the object detection model, the types of objects to be detected are reduced, and thus, an object can be detected with higher accuracy while processing a smaller amount of calculations compared to the case of estimating sub-category information.

Continuing, the model manager 230 according to an embodiment of the present invention may reduce the weight of the object detection model generated as described above and distribute it to the network camera 300 .

More specifically, the model manager 230 according to an embodiment of the present invention generates an object detection model that is learned to estimate upper category information of an object to be detected from a captured image, and performs pruning and quantization. ) and knowledge distillation, it is possible to reduce the weight of the generated model by using an artificial neural network model lightweight algorithm. And, the model management unit 230 according to an embodiment of the present invention, in order to enable smooth use even in the network camera 300, which has lower computing power compared to the server 200, a lightweight model as described above. As an object detection model used by the object detection unit 310 according to an embodiment, it may be distributed to the network camera 300 . However, the weight reduction algorithm according to an embodiment of the present invention is not limited to the ones listed above, and may be variously changed within a range that can achieve the object of the present invention.

Meanwhile, the functions not described above with respect to the function of the object detection unit 310 according to an embodiment of the present invention will be described later.

Next, the object classifying unit 220 according to an embodiment of the present invention provides sub-category information of an object from a partial image related to a captured image obtained by the detection result acquiring unit 210 according to an embodiment of the present invention. Using an object classification model trained to estimate function can be performed.

Specifically, when a partial image related to a captured image is obtained, the object classifier 220 according to an embodiment of the present invention uses an object classification model that is learned to estimate sub-category information of an object from the partial image. , the above object based on the result of detecting the object in the above captured image by the object detection unit 310 according to an embodiment of the present invention (specifically, upper category information of the above object) and the above partial image of subcategory information can be estimated.

More specifically, the model manager 230 according to an embodiment of the present invention may generate an object classification model that is learned to estimate sub-category information of an object from the partial image above. According to an embodiment of the present invention, the learning data on the object used in generating such an object classification model is an image about a region in which the object is detected in a captured image of the object, that is, a partial image and As labeling data regarding the partial image, sub-category information of the above object may be included. That is, the model manager 230 according to an embodiment of the present invention may train the object detection model to estimate information on two or more subcategories of the object selected by the user. A function of specifically classifying an object to be detected in a captured image requires processing a large amount of calculation. According to an embodiment of the present invention, as described above, this function has a lower computing power than the network camera 300 as described above. It is possible to reduce the computational burden of the network camera 300 by performing the operation in the high server 200 .

Next, the communication unit 240 according to an embodiment of the present invention performs a function of enabling data transmission/reception to/from the detection result acquisition unit 210 , the object classification unit 220 , and the model management unit 230 . can

Finally, the control unit 250 according to an embodiment of the present invention functions to control the flow of data between the detection result acquisition unit 210 , the object classification unit 220 , the model management unit 230 , and the communication unit 240 . can be done That is, the control unit 250 according to the present invention controls the data flow to/from the outside of the server 200 or the data flow between each component of the server 200, so that the detection result obtaining unit 210, the object classifying unit 220 , the model management unit 230 , and the communication unit 240 may be controlled to perform their own functions, respectively.

Network camera configuration

Hereinafter, the internal configuration of the network camera 300 that performs an important function for the implementation of the present invention and the function of each component will be described.

3 is a diagram illustrating in detail an internal configuration of a network camera 300 according to an embodiment of the present invention.

As shown in FIG. 3 , a network camera 300 according to an embodiment of the present invention may be configured to include an object detection unit 310 , a detection result management unit 320 , a communication unit 330 , and a control unit 340 . can According to an embodiment of the present invention, the object detection unit 310, the detection result management unit 320, the communication unit 330, and the control unit 340 are program modules in which at least some of them communicate with an external system (not shown). can be Such a program module may be included in the network camera 300 in the form of an operating system, an application program module, or other program module, and may be physically stored in various known storage devices. Also, such a program module may be stored in a remote storage device capable of communicating with the network camera 300 . Meanwhile, such a program module includes, but is not limited to, routines, subroutines, programs, objects, components, data structures, etc. that perform specific tasks or execute specific abstract data types according to the present invention.

On the other hand, although described above with respect to the network camera 300, this description is exemplary, and at least some of the components or functions of the network camera 300 may be realized or included in an external system (not shown) as needed. It will be apparent to those skilled in the art that this may be the case.

First, the object detection unit 310 according to an embodiment of the present invention detects the object in the captured image of the object by using the object detection model that is learned to estimate upper category information of the object to be detected in the captured image. The detection function can be performed.

Specifically, the object detection unit 310 according to an embodiment of the present invention may detect the above object from a photographed image of the object by using the above object detection model, and the above detection result is The upper category information of the object and identification information of the network camera 300 that has performed the detection may be included.

Continuing, the object detection model used by the object detection unit 310 according to an embodiment of the present invention to detect an object at a higher category level in the captured image as described above is R-CNN (Region-based Convolutional Neural Networks), YOLO (You Only Look Once) or SSD (Single Shot Multibox Detector) may be generated based on an object recognition model based on an artificial neural network. However, the object recognition model based on the artificial neural network according to an embodiment of the present invention is not limited to the above-listed ones, and may be variously changed within the scope that can achieve the object of the present invention.

And, according to an embodiment of the present invention, the upper category information of the object to be detected in the captured image may be associated with information on at least two lower categories of the object selected by the user, that is, the object to be detected in the captured image. Meanwhile, since the upper category information and the lower category information have been described in detail above, a description of the corresponding content will be omitted here.

Specifically, the model manager 230 according to an embodiment of the present invention may generate an object detection model that is learned to estimate upper category information of an object to be detected from a captured image. In addition, the model manager 230 according to an embodiment of the present invention may reduce the weight of the object detection model generated as above and distribute it to the network camera 300 . Meanwhile, since the generation of the object detection model and the weight reduction of the model have been described in detail above, a description of the corresponding content will be omitted herein.

Next, the detection result management unit 320 according to an embodiment of the present invention may include a result of detecting the object in the captured image regarding the object as described above, and a portion related to the captured image generated based on the detection result. A function of transmitting an image to the server 200 may be performed.

Specifically, the object detection unit 310 according to an embodiment of the present invention may include an image (eg, a bounding box in which the above object is detected) about an area in which an object is detected at a higher category level in the captured image. image cut along the boundary of , that is, a partial image may be generated. And, when a plurality of objects are detected in the captured image, the object detection unit 310 according to an embodiment of the present invention may generate a partial image for each of the plurality of detected objects. In addition, the detection result management unit 320 according to an embodiment of the present invention may transmit the partial image generated as described above to the server. Meanwhile, according to an embodiment of the present invention, the partial image above may mean only the partial image itself, but may also include information (eg, coordinates) about the position occupied by the partial image in the captured image. .

Referring to FIG. 4 , according to an embodiment of the present invention, the network camera 300 (specifically, the object detection unit 310 ) collects upper category information (eg, vehicle) of an object to be detected in a captured image. The object may be detected from the captured image 410 of the object by using the object detection model learned to estimate ( 411 and 412 ).

Continuing to refer to FIG. 4 , the object detection unit 310 according to an embodiment of the present invention may generate

partial images

420 and 430 of the captured image 410 based on a result of detecting a corresponding object. . In addition, the detection result management unit 320 according to an embodiment of the present invention may transmit the detection result and the

partial images

420 and 430 above to the server 200 .

Continuing to refer to FIG. 4 , when the detection result acquisition unit 210 according to an embodiment of the present invention acquires the detection result and the

partial images

420 and 430 above, according to an embodiment of the present invention The object classifier 220 may estimate sub-category information of the above object by using an object classification model that is trained to estimate sub-category information (eg, sedan or truck) of the object from the partial image above. There are (420 and 450).

The embodiments according to the present invention described above may be implemented in the form of program instructions that can be executed through various computer components and recorded in a computer-readable recording medium. The computer-readable recording medium may include program instructions, data files, data structures, etc. alone or in combination. The program instructions recorded on the computer-readable recording medium may be specially designed and configured for the present invention or may be known and used by those skilled in the computer software field. Examples of the computer-readable recording medium include hard disks, magnetic media such as floppy disks and magnetic tapes, optical recording media such as CD-ROMs and DVDs, and magneto-optical media such as floppy disks. medium), and hardware devices specially configured to store and execute program instructions, such as ROM, RAM, flash memory, and the like. Examples of program instructions include not only machine language codes such as those generated by a compiler, but also high-level language codes that can be executed by a computer using an interpreter or the like. A hardware device may be converted into one or more software modules to perform processing in accordance with the present invention, and vice versa.

In the above, the present invention has been described with reference to specific matters such as specific components and limited embodiments and drawings, but these are provided to help a more general understanding of the present invention, and the present invention is not limited to the above embodiments, and the present invention is not limited to the above embodiments. Those of ordinary skill in the art to which the invention pertains can make various modifications and changes from these descriptions.

Therefore, the spirit of the present invention should not be limited to the above-described embodiments, and the scope of the spirit of the present invention is not limited to the scope of the scope of the present invention. will be said to belong to

Claims

A method for estimating information about an object based on an image in an LPWAN (Low Power Wide Area Network) environment, the method comprising:

A result of detecting the object in the captured image of the object using an object detection model trained to estimate upper category information of an object to be detected from the captured image, and a partial image of the captured image generated based on the detection result obtaining, and

estimating sub-category information of the object based on the obtained detection result and the partial image using an object classification model trained to estimate sub-category information of the object from the partial image of the captured image do,

The detection result includes upper category information of the object.

Way.
According to claim 1,

The upper-level category information is associated with two or more lower-level category information of the object selected by the user.

Way.
3. The method of claim 1 or 2,

The object detection model is that the object detection model generated in the server is lightened and distributed

Way.
A method for estimating information about an object based on an image in an LPWAN (Low Power Wide Area Network) environment, the method comprising:

detecting the object in the captured image of the object by using an object detection model that is trained to estimate upper category information of the object to be detected in the captured image; and

Transmitting the detection result and the partial image related to the captured image generated based on the detection result to a server,

The detection result includes upper category information of the object.

Way.
5. The method of claim 4,

The upper-level category information is associated with two or more lower-level category information of the object selected by the user.

Way.
6. The method according to claim 4 or 5,

The object detection model is that the object detection model generated in the server is lightweight and distributed

Way.
A non-transitory computer-readable recording medium storing a computer program for executing the method according to any one of claims 1 to 6.
A system for estimating information about an object based on an image in an LPWAN (Low Power Wide Area Network) environment,

A result of detecting the object in the captured image of the object using an object detection model trained to estimate upper category information of an object to be detected from the captured image, and a partial image of the captured image generated based on the detection result A detection result obtaining unit to obtain

An object classification unit for estimating sub-category information of the object based on the obtained detection result and the partial image using an object classification model trained to estimate sub-category information of the object from the partial image of the captured image including,

The detection result includes upper category information of the object.

system.
9. The method of claim 8,

The upper-level category information is associated with two or more lower-level category information of the object selected by the user.

system.
10. The method according to claim 8 or 9,

The object detection model is that the object detection model generated in the server is lightweight and distributed

system.
As a system for estimating information about an object based on an image in a Low Power Wide Area Network (LPWAN) environment,

an object detection unit that detects the object from the captured image of the object using an object detection model trained to estimate upper category information of the object to be detected from the captured image; and

and a detection result management unit that transmits the detection result and a partial image related to the captured image generated based on the detection result to a server,

The detection result includes upper category information of the object.

system.
12. The method of claim 11,

The upper category information is associated with two or more lower category information of the object selected by the user

system.
13. The method of claim 11 or 12,

The object detection model is that the object detection model generated in the server is distributed in light weight

system.