CN112990187A

CN112990187A - Target position information generation method based on handheld terminal image

Info

Publication number: CN112990187A
Application number: CN202110436206.9A
Authority: CN
Inventors: 孙敏; 黄翔; 楼夏寅
Original assignee: Peking University
Current assignee: Peking University
Priority date: 2021-04-22
Filing date: 2021-04-22
Publication date: 2021-06-18
Anticipated expiration: 2041-04-22
Also published as: CN112990187B

Abstract

The invention provides a target position information generation method based on a handheld terminal image, which comprises the following steps: when a suspicious target is found, a camera of the handheld terminal acquires an image of a target scene to obtain a monitoring image; the server identifies the human object and/or the vehicle object in the monitoring image, estimates the orientation of the human object and the actual distance between the human object and the central point of the camera, and estimates the orientation of the vehicle object and the actual distance between the vehicle object and the central point of the camera. The server generates informative text information. The object is divided into the character object and the vehicle object, and different distance recognition algorithms are adopted for the character object and the vehicle object respectively, so that the accuracy of target object distance recognition is effectively improved.

Description

Target position information generation method based on handheld terminal image

Technical Field

The invention belongs to the technical field of target identification, and particularly relates to a target position information generation method based on a handheld terminal image.

Background

In the fields of public safety, military, emergency rescue or tourism exploration, along with the continuous popularization of handheld terminals (such as mobile phones), the discovery of outdoor specific targets and the information acquisition thereof can be completely completed by the convenient and universal handheld terminals such as the mobile phones and the like. Particularly, in specific industries related to civil information collection and analysis, such as criminal suspects or discovery, recording and reporting of bad behaviors, and the like, comprehensive analysis is performed by combining images with geographic information, so that reliable information can be obtained for decision or analysis of relevant institutions or team command centers, and the system is simple and convenient to collect and high in transmission speed.

The existing information acquisition system has the problem of low acquisition precision of the target geographical position when acquiring the target geographical position, thereby limiting the popularization and the application of the existing information acquisition system.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides a target position information generation method based on a handheld terminal image, which can effectively solve the problems.

The technical scheme adopted by the invention is as follows:

the invention provides a target position information generation method based on a handheld terminal image, which comprises the following steps:

step 1, when finding a suspicious target, the camera of the handheld terminal performs image acquisition on the target scene to obtain a monitoring image tu (a), and meanwhile, the handheld terminal obtains the camera position and posture information when acquiring the monitoring image tu (a), and the method comprises the following steps: position coordinate O (x) of camera center point O₀,y₀) The azimuth beta of a main optical axis of the camera and a pitch angle k of the main optical axis of the camera; the main optical axis azimuth beta of the camera is an included angle between the main optical axis of the camera and the due north direction;

step 2, the handheld terminal uploads the monitoring image tu (A) and the position and posture information of the camera to a server by using a wireless communication module;

step 3, the server performs object identification on the monitoring image tu (A), and detects whether a person object obj (r) and/or a vehicle object obj (c) exist in the monitoring image tu (A); if not, the suspicious target does not exist in the monitoring image tu (A), and the process is ended; if yes, executing step 4;

step 4, the server identifies a person object obj (r) and/or a vehicle object obj (c) in the monitoring image tu (a); such asIf the character object obj (r) is the character object obj (r), the direction alpha of the character object obj (r) is estimated by adopting the steps 5 to 6_rAnd the actual distance S between the character object obj (r) and the camera center point O_r；

If the vehicle object obj (c) is the vehicle object obj (c), the direction alpha of the vehicle object obj (c) is estimated by adopting the steps 7 to 8_cAnd the actual distance S between the vehicle object obj (c) and the camera center point O_c；

Step 5, estimating the orientation alpha of the character object obj (r)_rThe method comprises the following steps:

step 5.1, the server analyzes the monitored image tu (A), and obtains the pixel distance x between the image center point of the person object obj (r) and the monitored image tu (A) on the monitored image tu (A)_r；

Step 5.2, obtaining the orientation alpha of the character object obj (r) according to the following formula_r：

α_r＝arctan(x_r/f)-β

Wherein:

orientation α of person object obj (r)_rComprises the following steps: the connecting line between the character object obj (r) and the central point O of the camera forms an included angle with the true north direction; that is, with the camera center point O as a reference, the deviation angle of the human object obj (r) with respect to the true north direction;

step 6, estimating the actual distance S between the character object obj (r) and the central point O of the camera_rThe method comprises the following steps: the server reads the pitch angle k of the main optical axis of the camera, and if the pitch angle k of the main optical axis of the camera is smaller than the pitch angle setting threshold k of the person object_maxIf yes, executing step 6.1; otherwise, executing step 6.2;

step 6.1, the server analyzes the monitoring image tu (A), recognizes the head pixel height m of the person object obj (r) on the monitoring image tu (A), and sets a threshold value m according to the head pixel height m and the head pixel_minThe distance D from the projection point of the character object obj (r) in the main optical axis direction of the camera to the camera center point O is obtained according to the following formula_rThen step 6.3 is executed;

wherein:

f is the focal length of the camera;

m is the pixel value of the height of the human body, namely: on the monitor image tu (a), the pixel value in the height direction of the minimum bounding rectangle of the human object obj (r) is obtained by analyzing the monitor image tu (a);

H₁the general actual height value of the person is a preset fixed value;

H₂the height value of the general actual head of a person is a preset fixed value;

step 6.2, obtaining the distance D from the projection point of the character object obj (r) in the main optical axis direction of the camera to the central point O of the camera according to the following formula_rThen step 6.3 is executed;

step 6.3, obtaining the actual distance S between the character object obj (r) and the camera center point O according to the following formula_r：

S_r＝D_r/cosδ_r

Wherein: delta_rThe included angle between the connecting line of the character object obj (r) and the central point O of the camera and the main optical axis of the camera; delta_r＝α_r+β；

Step 7, estimating the orientation alpha of the vehicle object obj (c)_cThe method comprises the following steps:

step 7.1, the server analyzes the monitoring image tu (A), and obtains the pixel distance x between the vehicle object obj (c) and the image center point of the monitoring image tu (A) on the monitoring image tu (A)_c；

Step 7.2, obtaining the azimuth alpha of the vehicle object obj (c) according to the following formula_c：

α_c＝arctan(x_c/f)-β

Wherein:

orientation α of vehicle object obj (c)_cComprises the following steps: the connecting line between the vehicle object obj (c) and the central point O of the camera forms an included angle with the true north direction; that is, with the camera center point O as a reference, the declination angle of the vehicle object obj (c) with respect to the true north direction;

step 8, estimating the actual distance S between the vehicle object obj (c) and the central point O of the camera_cThe method comprises the following steps:

step 8.1, the server analyzes the monitoring image tu (A), and identifies the minimum circumscribed rectangle of the vehicle object obj (c) on the monitoring image tu (A), wherein the height of the minimum circumscribed rectangle is the vehicle pixel height h;

if the vehicle pixel height h > λ f, where λ is a scaling factor, a known fixed value, then step 8.2 is performed; otherwise, executing step 8.3;

step 8.2, this indicates the actual distance S of the vehicle object obj (c) from the camera center point O_cVery small, i.e.: s_c0, that is, the position of the vehicle object obj (c) is approximately at the position of the camera center point O; then step 9 is executed;

step 8.3, obtaining the distance D from the projection point of the vehicle object obj (c) in the main optical axis direction of the camera to the central point O of the camera according to the following formula_cThen step 8.4 is performed;

wherein:

l₂the width value is a common actual width value of the vehicle and is a preset fixed value;

k_minsetting a threshold value for the pitch angle of the vehicle object;

h is the pixel height of the vehicle object obj (c), i.e.: on the monitor image tu (a), the pixel value in the height direction of the minimum circumscribed rectangle of the vehicle object obj (c) is obtained by analyzing the monitor image tu (a);

l₁the height value of the vehicle is a common actual height value of the vehicle and is a preset fixed value;

step 8.4, obtain the vehicle object obj (c) andactual distance S of camera center point O_c：

S_c＝D_c/cosδ_c

Wherein: delta_cThe included angle between the connecting line of the vehicle object obj (c) and the central point O of the camera and the main optical axis of the camera; delta_c＝α_c+β；

Then step 9 is executed;

step 9, if the object is the character object obj (r), the orientation α of the character object obj (r) is determined_rAnd the actual distance S between the character object obj (r) and the camera center point O_rCombining with the position coordinate O (x) of the central point O of the camera₀,y₀) Obtaining the position coordinates of the character object obj (r);

if the vehicle object obj (c) is, the orientation alpha of the vehicle object obj (c) is determined_cAnd the actual distance S between the vehicle object obj (c) and the camera center point O_cCombining with the position coordinate O (x) of the central point O of the camera₀,y₀) Obtaining the position coordinates of the vehicle object obj (c);

step 10, the server generates informative text information, wherein the informative text information comprises the position coordinates of the identified person object obj (r) and/or the position coordinates of the identified vehicle object obj (c).

Preferably, in step 3, the server performs object recognition on the monitoring image tu (a), specifically:

and the server adopts the trained machine learning network to perform object recognition on the monitoring image tu (A).

Preferably, the server performs object recognition on the monitoring image tu (a) by using a trained machine learning network, specifically:

if the server recognizes that the person object obj (r) exists in the monitored image tu (a), the age of the person is further recognized; determining the general actual height value H of the person according to the age of the person₁And a value of the general actual head height H of the person₂；

If the server identifies that the vehicle object obj (c) exists in the monitoring image tu (A), further identifying the vehicle type; according toType of vehicle, determining value of actual width l of vehicle₂And the prevailing actual height value l of the vehicle₁。

Preferably, after step 10, the method further comprises:

step 11, after obtaining the position coordinates of the character object obj (r) and/or the vehicle object obj (c), the server obtains the geographic information of the scene where the character object obj (r) and/or the vehicle object obj (c) are located through buffer analysis by using a small map service program and a buffer analysis module based on a target position, and fuses the character object obj (r) and/or the vehicle object obj (c) with the geographic information of the scene to generate the informative text information.

The target position information generation method based on the handheld terminal image has the following advantages:

the object is divided into the character object and the vehicle object, and different distance recognition algorithms are adopted for the character object and the vehicle object respectively, so that the accuracy of target object distance recognition is effectively improved.

Drawings

FIG. 1 is a schematic flow chart of a method for generating target location information based on a handheld terminal image according to the present invention;

FIG. 2 is a schematic diagram of an implementation of a method for generating target location information based on a handheld terminal image according to the present invention;

FIG. 3 is a horizontal projection view of the estimation of the geographic coordinates of a human object;

FIG. 4 is a graph of the relationship between the actual height of a person's head and the height of its imaged head pixels in the vertical direction;

FIG. 5 is a schematic diagram of the relationship between the actual height of the target and its imaging height in the vertical direction;

fig. 6 is a schematic diagram of distance calculation considering the pitch angle k of the main optical axis of the camera.

Detailed Description

In order to make the technical problems, technical solutions and advantageous effects solved by the present invention more clearly apparent, the present invention is further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

The invention belongs to the crossing field of the technical fields of mobile terminal technology, digital map technology, automatic identification technology, public security, military application and the like, and mainly aims at the public safety and military detection process, a handheld terminal and other portable devices are used for recording the geographic position of a ground sensitive target or personnel, the geographic position is quickly uploaded to a remote server through a wireless transmission module, and the remote server automatically generates information for relevant personnel to analyze and judge.

Referring to fig. 1 and 2, the present invention provides a method for generating target location information based on a handheld terminal image, comprising the following steps:

the handheld terminal includes but is not limited to a mobile phone with a camera, a tablet computer and other terminal devices; the hand-held terminal is provided with a position and posture measuring sensor module used for obtaining the position and posture information of the camera when the monitoring image tu (A) is collected. Specifically, the handheld terminal can obtain the current position coordinates of the shooting point through a built-in or external satellite positioning module thereof, namely: position coordinate O (x) of camera center point O₀,y₀) (ii) a Through its built-in or external gesture and gyrosensor, when can obtain current shooting, orientation and the attitude information of camera, promptly: the azimuth beta of a main optical axis of the camera and the pitch angle k of the main optical axis of the camera are calculated;

specifically, the present invention mainly recognizes the two types of targets considering that the personnel and the vehicles belong to the primary targets of public safety and military detection, but the targets recognized by the present invention are not limited to the personnel and the vehicles.

In this step, the server performs object recognition on the monitoring image tu (a), specifically:

and the server adopts the trained machine learning network to perform object recognition on the monitoring image tu (A). For example, a machine learning algorithm is firstly adopted to train the target recognition neural network through a large number of data sets of people and vehicles, and after an ideal target recognition neural network is obtained, the neural network is used for recognizing the target in the shooting scene of the handheld terminal. The trained target recognition neural network can better recognize targets in the scene, for example, whether people in the scene are adults or children can be recognized, and the type of vehicles in the scene, belonging to cars or trucks, can also be recognized.

Specifically, if the server recognizes that the person object obj (r) exists in the monitored image tu (a), the age of the person is further recognized; determining the general actual height value H of the person according to the age of the person₁And a value of the general actual head height H of the person₂；

If the server identifies that the vehicle object obj (c) exists in the monitoring image tu (A), further identifying the vehicle type; determining the universal actual width value l of the vehicle according to the type of the vehicle₂And the prevailing actual height value l of the vehicle₁。

Step 4, the server identifies a person object obj (r) and/or a vehicle object obj (c) in the monitoring image tu (a); if the character object obj (r) is the character object obj (r), the orientation alpha of the character object obj (r) is estimated by adopting the steps 5-6_rAnd the actual distance S between the character object obj (r) and the camera center point O_r；

In the invention, when the actual distances between the human object obj (r) and the vehicle object obj (c) and the camera center point O are estimated, the actual distances between the estimated object and the camera center point O are different, so that the sizes of the pixels in the shot monitoring images are different, and the sizes of the pixels in the scene of the object are also changed due to the posture and the orientation of the handheld terminal during shooting. Therefore, the present invention proposes a precise estimation method of the geographical positions of the following human object obj (r) and vehicle object obj (c).

step 5.1, the server analyzes the monitoring image tu (A), and obtains the pixel distance x from the imaging point of the human object obj (r) on the monitoring image tu (A) to the main optical axis direction of the monitoring image tu (A) on the monitoring image tu (A)_r；

Wherein, an imaging point of the human object obj (r) on the monitored image tu (a) is represented as G in fig. 1;

α_r＝arctan(x_r/f)-β

Wherein:

as shown in fig. 3, a horizontal projection view of the estimation of the geographic coordinates for the human object; wherein, the character object obj (r) is represented by a, and the image center point is represented by O; the image plane of the person object obj (r) after being imaged by the camera is represented by B; the Z axis is the direction of the main optical axis of the camera, and the Y axis is the direction vertical to the focal plane; the X axis is determined by taking the direction of a right-handed system from the Z axis to the Y axis;

step 6, estimating the actual distance S between the character object obj (r) and the central point O of the camera_rI.e. the distance from point a to point O in fig. 1, the method is: the server reads the pitch angle k of the main optical axis of the camera, and if the pitch angle k of the main optical axis of the camera is smaller than the pitch angle setting threshold k of the person object_maxIf yes, executing step 6.1; otherwise, executing step 6.2;

the invention estimates the actual distance S between the character object obj (r) and the central point O of the camera_rThe main conception is as follows:

firstly, whether the pitch angle k of the main optical axis of the camera is smaller than the pitch angle setting threshold k of the human object is considered_maxIf so, indicating that the target and the observer position are considered to be in approximately the same horizontal plane, then step 6.1 is performed; otherwise, it indicates that there is a large height difference between the observer position and the target position, for example, the observer is located at a high position, and the target is photographed, in which case, for the purpose of accurate estimation, the influence of the main optical axis pitch angle k on the distance estimation needs to be considered, and the step 6.2 is performed.

wherein:

f is the focal length of the camera;

H₁the general actual height value of the person is a preset fixed value; for example, take 1.7 meters;

H₂the height value of the general actual head of a person is a preset fixed value; for example, take0.56 m;

the implementation concept of the step 6.1 is as follows:

when the object is a human object obj (r), a threshold value m is set according to the height m of the head pixel and the head pixel_minThereby determining the distance between the human object obj (r) and the camera;

specifically, the method comprises the following steps:

if m > m_minThen the representative character object obj (r) is closer to the camera, at this time, the general actual head height value H of the person is adopted₂And head pixel height m, estimating the distance between the head pixel height m and a photographer, and adopting the following principle: the general actual head height values H of different character objects when they are closer to the camera₂The difference is not large, and the actual head height values of different character objects can be ignored, so that H₂The preset fixed value meets the precision requirement; meanwhile, the height m of the head pixel is large, and the requirement of accurate measurement on an image is met.

And when m is less than or equal to m_minAt this time, the representative person object obj (r) is far from the camera, and at this time, the general actual height value H of the person is adopted₁And the distance between the pixel value M of the height of the person and the photographer is estimated according to the principle: when the person object is far away from the camera, the imaging size of the person image on the image is small, so that the value of the head pixel height M with a small size cannot be accurately measured, and therefore, the pixel value M of the person height with a large pixel is taken as a calculation target to ensure the accuracy requirement.

In practical application, the head pixel sets the threshold value m_minIs determined by:

referring to fig. 4, a diagram of the relationship between the actual height of the person's head and the height of the imaged head pixels in the vertical direction is shown. Wherein the height of the head pixel is represented by m, and the actual height of the head of the person is represented by H_tAnd (4) showing. The actual height value of the head of an adult is generally 54-58 cm, and the median value is 56 cm. If the head is imaged too small in the image, the recognition difficulty is generated, so the minimum height of the head on the image is taken as 5 pixels according to experience, namely when the height of the head is less than 5 pixels, the head is not processed independently, and the whole height is directly taken。

Due to D_r＝H₁f²sin(arctan(x_r/f))/(x_rM) derivation principle, and D_r＝H₂f²sin(arctan(x_r/f))/(x_rm) are derived in the same manner, and only D will be described below_r＝H₁f²sin(arctan(x_r/f))/(x_rM) reasoning principle:

1) it should be emphasized that, in the present invention, FIG. 3 is a schematic diagram of the target position and its image in the horizontal direction, i.e., the plane of the X-Z axis; and fig. 5 is a schematic diagram of the relationship between the actual height of the target and the imaging height thereof in the vertical direction.

When the target moves from the original position to the main optical axis direction in parallel, the change ratio in the horizontal direction is the same as the change ratio in the vertical direction.

If the target actual height is known, assume the prevalent actual height value H of the person₁Its actual height in the main optical axis direction is H'₁Thus, there is the following formula (1):

H′₁＝D_rH₁/S_r (1)

namely: the actual height in the direction of the main optical axis, being equal to the original height, multiplied by the ratio D_r/S_r。

2) Based on the schematic diagram in the horizontal direction of fig. 3, it can be seen that:

D_r/S_r＝f/(x_r/sinδ_r) (2)

3) combining equation (1) and equation (2), the following equation (3) is obtained:

H′₁＝D_rH₁/S_r＝fH₁sinδ_r/x_r (3)

4) regardless of the camera pose, there is the following geometric relationship:

M/H′₁＝f/D_r (4)

thus: having the following formula (5):

D_r＝fH′₁/M (5)

5) combining equation (5) and equation (3), equation (6) is obtained:

D_r＝f²H₁sinδ_r/x_rM (6)

also, since in FIG. 3, δ_r＝arctan(x_r/f), the following relationship is thus obtained:

D_r＝H₁f²sin(arctan(x_r/f))/(x_rM) (7)

as shown in fig. 6, a schematic diagram of distance calculation considering the pitch angle k of the main optical axis of the camera is shown. In fig. 6, the distance is calculated using the height of the person as a reference. As can be seen from FIG. 6, D is obtained when the pitch angle k of the main optical axis of the camera is taken into consideration_rEqual to D when the pitch angle k of the main optical axis of the camera is not considered_rAnd multiplied by cosk.

S_r＝D_r/cosδ_r

α_c＝arctan(x_c/f)-β

Wherein:

when the invention identifies the distance of the vehicle object obj (c), the main conception is as follows:

by determining the λ value, the distance between the vehicle object obj (c) and the camera is determined:

specifically, the method comprises the following steps:

if the vehicle pixel height h is more than lambdaf, the actual distance S between the vehicle object obj (c) and the central point O of the camera is shown_cThe distance is very small, and the vehicle height is not easy to be recognized due to the limitation of the shooting angle and the state when the distance is too short, and at this time, according to the approximation processing, the position of the vehicle object obj (c) is approximately located at the position of the camera center point O.

If the vehicle pixel height h is less than or equal to λ f, the actual distance S between the vehicle object obj (c) and the camera center point O is shown_cFurther, this case further distinguishes two cases:

in the first case, when the camera main optical axis pitch angle k is small, that is: k is less than or equal to k_minIndicating a situation where the target and the observer position are considered to be approximately in the same horizontal plane, in which case the prevailing actual height value l of the vehicle is used₁As a reference in distance calculation;

in the second case, when the pitch angle k of the main optical axis of the camera is large, i.e. the camera is in a state of a large pitch angle k：k＞k_minIn this case, it is indicated that there is a large height difference between the observer position and the target position, for example, the observer is located at a high position, and the target is photographed in a plan view, and in this case, the entire vehicle body can be photographed because the distance is long, and therefore, the general actual width value l of the vehicle is used₂As a reference in distance calculation.

Wherein λ may be taken to be 0.15. λ can be determined by:

1) suppose that the distance D from the projection point of the vehicle object obj (c) in the direction of the main optical axis of the camera to the central point O of the camera_cLess than 10 meters, i.e.: when D is present_cIf the distance is less than 10, the distance is considered to be too close, the vehicle height is not easy to identify due to the limitation of the shooting angle and the state, and the following approximate processing is carried out:

due to H/H'_c＝f/D_c

Wherein:

h is the vehicle pixel height;

H′_cthe height of the vehicle in the direction of a main optical axis is 1.5 meters according to the calculation of a conventional vehicle;

2) thus, it is possible to obtain: h > 0.15f, i.e.: λ is 0.15.

wherein:

l₂the width value is a common actual width value of the vehicle and is a preset fixed value; for example, take 1.7 meters;

k_minpitching angle for vehicle objectSetting a threshold value; in practical application, the angle is 15 degrees.

step 8.4, obtaining the actual distance S between the vehicle object obj (c) and the camera center point O according to the following formula_c：

S_c＝D_c/cosδ_c

Then step 9 is executed;

It should be emphasized that the above information generation process is performed in the server, which is limited to the software computing capability of the current handheld terminal, and if the handheld terminal has a strong data processing capability, the information generation process can be directly performed in the handheld terminal, and then the generated information and the monitoring image are uploaded to the server together. The invention is not limited in this regard.

In practical application, if the information needs to be uploaded to a server to generate information, before the information is uploaded to the server by taking a picture by the handheld terminal, a user can be required to specify a sensitive target on the image, and the operation can be specified through simple interaction; when the server carries out the target identification algorithm, only the target related to the designated position on the image is extracted, and other targets are ignored, so that the generated information can be more clear.

The invention provides a target position information generation method based on a handheld terminal image, which integrates the performances of various aspects of the handheld terminal, including photographing, positioning, orienting and posture measuring, combines the existing massive geographic information in a geographic information system and the map analysis function of the geographic information system, combines the functions of target identification, character fusion and the like in the existing machine learning method, integrates a set of intelligent information generation system convenient to use, and is convenient for relevant departments to collect sensitive or relevant interest point target information through mobile phone terminals of information collectors and even common users.

The target position information generation method based on the handheld terminal image has the following advantages that:

The foregoing is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, various modifications and improvements can be made without departing from the principle of the present invention, and such modifications and improvements should also be considered within the scope of the present invention.

Claims

1. A target position information generating method based on a handheld terminal image is characterized by comprising the following steps:

Step 5.2, obtaining the square of the character object obj (r) according to the following formulaBit alpha_r：

α_r＝arctan(x_r/f)-β

Wherein:

wherein:

f is the focal length of the camera;

H₁the general actual height value of the person is a preset fixed value;

S_r＝D_r/cosδ_r

α_c＝arctan(x_c/f)-β

Wherein:

step 8.2, this case is indicative of the vehicle object obj (c) being located at a distance O from the camera center pointActual distance S_cVery small, i.e.: s_c0, that is, the position of the vehicle object obj (c) is approximately at the position of the camera center point O; then step 9 is executed;

wherein:

k_minsetting a threshold value for the pitch angle of the vehicle object;

S_c＝D_c/cosδ_c

Then step 9 is executed;

if the vehicle object obj (c) is, the orientation alpha of the vehicle object obj (c) is determined_cAnd the vehicle object obj (c) and the image pickupActual distance S of head center point O_cCombining with the position coordinate O (x) of the central point O of the camera₀,y₀) Obtaining the position coordinates of the vehicle object obj (c);

2. The method for generating the target location information based on the handheld terminal image as claimed in claim 1, wherein in the step 3, the server performs the object recognition on the monitoring image tu (a), specifically:

3. The method as claimed in claim 2, wherein the server performs object recognition on the monitored image tu (a) by using a trained machine learning network, specifically:

4. The method for generating object location information based on handheld terminal image as claimed in claim 1, further comprising after step 10: