CN107169401B

CN107169401B - Rail invader detection method based on rail visual feature spectrum

Info

Publication number: CN107169401B
Application number: CN201710074119.7A
Authority: CN
Inventors: 李晓峰; 管岭; 杨晗; 贾利民
Original assignee: Beijing Jiaotong University
Current assignee: Beijing Jiaotong University
Priority date: 2017-02-10
Filing date: 2017-02-10
Publication date: 2020-05-05
Anticipated expiration: 2037-02-10
Also published as: CN107169401A

Abstract

The invention provides a rail invader detection method based on a rail visual feature spectrum. Cruising and shooting images at a certain height, angle and speed along a track line, establishing a characteristic image library aiming at 6 types of image shooting and affine change by utilizing image data, and establishing a track visual characteristic spectrum based on the track characteristic image library; and performing characteristic matching by using the rail visual characteristic spectrum and the characteristics of the rail image to be detected, and judging whether a rail invader exists in the rail image to be detected according to a matching result. The orbit vision characteristic spectrum theory and the invader detection method based on the orbit vision characteristic spectrum theory can be applied to an invader detection system based on an unmanned aerial vehicle, and the problems of poor reliability and high cost of the traditional detection system are solved. The method has high feature extraction and matching speed, can effectively detect the invader in the track, and can be applied to an invader detection system based on the unmanned aerial vehicle.

Description

Rail invader detection method based on rail visual feature spectrum

Technical Field

The invention relates to the technical field of rail line environment safety detection, in particular to an unmanned aerial vehicle-mounted rail invader detection method based on a rail visual feature spectrum.

Background

In recent years, rail traffic mileage, particularly railway mileage under western complex geomorphic environments, is increasing, the construction of high-speed railways in asia-europe, and severe international safety situation influence, so that a rapid identification method and technology for rail line invaders, which can be used for an unmanned airborne invader identification device, are urgently needed.

In the existing track safety detection system in China, track line conditions are mainly checked in a mode of installing monitoring cameras at fixed points and observing operators on duty in real time. In the mode, cameras need to be installed on the track line in a point-type distribution mode, the installation and maintenance cost is high, and the method cannot be popularized and applied under the influence of severe and complex geomorphic environments such as western mountain areas, unmanned areas and the like; the safety state of the current track line is judged by staring and controlling the monitoring video for a long time by the staff, and potential safety hazards are easily caused by factors such as fatigue negligence and the like.

In the intelligent identification algorithm for the rail invaders based on the fixed camera, researched by researchers at home and abroad, the existence of the invaders is detected by mostly using image pixel characteristics as templates and adopting a background difference or background modeling method. The method is easily influenced by the surrounding environment, the template needs to be updated in real time, and the method cannot be applied to an invader detection system of an unmanned aerial vehicle-mounted mobile camera platform.

Disclosure of Invention

The embodiment of the invention provides a track invader detection method based on a track visual feature spectrum, which is used for effectively detecting the track line environment safety situation through an unmanned aerial vehicle-mounted camera.

In order to achieve the purpose, the invention adopts the following technical scheme.

A rail invader detection method based on rail visual feature spectrum specifically comprises the following steps:

cruising and shooting images at a certain height, angle and speed along a track line, establishing a track characteristic diagram aiming at 6 image shooting and affine changes of rotation, brightness, visual angle, definition, scale and compression ratio through actual acquisition methods such as different weather, different angles and the like or post-processing methods such as image transformation and the like, extracting characteristic description and compressing codes based on the track characteristic diagram, and fusing geographical position information to construct a track visual characteristic spectrum;

and performing feature matching on the to-be-detected track image and a track visual feature spectrum in the same geographical position, and judging whether a track invader exists in the to-be-detected track image according to a matching result.

Further, the method comprises the steps of cruising along the track line at a certain height, angle and speed, shooting images, establishing a track characteristic diagram aiming at 6 types of image shooting and affine change of rotation, brightness, visual angle, definition, scale and compression ratio, and constructing a track visual characteristic spectrum, wherein the specific process comprises the following steps:

(1) the unmanned aerial vehicle carrying the cloud deck camera is utilized to cruise and shoot images at a certain height, angle and speed along a track line, a track characteristic template interval is defined according to the specific visual angle of the camera and the track length in the images, and track image data and positioning information are collected. By using an unmanned aerial vehicle acquisition mode under different weather and different angles and by post-processing modes such as image transformation and the like, the track characteristic diagrams under different geographic positions are established aiming at 6 kinds of shooting and affine changes of rotation, brightness, visual angle, definition, scale and compression ratio. The specific algorithm comprises the following steps:

1) computing pixel gradients

Horizontal direction template ▽ using Sobel operator_xAnd a vertical direction form ▽_yCalculating a horizontal gradient value for each pixel in the orbit image data

And vertical gradient value

As shown in formula (1) and formula (2):

in the formula (2), f (x, y) is image data;

2) calculating gradient direction

Solving the horizontal square gradient from the horizontal gradient and the vertical gradient of each pixel point in the image

And square gradient in vertical direction

As shown in formula (3):

for each sub-block w × w of the image sliding window centered on pixel (i, j), the mean squared gradient value Δ is calculated_x(i, j) and Δ_y(i, j) as shown in formula (4):

calculating the average direction of the sub-block gradient

Is represented by formula (5):

3) rail positioning

And (3) extracting a gradient direction field from the track image according to the formulas (3), (4) and (5) to obtain a track gradient map, and performing thresholding treatment on the track gradient map, wherein the gradient direction screening range is 40-140 degrees. Finally, the method of Canny edge detection, connected region screening, image corrosion and expansion and the like is applied to obtain the position of the track in the image;

4) extraction of interest

Defining and extracting a track interested area in the track image according to the track position and the railway limit in the track image, wherein the track interested area comprises a railway limit area including a track and a roadbed and a buffer area extending outwards from the railway limit area, calculating the distance between each pixel in the track image and the nearest track pixel, and marking the area with the distance less than or equal to the track width as the railway interested area;

5) ALP feature extraction

Extracting ALP characteristics of each orbit interesting region and carrying out compression coding;

6) construction of orbit visual characteristic spectrum

And (3) taking the geographical position of each orbit interesting area as an interval index, and carrying out vector fusion on ALP features extracted from the orbit feature map between the orbits interesting areas to construct and finish the orbit visual feature spectrum.

Further, the extracting the ALP feature of each orbit roi includes:

①, constructing a Gaussian scale space to solve the extreme point of the scale space, wherein a Gaussian kernel is the only linear convolution kernel for realizing scale transformation, and for a scale parameter sigma, the two-dimensional Gaussian kernel is expressed as formula (6):

g(x,y,σ)＝1/(2πσ²)·exp(-(x²+y²)/(2σ²)) (6)

to construct the image pyramid, in each sub-octave, the original image is filtered using 4 increasing scale parameters σ of gaussian kernel and discrete laplacian kernel as shown in equation (7):

wherein g (x, y, σ) is a discrete Gaussian function,

for discrete laplacian, the symbol denotes convolution;

to find the extreme points in the pyramid, a third order polynomial filter result as shown in equation (8) is used;

p(x,y,σ)＝α₃(x,y)σ³+α₂(x,y)σ²+α₁(x,y)σ+α₀(x,y) (8)

wherein (A), (B), (Cx, y) are pixel coordinates, α₀(x,y):α₃(x, y) is calculated by weighted summation of images after Gaussian filtering, and p (x, y, sigma) is a response value of each pixel point;

within each sub-octave, calculating the response p (x, y, sigma) of each pixel of each image to obtain candidate extreme points (x, y) and corresponding scales sigma^*Remapping candidate extreme points to sub-pixel precision, eliminating repeated extreme points, extracting ALP feature points, and allocating a main direction to each feature point according to a histogram of regional gradient amplitude of each ALP feature point;

② local feature selection, according to the scale σ of ALP feature points^*Direction θ, distance d from the center of the image, peak D, Hessian matrix squared trace ratio ρ, second derivative of the scale space function p_σσ6 feature parameters are calculated, correlation calculation is carried out as shown in a formula (9), feature points with correlation larger than a set threshold value are screened out and used as finally extracted ALP feature points;

r(σ^*,θ,d,D,ρ,p_σσ)＝f₁(σ^*)·f₂(θ)·f₃(d)·f₄(D)·f₅(ρ)·f₆(p_σσ) (9)

③ local feature description, in a 4 x 4 region centered on the ALP feature point, 8 directional gradient histograms are calculated for each region, these gradient histograms are concatenated to form a 128-dimensional vector, and each ALP feature point is described as a 128-dimensional vector.

Further, the compression encoding of the ALP feature of the extracted orbit roi includes:

① local feature aggregation, normalizing the local feature description of each 128-dimensional vector, and extracting 32-dimensional principal vectors through principal component analysis, extracting ALP features from 6 case distributions in the orbit feature map, and finally forming 6 x 32-dimensional feature vectors;

② local feature compression, which is divided into local feature description compression and local feature position compression, wherein in the local feature description compression, a low-complexity transform coding scheme is adopted for feature description compression, and in the local feature position compression, a position histogram coding scheme is adopted for representing features into binary bit streams;

③ global feature description, adopts a scalable compression Fisher vector coding scheme to carry out generalized description on image features.

Further, the track visual characteristic spectrum is subjected to characteristic matching with a track image to be detected, and whether a track invader exists in the track image to be detected is judged according to a matching result, and the specific process comprises the following steps:

when the track line is inspected, the track image to be detected is shot in real time according to the same shooting conditions in the process of constructing the track visual characteristic spectrum, ALP characteristics in the region of interest of the track image are extracted in each track section, and the characteristics extracted in real time are matched with the track visual characteristic spectrum at the same geographic position;

after completing feature matching between ALP features of the image shot in real time and the orbit visual feature spectrum at the same position, eliminating mismatching feature points by using a RANSAC consistency detection method. Then, determining whether an invader exists in the real-time graph by adopting a threshold method, and defining the matching rate as an equation (10);

in the formula, TotalNum represents the total number of feature points, and matchNum represents the number of feature points successfully matched. If the ratio is larger than a preset threshold value, determining that an invader exists; if the ratio is less than the threshold, then it is considered normal;

and when the existence of the invader is detected, classifying the gathered positions of most of the feature points in the unmatched residual feature point set by using a K-means clustering algorithm, and taking the gathered positions of most of the feature points as the range of the invader.

Further, the ALP feature matching of the image features extracted in real time and the orbit visual feature spectrum at the same geographic position mainly comprises:

and global description matching, namely describing ALP feature matching degree between the orbit image shot in real time and the orbit visual feature image at the geographic position by adopting similarity measurement based on Hamming distance.

And local description matching, namely performing ALP feature matching on the track image shot in real time and the track visual feature image at the geographic position by using a nearest neighbor matching algorithm.

And a bidirectional matching strategy is adopted during matching, so that the matching accuracy is improved. And bidirectional matching, namely performing characteristic matching on the characteristic description R in the characteristic spectrum and the ALP characteristic descriptor Q extracted from the image to be matched from two directions of R → Q and Q → R respectively.

According to the technical scheme provided by the embodiment of the invention, the orbit vision characteristic spectrum theory and the intruder detection method based on the orbit vision characteristic spectrum theory provided by the embodiment of the invention can be applied to an intruder detection system based on an unmanned aerial vehicle, and the problems of poor reliability and high cost of the traditional detection system are solved. The method has high feature extraction and matching speed, and can autonomously and effectively detect the invader in the track.

Additional aspects and advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive labor.

Fig. 1 is a processing flow chart of a rail invader detecting method based on rail visual feature spectrum according to an embodiment of the present invention;

fig. 2 is a schematic diagram illustrating a track feature pattern according to an embodiment of the present invention;

FIG. 3 is a schematic diagram of a track positioning system according to an embodiment of the present invention;

FIG. 4 is a diagram of a region of interest of a track according to an embodiment of the present invention;

FIG. 5 is a diagram illustrating the result of ALP feature extraction according to an embodiment of the present invention;

fig. 6 is an image of rail intrusion according to an embodiment of the present invention;

fig. 7 is a schematic diagram of an intruder range extraction according to an embodiment of the present invention.

Detailed Description

Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the accompanying drawings are illustrative only for the purpose of explaining the present invention, and are not to be construed as limiting the present invention.

As used herein, the singular forms "a", "an", "the" and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms "comprises" and/or "comprising," when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. It will be understood that when an element is referred to as being "connected" or "coupled" to another element, it can be directly connected or coupled to the other element or intervening elements may also be present. Further, "connected" or "coupled" as used herein may include wirelessly connected or coupled. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.

It will be understood by those skilled in the art that, unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the prior art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.

For the convenience of understanding the embodiments of the present invention, the following description will be further explained by taking several specific embodiments as examples in conjunction with the drawings, and the embodiments are not to be construed as limiting the embodiments of the present invention.

Example one

The invention provides a track visual characteristic spectrum theory and a construction method thereof, and provides a characteristic matching intruder detection method based on the theory, aiming at the defects that the existing track intruder detection system can only carry out fixed-point detection, is greatly influenced by environmental change and has low detection speed, and the characteristics of small load of an unmanned aerial vehicle and high real-time performance of the detection system.

The processing flow of the rail invader detection method based on the rail visual feature spectrum provided by the embodiment of the invention is shown in fig. 1, and mainly comprises the following processing procedures:

(1) the method comprises the following steps of constructing an orbit vision characteristic spectrum, and defining the orbit vision characteristic spectrum as follows: and taking the geographic information as an index, respectively extracting image characteristics of image rotation, brightness, visual angle, definition, scale, compression ratio and the like and affine change at the same position of the track, and coding and compressing the image characteristics. The method mainly comprises the steps of carrying out various affine transformation processing on an orbit image, extracting an orbit region of interest, extracting ALP characteristics, compressing and generating a visual characteristic spectrum;

(2) the intrusion object detection part mainly adopts ALP feature matching, RANSAC-based mismatching point elimination and K-means clustering methods. Experimental results show that the method provided by the invention has high feature extraction and matching speed, and can autonomously and effectively extract the range of the rail invader.

The feature matching stage mainly comprises global description matching and local description matching, and the specific implementation process is as follows:

1) global description matching is achieved by quickly calculating the weighted correlation of the 2 image global description features. The method adopts similarity measurement based on Hamming distance to describe the matching degree of two images. The similarity definition is shown in (1).

In the formula

Is a feature binary vector. Ha (,) represents the hamming distance. w is a_HaAre coefficients learned from a database. Setting a threshold τ if s_Q,RIf the value is greater than tau, the features are matched, otherwise the features are not matched.

2) In the local description matching, a nearest neighbor matching algorithm is used for performing feature matching on the image to be matched and the reference image. After matching is completed, RANSAC consistency check is carried out, and wrong matching points are removed.

3) And a bidirectional matching strategy is adopted during matching, so that the matching accuracy is improved. And bidirectional matching, namely performing characteristic matching on the characteristic description R in the characteristic spectrum and the ALP characteristic descriptor Q extracted from the image to be matched from two directions of R → Q and Q → R respectively.

And after matching of the ALP characteristics of the real-time shot image and the ALP characteristics of the orbit visual characteristic image at the same position, determining whether an invader exists in the real-time image by adopting a threshold method. Defining the matching rate ratio as formula (2);

in the formula, TotalNum represents the total number of feature points, and matchNum represents the number of feature points successfully matched. If the ratio is larger than a preset threshold value, determining that an invader exists; if ratio is less than the threshold, then it is considered normal.

When the existence of the invader is detected, the unmatched feature point set comprises the feature points of the suspicious invader and the noise feature points existing in the surrounding environment irrelevant to the invader. The characteristic points on the invader are dense, the noise points are scattered, and the interference characteristic points are removed by utilizing a threshold value method through calculating the Euclidean distance between each characteristic point and the rest points.

And finally, classifying the positions of most feature point aggregates in the residual feature point sets by using a K-means clustering algorithm, namely extracting the approximate range of the suspicious invaders.

Example two

Fig. 2 is a schematic diagram of a track feature sample constructed by acquiring an image of a certain track section through an actual scene or performing various affine transformations, wherein a rotation feature image (a) is obtained by a camera at a fixed position through self-rotation shooting; the characteristic images (b) (d) of the luminance change and the sharpness change are obtained by taking images by changing the aperture and the focal length of the camera; the characteristic images (c) and (e) of the visual angle change and the scale change are respectively obtained by shooting images at different visual angles and different shooting distances in the same place; the compression ratio change characteristic image (f) is obtained by processing the JPEG2000 image compression standard at different compression ratios. Fig. 3 is an image of positioning a track, where (a) is an original track image captured by an unmanned aerial vehicle, (b) is a gradient image obtained by calculating a Sobel gradient direction field and performing threshold processing, and (c) is a track position image obtained by image erosion, Canny edge detection, and image area filling. Fig. 4 is an orbit image after region of interest extraction, and ALP descriptor extraction is performed on the basis of the image. ALP feature descriptor schematic the ALP feature descriptor is shown in fig. 5, where the center of each circle represents the feature point and the radius of the circle represents the feature point dimension, where only a portion of the ALP feature points are shown. Feature vector aggregation and compression are carried out on the ALP features extracted after various shooting and affine transformation, geographic position information is fused, and finally an orbit visual feature spectrum is constructed. Fig. 6 is an orbit image of an intruding object in the presence of the unmanned aerial vehicle. FIG. 7 is a schematic diagram of the range of an intruding object, which is detected after feature extraction, feature matching and K-means clustering. The experimental result shows that the method provided by the invention can effectively detect the rail invader. Under test experiment environments of Intel (R) core (TM) i5-3470 CPU, 8GB memory, Visual Studio 2013 and OpenCV 3.0, the time used by the experiment operation result is about 1.2s, and the requirement of unmanned aerial vehicle cruise detection real-time performance at the speed of 60km/h is met.

In summary, the orbit visual characteristic spectrum theory and the invader detection method based on the orbit visual characteristic spectrum theory provided by the embodiment of the invention take the ALP compression characteristic of the orbit image as the detection template, thereby greatly reducing the storage capacity and meeting the requirements of an embedded detection system; ALP feature matching speed is fast, satisfies unmanned aerial vehicle speed requirement of cruising, consequently can apply to in the invader detecting system based on unmanned aerial vehicle. Meanwhile, the ALP characteristics are stable, the influence of illumination, scale and the like is small, and the detection reliability is high. Only need an unmanned aerial vehicle can detect whole road section, need not fixed point installation surveillance camera along the line, greatly reduced the cost has very important meaning to track detection under adverse circumstances such as western mountain area, unmanned area.

Those of ordinary skill in the art will understand that: the figures are merely schematic representations of one embodiment, and the blocks or flow diagrams in the figures are not necessarily required to practice the present invention.

From the above description of the embodiments, it is clear to those skilled in the art that the present invention can be implemented by software plus necessary general hardware platform. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which may be stored in a storage medium, such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the method according to the embodiments or some parts of the embodiments.

The embodiments in the present specification are described in a progressive manner, and the same and similar parts among the embodiments are referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for apparatus or system embodiments, since they are substantially similar to method embodiments, they are described in relative terms, as long as they are described in partial descriptions of method embodiments. The above-described embodiments of the apparatus and system are merely illustrative, and the units described as separate parts may or may not be physically separate, and the parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.

The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A rail invader detection method based on rail visual feature spectrum is characterized by comprising the following steps:

cruising along a track line at a certain height, angle and speed, shooting images, establishing a track characteristic image library aiming at 6 types of image shooting and affine change of rotation, brightness, visual angle, definition, scale and compression ratio by using image data, and establishing a track visual characteristic spectrum based on the track characteristic image library;

performing feature matching on the rail visual feature spectrum and a rail image to be detected, and judging whether a rail invader exists in the rail image to be detected according to a matching result;

the method comprises the following steps of cruising along a track line at a certain height, angle and speed, shooting images, establishing a track characteristic image library aiming at 6 types of image shooting and affine change of rotation, brightness, visual angle, definition, scale and compression ratio by utilizing image data, and establishing a track visual characteristic spectrum based on the track characteristic image library, wherein the track visual characteristic spectrum comprises the following steps:

the method comprises the following steps of cruising and shooting images at a certain height, angle and speed along a track line by using an unmanned aerial vehicle carrying a cloud platform camera, dividing a track characteristic template interval according to the length of a track in the images, establishing a track characteristic image library aiming at 6 shooting and affine changes of rotation, brightness, a visual angle, definition, a scale and a compression ratio according to image data, and establishing a track visual characteristic spectrum based on the track characteristic image library, wherein the processing process comprises the following 6 processing steps:

1) computing pixel gradients

Horizontal direction template applying Sobel operator

And a vertical direction template

Calculating a horizontal gradient value for each pixel in the orbit image data

And vertical gradient value

As shown in formula (1) and formula (2):

in the formula (2), f (x, y) is image data;

2) calculating the directional field

Solving horizontal square gradient by gradient operator of each pixel point in image data

And square gradient in vertical direction

As shown in formula (3):

for each one withThe image sliding window sub-block w x w, centered on pixel (i, j), calculates the mean squared gradient value Δ_x(i, j) and Δ_y(i, j) as shown in formula (4):

calculating the average direction of the sub-block gradient

Is represented by formula (5):

3) rail positioning

Extracting a direction field from the track image according to the formulas (3), (4) and (5), obtaining a gradient image and a gradient value of the track image according to the direction field, and performing thresholding on the track image by taking the gradient value of the track image as a threshold value;

removing background influence, and obtaining the position of the track in the track image by Canny edge detection;

4) boundary extraction

Extracting a rail bound area in the rail image according to the position of the rail in the rail image and the railway bound definition, wherein the rail bound area comprises the rail, a railway bed and a buffer area extending outwards from the rail bound area, calculating the distance between each pixel in the rail image and the nearest rail pixel, and marking the area with the distance smaller than or equal to the width of the rail as the rail bound area;

5) ALP feature extraction

Extracting an ALP feature of each track limit area, and carrying out compression coding on the extracted ALP feature of the track limit area;

6) construction of orbit visual characteristic spectrum

And taking the geographical position of each track limiting area as an interval index, fusing the geographical position of each track limiting area and the ALP characteristic of each track limiting interval, and integrating the fusion characteristics of all track limiting intervals of the track segment to construct a finished track visual characteristic spectrum.

2. The method of claim 1, wherein said extracting ALP features for each track bounding area comprises:

①, constructing a Gaussian scale space to solve the extreme point of the scale space, wherein a Gaussian kernel is the only linear convolution kernel for realizing scale transformation, and for a scale parameter delta, the two-dimensional Gaussian kernel is expressed as shown in formula (6):

g(x,y,σ)＝1/(2πσ²)·exp(-(x²+y²)/(2σ²)) (6)

to construct the image pyramid, the original image is filtered using 4 increasing scale parameters δ of gaussian kernel and discrete laplacian kernel in each sub-octave, as shown in equation (7):

wherein g (x, y, delta) is a discrete Gaussian function,

for discrete laplacian, the symbol denotes convolution;

p(x,y,δ)＝α₃(x,y)δ³+α₂(x,y)δ²+α₁(x,y)δ+α₀(x,y) (8)

where (x, y) is the pixel coordinate, δ is the image scale, α₀(x,y):α₃(x, y) is calculated by weighted summation of images after Gaussian filtering, and p (x, y, delta) is a response value of each pixel point;

within each sub octave, calculating the response p (x, y, delta) of each pixel of each image to obtain candidate extreme points (x, y) and corresponding scales delta^*Remapping candidate extreme points to sub-imagesPixel precision, eliminating repeated extreme points, extracting ALP feature points, and distributing a main direction to each feature point according to a histogram of regional gradient amplitude of each ALP feature point;

② local feature selection, based on the ALP feature point dimension delta^*Direction θ, distance d from the center of the image, peak D, Hessian matrix squared trace ratio ρ, second derivative of the scale space function p_δδ6 feature parameters are calculated, correlation calculation is carried out as shown in a formula (9), feature points with correlation larger than a set threshold value are screened out and used as finally extracted ALP feature points;

r(δ^*,θ,d,D,ρ,p_δδ)＝f₁(δ^*)·f₂(θ)·f₃(d)·f₄(D)·f₅(ρ)·f₆(p_δδ) (9)

wherein f is_nThe (—) represents a quantized correlation coefficient, and n is set in advance to 1 to 6;

3. The method of claim 2, wherein said compressively encoding the ALP characteristics of the extracted track bounding area comprises:

① local feature aggregation, normalizing the local feature description of each 128-dimensional vector, and extracting a 32-dimensional principal vector through principal component analysis;

4. The method according to any one of claims 1 to 3, wherein the step of performing feature matching on the rail visual feature spectrum and the rail image to be detected and judging whether a rail invader exists in the rail image to be detected according to a matching result comprises the steps of:

when the track line is inspected, the track image to be detected is shot in real time according to the same shooting conditions in the process of constructing the track visual characteristic spectrum, the ALP characteristics of the track image shot in real time in the track limit range are extracted in each track interval, and the ALP characteristics of the track image shot in real time and the track visual characteristic spectrum at the geographic position are matched;

after completing feature matching between ALP features of a real-time shot image and orbit visual feature spectrums at the same position in a template feature library, determining whether an invader exists in the real-time image by adopting a threshold method, and defining a matching rate as an equation (10):

in the formula, TotalNum represents the total number of feature points, and matchNum represents the number of feature points successfully matched;

if the ratio is larger than a preset threshold value, determining that an invader exists; if the ratio is less than the threshold, then it is considered normal;

5. The method of claim 4, wherein ALP feature matching the real-time captured orbit image with the orbit vision feature spectrum at the geographic location comprises:

global description matching, namely describing ALP feature matching degree between the orbit image shot in real time and the orbit visual feature image at the geographic position by adopting similarity measurement based on Hamming distance;