CN117611983B - Underwater target detection method and system based on hidden communication technology and deep learning - Google Patents

Underwater target detection method and system based on hidden communication technology and deep learning Download PDF

Info

Publication number
CN117611983B
CN117611983B CN202311537443.XA CN202311537443A CN117611983B CN 117611983 B CN117611983 B CN 117611983B CN 202311537443 A CN202311537443 A CN 202311537443A CN 117611983 B CN117611983 B CN 117611983B
Authority
CN
China
Prior art keywords
information
image
target
secret
secret information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202311537443.XA
Other languages
Chinese (zh)
Other versions
CN117611983A (en
Inventor
张德华
于长成
王德臣
王宇辰
孟磊
张雷
梁琳琳
张妮娜
秦春斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Henan University
Original Assignee
Henan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Henan University filed Critical Henan University
Priority to CN202311537443.XA priority Critical patent/CN117611983B/en
Publication of CN117611983A publication Critical patent/CN117611983A/en
Application granted granted Critical
Publication of CN117611983B publication Critical patent/CN117611983B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/004Arrangements for detecting or preventing errors in the information received by using forward error control
    • H04L1/0056Systems characterized by the type of code used
    • H04L1/0059Convolutional codes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/10Image acquisition
    • G06V10/12Details of acquisition arrangements; Constructional details thereof
    • G06V10/14Optical characteristics of the device performing the acquisition or on the illumination arrangements
    • G06V10/141Control of illumination
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • G06V10/443Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
    • G06V10/449Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
    • G06V10/451Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
    • G06V10/454Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/806Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/05Underwater scenes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B11/00Transmission systems employing sonic, ultrasonic or infrasonic waves
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/004Arrangements for detecting or preventing errors in the information received by using forward error control
    • H04L1/0041Arrangements at the transmitter end
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/004Arrangements for detecting or preventing errors in the information received by using forward error control
    • H04L1/0045Arrangements at the receiver end
    • H04L1/0054Maximum-likelihood or sequential decoding, e.g. Viterbi, Fano, ZJ algorithms
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/004Arrangements for detecting or preventing errors in the information received by using forward error control
    • H04L1/0056Systems characterized by the type of code used
    • H04L1/0071Use of interleaving
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Medical Informatics (AREA)
  • Biophysics (AREA)
  • Mathematical Physics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Image Processing (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses an underwater target detection method and system based on a hidden communication technology and deep learning, and the method comprises the following steps: acquiring an image of a target object in real time; preprocessing the image, and extracting features of the preprocessed image by using a convolutional neural network model to obtain boundary frame coordinates, category labels and confidence scores of each target; the information transmission method based on the hidden communication realizes the safe transmission of the boundary frame coordinates, the category labels, the confidence scores and the communication information of each target. The invention can effectively solve the problems of real-time image processing and underwater hidden communication of the underwater robot, and is more in line with the complex and changeable environment at present.

Description

Underwater target detection method and system based on hidden communication technology and deep learning
Technical Field
The invention belongs to the field of underwater robots, in particular to the field of underwater target detection and hidden communication, and particularly relates to an underwater target detection method and system based on a hidden communication technology and deep learning.
Background
With the development of technology, underwater target detection has become a hot spot technical problem, and due to the specificity of an underwater environment, the acquisition difficulty of an underwater image is much greater than that of a land image, and the acquisition of the underwater image is affected by various factors, such as an underwater complex illumination environment, water quality conditions, similarity of a target and a background, and the like, which are main factors affecting the accuracy of underwater target detection. In recent years, deep learning is being vigorously developed. The object detection technology has greatly progressed and is widely applied to various scenes. However, the network structure of the method in the present stage is complex and the parameter quantity is huge, which is unfavorable for real-time detection.
Underwater communication technology refers to technology for transmitting information in an underwater environment, and includes various forms of acoustic communication, electromagnetic wave communication, optical communication, and the like. In seawater, the absorption and attenuation of electromagnetic waves and light waves propagating in water are serious, and sound waves are currently the main mode of information propagation in water. In an underwater environment, the traditional communication modes such as electromagnetic waves, light waves and the like have many limitations and disadvantages, and cannot meet the actual application demands. The underwater sound hidden communication technology can effectively improve confidentiality and concealment of underwater communication by using means of special modulation and demodulation methods, encryption algorithms and the like, has stronger anti-interference capability, can cope with the influence factors such as ocean background noise, underwater biological activity interference, hostile interference and the like, and ensures accurate transmission of information.
The coding layer hidden communication technology transmits the secret information by embedding the secret information into the coded public information, so as to realize hidden communication. The embedding of secret information affects the error correction performance of channel coding, thus increasing the bit error rate of information transmitted in the original communication system at the receiving end and the risk of the embedded information being detected. In order to reduce the negative influence of secret information embedding on the original communication system, the binary Hamming matrix based embedding method is introduced into the hidden communication of the coding layer, and the embedding method can effectively reduce the influence of secret information on carrier information and improve the concealment and safety of the hidden communication. But the introduction of the binary hamming matrix embedding method increases the bit error rate of the secret information.
Therefore, the invention provides an underwater robot scheme combining a technology for detecting an underwater target in real time and a coding layer hidden communication technology based on binary Hamming matrix embedding improvement for underwater acoustic communication, which can well solve the problem.
Disclosure of Invention
Aiming at the defects of the prior art, the invention provides the underwater target detection method and the underwater target detection system based on the hidden communication technology and the deep learning, which can effectively solve the problems of real-time image processing of the underwater robot and underwater hidden communication and are more in line with the complex and changeable environment at present.
In order to achieve the above object, the present invention provides the following solutions:
The underwater target detection method based on the hidden communication technology and the deep learning comprises the following steps:
acquiring an image of a target object in real time;
Preprocessing the image, and extracting features of the preprocessed image by using a convolutional neural network model to obtain boundary frame coordinates, category labels and confidence scores of each target;
The information transmission method based on the hidden communication realizes the safe transmission of the boundary frame coordinates, the category labels, the confidence scores and the communication information of each target.
Preferably, the method for acquiring the image of the target object in real time comprises the following steps:
And acquiring an image of the region to be detected by using the high-definition camera module to obtain an image of the target object.
Preferably, the method for preprocessing the image and extracting features of the preprocessed image by using a convolutional neural network model to obtain the boundary frame coordinates, class labels and confidence scores of each target comprises the following steps:
Adjusting the size of the image of the target object and normalizing the pixel value;
based on YOLOv model, introducing a plurality of fine-grained feature layers on the basis of Darknet, extracting and fusing the feature information of different scales of the preprocessed image, and predicting target frames of different scales by using a plurality of detection heads to obtain the boundary frame coordinates, class labels and confidence scores of each target.
Preferably, the information transmission method based on the hidden communication, the method for realizing the safe transfer of the boundary frame coordinates, the category labels and the confidence scores of each target and the communication information, comprises the following steps:
Performing convolutional encoding operation on the public information to form carrier information; carrying out convolutional encoding operation on the secret information, and carrying out interleaving treatment on the secret information through an interleaver; embedding secret information into public information by using a matrix embedding mode based on binary Hamming codes to obtain secret carrying information; modulating the secret information to obtain modulated signals and transmitting the modulated signals through sonar; wherein, the secret information is information processed by the image processing module;
And carrying out matrix quantization extraction on the secret information, de-interleaving the result of matrix quantization extraction, and then using the result as the input of a Viterbi soft decision decoder to decode and recover the secret information.
The invention also discloses an underwater target detection system based on the hidden communication technology and the deep learning, which comprises: the device comprises an image processing module, an image acquisition module and a hidden communication module;
the image acquisition module is used for acquiring an image of a target object in real time;
The image processing module is used for preprocessing the image, and extracting features of the preprocessed image by using a convolutional neural network model to obtain the boundary frame coordinates, the category labels and the confidence scores of each target;
The hidden communication module is used for realizing the safe transfer of the boundary frame coordinates, the category labels, the confidence scores and the communication information of each target based on the information transmission method of the hidden communication.
Preferably, the image acquisition module includes: the high-definition camera module and the LED light supplementing device;
The high-definition camera module is used for collecting images of the region to be detected and obtaining images of the target object;
The LED light supplementing device is used for assisting the high-definition camera module.
Preferably, the image processing module includes: a preprocessing unit and a feature extraction unit;
The preprocessing unit is used for adjusting the size of the image of the target object and normalizing the pixel value;
The feature extraction unit is used for introducing a plurality of fine-grained feature layers on the basis of a YOLOv model and Darknet, extracting and fusing feature information of different scales of the preprocessed image, and predicting target frames of different scales by using a plurality of detection heads to obtain the boundary frame coordinates, category labels and confidence scores of each target.
Preferably, the covert communication module includes: a transmitter, a receiver, a sonar, a signal processing unit and a control unit;
The signal processing unit is used for performing convolution encoding operation on the public information to form carrier information; carrying out convolutional encoding operation on the secret information, and carrying out interleaving treatment on the secret information through an interleaver; embedding secret information into public information by using a matrix embedding mode based on binary Hamming codes to obtain secret carrying information; modulating the secret information to obtain a modulated signal; wherein, the secret information is information processed by the image processing module;
the transmitter is used for transmitting the secret carrying information;
The sonar is used for sending out the modulated signals;
The receiver is used for receiving the data signal from the transmitter;
The control unit is used for carrying out matrix quantization extraction on the secret information, de-interleaving the result of matrix quantization extraction, and then taking the result as the input of the Viterbi soft decision decoder, and decoding and recovering the secret information.
The invention also discloses an underwater target detection robot based on the hidden communication technology and the deep learning, and the robot applies the underwater target detection method based on the hidden communication technology and the deep learning.
Compared with the prior art, the invention has the beneficial effects that:
The image processing module disclosed by the invention is based on YOLOv algorithm, adopts DarkNet53 as a feature extractor, introduces a multi-scale prediction and feature fusion strategy, can better capture target information of different scales, can effectively process targets of different sizes by carrying out target detection under a plurality of different scales, and can detect on different levels of a network at the same time, and the precision and stability of a detection result are improved by feature fusion.
The communication module of the invention adopts a hidden communication technology based on a coding layer, and introduces a binary Hamming matrix embedding mode facing to the reduction of the modification quantity into a hidden communication system of the coding layer, the embedding mode can reduce the modification quantity of information embedding on carrier information, and simultaneously, an interleaving module is introduced to improve an extraction scheme, so that the negative effect of the embedding method on the reliability of secret information is weakened. The interleaving function is to convert the continuous errors brought by the embedding mode into random errors, thereby being beneficial to decoding of the convolution codes and increasing the reliability of secret information. The matrix quantization extraction is an extraction scheme which is proposed based on the embedding mode and can enable the extracted information to carry out soft decision decoding, so that the error rate of secret information can be further reduced.
Drawings
In order to more clearly illustrate the technical solutions of the present invention, the drawings that are needed in the embodiments are briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings can be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a diagram of an overall framework of an underwater target detection system based on a covert communication technique and deep learning in an embodiment of the invention;
FIG. 2 is a diagram of an overall framework of a network model of an image processing module in an embodiment of the invention;
FIG. 3 is a block diagram of an image acquisition module in an embodiment of the invention;
Fig. 4 is a system block diagram of a coding layer hidden communication scheme in an embodiment of the present invention.
Fig. 5 is a schematic view of an underwater target detection robot housing in an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
In order that the above-recited objects, features and advantages of the present invention will become more readily apparent, a more particular description of the invention will be rendered by reference to the appended drawings and appended detailed description.
Example 1
The invention provides an underwater target detection method based on a hidden communication technology and deep learning, which comprises the following steps:
acquiring an image of a target object in real time;
Preprocessing an image, and extracting features of the preprocessed image by using a convolutional neural network model to obtain boundary frame coordinates, category labels and confidence scores of each target;
The information transmission method based on the hidden communication realizes the safe transmission of the boundary frame coordinates, the category labels, the confidence scores and the communication information of each target.
In the present embodiment, image data including a target object is collected according to the requirements of a target detection task. The data can be obtained from the public data set, or can be collected or purchased by itself. The data set is ensured to contain the diversity and representativeness of each class of targets.
The collected images are labeled using LabelImg or RectLabel labeling tools, labeling the location and class of each target. Common target labeling formats include Bounding Box (rectangular box) and Mask.
The marked data set is divided into a training set, a verification set and a test set. Typically 70% -80% of the data is used as training set, 10% -15% of the data is used as verification set, and the rest of the data is used as test set. Ensuring a relatively uniform distribution of the target categories for each dataset.
And a data enhancement technology is applied to the training set, so that the diversity and the number of the data sets are increased, and the generalization capability of the model is improved. Common data enhancement methods include random cropping, scaling, panning, rotation, brightness adjustment, color conversion, and the like.
The annotation and image data are converted into the format required for model training. For the YOLOv model, the common data format is Darknet format or COCO format. The data set may be converted to a specified format using a corresponding tool.
During training, the image data is pre-processed, e.g., normalized, scaled to a fixed size, etc. Thus, the training process can be accelerated, and the performance and stability of the model can be improved.
The prepared dataset is loaded into the model for training using the data loading tool DataLoader provided by the deep learning framework PyTorch.
The above is the data collection and preprocessing process followed by the training process for the specific object detection model.
A network structure of the YOLOv8 model is built using a deep learning framework PyTorch. The construction process includes a combination of a downsampling network, a feature extraction layer, and a detection head.
The model is weight initialized, and may be initialized randomly or with pre-trained weights.
A penalty function of the YOLOv model is defined. The penalty function of YOLOv typically consists of a location penalty, a classification penalty, and a confidence penalty. The method can be realized by using the existing loss functions, such as cross entropy and the like, and can be correspondingly adjusted according to task requirements.
The prepared training set and verification set data is loaded using DataLoader of data loading tool PyTorch.
An appropriate optimizer is selected, here an SGD optimizer is selected, and then the corresponding super parameters of learning rate, weight decay, momentum, etc. are set. These hyper-parameters can be debugged and optimized through experimentation.
In each training cycle, a batch of training data is input into the model for forward propagation calculations, and then for backward propagation to update the model parameters. Model weights are updated according to the value of the loss function.
The rate of model training may be controlled by way of learning rate decay, typically by gradually decreasing the learning rate during the training process.
Model weights during the training process are saved periodically for subsequent model evaluation and reasoning use.
And deploying the stored model into an operating system of the underwater robot, and detecting and using the underwater target.
The underwater target detection needs to use an image acquired by an underwater robot in real time. Firstly, preprocessing an image acquired by an underwater robot, wherein the preprocessing comprises adjustment of image size, normalization of pixel values and the like. These preprocessing operations help to improve the performance and accuracy of the model.
And performing feature extraction on the preprocessed image by using a Convolutional Neural Network (CNN) model. YOLOv8 employs Darknet53 as a feature extraction network, which consists of multiple convolutional layers, residual blocks, and convergence layers.
YOLOv8 a plurality of fine-grained feature layers (fine-grained features) are introduced on the basis of Darknet for extracting feature information of different scales. By fusing features of different levels, contextual information of objects of different sizes can be better captured.
YOLOv8 use multiple detection heads (detection heads) to predict target boxes of different scales. Each detection head is responsible for predicting a target of a particular scale and outputting a corresponding classification probability, bounding box position, and confidence score.
And decoding the prediction result output by the detection head to obtain the boundary frame coordinates, the category labels and the confidence scores of each target.
Since the same object may be detected by multiple bounding boxes, the prediction box is filtered using NMS algorithm in order to eliminate duplicate detection results. The NMS will select the most likely target box based on the confidence score and overlap of the predicted boxes.
And further processing the target frame processed by the NMS, such as setting a confidence threshold value, and filtering the target frame with low confidence.
Finally, a category label, location information, and confidence score including the detected target frame are output.
In this embodiment, the apparatus for capturing an image of a target object in real time mainly includes two major components: high definition digtal camera module and LED light filling device.
And (3) remotely controlling the underwater robot, acquiring images of the region to be detected, and temporarily storing the acquired images into a storage system of the underwater robot.
In this embodiment, the information transmission method based on covert communication realizes secure transmission of the boundary frame coordinates, class labels, confidence scores and communication information of each target, and mainly includes the following components: transmitter, receiver, sonar, signal processing unit and control unit. The data transmission and reception is mainly divided into two stages: a data transmission phase and a data reception phase.
In the data transmission phase, the transmitter acquires information processed by the image processing module to be transmitted, which is referred to herein as secret information, from the storage system of the underwater robot. Some of the information that can be publicly transmitted is referred to herein as public information as a carrier of such processed information.
The signal processing unit carries out convolution encoding operation on the public information to form carrier information; the secret information is also subjected to convolutional encoding operation, and then is subjected to interleaving processing through an interleaver. Finally, the secret information is embedded into the public information by using a matrix embedding mode based on binary Hamming codes, which is called as secret carrying information.
The secret information is encoded by the source into a digital code, a process also known as "data packing". In this process, data is added with a preamble and a guard interval to enhance reliability and robustness of the signal.
The encoded data is modulated by the signal source onto a carrier frequency having quadrature characteristics, a process also known as "signal modulation". The purpose of modulation is to allow data to be transmitted farther and more stably in the channel.
The modulated signal is sent out through sonar.
In the data receiving phase, a data signal from a transmitting end is received by a receiver. The receiver demodulates the received signal, demodulates the signal from the carrier frequency, and restores the signal to the original data code. The demodulated data code is decoded by the receiver, and the preamble and the synchronization code are removed to restore the original digital code or analog signal. The secret information of the transmitting end is obtained.
Firstly, matrix quantization extraction is carried out on secret information, so that soft decision Viterbi decoding can be carried out on the extracted information, and the error rate of the secret information is reduced.
And the result of matrix quantization extraction is used as the input of a Viterbi soft decision decoder after de-interleaving, and finally the secret information is decoded and recovered.
Example two
As shown in fig. 1, the invention also discloses an underwater target detection system based on the hidden communication technology and deep learning, comprising: the device comprises an image processing module, an image acquisition module and a hidden communication module;
the image acquisition module is used for acquiring an image of a target object in real time;
The image processing module is used for preprocessing the image, and extracting features of the preprocessed image by using a convolutional neural network model to obtain the boundary frame coordinates, the category labels and the confidence scores of each target;
The hidden communication module is used for realizing the safe transfer of the boundary frame coordinates, the category labels, the confidence scores and the communication information of each target based on the information transmission method of the hidden communication.
In this embodiment, as shown in fig. 2, for an image processing module (including a preprocessing unit and a feature extraction unit), image data including a target object is collected according to the requirements of a target detection task. The data can be obtained from the public data set, or can be collected or purchased by itself. The data set is ensured to contain the diversity and representativeness of each class of targets.
The collected images are labeled using LabelImg or RectLabel labeling tools, labeling the location and class of each target. Common target labeling formats include Bounding Box (rectangular box) and Mask.
The marked data set is divided into a training set, a verification set and a test set. Typically 70% -80% of the data is used as training set, 10% -15% of the data is used as verification set, and the rest of the data is used as test set. Ensuring a relatively uniform distribution of the target categories for each dataset.
And a data enhancement technology is applied to the training set, so that the diversity and the number of the data sets are increased, and the generalization capability of the model is improved. Common data enhancement methods include random cropping, scaling, panning, rotation, brightness adjustment, color conversion, and the like.
The annotation and image data are converted into the format required for model training. For the YOLOv model, the common data format is Darknet format or COCO format. The data set may be converted to a specified format using a corresponding tool.
During training, the image data is pre-processed, e.g., normalized, scaled to a fixed size, etc. Thus, the training process can be accelerated, and the performance and stability of the model can be improved.
The prepared dataset is loaded into the model for training using the data loading tool DataLoader provided by the deep learning framework PyTorch.
The above is the data collection and preprocessing process followed by the training process for the specific object detection model.
A network structure of the YOLOv8 model is built using a deep learning framework PyTorch. The construction process includes a combination of a downsampling network, a feature extraction layer, and a detection head.
The model is weight initialized, and may be initialized randomly or with pre-trained weights.
A penalty function of the YOLOv model is defined. The penalty function of YOLOv typically consists of a location penalty, a classification penalty, and a confidence penalty. The method can be realized by using the existing loss functions, such as cross entropy and the like, and can be correspondingly adjusted according to task requirements.
The prepared training set and verification set data is loaded using DataLoader of data loading tool PyTorch.
An appropriate optimizer is selected, here an SGD optimizer is selected, and then the corresponding super parameters of learning rate, weight decay, momentum, etc. are set. These hyper-parameters can be debugged and optimized through experimentation.
In each training cycle, a batch of training data is input into the model for forward propagation calculations, and then for backward propagation to update the model parameters. Model weights are updated according to the value of the loss function.
The rate of model training may be controlled by way of learning rate decay, typically by gradually decreasing the learning rate during the training process.
Model weights during the training process are saved periodically for subsequent model evaluation and reasoning use.
And deploying the stored model into an operating system of the underwater robot, and detecting and using the underwater target.
The underwater target detection needs to use an image acquired by an underwater robot in real time. Firstly, preprocessing an image acquired by an underwater robot, wherein the preprocessing comprises adjustment of image size, normalization of pixel values and the like. These preprocessing operations help to improve the performance and accuracy of the model.
And performing feature extraction on the preprocessed image by using a Convolutional Neural Network (CNN) model. YOLOv8 employs Darknet53 as a feature extraction network, which consists of multiple convolutional layers, residual blocks, and convergence layers.
YOLOv8 a plurality of fine-grained feature layers (fine-grained features) are introduced on the basis of Darknet for extracting feature information of different scales. By fusing features of different levels, contextual information of objects of different sizes can be better captured.
YOLOv8 use multiple detection heads (detection heads) to predict target boxes of different scales. Each detection head is responsible for predicting a target of a particular scale and outputting a corresponding classification probability, bounding box position, and confidence score.
And decoding the prediction result output by the detection head to obtain the boundary frame coordinates, the category labels and the confidence scores of each target.
Since the same object may be detected by multiple bounding boxes, the prediction box is filtered using NMS algorithm in order to eliminate duplicate detection results. The NMS will select the most likely target box based on the confidence score and overlap of the predicted boxes.
And further processing the target frame processed by the NMS, such as setting a confidence threshold value, and filtering the target frame with low confidence.
Finally, a category label, location information, and confidence score including the detected target frame are output.
In this embodiment, as shown in fig. 3, the apparatus for an image acquisition module mainly includes two major components: high definition digtal camera module and LED light filling device.
And (3) remotely controlling the underwater robot, acquiring images of the region to be detected, and temporarily storing the acquired images into a storage system of the underwater robot.
In the present embodiment, as shown in fig. 4, for the communication module, the following components are mainly included: transmitter, receiver, sonar, signal processing unit and control unit. The data transmission and reception is mainly divided into two stages: a data transmission phase and a data reception phase.
In the data transmission phase, the transmitter acquires information processed by the image processing module to be transmitted, which is referred to herein as secret information, from the storage system of the underwater robot. Some of the information that can be publicly transmitted is referred to herein as public information as a carrier of such processed information.
The signal processing unit carries out convolution encoding operation on the public information to form carrier information; the secret information is also subjected to convolutional encoding operation, and then is subjected to interleaving processing through an interleaver. Finally, the secret information is embedded into the public information by using a matrix embedding mode based on binary Hamming codes, which is called as secret carrying information.
The secret information is encoded by the source into a digital code, a process also known as "data packing". In this process, data is added with a preamble and a guard interval to enhance reliability and robustness of the signal.
The encoded data is modulated by the signal source onto a carrier frequency having quadrature characteristics, a process also known as "signal modulation". The purpose of modulation is to allow data to be transmitted farther and more stably in the channel.
The modulated signal is sent out through sonar.
In the data receiving phase, a data signal from a transmitting end is received by a receiver. The receiver demodulates the received signal, demodulates the signal from the carrier frequency, and restores the signal to the original data code. The demodulated data code is decoded by the receiver, and the preamble and the synchronization code are removed to restore the original digital code or analog signal. The secret information of the transmitting end is obtained.
Firstly, matrix quantization extraction is carried out on secret information, so that soft decision Viterbi decoding can be carried out on the extracted information, and the error rate of the secret information is reduced.
And the result of matrix quantization extraction is used as the input of a Viterbi soft decision decoder after de-interleaving, and finally the secret information is decoded and recovered.
The invention mainly combines the related technologies of the target detection field and the hidden communication field, and fuses the target detection field and the hidden communication field into the underwater robot. In the image processing module, a yolov-based object detection algorithm is adopted; in a communication module, an improved coding layer covert communication scheme is employed.
Compared with the prior art, the technical scheme of the invention has the following advantages:
the method can perform target detection at a higher frame rate and has higher real-time performance.
The method adopts a deeper network structure, introduces a strategy of multi-scale prediction and feature fusion, and has better detection precision.
Is insensitive to the change of the number of targets and can better detect multi-target images.
The interleaving is used in the hidden communication system of the coding layer, so that continuous errors in information extraction can be well dispersed.
The addition of matrix quantization extraction can further reduce the error rate of secret information
Example III
The invention also discloses an underwater target detection robot based on the hidden communication technology and the deep learning, and the robot applies the underwater target detection method based on the hidden communication technology and the deep learning. A schematic view of the underwater target detection robot housing is shown in fig. 5.
The above embodiments are merely illustrative of the preferred embodiments of the present invention, and the scope of the present invention is not limited thereto, but various modifications and improvements made by those skilled in the art to which the present invention pertains are made without departing from the spirit of the present invention, and all modifications and improvements fall within the scope of the present invention as defined in the appended claims.

Claims (8)

1. The underwater target detection method based on the hidden communication technology and the deep learning is characterized by comprising the following steps of:
acquiring an image of a target object in real time;
Preprocessing the image, and extracting features of the preprocessed image by using a convolutional neural network model to obtain boundary frame coordinates, category labels and confidence scores of each target;
The information transmission method based on the hidden communication realizes the safe transmission of the boundary frame coordinates, the category labels, the confidence scores and the communication information of each target;
The information transmission method based on the hidden communication, the method for realizing the safe transfer of the boundary frame coordinates, the category labels and the confidence scores of each target and the communication information comprises the following steps:
Performing convolutional encoding operation on the public information to form carrier information; carrying out convolutional encoding operation on the secret information, and carrying out interleaving treatment on the secret information through an interleaver; embedding secret information into public information by using a matrix embedding mode based on binary Hamming codes to obtain secret carrying information; modulating the secret information to obtain modulated signals and transmitting the modulated signals through sonar; wherein, the secret information is information processed by the image processing module;
Carrying out matrix quantization extraction on the secret information, de-interleaving the result of matrix quantization extraction as input of a Viterbi soft decision decoder, and decoding to recover the secret information;
The information transmission method based on the hidden communication realizes the safe transmission of the boundary frame coordinates, the category labels and the confidence scores of each target and the communication information, and mainly comprises the following components: a transmitter, a receiver, a sonar, a signal processing unit and a control unit; the data transmission and reception is mainly divided into two stages: a data transmission stage and a data reception stage;
In the data transmission stage, the transmitter acquires information which is to be transmitted and is processed by the image processing module from a storage system of the underwater robot, wherein the information is called secret information; the public transmission information is used as a carrier of the processed information and is called public information;
The signal processing unit carries out convolution encoding operation on the public information to form carrier information; the secret information is subjected to convolutional coding operation, and then is subjected to interleaving treatment through an interleaver; finally, embedding the secret information into the public information by using a matrix embedding mode based on binary Hamming codes, which is called as secret carrying information;
the secret information is encoded into digital codes by the signal source, and is called as data packing; the data is added with a preamble and a guard interval;
The coded data is modulated by a signal source onto a carrier frequency having orthogonal characteristics, referred to as "signal modulation";
the modulated signal is sent out through sonar;
In the data receiving stage, receiving a data signal from a transmitting end through a receiver; the receiver demodulates the received signal, demodulates the signal from the carrier frequency, and restores the signal into the original data code; the demodulated data code is decoded by a receiver, the lead code and the synchronous code are removed, and the original digital code or analog signal is recovered to obtain secret carrying information of a transmitting end;
Firstly, carrying out matrix quantization extraction on secret information, so that soft decision Viterbi decoding is carried out on the extracted information;
And the result of matrix quantization extraction is used as the input of a Viterbi soft decision decoder after de-interleaving, and finally the secret information is decoded and recovered.
2. The underwater target detection method based on the covert communication technology and the deep learning according to claim 1, wherein the method for acquiring the image of the target object in real time comprises the following steps:
And acquiring an image of the region to be detected by using the high-definition camera module to obtain an image of the target object.
3. The underwater target detection method based on the hidden communication technology and the deep learning according to claim 1, wherein the method for preprocessing the image and extracting features of the preprocessed image by using a convolutional neural network model to obtain the boundary frame coordinates, the category labels and the confidence scores of each target comprises the following steps:
Adjusting the size of the image of the target object and normalizing the pixel value;
based on YOLOv model, introducing a plurality of fine-grained feature layers on the basis of Darknet, extracting and fusing the feature information of different scales of the preprocessed image, and predicting target frames of different scales by using a plurality of detection heads to obtain the boundary frame coordinates, class labels and confidence scores of each target.
4. Underwater target detection system based on hidden communication technology and deep learning, which is characterized by comprising: the device comprises an image processing module, an image acquisition module and a hidden communication module;
the image acquisition module is used for acquiring an image of a target object in real time;
The image processing module is used for preprocessing the image, and extracting features of the preprocessed image by using a convolutional neural network model to obtain the boundary frame coordinates, the category labels and the confidence scores of each target;
The hidden communication module is used for realizing the safe transfer of the boundary frame coordinates, the category labels, the confidence scores and the communication information of each target based on the information transmission method of the hidden communication.
5. The underwater target detection system based on the covert communication technique and the deep learning of claim 4, wherein the image acquisition module comprises: the high-definition camera module and the LED light supplementing device;
The high-definition camera module is used for collecting images of the region to be detected and obtaining images of the target object;
The LED light supplementing device is used for assisting the high-definition camera module.
6. The underwater target detection system based on the covert communication technique and the deep learning of claim 4, wherein the image processing module comprises: a preprocessing unit and a feature extraction unit;
The preprocessing unit is used for adjusting the size of the image of the target object and normalizing the pixel value;
The feature extraction unit is used for introducing a plurality of fine-grained feature layers on the basis of a YOLOv model and Darknet, extracting and fusing feature information of different scales of the preprocessed image, and predicting target frames of different scales by using a plurality of detection heads to obtain the boundary frame coordinates, category labels and confidence scores of each target.
7. The underwater target detection system based on the covert communication technique and deep learning of claim 4, wherein the covert communication module comprises: a transmitter, a receiver, a sonar, a signal processing unit and a control unit;
The signal processing unit is used for performing convolution encoding operation on the public information to form carrier information; carrying out convolutional encoding operation on the secret information, and carrying out interleaving treatment on the secret information through an interleaver; embedding secret information into public information by using a matrix embedding mode based on binary Hamming codes to obtain secret carrying information; modulating the secret information to obtain a modulated signal; wherein, the secret information is information processed by the image processing module;
the transmitter is used for transmitting the secret carrying information;
The sonar is used for sending out the modulated signals;
The receiver is used for receiving the data signal from the transmitter;
The control unit is used for carrying out matrix quantization extraction on the secret information, de-interleaving the result of matrix quantization extraction, and then taking the result as the input of the Viterbi soft decision decoder, and decoding and recovering the secret information.
8. An underwater target detection robot based on a covert communication technology and deep learning, which is characterized in that the robot applies the underwater target detection method based on the covert communication technology and the deep learning as set forth in any one of claims 1 to 3.
CN202311537443.XA 2023-11-17 2023-11-17 Underwater target detection method and system based on hidden communication technology and deep learning Active CN117611983B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311537443.XA CN117611983B (en) 2023-11-17 2023-11-17 Underwater target detection method and system based on hidden communication technology and deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311537443.XA CN117611983B (en) 2023-11-17 2023-11-17 Underwater target detection method and system based on hidden communication technology and deep learning

Publications (2)

Publication Number Publication Date
CN117611983A CN117611983A (en) 2024-02-27
CN117611983B true CN117611983B (en) 2024-07-02

Family

ID=89948832

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311537443.XA Active CN117611983B (en) 2023-11-17 2023-11-17 Underwater target detection method and system based on hidden communication technology and deep learning

Country Status (1)

Country Link
CN (1) CN117611983B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106197543A (en) * 2016-07-13 2016-12-07 北方爆破科技有限公司 The wireless communication system of a kind of Underwater Acoustic Environment monitoring and method
CN114863263A (en) * 2022-07-07 2022-08-05 鲁东大学 Snakehead detection method for intra-class shielding based on cross-scale hierarchical feature fusion

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU3320700A (en) * 1999-03-26 2000-10-16 Wulich Wave Ltd. Underwater communication method, device, and system utilizing a doppler frequency shift
CN107465486B (en) * 2017-06-28 2020-07-28 中国船舶重工集团公司第七一五研究所 Cooperative coding communication method suitable for underwater acoustic network
CN109597032B (en) * 2019-01-31 2020-02-14 中国科学院深海科学与工程研究所 Underwater acoustic positioning communication method
CN110518935B (en) * 2019-09-18 2020-10-02 中国海洋大学 MC-CDMA-based underwater acoustic communication system and PAPR (Peak to average Power ratio) suppression method
RU2724145C1 (en) * 2019-10-16 2020-06-22 Федеральное государственное бюджетное учреждение науки Институт проблем морских технологий Дальневосточного отделения Российской академии наук (ИПМТ ДВО РАН) Hydroacoustic monitoring station of underwater situation
JP7178016B2 (en) * 2021-03-01 2022-11-25 アイテック株式会社 Image processing device and its image processing method
CN115471746A (en) * 2022-08-26 2022-12-13 中船航海科技有限责任公司 Ship target identification detection method based on deep learning
CN116707662A (en) * 2023-06-13 2023-09-05 曲阜师范大学 MIMO-FMT-TR underwater acoustic communication system and method
CN116895012A (en) * 2023-07-21 2023-10-17 广东电网有限责任公司 Underwater image abnormal target identification method, system and equipment

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106197543A (en) * 2016-07-13 2016-12-07 北方爆破科技有限公司 The wireless communication system of a kind of Underwater Acoustic Environment monitoring and method
CN114863263A (en) * 2022-07-07 2022-08-05 鲁东大学 Snakehead detection method for intra-class shielding based on cross-scale hierarchical feature fusion

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘伟伟 等.《 基于线性码的隐写编码研究进展》.江苏科技大学学报(自然科学版).2015,第362-370页. *

Also Published As

Publication number Publication date
CN117611983A (en) 2024-02-27

Similar Documents

Publication Publication Date Title
CN110225341B (en) Task-driven code stream structured image coding method
CN111079683B (en) Remote sensing image cloud and snow detection method based on convolutional neural network
CN112565777B (en) Deep learning model-based video data transmission method, system, medium and device
CN118279734B (en) Underwater particulate matter and biological image in-situ acquisition method, medium and system
CN117611983B (en) Underwater target detection method and system based on hidden communication technology and deep learning
CN116320538A (en) Semantic communication transmission method and system for substation inspection image
CN110837806B (en) Indoor positioning method based on deep convolutional memory network
CN113280820A (en) Orchard visual navigation path extraction method and system based on neural network
CN114743099A (en) Method and device for identifying faults of train key components
CN118196628A (en) Enhanced semantic-location feature fusion network method, device and application for extracting backbone based on different pre-training features
CN116503641A (en) Small target detection method based on multi-scale feature fusion
CN114332699B (en) Road condition prediction method, device, equipment and storage medium
CN115713695A (en) 6G satellite image landslide detection method based on multistage Transformer network
CN114998117A (en) Port ship intelligent identification method and system combining image defogging and denoising
CN114418821A (en) Blind watermark processing method based on image frequency domain
CN106548143A (en) It is a kind of based on facial recognition techniques and the face identification system and method for memory function
CN112803951A (en) Method for reducing noise of communication system received signal, receiving end and communication system
CN116109944B (en) Satellite image cloud target extraction method based on deep learning network architecture
CN117291787B (en) Traceability method and system based on data watermark
CN118784093A (en) Underwater unidirectional broadcast communication method and system
Liu et al. ForgeryTTT: Zero-Shot Image Manipulation Localization with Test-Time Training
CN118298834A (en) Task-driven image semantic communication method, system and training method thereof
Wang et al. Advanced scene text recognition and application for dynamic driving scenes
CN118840743A (en) Semantic communication method, device and equipment for binocular three-dimensional target recognition
Lin Visual prompt-based learning for aerial image dehazing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant