CN117611983B

CN117611983B - Underwater target detection method and system based on hidden communication technology and deep learning

Info

Publication number: CN117611983B
Application number: CN202311537443.XA
Authority: CN
Inventors: 张德华; 于长成; 王德臣; 王宇辰; 孟磊; 张雷; 梁琳琳; 张妮娜; 秦春斌
Original assignee: Henan University
Current assignee: Henan Mirror Instrument Technology Co ltd
Priority date: 2023-11-17
Filing date: 2023-11-17
Publication date: 2024-07-02
Anticipated expiration: 2043-11-17
Also published as: CN117611983A

Abstract

The invention discloses an underwater target detection method and system based on a hidden communication technology and deep learning, and the method comprises the following steps: acquiring an image of a target object in real time; preprocessing the image, and extracting features of the preprocessed image by using a convolutional neural network model to obtain boundary frame coordinates, category labels and confidence scores of each target; the information transmission method based on the hidden communication realizes the safe transmission of the boundary frame coordinates, the category labels, the confidence scores and the communication information of each target. The invention can effectively solve the problems of real-time image processing and underwater hidden communication of the underwater robot, and is more in line with the complex and changeable environment at present.

Description

Underwater target detection method and system based on covert communication technology and deep learning

技术领域Technical Field

本发明属于水下机器人领域，尤其涉及水下目标检测领域和隐蔽通信领域，具体涉及基于隐蔽通信技术及深度学习的水下目标检测方法及系统。The present invention belongs to the field of underwater robots, and in particular to the fields of underwater target detection and covert communication, and specifically to an underwater target detection method and system based on covert communication technology and deep learning.

背景技术Background technique

随着科技发展，水下目标检测已成为热点技术问题，由于水下环境的特殊性，水下图像采集难度比陆地图像要大得多，水下图像的采集受到多种因素的影响，比如水下复杂的光照环境、水质条件、目标与背景的的相似度等等，这些都是影响水下目标检测准确度的主要因素。近年来，深度学习正在蓬勃发展。目标检测技术取得了很大的进展，被广泛运用于各种场景。但是，现阶段的方法网络结构复杂且参数量庞大，不利于实时检测。With the development of science and technology, underwater target detection has become a hot technical issue. Due to the particularity of the underwater environment, underwater image acquisition is much more difficult than land images. The acquisition of underwater images is affected by many factors, such as the complex underwater lighting environment, water quality conditions, the similarity between the target and the background, etc. These are the main factors affecting the accuracy of underwater target detection. In recent years, deep learning is booming. Target detection technology has made great progress and has been widely used in various scenarios. However, the current method has a complex network structure and a large number of parameters, which is not conducive to real-time detection.

水下通信技术是指在水下环境中传输信息的技术，包括声学通信、电磁波通信和光学通信等多种形式。在海水中由于电磁波和光波在水中传播的吸收和衰减十分严重，声波目前成为信息在水中传播的主要方式。在水下环境中，电磁波和光波等传统通信方式存在很多限制和缺点，无法满足实际应用需求。而水声隐蔽通信技术通过使用特殊的调制和解调方法、加密算法等手段，可以有效提高水下通信的保密性和隐蔽性，并且具有更强的抗干扰能力，可以应对海洋背景噪声、水下生物活动干扰、敌对干扰等影响因素，保证信息的准确传输。Underwater communication technology refers to the technology of transmitting information in an underwater environment, including acoustic communication, electromagnetic wave communication, optical communication and other forms. In seawater, due to the serious absorption and attenuation of electromagnetic waves and light waves in water, sound waves have become the main way for information to be transmitted in water. In underwater environments, traditional communication methods such as electromagnetic waves and light waves have many limitations and shortcomings and cannot meet the needs of actual applications. However, underwater acoustic covert communication technology can effectively improve the confidentiality and concealment of underwater communications by using special modulation and demodulation methods, encryption algorithms and other means, and has stronger anti-interference capabilities. It can cope with factors such as ocean background noise, interference from underwater biological activities, and hostile interference to ensure accurate transmission of information.

编码层隐蔽通信技术通过将秘密信息嵌入到编码后的公开信息中来传输秘密信息，实现隐蔽通信。秘密信息的嵌入会影响信道编码的纠错性能，因此增加了原来通信系统中传输的信息在接收端的误码率和嵌入信息被检测到的风险。为了降低秘密信息嵌入对原来通信系统的负面影响，基于二元汉明矩阵嵌入方法被引入到编码层的隐蔽通信中，这种嵌入方法能够有效的减少秘密信息对载体信息的影响，提高隐蔽通信的隐蔽性和安全性。但是，二元汉明矩阵嵌入方法的引入会增加秘密信息的误码率。The coding layer covert communication technology transmits secret information by embedding it into the coded public information to achieve covert communication. The embedding of secret information will affect the error correction performance of channel coding, thereby increasing the bit error rate of the information transmitted in the original communication system at the receiving end and the risk of the embedded information being detected. In order to reduce the negative impact of the embedding of secret information on the original communication system, a binary Hamming matrix embedding method is introduced into the covert communication of the coding layer. This embedding method can effectively reduce the impact of secret information on the carrier information and improve the concealment and security of covert communication. However, the introduction of the binary Hamming matrix embedding method will increase the bit error rate of secret information.

因此，本发明提出一种用于水下目标实时检测的技术和一种用于水声通信的基于二元汉明矩阵嵌入改进的编码层隐蔽通信技术相结合的水下机器人方案，可以很好的解决这个问题。Therefore, the present invention proposes an underwater robot solution combining a technology for real-time underwater target detection and an improved coding layer covert communication technology based on binary Hamming matrix embedding for underwater acoustic communication, which can solve this problem well.

发明内容Summary of the invention

针对现有技术的不足，本发明提出了基于隐蔽通信技术及深度学习的水下目标检测方法及系统，可以有效地解决水下机器人实时图像处理问题和水下隐蔽通信问题，更符合当下复杂多变的环境。In view of the shortcomings of the existing technology, the present invention proposes an underwater target detection method and system based on covert communication technology and deep learning, which can effectively solve the real-time image processing problems of underwater robots and underwater covert communication problems, and is more in line with the current complex and changeable environment.

为实现上述目的，本发明提供了如下方案：To achieve the above object, the present invention provides the following solutions:

基于隐蔽通信技术及深度学习的水下目标检测方法，包括以下步骤：The underwater target detection method based on covert communication technology and deep learning includes the following steps:

实时采集目标物体的图像；Collect images of target objects in real time;

对所述图像进行预处理，使用卷积神经网络模型对预处理后的所述图像进行特征提取，得到每个目标的边界框坐标、类别标签和置信度得分；Preprocessing the image, and extracting features from the preprocessed image using a convolutional neural network model to obtain bounding box coordinates, category labels, and confidence scores for each target;

基于隐蔽通信的信息传输方法，实现每个目标的边界框坐标、类别标签和置信度得分与通信信息的安全传递。The information transmission method based on covert communication realizes the secure transmission of bounding box coordinates, category label and confidence score of each target and communication information.

优选的，实时采集目标物体的图像的方法包括：Preferably, the method for collecting an image of a target object in real time includes:

使用高清摄像头模组对待检测区域进行图像采集，获得目标物体的图像。Use a high-definition camera module to capture images of the area to be inspected and obtain images of the target object.

优选的，对所述图像进行预处理，使用卷积神经网络模型对预处理后的所述图像进行特征提取，得到每个目标的边界框坐标、类别标签和置信度得分的方法包括：Preferably, the method of preprocessing the image and extracting features from the preprocessed image using a convolutional neural network model to obtain the bounding box coordinates, category label and confidence score of each target includes:

对目标物体的图像进行尺寸的调整、像素值的归一化；Adjust the size of the image of the target object and normalize the pixel values;

基于YOLOv8模型，在Darknet53的基础上引入了多个细粒度特征层，提取预处理后的所述图像不同尺度的特征信息并进行融合，使用多个检测头部，预测不同尺度的目标框，得到每个目标的边界框坐标、类别标签和置信度得分。Based on the YOLOv8 model, multiple fine-grained feature layers are introduced on the basis of Darknet53. The feature information of different scales of the preprocessed image is extracted and fused. Multiple detection heads are used to predict target boxes of different scales to obtain the bounding box coordinates, category label and confidence score of each target.

优选的，基于隐蔽通信的信息传输方法，实现每个目标的边界框坐标、类别标签和置信度得分与通信信息的安全传递的方法包括：Preferably, the method for realizing the secure transmission of the bounding box coordinates, category label and confidence score of each target and the communication information based on the covert communication information transmission method includes:

将公开信息进行卷积编码操作，形成载体信息；将秘密信息进行卷积编码操作，并经过交织器进行交织处理秘密信息；使用基于二元汉明码的矩阵嵌入方式将秘密信息嵌入到公开信息中，获得载秘信息；将载秘信息进行信号调制，获得调制后的信号并通过声纳发送出去；其中，所述秘密信息为经过图像处理模块处理的信息；Perform convolution coding on public information to form carrier information; perform convolution coding on secret information and interleave the secret information through an interleaver; embed the secret information into public information using a matrix embedding method based on binary Hamming code to obtain carrier information; perform signal modulation on the carrier information to obtain a modulated signal and send it out through sonar; wherein the secret information is information processed by an image processing module;

对载秘信息进行矩阵量化提取，将矩阵量化提取的结果进行解交织后作为维特比软判决译码器的输入，译码恢复秘密信息。The secret information is extracted by matrix quantization, and the result of the matrix quantization extraction is deinterleaved as the input of the Viterbi soft decision decoder to decode and recover the secret information.

本发明还公开了基于隐蔽通信技术及深度学习的水下目标检测系统，包括：图像处理模块、图像采集模块和隐蔽通信模块；The present invention also discloses an underwater target detection system based on covert communication technology and deep learning, comprising: an image processing module, an image acquisition module and a covert communication module;

所述图像采集模块用于实时采集目标物体的图像；The image acquisition module is used to acquire images of target objects in real time;

所述图像处理模块用于对所述图像进行预处理，使用卷积神经网络模型对预处理后的所述图像进行特征提取，得到每个目标的边界框坐标、类别标签和置信度得分；The image processing module is used to preprocess the image, and use a convolutional neural network model to extract features from the preprocessed image to obtain the bounding box coordinates, category label and confidence score of each target;

所述隐蔽通信模块用于基于隐蔽通信的信息传输方法，实现每个目标的边界框坐标、类别标签和置信度得分与通信信息的安全传递。The covert communication module is used to achieve the secure transmission of the bounding box coordinates, category label and confidence score of each target and the communication information based on the covert communication information transmission method.

优选的，所述图像采集模块包括：高清摄像头模组和LED补光装置；Preferably, the image acquisition module includes: a high-definition camera module and an LED fill light device;

所述高清摄像头模组用于对待检测区域进行图像采集，获得目标物体的图像；The high-definition camera module is used to collect images of the area to be detected and obtain images of the target object;

所述LED补光装置用于辅助所述高清摄像头模组。The LED fill light device is used to assist the high-definition camera module.

优选的，所述图像处理模块包括：预处理单元和特征提取单元；Preferably, the image processing module includes: a preprocessing unit and a feature extraction unit;

所述预处理单元用于对目标物体的图像进行尺寸的调整、像素值的归一化；The preprocessing unit is used to adjust the size of the image of the target object and normalize the pixel values;

所述特征提取单元用于基于YOLOv8模型，在Darknet53的基础上引入了多个细粒度特征层，提取预处理后的所述图像不同尺度的特征信息并进行融合，使用多个检测头部，预测不同尺度的目标框，得到每个目标的边界框坐标、类别标签和置信度得分。The feature extraction unit is used to introduce multiple fine-grained feature layers based on the YOLOv8 model on the basis of Darknet53, extract feature information of different scales of the preprocessed image and fuse them, use multiple detection heads to predict target frames of different scales, and obtain the bounding box coordinates, category label and confidence score of each target.

优选的，所述隐蔽通信模块包括：发射器、接收器、声纳、信号处理单元和控制单元；Preferably, the covert communication module comprises: a transmitter, a receiver, a sonar, a signal processing unit and a control unit;

所述信号处理单元用于将公开信息进行卷积编码操作，形成载体信息；将秘密信息进行卷积编码操作，并经过交织器进行交织处理秘密信息；使用基于二元汉明码的矩阵嵌入方式将秘密信息嵌入到公开信息中，获得载秘信息；将载秘信息进行信号调制，获得调制后的信号；其中，所述秘密信息为经过图像处理模块处理的信息；The signal processing unit is used to perform convolution coding operation on public information to form carrier information; perform convolution coding operation on secret information and interleave the secret information through an interleaver; embed the secret information into the public information using a matrix embedding method based on binary Hamming code to obtain carrier information; perform signal modulation on the carrier information to obtain a modulated signal; wherein the secret information is information processed by the image processing module;

所述发射器用于发送所述载秘信息；The transmitter is used to send the secret information;

所述声纳用于将调制后的信号发送出去；The sonar is used to send out the modulated signal;

所述接收器用于接收来自所述发射器的数据信号；The receiver is used to receive the data signal from the transmitter;

所述控制单元用于对载秘信息进行矩阵量化提取，将矩阵量化提取的结果进行解交织后作为维特比软判决译码器的输入，译码恢复秘密信息。The control unit is used to perform matrix quantization extraction on the secret information, de-interleave the result of the matrix quantization extraction as the input of the Viterbi soft decision decoder, and decode and restore the secret information.

本发明还公开了基于隐蔽通信技术及深度学习的水下目标检测机器人，所述机器人应用所述的基于隐蔽通信技术及深度学习的水下目标检测方法。The present invention also discloses an underwater target detection robot based on covert communication technology and deep learning, and the robot applies the underwater target detection method based on covert communication technology and deep learning.

与现有技术相比，本发明的有益效果为：Compared with the prior art, the present invention has the following beneficial effects:

本发明的图像处理模块基于YOLOv8算法，采用DarkNet53作为特征提取器，同时引入了多尺度预测和特征融合的策略，能够更好地捕捉不同尺度的目标信息，通过在多个不同尺度下进行目标检测，可以有效地处理不同大小的目标，同时在网络的不同层级上进行检测，并通过特征融合来提高检测结果的精度和稳定性。The image processing module of the present invention is based on the YOLOv8 algorithm, adopts DarkNet53 as the feature extractor, and introduces the strategies of multi-scale prediction and feature fusion, which can better capture the target information of different scales. By performing target detection at multiple different scales, it can effectively process targets of different sizes, perform detection at different levels of the network at the same time, and improve the accuracy and stability of the detection results through feature fusion.

本发明的通信模块采用基于编码层的隐蔽通信技术，将面向修改量降低的二元汉明矩阵嵌入方式引入到编码层的隐蔽通信系统中，此嵌入方式能够降低信息嵌入对于载体信息的修改量，同时通过引入交织模块，对提取方案进行改进，减弱此嵌入方法对秘密信息可靠性带来的消极作用。交织的作用是将此嵌入方式带来的连续错误转化为随机错误，从而有利于卷积码的译码，增加秘密信息的可靠性。而矩阵量化提取，则是基于此嵌入方式提出的一种能够使提取到的信息进行软判决译码的提取方案，可以进一步降低秘密信息的误码率。The communication module of the present invention adopts the covert communication technology based on the coding layer, and introduces the binary Hamming matrix embedding method for reducing the modification amount into the covert communication system of the coding layer. This embedding method can reduce the modification amount of information embedding to the carrier information. At the same time, by introducing the interleaving module, the extraction scheme is improved to reduce the negative effect of this embedding method on the reliability of secret information. The role of interleaving is to convert the continuous errors caused by this embedding method into random errors, which is beneficial to the decoding of convolutional codes and increases the reliability of secret information. The matrix quantization extraction is an extraction scheme based on this embedding method that enables the extracted information to be soft-decision decoded, which can further reduce the bit error rate of secret information.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

为了更清楚地说明本发明的技术方案，下面对实施例中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅是本发明的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动性的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solution of the present invention, the following briefly introduces the drawings required for use in the embodiments. Obviously, the drawings described below are only some embodiments of the present invention. For ordinary technicians in this field, other drawings can be obtained based on these drawings without paying creative labor.

图1为本发明实施例中的基于隐蔽通信技术及深度学习的水下目标检测系统整体框架图；FIG1 is an overall framework diagram of an underwater target detection system based on covert communication technology and deep learning in an embodiment of the present invention;

图2为本发明实施例中的图像处理模块网络模型整体框架图；FIG2 is an overall framework diagram of an image processing module network model in an embodiment of the present invention;

图3为本发明实施例中的图像采集模块结构图；FIG3 is a structural diagram of an image acquisition module in an embodiment of the present invention;

图4为本发明实施例中的编码层隐蔽通信方案系统框图。FIG4 is a system block diagram of a coding layer covert communication solution in an embodiment of the present invention.

图5为本发明实施例中的水下目标检测机器人机械外壳示意图。FIG. 5 is a schematic diagram of a mechanical housing of an underwater target detection robot in an embodiment of the present invention.

具体实施方式Detailed ways

下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例仅仅是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。The following will be combined with the drawings in the embodiments of the present invention to clearly and completely describe the technical solutions in the embodiments of the present invention. Obviously, the described embodiments are only part of the embodiments of the present invention, not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by ordinary technicians in this field without creative work are within the scope of protection of the present invention.

为使本发明的上述目的、特征和优点能够更加明显易懂，下面结合附图和具体实施方式对本发明作进一步详细的说明。In order to make the above-mentioned objects, features and advantages of the present invention more obvious and easy to understand, the present invention is further described in detail below with reference to the accompanying drawings and specific embodiments.

实施例一Embodiment 1

本发明提供了基于隐蔽通信技术及深度学习的水下目标检测方法，包括以下步骤：The present invention provides an underwater target detection method based on covert communication technology and deep learning, comprising the following steps:

对图像进行预处理，使用卷积神经网络模型对预处理后的图像进行特征提取，得到每个目标的边界框坐标、类别标签和置信度得分；Preprocess the image and use the convolutional neural network model to extract features from the preprocessed image to obtain the bounding box coordinates, category label, and confidence score of each object;

在本实施例中，根据目标检测任务的需求，收集包含目标物体的图像数据。可以从公开数据集中获取数据，也可以自行收集或购买数据。确保数据集中包含各类别目标的多样性和代表性。In this embodiment, according to the requirements of the target detection task, image data containing the target object is collected. The data can be obtained from a public data set, or collected or purchased by yourself. Ensure that the data set contains diversity and representativeness of each category of targets.

使用LabelImg或RectLabel标注工具对收集到的图像进行标注，标注每个目标的位置和类别。常用的目标标注格式包括Bounding Box(矩形框)和Mask(掩膜)。Use LabelImg or RectLabel to annotate the collected images and mark the location and category of each target. Common target annotation formats include Bounding Box and Mask.

将标注好的数据集划分为训练集、验证集和测试集。通常使用70％～80％的数据作为训练集，10％～15％的数据作为验证集，剩余的数据作为测试集。确保各个数据集的目标类别分布相对均匀。Divide the labeled data set into training set, validation set and test set. Usually use 70% to 80% of the data as training set, 10% to 15% of the data as validation set, and the remaining data as test set. Ensure that the target categories of each data set are relatively evenly distributed.

在训练集上应用数据增强技术，增加数据集的多样性和数量，提升模型的泛化能力。常用的数据增强方法包括随机裁剪、缩放、平移、旋转、亮度调整、颜色变换等。Apply data augmentation techniques to the training set to increase the diversity and quantity of the data set and improve the generalization ability of the model. Common data augmentation methods include random cropping, scaling, translation, rotation, brightness adjustment, color transformation, etc.

将标注和图像数据转换为模型训练所需的格式。对于YOLOv8模型，常用的数据格式是Darknet格式或COCO格式。可以使用相应的工具将数据集转换为指定的格式。Convert annotation and image data into the format required for model training. For the YOLOv8 model, the commonly used data format is Darknet format or COCO format. You can use the corresponding tools to convert the dataset to the specified format.

在训练过程中，对图像数据进行预处理，如归一化、缩放到固定尺寸等。这样可以加速训练过程，并提升模型的性能和稳定性。During the training process, the image data is preprocessed, such as normalization, scaling to a fixed size, etc. This can speed up the training process and improve the performance and stability of the model.

使用深度学习框架PyTorch提供的数据加载工具DataLoader，将准备好的数据集加载到模型中进行训练。Use the data loading tool DataLoader provided by the deep learning framework PyTorch to load the prepared dataset into the model for training.

以上是数据的收集和预处理过程，接下来是特定目标检测模型的训练过程。The above is the data collection and preprocessing process, followed by the training process of a specific target detection model.

使用深度学习框架PyTorch搭建YOLOv8模型的网络结构。构建过程包括下采样网络、特征提取层和检测头的组合。The network structure of the YOLOv8 model is built using the deep learning framework PyTorch. The construction process includes the combination of downsampling network, feature extraction layer and detection head.

对模型进行权重初始化，可以随机初始化或使用预训练的权重进行初始化。Initialize the model weights randomly or with pre-trained weights.

定义YOLOv8模型的损失函数。YOLOv8的损失函数通常由位置损失、分类损失和置信度损失组成。可以使用已有的损失函数实现，如交叉熵等，并根据任务需求进行相应的调整。Define the loss function of the YOLOv8 model. The loss function of YOLOv8 usually consists of position loss, classification loss, and confidence loss. You can use existing loss functions, such as cross entropy, and adjust them accordingly according to task requirements.

使用数据加载工具PyTorch的DataLoader加载准备好的训练集和验证集数据。Use the data loading tool PyTorch's DataLoader to load the prepared training set and validation set data.

选择适当的优化器，这里选择SGD优化器，然后设置相应的学习率、权重衰减、动量等超参数。这些超参数可以通过实验进行调试和优化。Select an appropriate optimizer, here we choose the SGD optimizer, and then set the corresponding hyperparameters such as learning rate, weight decay, momentum, etc. These hyperparameters can be debugged and optimized through experiments.

在每个训练循环中，将一批训练数据输入到模型中进行前向传播计算，然后进行反向传播更新模型参数。根据损失函数的值来更新模型权重。In each training cycle, a batch of training data is input into the model for forward propagation calculation, and then backpropagation is performed to update the model parameters. The model weights are updated according to the value of the loss function.

可以通过学习率衰减的方式来控制模型训练的速率，通常在训练过程中逐渐减小学习率。The rate of model training can be controlled by learning rate decay, which usually gradually reduces the learning rate during training.

定期保存训练过程中的模型权重，以便后续的模型评估和推理使用。Regularly save the model weights during the training process for subsequent model evaluation and inference.

将保存好的模型部署到水下机器人的操作系统中，供水下目标检测使用。Deploy the saved model to the operating system of the underwater robot for use in underwater target detection.

水下目标检测需要用到水下机器人实时采集的图像。首先，将水下机器人采集的图像进行预处理，包括图像尺寸的调整、像素值的归一化等。这些预处理操作有助于提高模型的性能和准确度。Underwater target detection requires images collected by underwater robots in real time. First, the images collected by underwater robots are preprocessed, including image size adjustment, pixel value normalization, etc. These preprocessing operations help improve the performance and accuracy of the model.

使用卷积神经网络(CNN)模型对预处理后的图像进行特征提取。YOLOv8采用了Darknet53作为特征提取网络，它由多个卷积层、残差块和汇集层组成。The convolutional neural network (CNN) model is used to extract features from the preprocessed images. YOLOv8 uses Darknet53 as the feature extraction network, which consists of multiple convolutional layers, residual blocks, and pooling layers.

YOLOv8在Darknet53的基础上引入了多个细粒度特征层(fine-grainedfeatures)，用于提取不同尺度的特征信息。通过将不同层级的特征进行融合，可以更好地捕捉不同大小目标的上下文信息。YOLOv8 introduces multiple fine-grained features based on Darknet53 to extract feature information of different scales. By fusing features at different levels, the contextual information of objects of different sizes can be better captured.

YOLOv8使用多个检测头部(detection heads)来预测不同尺度的目标框。每个检测头部负责预测特定尺度的目标，并输出相应的分类概率、边界框位置以及置信度得分。YOLOv8 uses multiple detection heads to predict object boxes of different scales. Each detection head is responsible for predicting objects of a specific scale and outputs the corresponding classification probability, bounding box location, and confidence score.

将检测头部输出的预测结果进行解码，得到每个目标的边界框坐标、类别标签和置信度得分。The predictions output by the detection head are decoded to obtain the bounding box coordinates, category label, and confidence score of each object.

由于同一目标可能被多个边界框检测到，为了消除重复的检测结果，使用NMS算法对预测框进行筛选。NMS会根据预测框的置信度得分和重叠度，选择最有可能的目标框。Since the same object may be detected by multiple bounding boxes, in order to eliminate duplicate detection results, the NMS algorithm is used to filter the predicted boxes. NMS selects the most likely target box based on the confidence score and overlap of the predicted boxes.

对经过NMS处理后的目标框进行进一步的处理，例如设置置信度阈值，滤除低置信度的目标框。The target frame after NMS processing is further processed, such as setting a confidence threshold to filter out low-confidence target frames.

最终，输出包含了检测到的目标框的类别标签、位置信息和置信度得分。Finally, the output includes the category label, location information, and confidence score of the detected target box.

在本实施例中，实时采集目标物体的图像的装置主要包括两大部件：高清摄像头模组和LED补光装置。In this embodiment, the device for collecting images of target objects in real time mainly includes two major components: a high-definition camera module and an LED fill light device.

远程遥控水下机器人，对待检测区域进行图像采集，将采集后的图像暂存至水下机器人的存储系统中。The underwater robot is remotely controlled to collect images of the inspection area and temporarily store the collected images in the storage system of the underwater robot.

在本实施例中，基于隐蔽通信的信息传输方法，实现每个目标的边界框坐标、类别标签和置信度得分与通信信息的安全传递主要包括以下部件：发射器、接收器、声纳、信号处理单元和控制单元。根据数据的收发主要分为两个阶段：数据发送阶段和数据接收阶段。In this embodiment, the information transmission method based on covert communication realizes the secure transmission of the bounding box coordinates, category label and confidence score of each target and the communication information, which mainly includes the following components: a transmitter, a receiver, a sonar, a signal processing unit and a control unit. According to the data transmission and reception, it is mainly divided into two stages: data transmission stage and data reception stage.

在数据发送阶段，发射器从水下机器人的存储系统中获取要发送的经过图像处理模块处理过的信息，这些信息在这里称为秘密信息。将一些可以公开发送信息作为这些处理过的信息的载体，这里称为公开信息。In the data transmission stage, the transmitter obtains the information to be transmitted from the storage system of the underwater robot after being processed by the image processing module, which is referred to as secret information. Some information that can be publicly transmitted is used as the carrier of these processed information, which is referred to as public information.

信号处理单元将公开信息进行卷积编码操作，形成载体信息；将秘密信息也进行卷积编码操作，然后经过交织器进行交织处理秘密信息。最后使用基于二元汉明码的矩阵嵌入方式将秘密信息嵌入到公开信息中，这里称为载秘信息。The signal processing unit performs convolution coding on the public information to form carrier information; the secret information is also convolutionally coded, and then the secret information is interleaved through the interleaver. Finally, the secret information is embedded into the public information using a matrix embedding method based on binary Hamming code, which is called carrier information.

载秘信息被信号源编码成数字码，这个过程也被称为“数据打包”。在这个过程中，数据被添加前导码和保护间隔，以增强信号的可靠性和鲁棒性。The secret information is encoded into a digital code by the signal source, a process also known as "data packing". In this process, a preamble and a guard interval are added to the data to enhance the reliability and robustness of the signal.

编码后的数据被信号源调制到具有正交特性的载波频率上，这个过程也被称为“信号调制”。调制的目的是为了让数据能够在信道中传输得更远、更稳定。The coded data is modulated by the signal source onto a carrier frequency with orthogonal characteristics, a process also known as "signal modulation". The purpose of modulation is to allow data to be transmitted farther and more stably in the channel.

调制后的信号通过声纳发送出去。The modulated signal is sent out via sonar.

在数据接收阶段，通过接收器接收来自发射端的数据信号。接收器对接收到的信号进行解调，将信号从载波频率上解调下来，还原成原始的数据码。解调后的数据码被接收器进行解码处理，将前导码和同步码去掉，恢复为原始的数字码或者模拟信号。这里得到的就是发送端的载秘信息。In the data receiving stage, the data signal from the transmitter is received by the receiver. The receiver demodulates the received signal, demodulates the signal from the carrier frequency, and restores it to the original data code. The demodulated data code is decoded by the receiver, and the preamble and synchronization code are removed to restore it to the original digital code or analog signal. What is obtained here is the secret information of the transmitter.

对载秘信息首先进行矩阵量化提取，使得提取到的信息能够进行软判决维特比译码，降低秘密信息的误码率。The secret information is first extracted by matrix quantization, so that the extracted information can be soft-decision Viterbi decoded to reduce the bit error rate of the secret information.

矩阵量化提取的结果进行解交织后作为维特比软判决译码器的输入，最后译码恢复秘密信息。The result of matrix quantization extraction is deinterleaved and used as the input of Viterbi soft decision decoder, and finally decoded to recover the secret information.

实施例二Embodiment 2

如图1所示，本发明还公开了基于隐蔽通信技术及深度学习的水下目标检测系统，包括：图像处理模块、图像采集模块和隐蔽通信模块；As shown in FIG1 , the present invention also discloses an underwater target detection system based on covert communication technology and deep learning, including: an image processing module, an image acquisition module and a covert communication module;

在本实施例中，如图2所示，对于图像处理模块(包括：预处理单元和特征提取单元)，根据目标检测任务的需求，收集包含目标物体的图像数据。可以从公开数据集中获取数据，也可以自行收集或购买数据。确保数据集中包含各类别目标的多样性和代表性。In this embodiment, as shown in FIG2 , for the image processing module (including: a preprocessing unit and a feature extraction unit), image data containing the target object is collected according to the requirements of the target detection task. The data can be obtained from a public data set, or it can be collected or purchased by itself. Ensure that the data set contains the diversity and representativeness of each category of targets.

在本实施例中，如图3所示，对于图像采集模块的装置主要包括两大部件：高清摄像头模组和LED补光装置。In this embodiment, as shown in FIG. 3 , the device for the image acquisition module mainly includes two major components: a high-definition camera module and an LED fill light device.

在本实施例中，如图4所示，对于通信模块，主要包括以下部件：发射器、接收器、声纳、信号处理单元和控制单元。根据数据的收发主要分为两个阶段：数据发送阶段和数据接收阶段。In this embodiment, as shown in Figure 4, the communication module mainly includes the following components: a transmitter, a receiver, a sonar, a signal processing unit and a control unit. The data transmission and reception are mainly divided into two stages: a data transmission stage and a data reception stage.

本发明中主要结合了目标检测领域和隐蔽通信领域的相关技术，并将两者融合到水下机器人中。在图像处理模块，采用基于一种基于yolov8的目标检测算法；在通信模块，采用一种改进的编码层隐蔽通信方案。The present invention mainly combines the relevant technologies in the field of target detection and covert communication, and integrates the two into the underwater robot. In the image processing module, a target detection algorithm based on yolov8 is adopted; in the communication module, an improved coding layer covert communication scheme is adopted.

本发明的技术方案和现有技术相比，具有以下优点：Compared with the prior art, the technical solution of the present invention has the following advantages:

能够在较高的帧率下进行目标检测，具有较高的实时性。It can detect targets at a higher frame rate and has high real-time performance.

采用更深的网络结构，同时引入了多尺度预测和特征融合的策略，具有更好的检测精度。It adopts a deeper network structure and introduces multi-scale prediction and feature fusion strategies, which has better detection accuracy.

对目标数量变化不敏感，能够更好检测多目标图像。It is insensitive to changes in the number of targets and can better detect multi-target images.

在编码层的隐蔽通信系统中使用交织，可以很好的分散在信息提取时的连续的错误。Using interleaving in the covert communication system at the coding layer can effectively disperse the continuous errors in information extraction.

矩阵量化提取的加入，可以进一步降低秘密信息的误码率The addition of matrix quantization extraction can further reduce the bit error rate of secret information

实施例三Embodiment 3

本发明还公开了基于隐蔽通信技术及深度学习的水下目标检测机器人，所述机器人应用所述的基于隐蔽通信技术及深度学习的水下目标检测方法。水下目标检测机器人机械外壳示意图如图5所示。The present invention also discloses an underwater target detection robot based on covert communication technology and deep learning, and the robot applies the underwater target detection method based on covert communication technology and deep learning. A schematic diagram of the mechanical shell of the underwater target detection robot is shown in FIG5 .

以上所述的实施例仅是对本发明优选方式进行的描述，并非对本发明的范围进行限定，在不脱离本发明设计精神的前提下，本领域普通技术人员对本发明的技术方案做出的各种变形和改进，均应落入本发明权利要求书确定的保护范围内。The embodiments described above are only descriptions of the preferred embodiments of the present invention and are not intended to limit the scope of the present invention. Without departing from the design spirit of the present invention, various modifications and improvements made to the technical solutions of the present invention by ordinary technicians in this field should fall within the protection scope determined by the claims of the present invention.

Claims

1. The underwater target detection method based on the hidden communication technology and the deep learning is characterized by comprising the following steps of:

acquiring an image of a target object in real time;

Preprocessing the image, and extracting features of the preprocessed image by using a convolutional neural network model to obtain boundary frame coordinates, category labels and confidence scores of each target;

The information transmission method based on the hidden communication realizes the safe transmission of the boundary frame coordinates, the category labels, the confidence scores and the communication information of each target;

The information transmission method based on the hidden communication, the method for realizing the safe transfer of the boundary frame coordinates, the category labels and the confidence scores of each target and the communication information comprises the following steps:

Performing convolutional encoding operation on the public information to form carrier information; carrying out convolutional encoding operation on the secret information, and carrying out interleaving treatment on the secret information through an interleaver; embedding secret information into public information by using a matrix embedding mode based on binary Hamming codes to obtain secret carrying information; modulating the secret information to obtain modulated signals and transmitting the modulated signals through sonar; wherein, the secret information is information processed by the image processing module;

Carrying out matrix quantization extraction on the secret information, de-interleaving the result of matrix quantization extraction as input of a Viterbi soft decision decoder, and decoding to recover the secret information;

The information transmission method based on the hidden communication realizes the safe transmission of the boundary frame coordinates, the category labels and the confidence scores of each target and the communication information, and mainly comprises the following components: a transmitter, a receiver, a sonar, a signal processing unit and a control unit; the data transmission and reception is mainly divided into two stages: a data transmission stage and a data reception stage;

In the data transmission stage, the transmitter acquires information which is to be transmitted and is processed by the image processing module from a storage system of the underwater robot, wherein the information is called secret information; the public transmission information is used as a carrier of the processed information and is called public information;

The signal processing unit carries out convolution encoding operation on the public information to form carrier information; the secret information is subjected to convolutional coding operation, and then is subjected to interleaving treatment through an interleaver; finally, embedding the secret information into the public information by using a matrix embedding mode based on binary Hamming codes, which is called as secret carrying information;

the secret information is encoded into digital codes by the signal source, and is called as data packing; the data is added with a preamble and a guard interval;

The coded data is modulated by a signal source onto a carrier frequency having orthogonal characteristics, referred to as "signal modulation";

the modulated signal is sent out through sonar;

In the data receiving stage, receiving a data signal from a transmitting end through a receiver; the receiver demodulates the received signal, demodulates the signal from the carrier frequency, and restores the signal into the original data code; the demodulated data code is decoded by a receiver, the lead code and the synchronous code are removed, and the original digital code or analog signal is recovered to obtain secret carrying information of a transmitting end;

Firstly, carrying out matrix quantization extraction on secret information, so that soft decision Viterbi decoding is carried out on the extracted information;

And the result of matrix quantization extraction is used as the input of a Viterbi soft decision decoder after de-interleaving, and finally the secret information is decoded and recovered.

2. The underwater target detection method based on the covert communication technology and the deep learning according to claim 1, wherein the method for acquiring the image of the target object in real time comprises the following steps:

And acquiring an image of the region to be detected by using the high-definition camera module to obtain an image of the target object.

3. The underwater target detection method based on the hidden communication technology and the deep learning according to claim 1, wherein the method for preprocessing the image and extracting features of the preprocessed image by using a convolutional neural network model to obtain the boundary frame coordinates, the category labels and the confidence scores of each target comprises the following steps:

Adjusting the size of the image of the target object and normalizing the pixel value;

based on YOLOv model, introducing a plurality of fine-grained feature layers on the basis of Darknet, extracting and fusing the feature information of different scales of the preprocessed image, and predicting target frames of different scales by using a plurality of detection heads to obtain the boundary frame coordinates, class labels and confidence scores of each target.

4. Underwater target detection system based on hidden communication technology and deep learning, which is characterized by comprising: the device comprises an image processing module, an image acquisition module and a hidden communication module;

the image acquisition module is used for acquiring an image of a target object in real time;

The image processing module is used for preprocessing the image, and extracting features of the preprocessed image by using a convolutional neural network model to obtain the boundary frame coordinates, the category labels and the confidence scores of each target;

The hidden communication module is used for realizing the safe transfer of the boundary frame coordinates, the category labels, the confidence scores and the communication information of each target based on the information transmission method of the hidden communication.

5. The underwater target detection system based on the covert communication technique and the deep learning of claim 4, wherein the image acquisition module comprises: the high-definition camera module and the LED light supplementing device;

The high-definition camera module is used for collecting images of the region to be detected and obtaining images of the target object;

The LED light supplementing device is used for assisting the high-definition camera module.

6. The underwater target detection system based on the covert communication technique and the deep learning of claim 4, wherein the image processing module comprises: a preprocessing unit and a feature extraction unit;

The preprocessing unit is used for adjusting the size of the image of the target object and normalizing the pixel value;

The feature extraction unit is used for introducing a plurality of fine-grained feature layers on the basis of a YOLOv model and Darknet, extracting and fusing feature information of different scales of the preprocessed image, and predicting target frames of different scales by using a plurality of detection heads to obtain the boundary frame coordinates, category labels and confidence scores of each target.

7. The underwater target detection system based on the covert communication technique and deep learning of claim 4, wherein the covert communication module comprises: a transmitter, a receiver, a sonar, a signal processing unit and a control unit;

The signal processing unit is used for performing convolution encoding operation on the public information to form carrier information; carrying out convolutional encoding operation on the secret information, and carrying out interleaving treatment on the secret information through an interleaver; embedding secret information into public information by using a matrix embedding mode based on binary Hamming codes to obtain secret carrying information; modulating the secret information to obtain a modulated signal; wherein, the secret information is information processed by the image processing module;

the transmitter is used for transmitting the secret carrying information;

The sonar is used for sending out the modulated signals;

The receiver is used for receiving the data signal from the transmitter;

The control unit is used for carrying out matrix quantization extraction on the secret information, de-interleaving the result of matrix quantization extraction, and then taking the result as the input of the Viterbi soft decision decoder, and decoding and recovering the secret information.

8. An underwater target detection robot based on a covert communication technology and deep learning, which is characterized in that the robot applies the underwater target detection method based on the covert communication technology and the deep learning as set forth in any one of claims 1 to 3.