WO2021027026A1 - Seismic wave vibration nature recognition method based on machine learning - Google Patents

Seismic wave vibration nature recognition method based on machine learning Download PDF

Info

Publication number
WO2021027026A1
WO2021027026A1 PCT/CN2019/107486 CN2019107486W WO2021027026A1 WO 2021027026 A1 WO2021027026 A1 WO 2021027026A1 CN 2019107486 W CN2019107486 W CN 2019107486W WO 2021027026 A1 WO2021027026 A1 WO 2021027026A1
Authority
WO
WIPO (PCT)
Prior art keywords
waveform
seismic
model
training
data
Prior art date
Application number
PCT/CN2019/107486
Other languages
French (fr)
Chinese (zh)
Inventor
刘昕靓
任涛
王柳婷
杨丹丹
商冰冰
Original Assignee
东北大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 东北大学 filed Critical 东北大学
Publication of WO2021027026A1 publication Critical patent/WO2021027026A1/en

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01VGEOPHYSICS; GRAVITATIONAL MEASUREMENTS; DETECTING MASSES OR OBJECTS; TAGS
    • G01V1/00Seismology; Seismic or acoustic prospecting or detecting
    • G01V1/28Processing seismic data, e.g. for interpretation or for event detection
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01VGEOPHYSICS; GRAVITATIONAL MEASUREMENTS; DETECTING MASSES OR OBJECTS; TAGS
    • G01V1/00Seismology; Seismic or acoustic prospecting or detecting
    • G01V1/28Processing seismic data, e.g. for interpretation or for event detection
    • G01V1/30Analysis
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01VGEOPHYSICS; GRAVITATIONAL MEASUREMENTS; DETECTING MASSES OR OBJECTS; TAGS
    • G01V1/00Seismology; Seismic or acoustic prospecting or detecting
    • G01V1/28Processing seismic data, e.g. for interpretation or for event detection
    • G01V1/30Analysis
    • G01V1/307Analysis for determining seismic attributes, e.g. amplitude, instantaneous phase or frequency, reflection strength or polarity
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology

Definitions

  • the present invention relates to the technical field of machine learning, in particular to a method for identifying seismic wave vibration properties based on machine learning.
  • Natural earthquakes are squeezing and collision between the earth's plates and plates, causing dislocations and ruptures on the edges and inside of the plates.
  • Unnatural earthquakes also known as induced earthquakes, refer to abnormal seismic activities in a local area caused by human activities, such as artificial nuclear explosion tests or collapses. Observing the seismic wave signals of natural and non-natural earthquakes recorded at each station of the Seismic Network Center, it is found that the waveforms of the two are very similar. If they are not distinguished and screened, the natural and artificial seismic waveforms will be mixed to study the earthquake. The scholars mislead and influence the research work of seismology.
  • Machine learning theory and application research began in 1986. With the rapid development of artificial intelligence, it has a wide range of applications in the fields of image recognition, signal processing, predictive evaluation, combinatorial optimization, and knowledge engineering. In recent years, machine learning methods have begun to be used to analyze and process seismic waveform data. How to apply the ever-changing computer frontier technology to the classification of seismic events so as to improve the accuracy and stability of recognition is still one of the topics that need to be studied. . Domestic experts and scholars combined genetic algorithm (GA) with BP neural network when identifying natural earthquakes and unnatural earthquakes, and established a genetic BP network (GA-BP network). The genetic algorithm is used to optimize the neural network globally, and then Use the BP backpropagation algorithm to accurately train the neural network. Dowla et al.
  • MLP multilayer perceptron
  • HHT Hilbert Huang Transform
  • the present invention provides a method for identifying seismic wave vibration properties based on machine learning.
  • the present invention classifies waveforms based on waveform data to identify natural earthquakes and non-natural earthquakes.
  • the Python programming language is mainly used to process the waveform data
  • the machine learning method-artificial neural network is used to classify and recognize seismic waves.
  • the invention adopts the supervised learning method to obtain the seismic property classifier, that is, the artificial neural network model is trained with data of known categories.
  • the input of the model is a feature vector, including features that can reflect the nature of the earthquake.
  • the present invention considers both the time domain and the frequency domain to ensure that comprehensive features of the waveform are obtained.
  • the technical solution adopted by the present invention is: a method for identifying the nature of seismic wave vibration based on machine learning; including the following steps:
  • Step 1 Read the original seismic waveform data and determine the epicenter distance of the seismic waveform that needs to be classified and identified;
  • step 1 The specific steps of step 1 are:
  • Step 1.1 Use the Python library obspy in Anaconda for the seismic field to read the original seismic waveform data, and select the waveform according to the spherical distance between the seismic source and the station;
  • Step 1.2 Use the STA/LTA algorithm to find the interval containing the earthquake starting point, and use the AIC algorithm to accurately locate the time of the first arrival of the P wave;
  • Step 1.3 After determining the first arrival of the P wave, intercept the seismic waveform to a uniform length to facilitate the extraction of waveform characteristics for subsequent analysis and processing;
  • Step 2 Perform time-frequency domain analysis on the natural earthquake and unnatural seismic waveforms respectively, the time-domain analysis obtains the waveform complexity characteristics, the frequency domain analysis obtains the spectral ratio characteristics, and the two characteristic values are calculated to obtain the time-frequency domain comprehensive characteristics Value waveform complexity/spectrum ratio;
  • the waveform complexity and spectral ratio characteristics are respectively the characteristic quantities that can characterize the waveform trend in the time domain and the frequency domain;
  • Step 3 Use the artificial neural network to train the model.
  • the model input is the feature vector of the waveform instance, that is, the feature vector composed of the three feature values obtained in step 2.
  • the output result is calculated by two hidden layers, and the output result is "1 "Indicates that the category of the waveform is natural earthquake, and the output result of "0" indicates that the category of the waveform is non-natural earthquake;
  • Step 4 Divide the seismic waveform data into two disjoint data sets, the training set and the test set, use the backpropagation algorithm to update the model parameters, continuously fit the training data, and reduce the value of the loss function; test must be used during model training Collect data to evaluate the effect of the model. After a specified number of iterations of training or the model's test accuracy reaches a certain standard, it means that the model training is completed;
  • Step 5 Save the model.
  • the output result of the saved model can be directly called for subsequent seismic waveform classification problems, without tedious training.
  • the beneficial effects of using the above technical solutions are: the intensity and impact of non-natural earthquakes are smaller than those of natural earthquakes. If the non-natural earthquakes cannot be identified and screened out in time, people will mistakenly believe that these events are strong earthquakes. The precursors of confuses the earthquake catalogue established based on this record, which will affect the future research work of seismology. Therefore, the classification and identification of natural earthquakes and non-natural earthquakes in seismic signals are helpful for monitoring and early warning of destructive tectonic earthquakes, for small-equivalent nuclear test reconnaissance, for the study of seismology, for protecting human property and safeguarding national interests. World peace has important meaning.
  • Fig. 1 is a flowchart of waveform complexity feature extraction according to the present invention
  • Figure 2 is a flow chart of the spectral ratio feature extraction of the present invention
  • Fig. 3 is a structural diagram of a model constructed by an artificial neural network used in the present invention.
  • the software environment of this embodiment is the WINDOWS 7 system, and the integrated development environment selects Pycharm IDE.
  • the identification method of seismic wave vibration properties based on machine learning includes the following steps:
  • Step 1 Read the original seismic waveform data and determine the epicenter distance of the seismic waveform that needs to be classified and identified;
  • the epicenter distance is the spherical distance between the station that recorded the waveform and the epicenter;
  • step 1 The specific steps of step 1 are:
  • Step 1.1 Use the Python library obspy in Anaconda for the seismic field to read the original seismic waveform data, and select the local seismic waveform according to the spherical distance between the source and the station;
  • Step 1.2 Use the STA/LTA algorithm to find the interval containing the earthquake starting point, and use the AIC algorithm to accurately locate the time of the first arrival of the P wave;
  • Step 1.3 After determining the first arrival of the P wave, intercept the seismic waveform to a uniform length to facilitate the extraction of waveform characteristics for subsequent analysis and processing;
  • the seismic wave transmission file ".mseed” is read through the obspy library and used to find the starting point by combining STA/LTA and AIC; the data is intercepted to a uniform length to facilitate the extraction of waveform features.
  • This embodiment determines that the seismic wave determines the P wave first arrival After the earthquake's starting point, the data of 3 seconds before the starting point and the data of 160 seconds after the starting point are intercepted, a total of 163s of data are processed for subsequent analysis;
  • Step 2 Perform time-frequency domain analysis on the natural earthquake and unnatural seismic waveforms respectively, the time-domain analysis obtains the waveform complexity characteristics, the frequency domain analysis obtains the spectral ratio characteristics, and the two characteristic values are calculated to obtain the time-frequency domain comprehensive characteristics Value waveform complexity/spectrum ratio;
  • the waveform complexity and spectral ratio characteristics are the characteristic quantities that can characterize the waveform trend at the time domain level and the frequency domain level, respectively, and the extraction process is shown in Figure 1 and Figure 2;
  • Step 3 Use the artificial neural network to train the model.
  • the model input is the feature vector of the waveform instance, that is, the feature vector composed of the three feature values obtained in step 2, and the output result is calculated through two hidden layers ,
  • the output result of "1" indicates that the category of the waveform is natural earthquake, and the output result of "0" indicates that the category of the waveform is non-natural earthquake;
  • Hidden layers 2 Number of hidden layer neurons 6 Prevent overfitting L2 regularization Learning rate Exponential decay algorithm update Loss function Binary cross entropy loss function Number of training iterations 500,000
  • Step 4 Divide the seismic waveform data into two disjoint data sets, the training set and the test set, use the backpropagation algorithm to update the model parameters, continuously fit the training data, and reduce the value of the loss function; test must be used during model training Collect data to evaluate the effect of the model. After a specified number of iterations of training or the model's test accuracy reaches a certain standard, it means that the model training is completed;
  • the setting standard for the number of training iterations in this embodiment is 500,000;
  • Step 5 Save the model.
  • the output result of the saved model can be directly called for subsequent seismic waveform classification problems, without tedious training.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Remote Sensing (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • General Life Sciences & Earth Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Geophysics (AREA)
  • Software Systems (AREA)
  • Acoustics & Sound (AREA)
  • Environmental & Geological Engineering (AREA)
  • Geology (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Medical Informatics (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Geophysics And Detection Of Objects (AREA)

Abstract

A seismic wave vibration nature recognition method based on machine learning, wherein the method relates to the technical field of machine learning. The implementation of the method is divided into four stages: seismic waveform processing, feature value extraction, model training, and model application, involving: calculating the distance between a seismic source and a station, screening an epicentral distance, reading three-component seismic waveform data of the screened epicentral distance, using a long time-window algorithm/short time-window algorithm (STA/LTA) and an AIC method to accurately find the location of a first movement of seismic waves, and further intercepting a seismic data length; respectively performing time domain analysis and frequency domain analysis on a seismic waveform; extracting three characteristic waveform complexities, a spectrum ratio, and waveform complexity/spectrum ratio as an input of an artificial neural network model; and training an artificial neural network model with two hidden layers used to identify a two-classification problem, and outputting the probability of a category to which the waveform belongs. The model trained by the method can accurately and efficiently determine a category to which a waveform belongs.

Description

基于机器学习的地震波震动性质识别方法Recognition method of seismic wave vibration properties based on machine learning 技术领域Technical field
本发明涉及机器学习技术领域,尤其涉及一种基于机器学习的地震波震动性质识别方法。The present invention relates to the technical field of machine learning, in particular to a method for identifying seismic wave vibration properties based on machine learning.
背景技术Background technique
地震可按其震动性质分为天然地震和非天然地震两大类,天然地震为地球板块与板块之间相互挤压碰撞,造成板块边沿及板块内部产生错动和破裂。非天然地震又称诱发地震,指人类活动引起的局部地区异常地震活动,如人工核爆试验或塌陷等。观察地震台网中心各台站记录的天然地震与非天然地震的地震波信号时发现,二者的波形极为相似,若不对其进行区分筛选,天然地震波形与人工地震波形掺杂将会给研究地震的学者带来误导,影响地震学的研究工作。Earthquakes can be divided into two categories: natural earthquakes and non-natural earthquakes according to their vibration properties. Natural earthquakes are squeezing and collision between the earth's plates and plates, causing dislocations and ruptures on the edges and inside of the plates. Unnatural earthquakes, also known as induced earthquakes, refer to abnormal seismic activities in a local area caused by human activities, such as artificial nuclear explosion tests or collapses. Observing the seismic wave signals of natural and non-natural earthquakes recorded at each station of the Seismic Network Center, it is found that the waveforms of the two are very similar. If they are not distinguished and screened, the natural and artificial seismic waveforms will be mixed to study the earthquake. The scholars mislead and influence the research work of seismology.
机器学习理论和应用研究始于1986年,随着人工智能的突飞猛进,其在图像识别、信号处理、预测评估、组合优化及知识工程等领域有着广泛的应用。近年来机器学习方法开始被用来对地震波形数据进行分析和处理,如何把日新月异的计算机前沿技术应用于地震事件分类中,从而提高识别的准确率、稳定性仍是当前需要研究的课题之一。国内有专家学者在识别天然地震与非天然地震时将遗传算法(GA)与BP神经网络相结合,建立了遗传BP网络(GA-BP网络),利用遗传算法对神经网络进行全局寻优,然后利用BP反向传播算法对神经网络进行精确训练。Dowla等人利用多层感知机(MLP)神经网络识别天然地震与地下爆炸。21世纪初,有学者提出利用经验模态分解方法对原始数据波形进行处理,从中提取出特征值,对原始波形数据预处理后再利用支持向量机(SVM)完成地震波形分类识别任务,使模型具有更高的识别率。同时有专家从物理学角度提出利用小波变换方法对天然地震和人工地震提取到的震源尺度、震源深度等特征进行分类识别任务。毕明霞、黄汉明等人利用希尔伯特黄变换(HHT)提取波形特征,利用SVM进行识别分类;边银菊等利用小波变换提取地震波形特征进行研究;一些国外学者尝试使用P波、S波的信号到时识别分类地震波。Machine learning theory and application research began in 1986. With the rapid development of artificial intelligence, it has a wide range of applications in the fields of image recognition, signal processing, predictive evaluation, combinatorial optimization, and knowledge engineering. In recent years, machine learning methods have begun to be used to analyze and process seismic waveform data. How to apply the ever-changing computer frontier technology to the classification of seismic events so as to improve the accuracy and stability of recognition is still one of the topics that need to be studied. . Domestic experts and scholars combined genetic algorithm (GA) with BP neural network when identifying natural earthquakes and unnatural earthquakes, and established a genetic BP network (GA-BP network). The genetic algorithm is used to optimize the neural network globally, and then Use the BP backpropagation algorithm to accurately train the neural network. Dowla et al. used a multilayer perceptron (MLP) neural network to identify natural earthquakes and underground explosions. At the beginning of the 21st century, some scholars proposed to use the empirical mode decomposition method to process the original data waveform, extract the eigenvalues from it, preprocess the original waveform data, and then use the support vector machine (SVM) to complete the seismic waveform classification and recognition task, and make the model Has a higher recognition rate. At the same time, some experts proposed to use wavelet transform method to classify and recognize features such as focal scale and focal depth extracted from natural earthquakes and artificial earthquakes. Bi Mingxia, Huang Hanming and others used Hilbert Huang Transform (HHT) to extract waveform features, and SVM for recognition and classification; Bian Yinju and others used wavelet transform to extract seismic waveform features for research; some foreign scholars tried to use P wave and S wave Identify and classify seismic waves when the signal arrives.
发明概述Summary of the invention
技术问题technical problem
问题的解决方案The solution to the problem
技术解决方案Technical solutions
针对现有技术存在的问题,本发明提供一种基于机器学习的地震波震动性质识别方法,本发明根据波形数据对波形进行分类,从而识别天然地震与非天然地震。其中主要使用Python程序设计语言对波形数据进行处理,并运用机器学习方法——人工神经网络进行分类识别地震波。本发明采用监督学习方式获得地震性质分类器,即用已知类别的数据训练人工神经网络模型。模型的输入是一个特征向量,包括可以反映地震性质的特征。本发明提取特征时分别从时域和频域两方面考虑保证获得波形的综合特征。In view of the problems existing in the prior art, the present invention provides a method for identifying seismic wave vibration properties based on machine learning. The present invention classifies waveforms based on waveform data to identify natural earthquakes and non-natural earthquakes. Among them, the Python programming language is mainly used to process the waveform data, and the machine learning method-artificial neural network is used to classify and recognize seismic waves. The invention adopts the supervised learning method to obtain the seismic property classifier, that is, the artificial neural network model is trained with data of known categories. The input of the model is a feature vector, including features that can reflect the nature of the earthquake. When extracting features, the present invention considers both the time domain and the frequency domain to ensure that comprehensive features of the waveform are obtained.
为解决上述技术问题,本发明所采取的技术方案是:基于机器学习的地震波震动性质识别方法;包括以下步骤:In order to solve the above technical problems, the technical solution adopted by the present invention is: a method for identifying the nature of seismic wave vibration based on machine learning; including the following steps:
步骤1:对原始地震波形数据进行读取,确定需要进行分类识别的地震波形的震中距;Step 1: Read the original seismic waveform data and determine the epicenter distance of the seismic waveform that needs to be classified and identified;
所述步骤1的具体步骤为The specific steps of step 1 are
步骤1.1:使用Anaconda中针对地震领域的Python库obspy读取原始地震波形数据,依据震源与台站间的球面距离选取波形;Step 1.1: Use the Python library obspy in Anaconda for the seismic field to read the original seismic waveform data, and select the waveform according to the spherical distance between the seismic source and the station;
步骤1.2:利用长短时窗比STA/LTA的算法找出包含地震起点的区间,使用AIC算法精确定位P波初至的时刻;Step 1.2: Use the STA/LTA algorithm to find the interval containing the earthquake starting point, and use the AIC algorithm to accurately locate the time of the first arrival of the P wave;
步骤1.3:确定P波初至后将地震波形截取为统一长度便于提取波形特征进行后续分析处理;Step 1.3: After determining the first arrival of the P wave, intercept the seismic waveform to a uniform length to facilitate the extraction of waveform characteristics for subsequent analysis and processing;
步骤2:分别对天然地震和非天然地震波形进行时频域分析,时域分析得到波形复杂度特征,频域分析得到谱比值特征,将所述两个特征值进行计算得到时频域综合特征值波形复杂度/谱比值;Step 2: Perform time-frequency domain analysis on the natural earthquake and unnatural seismic waveforms respectively, the time-domain analysis obtains the waveform complexity characteristics, the frequency domain analysis obtains the spectral ratio characteristics, and the two characteristic values are calculated to obtain the time-frequency domain comprehensive characteristics Value waveform complexity/spectrum ratio;
所述波形复杂度和谱比值特征分别为时域层面和频域层面可以表征波形走向的特征量;The waveform complexity and spectral ratio characteristics are respectively the characteristic quantities that can characterize the waveform trend in the time domain and the frequency domain;
步骤3:使用人工神经网络训练模型,模型输入是波形实例的特征向量,即步 骤2中得到的三个特征值组成的特征向量,经过两个隐藏层的计算得到输出结果,输出结果为“1”表示波形所属类别为天然地震,输出结果为“0”表示波形所属类别为非天然地震;Step 3: Use the artificial neural network to train the model. The model input is the feature vector of the waveform instance, that is, the feature vector composed of the three feature values obtained in step 2. The output result is calculated by two hidden layers, and the output result is "1 "Indicates that the category of the waveform is natural earthquake, and the output result of "0" indicates that the category of the waveform is non-natural earthquake;
步骤4:将地震波形数据分为训练集和测试集两个不相交的数据集,使用反向传播算法更新模型参数,不断拟合训练数据,降低损失函数的值;模型训练过程中须使用测试集数据评估模型的效果,经过指定迭代次数的训练或模型的测试准确率达到一定标准后,意味着模型训练完成;Step 4: Divide the seismic waveform data into two disjoint data sets, the training set and the test set, use the backpropagation algorithm to update the model parameters, continuously fit the training data, and reduce the value of the loss function; test must be used during model training Collect data to evaluate the effect of the model. After a specified number of iterations of training or the model's test accuracy reaches a certain standard, it means that the model training is completed;
步骤5:保存模型,对后续的地震波形分类问题可直接调用该保存模型输出结果,无需再进行繁琐训练。Step 5: Save the model. The output result of the saved model can be directly called for subsequent seismic waveform classification problems, without tedious training.
发明的有益效果The beneficial effects of the invention
有益效果Beneficial effect
采用上述技术方案所产生的有益效果在于:非天然地震的强度及影响程度相比天然地震较小,如果不能及时的将非天然地震予以识别和筛选出来,会让人误以为这些事件是强震的前兆,混淆依此记录所建立的地震目录,影响日后地震学的研究工作。因此地震信号中的天然地震和非天然地震的分类识别有助于破坏性构造地震的监测预警,有助于小当量核试侦察,对地震学的研究,对保护人类财产和维护国家利益,保卫世界和平都有重要的意义。The beneficial effects of using the above technical solutions are: the intensity and impact of non-natural earthquakes are smaller than those of natural earthquakes. If the non-natural earthquakes cannot be identified and screened out in time, people will mistakenly believe that these events are strong earthquakes. The precursors of confuses the earthquake catalogue established based on this record, which will affect the future research work of seismology. Therefore, the classification and identification of natural earthquakes and non-natural earthquakes in seismic signals are helpful for monitoring and early warning of destructive tectonic earthquakes, for small-equivalent nuclear test reconnaissance, for the study of seismology, for protecting human property and safeguarding national interests. World peace has important meaning.
对附图的简要说明Brief description of the drawings
附图说明Description of the drawings
图1为本发明的波形复杂度特征提取流程图;Fig. 1 is a flowchart of waveform complexity feature extraction according to the present invention;
图2为本发明的谱比值特征提取流程图;Figure 2 is a flow chart of the spectral ratio feature extraction of the present invention;
图3为本发明使用的人工神经网络搭建的模型结构图。Fig. 3 is a structural diagram of a model constructed by an artificial neural network used in the present invention.
发明实施例Invention embodiment
本发明的实施方式Embodiments of the invention
下面结合附图和实施例,对本发明的具体实施方式作进一步详细描述。以下实施例用于说明本发明,但不用来限制本发明的范围。The specific embodiments of the present invention will be described in further detail below in conjunction with the drawings and embodiments. The following examples are used to illustrate the present invention, but not to limit the scope of the present invention.
本实施例的软件环境为WINDOWS 7系统,集成开发环境选择Pycharm IDE。The software environment of this embodiment is the WINDOWS 7 system, and the integrated development environment selects Pycharm IDE.
基于机器学习的地震波震动性质识别方法,包括以下步骤:The identification method of seismic wave vibration properties based on machine learning includes the following steps:
步骤1:对原始地震波形数据进行读取,确定需要进行分类识别的地震波形的震中距;Step 1: Read the original seismic waveform data and determine the epicenter distance of the seismic waveform that needs to be classified and identified;
所述震中距即记录该波形的台站与震源的球面距离;The epicenter distance is the spherical distance between the station that recorded the waveform and the epicenter;
所述步骤1的具体步骤为The specific steps of step 1 are
步骤1.1:使用Anaconda中针对地震领域的Python库obspy读取原始地震波形数据,依据震源与台站间的球面距离选取近震波形;Step 1.1: Use the Python library obspy in Anaconda for the seismic field to read the original seismic waveform data, and select the local seismic waveform according to the spherical distance between the source and the station;
由于地震波信号的能量在传播过程中会有损耗,震中距越大,记录到的波形所中掺杂的噪声越大,同时其代表的地震能量损耗越大,即信噪比低,不利于分析波形数据,故本方法选定震中距500km以内的数据进行分类识别;Since the energy of the seismic wave signal will be lost during the propagation process, the greater the epicentral distance, the greater the noise doped in the recorded waveform, and the greater the seismic energy loss it represents, that is, the low signal-to-noise ratio, which is not conducive to analysis Waveform data, so this method selects the data within 500km of the epicenter for classification and identification;
步骤1.2:利用长短时窗比STA/LTA的算法找出包含地震起点的区间,使用AIC算法精确定位P波初至的时刻;Step 1.2: Use the STA/LTA algorithm to find the interval containing the earthquake starting point, and use the AIC algorithm to accurately locate the time of the first arrival of the P wave;
本实施例使用的波形数据中存在大量噪声,故需要识别地震波初动的起点,为方便后续处理,选择截取定长的可以包含完整波形信息的数据;There is a lot of noise in the waveform data used in this embodiment, so it is necessary to identify the starting point of the initial motion of the seismic wave, and to facilitate subsequent processing, choose to intercept fixed-length data that can contain complete waveform information;
步骤1.3:确定P波初至后将地震波形截取为统一长度便于提取波形特征进行后续分析处理;Step 1.3: After determining the first arrival of the P wave, intercept the seismic waveform to a uniform length to facilitate the extraction of waveform characteristics for subsequent analysis and processing;
将地震波传输文件“.mseed”文件通过obspy库读取并使用STA/LTA与AIC相结合的方法寻找起点;将数据截取为统一长度便于提取波形特征,本实施例确定地震波确定P波初至为地震起点后截取起点前3秒的数据及起点后的160秒的数据,共163s数据进行后续分析处理;The seismic wave transmission file ".mseed" is read through the obspy library and used to find the starting point by combining STA/LTA and AIC; the data is intercepted to a uniform length to facilitate the extraction of waveform features. This embodiment determines that the seismic wave determines the P wave first arrival After the earthquake's starting point, the data of 3 seconds before the starting point and the data of 160 seconds after the starting point are intercepted, a total of 163s of data are processed for subsequent analysis;
步骤2:分别对天然地震和非天然地震波形进行时频域分析,时域分析得到波形复杂度特征,频域分析得到谱比值特征,将所述两个特征值进行计算得到时频域综合特征值波形复杂度/谱比值;Step 2: Perform time-frequency domain analysis on the natural earthquake and unnatural seismic waveforms respectively, the time-domain analysis obtains the waveform complexity characteristics, the frequency domain analysis obtains the spectral ratio characteristics, and the two characteristic values are calculated to obtain the time-frequency domain comprehensive characteristics Value waveform complexity/spectrum ratio;
所述波形复杂度和谱比值特征分别为时域层面和频域层面可以表征波形走向的特征量,提取过程如图1、图2所示;The waveform complexity and spectral ratio characteristics are the characteristic quantities that can characterize the waveform trend at the time domain level and the frequency domain level, respectively, and the extraction process is shown in Figure 1 and Figure 2;
步骤3:使用人工神经网络训练模型,如图3所示,模型输入是波形实例的特征向量,即步骤2中得到的三个特征值组成的特征向量,经过两个隐藏层的计算得到输出结果,输出结果为“1”表示波形所属类别为天然地震,输出结果为“0”表 示波形所属类别为非天然地震;Step 3: Use the artificial neural network to train the model. As shown in Figure 3, the model input is the feature vector of the waveform instance, that is, the feature vector composed of the three feature values obtained in step 2, and the output result is calculated through two hidden layers , The output result of "1" indicates that the category of the waveform is natural earthquake, and the output result of "0" indicates that the category of the waveform is non-natural earthquake;
将特征向量的三个分量输入人工神经网络模型中训练并保存模型。将单个波形的特征向量输入到模型中可以得到分类结果;或将一次事件的全部波形特征向量输入到模型中,结合一次事件中的多个地震波形数据得到的分类结果确定当前事件的震动性质(天然地震事件或非天然震动事件)。Input the three components of the feature vector into the artificial neural network model to train and save the model. Input the feature vector of a single waveform into the model to obtain the classification result; or input all the feature vectors of the waveform of an event into the model, and combine the classification results obtained from multiple seismic waveform data in one event to determine the vibration properties of the current event ( Natural seismic event or non-natural seismic event).
所述步骤3中人工神经网络的模型结构及使用参数信息如表1所示:The model structure and use parameter information of the artificial neural network in the step 3 are shown in Table 1:
表1人工神经网络的模型结构及使用参数Table 1 Model structure and usage parameters of artificial neural network
[Table 1][Table 1]
隐藏层数Hidden layers 22
隐藏层神经元个数Number of hidden layer neurons 66
防止过拟合Prevent overfitting L2正则化L2 regularization
学习率Learning rate 指数衰减算法更新Exponential decay algorithm update
损失函数Loss function 二元交叉熵损失函数Binary cross entropy loss function
训练迭代次数Number of training iterations 500,000500,000
步骤4:将地震波形数据分为训练集和测试集两个不相交的数据集,使用反向传播算法更新模型参数,不断拟合训练数据,降低损失函数的值;模型训练过程中须使用测试集数据评估模型的效果,经过指定迭代次数的训练或模型的测试准确率达到一定标准后,意味着模型训练完成;Step 4: Divide the seismic waveform data into two disjoint data sets, the training set and the test set, use the backpropagation algorithm to update the model parameters, continuously fit the training data, and reduce the value of the loss function; test must be used during model training Collect data to evaluate the effect of the model. After a specified number of iterations of training or the model's test accuracy reaches a certain standard, it means that the model training is completed;
本实施例中训练迭代次数设定标准为500,000次;The setting standard for the number of training iterations in this embodiment is 500,000;
步骤5:保存模型,对后续的地震波形分类问题可直接调用该保存模型输出结果,无需再进行繁琐训练。Step 5: Save the model. The output result of the saved model can be directly called for subsequent seismic waveform classification problems, without tedious training.
最后应说明的是:以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解;其依然可以对前述实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;因而这些修改或者替换,并不使相应技术方案的本质脱离本发明权利要求所限定的范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention, but not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that; The technical solutions recorded in the foregoing embodiments are modified, or some or all of the technical features are equivalently replaced; therefore, these modifications or replacements do not cause the essence of the corresponding technical solutions to deviate from the scope defined by the claims of the present invention.

Claims (2)

  1. 一种基于机器学习的地震波震动性质识别方法,其特征在于:包括以下步骤:A method for identifying seismic properties of seismic waves based on machine learning, which is characterized in that it includes the following steps:
    步骤1:对原始地震波形数据进行读取,确定需要进行分类识别的地震波形的震中距;Step 1: Read the original seismic waveform data and determine the epicenter distance of the seismic waveform that needs to be classified and identified;
    步骤2:分别对天然地震和非天然地震波形进行时频域分析,时域分析得到波形复杂度特征,频域分析得到谱比值特征,将所述两个特征值进行计算得到时频域综合特征值波形复杂度/谱比值;Step 2: Perform time-frequency domain analysis on the natural earthquake and unnatural seismic waveforms respectively, the time-domain analysis obtains the waveform complexity characteristics, the frequency domain analysis obtains the spectral ratio characteristics, and the two characteristic values are calculated to obtain the time-frequency domain comprehensive characteristics Value waveform complexity/spectrum ratio;
    所述波形复杂度和谱比值特征分别为时域层面和频域层面可以表征波形走向的特征量;The waveform complexity and spectral ratio characteristics are respectively the characteristic quantities that can characterize the waveform trend in the time domain and the frequency domain;
    步骤3:使用人工神经网络训练模型,模型输入是地震波形实例的特征向量,即步骤2中得到的三个特征值组成的特征向量,经过两个隐藏层的计算得到输出结果,输出结果为“1”表示波形所属类别为天然地震,输出结果为“0”表示波形所属类别为非天然地震;Step 3: Use artificial neural network to train the model. The input of the model is the eigenvector of the seismic waveform instance, that is, the eigenvector composed of the three eigenvalues obtained in step 2. The output result is calculated by two hidden layers, and the output result is " 1" indicates that the category of the waveform is natural earthquake, and the output result of "0" indicates that the category of the waveform is non-natural earthquake;
    步骤4:将地震波形数据分为训练集和测试集两个不相交的数据集,使用反向传播算法更新模型参数,不断拟合训练数据,降低损失函数的值;模型训练过程中须使用测试集数据评估模型的效果,经过指定迭代次数的训练或模型的测试准确率达到一定标准后,意味着模型训练完成;Step 4: Divide the seismic waveform data into two disjoint data sets, the training set and the test set, use the backpropagation algorithm to update the model parameters, continuously fit the training data, and reduce the value of the loss function; test must be used during model training Collect data to evaluate the effect of the model. After a specified number of iterations of training or the model's test accuracy reaches a certain standard, it means that the model training is completed;
    步骤5:保存模型,对后续的地震波形分类问题可直接调用该保存模型输出结果,无需再进行繁琐训练。Step 5: Save the model. For subsequent seismic waveform classification problems, you can directly call the saved model to output the results, eliminating the need for tedious training.
  2. 根据权利要求1所述的基于机器学习的地震波震动性质识别方法,其特征在于:所述步骤1的具体步骤为:The method for identifying seismic wave vibration properties based on machine learning according to claim 1, characterized in that: the specific steps of step 1 are:
    步骤1.1:使用Anaconda中针对地震领域的Python库obspy读取原始地震波形数据,依据震源与台站间的球面距离选取地震波形;Step 1.1: Use the Python library obspy in Anaconda for the seismic field to read the original seismic waveform data, and select the seismic waveform according to the spherical distance between the seismic source and the station;
    步骤1.2:利用长短时窗比STA/LTA的算法找出包含地震起点的区间,使用AIC算法精确定位地震波P波初至的时刻;Step 1.2: Use the STA/LTA algorithm to find the interval containing the earthquake starting point, and use the AIC algorithm to accurately locate the time of the first arrival of the seismic wave P;
    步骤1.3:确定P波初至后将地震波形截取为统一长度便于提取波 形特征进行后续分析处理。Step 1.3: After determining the first arrival of the P wave, intercept the seismic waveform to a uniform length to facilitate the extraction of waveform characteristics for subsequent analysis and processing.
PCT/CN2019/107486 2019-08-15 2019-09-24 Seismic wave vibration nature recognition method based on machine learning WO2021027026A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910752213.2A CN110488351A (en) 2019-08-15 2019-08-15 Seismic wave based on machine learning shakes property recognition methods
CN201910752213.2 2019-08-15

Publications (1)

Publication Number Publication Date
WO2021027026A1 true WO2021027026A1 (en) 2021-02-18

Family

ID=68551095

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/107486 WO2021027026A1 (en) 2019-08-15 2019-09-24 Seismic wave vibration nature recognition method based on machine learning

Country Status (2)

Country Link
CN (1) CN110488351A (en)
WO (1) WO2021027026A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210117737A1 (en) * 2019-10-18 2021-04-22 Korea University Research And Business Foundation Earthquake event classification method using attention-based convolutional neural network, recording medium and device for performing the method
CN113671564A (en) * 2021-08-11 2021-11-19 西安石油大学 Automatic microseism effective event picking method based on NARX dynamic neural network
CN115963548A (en) * 2023-01-16 2023-04-14 中国矿业大学 Mine microseismic P-wave arrival time picking model construction method based on inverse deductive learning
CN116047604A (en) * 2023-02-23 2023-05-02 成都理工大学 Deep seismic phase rapid pickup method based on amplitude statistics and time-frequency analysis
CN116046307A (en) * 2022-12-12 2023-05-02 中铁西北科学研究院有限公司 Identification method suitable for earthquake damage mode of vibrating table with tunnel slope
CN116502154A (en) * 2023-04-25 2023-07-28 中国地震局地球物理研究所 Seismic classification method and system based on multidimensional feature extraction and XGBoost

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111257934B (en) * 2020-01-17 2022-03-11 哈尔滨工业大学 Seismic oscillation peak acceleration prediction method based on second-order neuron deep neural network
CN113960657A (en) * 2020-07-20 2022-01-21 中国石油天然气股份有限公司 Seismic data feature acquisition method and system
CN112230275B (en) * 2020-09-14 2024-01-26 河南省地震局 Method and device for identifying seismic waveform and electronic equipment
CN112686315A (en) * 2020-12-31 2021-04-20 山西三友和智慧信息技术股份有限公司 Deep learning-based unnatural earthquake classification method
CN112782762A (en) * 2021-01-29 2021-05-11 东北大学 Earthquake magnitude determination method based on deep learning
CN112799130A (en) * 2021-02-03 2021-05-14 东北大学 Seismic wave amplitude prediction method based on deep learning
CN112799128B (en) * 2021-02-03 2021-10-22 北京航空航天大学 Method for seismic signal detection and seismic phase extraction
CN113341459B (en) * 2021-05-12 2022-04-12 北京大学 Earthquake positioning method and device based on machine learning and dynamics calculation fusion
CN114509811B (en) * 2022-01-28 2022-12-02 中国地震局地球物理研究所 Single station rear azimuth estimation method and device based on deep learning
CN115421188B (en) * 2022-08-23 2024-02-20 宿州学院 Microseism event real-time identification system and method based on artificial intelligence

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5617513A (en) * 1992-03-06 1997-04-01 Schnitta; Bonnie S. Method for analyzing activity in a signal
CN102495425A (en) * 2011-11-14 2012-06-13 北京科技大学 Energy-based method for automatically locating earthquake focus of microearthquake
CN107300715A (en) * 2017-06-22 2017-10-27 禁核试北京国家数据中心 A kind of method for recognizing nuclear explosion seismic events
CN107818309A (en) * 2017-11-02 2018-03-20 中国地震局地壳应力研究所 A kind of on-line automatic extracting method of earthquake motion P ripples based on Analysis signal-to-noise ratio (SNR)

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108805269B (en) * 2018-06-28 2021-11-30 杭州叙简科技股份有限公司 Method for picking seismic facies arrival time based on LSTM recurrent neural network
CN110082822B (en) * 2019-04-09 2020-07-28 中国科学技术大学 Method for detecting earthquake by using convolution neural network
CN110032975B (en) * 2019-04-15 2021-09-07 禁核试北京国家数据中心 Seismic facies picking method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5617513A (en) * 1992-03-06 1997-04-01 Schnitta; Bonnie S. Method for analyzing activity in a signal
CN102495425A (en) * 2011-11-14 2012-06-13 北京科技大学 Energy-based method for automatically locating earthquake focus of microearthquake
CN107300715A (en) * 2017-06-22 2017-10-27 禁核试北京国家数据中心 A kind of method for recognizing nuclear explosion seismic events
CN107818309A (en) * 2017-11-02 2018-03-20 中国地震局地壳应力研究所 A kind of on-line automatic extracting method of earthquake motion P ripples based on Analysis signal-to-noise ratio (SNR)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210117737A1 (en) * 2019-10-18 2021-04-22 Korea University Research And Business Foundation Earthquake event classification method using attention-based convolutional neural network, recording medium and device for performing the method
US11947061B2 (en) * 2019-10-18 2024-04-02 Korea University Research And Business Foundation Earthquake event classification method using attention-based convolutional neural network, recording medium and device for performing the method
CN113671564A (en) * 2021-08-11 2021-11-19 西安石油大学 Automatic microseism effective event picking method based on NARX dynamic neural network
CN116046307A (en) * 2022-12-12 2023-05-02 中铁西北科学研究院有限公司 Identification method suitable for earthquake damage mode of vibrating table with tunnel slope
CN115963548A (en) * 2023-01-16 2023-04-14 中国矿业大学 Mine microseismic P-wave arrival time picking model construction method based on inverse deductive learning
CN115963548B (en) * 2023-01-16 2024-01-23 中国矿业大学 Mine microseismic P wave arrival time pickup model construction method based on deduction learning
CN116047604A (en) * 2023-02-23 2023-05-02 成都理工大学 Deep seismic phase rapid pickup method based on amplitude statistics and time-frequency analysis
CN116047604B (en) * 2023-02-23 2023-10-17 成都理工大学 Deep seismic phase rapid pickup method based on amplitude statistics and time-frequency analysis
CN116502154A (en) * 2023-04-25 2023-07-28 中国地震局地球物理研究所 Seismic classification method and system based on multidimensional feature extraction and XGBoost

Also Published As

Publication number Publication date
CN110488351A (en) 2019-11-22

Similar Documents

Publication Publication Date Title
WO2021027026A1 (en) Seismic wave vibration nature recognition method based on machine learning
CN109473120A (en) A kind of abnormal sound signal recognition method based on convolutional neural networks
Ruano et al. Seismic detection using support vector machines
Del Pezzo et al. Discrimination of earthquakes and underwater explosions using neural networks
Chin et al. Intelligent real-time earthquake detection by recurrent neural networks
Lin et al. Detection of gravitational waves using Bayesian neural networks
Riggelsen et al. A machine learning approach for improving the detection capabilities at 3C seismic stations
Cai et al. Anomaly detection of earthquake precursor data using long short-term memory networks
Liao et al. ARRU phase picker: Attention recurrent‐residual U‐Net for picking seismic P‐and S‐phase arrivals
Kim et al. Graph convolution networks for seismic events classification using raw waveform data from multiple stations
CN111814699A (en) Deep learning earthquake prediction method for SWARM electromagnetic satellite data
CN112799130A (en) Seismic wave amplitude prediction method based on deep learning
Zhu et al. An STFT-LSTM system for P-wave identification
Liu et al. Intrusion detection based on IDBM
CN112526606A (en) Seismic source type prediction method and system based on heterogeneous multi-classification model
Liu et al. Discrimination between earthquake P waves and microtremors via a generative adversarial network
Whitehill et al. Whosecough: In-the-wild cougher verification using multitask learning
Frost et al. TB or not TB? Acoustic cough analysis for tuberculosis classification
Madureira et al. A neural network seismic detector
León et al. Small-layered Feed-Forward and Convolutional neural networks for efficient P wave earthquake detection
CN113782051B (en) Broadcast effect classification method and system, electronic equipment and storage medium
Yue et al. Seismic event classification based on a two-step convolutional neural network
Garcıa et al. Automatic detection of long period events based on subband-envelope processing
Sya’bani et al. Implementation of automatic first arrival picking on P-Wave seismic signal using logistic regression method
Sun et al. Micro‐seismic Event Detection of Hot Dry Rock based on the Gated Recurrent Unit Model and a Support Vector Machine

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19941340

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19941340

Country of ref document: EP

Kind code of ref document: A1