CN116823541A

CN116823541A - Criminal investigation calculation method and system based on nonlinear model

Info

Publication number: CN116823541A
Application number: CN202311090895.8A
Authority: CN
Inventors: 王芳; 郭雷; 张蓝天
Original assignee: Shandong University; Academy of Mathematics and Systems Science of CAS
Current assignee: Shandong University; Academy of Mathematics and Systems Science of CAS
Priority date: 2023-08-29
Filing date: 2023-08-29
Publication date: 2023-09-29

Abstract

The invention relates to the technical field of legal text processing, in particular to a criminal calculation method and system based on a nonlinear model, comprising the following steps: text data in case description is obtained, and the text data are preprocessed to obtain sentencing factor characteristics; determining a criminal starting point based on the obtained criminal factor characteristics, and determining a range of a criminal range of the criminal rule adjustment reference according to the criminal factor characteristics to obtain the weight of the criminal rule; estimating noise of the nonlinear model, determining noise distribution, generating a plurality of samples obeying the noise distribution based on the sentencing factor characteristics, and determining error limits of parameter estimation values of the nonlinear model through a plurality of simulations; and in the error limit, according to the corresponding relation between the weight of the sentencing factors and the parameter estimation value, determining the estimation value and the confidence limit of the unknown parameter vector corresponding to the sentencing characteristic factors in the nonlinear model, and obtaining the prediction criminal period output by the model as a reference standard for judgment.

Description

A sentencing calculation method and system based on nonlinear models

技术领域Technical field

本发明涉及法律文本处理技术领域，具体为一种基于非线性模型的量刑计算方法及系统。The invention relates to the technical field of legal text processing, specifically a sentencing calculation method and system based on a nonlinear model.

背景技术Background technique

本部分的陈述仅仅是提供了与本发明相关的背景技术信息，不必然构成在先技术。The statements in this section merely provide background technical information related to the present invention and do not necessarily constitute prior art.

量刑系统是根据案卷中的文本信息，向司法工作者输出预测量刑结果的文本数据处理系统，这类系统通常依赖案卷中大量文本数据提取到的特征，根据不同案件中量刑因素对刑期结果的影响，在经过一定的处理和计算后进行结果的输出。目前，这类系统分为两类，一是遵循传统的线性回归统计模型或实证方法搭建的系统，二是基于机器学习和自然语言处理技术，挖掘案卷文本中的关键信息和文本中包含的决策逻辑。The sentencing system is a text data processing system that outputs predictions of sentencing results to judicial workers based on text information in case files. This type of system usually relies on features extracted from a large amount of text data in case files, and based on the impact of sentencing factors on sentencing outcomes in different cases , and the results are output after certain processing and calculations. Currently, this type of system is divided into two categories. One is a system built based on traditional linear regression statistical models or empirical methods. The other is a system based on machine learning and natural language processing technology to mine key information in the case file text and the decisions contained in the text. logic.

其中，线性回归统计模型运用的是统计学方法，根据大数定律和中心极限定理等结果处理案卷中的原始文本数据以得到量刑的计算结果，而模型本身并没有对刑期区间进行限制，不能够完全适应实际的量刑场景（实际的量刑场景具有非线性饱和特性），因此对文本数据中包含的量刑机理无法准确的分析和控制，从而使得到的量刑结果与实际案件描述中的量刑场景差异较大，难以帮助司法工作人员提高工作效率。此外，由于目前公开的案卷数量有限，统计学方法需要先验性地假设数据满足良好的统计性质(如独立同分布等)，导致模型中估计参数的值以及精度难以确定，从而使模型无法掌握不同案件中量刑情节对基准刑的调节比例。Among them, the linear regression statistical model uses statistical methods to process the original text data in the case file according to the results of the law of large numbers and the central limit theorem to obtain the sentencing calculation results. However, the model itself does not limit the sentence range and cannot It is fully adapted to the actual sentencing scenario (the actual sentencing scenario has non-linear saturation characteristics), so the sentencing mechanism contained in the text data cannot be accurately analyzed and controlled, making the sentencing results obtained more different from the sentencing scenario described in the actual case. It is difficult to help judicial staff improve their work efficiency. In addition, due to the limited number of currently disclosed case files, statistical methods need to assume a priori that the data satisfies good statistical properties (such as independent and identical distribution, etc.), making it difficult to determine the values and accuracy of the estimated parameters in the model, making the model impossible to master. The adjustment ratio of the sentencing circumstances to the base sentence in different cases.

而对于机器学习和自然语言处理的相关技术和方法，通常需要大量的法律文本信息来训练具有强大泛化能力的模型，而目前公开的案卷（判决书文本）的数量还无法支持模型获得可靠输出的能力，导致难以通过训练来判定模型中估计参数的精度。For related technologies and methods of machine learning and natural language processing, a large amount of legal text information is usually required to train a model with strong generalization capabilities, and the number of currently public case files (judgment text) cannot support the model to obtain reliable output. ability, making it difficult to determine the accuracy of estimated parameters in the model through training.

发明内容Contents of the invention

为了解决上述背景技术中存在的技术问题，本发明提供一种基于非线性模型的量刑计算方法及系统，根据案卷文本中提取的有效信息，应用非线性饱和模型，运用贝叶斯嵌入和随机模拟与多阶段计算方法，解决有限数据样本下，得到量刑结果的过程中，参数估计的精度判定问题，克服了现有方法需要充分大数据样本量的局限性，同时还呈现了各量刑特征的影响随时间的变化趋势。In order to solve the technical problems existing in the above background technology, the present invention provides a sentencing calculation method and system based on a nonlinear model. Based on the effective information extracted from the case file text, the nonlinear saturation model is applied, and Bayesian embedding and random simulation are used. With the multi-stage calculation method, it solves the problem of determining the accuracy of parameter estimation in the process of obtaining sentencing results under limited data samples. It overcomes the limitation of existing methods that require sufficient big data sample size, and also shows the impact of various sentencing characteristics. Trends over time.

为了实现上述目的，本发明采用如下技术方案：In order to achieve the above objects, the present invention adopts the following technical solutions:

本发明的第一个方面提供一种基于非线性模型的量刑计算方法，包括以下步骤：A first aspect of the present invention provides a sentencing calculation method based on a nonlinear model, which includes the following steps:

获取案件描述中的文本数据，并预处理得到量刑因素特征；Obtain the text data in the case description and pre-process it to obtain the sentencing factor characteristics;

基于得到的量刑因素特征确定量刑起点，根据量刑因素特征确定量刑情节调节基准刑幅度的范围，得到量刑因素权重的大小；Determine the starting point of sentencing based on the characteristics of the sentencing factors obtained, determine the range of the sentencing circumstances to adjust the range of the benchmark penalty based on the characteristics of the sentencing factors, and obtain the weight of the sentencing factors;

估计非线性模型的噪声并确定噪声分布，基于量刑因素特征生成服从噪声分布的若干样本，经若干次模拟确定非线性模型参数估计值的误差界限；Estimate the noise of the nonlinear model and determine the noise distribution, generate a number of samples that obey the noise distribution based on the characteristics of the sentencing factors, and determine the error limits of the nonlinear model parameter estimates through several simulations;

在误差界限内，根据量刑因素权重的大小和参数估计值之间的对应关系，确定非线性模型中，量刑特征因素所对应的未知参数向量的估计值以及置信界限，得到模型输出的预测刑期作为裁决的参考基准。Within the error bound, based on the correspondence between the weights of sentencing factors and parameter estimates, determine the estimated values and confidence limits of the unknown parameter vector corresponding to the sentencing characteristic factors in the nonlinear model, and obtain the predicted sentence output of the model as Reference base for adjudication.

预处理包括：提取文本数据中，与量刑相关的特征字段并合并，得到结构化后的案件文本数据。Preprocessing includes: extracting sentencing-related feature fields from the text data and merging them to obtain structured case text data.

非线性模型为饱和非线性回归模型，根据案件类型确定饱和非线性回归模型的浮动上界和下界。The nonlinear model is a saturated nonlinear regression model, and the floating upper and lower bounds of the saturated nonlinear regression model are determined according to the case type.

估计非线性模型的噪声，具体为：基于最小二乘法获取非线性模型的估计噪声，利用估计噪声得到经验分布曲线，确定噪声的正态密度函数及方差大小。Estimating the noise of the nonlinear model specifically includes: obtaining the estimated noise of the nonlinear model based on the least squares method, using the estimated noise to obtain the empirical distribution curve, and determining the normal density function and variance of the noise.

确定量刑起点，具体为：将量刑区间若干等分，分别计算每一等分的精度及偏置项估计值，在保证计算精度满足设定值的前提下，使偏置项最小，确定量刑起点在量刑区间的位置。Determine the starting point of sentencing, specifically: Divide the sentencing interval into several equal parts, calculate the accuracy and estimated value of the bias term for each decimal respectively, and on the premise of ensuring that the calculation accuracy meets the set value, minimize the bias term and determine the starting point of sentencing. position within the sentencing range.

得到量刑因素权重的大小，具体为：基于多阶段随机拟牛顿自适应学习算法确定量刑因素权重的大小。Obtain the weight of the sentencing factors, specifically: determine the weight of the sentencing factors based on a multi-stage stochastic quasi-Newton adaptive learning algorithm.

经若干次模拟确定非线性模型参数估计值的误差界限，具体为：After several simulations, the error bounds of the estimated values of the nonlinear model parameters are determined, specifically:

获取预处理后的若干个文本数据作为样本，基于非线性模型得到多维输出观测集；Obtain several preprocessed text data as samples, and obtain a multidimensional output observation set based on the nonlinear model;

多维输出观测集基于多阶段随机拟牛顿自适应学习算法，分别得到对应次数模拟的参数估计；The multi-dimensional output observation set is based on the multi-stage stochastic quasi-Newton adaptive learning algorithm to obtain parameter estimates for the corresponding number of simulations;

根据某一维分量的参数估计误差和经验分布函数，确定对应的该参数估计误差至少以某一概率属于误差界限所在的区间。According to the parameter estimation error of a certain dimensional component and the empirical distribution function, it is determined that the corresponding parameter estimation error belongs to the interval where the error limit is at least with a certain probability.

本发明的第二个方面提供实现上述方法所需的系统，包括：A second aspect of the present invention provides a system required to implement the above method, including:

文本预处理模块，被配置为：获取案件描述中的文本数据，并预处理得到量刑因素特征；The text preprocessing module is configured to: obtain the text data in the case description, and preprocess to obtain the sentencing factor characteristics;

第一参数估计模块，被配置为：基于得到的量刑因素特征确定量刑起点，根据量刑因素特征确定量刑情节调节基准刑幅度的范围，得到量刑因素权重的大小；The first parameter estimation module is configured to: determine the sentencing starting point based on the obtained characteristics of the sentencing factors, determine the range of the sentencing circumstances to adjust the range of the benchmark punishment based on the characteristics of the sentencing factors, and obtain the weight of the sentencing factors;

第二参数估计模块，被配置为：估计非线性模型的噪声并确定噪声分布，基于量刑因素特征生成服从噪声分布的若干样本，经若干次模拟确定非线性模型参数估计值的误差界限；The second parameter estimation module is configured to: estimate the noise of the nonlinear model and determine the noise distribution, generate a number of samples that obey the noise distribution based on the characteristics of the sentencing factors, and determine the error limit of the nonlinear model parameter estimate through several simulations;

结果输出模块，被配置为：在误差界限内，根据量刑因素权重的大小和参数估计值之间的对应关系，确定非线性模型中，量刑特征因素所对应的未知参数向量的估计值以及置信界限，得到模型输出的预测刑期作为裁决的参考基准。The result output module is configured to: within the error limit, determine the estimated value and confidence limit of the unknown parameter vector corresponding to the sentencing characteristic factor in the nonlinear model based on the correspondence between the weight of the sentencing factor and the parameter estimate. , the predicted sentence output from the model is obtained as a reference benchmark for judgment.

本发明的第三个方面提供一种计算机可读存储介质。A third aspect of the invention provides a computer-readable storage medium.

一种计算机可读存储介质，其上存储有计算机程序，该程序被处理器执行时实现如上述所述的一种基于非线性模型的量刑计算方法中的步骤。A computer-readable storage medium on which a computer program is stored. When the program is executed by a processor, the steps in the sentencing calculation method based on a nonlinear model are implemented as described above.

本发明的第四个方面提供一种计算机设备。A fourth aspect of the invention provides a computer device.

一种计算机设备，包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序，所述处理器执行所述程序时实现如上述所述的一种基于非线性模型的量刑计算方法中的步骤。A computer device, including a memory, a processor, and a computer program stored in the memory and executable on the processor. When the processor executes the program, it implements a sentencing calculation based on a nonlinear model as described above. steps in the method.

与现有技术相比，以上一个或多个技术方案存在以下有益效果：Compared with the existing technology, one or more of the above technical solutions have the following beneficial effects:

1、根据真实案卷文本中提取的有效信息，确定量刑的起点和量刑因素权重的大小，再根据生成的样本数据经多次模拟确定模型参数估计值的误差界限，通过量刑因素权重的大小和参数估计值之间的对应关系，能够得到非线性模型中量刑特征因素所对应的未知参数向量的值，使得模型在计算刑期时，能够在有限数据样本下，更好的掌握不同案件中量刑情节对基准刑的调节比例，从而更加适应实际的量刑场景。1. Determine the starting point of sentencing and the weight of sentencing factors based on the effective information extracted from the text of the real case file, and then determine the error limits of the model parameter estimates through multiple simulations based on the generated sample data. Through the size and parameters of the weight of sentencing factors The correspondence between the estimated values can obtain the value of the unknown parameter vector corresponding to the sentencing characteristic factors in the nonlinear model, so that when calculating the sentence, the model can better grasp the relationship between sentencing circumstances in different cases under limited data samples. Adjust the proportion of the base sentence to better adapt to actual sentencing scenarios.

2、每一个案件描述中的量刑场景均为独立的，不同案件中量刑情节对基准刑的调节比例均存在不同，也难以找到完全相同的多个量刑场景对模型进行训练，而根据真实案件描述中的文本数据给出的“权重估计”，再利用随机生成的数据给出的“权重估计”，利用两部分数据之间的对应关系得到估计的高概率置信界，使得模型在计算刑期时，能够在有限数据样本下，更好的掌握不同案件中量刑情节对基准刑的调节比例。2. The sentencing scenarios in each case description are independent. The adjustment ratio of the sentencing circumstances to the base penalty in different cases is different. It is also difficult to find multiple identical sentencing scenarios to train the model. However, based on real case descriptions The "weight estimate" given by the text data in the text data is then used to give the "weight estimate" given by the randomly generated data, and the correspondence between the two parts of data is used to obtain the estimated high probability confidence bound, so that the model can calculate the sentence period. With limited data samples, we can better understand the adjustment ratio of sentencing circumstances to the base sentence in different cases.

3、采用非线性饱和模型能够很好的适应量刑场景，能够对超过或低于相应法定刑区间的案件，将宣告刑限定在所需的法定刑区间内，以弥补传统线性模型的适用性局限，并且能够适应对小数据样本分析需求。3. The nonlinear saturation model can be well adapted to sentencing scenarios and can limit the announced sentence within the required statutory sentencing range for cases exceeding or falling below the corresponding statutory sentencing range to make up for the applicability limitations of the traditional linear model. , and can adapt to the analysis needs of small data samples.

4、能够对量刑特征进行可靠的估计，并呈现各量刑特征的影响随时间的变化趋势，有利于帮助司法工作人员分析并确定与案件对应的量刑特征因素，从而确保不同类型案件在输出预测结果时的可靠性。4. It can reliably estimate sentencing characteristics and present the changing trend of the influence of each sentencing characteristic over time, which is helpful to help judicial staff analyze and determine the sentencing characteristic factors corresponding to the case, thereby ensuring that different types of cases can output prediction results time reliability.

附图说明Description of the drawings

构成本发明的一部分的说明书附图用来提供对本发明的进一步理解，本发明的示意性实施例及其说明用于解释本发明，并不构成对本发明的不当限定。The description and drawings that constitute a part of the present invention are used to provide a further understanding of the present invention. The illustrative embodiments of the present invention and their descriptions are used to explain the present invention and do not constitute an improper limitation of the present invention.

图1是本发明一个或多个实施例提供的量刑计算方法整体架构示意图；Figure 1 is a schematic diagram of the overall architecture of the sentencing calculation method provided by one or more embodiments of the present invention;

图2是本发明一个或多个实施例提供的S-模型与L-模型对重伤案件量刑计算精度的对比示意图；Figure 2 is a schematic diagram comparing the sentencing calculation accuracy of serious injury cases between the S-model and the L-model provided by one or more embodiments of the present invention;

图3是本发明一个或多个实施例提供的S-模型部分关注变量变化趋势示意图。Figure 3 is a schematic diagram of the change trend of the S-model part of the variables of interest provided by one or more embodiments of the present invention.

具体实施方式Detailed ways

下面结合附图与实施例对本发明作进一步说明。The present invention will be further described below in conjunction with the accompanying drawings and examples.

应该指出，以下详细说明都是示例性的，旨在对本发明提供进一步的说明。除非另有指明，本文使用的所有技术和科学术语具有与本发明所属技术领域的普通技术人员通常理解的相同含义。It should be noted that the following detailed description is exemplary and is intended to provide further explanation of the present invention. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.

量刑的基本步骤包含确定刑期起点、确定基准刑和确定宣告刑，刑期起点是根据案情描述中的基本事实所确定的刑期，基准刑则需要根据案件描述的过程中每一个发生的事实作为实际情况进行调整，从而在刑期起点的基础上增加基准刑形成宣告刑，而案情描述中能够影响基准刑刑期确定的参数则为量刑因素。The basic steps of sentencing include determining the starting point of the sentencing period, determining the base sentence and determining the declaratory sentence. The starting point of the sentencing period is the sentencing period determined based on the basic facts in the description of the case, while the base sentence needs to be based on every fact that occurred during the description of the case as the actual situation. Adjustments are made so that the benchmark penalty is added to the starting point of the sentence to form a declaratory sentence, and the parameters in the description of the case that can affect the determination of the benchmark sentence are sentencing factors.

正如背景技术中所描述的，利用量刑系统处理案卷文本信息输出参考的量刑结果时，存在以下问题：As described in the background art, when using the sentencing system to process case file text information and output sentencing results for reference, there are the following problems:

第一、模型的适用性方面，传统线性模型缺乏刑期区间的限制，具有适用性局限，而深度学习模型则对数据量有较大需求。First, in terms of the applicability of the model, the traditional linear model lacks the restriction of the sentence interval and has limited applicability, while the deep learning model has a large demand for data volume.

第二，计算方法方面，既有计算方法的理论要求数据具有较强的统计性假设，对有限数据样本计算的可靠性缺乏保证。Second, in terms of calculation methods, the theory of existing calculation methods requires that the data have strong statistical assumptions, and there is a lack of guarantee for the reliability of calculations with limited data samples.

因此，以下实施例给出一种基于非线性模型的量刑计算方法及系统，根据案卷文本中提取的有效信息，应用非线性饱和模型，运用贝叶斯嵌入和随机模拟与多阶段计算方法，解决有限数据样本下，得到量刑结果的过程中，参数估计的精度判定问题，克服了单纯统计方法需要充分大数据样本量的局限性，同时还呈现了各量刑特征的影响随时间的变化趋势。Therefore, the following embodiments provide a sentencing calculation method and system based on a nonlinear model. Based on the effective information extracted from the case file text, the nonlinear saturation model is applied, and Bayesian embedding, stochastic simulation and multi-stage calculation methods are used to solve the problem. Under limited data samples, in the process of obtaining sentencing results, the problem of determining the accuracy of parameter estimation overcomes the limitation of simple statistical methods that require sufficient big data sample size. It also shows the changing trend of the impact of various sentencing characteristics over time.

实施例一：Example 1:

如图1-图3所示，一种基于非线性模型的量刑计算方法，包括以下步骤：As shown in Figures 1-3, a sentencing calculation method based on a nonlinear model includes the following steps:

具体的：specific:

1 文本预处理1 Text preprocessing

步骤1-1：对于司法判决文书信息和案情描述文本，在合理切分和人工标注的基础上运用自然语言处理技术提取量刑相关特征字段，得到的自然特征作为影响量刑的要素。Step 1-1: For judicial judgment document information and case description text, natural language processing technology is used to extract sentencing-related feature fields based on reasonable segmentation and manual annotation. The obtained natural features are used as factors affecting sentencing.

步骤1-2：在此基础上，根据不同的计算目的，分别进行进一步的特征选择，具体的流程如下：Step 1-2: On this basis, further feature selection is performed according to different calculation purposes. The specific process is as follows:

步骤1-2-1：按照这些字段的法律属性归类（比如将“前科”与“因**受过刑事处罚”合并），得到缓刑、提出附带民事诉讼等若干个“自然特征”用于刑期计算。Step 1-2-1: Classify according to the legal attributes of these fields (for example, merging "previous record" with "received criminal punishment for rape"), and several "natural characteristics" such as obtaining a suspended sentence and filing ancillary civil lawsuits are used for the sentence. calculate.

步骤1-2-2：根据相关性对自然特征进行合并；某些自然特征是具有法律上的相关或相近性质的，这类特征可以合并。例如根据《刑法》第26条，“主犯”包括“首要分子”“一般主犯”“雇佣他人”等类型，因此，上述类型的三个自然特征可以合并为“主犯”。例如根据《刑法》第67条，“自首”包括“准自首”“主动自首”“劝说自首”等类型，因此，上述类型的三个自然特征可以合并为“自首”。Step 1-2-2: Merge natural features based on relevance; some natural features are legally relevant or similar, and such features can be merged. For example, according to Article 26 of the Criminal Law, "principal offenders" include "chief criminals", "general principal offenders", and "hiring others". Therefore, the three natural characteristics of the above types can be combined into "principal offenders". For example, according to Article 67 of the Criminal Law, "surrender" includes "quasi-surrender", "active surrender", "persuasion to surrender" and other types. Therefore, the three natural characteristics of the above types can be combined into "surrender".

本实施例根据文本中提取到的自然特征将其合并，并根据具体特征的分布情况，删除了部分过分稀疏的特征(例如，频次少于200，加入后不足以产生影响)。为进一步排除这些特征的影响，将含有这些特征的案件也相应删除。最终在建模中确定主要的特征因素。This embodiment combines the natural features extracted from the text, and deletes some features that are too sparse (for example, the frequency is less than 200, which is not enough to have an impact after adding them) according to the distribution of specific features. In order to further eliminate the influence of these characteristics, cases containing these characteristics were also deleted accordingly. Finally, the main characteristic factors are determined in the modeling.

2 适用饱和非线性回归模型2 Apply saturated nonlinear regression model

步骤2-1：结合量刑场景的共同特征，确定饱和非线性回归模型的浮动上界和下界：Step 2-1: Combine the common characteristics of sentencing scenarios to determine the floating upper and lower bounds of the saturated nonlinear regression model:

； ;

其中，饱和函数的具体定义如下：Among them, the specific definition of the saturation function is as follows:

； ;

模型中，y _t为第t个案件的有期徒刑刑期(单位：月)；案件按照判决时间的先后进行排序，同一天发生的案件可以随机排序，对计算结果没有实质性影响。In the model, y _t is the fixed-term imprisonment term of the t- th case (unit: month); the cases are sorted according to the time of judgment. Cases occurring on the same day can be sorted randomly, which has no substantial impact on the calculation results.

表示量刑起点(单位：月)；/>在故意伤害罪中分别表示第t个案件中决定刑罚量的轻伤人数、重伤人数、死亡人数（在假冒注册商标罪中分别表示第t个案件中的非法经营数额、违法所得数额、所涉及商标种类数，在集资诈骗罪中分别表示第t个案件中的诈骗金额等）； Indicates the starting point of sentencing (unit: month);/> In the crime of intentional injury, they respectively represent the number of minor injuries, the number of serious injuries, and the number of deaths that determine the amount of punishment in the t-th case (in the crime of counterfeiting registered trademarks, they respectively represent the amount of illegal business operations, the amount of illegal income, and the amount of illegal income involved in the t-th case. The number of trademark types, in the case of fund-raising fraud, respectively represents the amount of fraud in the t- th case, etc.);

b，c，d在故意伤害罪中分别代表每多造成一个轻伤、重伤、死亡所增加的刑期（在假冒注册商标罪和集资诈骗罪中分别表示相应因素的危害程度）；In the crime of intentional injury , b, c, and d respectively represent the increased sentence for each additional minor injury, serious injury, or death (in the crime of counterfeiting registered trademarks and the crime of fund-raising fraud, respectively, they represent the degree of harm of the corresponding factors);

表示所选取的量刑特征因素构成的回归向量；/>表示量刑特征因素所对应的未知参数向量，每一个分量体现了对应特征因素作用所占的百分比；e是建模的偏置项，代表可能未考虑到的其他量刑特征因素的综合影响；/>是可能存在的随机噪声；U _t是相应法定刑区间的上限，L _t是相应法定刑区间的下限，它们因案件性质的不同而不同，并且根据法定量刑情节的性质而浮动。 Represents the regression vector composed of selected sentencing characteristic factors;/> Represents the unknown parameter vector corresponding to the sentencing characteristic factors. Each component reflects the percentage of the corresponding characteristic factor; e is the bias term of the modeling, which represents the comprehensive impact of other sentencing characteristic factors that may not be considered;/> is the possible random noise; U _t is the upper limit of the corresponding statutory sentencing interval, and L _t is the lower limit of the corresponding statutory sentencing interval. They vary depending on the nature of the case and fluctuate according to the nature of the statutory sentencing circumstances.

3 计算分析方法3 Calculation and analysis methods

步骤3-1: 引入新的高维回归向量如下：Step 3-1: Introduce new high-dimensional regression vectors as follows:

； ;

相应地，定义未知参数向量：Accordingly, define the unknown parameter vector:

； ;

相应地，步骤2-1中的非线性随机模型可以转化为如下饱和模型：Correspondingly, the nonlinear stochastic model in step 2-1 can be transformed into the following saturated model:

； ;

其中，是已知输入向量，y _t是输出，θ是要估计的未知参数。通常根据实际的案件场景，可以给出参数所在的先验紧致凸集D。in, is the known input vector, y _t is the output, and θ is the unknown parameter to be estimated. Usually according to the actual case scenario, the prior compact convex set D where the parameters are located can be given.

参数θ的实际含义是“量刑情节对基准刑的调节比例”，也就是量刑因素的权重，即参数θ的真实取值，投影区间是参数θ的先验所述范围；而先验集D的范围对应法条中关于量刑情节调节基准刑幅度的具体规定。The actual meaning of the parameter θ is "the adjustment ratio of the sentencing circumstances to the base sentence", which is the weight of the sentencing factors, that is, the true value of the parameter θ . The projection interval is the range stated a priori of the parameter θ ; and the prior set D The scope corresponds to the specific provisions in the law regarding the adjustment of the base penalty range based on sentencing circumstances.

步骤3-2：为了解决具体量刑计算中需要模型噪声的分布函数问题，首先基于司法判决的数据，用最小二乘法对具体量刑模型的噪声进行估计。Step 3-2: In order to solve the problem of the distribution function of model noise required in specific sentencing calculations, first use the least squares method to estimate the noise of the specific sentencing model based on judicial judgment data.

具体的，仅选取非饱和区间中的样本，当第t个样本到达时，利用迭代最小二乘算法更新参数θ的估计值/>，再利用参数的估计值/>给出样本/>中的噪声估计/>，具体计算公式如下：Specifically, only samples in the non-saturated interval are selected. When the t-th sample Upon arrival, the iterative least squares algorithm is used to update the estimated value of parameter θ /> , and then use the estimated values of the parameters/> Give sample/> Noise estimation in/> , the specific calculation formula is as follows:

； ;

利用噪声的估计给出噪声的经验分布曲线，确定噪声的正态密度函数及方差大小，应用在具体的量刑计算过程中。Estimation using noise The empirical distribution curve of noise is given, the normal density function and variance of noise are determined, and applied in the specific sentencing calculation process.

步骤3-3：为了克服量刑模型中数学方程求解的不适定性困难，利用均匀分割法与预测精度比较法，具体确定量刑起点大小。将量刑区间六等分，分别取量刑起点为量刑区间的1/6，1/3，1/2，2/3，5/6处以及区间的最低点和最高点进行计算，得到7组不同的计算精度及偏置项估计值。Step 3-3: In order to overcome the ill-posed difficulty in solving mathematical equations in the sentencing model, the uniform segmentation method and the prediction accuracy comparison method are used to specifically determine the starting point of sentencing. Divide the sentencing interval into six equal parts, take the starting point of sentencing as 1/6, 1/3, 1/2, 2/3, and 5/6 of the sentencing interval and the lowest and highest points of the interval for calculation, and obtain 7 different groups. The calculation accuracy and bias term estimate.

量刑起点的选取原则是在保证较高计算精度的前提下，尽可能减小偏置项。本实施例通过实验最终确定将量刑起点选在相应量刑区间的1/3处。The principle of selecting the sentencing starting point is to reduce the offset term as much as possible while ensuring high calculation accuracy. In this embodiment, it is finally determined through experiments that the sentencing starting point is selected at 1/3 of the corresponding sentencing interval.

步骤3-4：利用多阶段（multi-stage）随机拟牛顿自适应学习方法（MSQN）计算量刑因素的权重大小。Step 3-4: Use the multi-stage stochastic quasi-Newton adaptive learning method (MSQN) to calculate the weight of the sentencing factors.

针对参数的先验紧致凸集D，及正定矩阵，引入投影算子投影算子/>，定义为：A priori compact convex set D for parameters, and a positive definite matrix , introduce the projection operator projection operator/> ,defined as:

； ;

其中，范数定义为/>利用上述投影算子。Among them, norm Defined as/> Use the above projection operator.

对每个时刻t，算法输入回归向量；模型输出y _t，正则化因子μ _j,t（1≤j≤K），算法阶段数K，参数投影先验集D，噪声分布/>, 算法初始值P _j,0（1≤j≤K），/>（1≤j≤K）。For each time t , the algorithm inputs the regression vector ;Model output y _t , regularization factor μ _j,t (1≤j≤K), algorithm stage number K, parameter projection prior set D, noise distribution/> , algorithm initial value P _j,0 (1≤j≤K),/> (1≤j≤K).

t时刻的迭代估计公式基于MSQN自适应学习方法得到，MSQN方法（多阶段随机拟牛顿自适应学习方法）自身为现有技术，本实施例不做过多赘述。The iterative estimation formula at time t is obtained based on the MSQN adaptive learning method. The MSQN method (multi-stage stochastic quasi-Newton adaptive learning method) itself is an existing technology and will not be described in detail in this embodiment.

具体到本实施例的量刑数据分析中，取K=3；通过步骤3-1中对数据噪声的估计，取噪声分布为均值为0，标准差为5的正态分布；Specifically in the sentencing data analysis of this embodiment, K=3 is taken; through the estimation of data noise in step 3-1, the noise distribution is taken as a normal distribution with a mean value of 0 and a standard deviation of 5;

此外，函数G_t(·)可以具体表达为：In addition, the function G _t (·) can be specifically expressed as:

G_t(x)=U_t+(L_t-x)F(L_t-x)-(U_t-x)F(U_t-x)+25[f(L_t-x)-f(U_t-x)]；G _t (x)=U _t +(L _t -x)F(L _t -x)-(U _t -x)F(U _t -x)+25[f(L _t -x)-f(U _t -x)];

导数G´_t(·)的表达式为：The expression of the derivative G´ _t (·) is:

G´_t(x)=F(U_t-x)-(L_t-x)；G´ _t (x)=F(U _t -x)-(L _t -x);

其中，F(·)及f(·)分别是正态分布N(0，25)的分布函数及概率密度函数；Among them, F(·) and f(·) are the distribution function and probability density function of the normal distribution N(0, 25) respectively;

特征因素“重伤人数”对应参数的投影区间为[0,40]，特征因素“轻伤人数”对应参数的投影区间为[0,10]，偏置项投影区间为[-1,1]，其余特征中增刑特征因素(如：持械)对应参数的投影区间为[-0.1,1]，减刑特征因素(如：自首)对应参数的投影区间为[-1,0.1]；μ _j,t=25，1≤j≤3；对1≤j≤3初始值P_j,0=I，/>，n为样本量。The projection interval of the parameter corresponding to the characteristic factor "number of serious injuries" is [0,40], the projection interval of the parameter corresponding to the characteristic factor "number of minor injuries" is [0,10], and the projection interval of the offset term is [-1,1]. Among the remaining features, the projection interval of the corresponding parameters for the penalty-increasing characteristic factors (such as holding weapons) is [-0.1,1], and the projection interval of the corresponding parameters for the penalty-reducing characteristic factors (such as surrender) is [-1,0.1]; μ _{j, t} =25, 1≤j≤3; for 1≤j≤3 Initial value P _j,0 =I,/> , n is the sample size.

本实施例中的多阶段方法（MSQN）不同于传统的线性最小二乘（RLS）方法，是针对具有饱和性质的量刑系统所设计。该算法不需要数据组成的回归向量满足独立同分布等传统上难以满足的统计假设，更加适合司法判决这一类复杂文本信息的特性，为辅助司法工作人员判断各个量刑情节对刑期判决的实际影响提供计算支撑。The multi-stage method (MSQN) in this embodiment is different from the traditional linear least squares (RLS) method and is designed for sentencing systems with saturated properties. This algorithm does not require the regression vector composed of data to satisfy traditionally difficult statistical assumptions such as independent and identical distribution. It is more suitable for the characteristics of complex text information such as judicial decisions, and can assist judicial staff in judging the actual impact of various sentencing circumstances on sentencing decisions. Provide calculation support.

步骤3-5：综合利用贝叶斯嵌入、随机模拟与多阶段计算方法，给出有限量刑数据样本下的参数估计精度（高概率置信界），克服了单纯统计方法需要充分大数据样本量的局限。包括以下步骤：Step 3-5: Comprehensive use of Bayesian embedding, stochastic simulation and multi-stage calculation methods to provide parameter estimation accuracy (high probability confidence bound) under limited sentencing data samples, overcoming the problem that pure statistical methods require sufficient large data sample size limitations. Includes the following steps:

步骤3-5-1：独立同分布抽取N个样本，服从分布/>，其中U是参数先验集D上的均匀分布，F是服从正态分布的噪声（利用步骤3-2得到的噪声估计分布，这里的样本利用F这个噪声分布生成，不是从原本数据中抽取的）。Step 3-5-1: Extract N samples from independent and identical distributions , obey the distribution/> , where U is the uniform distribution on the parameter prior set D, and F is the noise obeying the normal distribution (using the noise estimation distribution obtained in step 3-2, the sample here is generated using the noise distribution F, not extracted from the original data of).

步骤3-5-2：利用N个样本{X₁,X₂,...X_N}，特征以及饱和非线性回归模型形式，生成n维输出观测集{Y₁,Y₂,...Y_N}。Step 3-5-2: Using N samples {X ₁ , X ₂ ,...X _N }, features And the saturated nonlinear regression model form generates an n-dimensional output observation set {Y ₁ , Y ₂ ,...Y _N }.

步骤3-5-3：利用MSQN方法分别给出N次模拟情形下的参数估计，并计算参数第j维分量（j=1，...m）的参数估计误差，再利用如下公式计算/>的经验分布函数：Step 3-5-3: Use the MSQN method to give parameter estimates for N simulations, and calculate the parameter estimation error of the j-th dimension component of the parameter (j=1,...m) , and then use the following formula to calculate/> The empirical distribution function of:

； ;

步骤3-5-4：对任意并满足/>利用MSQN计算出的第j维参数估计误差/>至少以概率/>属于如下区间：Step 3-5-4: For any and satisfy/> The j-th dimension parameter estimation error calculated using MSQN/> At least with probability/> Belongs to the following range:

； ;

其中，是经验分布/>的/>分位数。in, is the empirical distribution/> of/> Quantile.

步骤3-6：利用步骤3-1中与/>的对应关系，给出原模型中/>的估计及估计精度（高概率置信界）。Step 3-6: Use step 3-1 with/> The corresponding relationship is given in the original model/> The estimation and estimation accuracy (high probability confidence bound).

步骤3-4是根据真实案件描述中的文本数据给出的“权重估计”，这里是利用随机生成的数据给出的“权重估计”，用于得到估计的高概率置信界，使得模型在计算刑期时，能够在有限数据样本下，更好的掌握不同案件中量刑情节对基准刑的调节比例。Steps 3-4 are "weight estimates" given based on text data in real case descriptions. Here are "weight estimates" given using randomly generated data, which are used to obtain estimated high-probability confidence bounds so that the model can calculate When it comes to sentencing, it is possible to better understand the adjustment ratio of sentencing circumstances to the base sentence in different cases under limited data samples.

4 验证4 Verification

本实施例以故意伤害罪为例进行刑期预测和因素分析，样本取自2011年1月至2021年6月期间公开的故意伤害罪初审判决数据，共计19.959万。This example uses the crime of intentional injury as an example to conduct sentence prediction and factor analysis. The samples are taken from the first trial judgment data of the crime of intentional injury published between January 2011 and June 2021, totaling 199,590.

步骤4-1：基于步骤3-3中的非线性递推辨识算法，依据实际的判决文书信息所提取出的结构化数据可以得到具体的参数估计值及其误差界，以S-模型为本实施例提出非线性饱和模型，L-模型为传统的线性模型。Step 4-1: Based on the nonlinear recursive identification algorithm in step 3-3, specific parameter estimates and their error bounds can be obtained based on the structured data extracted from the actual judgment document information, based on the S-model. The embodiment proposes a nonlinear saturation model, and the L-model is a traditional linear model.

步骤4-2：利用本实施例提出的非线性饱和模型（MSQN algorithm，简称S-模型）以及步骤4-1中计算得到的参数估计值可以对刑期进行计算，并与传统的线性模型(RLSalgorithm，简称L-模型)的量刑计算精度进行比较，具体结果如表1所示：Step 4-2: Use the nonlinear saturation model (MSQN algorithm, referred to as S-model) proposed in this embodiment and the parameter estimates calculated in step 4-1 to calculate the sentence period, and compare it with the traditional linear model (RLS algorithm , referred to as L-model), the specific results are shown in Table 1:

表1：S-模型与L-模型计算精度对比Table 1: Comparison of calculation accuracy between S-model and L-model

其中的量刑计算精度定义为相对计算误差的平均值（Prediction accuracy预测精度）。可以看出，针对故意伤害致人重伤案件的刑期计算，S-模型与L-模型相比，精度得到了提高了。The sentencing calculation accuracy is defined as the average of the relative calculation errors (Prediction accuracy). It can be seen that the S-model has improved accuracy compared with the L-model in calculating the sentence for intentional injury causing serious injury.

图2展示了2011年到2021年重伤案件刑期计算精度的变化趋势。可以看出，S-模型的计算精度始终远高于L-模型的计算精度。Figure 2 shows the changing trend in the accuracy of sentencing calculations for serious injury cases from 2011 to 2021. It can be seen that the calculation accuracy of the S-model is always much higher than that of the L-model.

步骤4-3：此外，利用所提出的MSQN算法还可以呈现量刑特征要素的影响随时间的变化趋势，进而发现司法的变化规律及其背后的法治变迁轨迹。通过分析发现，建模中的大多数量刑特征因素随时间变化的表现平稳，但也存在个别特征变化明显，比如偏置项和认罪认罚两个特征出现了随时间的明显变化，具体变化如图3所示。Step 4-3: In addition, the proposed MSQN algorithm can also be used to present the changing trend of the influence of sentencing characteristic elements over time, and then discover the changing rules of justice and the trajectory of changes in the rule of law behind it. Through analysis, it was found that most of the sentencing characteristic factors in the modeling showed stable changes over time, but there were also individual features that changed significantly. For example, the two features of bias items and confession and punishment showed significant changes over time. The specific changes are as shown in the figure 3 shown.

本实施例依据法律文本中提取的量刑模式，依据可靠的文书信息使量刑系统能够辅助司法判决，向司法工作人员输出供参考的量刑结果。This embodiment enables the sentencing system to assist judicial decisions based on sentencing patterns extracted from legal texts and reliable document information, and output sentencing results to judicial staff for reference.

在模型的适用性方面，应用具有精准性和可解释性的非线性饱和模型。该模型能够很好的适应量刑场景，确保对超过或低于相应法定刑区间的案件的宣告刑限定在法定刑区间内，以弥补传统线性模型的适用性局限，并且能够适应对小数据样本分析需求。In terms of model applicability, a nonlinear saturated model with accuracy and interpretability is applied. This model can adapt well to sentencing scenarios, ensuring that the sentences for cases exceeding or falling below the corresponding legal sentencing range are limited to the statutory sentencing range to make up for the applicability limitations of the traditional linear model, and can be adapted to the analysis of small data samples need.

在计算方法方面，考虑到司法判决文本这一类复杂社会数据远不满足独立同分布等传统的数据假设，提出自适应方法可以在较弱的数据条件下建立理论保证。In terms of calculation methods, considering that complex social data such as judicial judgment texts are far from satisfying traditional data assumptions such as independent and identical distribution, adaptive methods are proposed to establish theoretical guarantees under weaker data conditions.

在精度保证方面，从理论上给出有限数据样本情形参数估计的可靠性保证，以准确界定量刑情节的实际作用大小。In terms of accuracy guarantee, the reliability guarantee of parameter estimation in limited data sample situations is theoretically provided to accurately define the actual effect of sentencing circumstances.

在计算效果方面。相较于传统线性回归模型，所建立的非线性模型和相应的新计算方法可以根据给定的案情描述给出具有可解释性的刑期结果，且刑期计算的准确度得到了提升（以故意伤害罪为例），可以为法官提供更有参考价值的量刑建议结果。In terms of computational effects. Compared with the traditional linear regression model, the established nonlinear model and the corresponding new calculation method can provide interpretable sentencing results based on the given case description, and the accuracy of the sentencing calculation has been improved (with intentional injury) crime as an example), which can provide the judge with more valuable sentencing recommendations.

实施例二：Example 2:

实现上述方法的系统，包括：Systems that implement the above methods include:

实施例三：Embodiment three:

本实施例提供了一种计算机可读存储介质，其上存储有计算机程序，该程序被处理器执行时实现如上述实施例一所述的一种基于非线性模型的量刑计算方法中的步骤。This embodiment provides a computer-readable storage medium on which a computer program is stored. When the program is executed by a processor, the steps of a sentencing calculation method based on a nonlinear model as described in the first embodiment are implemented.

实施例四：Embodiment 4:

本实施例提供了一种计算机设备，包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序，所述处理器执行所述程序时实现如上述实施例一所述的一种基于非线性模型的量刑计算方法中的步骤。This embodiment provides a computer device, including a memory, a processor, and a computer program stored in the memory and executable on the processor. When the processor executes the program, it implements the method described in the first embodiment above. The steps in a sentencing calculation method based on a nonlinear model.

以上实施例二至四中涉及的各步骤或网络与实施例一相对应，具体实施方式可参见实施例一的相关说明部分。术语“计算机可读存储介质”应该理解为包括一个或多个指令集的单个介质或多个介质；还应当被理解为包括任何介质，所述任何介质能够存储、编码或承载用于由处理器执行的指令集并使处理器执行本发明中的任一方法。Each step or network involved in the above Embodiments 2 to 4 corresponds to Embodiment 1. For specific implementation details, please refer to the relevant description of Embodiment 1. The term "computer-readable storage medium" shall be understood to include a single medium or multiple media that includes one or more sets of instructions; and shall also be understood to include any medium capable of storing, encoding, or carrying instructions for use by a processor. The executed instruction set causes the processor to perform any method in the present invention.

以上所述仅为本发明的优选实施例而已，并不用于限制本发明，对于本领域的技术人员来说，本发明可以有各种更改和变化。凡在本发明的精神和原则之内，所作的任何修改、等同替换、改进等，均应包含在本发明的保护范围之内。The above descriptions are only preferred embodiments of the present invention and are not intended to limit the present invention. For those skilled in the art, the present invention may have various modifications and changes. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and principles of the present invention shall be included in the protection scope of the present invention.

Claims

1. A sentencing calculation method based on a nonlinear model, which is characterized by including the following steps:

Obtain the text data in the case description and pre-process it to obtain the sentencing factor characteristics;

Determine the starting point of sentencing based on the characteristics of the sentencing factors obtained, determine the range of the sentencing circumstances to adjust the range of the benchmark penalty based on the characteristics of the sentencing factors, and obtain the weight of the sentencing factors;

Estimate the noise of the nonlinear model and determine the noise distribution, generate a number of samples that obey the noise distribution based on the characteristics of the sentencing factors, and determine the error limits of the nonlinear model parameter estimates through several simulations;

Within the error bound, based on the correspondence between the weights of sentencing factors and parameter estimates, determine the estimated values and confidence limits of the unknown parameter vector corresponding to the sentencing characteristic factors in the nonlinear model, and obtain the predicted sentence output of the model as Reference base for adjudication.

2. A sentencing calculation method based on a nonlinear model according to claim 1, characterized in that the preprocessing includes: extracting and merging characteristic fields related to sentencing in the text data to obtain structured case text data. .

3. A sentencing calculation method based on a nonlinear model as claimed in claim 1, characterized in that the nonlinear model is a saturated nonlinear regression model, and the floating upper and lower bounds of the saturated nonlinear regression model are determined according to the case type.

4. A sentencing calculation method based on a nonlinear model as claimed in claim 1, characterized in that the noise of the nonlinear model is estimated, specifically: obtaining the estimated noise of the nonlinear model based on the least squares method, and using the estimated noise to obtain The empirical distribution curve determines the normal density function and variance of the noise.

5. A sentencing calculation method based on a nonlinear model as claimed in claim 1, characterized in that determining the starting point of sentencing is specifically: dividing the sentencing interval into several equal parts, and calculating the accuracy and offset terms of each equal part respectively. The estimated value, on the premise of ensuring that the calculation accuracy meets the set value, minimizes the offset term and determines the position of the sentencing starting point in the sentencing interval.

6. A sentencing calculation method based on a nonlinear model as claimed in claim 1, characterized in that the weight of the sentencing factors is obtained, specifically: determining the weight of the sentencing factors based on a multi-stage random quasi-Newton adaptive learning algorithm. .

7. A sentencing calculation method based on a nonlinear model as claimed in claim 1, characterized in that the error limits of the nonlinear model parameter estimates are determined through several simulations, including:

Several preprocessed text data are obtained as samples, and a multidimensional output observation set is obtained based on the nonlinear model.

8. A sentencing calculation method based on a nonlinear model as claimed in claim 7, characterized in that the error limit of the nonlinear model parameter estimate is determined through several simulations, and further includes:

The multi-dimensional output observation set is based on a multi-stage stochastic quasi-Newton adaptive learning algorithm to obtain parameter estimates for the corresponding number of simulations.

9. A sentencing calculation method based on a nonlinear model as claimed in claim 8, characterized in that the error limit of the nonlinear model parameter estimate is determined through several simulations, and further includes:

According to the parameter estimation error of a certain dimensional component and the empirical distribution function, it is determined that the corresponding parameter estimation error belongs to the interval where the error limit is at least with a certain probability.

10. A sentencing calculation system based on a nonlinear model, characterized by:

The text preprocessing module is configured to: obtain the text data in the case description, and preprocess to obtain the sentencing factor characteristics;

The first parameter estimation module is configured to: determine the starting point of sentencing based on the obtained characteristics of sentencing factors, determine the range of the sentencing circumstances to adjust the range of the benchmark punishment based on the characteristics of sentencing factors, and obtain the weight of sentencing factors;

The second parameter estimation module is configured to: estimate the noise of the nonlinear model and determine the noise distribution, generate a number of samples that obey the noise distribution based on the characteristics of the sentencing factors, and determine the error limit of the nonlinear model parameter estimate through several simulations;

The result output module is configured to: within the error limit, determine the estimated value and confidence limit of the unknown parameter vector corresponding to the sentencing characteristic factor in the nonlinear model based on the correspondence between the weight of the sentencing factor and the parameter estimate. , the predicted sentence output from the model is obtained as a reference benchmark for judgment.