CN112016314A

CN112016314A - Medical text understanding method and system based on BERT model

Info

Publication number: CN112016314A
Application number: CN202010977191.2A
Authority: CN
Inventors: 汪秀英
Original assignee: Individual
Current assignee: Individual
Priority date: 2020-09-17
Filing date: 2020-09-17
Publication date: 2020-12-01

Abstract

The invention relates to the technical field of text processing, and discloses a medical text understanding method based on a BERT model, which comprises the following steps: acquiring medical text data, and filtering invalid medical text data by using a sentence filtering model; according to the filtered medical text data, generating large-scale medical text data by using a medical text generation model based on text copy; training a medical text entity recognition model by using the generated large-scale medical field text data; performing entity recognition on the medical text to be processed by using the trained medical text entity recognition model; semantic extraction is carried out on the medical text entity by using an attention-based information extraction method to obtain semantic features of the medical text entity; and according to the semantic features of the medical text entities, understanding the medical text by using a multilayer perception machine. The invention also provides a BERT model-based medical text understanding system. The invention realizes the understanding of medical texts.

Description

Medical text understanding method and system based on BERT model

Technical Field

The invention relates to the technical field of text processing, in particular to a method and a system for medical text understanding based on a BERT model.

Background

With the increase of the economic level, people inevitably pay more attention to the health condition of the people, and the requirement on the medical service level is higher and higher. The existing medical service is limited by various factors such as resources and management, and the ever-increasing requirements of people are difficult to meet. Intelligent medical treatment becomes more and more important, and the knowledge in the medical text is fully utilized to accelerate the progress of the intelligent medical treatment.

At present, the research on the text understanding in the medical field is less, a large amount of label training data is needed in a traditional named entity recognition model based on a neural network, however, proper terms of medical field data have strong speciality and high labeling cost, so that accurate labeled data are less, and a large-scale medical field text data set is lacked. Meanwhile, the current entity recognition model is difficult to classify entities by connecting with the context and recognize medical entities due to the fact that the writing habits of doctors are greatly different.

In view of this, how to acquire a large-scale medical text data set and construct a medical entity identification model that can be effectively applied to the medical field, so as to perform medical text understanding by using identified medical entity information, is a problem that needs to be solved by those skilled in the art.

Disclosure of Invention

The invention provides a BERT model-based medical text understanding method, which comprises the steps of generating large-scale medical field text data by using a text copy-based medical text generation technology, and training a medical text entity recognition model by using the generated medical field text data, so that entity recognition is performed on a medical text to be processed by using the trained medical text entity recognition model; and semantic extraction is carried out on the medical text entity by using an information extraction method based on rules, and the understanding of the medical text is realized according to the extracted semantic information.

In order to achieve the above object, the present invention provides a medical text understanding method based on a BERT model, including:

acquiring medical text data, and filtering invalid medical text data by using a sentence filtering model;

according to the filtered medical text data, generating large-scale medical text data by using a medical text generation model based on text copy;

training a medical text entity recognition model by using the generated large-scale medical field text data;

performing entity recognition on the medical text to be processed by using the trained medical text entity recognition model;

semantic extraction is carried out on the medical text entity by using an attention-based information extraction method to obtain semantic features of the medical text entity;

and according to the semantic features of the medical text entities, understanding the medical text by using a multilayer perception machine.

Optionally, the filtering out invalid medical text data by using a sentence filtering model includes:

the sentence filtering model is a BERT-based self-attention mechanism model; the process of filtering invalid medical text data by using the sentence filtering model comprises the following steps:

1) adding a [ CLS ] mark before inputting a word sequence, adding a [ SEP ] mark after inputting the word sequence, converting the input word sequence into corresponding Token Embedding, and calculating to obtain Position Embedding corresponding to each word; adding the two Embedding corresponding to each word to obtain an input Embedding code;

2) the attention weight α of the input sequence vector is derived using a global-based attention matrix:

α＝softmax(WT)

wherein:

w is a weight-based attention matrix used to assist the model in capturing the more important information for classification in the representation of the input sequence;

t is a BERT word vector;

3) multiplying the attention weight by the BERT word vector representation obtained by the corresponding word vector coding layer to obtain the attention representation of the input sequence:

wherein:

T_irepresenting the ith BERT word vector;

α_ian attention weight representing the ith BERT word vector;

4) outputting sentence filtering results based on the parameter matrix of the multilayer perceptron:

Output＝sigmoid(W⁰attention)

wherein:

W⁰is a parameter matrix of the multi-layer perceptron.

Optionally, the generating of large-scale medical text data by using a medical text generation model based on text copy includes:

1) introducing an implicit variable z_tControlling the model to generate words from the word list or copy the words needed to be generated currently from the text in the decoding process when z is_t0 represents that the decoder needs to generate a word from the word list at the current moment, when z _t1 represents that the decoder copies a word from the input text D at the present moment;

2) generating the medical text by using a decoder, wherein the probability of generating the tth word by the decoder is as follows:

wherein:

d is a text of a sentence filtering result;

s is a text word vector;

y_tgenerating a t-th word;

z_tis a hidden variable used for controlling the model to generate words from a word list or copy words required to be generated currently from a text in the decoding process when z is_t0 generation ═ 0 generationThe table decoder needs to generate a word from the word table at the current moment, when z _t1 represents that the decoder copies a word from the input text D at the current moment.

Optionally, the training process of performing the medical text entity recognition model by using the generated large-scale medical field text data includes:

1) marking [ MASK ] on the input token by using a bidirectional masked language model and adopting a random marking [ MASK ] method, and predicting the token marked with [ MASK ] by using context;

2) randomly selecting two sentences from the medical text data, and if the [ MASK ] marks of the two sentences are marked as context marks, considering that one sentence is the next sentence of the other sentence;

3) the above steps are repeated until 30% of the medical text data is marked [ MASK ].

Optionally, the performing entity recognition on the medical text by using the trained medical text entity recognition model includes:

1) performing word segmentation processing on the text data in the large-scale medical field by adopting a jieba word segmentation tool;

2) by calculating the word frequency of each word in the word segmentation result, replacing the segmented words with higher word frequency by using smaller characters, introducing a word boundary symbol, and combining the word boundary symbol with a plurality of divided word groups together to keep the original word sequence unchanged;

3) judging whether the self-defined rule is updated or not, and extracting special medical terms of the rule under the condition of ensuring that the rule is the latest rule;

4) recognizing the medical text entity by adopting a BERT pre-training semantic model;

5) the way of combining 3) and 4) is adopted to keep more semantics as much as possible.

Optionally, the semantic extraction of the medical text entity by using the attention-based information extraction method includes:

1) inputting the medical text entity into CNN to obtain a word vector representation, and inputting the word vector representation into two layers of HighIn wayNet, a vector representation is obtained

Wherein w_iIs the ith word in the medical text entity, w_i＝{c₁，c₂，...，c_nIn which c is_kIs the word w_iThe k character in (a);

2) bidirectional context coding is carried out on the word vector representation in the medical text entity by utilizing a context coding layer:

H＝BiLSTM(C)

U＝BiLSTM(C)

wherein:

h, U are context coding results of the two times respectively;

c is a word vector representation of the medical text entity;

3) computing a similarity matrix S of the context coding result:

S＝sim(H_：t，U_：j)

wherein:

H_：tthe t column vector of H;

U_：ja jth column vector of U;

sim is a cosine similarity measurement formula;

4) calculating an attention weight vector G for the medical text entity:

G＝softmax(S_：t)

wherein:

S_：tis the t-th column of the similarity matrix S;

5) and (3) outputting semantic features of the medical text by using a BilSTM model:

M＝BiLSTM(G)

wherein:

and M is a semantic feature of the medical text.

Optionally, the understanding of the medical text by using the multi-layer perceptron comprises:

the medical text understanding is carried out by utilizing a multilayer perceptron, the medical text understanding result y with the highest probability is used as the output of the multilayer perceptron, and the specific process is as follows:

P(y|M)＝σ(MLP(M))

wherein:

m is semantic characteristics of medical text entities;

y is the medical text understanding result;

sigma is sigmoid function;

MLP is a perceptron consisting of two layers of linear transformations and a non-linear ReLu activation function;

and (3) using the cross entropy as a loss function of the multi-layer perceptron to train the model, wherein the method comprises the following steps:

wherein:

n is the total number of training samples;

during the training process, the model optimizes the entire model using a stochastic gradient descent optimizer.

In addition, to achieve the above object, the present invention also provides a medical text understanding system based on a BERT model, the system comprising:

medical text generation means for generating large-scale medical text data using a medical text generation model based on a text copy;

the medical text processor is used for training a medical text entity recognition model by utilizing the generated large-scale medical field text data and carrying out entity recognition on a medical text to be processed by utilizing the trained medical text entity recognition model; meanwhile, semantic extraction is carried out on the medical text entity by using an attention-based information extraction method to obtain semantic features of the medical text entity;

and the medical text understanding device is used for understanding the medical text by utilizing the multilayer sensing machine.

In addition, to achieve the above object, the present invention also provides a computer readable storage medium having stored thereon medical text understanding program instructions executable by one or more processors to implement the steps of the implementation method of medical text understanding based on BERT model as described above.

Compared with the prior art, the invention provides a BERT model-based medical text understanding method, which has the following advantages:

firstly, because the existing text understanding research about the medical field is less, and the proper term of the medical field data has stronger speciality and high labeling cost, the accurate labeling data is less, and a large-scale text data set in the medical field is lacked. Therefore, the invention provides a medical text generation model based on text copy for generating medical text data, and introduces a hidden variable z_tTo control the model to generate from the vocabulary or copy from the text the word currently needed to be generated during the decoding process, in which z is usually used_tSet to 0, indicating that the decoder is to generate a word from the vocabulary at the current time, when a medical domain specific noun is encountered, then z will be_tThe method is set to be 1, and represents that a decoder copies a word from an input text D at the current moment, so that a copying mechanism is introduced, therefore, when a model generates large-scale medical text data, certain medical field special words in the input text D can be directly copied into the generated medical text data, the situation that the generation of sparse words such as special names is difficult to realize during generation is relieved, a large-scale medical text data set is obtained, and the understanding of subsequent medical texts is carried out.

Meanwhile, because the traditional rule-based method is poor in robustness and portability, the invention provides an information extraction processing method based on semantic fusion of rule and named entity recognition; firstly, aiming at the traditional word segmentation method based on the jieba tool, the invention uses smaller characters to replace segmented words with higher word frequency by calculating the word frequency of each word in the word segmentation result on the basis of the jieba segmentation, for example, if the word frequency of one word of a doctor in the jieba segmentation result is higher, the word of the doctor is replaced by using a character Y, thereby realizing the segmentation of the word segmentation result into smaller units, effectively reducing the size of a word segmentation dictionary, introducing a word boundary symbol _, combining the word boundary symbol with a plurality of divided words together, keeping the original word sequence unchanged, and leading the algorithm to recover the original text without ambiguity; and aiming at terms in the medical treatment direction, the effect of extracting real information is influenced by the fact that some doctors express non-standard or new medical vocabulary and other problems can occur, in order to reduce the influence on the aspect, the invention adds an operation of updating rules in the information extraction method, wherein the rules refer to regular expressions, mapping tables and other patch files, and some simple rules for customizing medical names can be automatically added, so that the medical term information is effectively extracted.

Drawings

Fig. 1 is a schematic flowchart of a medical text understanding method based on a BERT model according to an embodiment of the present invention;

fig. 2 is a schematic structural diagram of a medical text understanding system based on a BERT model according to an embodiment of the present invention;

the implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.

Detailed Description

It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

Generating large-scale medical field text data by using a medical text generation technology based on text copy, and training a medical text entity recognition model by using the generated medical field text data, so as to perform entity recognition on a medical text to be processed by using the trained medical text entity recognition model; and semantic extraction is carried out on the medical text entity by using an information extraction method based on rules, and the understanding of the medical text is realized according to the extracted semantic information. Referring to fig. 1, a schematic diagram of a medical text understanding method based on a BERT model according to an embodiment of the present invention is provided.

In this embodiment, the medical text understanding method based on the BERT model includes:

s1, medical text data are obtained, invalid medical text data are filtered out through a sentence filtering model, and large-scale medical text data are generated through a medical text generation model based on text copying.

Firstly, acquiring a large amount of medical text data, and filtering invalid medical text data by using a sentence filtering model;

1) increasing [ CLS ] before inputting word sequence]Tagging, added after entry of word sequences [ SEP]Marking, converting the input word sequence into corresponding Token Embedding, and calculating to obtain Position Embedding corresponding to each word; adding the two Embedding corresponding to each word to obtain an input Embedding code; formally, for an input sequence S ═ S₁，s₂，...，s_nAnd constructing an input sequence of BERT as { [ CLS { []，s₁s₂...s_n，[SEP]The corresponding output is noted as

Wherein s is_iFor the ith word in a piece of medical text data,

for corresponding BERT word vectors, T_[CLS]，T_[SEP]Is [ CLS ]]And [ SEP ]]Marking the corresponding BERT word vector;

α＝softmax(WT)

wherein:

t is a BERT word vector;

wherein:

T_irepresenting the ith BERT word vector;

α_ian attention weight representing the ith BERT word vector;

Output＝sigmoid(W⁰attention)

wherein:

W⁰a parameter matrix of a multi-layer perceptron;

furthermore, as for the sentence filtering result, the bidirectional LSTM model is used for coding, in the decoding and medical text generation stages, medical nouns are sparse in the corpus, and the model is difficult to learn and generate the proper nouns, so that in a specific embodiment of the invention, by introducing a copy mechanism, the invention allows the model to directly copy some words in the input text D to the generated problem when some words in the problem are generated, thereby relieving the situation that the sparse words such as proper names are difficult to generate when the sparse words are generated;

in detail, the medical text generation process of the medical text generation model based on text copy comprises the following steps:

wherein:

d is a text of a sentence filtering result;

s is a text word vector;

y_tgenerating a t-th word;

z_tis a hidden variable used for controlling the model to generate words from a word list or copy words which are required to be generated currently from a text in the decoding process.

And S2, training a medical text entity recognition model by using the generated large-scale medical field text data, and performing entity recognition on the medical text to be processed by using the trained medical text entity recognition model.

Further, the invention utilizes the generated large-scale text data in the medical field to train the medical text entity recognition model, and the training of the medical text entity model is divided into the following two parts:

3) the above steps are repeated until 30% of the medical text data is marked [ MASK ]. The process of performing entity recognition on the medical text by using the medical text entity recognition model comprises the following steps:

2) by calculating the word frequency of each word in the word segmentation result, replacing the segmented words with higher word frequency by using smaller characters, introducing a word boundary symbol, and combining the word boundary symbol with a plurality of divided word groups, the original word sequence is kept unchanged;

And S3, performing semantic extraction on the medical text entity by using an attention-based information extraction method to obtain semantic features of the medical text entity.

Furthermore, the invention utilizes an attention-based information extraction method to perform semantic extraction on the medical text entity to obtain the semantic features of the medical text entity, and the extraction process of the semantic features of the medical text entity comprises the following steps:

1) inputting the medical text entity into CNN to obtain a word vector representation, and inputting the word vector representation into a two-layer high way Network to obtain a vector representation

H＝BiLSTM(C)

U＝BiLSTM(C)

wherein:

h, U are context coding results of the two times respectively;

c is a word vector representation of the medical text entity;

3) computing a similarity matrix S of the context coding result:

S＝sim(H_：t，U_：j)

wherein:

H_：tthe t column vector of H;

U_：ja jth column vector of U;

sim is a cosine similarity measurement formula;

4) calculating an attention weight vector G for the medical text entity:

G＝softmax(S_：t)

wherein:

S_：tis the t-th column of the similarity matrix S;

M＝BiLSTM(G)

wherein:

and M is a semantic feature of the medical text.

And S4, understanding the medical text by utilizing a multilayer perceptron according to the semantic features of the medical text entity.

Further, according to the semantic feature M of the medical text entity, the invention utilizes a multilayer perceptron to understand the medical text, and uses the medical text understanding result y with the highest probability as the output of the multilayer perceptron, and the specific process is as follows:

P(y|M)＝σ(MLP(M))

wherein:

m is semantic characteristics of medical text entities;

y is the medical text understanding result;

sigma is sigmoid function;

further, the invention uses the cross entropy as the loss function of the multi-layer perceptron, which is as follows:

wherein:

n is the total number of training samples;

during the training process, the model optimizes the entire model using a stochastic gradient descent optimizer. The initial learning rate of the model is 0.005, and the model is gradually halved along with the training process, so that the training effect of the model is ensured.

The following describes embodiments of the present invention through an algorithmic experiment and tests of the inventive treatment method. The hardware test environment of the algorithm of the invention is as follows: the system is Ubuntu16.04, the open source framework is TensorFlow 1.6, the processor is Intel i7-7700K, and the graphics card is Nvidia GTX 1080-Ti; the comparison algorithm models are BiLSTM, BERT and CRF-LSTM models.

In the algorithm experiment of the invention, the data set is c MedQA2, and the data set is a Chinese medical text data set in large-scale medical field. In the experiment, medical text data in a data set is input into an algorithm model, and the accuracy of understanding the medical text is used as an evaluation index of algorithm performance.

According to the experimental result, the medical text understanding accuracy of the BilSTM model is 95.62%, the medical text understanding accuracy of the BERT model is 92.14%, the medical text understanding accuracy of the CRF-LSTM model is 93.18%, and the medical text understanding accuracy of the BERT model-based medical text understanding algorithm of the invention is 96.82%.

The invention also provides a BERT model-based medical text understanding system. Referring to fig. 2, a schematic diagram of an internal structure of a BERT model-based medical text understanding system according to an embodiment of the present invention is provided.

In the present embodiment, the BERT model-based medical text understanding system 1 includes at least a medical text generating means 11, a medical text processor 12, a medical text understanding means 13, a communication bus 14, and a network interface 15.

The medical text generation device 11 may be a PC (Personal Computer), a terminal device such as a smartphone, a tablet Computer, or a mobile Computer, or may be a server.

The medical text processor 12 includes at least one type of readable storage medium including flash memory, hard disk, multi-media card, card type memory (e.g., SD or DX memory, etc.), magnetic memory, magnetic disk, optical disk, and the like. The medical text processor 12 may in some embodiments be an internal storage unit of the BERT model based medical text understanding system 1, for example a hard disk of the BERT model based medical text understanding system 1. The medical text processor 12 may also be an external storage device of the BERT model-based medical text understanding system 1 in other embodiments, such as a plug-in hard disk provided on the BERT model-based medical text understanding system 1, a Smart Memory Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like. Further, the medical text processor 12 may also include both an internal storage unit and an external storage device of the BERT model-based medical text understanding system 1. The medical text processor 12 can be used not only to store application software installed in the BERT model-based medical text understanding system 1 and various kinds of data, but also to temporarily store data that has been output or is to be output.

Medical text understanding apparatus 13 may be, in some embodiments, a Central Processing Unit (CPU), controller, microcontroller, microprocessor or other data processing chip for executing program code stored in medical text processor 12 or processing data, such as medical text understanding program instructions.

The communication bus 14 is used to enable connection communication between these components.

The network interface 15 may optionally include a standard wired interface, a wireless interface (e.g., WI-FI interface), and is typically used to establish a communication link between the system 1 and other electronic devices.

Optionally, the system 1 may further comprise a user interface, which may comprise a Display (Display), an input unit such as a Keyboard (Keyboard), and optionally a standard wired interface, a wireless interface. Alternatively, in some embodiments, the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode) touch device, or the like. The display, which may also be referred to as a display screen or display unit, is suitable for displaying information processed in the BERT model-based medical text understanding system 1 and for displaying a visualized user interface.

Fig. 2 only shows the medical text understanding system 1 with the components 11-15 and based on the BERT model, and it will be understood by those skilled in the art that the structure shown in fig. 1 does not constitute a limitation of the BERT model based medical text understanding system 1, and may include fewer or more components than shown, or combine certain components, or a different arrangement of components.

In the embodiment of the apparatus 1 shown in fig. 2, medical text understanding program instructions are stored in the medical text processor 12; the steps of the medical text understanding apparatus 13 executing the medical text understanding program instructions stored in the medical text processor 12 are the same as the implementation method of the BERT model-based medical text understanding method, and are not described here.

Furthermore, an embodiment of the present invention also provides a computer-readable storage medium having stored thereon BERT model-based medical text understanding program instructions, which are executable by one or more processors to implement the following operations:

It should be noted that the above-mentioned numbers of the embodiments of the present invention are merely for description, and do not represent the merits of the embodiments. And the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, apparatus, article, or method that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, apparatus, article, or method. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, apparatus, article, or method that includes the element.

Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium (e.g., ROM/RAM, magnetic disk, optical disk) as described above and includes instructions for enabling a terminal device (e.g., a mobile phone, a computer, a server, or a network device) to execute the method according to the embodiments of the present invention.

The above description is only a preferred embodiment of the present invention, and not intended to limit the scope of the present invention, and all modifications of equivalent structures and equivalent processes, which are made by using the contents of the present specification and the accompanying drawings, or directly or indirectly applied to other related technical fields, are included in the scope of the present invention.

Claims

1. A BERT model-based medical text understanding method, the method comprising:

2. The BERT model-based medical text understanding method of claim 1, wherein the filtering out invalid medical text data using the sentence filtering model comprises:

α＝softmax(WT)

wherein:

t is a BERT word vector;

wherein:

T_irepresenting the ith BERT word vector;

α_ian attention weight representing the ith BERT word vector;

Output＝sigmoid(W⁰attention)

wherein:

W⁰is a parameter matrix of the multi-layer perceptron.

3. The BERT model-based medical text understanding method of claim 2, wherein the large-scale medical text data generation using a text copy-based medical text generation model comprises:

1) introducing an implicit variable z_tControlling the model to generate words from the word list or copy the words needed to be generated currently from the text in the decoding process when z is_t0 represents that the decoder needs to generate a word from the word list at the current moment, when z_t1 represents that the decoder copies a word from the input text D at the present moment;

wherein:

d is a text of a sentence filtering result;

s is a text word vector;

y_tgenerating a t-th word;

z_tis an implicit variable when z_t0 represents that the decoder needs to generate a word from the word list at the current moment, when z_t1 represents that the decoder copies a word from the input text D at the current moment.

4. The method for understanding medical texts based on the BERT model as claimed in claim 3, wherein the training process of the medical text entity recognition model by using the generated large-scale medical field text data comprises:

5. The method of claim 4, wherein the entity recognizing the medical text by the trained medical text entity recognition model comprises:

5) and 3) and 4) adopt a union mode to reserve more semantics.

6. The BERT model-based medical text understanding method of claim 5, wherein the semantic extraction of the medical text entity using the attention-based information extraction method comprises:

1) inputting the medical text entity into CNN to obtain a word vector representation, and inputting the word vector representation into two layers of HighwayNetwork to obtain a vector representation

H＝BiLSTM(C)

U＝BiLSTM(C)

wherein:

h, U are context coding results of the two times respectively;

c is a word vector representation of the medical text entity;

3) computing a similarity matrix S of the context coding result:

S＝sim(H_：t，U_：j)

wherein:

H_：tthe t column vector of H;

U_：ja jth column vector of U;

sim is a cosine similarity measurement formula;

4) calculating an attention weight vector G for the medical text entity:

G＝softmax(S_：t)

wherein:

S_：tis the t-th column of the similarity matrix S;

M＝BiLSTM(G)

wherein:

and M is a semantic feature of the medical text.

7. The method as claimed in claim 6, wherein said medical text understanding using multilayer perceptron comprises:

P(y|M)＝σ(MLP(M))

wherein:

m is semantic characteristics of medical text entities;

y is the medical text understanding result;

sigma is sigmoid function;

wherein:

n is the total number of training samples;

8. A BERT model-based medical text understanding system, the system comprising:

9. A computer readable storage medium having stored thereon medical text understanding program instructions executable by one or more processors to implement the steps of a method of implementing a BERT model-based medical text understanding of any of claims 1 to 7.