KR20210010366A

KR20210010366A - Fusion tag for preparing glucagon-like peptide-1 or analogues

Info

Publication number: KR20210010366A
Application number: KR1020200086772A
Authority: KR
Inventors: 김성건
Original assignee: 주식회사 펩진
Priority date: 2019-07-19
Filing date: 2020-07-14
Publication date: 2021-01-27
Also published as: KR20210064144A; KR102345011B1

Abstract

One embodiment of the present invention relates to a fusion tag and a fusion polypeptide including the fusion tag and glucagon-like peptide-1 or analogues. When the novel fusion tag of the present invention is used, the yield of glucagon-like peptide-1 or analogues can be improved in comparison with a conventional fusion tag. In particular, it is useful when glucagon-like peptide-1 or analogues having a relatively small molecular weight and an easily degraded amino terminal are produced by using a genetic recombination technique. Moreover, according to one embodiment of the present invention, the fusion polypeptide including the fusion tag is induced to be expressed in the form of an inclusion body in a host cell to be protected from proteolytic enzymes of the host cell and the like, thereby providing an advantage of stably producing glucagon-like peptide-1 or analogues. Accordingly, it is possible to provide a method for producing recombinant glucagon-like peptide-1 or analogues with increased stability and yield more in comparison with a conventional fusion tag.

Description

Fusion tag for producing glucagon-like peptide-1 or analogue thereof {FUSION TAG FOR PREPARING GLUCAGON-LIKE PEPTIDE-1 OR ANALOGUES}

본 발명은 신규한 융합태그로서, 상기 융합태그와 글루카곤 유사 펩타이드-1 또는 이의 유사체를 포함하는 융합 폴리펩타이드에 관한 것이다.The present invention relates to a fusion polypeptide comprising the fusion tag and glucagon-like peptide-1 or an analog thereof as a novel fusion tag.

최근 유전공학 및 생명공학 기술의 발달로 대장균, 효모, 동식물세포 등을 이용하여 유용한 외래 단백질이 많이 생산되고, 이들 단백질이 의약품 등으로 널리 이용되고 있다. 구체적으로, 면역 조절제, 효소 저해제 및 호르몬 같은 의약 및 연구용 단백질이나 반응 첨가 효소와 같은 산업용 단백질에 대한 생산 공정 기술 개발 및 산업화가 추진되고 있다. With the recent development of genetic engineering and biotechnology, many useful foreign proteins are produced using E. coli, yeast, animal and plant cells, etc., and these proteins are widely used as pharmaceuticals. Specifically, the development and industrialization of production process technologies for pharmaceutical and research proteins such as immunomodulators, enzyme inhibitors and hormones, or industrial proteins such as reaction additive enzymes are being promoted.

이중에서도 유전자 재조합 기술은 여러 목적 단백질들의 핵산을 발현벡터에 클로닝하여 재조합 발현벡터를 얻고, 이를 적당한 숙주세포에 형질전환시켜 배양함으로써 목적 단백질 또는 목적 폴리펩타이드를 생산하는 방법이다. 다만, 숙주세포에 내재된 분해 효소(예를 들어, 프로테아제(protease) 또는 펩티다제(peptidase))에 의해 목적 단백질또는 목적 폴리펩타이드의 전체 또는 일부가 분해되어 수율이 낮아지거나, 융합태그로 사용하는 펩타이드의 크기가 제조하고자 하는 목적 단백질 또는 목적 폴리펩타이드의 크기에 비해 너무 커서 수율이 현저히 떨어지는 문제가 있다. Among them, gene recombination technology is a method of producing a target protein or a target polypeptide by cloning a nucleic acid of a number of target proteins into an expression vector to obtain a recombinant expression vector, transforming it into an appropriate host cell, and culturing it. However, all or part of the target protein or target polypeptide is degraded by a degrading enzyme (e.g., protease or peptidase) inherent in the host cell, resulting in a lower yield or used as a fusion tag. There is a problem that the yield is significantly lowered because the size of the peptide to be prepared is too large for the size of the target protein or polypeptide to be produced.

따라서, 유전자 재조합 기술을 이용하여 목적 단백질 또는 목적 폴리펩타이드을 대량 생산하고자 하는 경우, 목적 단백질 또는 목적 폴리펩타이드를 안정적으로 발현시키면서도 생산 수율을 향상시킬 수 있는 융합태그를 개발하는 것이 중요하다.Therefore, in the case of mass production of a target protein or a target polypeptide using gene recombination technology, it is important to develop a fusion tag capable of stably expressing the target protein or target polypeptide while improving the production yield.

본 발명이 이루고자 하는 기술적 과제는 글루카곤 유사 펩타이드-1 또는 이의 유사체를 생산하기 위한 아미노산 서열로 이루어진 융합태그를 제공하는 것이다.The technical problem to be achieved by the present invention is to provide a fusion tag consisting of an amino acid sequence for producing glucagon-like peptide-1 or an analog thereof.

본 발명이 이루고자 하는 기술적 과제는 이상에서 언급한 기술적 과제로 제한되지 않으며, 언급되지 않은 또 다른 기술적 과제들은 아래의 기재로부터 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에게 명확하게 이해될 수 있을 것이다.The technical problem to be achieved by the present invention is not limited to the technical problems mentioned above, and other technical problems that are not mentioned can be clearly understood by those of ordinary skill in the technical field to which the present invention belongs from the following description. There will be.

상기 기술적 과제를 달성하기 위하여, 본 발명의 일측면에 따른 글루카곤 유사 펩타이드-1 또는 이의 유사체 생산용 융합태그는 서열번호6의 아미노산 서열을 포함할 수 있다.In order to achieve the above technical problem, the fusion tag for producing glucagon-like peptide-1 or an analog thereof according to an aspect of the present invention may include the amino acid sequence of SEQ ID NO: 6.

본 발명의 실시예에 있어서, 상기 융합태그는 상기 글루카곤 유사 펩타이드-1 또는 이의 유사체의 N말단에 위치할 수 있다.In an embodiment of the present invention, the fusion tag may be located at the N-terminus of the glucagon-like peptide-1 or an analog thereof.

본 발명의 실시예에 있어서, 상기 융합태그는 상기 글루카곤 유사 펩타이드-1 또는 이의 유사체와 함께 발현되어 봉입체(inclusion body)를 형성할 수 있다.In an embodiment of the present invention, the fusion tag may be expressed together with the glucagon-like peptide-1 or an analog thereof to form an inclusion body.

본 발명의 실시예에 있어서, 상기 융합태그는 상기 융합태그와 상기 글루카곤 유사 펩타이드-1 또는 이의 유사체를 연결시켜주는 링커를 포함할 수 있다.In an embodiment of the present invention, the fusion tag may include a linker that connects the fusion tag and the glucagon-like peptide-1 or an analog thereof.

본 발명의 실시예에 있어서, 상기 링커는 친화성 태그를 포함할 수 있다.In an embodiment of the present invention, the linker may include an affinity tag.

본 발명의 실시예에 있어서, 상기 링커는 프로테아제 인식 서열을 포함할 수 있다. In an embodiment of the present invention, the linker may include a protease recognition sequence.

본 발명의 실시예에 있어서, 상기 프로테아제 인식 서열은 상기 링커의 C말단에 위치할 수 있다.In an embodiment of the present invention, the protease recognition sequence may be located at the C-terminus of the linker.

본 발명의 실시예에 있어서, 상기 프로테아제 인식 서열은 담배 식각 바이러스 프로테아제 인식 서열, 엔테로키나아제 인식 서열, 유비퀴틴 카르복시 말단 가수분해효소 인식 서열, 인자Xa 인식 서열, 퓨린 인식 서열 및 이들의 조합으로 이루어진 군으로부터 선택될 수 있다.In an embodiment of the present invention, the protease recognition sequence is from the group consisting of a tobacco etching virus protease recognition sequence, an enterokinase recognition sequence, a ubiquitin carboxy terminal hydrolase recognition sequence, a factor Xa recognition sequence, a purine recognition sequence, and a combination thereof. Can be chosen.

본 발명의 실시예에 있어서, 상기 프로테아제 인식 서열은 서열번호 9의 아미노산 서열을 포함할 수 있다.In an embodiment of the present invention, the protease recognition sequence may include the amino acid sequence of SEQ ID NO: 9.

본 발명의 실시예에 따른 신규한 융합태그를 사용하면, 종래 융합태그 대비 글루카곤 유사 펩타이드-1 또는 이의 유사체의 수율을 향상시킬 수 있다. 특히, 상대적으로 분자량이 작으며 분해되기 쉬운 아미노 말단을 갖는 글루카곤 유사 펩타이드-1 또는 이의 유사체를 유전자 재조합 기술을 이용하여 제조하는 경우에 유용하다. 또한, 본 발명의 실시예에 따른 융합태그를 포함하는 융합 폴리펩타이드는 숙주세포 내에서 내포체 형태로 발현이 유도됨으로써 숙주세포의 단백질 분해효소 등으로부터 보호되어 안정적으로 글루카곤 유사 펩타이드-1 또는 이의 유사체를 생산할 수 있는 장점이 있다. 따라서, 종래의 융합태그를 사용한 경우보다 안정성 및 수율이 향상된 재조합 펩타이드 생산 방법을 제공할 수 있다.When the novel fusion tag according to an embodiment of the present invention is used, the yield of glucagon-like peptide-1 or an analog thereof can be improved compared to the conventional fusion tag. In particular, it is useful when a glucagon-like peptide-1 or an analog thereof having a relatively small molecular weight and an amino terminus that is easily degraded is prepared using genetic recombination technology. In addition, the fusion polypeptide comprising the fusion tag according to the embodiment of the present invention is protected from proteolytic enzymes of the host cell by inducing expression in the form of an inclusion body in the host cell, and stably glucagon-like peptide-1 or an analog thereof It has the advantage of being able to produce. Therefore, it is possible to provide a method for producing a recombinant peptide having improved stability and yield compared to the case of using a conventional fusion tag.

본 발명의 효과는 상기한 효과로 한정되는 것은 아니며, 본 발명의 상세한 설명 또는 특허청구범위에 기재된 발명의 구성으로부터 추론 가능한 모든 효과를 포함하는 것으로 이해되어야 한다.The effects of the present invention are not limited to the above effects, and should be understood to include all effects that can be inferred from the configuration of the invention described in the detailed description or claims of the present invention.

도 1은 재조합 대장균에서 발현된 GLP-1K28R 융합 폴리펩타이드들의 총세포 분획을 SDS-PAGE로 분석한 결과이다(레인 M: 마커 단백질, 레인 1: H6TEV-GLP-1K28R (균주 번호 PG005), 레인 2: PG43-H6TEV-GLP-1K28R(균주 번호 PG011), 레인 3: PG57-H6TEV-GLP-1K28R(균주 번호 PG035); 레인 4: PG68-H6TEV-GLP-1K28R(균주 번호 PG036); 레인 5: PG79-H6TEV-GLP-1K28R(균주 번호 PG037); 레인 6: PG88-H6TEV-GLP-1K28R(균주 번호 PG038); 레인 7: PG97-H6TEV-GLP-1K28R(균주 번호 PG039)).
도 2는 재조합 대장균에서 발현된 H6TEV-GLP-1K28R, PG97-H6TEV-GLP-1K28R, SUMO-H6TEV-GLP-1K28R의 총세포 분획을 SDS-PAGE로 분석한 결과이다(레인 M: 마커 단백질, 레인 1: H6TEV-GLP-1K28R (균주 번호 PG005), 레인 2: PG97-H6TEV-GLP-1K28R(균주 번호 PG039), 레인 3: SUMO-H6TEV-GLP-1K28R(균주 번호 PG049))
도 3은 크로마토그래피를 통해 정제한 PG97-H6TEV-GLP-1K28R 융합 폴리펩타이드를 SDS-PAGE로 분석한 결과이다(레인 M: 마커 단백질, 레인 S: 정제 전 시료, 레인 FT: 통과액 분획, 레인 E: 용출액 분획). 화살표는 G97-H6TEV-GLP-1K28R 융합 폴리펩타이드를 나타낸다.
도 4는 TEV 프로테아제를 이용하여 정제된 PG97-H6TEV-GLP-1K28R 를 절단한 후 분획을 SDS-PAGE로 분석한 결과이다(레인 M: 마커 단백질, 레인 C: TEV 프로테아제를 처리하지 않은 시료, 레인 T: TEV 프로테아제를 처리한 시료).
도 5는 역상 크로마토그래피를 이용하여 PG97-H6TEV-GLP-1K28R 로부터 절단된 PG97-H6TEV와 GLP-1K28R을 분리한 분획들을 SDS-PAGE로 분석한 결과이다(레인 M: 마커 단백질, 레인 S: 정제 전 시료, 레인 FT: 통과액 분획, 레인 W: 세척 분획, 레인 E: 용출액 분획).
도 6은 화학합성된 GLP-1K28R 의 분자량을 측정한 결과를 나타낸 것이다.
도 7은 본 발명에 따라 정제된 GLP-1K28R의 분자량을 측정한 결과를 나타낸 것이다.
도 8은 역상 HPLC 방법으로 화학합성된 GLP-1K28R과 본 발명에 따른 재조합 GLP-1K28R 의 체류시간 및 순도를 확인한 그래프이다. 도 8(a) 및 도 8(b)의 그래프는 화학합성된 GLP-1K28R과 재조합 GLP-1K28R 의 역상 HPLC 분석 크로마토그램을 나타낸다.1 is a result of SDS-PAGE analysis of the total cell fraction of GLP-1K28R fusion polypeptides expressed in recombinant E. coli (lane M: marker protein, lane 1: H6TEV-GLP-1K28R (strain number PG005), lane 2 : PG43-H6TEV-GLP-1K28R (strain number PG011), lane 3: PG57-H6TEV-GLP-1K28R (strain number PG035); lane 4: PG68-H6TEV-GLP-1K28R (strain number PG036); lane 5: PG79 -H6TEV-GLP-1K28R (strain number PG037); lane 6: PG88-H6TEV-GLP-1K28R (strain number PG038); lane 7: PG97-H6TEV-GLP-1K28R (strain number PG039)).
Figure 2 shows the results of SDS-PAGE analysis of the total cell fraction of H6TEV-GLP-1K28R, PG97-H6TEV-GLP-1K28R, and SUMO-H6TEV-GLP-1K28R expressed in recombinant E. coli (lane M: marker protein, lane 1: H6TEV-GLP-1K28R (strain number PG005), lane 2: PG97-H6TEV-GLP-1K28R (strain number PG039), lane 3: SUMO-H6TEV-GLP-1K28R (strain number PG049))
3 is a result of analysis of PG97-H6TEV-GLP-1K28R fusion polypeptide purified through chromatography by SDS-PAGE (lane M: marker protein, lane S: sample before purification, lane FT: flow-through fraction, lane E: eluate fraction). Arrows indicate G97-H6TEV-GLP-1K28R fusion polypeptide.
4 is a result of analyzing the fraction by SDS-PAGE after cleaving PG97-H6TEV-GLP-1K28R purified using TEV protease (lane M: marker protein, lane C: sample not treated with TEV protease, lane T: TEV protease-treated sample).
5 is a result of SDS-PAGE analysis of fractions separated from PG97-H6TEV-GLP-1K28R cleaved PG97-H6TEV and GLP-1K28R using reverse phase chromatography (lane M: marker protein, lane S: purification All samples, lane FT: flow-through fraction, lane W: wash fraction, lane E: eluate fraction).
6 shows the results of measuring the molecular weight of chemically synthesized GLP-1K28R.
7 shows the results of measuring the molecular weight of GLP-1K28R purified according to the present invention.
Figure 8 is a graph confirming the retention time and purity of GLP-1K28R chemically synthesized by reverse-phase HPLC method and the recombinant GLP-1K28R according to the present invention. 8(a) and 8(b) show reverse-phase HPLC chromatograms of chemically synthesized GLP-1K28R and recombinant GLP-1K28R.

이하에서는 첨부한 도면을 참조하여 본 발명을 설명하기로 한다. 그러나 본 발명은 여러 가지 상이한 형태로 구현될 수 있으며, 따라서 여기에서 설명하는 실시예로 한정되는 것은 아니다. 그리고 도면에서 본 발명을 명확하게 설명하기 위해서 설명과 관계없는 부분은 생략하였으며, 명세서 전체를 통하여 유사한 부분에 대해서는 유사한 도면 부호를 붙였다.Hereinafter, the present invention will be described with reference to the accompanying drawings. However, the present invention may be implemented in various different forms, and therefore is not limited to the embodiments described herein. In the drawings, parts irrelevant to the description are omitted in order to clearly describe the present invention, and similar reference numerals are assigned to similar parts throughout the specification.

명세서 전체에서, 어떤 부분이 다른 부분과 "연결(접속, 접촉, 결합)"되어 있다고 할 때, 이는 "직접적으로 연결"되어 있는 경우뿐 아니라, 그 중간에 다른 부재를 사이에 두고 "간접적으로 연결"되어 있는 경우도 포함한다. 또한 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 구비할 수 있다는 것을 의미한다.Throughout the specification, when a part is said to be "connected (connected, contacted, bonded)" with another part, it is not only "directly connected", but also "indirectly connected" with another member in the middle. "Including the case. In addition, when a part "includes" a certain component, it means that other components may be further provided, rather than excluding other components unless specifically stated to the contrary.

본 명세서에서 사용한 용어는 단지 특정한 실시예를 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아니다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 명세서에서, "포함하다" 또는 "가지다" 등의 용어는 명세서상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.The terms used in the present specification are only used to describe specific embodiments, and are not intended to limit the present invention. Singular expressions include plural expressions unless the context clearly indicates otherwise. In this specification, terms such as "comprise" or "have" are intended to designate the presence of features, numbers, steps, actions, components, parts, or a combination thereof described in the specification, but one or more other features. It is to be understood that the presence or addition of elements or numbers, steps, actions, components, parts, or combinations thereof, does not preclude in advance.

이하 첨부된 도면을 참고하여 본 발명의 실시예를 상세히 설명하기로 한다.Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

본 발명의 일측면에 따른 글루카곤 유사 펩타이드-1(GLP-1) 또는 이의 유사체 생산용 융합태그는 서열번호 6의 아미노산 서열을 포함할 수 있다. The fusion tag for producing glucagon-like peptide-1 (GLP-1) or an analog thereof according to an aspect of the present invention may include the amino acid sequence of SEQ ID NO: 6.

재조합 미생물 시스템을 이용하여 여러가지 목적 펩타이드 또는 목적 단백질을 생산하는 경우, 그 목적 물질의 물성에 따라 숙주 세포 내 효소에 의한 분해, 낮은 발현 수준, 부적절한 단백질 폴딩 및/또는 낮은 mRNA 안정성 등으로 인해 수율이 저하될 우려가 있다. 예를 들어, 종래 융합 태그로 활용되는 MBP (maltose binding protein), 글투타티온 S-전달효소 (glutathione-S-transferase), 티오레독신(thioredoxin), SUMO, 유비퀴틴 등은 각각 397, 216, 106, 101, 76 개의 아미노 산을 갖는데, 비교적 저분자량의 목적 폴리펩타이드 등을 생산 시 수율이 좋지 못한 문제가 있었다.In the case of producing various peptides or proteins of interest using a recombinant microbial system, the yield may be reduced due to degradation by enzymes in host cells, low expression levels, inappropriate protein folding and/or low mRNA stability, depending on the properties of the target substances. There is a risk of deterioration. For example, MBP (maltose binding protein), glutathione-S-transferase, thioredoxin, SUMO, ubiquitin, etc., which are conventionally used as fusion tags, are 397, 216, 106, respectively. , 101, 76 amino acids, there was a problem of poor yield when producing a target polypeptide having a relatively low molecular weight.

이에 반해, 본 발명에 따른 상기 융합태그는 상대적으로 분자량이 작은 펩타이드로서, 이를 적용하여 GLP-1, GLP-2 또는 hPTH 1-34 등의 목적 폴리펩타이드 또는 목적 단백질을 생산하는 경우 기존의 융합태그들을 활용한 경우보다 향상된 수율로 목적 폴리펩타이드 또는 목적 단백질을 수득할 수 있다. 예를 들어, 재조합 융합 폴리펩타이드에서 GLP-1의 비율을 아래 표 1에 개략적으로 나타내었다.In contrast, the fusion tag according to the present invention is a peptide having a relatively small molecular weight, and when applying it to produce a target polypeptide or a target protein such as GLP-1, GLP-2 or hPTH 1-34, the existing fusion tag It is possible to obtain a target polypeptide or a target protein in an improved yield than when using them. For example, the ratio of GLP-1 in the recombinant fusion polypeptide is schematically shown in Table 1 below.

융합 파트너Fusion partner 아미노산 개수Number of amino acids 분자량 (kDa)Molecular weight (kDa) 목적 펩타이드 비율 (%)*Target peptide ratio (%)* MBP(Maltose binding protein)Maltose binding protein (MBP) 397397 44.244.2 88 글루타티온 S-전달효소Glutathione S-transferase 216216 23.823.8 1212 서열번호 6의 아미노산 서열을 포함하는 융합태그Fusion tag comprising the amino acid sequence of SEQ ID NO: 6 57 ~ 9757 ~ 97 6.3 - 10.46.3-10.4 23 - 3323-33

*GLP-1을 기준으로 링커를 포함하여 계산*Calculated including linker based on GLP-1

표 1에 나타난 바와 같이, 종래의 융합태그들은 융합 폴리펩타이드에서 GLP-1의 비율이 8% 내지 12%등에 그치는 반면, 본 발명에 따른 융합태그가 융합된 융합 폴리펩타이드에서의 GLP-1의 비율은 23% 내지 33%로 확인된다. 따라서, 동일한 농도의 융합 폴리펩타이드로부터 얻을 수 있는 GLP-1의 양이 종래 융합태그 대비 더 높으므로 최종 생산 수율이 향상될 수 있다.As shown in Table 1, the ratio of GLP-1 in the fusion polypeptide in the conventional fusion tags is only 8% to 12%, etc., while the ratio of GLP-1 in the fusion polypeptide in which the fusion tag according to the present invention is fused. Is identified as 23% to 33%. Therefore, since the amount of GLP-1 that can be obtained from the fusion polypeptide of the same concentration is higher than that of the conventional fusion tag, the final production yield can be improved.

또한, 본 발명에 따른 융합태그는 융합 폴리펩타이드의 불용성 발현을 유도하여 융합 폴리펩타이드가 불용성 내포체로서 숙주세포 내로 고농도로 축적되게 할 수 있다. 따라서, 대장균(Escherichia coli) 등의 숙주세포 내에서 프로테아제 및 펩티다제에 의해 전부 혹은 일부가 분해 혹은 절단될 우려가 있는 글루카곤 유사 펩타이드-1(GLP-1) 또는 이의 유사체를 고수율로 수득하기 용이한 장점이 있다.In addition, the fusion tag according to the present invention can induce insoluble expression of the fusion polypeptide so that the fusion polypeptide is accumulated in a host cell as an insoluble inclusion body at a high concentration. Therefore, to obtain glucagon-like peptide-1 (GLP-1) or an analog thereof in a high yield that may be degraded or cleaved in whole or in part by proteases and peptidases in host cells such as Escherichia coli. It has the advantage of being easy.

구체적으로, 상기 서열번호 6의 아미노산 서열을 포함하는 융합태그는 97개의 아미노산을 갖는 펩타이드를 포함할 수 있다. Specifically, the fusion tag including the amino acid sequence of SEQ ID NO: 6 may include a peptide having 97 amino acids.

또한, 상기 융합태그는 다양하게 변형된 펩타이드, 즉 변이체를 포함할 수 있다. 상기 변형은 융합태그의 기능을 변형시키지 않는 내에서 하나 이상의 단백질을 치환, 결실 또는 추가하는 방법을 통하여 수행될 수 있다. 이러한 다양한 펩타이드는 서열번호 6의 아미노산 서열 중 어느 하나와 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 또는 99% 이상 동일한 것일 수 있다.In addition, the fusion tag may include variously modified peptides, that is, variants. The modification may be performed through a method of substituting, deleting or adding one or more proteins without altering the function of the fusion tag. These various peptides may be 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more identical to any one of the amino acid sequences of SEQ ID NO: 6.

본 발명의 또 다른 측면은, 전술한 바와 같은 융합태그, 글루카곤 유사 펩타이드-1 또는 이의 유사체, 및 상기 글루카곤 유사 펩타이드-1 또는 이의 유사체를 연결시켜주는 링커를 포함하는 융합태그를 포함하는 융합 폴리펩타이드를 제공할 수 있다.Another aspect of the present invention is a fusion polypeptide comprising a fusion tag as described above, a glucagon-like peptide-1 or an analog thereof, and a fusion tag comprising a linker linking the glucagon-like peptide-1 or an analog thereof. Can provide.

상기 링커는 친화성 태그를 포함할 수 있다. 본 발명에서 사용된 용어 "친화성 태그"는 재조합 융합 폴리펩타타이드 또는 이를 코딩하는 핵산에 도입될 수 있는 펩타이드 또는 핵산 서열로서 다양한 목적으로 사용될 수 있으며, 예를 들어 목적 폴리펩타이드 또는 목적 단백질의 정제 효율을 높이기 위한 것일 수 있다. 본 발명에서 사용될 수 있는 친화성 태그는 당 업계에 공지된 임의의 적합한 물질을 원하는 바에 따라 사용할 수 있다. 예를 들어, 본 발명에 사용되는 친화성 태그는 폴리히스티딘 태그(서열번호8)일 수 있다.The linker may include an affinity tag. The term "affinity tag" used in the present invention is a peptide or nucleic acid sequence that can be introduced into a recombinant fusion polypeptide or a nucleic acid encoding the same, and can be used for various purposes, for example, purification of a target polypeptide or a protein of interest. It may be to increase efficiency. The affinity tag that can be used in the present invention can be any suitable material known in the art as desired. For example, the affinity tag used in the present invention may be a polyhistidine tag (SEQ ID NO: 8).

또한, 상기 링커는 프로테아제 인식 서열을 포함할 수 있다. 프로테아제란, 특정 아미노산 서열을 인식하여, 인식한 서열 내의 펩타이드 결합이나 그 서열의 마지막 아미노산과 융합된 폴리펩타이드의 첫번째 아미노산과의 펩타이드 결합을 절단하는 단백질 분해효소이다. 본 발명에 따른 융합 폴리펩타이드는 프로테아제 인식 서열을 갖는 링커를 포함함으로써, 최종 단계에서 폴리 펩타이드 정제 시 절단 효소 인식 서열이 포함된 아미노 말단(친화성 태그를 사용한 경우에는 친화성 태그도 포함)과 목적 폴리펩타이드 또는 목적 단백질의 N-말단을 분리시켜 목적 폴리펩타이드 또는 목적 단백질을 회수할 수 있다.In addition, the linker may include a protease recognition sequence. A protease is a protease that recognizes a specific amino acid sequence and cleaves a peptide bond within the recognized sequence or a peptide bond with the first amino acid of a polypeptide fused with the last amino acid of the sequence. The fusion polypeptide according to the present invention includes a linker having a protease recognition sequence, so that the amino terminal containing the cleavage enzyme recognition sequence when the polypeptide is purified in the final step (including the affinity tag when using an affinity tag) and the purpose By separating the N-terminus of the polypeptide or protein of interest, the polypeptide or protein of interest can be recovered.

구체적으로, 상기 프로테아제 인식 서열은 담배 식각 바이러스(TEV) 프로테아제 인식 서열, 엔테로키나아제 인식 서열, 유비퀴틴 가수분해효소 인식 서열, 인자Xa, 퓨린 및 이들의 조합으로 이루어진 군으로부터 선택되는 인식 서열일 수 있다. 예를 들어, 상기 프로테아제 인식 서열은 서열번호9의 아미노산 서열 중 어느 하나를 포함할 수 있다.Specifically, the protease recognition sequence may be a recognition sequence selected from the group consisting of a tobacco etching virus (TEV) protease recognition sequence, an enterokinase recognition sequence, a ubiquitin hydrolase recognition sequence, factor Xa, purine, and combinations thereof. For example, the protease recognition sequence may include any one of the amino acid sequences of SEQ ID NO: 9.

상기 글루카곤 유사 펩타이드-1 또는 이의 유사체는 바람직하게 리라글루타이드(liraglutide) 전구체 펩타이드일 수 있다. The glucagon-like peptide-1 or an analog thereof may preferably be a liraglutide precursor peptide.

상기 리라글루타이드(liraglutide) 전구체 펩타이드(GLP-1K28R)는 31개의 아미노산으로 구성된 글루카곤 유사 펩타이드-1(GLP-1)에서 유래된 폴리펩타이드이고 GLP-1의 28번째 라이신이 아르기닌으로 치환(K28R)되어 있다. 이와 관련하여, 리라글루타이드는 GLP-1K28R의 20번째 라이신 잔기의 아미노그룹에 팔미트산과 글루탐산으로 구성된 N-Palmitoyl-L-glutamic acid가 결합되어 있는 형태를 갖는다. 리라글루타이드는 제2형 당뇨병 또는 비만을 치료제로 사용될 수 있으며, 일반적으로 리라글루타이드는 N-Palmitoyl-L-glutamic acid가 결합되어 있지 않은 리라글루타이드 전구체 펩타이드(GLP-1K28R)를 생산하고, 이의 20 번째 라이신 잔기에 N-Palmitoyl-L-glutamic acid를 결합시키는 공정으로 생산할 수 있다(Dunweber, Jensen et al. 2007). The liraglutide precursor peptide (GLP-1K28R) is a polypeptide derived from glucagon-like peptide-1 (GLP-1) consisting of 31 amino acids, and the 28th lysine of GLP-1 is substituted with arginine (K28R) Has been. In this regard, liraglutide has a form in which N-Palmitoyl-L-glutamic acid consisting of palmitic acid and glutamic acid is bound to the amino group of the 20th lysine residue of GLP-1K28R. Liraglutide can be used as a treatment for type 2 diabetes or obesity, and in general, liraglutide produces a liraglutide precursor peptide (GLP-1K28R) that is not bound to N-Palmitoyl-L-glutamic acid, It can be produced by binding N-Palmitoyl-L-glutamic acid to its 20th lysine residue (Dunweber, Jensen et al. 2007).

본 발명의 다른 측면은, 전술한 융합 폴리펩타이드를 코딩하는 폴리 뉴클레오타이드를 제공할 수 있다.Another aspect of the present invention can provide a polynucleotide encoding the above-described fusion polypeptide.

본 발명의 또 다른 측면은, 전술한 융합 폴리펩타이드를 코딩하는 폴리 뉴클레오타이드 분자를 포함하는 발현 벡터를 제공할 수 있다. 본 발명에서 사용된 용어 "벡터"는 숙주 세포에 도입되어 숙주 세포 유전체 내로 재조합 및 삽입될 수 있다. 상기 벡터는 선형 핵산, 플라스미드, 파지미드, 코스미드, RNA 벡터, 바이러스 벡터 및 이의 유사체들을 포함한다. 바이러스 벡터의 예로는 레트로바이러스, 아데노바이러스, 및 아데노-관련 바이러스를 포함할 수 있다. 또한, 상기 플라스미드는 항생제 내성 유전자와 같은 선별마커를 포함할 수 있고, 플라스미드를 유지하는 숙주 세포는 선택적인 조건하에서 배양될 수 있다.Another aspect of the present invention can provide an expression vector comprising a polynucleotide molecule encoding the above-described fusion polypeptide. As used herein, the term "vector" can be introduced into a host cell and recombined and inserted into the host cell genome. Such vectors include linear nucleic acids, plasmids, phagemids, cosmids, RNA vectors, viral vectors, and analogs thereof. Examples of viral vectors may include retroviruses, adenoviruses, and adeno-associated viruses. In addition, the plasmid may contain a selection marker such as an antibiotic resistance gene, and the host cell holding the plasmid may be cultured under selective conditions.

본 발명에서 사용된 용어, "유전자 발현" 또는 "발현"은, DNA서열의 전사, mRNA 전사체의 번역 및 융합 단백질 생산물 또는 이의 단편의 분비를 의미하는 것으로 이해된다.As used herein, the term "gene expression" or "expression" is understood to mean transcription of a DNA sequence, translation of an mRNA transcript, and secretion of a fusion protein product or fragment thereof.

본 발명에서 사용된 용어 "숙주 세포"는 재조합 발현 벡터가 도입될 수 있는 원핵 및 진핵 세포를 나타낸다. 본 발명에서 사용된 용어, "형질주입된", "형질전환된" 및 "형질감염된"은 당업계에 공지된 기술을 사용하여 세포 내로 핵산(예를 들어, 벡터)을 도입하는 것을 의미한다.The term "host cell" as used herein refers to prokaryotic and eukaryotic cells into which a recombinant expression vector can be introduced. As used herein, the terms "transfected", "transformed" and "transfected" refer to introducing a nucleic acid (eg, a vector) into a cell using techniques known in the art.

또한, 본 발명의 글루카곤 유사 펩타이드-1 또는 이의 유사체를 포함하는 융합 폴리펩타이드를 코딩하는 뉴클레오타이드 서열을 포함하는 발현 벡터는 유도성 발현 벡터의 일부로서 제공될 수 있다. 예를 들어, 발현벡터로 형질전환된 숙주 세포를 배양하고, 발현 벡터로부터 융합 단백질의 발현을 유도한다. 발현 벡터는 플라스미드일 수 있다. 예를 들어, 발현 벡터는 재조합 융합 단백질을 암호화하고 있는 유전자와 선별 마커에 대한 유전자를 추가로 포함하는 플라스미드이고, 숙주 세포는 플라스미드를 유지하도록 하는 선별 조건하에서 배양된다.In addition, an expression vector comprising a nucleotide sequence encoding a fusion polypeptide comprising a glucagon-like peptide-1 or an analog thereof of the present invention may be provided as part of an inducible expression vector. For example, a host cell transformed with an expression vector is cultured, and expression of the fusion protein is induced from the expression vector. The expression vector can be a plasmid. For example, the expression vector is a plasmid further comprising a gene encoding a recombinant fusion protein and a gene for a selection marker, and the host cell is cultured under selection conditions to maintain the plasmid.

상기 발현 벡터는 단백질 코딩 서열 이외에 프로모터, 리보솜 결합 부위(RBS), 전사 종결 인자, 및 번역 개시 및 정지 신호 등의 단백질 발현과 관련된 조절 요소를 포함할 수 있다. 예를 들어, 유용한 목적 폴리펩타이드의 발현 수준에 영향을 주는 RBS는 다양한 생물의 유전자나 합성을 통해 얻을 수 있다.In addition to the protein coding sequence, the expression vector may include a promoter, a ribosome binding site (RBS), a transcription termination factor, and regulatory elements related to protein expression such as translation initiation and stop signals. For example, RBS, which affects the expression level of a useful polypeptide of interest, can be obtained through genes or synthesis of various organisms.

또한, 본 발명에서 사용되는 프로모터는 조절되는 유도성 프로모터로서, 예를 들어, T7 프로모터, Lac 프로모터, Tac 프로모터 및 Trc 프로모터로부터 유래한 프로모터일 수 있다. 구체적으로, T7 프로모터는 일반적으로 Lac 계열 프로모터를 사용하는 발현 시스템에는 발현 조절자로서 LacI 유전자가 발현 시스템에 존재할 수 있다. 보통 항시적으로 발현되는 유전자인 LacI 유전자는 Lac 패밀리 프로모터의 lac 작동 인자(operator)에 결합하는 Lac 억제인자 단백질 LacI 단백질을 코딩한다. 따라서, Lac 계열 프로모터가 이용되는 경우, Lac 유전자는 또한 발현시스템에 포함되고 발현될 수 있다.Further, the promoter used in the present invention is a regulated inducible promoter, and may be, for example, a promoter derived from a T7 promoter, a Lac promoter, a Tac promoter, and a Trc promoter. Specifically, the T7 promoter may have a LacI gene as an expression regulator in an expression system using a Lac family promoter in general. The LacI gene, usually a constitutively expressed gene, encodes the Lac repressor protein LacI protein, which binds to the lac operator of the Lac family promoter. Thus, when a Lac family promoter is used, the Lac gene can also be included and expressed in an expression system.

본 발명의 다른 측면은, 상기 발현 벡터를 포함하는 숙주 세포를 제공할 수 있다. 적절한 숙주 세포는, 본 발명의 DNA 서열이 형질주입 또는 형질감염되어, 글루카곤 유사 펩타이드-1 또는 이의 유사체의 발현 및/또는 분비에 이용될 수 있으며, 본 발명에 사용될 수 있는 바람직한 숙주 세포는 대장균 세포, 불사의 하이브리도마 세포, NS/0 골수종 세포, 293 세포, 중국 햄스터 난소 세포(CHO cell), HeLa 세포, 인간 양수 유래 세포(CapT cell) 또는 COS 세포를 포함한다. 예를 들어, 본 발명에서 융합 폴리펩타이드를 발현시키는데 사용된 숙주 균주는 대장균 BL21(DE3)이며, 상기 유전자 및 이들의 사용 방법은 당 업계에 공지되어 있다.Another aspect of the present invention can provide a host cell comprising the expression vector. Suitable host cells are transfected or transfected with the DNA sequence of the present invention to be used for expression and/or secretion of glucagon-like peptide-1 or an analog thereof, and a preferred host cell that can be used in the present invention is E. coli cells. , Immortal hybridoma cells, NS/0 myeloma cells, 293 cells, Chinese hamster ovary cells (CHO cells), HeLa cells, human amniotic fluid derived cells (CapT cells) or COS cells. For example, the host strain used to express the fusion polypeptide in the present invention is E. coli BL21 (DE3), and the gene and its use method are known in the art.

또한, 재조합 융합 단백질을 발현하기 위한 최적의 조건을 결정하기 위해 고처리량 스크리닝(high throughput screening)을 수행할 수 있다. 스크리닝에서 변화될 수 있는 조건은, 숙주 세포, 숙주 세포의 유전적 배경 (예를 들어, 다양한 프로테아제와 같은 특정 유전자의 결실), 발현 구조물 내 프로모터의 유형, 발현 구조물 내 RBS의 유형, 배양 온도, 유도성 프로모터가 사용될 때 유도 시 세포 탁도, lac 계열 프로모터가 사용될 때 유도에 사용되는 IPTG의 농도, 단백질 유도기간, 배양물에 유도제 첨가 후 배양 온도, 배양물의 교반 속도, 플라스미드 유지에 대한 선별 방법, 용기 내 배양물의 부피, 및 세포 용해 방법을 포함한다.In addition, high throughput screening can be performed to determine optimal conditions for expressing the recombinant fusion protein. Conditions that can be changed in the screening include the host cell, the genetic background of the host cell (e.g., deletion of certain genes such as various proteases), the type of promoter in the expression construct, the type of RBS in the expression construct, the culture temperature, Cell turbidity during induction when an inducible promoter is used, the concentration of IPTG used for induction when a lac-based promoter is used, protein induction period, culture temperature after adding an inducer to the culture, agitation speed of the culture, selection method for plasmid maintenance, The volume of the culture in the vessel, and the cell lysis method.

또한, "최적의 숙주 균주" 또는 "최적의 발현 시스템"은 유전형(genotype)이 상이한 다양한 숙주 세포와 비교하여 발현된 재조합 융합 단백질의 품질, 발현 양, 발현 수준 및/또는 위치를 기준으로 하여 확인하거나 선택할 수 있다. 따라서, 최적의 숙주 균주는 목적 폴리펩타이드 또는 목적 단백질의 생산에 적합하도록 재조합 융합 단백질을 생산하는 균주이다. 목적 폴리펩타이드 또는 목적 단백질의 생산에 대한 높은 적합성은 생산되는 목적 폴리펩타이드 또는 목적 단백질에 따라 달라질 수 있지만 재조합 융합 단백질의 품질, 발현 양, 발현 수준, 예를 들어 목적 폴리펩타이드 또는 목적 단백질의 세포 내 분해 혹은 부분적인 절단 유무가 발생되었는지, 어느 정도의 발현양 혹은 발현 수준인지, 단백질이 적합하게 또는 바람직하게 폴딩되거나 불용성 내포체로 축적되었는지 등을 포함한다. 즉, 최적의 숙주 균주 또는 최적의 발현 시스템은 대조군 숙주 균주나 발현 시스템과 비교하여 특정 수준 이상의 생산 수율, 발현 양, 발현 수준, 회수 가능한 재조합 융합 단백질의 양, 적합하게 불용성 내포체 형태로 축적된 재조합 융합 단백질의 양을 생산할 수 있다.In addition, "optimal host strain" or "optimal expression system" is identified based on the quality, amount of expression, expression level and/or location of the expressed recombinant fusion protein compared to various host cells with different genotypes. Or you can choose. Therefore, the optimal host strain is a strain that produces a recombinant fusion protein suitable for production of a target polypeptide or a target protein. The high suitability for the production of a polypeptide of interest or a protein of interest may vary depending on the polypeptide of interest or protein of interest to be produced, but the quality, amount of expression, and level of expression of the recombinant fusion protein, for example, the intracellular It includes whether decomposition or partial cleavage has occurred, how much expression or level of expression, and whether the protein is suitably or preferably folded or accumulated into insoluble inclusion bodies. That is, the optimal host strain or optimal expression system is a production yield above a certain level, the amount of expression, the amount of expression, the amount of recombinant fusion protein that can be recovered, suitably accumulated in the form of insoluble inclusion bodies compared to the control host strain or expression system. The amount of recombinant fusion protein can be produced.

이하, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있도록 본 발명의 실시예에 대하여 상세히 설명한다. 그러나 본 발명은 여러 가지 상이한 형태로 구현될 수 있으며 여기에서 설명하는 실시예에 한정되지 않는다.Hereinafter, embodiments of the present invention will be described in detail so that those skilled in the art can easily implement the present invention. However, the present invention may be implemented in various different forms and is not limited to the embodiments described herein.

실시예 1. GLP-1K28R 융합 폴리펩타이드의 제조 및 생산Example 1. Preparation and production of GLP-1K28R fusion polypeptide

실시예 1.1 GLP-1K28R 융합 폴리펩타이드 발현 플라스미드의 제작Example 1.1 Construction of GLP-1K28R fusion polypeptide expression plasmid

GLP-1K28R 융합 폴리펩타이드에 대한 유전자는 overlap extension polymerase chain reaction (OE-PCR) 방법을 이용하여 합성하였다. 이때, 상기GLP-1K28R 융합 태그는 아미노 말단 융합태그로서 PG43(서열번호 1), PG57(서열번호 2), PG68(서열번호 3), PG79(서열번호 4), PG88(서열번호 5), PG97(서열번호 6) 중 하나와6 히스티딘 태그(서열번호 8) 및 TEV 프로테아제 인식 서열(서열번호 9) 및 GLP-1K28R의 아미노산 서열(서열번호 17)이 포함되어 있다.The gene for the GLP-1K28R fusion polypeptide was synthesized using the overlap extension polymerase chain reaction (OE-PCR) method. At this time, the GLP-1K28R fusion tag is an amino terminal fusion tag, PG43 (SEQ ID NO: 1), PG57 (SEQ ID NO: 2), PG68 (SEQ ID NO: 3), PG79 (SEQ ID NO: 4), PG88 (SEQ ID NO: 5), PG97 One of (SEQ ID NO: 6) and 6 histidine tag (SEQ ID NO: 8) and TEV protease recognition sequence (SEQ ID NO: 9) and the amino acid sequence of GLP-1K28R (SEQ ID NO: 17) are included.

음성 대조군으로서 사용된 GLP-1K28R융합 폴리펩타이드(H6TEV-GLP-1K28R, 서열번호 18)는 아미노 말단 융합태그가 포함되어 있지 않고, 6 히스티딘 태그(서열번호 7), TEV 프로테아제 인식 서열(서열번호 8) 및 GLP-1K28R의 아미노산 서열(서열번호 17)이 포함되어 있다. The GLP-1K28R fusion polypeptide (H6TEV-GLP-1K28R, SEQ ID NO: 18) used as a negative control does not contain an amino terminal fusion tag, 6 histidine tag (SEQ ID NO: 7), TEV protease recognition sequence (SEQ ID NO: 8) ) And the amino acid sequence of GLP-1K28R (SEQ ID NO: 17).

양성 대조군으로 사용된 GLP-1K28R융합 폴리펩타이드(SUMO-H6TEV-GLP-1K28R, 서열번호 25)는 아미노 말단 융합태그 SUMO(서열번호 7) 와6 히스티딘 태그(서열번호 8) 및 TEV 프로테아제 인식 서열(서열번호 9) 및 GLP-1K28R의 아미노산 서열(서열번호 17)이 포함되어 있다. 상기 PG43(서열번호 1), PG57(서열번호 2), PG68(서열번호 3), PG79(서열번호 4), PG88(서열번호 5), PG97(서열번호 6)중 어느 하나를 포함하는 GLP-1K28R융합 폴리펩타이드의 아미노산 서열은 서열번호 19 내지 24에 해당한다.GLP-1K28R fusion polypeptide (SUMO-H6TEV-GLP-1K28R, SEQ ID NO: 25) used as a positive control was an amino terminal fusion tag SUMO (SEQ ID NO: 7) and 6 histidine tag (SEQ ID NO: 8) and TEV protease recognition sequence ( SEQ ID NO: 9) and the amino acid sequence of GLP-1K28R (SEQ ID NO: 17) are included. GLP- containing any one of PG43 (SEQ ID NO: 1), PG57 (SEQ ID NO: 2), PG68 (SEQ ID NO: 3), PG79 (SEQ ID NO: 4), PG88 (SEQ ID NO: 5), PG97 (SEQ ID NO: 6) The amino acid sequence of the 1K28R fusion polypeptide corresponds to SEQ ID NOs: 19 to 24.

각 융합 폴리펩타이드의 유전자는 제한 효소 NdeI, NcoI 및 XhoI의 인식 서열과 1개의 종결 코돈을 포함한다. 상기 GLP-1K28R융합 폴리펩타이드를 코딩하는 뉴클레오타이드 서열은 서열번호 59 내지 64에 해당하며, 음성 및 양성 대조군은 서열번호 58 및 서열번호 65에 해당한다.The gene of each fusion polypeptide contains the recognition sequences of the restriction enzymes NdeI, NcoI and XhoI and one stop codon. The nucleotide sequence encoding the GLP-1K28R fusion polypeptide corresponds to SEQ ID NO: 59 to 64, and the negative and positive controls correspond to SEQ ID NO: 58 and SEQ ID NO: 65.

아래 표2에 기재한 GLP-1K28R융합 폴리펩타이드 발현 플라스미드 pSGK530, pSGK497, pSGK699, pSGK700, pSGK701, pSGK702, pSGK490, pSGK752를 제작하기 위해서, OE-PCR로 합성된 GLP-1K28R 융합단백질 유전자 단편을 NdeI 및 XhoI 제한 효소를 이용하여 절단하고, T7 프로모터, Lac 작동자 및 LacI 유전자를 포함하여 IPTG에 의한 발현 조절이 가능한 발현 벡터pET26b에 클로닝하였다.In order to prepare the GLP-1K28R fusion polypeptide expression plasmid pSGK530, pSGK497, pSGK699, pSGK700, pSGK701, pSGK702, pSGK490, pSGK752 shown in Table 2 below, GLP-1K28R fusion protein gene fragment synthesized by OE-PCR and NdeI It was digested using XhoI restriction enzyme, and cloned into an expression vector pET26b capable of regulating expression by IPTG, including the T7 promoter, Lac effector, and LacI gene.

균주 Strain 숙주세포Host cell 플라스미드Plasmid 재조합 융합 폴리펩타이드Recombinant fusion polypeptide PG005PG005 E. coli BL21(DE3) E. coli BL21(DE3) pSGK530pSGK530 H6TEV-GLP-1K28RH6TEV-GLP-1K28R PG011PG011 E. coli BL21(DE3) E. coli BL21(DE3) pSGK497pSGK497 PG43-H6TEV-GLP-1K28RPG43-H6TEV-GLP-1K28R PG035PG035 E. coli BL21(DE3) E. coli BL21(DE3) pSGK699pSGK699 PG57-H6TEV-GLP-1K28RPG57-H6TEV-GLP-1K28R PG036PG036 E. coli BL21(DE3) E. coli BL21(DE3) pSGK700pSGK700 PG68-H6TEV-GLP-1K28RPG68-H6TEV-GLP-1K28R PG037PG037 E. coli BL21(DE3) E. coli BL21(DE3) pSGK701pSGK701 PG79-H6TEV-GLP-1K28RPG79-H6TEV-GLP-1K28R PG038PG038 E. coli BL21(DE3) E. coli BL21(DE3) pSGK702pSGK702 PG88-H6TEV-GLP-1K28RPG88-H6TEV-GLP-1K28R PG039PG039 E. coli BL21(DE3) E. coli BL21(DE3) pSGK490pSGK490 PG97-H6TEV-GLP-1K28RPG97-H6TEV-GLP-1K28R PG049PG049 E. coli BL21(DE3) E. coli BL21(DE3) pSGK752pSGK752 SUMO-H6TEV-GLP-1K28RSUMO-H6TEV-GLP-1K28R

제작된 GLP-1K28R융합 폴리펩타이드 발현 플라스미드들은 DNA 염기서열 확인을 통해 정확한 클로닝 여부를 확인하였다. 제작된 GLP-1K28R융합 폴리펩타이드 발현 플라스미드들을 이용하여 염화칼슘을 이용한 화학적인 방법으로 대장균 BL21(DE3) 세포를 형질전환하였다. GLP-1K28R융합 폴리펩타이드의 발현 플라스미드들이 형질도입된 대장균들은 카나마이신(kanamycin)이 50 ㎍/㎖의 농도로 포함된 LB 고체배지에서 콜로니를 형성하였다. 형질전환된 대장균들은 각각 카나마이신이 50 ㎍/㎖의 농도로 존재하는 LB 액체 배지에 배양한 후, 50% 글리세롤을 배양액과 동일한 부피로 첨가하여 세포 농축액(cell stock)을 만들고, 이를 -80℃ 온도의 냉동고에 보관하였다.The prepared GLP-1K28R fusion polypeptide expression plasmids were checked for correct cloning through DNA sequence confirmation. Using the prepared GLP-1K28R fusion polypeptide expression plasmids, E. coli BL21 (DE3) cells were transformed by a chemical method using calcium chloride. E. coli transduced with the expression plasmids of the GLP-1K28R fusion polypeptide formed colonies in LB solid medium containing kanamycin at a concentration of 50 µg/ml. Transformed E. coli were each cultured in LB liquid medium in which kanamycin was present at a concentration of 50 ㎍/㎖, and then 50% glycerol was added in the same volume as the culture solution to make a cell stock, which was at -80°C. Stored in the freezer.

실시예 1.2. 형질전환된 세포의 배양 및 GLP-1K28R의 발현Example 1.2. Culture of transformed cells and expression of GLP-1K28R

-80℃ 온도에서 보관된 GLP-1K28R융합 폴리펩타이드의 발현 플라스미드들이 형질전환된 대장균의 세포 농축액을 실온에서 녹인 후 50 ㎕를 카나마이신이 50 ㎍/㎖의 농도로 존재하는 LB 액체 배지 5 ㎖이 포함된 시험관에 첨가하여 37℃ 온도의 진탕배양기에서 12시간 동안 종균 배양하였다. 종균 배양된 대장균 2 ㎖는 카나마이신이 50 ㎍/㎖의 농도로 존재하는 LB 액체 배지 200 ㎖이 포함된 플라스크에 첨가하여 37℃ 온도의 진탕배양기에서 배양하였다. 배양 약 3시간 후 세포의 광학밀도(OD600)가 약 1.0이 되었을 때, IPTG의 최종 농도가 0.1 mM이 되도록 첨가하여 GLP-1K28R 융합 폴리펩타이드의 발현을 유도하였다. 발현 유도 4시간 후에 광학 밀도를 측정하여 세포의 광학 밀도를 측정하였다.After dissolving the cell concentrate of E. coli transformed with the expression plasmids of GLP-1K28R fusion polypeptide stored at -80℃ at room temperature, 50 µl of LB liquid medium containing kanamycin at a concentration of 50 µg/ml is included. Incubated for 12 hours in a shaking incubator at 37° C. and added to the test tube. 2 ml of the seed cultured E. coli was added to a flask containing 200 ml of LB liquid medium in which kanamycin was present at a concentration of 50 µg/ml, and cultured in a shaking incubator at 37°C. When the optical density (OD600) of the cells reached about 1.0 after about 3 hours of culture, the final concentration of IPTG was added to 0.1 mM to induce the expression of the GLP-1K28R fusion polypeptide. After 4 hours of induction of expression, the optical density was measured to measure the optical density of the cells.

실시예 1.3. 발현 수준 비교 분석을 위한 시료의 제조 Example 1.3. Preparation of samples for comparative analysis of expression levels

발현 유도가 된 세포의 광학밀도가 10.0이 되도록 세포를 농축하고 50 mM 인산 나트륨, pH 7.2 완충액으로 재현탁시킨 후 초음파 세포 파쇄기(ultrasonic processor, Cole-Parmer)를 이용하여 세포를 파쇄하였다. 파쇄된 세포는 총 세포 분획으로 표지하였다. 세포 파쇄액은 12,000×g, 4℃ 온도에서 15분간 원심분리하였다. 상등액을 회수하고 가용성 분획으로 표지하였다. 불용성 분획은 500 ㎕의 50 mM 인산 나트륨, pH 7.2 완충액으로 초음파 세포 파쇄기를 이용하여 재현탁하고 불용성 분획으로 표지하였다.The cells were concentrated so that the optical density of the cells from which expression was induced was 10.0, resuspended in 50 mM sodium phosphate, pH 7.2 buffer, and then disrupted by using an ultrasonic cell disruptor (Cole-Parmer). The disrupted cells were labeled as the total cell fraction. The cell lysate was centrifuged for 15 minutes at 12,000×g and 4°C. The supernatant was recovered and labeled as a soluble fraction. The insoluble fraction was resuspended in 500 µl of 50 mM sodium phosphate, pH 7.2 buffer using an ultrasonic cell disruptor and labeled as an insoluble fraction.

실시예 1.4. SDS-PAGE 분석을 통한 생산된 GLP-1K28R 융합 폴리펩타이드의 확인Example 1.4. Identification of the GLP-1K28R fusion polypeptide produced through SDS-PAGE analysis

50 ㎕의 총 세포, 가용성 및 불용성 분획을 각각 50 ㎕의 2배 농축된 SDS 시료 완충액(SDS sample buffer 2× concentrate, Sigma)에 혼합한 후, 95℃ 온도에서 5분간 가열하여 각 시료의 단백질이 변성될 수 있도록 하였다. 시료 내 변성된 단백질들은 16% SDS-PAGE 겔 및 TANK 완충액을 이용하여 분자량 크기에 따라 겔 내에서 분리될 수 있도록 하였다. SDS-PAGE 후 겔을 쿠마시 블루(Coomassie blue) R-250이 포함된 염색 완충액으로 염색한 후, 탈색 완충액으로 탈색하여 염색된 단백질만 보일 수 있도록 하였다. 그 결과를 도 1에 나타내었다. 50 µl of total cells, soluble and insoluble fractions were mixed in each of 50 µl of a twice-concentrated SDS sample buffer (SDS sample buffer 2×concentrate, Sigma), and then heated at 95° C. for 5 minutes to obtain protein in each sample. Allowed to be denatured. The denatured proteins in the sample were separated in the gel according to the molecular weight size using a 16% SDS-PAGE gel and TANK buffer. After SDS-PAGE, the gel was stained with a staining buffer containing Coomassie blue R-250, and then bleached with a bleaching buffer so that only the stained protein could be seen. The results are shown in FIG. 1.

도 1을 참조하면, 음성 대조군인 서열번호 1 내지 서열번호 6의 아미노산 서열을 포함하는 융합태그가 포함되지 않은 H6TEV-GLP-1K28R(5.1 kDa의 분자량)의 밴드는 SDS-PAGE 젤에서 검출되지 않아 발현 후 세포내 단백질 분해효소에 의해 분해된 것으로 보인다. 아미노 말단 융합태그인 PG43, PG57, PG68, PG79, PG88 및 PG97이 융합된 GLP-1K28R 융합 폴리펩타이드인 PG43-H6TEV-GLP-1K28R(9.8 kDa의 분자량), PG57-H6TEV-GLP-1K28R(11.3 kDa의 분자량), PG68-H6TEV-GLP-1K28R(12.5 kDa의 분자량), PG79-H6TEV-GLP-1K28R(13.7 kDa의 분자량), PG88-H6TEV-GLP-1K28R(14.7 kDa의 분자량) 및 PG97-H6TEV-GLP-1K28R(15.3 kDa의 분자량)의 발현 수준은 대조군(H6TEV-GLP-1K28R) 대비 매우 향상된 것이 확인되었다. Referring to Figure 1, the band of H6TEV-GLP-1K28R (5.1 kDa molecular weight) without the fusion tag containing the amino acid sequence of SEQ ID NO: 1 to SEQ ID NO: 6, which is a negative control, was not detected in the SDS-PAGE gel. It appears to have been degraded by intracellular proteases after expression. Amino terminal fusion tags PG43, PG57, PG68, PG79, PG88 and PG97 are fused GLP-1K28R fusion polypeptide PG43-H6TEV-GLP-1K28R (molecular weight of 9.8 kDa), PG57-H6TEV-GLP-1K28R (11.3 kDa Molecular weight), PG68-H6TEV-GLP-1K28R (molecular weight of 12.5 kDa), PG79-H6TEV-GLP-1K28R (molecular weight of 13.7 kDa), PG88-H6TEV-GLP-1K28R (molecular weight of 14.7 kDa) and PG97-H6TEV- It was confirmed that the expression level of GLP-1K28R (molecular weight of 15.3 kDa) was significantly improved compared to the control (H6TEV-GLP-1K28R).

덴시토미터 분석을 통해 PG97이 융합된 PG97-H6TEV-GLP-1K28R이 PG43, PG57, PG68, PG79 및 PG88이 융합된 폴리펩타이드들 보다 월등히 높은 것으로 확인되었다(도 1). 또한 양성 대조군인 SUMO-H6TEV-GLP-1K28R(16.2 kDa의 분자량)와 비교해도 PG97-H6TEV-GLP-1K28R의 발현수준이 월등히 높았다(도 2). 따라서 이후 실험은 PG97-H6TEV-GLP-1K28R로 진행하였다.Through densitometric analysis, it was confirmed that PG97-H6TEV-GLP-1K28R fused with PG97 was significantly higher than the polypeptides fused with PG43, PG57, PG68, PG79 and PG88 (FIG. 1). In addition, compared with the positive control SUMO-H6TEV-GLP-1K28R (molecular weight of 16.2 kDa), the expression level of PG97-H6TEV-GLP-1K28R was significantly higher (Fig. 2). Therefore, subsequent experiments were performed with PG97-H6TEV-GLP-1K28R.

실시예 2. PG97-H6TEV-GLP-1K28R의 회수 및 정제Example 2. Recovery and purification of PG97-H6TEV-GLP-1K28R

실시예 2.1. 세포 파쇄 및 불용성 내포체 회수Example 2.1. Cell disruption and recovery of insoluble inclusion bodies

플라스크 규모에서 발현된 세포의 냉동된 세포 펠렛을 50 ㎖의 50 mM 인산 나트륨, pH 7.2 완충액을 첨가하여 해동하였다. 재현탁된 세포는 초음파 세포 파쇄기(ultrasonic processor, Cole-Parmer)를 이용하여 파쇄하였다. 용해된 세포는 12,000 rpm(12,000×g) 조건에서 30분간 원심분리하고 상등액을 제거하여 재조합 융합 폴리펩타이드가 포함된 불용성 내포체 분획을 회수하였다.Frozen cell pellets of cells expressed on a flask scale were thawed by adding 50 ml of 50 mM sodium phosphate, pH 7.2 buffer. The resuspended cells were disrupted using an ultrasonic cell disruptor (ultrasonic processor, Cole-Parmer). Lysed cells were centrifuged at 12,000 rpm (12,000×g) for 30 minutes, and the supernatant was removed to recover the insoluble inclusion body fraction containing the recombinant fusion polypeptide.

실시예 2.2. 불용성 내포체 가용화Example 2.2. Solubilization of insoluble inclusion bodies

회수된 불용성 내포체 분획에 20 ㎖의 내포체 가용화 완충액(8 M 요소, 20 mM 트리스, 500 mM 염화나트륨, 50 mM 이미다졸, pH 7.4)을 첨가하고 25℃ 온도에서 4시간 동안 진탕배양하여 불용성 분획 내 내포체 형태의 재조합 융합 폴리펩타이드가 가용화될 수 있도록 하였다. 가용화된 불용성 분획 시료는 12,000Х에서 30분간 원심분리하고 상등액을 막필터(0.45/0.2 ㎛)를 통해 여과하였다.To the recovered insoluble inclusion body fraction, 20 ml of inclusion body solubilization buffer (8 M urea, 20 mM Tris, 500 mM sodium chloride, 50 mM imidazole, pH 7.4) was added, and the insoluble fraction was cultured with shaking at 25°C for 4 hours. The recombinant fusion polypeptide in the form of an inner inclusion body was allowed to solubilize. The solubilized insoluble fraction sample was centrifuged at 12,000 Х for 30 minutes, and the supernatant was filtered through a membrane filter (0.45/0.2 µm).

실시예 2.3. PG97-H6TEV-GLP-1K28R의 정제Example 2.3. Purification of PG97-H6TEV-GLP-1K28R

가용화된 불용성 분획 내의 PG97-H6TEV-GLP-1K28R의 정제를 위하여 S9 시료 펌프(Sample pump S9) 및 F9-C 분획 수집기(Fraction collector F9-C)이 장착된 AKTA pure 25 크로마토그라피 시스템(GE Healthcare)을 이용하였다. 가용화된 불용성 분획 시료는 내포체 가용화 완충액(8 M 요소, 20 mM 트리스, 500 mM 염화나트륨, 50 mM 이미다졸, pH 7.4)으로 미리 평형화된 HisTrap FF 1 ㎖ 컬럼(GE Healthcare)에 주입하였다. 주입이 완료된 후 컬럼은 5배 컬럼 부피의 평형 완충액으로 세척하고, 5배 컬럼 부피의 용출 완충액(8 M 요소, 20 mM 트리스, 500 mM 염화나트륨, 500 mM 이미다졸, pH 7.4)을 단계적으로 100%가 되도록 증가시켜 컬럼의 수지에 결합되어 있는 PG97-H6TEV-GLP-1K28R을 용출하였다. 용출 후 얻어진 분획의 결과를 각각 도3에 나타내었다.AKTA pure 25 chromatography system (GE Healthcare) equipped with S9 sample pump (Sample pump S9) and F9-C fraction collector (Fraction collector F9-C) for purification of PG97-H6TEV-GLP-1K28R in solubilized insoluble fraction Was used. Samples of the solubilized insoluble fraction were injected into a HisTrap FF 1 ml column (GE Healthcare) previously equilibrated with an inclusion body solubilization buffer (8 M urea, 20 mM Tris, 500 mM sodium chloride, 50 mM imidazole, pH 7.4). After the injection is complete, the column is washed with 5 column volumes of equilibration buffer, and 5 column volumes of elution buffer (8 M urea, 20 mM Tris, 500 mM sodium chloride, 500 mM imidazole, pH 7.4) are stepwise 100%. PG97-H6TEV-GLP-1K28R bound to the column resin was eluted. The results of the fractions obtained after elution are shown in Figure 3, respectively.

실시예 3. 프로테아제 처리를 통한 링커 서열 절단 Example 3. Linker sequence cleavage through protease treatment

약 5 ㎖의 정제된 PG97-H6TEV-GLP-1K28R융합 폴리펩타이드의 분획을 합친 후 140 ㎖의 희석 완충액(20 mM 트리스, pH 7.4)을 첨가하여 요소의 농도가 1 M이 되도록 희석하였다. 희석된 재조합 융합 폴리펩타이드에 TEV 프로테아제의 최종 농도가 500 nM이 되도록 첨가하고 실온에서 12시간 동안 절단 반응이 진행될 수 있도록 하였다.After combining the fractions of about 5 ml of purified PG97-H6TEV-GLP-1K28R fusion polypeptide, 140 ml of a dilution buffer (20 mM Tris, pH 7.4) was added to dilute the urea concentration to 1 M. The diluted recombinant fusion polypeptide was added so that the final concentration of TEV protease was 500 nM, and the cleavage reaction was allowed to proceed for 12 hours at room temperature.

TEV 프로테아제에 의한 절단 여부를 확인하기 위해, 절단 후 SDS-PAGE 분석을 수행하여 그 결과를 도4에 나타내었다. 한편, 도4를 참조하면, 15.4 kDa의 PG97-H6TEV-GLP-1K28R는 거의 100%의 수율로 N-말단 융합태그, 6 히스티딘 태그 및 TEV 프로테아제 인식서열 융합체인 PG97-H6TEV 단편(12.0 kDa)과 목적 폴리펩타이드인 GLP-1K28R(3.4 kDa)단편으로 분리되는 것을 확인하였다.In order to confirm whether or not cleavage by TEV protease, SDS-PAGE analysis was performed after cleavage and the results are shown in FIG. On the other hand, referring to Figure 4, PG97-H6TEV-GLP-1K28R of 15.4 kDa is a N-terminal fusion tag, 6 histidine tag and TEV protease recognition sequence fusion PG97-H6TEV fragment (12.0 kDa) and It was confirmed that the target polypeptide, GLP-1K28R (3.4 kDa), was separated into a fragment.

실시예 4. GLP-1K28R의 정제 Example 4. Purification of GLP-1K28R

PG97-H6TEV-GLP-1K28R융합 폴리펩타이드에서, TEV 프로테아제의 절단에 의해 유리된 GLP-1K28R의 정제를 위하여 S9 시료 펌프(Sample pump S9) 및 F9-C 분획 수집기(Fraction collector F9-C)이 장착된 AKTA pure 25 크로마토그라피 시스템(GE Healthcare)을 이용하였다. TEV 프로테아제로 절단된 시료를 결합 완충액(10 mM 인산 칼륨, 2% 아세톤니트릴 pH 7.0)으로 미리 평형화된 Resource RPC 3 ㎖ 컬럼(GE Healthcare)에 주입하였다. PG97-H6TEV은 컬럼에 결합되지 않아 통과액(Flow-through) 분획을 수집하였고 용출 완충액(10 mM 인산 칼륨, 70% 아세톤니트릴 pH 7.0)을50%가 되도록 컬럼에 통과시켜 컬럼의 수지에 약하게 결합되어 있는PG97-H6TEV-GLP-1K28R을 제거하였고 용출 완충액을100%가 되도록 컬럼에 통과시켜 컬럼의 수지에 결합되어 있는 GLP-1K28R을 정제하였다. 분획 수집기의 통과액 및 용출 분획은 상기에 기재된 SDS-PAGE 방법으로 분석하였다. In the PG97-H6TEV-GLP-1K28R fusion polypeptide, the S9 sample pump (S9) and the F9-C fraction collector (Fraction collector F9-C) are equipped for purification of GLP-1K28R released by cleavage of TEV protease. AKTA pure 25 chromatography system (GE Healthcare) was used. The sample cut with TEV protease was injected into a Resource RPC 3 ml column (GE Healthcare) previously equilibrated with a binding buffer (10 mM potassium phosphate, 2% acetonitrile pH 7.0). PG97-H6TEV was not bound to the column, so the flow-through fraction was collected, and the elution buffer (10 mM potassium phosphate, 70% acetonitrile pH 7.0) was passed through the column so that it became 50%, and weakly bound to the resin of the column. The PG97-H6TEV-GLP-1K28R was removed, and GLP-1K28R bound to the resin of the column was purified by passing the elution buffer to 100% through the column. The flow through the fraction collector and the eluted fraction were analyzed by the SDS-PAGE method described above.

도 5를 참조하면, GLP-1K28R은 Resource RPC수지에 결합된 후 고농도의 아세톤니트릴에 의해 용출되는 것을 확인하였다. 따라서PG97-H6TEV-GLP-1K28R과 PG97-H6TEV는GLP-1K28R와 아세톤니트릴의 농도를 조절함으로써 용이하게 분리 가능하므로 역상 크로마토그래피를 이용하여GLP-1K28R의 정제를 용이하게 할 수 있다. Referring to FIG. 5, it was confirmed that GLP-1K28R is eluted by a high concentration of acetonitrile after being bound to the Resource RPC resin. Therefore, since PG97-H6TEV-GLP-1K28R and PG97-H6TEV can be easily separated by controlling the concentration of GLP-1K28R and acetonitrile, purification of GLP-1K28R can be facilitated using reverse phase chromatography.

실시예 5. 절단 후 정제된 GLP-1K28R의 분자량 분석Example 5. Molecular weight analysis of purified GLP-1K28R after cleavage

완전한 형태의 PG97-H6TEV-GLP-1K28R의 발현 유무, TEV 프로테아제에 의한 정확한 절단 및 절단 후 얻어진 GLP-1K28R의 변형(modification) 유무를 확인하기 위해 MALTI-TOF MS를 이용한 분자량 분석을 실시하였다. 화학합성된 GLP-1K28R표준 물질의 분자량을 측정한 결과와 본 발명에 따라 수득된 GLP-1K28R의 분자량을 측정한 결과를 도 6 및 도 7에 나타내었다. Molecular weight analysis using MALTI-TOF MS was performed to confirm the presence or absence of expression of the complete form of PG97-H6TEV-GLP-1K28R, accurate cleavage by TEV protease, and modification of GLP-1K28R obtained after cleavage. The results of measuring the molecular weight of the chemically synthesized GLP-1K28R standard substance and the results of measuring the molecular weight of the GLP-1K28R obtained according to the present invention are shown in FIGS. 6 and 7.

도 6및 도 7을 참고하면, PG97-H6TEV-GLP-1K28R로부터 수득된 GLP-1K28R의 분자량(3382.9 Da)은 화학합성된 GLP-1K28R(3382.8 Da)의 분자량과 정확하게 일치하였으므로 대장균 내에서 단백질 분해효소(proteolytic enzyme)에 의한 아미노 혹은 카르복시 말단의 부분적인 절단이나 분해 없이 완전한 형태로 발현된 것을 확인할 수 있었다. 따라서 TEV 프로테아제는 PG97-H6TEV-GLP-1K28R내 인식서열인 ENLFQ 서열을 인식하고 마지막 아미노산인 Q(글루타민)와 GLP-1K28R의 첫번째 아미노산인 H(히스티딘) 사이의 펩타이드 결합을 정확하게 절단하는 것으로 판단된다. 6 and 7, the molecular weight (3382.9 Da) of GLP-1K28R obtained from PG97-H6TEV-GLP-1K28R was exactly the same as that of the chemically synthesized GLP-1K28R (3382.8 Da), so protein degradation in E. coli It was confirmed that the amino or carboxy terminus was expressed in its complete form without partial cleavage or degradation by the proteolytic enzyme. Therefore, TEV protease recognizes the ENLFQ sequence, which is the recognition sequence in PG97-H6TEV-GLP-1K28R, and is judged to accurately cleave the peptide bond between the last amino acid Q (glutamine) and H (histidine), the first amino acid of GLP-1K28R. .

실시예 6. 정제된 GLP-1K28R의 역상 HPLC 분석Example 6. Reverse phase HPLC analysis of purified GLP-1K28R

화학합성된 GLP-1K28R와 본 발명을 통해 생산된 재조합 GLP-1K28R을 완충액 A (0.1% 삼불화초산, 2% 아세톤니트릴)에 평형화된 C18 고성능액체크로마토그래피(HPLC) 컬럼에 주입 후 완충액 B(0.1% 삼불화초산, 80% 아세톤니트릴)의 농도를 선형으로 증가시키는 을 역상 HPLC 방법으로 화학합성된 GLP-1K28R과 재조합 GLP-1K28R의 체류시간 및 순도를 분석하였다. 그 결과, 화학합성 및 재조합 GLP-1K28R모두 동일한 체류시간을 보였고 재조합 GLP-1K28R의 순도는 95% 이상으로 확인되었다(도 8).After injecting the chemically synthesized GLP-1K28R and the recombinant GLP-1K28R produced through the present invention into a C18 high-performance liquid chromatography (HPLC) column equilibrated in buffer A (0.1% trifluoroacetic acid, 2% acetonitrile), buffer B ( The retention time and purity of the chemically synthesized GLP-1K28R and recombinant GLP-1K28R were analyzed by a reverse-phase HPLC method of linearly increasing the concentration of 0.1% trifluoroacetic acid and 80% acetonitrile). As a result, both chemical synthesis and recombinant GLP-1K28R showed the same retention time, and the purity of the recombinant GLP-1K28R was confirmed to be 95% or higher (FIG. 8).

전술한 본 발명의 설명은 예시를 위한 것이며, 본 발명이 속하는 기술분야의 통상의 지식을 가진 자는 본 발명의 기술적 사상이나 필수적인 특징을 변경하지 않고서 다른 구체적인 형태로 쉽게 변형이 가능하다는 것을 이해할 수 있을 것이다. 그러므로 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적이 아닌 것으로 이해해야만 한다. 예를 들어, 단일형으로 설명되어 있는 각 구성 요소는 분산되어 실시될 수도 있으며, 마찬가지로 분산된 것으로 설명되어 있는 구성 요소들도 결합된 형태로 실시될 수 있다.The above description of the present invention is for illustrative purposes only, and those of ordinary skill in the art to which the present invention pertains will be able to understand that it can be easily modified into other specific forms without changing the technical spirit or essential features of the present invention. will be. Therefore, it should be understood that the embodiments described above are illustrative in all respects and not limiting. For example, each component described as a single type may be implemented in a distributed manner, and similarly, components described as being distributed may also be implemented in a combined form.

본 발명의 범위는 후술하는 특허청구범위에 의하여 나타내어지며, 특허청구범위의 의미 및 범위 그리고 그 균등 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다.The scope of the present invention is indicated by the claims to be described later, and all changes or modified forms derived from the meaning and scope of the claims and their equivalent concepts should be interpreted as being included in the scope of the present invention.

<110> PEPGENE INC. <120> FUSION TAG FOR PREPARING GLUCAGON-LIKE PEPTIDE-1 OR ANALOGUES <130> P20200194KR <150> KR 10-2019-0087637 <151> 2019-07-19 <160> 89 <170> KoPatentIn 3.0 <210> 1 <211> 43 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG43 <400> 1 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val 35 40 <210> 2 <211> 57 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG57 <400> 2 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu 50 55 <210> 3 <211> 68 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG68 <400> 3 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn 65 <210> 4 <211> 79 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG79 <400> 4 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp 65 70 75 <210> 5 <211> 88 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG88 <400> 5 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu 85 <210> 6 <211> 97 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97 <400> 6 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala <210> 7 <211> 98 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO <400> 7 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly <210> 8 <211> 6 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6 <400> 8 His His His His His His 1 5 <210> 9 <211> 6 <212> PRT <213> Artificial Sequence <220> <223> Tobacco etch virus (TEV) protease recognition site <400> 9 Glu Asn Leu Tyr Phe Gln 1 5 <210> 10 <211> 13 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV <400> 10 Met His His His His His His Glu Asn Leu Tyr Phe Gln 1 5 10 <210> 11 <211> 57 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG43-H6TEV <400> 11 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Pro Trp His His His 35 40 45 His His His Glu Asn Leu Tyr Phe Gln 50 55 <210> 12 <211> 71 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG57-H6TEV <400> 12 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Pro Trp His His His His His 50 55 60 His Glu Asn Leu Tyr Phe Gln 65 70 <210> 13 <211> 82 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG68-H6TEV <400> 13 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Pro Trp His His His His His His Glu Asn Leu Tyr 65 70 75 80 Phe Gln <210> 14 <211> 93 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG79-H6TEV <400> 14 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Pro 65 70 75 80 Trp His His His His His His Glu Asn Leu Tyr Phe Gln 85 90 <210> 15 <211> 102 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG88-H6TEV <400> 15 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Pro Trp His His His His His His 85 90 95 Glu Asn Leu Tyr Phe Gln 100 <210> 16 <211> 109 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV <400> 16 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln 100 105 <210> 17 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Liraglutide precursor peptide (GLP-1K28R) <400> 17 His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 1 5 10 15 Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg Gly 20 25 30 <210> 18 <211> 44 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-GLP-1K28R <400> 18 Met His His His His His His Glu Asn Leu Tyr Phe Gln His Ala Glu 1 5 10 15 Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly Gln Ala Ala 20 25 30 Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg Gly 35 40 <210> 19 <211> 88 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG43-H6TEV-GLP-1K28R <400> 19 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Pro Trp His His His 35 40 45 His His His Glu Asn Leu Tyr Phe Gln His Ala Glu Gly Thr Phe Thr 50 55 60 Ser Asp Val Ser Ser Tyr Leu Glu Gly Gln Ala Ala Lys Glu Phe Ile 65 70 75 80 Ala Trp Leu Val Arg Gly Arg Gly 85 <210> 20 <211> 102 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG57-H6TEV-GLP-1K28R <400> 20 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Pro Trp His His His His His 50 55 60 His Glu Asn Leu Tyr Phe Gln His Ala Glu Gly Thr Phe Thr Ser Asp 65 70 75 80 Val Ser Ser Tyr Leu Glu Gly Gln Ala Ala Lys Glu Phe Ile Ala Trp 85 90 95 Leu Val Arg Gly Arg Gly 100 <210> 21 <211> 113 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG68-H6TEV-GLP-1K28R <400> 21 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Pro Trp His His His His His His Glu Asn Leu Tyr 65 70 75 80 Phe Gln His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu 85 90 95 Glu Gly Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg 100 105 110 Gly <210> 22 <211> 124 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG79-H6TEV-GLP-1K28R <400> 22 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Pro 65 70 75 80 Trp His His His His His His Glu Asn Leu Tyr Phe Gln His Ala Glu 85 90 95 Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly Gln Ala Ala 100 105 110 Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg Gly 115 120 <210> 23 <211> 133 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG88-H6TEV-GLP-1K28R <400> 23 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Pro Trp His His His His His His 85 90 95 Glu Asn Leu Tyr Phe Gln His Ala Glu Gly Thr Phe Thr Ser Asp Val 100 105 110 Ser Ser Tyr Leu Glu Gly Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu 115 120 125 Val Arg Gly Arg Gly 130 <210> 24 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-GLP-1K28R <400> 24 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln His Ala Glu 100 105 110 Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly Gln Ala Ala 115 120 125 Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg Gly 130 135 140 <210> 25 <211> 141 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-GLP-1K28R <400> 25 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln His Ala 100 105 110 Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly Gln Ala 115 120 125 Ala Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg Gly 130 135 140 <210> 26 <211> 34 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for hPTH1-34 (Teriparatide) <400> 26 Ser Val Ser Glu Ile Gln Leu Met His Asn Leu Gly Lys His Leu Asn 1 5 10 15 Ser Met Glu Arg Val Glu Trp Leu Arg Lys Lys Leu Gln Asp Val His 20 25 30 Asn Phe <210> 27 <211> 47 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-hPTH1-34 <400> 27 Met His His His His His His Glu Asn Leu Tyr Phe Gln Ser Val Ser 1 5 10 15 Glu Ile Gln Leu Met His Asn Leu Gly Lys His Leu Asn Ser Met Glu 20 25 30 Arg Val Glu Trp Leu Arg Lys Lys Leu Gln Asp Val His Asn Phe 35 40 45 <210> 28 <211> 143 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-hPTH1-34 <400> 28 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Ser Val Ser 100 105 110 Glu Ile Gln Leu Met His Asn Leu Gly Lys His Leu Asn Ser Met Glu 115 120 125 Arg Val Glu Trp Leu Arg Lys Lys Leu Gln Asp Val His Asn Phe 130 135 140 <210> 29 <211> 144 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-hPTH1-34 <400> 29 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Ser Val 100 105 110 Ser Glu Ile Gln Leu Met His Asn Leu Gly Lys His Leu Asn Ser Met 115 120 125 Glu Arg Val Glu Trp Leu Arg Lys Lys Leu Gln Asp Val His Asn Phe 130 135 140 <210> 30 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Teduglutide (GLP-2A2G) <400> 30 His Gly Asp Gly Ser Phe Ser Asp Glu Met Asn Thr Ile Leu Asp Asn 1 5 10 15 Leu Ala Ala Arg Asp Phe Ile Asn Trp Leu Ile Gln Thr Lys Ile Thr 20 25 30 Asp <210> 31 <211> 46 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-GLP-2A2G <400> 31 Met His His His His His His Glu Asn Leu Tyr Phe Gln His Gly Asp 1 5 10 15 Gly Ser Phe Ser Asp Glu Met Asn Thr Ile Leu Asp Asn Leu Ala Ala 20 25 30 Arg Asp Phe Ile Asn Trp Leu Ile Gln Thr Lys Ile Thr Asp 35 40 45 <210> 32 <211> 142 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-GLP-2A2G <400> 32 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln His Gly Asp 100 105 110 Gly Ser Phe Ser Asp Glu Met Asn Thr Ile Leu Asp Asn Leu Ala Ala 115 120 125 Arg Asp Phe Ile Asn Trp Leu Ile Gln Thr Lys Ile Thr Asp 130 135 140 <210> 33 <211> 143 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-GLP-2A2G <400> 33 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln His Gly 100 105 110 Asp Gly Ser Phe Ser Asp Glu Met Asn Thr Ile Leu Asp Asn Leu Ala 115 120 125 Ala Arg Asp Phe Ile Asn Trp Leu Ile Gln Thr Lys Ile Thr Asp 130 135 140 <210> 34 <211> 60 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Ecallantide <400> 34 Glu Ala Met His Ser Phe Cys Ala Phe Lys Ala Asp Asp Gly Pro Cys 1 5 10 15 Arg Ala Ala His Pro Arg Trp Phe Phe Asn Ile Phe Thr Arg Gln Cys 20 25 30 Glu Glu Phe Ile Tyr Gly Gly Cys Glu Gly Asn Gln Asn Arg Phe Glu 35 40 45 Ser Leu Glu Glu Cys Lys Lys Met Cys Thr Arg Asp 50 55 60 <210> 35 <211> 73 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-Ecallantide <400> 35 Met His His His His His His Glu Asn Leu Tyr Phe Gln Glu Ala Met 1 5 10 15 His Ser Phe Cys Ala Phe Lys Ala Asp Asp Gly Pro Cys Arg Ala Ala 20 25 30 His Pro Arg Trp Phe Phe Asn Ile Phe Thr Arg Gln Cys Glu Glu Phe 35 40 45 Ile Tyr Gly Gly Cys Glu Gly Asn Gln Asn Arg Phe Glu Ser Leu Glu 50 55 60 Glu Cys Lys Lys Met Cys Thr Arg Asp 65 70 <210> 36 <211> 169 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-Ecallantide <400> 36 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Glu Ala Met 100 105 110 His Ser Phe Cys Ala Phe Lys Ala Asp Asp Gly Pro Cys Arg Ala Ala 115 120 125 His Pro Arg Trp Phe Phe Asn Ile Phe Thr Arg Gln Cys Glu Glu Phe 130 135 140 Ile Tyr Gly Gly Cys Glu Gly Asn Gln Asn Arg Phe Glu Ser Leu Glu 145 150 155 160 Glu Cys Lys Lys Met Cys Thr Arg Asp 165 <210> 37 <211> 170 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-Ecallantide <400> 37 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Glu Ala 100 105 110 Met His Ser Phe Cys Ala Phe Lys Ala Asp Asp Gly Pro Cys Arg Ala 115 120 125 Ala His Pro Arg Trp Phe Phe Asn Ile Phe Thr Arg Gln Cys Glu Glu 130 135 140 Phe Ile Tyr Gly Gly Cys Glu Gly Asn Gln Asn Arg Phe Glu Ser Leu 145 150 155 160 Glu Glu Cys Lys Lys Met Cys Thr Arg Asp 165 170 <210> 38 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Nesiritide <400> 38 Ser Pro Lys Met Val Gln Gly Ser Gly Cys Phe Gly Arg Lys Met Asp 1 5 10 15 Arg Ile Ser Ser Ser Ser Gly Leu Gly Cys Lys Val Leu Arg Arg His 20 25 30 <210> 39 <211> 45 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-Nesiritide <400> 39 Met His His His His His His Glu Asn Leu Tyr Phe Gln Ser Pro Lys 1 5 10 15 Met Val Gln Gly Ser Gly Cys Phe Gly Arg Lys Met Asp Arg Ile Ser 20 25 30 Ser Ser Ser Gly Leu Gly Cys Lys Val Leu Arg Arg His 35 40 45 <210> 40 <211> 141 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-Nesiritide <400> 40 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Ser Pro Lys 100 105 110 Met Val Gln Gly Ser Gly Cys Phe Gly Arg Lys Met Asp Arg Ile Ser 115 120 125 Ser Ser Ser Gly Leu Gly Cys Lys Val Leu Arg Arg His 130 135 140 <210> 41 <211> 142 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-Nesiritide <400> 41 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Ser Pro 100 105 110 Lys Met Val Gln Gly Ser Gly Cys Phe Gly Arg Lys Met Asp Arg Ile 115 120 125 Ser Ser Ser Ser Gly Leu Gly Cys Lys Val Leu Arg Arg His 130 135 140 <210> 42 <211> 53 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Human Epidermal growth factor (hEGF) <400> 42 Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His 1 5 10 15 Asp Gly Val Cys Met Tyr Ile Glu Ala Leu Asp Lys Tyr Ala Cys Asn 20 25 30 Cys Val Val Gly Tyr Ile Gly Glu Arg Cys Gln Tyr Arg Asp Leu Lys 35 40 45 Trp Trp Glu Leu Arg 50 <210> 43 <211> 66 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-hEGF <400> 43 Met His His His His His His Glu Asn Leu Tyr Phe Gln Asn Ser Asp 1 5 10 15 Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 20 25 30 Cys Met Tyr Ile Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys Val Val 35 40 45 Gly Tyr Ile Gly Glu Arg Cys Gln Tyr Arg Asp Leu Lys Trp Trp Glu 50 55 60 Leu Arg 65 <210> 44 <211> 162 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-hEGF <400> 44 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Asn Ser Asp 100 105 110 Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 115 120 125 Cys Met Tyr Ile Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys Val Val 130 135 140 Gly Tyr Ile Gly Glu Arg Cys Gln Tyr Arg Asp Leu Lys Trp Trp Glu 145 150 155 160 Leu Arg <210> 45 <211> 163 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-hEGF <400> 45 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Asn Ser 100 105 110 Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His Asp Gly 115 120 125 Val Cys Met Tyr Ile Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys Val 130 135 140 Val Gly Tyr Ile Gly Glu Arg Cys Gln Tyr Arg Asp Leu Lys Trp Trp 145 150 155 160 Glu Leu Arg <210> 46 <211> 165 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Interferon beta 1b (INF-beta1b) <400> 46 Ser Tyr Asn Leu Leu Gly Phe Leu Gln Arg Ser Ser Asn Phe Gln Ser 1 5 10 15 Gln Lys Leu Leu Trp Gln Leu Asn Gly Arg Leu Glu Tyr Cys Leu Lys 20 25 30 Asp Arg Met Asn Phe Asp Ile Pro Glu Glu Ile Lys Gln Leu Gln Gln 35 40 45 Phe Gln Lys Glu Asp Ala Ala Leu Thr Ile Tyr Glu Met Leu Gln Asn 50 55 60 Ile Phe Ala Ile Phe Arg Gln Asp Ser Ser Ser Thr Gly Trp Asn Glu 65 70 75 80 Thr Ile Val Glu Asn Leu Leu Ala Asn Val Tyr His Gln Ile Asn His 85 90 95 Leu Lys Thr Val Leu Glu Glu Lys Leu Glu Lys Glu Asp Phe Thr Arg 100 105 110 Gly Lys Leu Met Ser Ser Leu His Leu Lys Arg Tyr Tyr Gly Arg Ile 115 120 125 Leu His Tyr Leu Lys Ala Lys Glu Tyr Ser His Cys Ala Trp Thr Ile 130 135 140 Val Arg Val Glu Ile Leu Arg Asn Phe Tyr Phe Ile Asn Arg Leu Thr 145 150 155 160 Gly Tyr Leu Arg Asn 165 <210> 47 <211> 178 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-INF-beta1b <400> 47 Met His His His His His His Glu Asn Leu Tyr Phe Gln Ser Tyr Asn 1 5 10 15 Leu Leu Gly Phe Leu Gln Arg Ser Ser Asn Phe Gln Ser Gln Lys Leu 20 25 30 Leu Trp Gln Leu Asn Gly Arg Leu Glu Tyr Cys Leu Lys Asp Arg Met 35 40 45 Asn Phe Asp Ile Pro Glu Glu Ile Lys Gln Leu Gln Gln Phe Gln Lys 50 55 60 Glu Asp Ala Ala Leu Thr Ile Tyr Glu Met Leu Gln Asn Ile Phe Ala 65 70 75 80 Ile Phe Arg Gln Asp Ser Ser Ser Thr Gly Trp Asn Glu Thr Ile Val 85 90 95 Glu Asn Leu Leu Ala Asn Val Tyr His Gln Ile Asn His Leu Lys Thr 100 105 110 Val Leu Glu Glu Lys Leu Glu Lys Glu Asp Phe Thr Arg Gly Lys Leu 115 120 125 Met Ser Ser Leu His Leu Lys Arg Tyr Tyr Gly Arg Ile Leu His Tyr 130 135 140 Leu Lys Ala Lys Glu Tyr Ser His Cys Ala Trp Thr Ile Val Arg Val 145 150 155 160 Glu Ile Leu Arg Asn Phe Tyr Phe Ile Asn Arg Leu Thr Gly Tyr Leu 165 170 175 Arg Asn <210> 48 <211> 274 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-INF-beta1b <400> 48 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Ser Tyr Asn 100 105 110 Leu Leu Gly Phe Leu Gln Arg Ser Ser Asn Phe Gln Ser Gln Lys Leu 115 120 125 Leu Trp Gln Leu Asn Gly Arg Leu Glu Tyr Cys Leu Lys Asp Arg Met 130 135 140 Asn Phe Asp Ile Pro Glu Glu Ile Lys Gln Leu Gln Gln Phe Gln Lys 145 150 155 160 Glu Asp Ala Ala Leu Thr Ile Tyr Glu Met Leu Gln Asn Ile Phe Ala 165 170 175 Ile Phe Arg Gln Asp Ser Ser Ser Thr Gly Trp Asn Glu Thr Ile Val 180 185 190 Glu Asn Leu Leu Ala Asn Val Tyr His Gln Ile Asn His Leu Lys Thr 195 200 205 Val Leu Glu Glu Lys Leu Glu Lys Glu Asp Phe Thr Arg Gly Lys Leu 210 215 220 Met Ser Ser Leu His Leu Lys Arg Tyr Tyr Gly Arg Ile Leu His Tyr 225 230 235 240 Leu Lys Ala Lys Glu Tyr Ser His Cys Ala Trp Thr Ile Val Arg Val 245 250 255 Glu Ile Leu Arg Asn Phe Tyr Phe Ile Asn Arg Leu Thr Gly Tyr Leu 260 265 270 Arg Asn <210> 49 <211> 275 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-INF-beta1b <400> 49 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Ser Tyr 100 105 110 Asn Leu Leu Gly Phe Leu Gln Arg Ser Ser Asn Phe Gln Ser Gln Lys 115 120 125 Leu Leu Trp Gln Leu Asn Gly Arg Leu Glu Tyr Cys Leu Lys Asp Arg 130 135 140 Met Asn Phe Asp Ile Pro Glu Glu Ile Lys Gln Leu Gln Gln Phe Gln 145 150 155 160 Lys Glu Asp Ala Ala Leu Thr Ile Tyr Glu Met Leu Gln Asn Ile Phe 165 170 175 Ala Ile Phe Arg Gln Asp Ser Ser Ser Thr Gly Trp Asn Glu Thr Ile 180 185 190 Val Glu Asn Leu Leu Ala Asn Val Tyr His Gln Ile Asn His Leu Lys 195 200 205 Thr Val Leu Glu Glu Lys Leu Glu Lys Glu Asp Phe Thr Arg Gly Lys 210 215 220 Leu Met Ser Ser Leu His Leu Lys Arg Tyr Tyr Gly Arg Ile Leu His 225 230 235 240 Tyr Leu Lys Ala Lys Glu Tyr Ser His Cys Ala Trp Thr Ile Val Arg 245 250 255 Val Glu Ile Leu Arg Asn Phe Tyr Phe Ile Asn Arg Leu Thr Gly Tyr 260 265 270 Leu Arg Asn 275 <210> 50 <211> 191 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Human growth hormone (hGH) <400> 50 Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg 1 5 10 15 Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu 20 25 30 Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro 35 40 45 Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg 50 55 60 Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu 65 70 75 80 Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val 85 90 95 Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp 100 105 110 Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu 115 120 125 Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser 130 135 140 Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr 145 150 155 160 Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe 165 170 175 Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe 180 185 190 <210> 51 <211> 204 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-hGH <400> 51 Met His His His His His His Glu Asn Leu Tyr Phe Gln Phe Pro Thr 1 5 10 15 Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg Ala His Arg 20 25 30 Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu Glu Ala Tyr 35 40 45 Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro Gln Thr Ser 50 55 60 Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg Glu Glu Thr 65 70 75 80 Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu Leu Leu Ile 85 90 95 Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val Phe Ala Asn 100 105 110 Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp Leu Leu Lys 115 120 125 Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu Glu Asp Gly 130 135 140 Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser Lys Phe Asp 145 150 155 160 Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr Gly Leu Leu 165 170 175 Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe Leu Arg Ile 180 185 190 Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe 195 200 <210> 52 <211> 300 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-hGH <400> 52 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Phe Pro Thr 100 105 110 Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg Ala His Arg 115 120 125 Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu Glu Ala Tyr 130 135 140 Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro Gln Thr Ser 145 150 155 160 Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg Glu Glu Thr 165 170 175 Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu Leu Leu Ile 180 185 190 Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val Phe Ala Asn 195 200 205 Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp Leu Leu Lys 210 215 220 Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu Glu Asp Gly 225 230 235 240 Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser Lys Phe Asp 245 250 255 Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr Gly Leu Leu 260 265 270 Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe Leu Arg Ile 275 280 285 Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe 290 295 300 <210> 53 <211> 301 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-hGH <400> 53 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Phe Pro 100 105 110 Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg Ala His 115 120 125 Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu Glu Ala 130 135 140 Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro Gln Thr 145 150 155 160 Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg Glu Glu 165 170 175 Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu Leu Leu 180 185 190 Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val Phe Ala 195 200 205 Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp Leu Leu 210 215 220 Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu Glu Asp 225 230 235 240 Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser Lys Phe 245 250 255 Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr Gly Leu 260 265 270 Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe Leu Arg 275 280 285 Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe 290 295 300 <210> 54 <211> 86 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Insulin glargine precursor (INSG) <400> 54 Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 1 5 10 15 Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Arg Arg 20 25 30 Glu Ala Glu Asp Leu Gln Val Gly Gln Val Glu Leu Gly Gly Gly Pro 35 40 45 Gly Ala Gly Ser Leu Gln Pro Leu Ala Leu Glu Gly Ser Leu Gln Lys 50 55 60 Arg Gly Ile Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln 65 70 75 80 Leu Glu Asn Tyr Cys Gly 85 <210> 55 <211> 99 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-INSG <400> 55 Met His His His His His His Glu Asn Leu Tyr Phe Gln Phe Val Asn 1 5 10 15 Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr Leu Val Cys 20 25 30 Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Arg Arg Glu Ala Glu 35 40 45 Asp Leu Gln Val Gly Gln Val Glu Leu Gly Gly Gly Pro Gly Ala Gly 50 55 60 Ser Leu Gln Pro Leu Ala Leu Glu Gly Ser Leu Gln Lys Arg Gly Ile 65 70 75 80 Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln Leu Glu Asn 85 90 95 Tyr Cys Gly <210> 56 <211> 195 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-INSG <400> 56 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Phe Val Asn 100 105 110 Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr Leu Val Cys 115 120 125 Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Arg Arg Glu Ala Glu 130 135 140 Asp Leu Gln Val Gly Gln Val Glu Leu Gly Gly Gly Pro Gly Ala Gly 145 150 155 160 Ser Leu Gln Pro Leu Ala Leu Glu Gly Ser Leu Gln Lys Arg Gly Ile 165 170 175 Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln Leu Glu Asn 180 185 190 Tyr Cys Gly 195 <210> 57 <211> 196 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-INSG <400> 57 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Phe Val 100 105 110 Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr Leu Val 115 120 125 Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Arg Arg Glu Ala 130 135 140 Glu Asp Leu Gln Val Gly Gln Val Glu Leu Gly Gly Gly Pro Gly Ala 145 150 155 160 Gly Ser Leu Gln Pro Leu Ala Leu Glu Gly Ser Leu Gln Lys Arg Gly 165 170 175 Ile Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln Leu Glu 180 185 190 Asn Tyr Cys Gly 195 <210> 58 <211> 135 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-GLP-1K28R <400> 58 atgcatcatc atcaccacca cgaaaacctg tatttccagc atgcggaagg cacctttacc 60 agcgatgtga gcagctatct ggaaggccag gcggcgaaag aatttattgc gtggctggtg 120 cgtggccgcg gctaa 135 <210> 59 <211> 267 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG43-H6TEV-GLP-1K28R <400> 59 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcc catggcatca tcatcaccac cacgaaaacc tgtatttcca gcatgcggaa 180 ggcaccttta ccagcgatgt gagcagctat ctggaaggcc aggcggcgaa agaatttatt 240 gcgtggctgg tgcgtggccg cggctaa 267 <210> 60 <211> 309 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG57-H6TEV-GLP-1K28R <400> 60 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct gccatggcat 180 catcatcacc accacgaaaa cctgtatttc cagcatgcgg aaggcacctt taccagcgat 240 gtgagcagct atctggaagg ccaggcggcg aaagaattta ttgcgtggct ggtgcgtggc 300 cgcggctaa 309 <210> 61 <211> 342 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG68-H6TEV-GLP-1K28R <400> 61 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacccatgg catcatcatc accaccacga aaacctgtat 240 ttccagcatg cggaaggcac ctttaccagc gatgtgagca gctatctgga aggccaggcg 300 gcgaaagaat ttattgcgtg gctggtgcgt ggccgcggct aa 342 <210> 62 <211> 375 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG79-H6TEV-GLP-1K28R <400> 62 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgaccca 240 tggcatcatc atcaccacca cgaaaacctg tatttccagc atgcggaagg cacctttacc 300 agcgatgtga gcagctatct ggaaggccag gcggcgaaag aatttattgc gtggctggtg 360 cgtggccgcg gctaa 375 <210> 63 <211> 402 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG88-H6TEV-GLP-1K28R <400> 63 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaccatgg catcatcatc accaccacga aaacctgtat 300 ttccagcatg cggaaggcac ctttaccagc gatgtgagca gctatctgga aggccaggcg 360 gcgaaagaat ttattgcgtg gctggtgcgt ggccgcggct aa 402 <210> 64 <211> 423 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-GLP-1K28R <400> 64 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagcat gcggaaggca cctttaccag cgatgtgagc 360 agctatctgg aaggccaggc ggcgaaagaa tttattgcgt ggctggtgcg tggccgcggc 420 taa 423 <210> 65 <211> 426 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-GLP-1K28R <400> 65 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag catgcggaag gcacctttac cagcgatgtg 360 agcagctatc tggaaggcca ggcggcgaaa gaatttattg cgtggctggt gcgtggccgc 420 ggctaa 426 <210> 66 <211> 144 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-hPTH1-34 <400> 66 atgcatcatc atcaccacca cgaaaacctg tatttccagt ctgtgagtga aatacagctt 60 atgcataacc tgggaaaaca tctgaactcg atggagagag tagaatggct gcgtaagaag 120 ctgcaggatg tgcacaattt ttaa 144 <210> 67 <211> 531 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-hPTH1-34 <400> 67 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag catgcggaag gcacctttac cagcgatgtg 360 agcagctatc tggaaggcca ggcggcgaaa gaatttattg cgtggctggt gcgtggccgc 420 ggctaaagcg tgagcgaaat tcagctgatg cataacctgg gcaaacatct gaacagcatg 480 gaacgcgtgg aatggctgcg caaaaaactg caggatgtgc ataactttta a 531 <210> 68 <211> 435 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-hPTH1-34 <400> 68 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag agcgtgagcg aaattcagct gatgcataac 360 ctgggcaaac atctgaacag catggaacgc gtggaatggc tgcgcaaaaa actgcaggat 420 gtgcataact tttaa 435 <210> 69 <211> 141 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-GLP-2A2G <400> 69 atgcatcatc atcaccacca cgaaaacctg tatttccagc atggcgatgg cagctttagc 60 gatgaaatga acaccattct ggataacctg gcggcgcgcg attttattaa ctggctgatt 120 cagaccaaaa ttaccgatta a 141 <210> 70 <211> 429 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-GLP-2A2G <400> 70 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagcat ggcgatggca gctttagcga tgaaatgaac 360 accattctgg ataacctggc ggcgcgcgat tttattaact ggctgattca gaccaaaatt 420 accgattaa 429 <210> 71 <211> 432 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-GLP-2A2G <400> 71 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag catggcgatg gcagctttag cgatgaaatg 360 aacaccattc tggataacct ggcggcgcgc gattttatta actggctgat tcagaccaaa 420 attaccgatt aa 432 <210> 72 <211> 222 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-Ecallantide <400> 72 atgcatcatc atcaccacca cgaaaacctg tatttccagg aagcgatgca tagcttttgc 60 gcgtttaaag cggatgatgg cccgtgccgc gcggcgcatc cgcgctggtt ttttaacatt 120 tttacccgcc agtgcgaaga atttatttat ggcggctgcg aaggcaacca gaaccgcttt 180 gaaagcctgg aagaatgcaa aaaaatgtgc acccgcgatt aa 222 <210> 73 <211> 510 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-Ecallantide <400> 73 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccaggaa gcgatgcata gcttttgcgc gtttaaagcg 360 gatgatggcc cgtgccgcgc ggcgcatccg cgctggtttt ttaacatttt tacccgccag 420 tgcgaagaat ttatttatgg cggctgcgaa ggcaaccaga accgctttga aagcctggaa 480 gaatgcaaaa aaatgtgcac ccgcgattaa 510 <210> 74 <211> 513 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-Ecallantide <400> 74 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag gaagcgatgc atagcttttg cgcgtttaaa 360 gcggatgatg gcccgtgccg cgcggcgcat ccgcgctggt tttttaacat ttttacccgc 420 cagtgcgaag aatttattta tggcggctgc gaaggcaacc agaaccgctt tgaaagcctg 480 gaagaatgca aaaaaatgtg cacccgcgat taa 513 <210> 75 <211> 138 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-Nesiritide <400> 75 atgcatcatc atcaccacca cgaaaacctg tatttccaga gcccgaaaat ggtgcagggc 60 agcggctgct ttggccgcaa aatggatcgc attagcagca gcagcggcct gggctgcaaa 120 gtgctgcgcc gccattaa 138 <210> 76 <211> 426 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-Nesiritide <400> 76 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagagc ccgaaaatgg tgcagggcag cggctgcttt 360 ggccgcaaaa tggatcgcat tagcagcagc agcggcctgg gctgcaaagt gctgcgccgc 420 cattaa 426 <210> 77 <211> 429 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-Nesiritide <400> 77 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag agcccgaaaa tggtgcaggg cagcggctgc 360 tttggccgca aaatggatcg cattagcagc agcagcggcc tgggctgcaa agtgctgcgc 420 cgccattaa 429 <210> 78 <211> 201 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-hEGF <400> 78 atgcatcatc atcaccacca cgaaaacctg tatttccaga actccgattc tgagtgccca 60 ctttcccacg atggatactg ccttcatgat ggcgtgtgca tgtacatcga ggctctcgat 120 aagtacgcct gcaactgcgt tgtgggttac attggagaga gatgccagta cagggatctc 180 aagtggtggg agcttaggta a 201 <210> 79 <211> 489 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-hEGF <400> 79 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagaac tccgattctg agtgcccact ttcccacgat 360 ggatactgcc ttcatgatgg cgtgtgcatg tacatcgagg ctctcgataa gtacgcctgc 420 aactgcgttg tgggttacat tggagagaga tgccagtaca gggatctcaa gtggtgggag 480 cttaggtaa 489 <210> 80 <211> 492 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-hEGF <400> 80 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag aactccgatt ctgagtgccc actttcccac 360 gatggatact gccttcatga tggcgtgtgc atgtacatcg aggctctcga taagtacgcc 420 tgcaactgcg ttgtgggtta cattggagag agatgccagt acagggatct caagtggtgg 480 gagcttaggt aa 492 <210> 81 <211> 537 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-INF-beta1b <400> 81 atgcatcatc atcaccacca cgaaaacctg tatttccaga gctataacct gctgggcttt 60 ctgcagcgca gcagcaactt tcagagccag aaactgctgt ggcagctgaa cggccgcctg 120 gaatattgcc tgaaagatcg catgaacttt gatattccgg aagaaattaa acagctgcag 180 cagtttcaga aagaagatgc ggcgctgacc atttatgaaa tgctgcagaa catttttgcg 240 atttttcgcc aggatagcag cagcaccggc tggaacgaaa ccattgtgga aaacctgctg 300 gcgaacgtgt atcatcagat taaccatctg aaaaccgtgc tggaagaaaa actggaaaaa 360 gaagatttta cccgcggcaa actgatgagc agcctgcatc tgaaacgcta ttatggccgc 420 attctgcatt atctgaaagc gaaagaatat agccattgcg cgtggaccat tgtgcgcgtg 480 gaaattctgc gcaactttta ttttattaac cgcctgaccg gctatctgcg caactaa 537 <210> 82 <211> 825 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-INF-beta1b <400> 82 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagagc tataacctgc tgggctttct gcagcgcagc 360 agcaactttc agagccagaa actgctgtgg cagctgaacg gccgcctgga atattgcctg 420 aaagatcgca tgaactttga tattccggaa gaaattaaac agctgcagca gtttcagaaa 480 gaagatgcgg cgctgaccat ttatgaaatg ctgcagaaca tttttgcgat ttttcgccag 540 gatagcagca gcaccggctg gaacgaaacc attgtggaaa acctgctggc gaacgtgtat 600 catcagatta accatctgaa aaccgtgctg gaagaaaaac tggaaaaaga agattttacc 660 cgcggcaaac tgatgagcag cctgcatctg aaacgctatt atggccgcat tctgcattat 720 ctgaaagcga aagaatatag ccattgcgcg tggaccattg tgcgcgtgga aattctgcgc 780 aacttttatt ttattaaccg cctgaccggc tatctgcgca actaa 825 <210> 83 <211> 828 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-INF-beta1b <400> 83 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag agctataacc tgctgggctt tctgcagcgc 360 agcagcaact ttcagagcca gaaactgctg tggcagctga acggccgcct ggaatattgc 420 ctgaaagatc gcatgaactt tgatattccg gaagaaatta aacagctgca gcagtttcag 480 aaagaagatg cggcgctgac catttatgaa atgctgcaga acatttttgc gatttttcgc 540 caggatagca gcagcaccgg ctggaacgaa accattgtgg aaaacctgct ggcgaacgtg 600 tatcatcaga ttaaccatct gaaaaccgtg ctggaagaaa aactggaaaa agaagatttt 660 acccgcggca aactgatgag cagcctgcat ctgaaacgct attatggccg cattctgcat 720 tatctgaaag cgaaagaata tagccattgc gcgtggacca ttgtgcgcgt ggaaattctg 780 cgcaactttt attttattaa ccgcctgacc ggctatctgc gcaactaa 828 <210> 84 <211> 612 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-hGH <400> 84 catcatcatc accaccacga aaacctgtat ttccagtttc caaccattcc actgtctcgc 60 ctgtttgata atgcaatgct gcgtgcacat cgtctgcatc agctggcatt tgatacctat 120 caggaatttg aagaagcata tattccaaaa gaacagaaat attctttcct gcaaaatcca 180 cagacctctc tgtgtttttc tgaatctatt ccaaccccat ccaatcgtga agagacccaa 240 cagaaatcta atctggaact gctgcgtatc tctctgttac tgattcagtc ttggctggaa 300 ccagtgcagt tcctgcgttc tgtgttcgca aattctctgg tttatggtgc atctgattct 360 aatgtttatg atctgctgaa agatttagag gaaggaattc agaccctgat gggtcgtctg 420 gaagatggtt ctccacgtac cggtcagatt tttaaacaga cctattctaa atttgatacc 480 aattctcata atgacgatgc actgctgaaa aactatggtc tgctgtactg ctttcgtaaa 540 gatatggata aagttgaaac ctttctgcgt attgttcagt gtcgttctgt tgaaggttct 600 tgtggttttt aa 612 <210> 85 <211> 903 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-hGH <400> 85 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagttt ccaaccattc cactgtctcg cctgtttgat 360 aatgcaatgc tgcgtgcaca tcgtctgcat cagctggcat ttgataccta tcaggaattt 420 gaagaagcat atattccaaa agaacagaaa tattctttcc tgcaaaatcc acagacctct 480 ctgtgttttt ctgaatctat tccaacccca tccaatcgtg aagagaccca acagaaatct 540 aatctggaac tgctgcgtat ctctctgtta ctgattcagt cttggctgga accagtgcag 600 ttcctgcgtt ctgtgttcgc aaattctctg gtttatggtg catctgattc taatgtttat 660 gatctgctga aagatttaga ggaaggaatt cagaccctga tgggtcgtct ggaagatggt 720 tctccacgta ccggtcagat ttttaaacag acctattcta aatttgatac caattctcat 780 aatgacgatg cactgctgaa aaactatggt ctgctgtact gctttcgtaa agatatggat 840 aaagttgaaa cctttctgcg tattgttcag tgtcgttctg ttgaaggttc ttgtggtttt 900 taa 903 <210> 86 <211> 906 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-hGH <400> 86 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag tttccaacca ttccactgtc tcgcctgttt 360 gataatgcaa tgctgcgtgc acatcgtctg catcagctgg catttgatac ctatcaggaa 420 tttgaagaag catatattcc aaaagaacag aaatattctt tcctgcaaaa tccacagacc 480 tctctgtgtt tttctgaatc tattccaacc ccatccaatc gtgaagagac ccaacagaaa 540 tctaatctgg aactgctgcg tatctctctg ttactgattc agtcttggct ggaaccagtg 600 cagttcctgc gttctgtgtt cgcaaattct ctggtttatg gtgcatctga ttctaatgtt 660 tatgatctgc tgaaagattt agaggaagga attcagaccc tgatgggtcg tctggaagat 720 ggttctccac gtaccggtca gatttttaaa cagacctatt ctaaatttga taccaattct 780 cataatgacg atgcactgct gaaaaactat ggtctgctgt actgctttcg taaagatatg 840 gataaagttg aaacctttct gcgtattgtt cagtgtcgtt ctgttgaagg ttcttgtggt 900 ttttaa 906 <210> 87 <211> 300 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-INSG <400> 87 atgcatcatc atcaccacca cgaaaacctg tatttccagt ttgtgaacca gcatctgtgc 60 ggcagccatc tggtggaagc gctgtatctg gtgtgcggcg aacgcggctt tttttatacc 120 ccgaaaaccc gccgcgaagc ggaagatctg caggtgggcc aggtggaact gggcggcggc 180 ccgggcgcgg gcagcctgca gccgctggcg ctggaaggca gcctgcagaa acgcggcatt 240 gtggaacagt gctgcaccag catttgcagc ctgtatcagc tggaaaacta ttgcggctaa 300 300 <210> 88 <211> 588 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-INSG <400> 88 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagttt gtgaaccagc atctgtgcgg cagccatctg 360 gtggaagcgc tgtatctggt gtgcggcgaa cgcggctttt tttatacccc gaaaacccgc 420 cgcgaagcgg aagatctgca ggtgggccag gtggaactgg gcggcggccc gggcgcgggc 480 agcctgcagc cgctggcgct ggaaggcagc ctgcagaaac gcggcattgt ggaacagtgc 540 tgcaccagca tttgcagcct gtatcagctg gaaaactatt gcggctaa 588 <210> 89 <211> 591 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-INSG <400> 89 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag tttgtgaacc agcatctgtg cggcagccat 360 ctggtggaag cgctgtatct ggtgtgcggc gaacgcggct ttttttatac cccgaaaacc 420 cgccgcgaag cggaagatct gcaggtgggc caggtggaac tgggcggcgg cccgggcgcg 480 ggcagcctgc agccgctggc gctggaaggc agcctgcaga aacgcggcat tgtggaacag 540 tgctgcacca gcatttgcag cctgtatcag ctggaaaact attgcggcta a 591 <110> PEPGENE INC. <120> FUSION TAG FOR PREPARING GLUCAGON-LIKE PEPTIDE-1 OR ANALOGUES <130> P20200194KR <150> KR 10-2019-0087637 <151> 2019-07-19 <160> 89 <170> KoPatentIn 3.0 <210> 1 <211> 43 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG43 <400> 1 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val 35 40 <210> 2 <211> 57 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG57 <400> 2 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu 50 55 <210> 3 <211> 68 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG68 <400> 3 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn 65 <210> 4 <211> 79 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG79 <400> 4 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp 65 70 75 <210> 5 <211> 88 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG88 <400> 5 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu 85 <210> 6 <211> 97 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97 <400> 6 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala <210> 7 <211> 98 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO <400> 7 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly <210> 8 <211> 6 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6 <400> 8 His His His His His His 1 5 <210> 9 <211> 6 <212> PRT <213> Artificial Sequence <220> <223> Tobacco etch virus (TEV) protease recognition site <400> 9 Glu Asn Leu Tyr Phe Gln 1 5 <210> 10 <211> 13 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV <400> 10 Met His His His His His His Glu Asn Leu Tyr Phe Gln 1 5 10 <210> 11 <211> 57 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG43-H6TEV <400> 11 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Pro Trp His His His 35 40 45 His His His Glu Asn Leu Tyr Phe Gln 50 55 <210> 12 <211> 71 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG57-H6TEV <400> 12 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Pro Trp His His His His His 50 55 60 His Glu Asn Leu Tyr Phe Gln 65 70 <210> 13 <211> 82 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG68-H6TEV <400> 13 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Pro Trp His His His His His His Glu Asn Leu Tyr 65 70 75 80 Phe Gln <210> 14 <211> 93 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG79-H6TEV <400> 14 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Pro 65 70 75 80 Trp His His His His His His Glu Asn Leu Tyr Phe Gln 85 90 <210> 15 <211> 102 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG88-H6TEV <400> 15 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Pro Trp His His His His His His 85 90 95 Glu Asn Leu Tyr Phe Gln 100 <210> 16 <211> 109 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV <400> 16 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln 100 105 <210> 17 <211> 31 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Liraglutide precursor peptide (GLP-1K28R) <400> 17 His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly 1 5 10 15 Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg Gly 20 25 30 <210> 18 <211> 44 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-GLP-1K28R <400> 18 Met His His His His His His Glu Asn Leu Tyr Phe Gln His Ala Glu 1 5 10 15 Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly Gln Ala Ala 20 25 30 Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg Gly 35 40 <210> 19 <211> 88 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG43-H6TEV-GLP-1K28R <400> 19 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Pro Trp His His His 35 40 45 His His His Glu Asn Leu Tyr Phe Gln His Ala Glu Gly Thr Phe Thr 50 55 60 Ser Asp Val Ser Ser Tyr Leu Glu Gly Gln Ala Ala Lys Glu Phe Ile 65 70 75 80 Ala Trp Leu Val Arg Gly Arg Gly 85 <210> 20 <211> 102 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG57-H6TEV-GLP-1K28R <400> 20 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Pro Trp His His His His His 50 55 60 His Glu Asn Leu Tyr Phe Gln His Ala Glu Gly Thr Phe Thr Ser Asp 65 70 75 80 Val Ser Ser Tyr Leu Glu Gly Gln Ala Ala Lys Glu Phe Ile Ala Trp 85 90 95 Leu Val Arg Gly Arg Gly 100 <210> 21 <211> 113 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG68-H6TEV-GLP-1K28R <400> 21 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Pro Trp His His His His His His Glu Asn Leu Tyr 65 70 75 80 Phe Gln His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu 85 90 95 Glu Gly Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg 100 105 110 Gly <210> 22 <211> 124 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG79-H6TEV-GLP-1K28R <400> 22 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Pro 65 70 75 80 Trp His His His His His His Glu Asn Leu Tyr Phe Gln His Ala Glu 85 90 95 Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly Gln Ala Ala 100 105 110 Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg Gly 115 120 <210> 23 <211> 133 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG88-H6TEV-GLP-1K28R <400> 23 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Pro Trp His His His His His His 85 90 95 Glu Asn Leu Tyr Phe Gln His Ala Glu Gly Thr Phe Thr Ser Asp Val 100 105 110 Ser Ser Tyr Leu Glu Gly Gln Ala Ala Lys Glu Phe Ile Ala Trp Leu 115 120 125 Val Arg Gly Arg Gly 130 <210> 24 <211> 140 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-GLP-1K28R <400> 24 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln His Ala Glu 100 105 110 Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly Gln Ala Ala 115 120 125 Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg Gly 130 135 140 <210> 25 <211> 141 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-GLP-1K28R <400> 25 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln His Ala 100 105 110 Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly Gln Ala 115 120 125 Ala Lys Glu Phe Ile Ala Trp Leu Val Arg Gly Arg Gly 130 135 140 <210> 26 <211> 34 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for hPTH1-34 (Teriparatide) <400> 26 Ser Val Ser Glu Ile Gln Leu Met His Asn Leu Gly Lys His Leu Asn 1 5 10 15 Ser Met Glu Arg Val Glu Trp Leu Arg Lys Lys Leu Gln Asp Val His 20 25 30 Asn Phe <210> 27 <211> 47 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-hPTH1-34 <400> 27 Met His His His His His His Glu Asn Leu Tyr Phe Gln Ser Val Ser 1 5 10 15 Glu Ile Gln Leu Met His Asn Leu Gly Lys His Leu Asn Ser Met Glu 20 25 30 Arg Val Glu Trp Leu Arg Lys Lys Leu Gln Asp Val His Asn Phe 35 40 45 <210> 28 <211> 143 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-hPTH1-34 <400> 28 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Ser Val Ser 100 105 110 Glu Ile Gln Leu Met His Asn Leu Gly Lys His Leu Asn Ser Met Glu 115 120 125 Arg Val Glu Trp Leu Arg Lys Lys Leu Gln Asp Val His Asn Phe 130 135 140 <210> 29 <211> 144 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-hPTH1-34 <400> 29 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Ser Val 100 105 110 Ser Glu Ile Gln Leu Met His Asn Leu Gly Lys His Leu Asn Ser Met 115 120 125 Glu Arg Val Glu Trp Leu Arg Lys Lys Leu Gln Asp Val His Asn Phe 130 135 140 <210> 30 <211> 33 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Teduglutide (GLP-2A2G) <400> 30 His Gly Asp Gly Ser Phe Ser Asp Glu Met Asn Thr Ile Leu Asp Asn 1 5 10 15 Leu Ala Ala Arg Asp Phe Ile Asn Trp Leu Ile Gln Thr Lys Ile Thr 20 25 30 Asp <210> 31 <211> 46 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-GLP-2A2G <400> 31 Met His His His His His His Glu Asn Leu Tyr Phe Gln His Gly Asp 1 5 10 15 Gly Ser Phe Ser Asp Glu Met Asn Thr Ile Leu Asp Asn Leu Ala Ala 20 25 30 Arg Asp Phe Ile Asn Trp Leu Ile Gln Thr Lys Ile Thr Asp 35 40 45 <210> 32 <211> 142 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-GLP-2A2G <400> 32 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln His Gly Asp 100 105 110 Gly Ser Phe Ser Asp Glu Met Asn Thr Ile Leu Asp Asn Leu Ala Ala 115 120 125 Arg Asp Phe Ile Asn Trp Leu Ile Gln Thr Lys Ile Thr Asp 130 135 140 <210> 33 <211> 143 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-GLP-2A2G <400> 33 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln His Gly 100 105 110 Asp Gly Ser Phe Ser Asp Glu Met Asn Thr Ile Leu Asp Asn Leu Ala 115 120 125 Ala Arg Asp Phe Ile Asn Trp Leu Ile Gln Thr Lys Ile Thr Asp 130 135 140 <210> 34 <211> 60 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Ecallantide <400> 34 Glu Ala Met His Ser Phe Cys Ala Phe Lys Ala Asp Asp Gly Pro Cys 1 5 10 15 Arg Ala Ala His Pro Arg Trp Phe Phe Asn Ile Phe Thr Arg Gln Cys 20 25 30 Glu Glu Phe Ile Tyr Gly Gly Cys Glu Gly Asn Gln Asn Arg Phe Glu 35 40 45 Ser Leu Glu Glu Cys Lys Lys Met Cys Thr Arg Asp 50 55 60 <210> 35 <211> 73 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-Ecallantide <400> 35 Met His His His His His His Glu Asn Leu Tyr Phe Gln Glu Ala Met 1 5 10 15 His Ser Phe Cys Ala Phe Lys Ala Asp Asp Gly Pro Cys Arg Ala Ala 20 25 30 His Pro Arg Trp Phe Phe Asn Ile Phe Thr Arg Gln Cys Glu Glu Phe 35 40 45 Ile Tyr Gly Gly Cys Glu Gly Asn Gln Asn Arg Phe Glu Ser Leu Glu 50 55 60 Glu Cys Lys Lys Met Cys Thr Arg Asp 65 70 <210> 36 <211> 169 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-Ecallantide <400> 36 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Glu Ala Met 100 105 110 His Ser Phe Cys Ala Phe Lys Ala Asp Asp Gly Pro Cys Arg Ala Ala 115 120 125 His Pro Arg Trp Phe Phe Asn Ile Phe Thr Arg Gln Cys Glu Glu Phe 130 135 140 Ile Tyr Gly Gly Cys Glu Gly Asn Gln Asn Arg Phe Glu Ser Leu Glu 145 150 155 160 Glu Cys Lys Lys Met Cys Thr Arg Asp 165 <210> 37 <211> 170 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-Ecallantide <400> 37 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Glu Ala 100 105 110 Met His Ser Phe Cys Ala Phe Lys Ala Asp Asp Gly Pro Cys Arg Ala 115 120 125 Ala His Pro Arg Trp Phe Phe Asn Ile Phe Thr Arg Gln Cys Glu Glu 130 135 140 Phe Ile Tyr Gly Gly Cys Glu Gly Asn Gln Asn Arg Phe Glu Ser Leu 145 150 155 160 Glu Glu Cys Lys Lys Met Cys Thr Arg Asp 165 170 <210> 38 <211> 32 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Nesiritide <400> 38 Ser Pro Lys Met Val Gln Gly Ser Gly Cys Phe Gly Arg Lys Met Asp 1 5 10 15 Arg Ile Ser Ser Ser Ser Gly Leu Gly Cys Lys Val Leu Arg Arg His 20 25 30 <210> 39 <211> 45 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-Nesiritide <400> 39 Met His His His His His His Glu Asn Leu Tyr Phe Gln Ser Pro Lys 1 5 10 15 Met Val Gln Gly Ser Gly Cys Phe Gly Arg Lys Met Asp Arg Ile Ser 20 25 30 Ser Ser Ser Gly Leu Gly Cys Lys Val Leu Arg Arg His 35 40 45 <210> 40 <211> 141 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-Nesiritide <400> 40 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Ser Pro Lys 100 105 110 Met Val Gln Gly Ser Gly Cys Phe Gly Arg Lys Met Asp Arg Ile Ser 115 120 125 Ser Ser Ser Gly Leu Gly Cys Lys Val Leu Arg Arg His 130 135 140 <210> 41 <211> 142 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-Nesiritide <400> 41 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Ser Pro 100 105 110 Lys Met Val Gln Gly Ser Gly Cys Phe Gly Arg Lys Met Asp Arg Ile 115 120 125 Ser Ser Ser Ser Gly Leu Gly Cys Lys Val Leu Arg Arg His 130 135 140 <210> 42 <211> 53 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Human Epidermal growth factor (hEGF) <400> 42 Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His 1 5 10 15 Asp Gly Val Cys Met Tyr Ile Glu Ala Leu Asp Lys Tyr Ala Cys Asn 20 25 30 Cys Val Val Gly Tyr Ile Gly Glu Arg Cys Gln Tyr Arg Asp Leu Lys 35 40 45 Trp Trp Glu Leu Arg 50 <210> 43 <211> 66 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-hEGF <400> 43 Met His His His His His His Glu Asn Leu Tyr Phe Gln Asn Ser Asp 1 5 10 15 Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 20 25 30 Cys Met Tyr Ile Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys Val Val 35 40 45 Gly Tyr Ile Gly Glu Arg Cys Gln Tyr Arg Asp Leu Lys Trp Trp Glu 50 55 60 Leu Arg 65 <210> 44 <211> 162 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-hEGF <400> 44 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Asn Ser Asp 100 105 110 Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 115 120 125 Cys Met Tyr Ile Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys Val Val 130 135 140 Gly Tyr Ile Gly Glu Arg Cys Gln Tyr Arg Asp Leu Lys Trp Trp Glu 145 150 155 160 Leu Arg <210> 45 <211> 163 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-hEGF <400> 45 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Asn Ser 100 105 110 Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His Asp Gly 115 120 125 Val Cys Met Tyr Ile Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys Val 130 135 140 Val Gly Tyr Ile Gly Glu Arg Cys Gln Tyr Arg Asp Leu Lys Trp Trp 145 150 155 160 Glu Leu Arg <210> 46 <211> 165 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Interferon beta 1b (INF-beta1b) <400> 46 Ser Tyr Asn Leu Leu Gly Phe Leu Gln Arg Ser Ser Asn Phe Gln Ser 1 5 10 15 Gln Lys Leu Leu Trp Gln Leu Asn Gly Arg Leu Glu Tyr Cys Leu Lys 20 25 30 Asp Arg Met Asn Phe Asp Ile Pro Glu Glu Ile Lys Gln Leu Gln Gln 35 40 45 Phe Gln Lys Glu Asp Ala Ala Leu Thr Ile Tyr Glu Met Leu Gln Asn 50 55 60 Ile Phe Ala Ile Phe Arg Gln Asp Ser Ser Ser Thr Gly Trp Asn Glu 65 70 75 80 Thr Ile Val Glu Asn Leu Leu Ala Asn Val Tyr His Gln Ile Asn His 85 90 95 Leu Lys Thr Val Leu Glu Glu Lys Leu Glu Lys Glu Asp Phe Thr Arg 100 105 110 Gly Lys Leu Met Ser Ser Leu His Leu Lys Arg Tyr Tyr Gly Arg Ile 115 120 125 Leu His Tyr Leu Lys Ala Lys Glu Tyr Ser His Cys Ala Trp Thr Ile 130 135 140 Val Arg Val Glu Ile Leu Arg Asn Phe Tyr Phe Ile Asn Arg Leu Thr 145 150 155 160 Gly Tyr Leu Arg Asn 165 <210> 47 <211> 178 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-INF-beta1b <400> 47 Met His His His His His His Glu Asn Leu Tyr Phe Gln Ser Tyr Asn 1 5 10 15 Leu Leu Gly Phe Leu Gln Arg Ser Ser Asn Phe Gln Ser Gln Lys Leu 20 25 30 Leu Trp Gln Leu Asn Gly Arg Leu Glu Tyr Cys Leu Lys Asp Arg Met 35 40 45 Asn Phe Asp Ile Pro Glu Glu Ile Lys Gln Leu Gln Gln Phe Gln Lys 50 55 60 Glu Asp Ala Ala Leu Thr Ile Tyr Glu Met Leu Gln Asn Ile Phe Ala 65 70 75 80 Ile Phe Arg Gln Asp Ser Ser Ser Thr Gly Trp Asn Glu Thr Ile Val 85 90 95 Glu Asn Leu Leu Ala Asn Val Tyr His Gln Ile Asn His Leu Lys Thr 100 105 110 Val Leu Glu Glu Lys Leu Glu Lys Glu Asp Phe Thr Arg Gly Lys Leu 115 120 125 Met Ser Ser Leu His Leu Lys Arg Tyr Tyr Gly Arg Ile Leu His Tyr 130 135 140 Leu Lys Ala Lys Glu Tyr Ser His Cys Ala Trp Thr Ile Val Arg Val 145 150 155 160 Glu Ile Leu Arg Asn Phe Tyr Phe Ile Asn Arg Leu Thr Gly Tyr Leu 165 170 175 Arg Asn <210> 48 <211> 274 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-INF-beta1b <400> 48 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Ser Tyr Asn 100 105 110 Leu Leu Gly Phe Leu Gln Arg Ser Ser Asn Phe Gln Ser Gln Lys Leu 115 120 125 Leu Trp Gln Leu Asn Gly Arg Leu Glu Tyr Cys Leu Lys Asp Arg Met 130 135 140 Asn Phe Asp Ile Pro Glu Glu Ile Lys Gln Leu Gln Gln Phe Gln Lys 145 150 155 160 Glu Asp Ala Ala Leu Thr Ile Tyr Glu Met Leu Gln Asn Ile Phe Ala 165 170 175 Ile Phe Arg Gln Asp Ser Ser Ser Thr Gly Trp Asn Glu Thr Ile Val 180 185 190 Glu Asn Leu Leu Ala Asn Val Tyr His Gln Ile Asn His Leu Lys Thr 195 200 205 Val Leu Glu Glu Lys Leu Glu Lys Glu Asp Phe Thr Arg Gly Lys Leu 210 215 220 Met Ser Ser Leu His Leu Lys Arg Tyr Tyr Gly Arg Ile Leu His Tyr 225 230 235 240 Leu Lys Ala Lys Glu Tyr Ser His Cys Ala Trp Thr Ile Val Arg Val 245 250 255 Glu Ile Leu Arg Asn Phe Tyr Phe Ile Asn Arg Leu Thr Gly Tyr Leu 260 265 270 Arg Asn <210> 49 <211> 275 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-INF-beta1b <400> 49 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Ser Tyr 100 105 110 Asn Leu Leu Gly Phe Leu Gln Arg Ser Ser Asn Phe Gln Ser Gln Lys 115 120 125 Leu Leu Trp Gln Leu Asn Gly Arg Leu Glu Tyr Cys Leu Lys Asp Arg 130 135 140 Met Asn Phe Asp Ile Pro Glu Glu Ile Lys Gln Leu Gln Gln Phe Gln 145 150 155 160 Lys Glu Asp Ala Ala Leu Thr Ile Tyr Glu Met Leu Gln Asn Ile Phe 165 170 175 Ala Ile Phe Arg Gln Asp Ser Ser Ser Thr Gly Trp Asn Glu Thr Ile 180 185 190 Val Glu Asn Leu Leu Ala Asn Val Tyr His Gln Ile Asn His Leu Lys 195 200 205 Thr Val Leu Glu Glu Lys Leu Glu Lys Glu Asp Phe Thr Arg Gly Lys 210 215 220 Leu Met Ser Ser Leu His Leu Lys Arg Tyr Tyr Gly Arg Ile Leu His 225 230 235 240 Tyr Leu Lys Ala Lys Glu Tyr Ser His Cys Ala Trp Thr Ile Val Arg 245 250 255 Val Glu Ile Leu Arg Asn Phe Tyr Phe Ile Asn Arg Leu Thr Gly Tyr 260 265 270 Leu Arg Asn 275 <210> 50 <211> 191 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for human growth hormone (hGH) <400> 50 Phe Pro Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg 1 5 10 15 Ala His Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu 20 25 30 Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro 35 40 45 Gln Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg 50 55 60 Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu 65 70 75 80 Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val 85 90 95 Phe Ala Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp 100 105 110 Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu 115 120 125 Glu Asp Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser 130 135 140 Lys Phe Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr 145 150 155 160 Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe 165 170 175 Leu Arg Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe 180 185 190 <210> 51 <211> 204 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-hGH <400> 51 Met His His His His His His Glu Asn Leu Tyr Phe Gln Phe Pro Thr 1 5 10 15 Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg Ala His Arg 20 25 30 Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu Glu Ala Tyr 35 40 45 Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro Gln Thr Ser 50 55 60 Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg Glu Glu Thr 65 70 75 80 Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu Leu Leu Ile 85 90 95 Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val Phe Ala Asn 100 105 110 Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp Leu Leu Lys 115 120 125 Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu Glu Asp Gly 130 135 140 Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser Lys Phe Asp 145 150 155 160 Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr Gly Leu Leu 165 170 175 Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe Leu Arg Ile 180 185 190 Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe 195 200 <210> 52 <211> 300 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-hGH <400> 52 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Phe Pro Thr 100 105 110 Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg Ala His Arg 115 120 125 Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu Glu Ala Tyr 130 135 140 Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro Gln Thr Ser 145 150 155 160 Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg Glu Glu Thr 165 170 175 Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu Leu Leu Ile 180 185 190 Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val Phe Ala Asn 195 200 205 Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp Leu Leu Lys 210 215 220 Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu Glu Asp Gly 225 230 235 240 Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser Lys Phe Asp 245 250 255 Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr Gly Leu Leu 260 265 270 Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe Leu Arg Ile 275 280 285 Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe 290 295 300 <210> 53 <211> 301 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-hGH <400> 53 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Phe Pro 100 105 110 Thr Ile Pro Leu Ser Arg Leu Phe Asp Asn Ala Met Leu Arg Ala His 115 120 125 Arg Leu His Gln Leu Ala Phe Asp Thr Tyr Gln Glu Phe Glu Glu Ala 130 135 140 Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro Gln Thr 145 150 155 160 Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Arg Glu Glu 165 170 175 Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu Arg Ile Ser Leu Leu Leu 180 185 190 Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val Phe Ala 195 200 205 Asn Ser Leu Val Tyr Gly Ala Ser Asp Ser Asn Val Tyr Asp Leu Leu 210 215 220 Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu Glu Asp 225 230 235 240 Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser Lys Phe 245 250 255 Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr Gly Leu 260 265 270 Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe Leu Arg 275 280 285 Ile Val Gln Cys Arg Ser Val Glu Gly Ser Cys Gly Phe 290 295 300 <210> 54 <211> 86 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for Insulin glargine precursor (INSG) <400> 54 Phe Val Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 1 5 10 15 Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Arg Arg 20 25 30 Glu Ala Glu Asp Leu Gln Val Gly Gln Val Glu Leu Gly Gly Gly Pro 35 40 45 Gly Ala Gly Ser Leu Gln Pro Leu Ala Leu Glu Gly Ser Leu Gln Lys 50 55 60 Arg Gly Ile Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln 65 70 75 80 Leu Glu Asn Tyr Cys Gly 85 <210> 55 <211> 99 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for H6TEV-INSG <400> 55 Met His His His His His His Glu Asn Leu Tyr Phe Gln Phe Val Asn 1 5 10 15 Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr Leu Val Cys 20 25 30 Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Arg Arg Glu Ala Glu 35 40 45 Asp Leu Gln Val Gly Gln Val Glu Leu Gly Gly Gly Pro Gly Ala Gly 50 55 60 Ser Leu Gln Pro Leu Ala Leu Glu Gly Ser Leu Gln Lys Arg Gly Ile 65 70 75 80 Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln Leu Glu Asn 85 90 95 Tyr Cys Gly <210> 56 <211> 195 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for PG97-H6TEV-INSG <400> 56 Met Asn Ile Arg Pro Leu His Asp Arg Val Ile Val Lys Arg Lys Glu 1 5 10 15 Val Glu Thr Lys Ser Ala Gly Gly Ile Val Leu Thr Gly Ser Ala Ala 20 25 30 Ala Lys Ser Thr Arg Gly Glu Val Leu Ala Val Gly Asn Gly Arg Ile 35 40 45 Leu Glu Asn Gly Glu Val Lys Pro Leu Asp Val Lys Val Gly Asp Ile 50 55 60 Val Ile Phe Asn Asp Gly Tyr Gly Val Lys Ser Glu Lys Ile Asp Asn 65 70 75 80 Glu Glu Val Leu Ile Met Ser Glu Ser Asp Ile Leu Ala Ile Val Glu 85 90 95 Ala His His His His His His Glu Asn Leu Tyr Phe Gln Phe Val Asn 100 105 110 Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr Leu Val Cys 115 120 125 Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Arg Arg Glu Ala Glu 130 135 140 Asp Leu Gln Val Gly Gln Val Glu Leu Gly Gly Gly Pro Gly Ala Gly 145 150 155 160 Ser Leu Gln Pro Leu Ala Leu Glu Gly Ser Leu Gln Lys Arg Gly Ile 165 170 175 Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln Leu Glu Asn 180 185 190 Tyr Cys Gly 195 <210> 57 <211> 196 <212> PRT <213> Artificial Sequence <220> <223> amino acid sequence for SUMO-H6TEV-INSG <400> 57 Met Ser Asp Ser Glu Val Asn Gln Glu Ala Lys Pro Glu Val Lys Pro 1 5 10 15 Glu Val Lys Pro Glu Thr His Ile Asn Leu Lys Val Ser Asp Gly Ser 20 25 30 Ser Glu Ile Phe Phe Lys Ile Lys Lys Thr Thr Pro Leu Arg Arg Leu 35 40 45 Met Glu Ala Phe Ala Lys Arg Gln Gly Lys Glu Met Asp Ser Leu Arg 50 55 60 Phe Leu Tyr Asp Gly Ile Arg Ile Gln Ala Asp Gln Thr Pro Glu Asp 65 70 75 80 Leu Asp Met Glu Asp Asn Asp Ile Ile Glu Ala His Arg Glu Gln Ile 85 90 95 Gly Gly His His His His His His Glu Asn Leu Tyr Phe Gln Phe Val 100 105 110 Asn Gln His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr Leu Val 115 120 125 Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr Arg Arg Glu Ala 130 135 140 Glu Asp Leu Gln Val Gly Gln Val Glu Leu Gly Gly Gly Pro Gly Ala 145 150 155 160 Gly Ser Leu Gln Pro Leu Ala Leu Glu Gly Ser Leu Gln Lys Arg Gly 165 170 175 Ile Val Glu Gln Cys Cys Thr Ser Ile Cys Ser Leu Tyr Gln Leu Glu 180 185 190 Asn Tyr Cys Gly 195 <210> 58 <211> 135 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-GLP-1K28R <400> 58 atgcatcatc atcaccacca cgaaaacctg tatttccagc atgcggaagg cacctttacc 60 agcgatgtga gcagctatct ggaaggccag gcggcgaaag aatttattgc gtggctggtg 120 cgtggccgcg gctaa 135 <210> 59 <211> 267 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG43-H6TEV-GLP-1K28R <400> 59 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcc catggcatca tcatcaccac cacgaaaacc tgtatttcca gcatgcggaa 180 ggcaccttta ccagcgatgt gagcagctat ctggaaggcc aggcggcgaa agaatttatt 240 gcgtggctgg tgcgtggccg cggctaa 267 <210> 60 <211> 309 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG57-H6TEV-GLP-1K28R <400> 60 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct gccatggcat 180 catcatcacc accacgaaaa cctgtatttc cagcatgcgg aaggcacctt taccagcgat 240 gtgagcagct atctggaagg ccaggcggcg aaagaattta ttgcgtggct ggtgcgtggc 300 cgcggctaa 309 <210> 61 <211> 342 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG68-H6TEV-GLP-1K28R <400> 61 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacccatgg catcatcatc accaccacga aaacctgtat 240 ttccagcatg cggaaggcac ctttaccagc gatgtgagca gctatctgga aggccaggcg 300 gcgaaagaat ttattgcgtg gctggtgcgt ggccgcggct aa 342 <210> 62 <211> 375 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG79-H6TEV-GLP-1K28R <400> 62 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgaccca 240 tggcatcatc atcaccacca cgaaaacctg tatttccagc atgcggaagg cacctttacc 300 agcgatgtga gcagctatct ggaaggccag gcggcgaaag aatttattgc gtggctggtg 360 cgtggccgcg gctaa 375 <210> 63 <211> 402 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG88-H6TEV-GLP-1K28R <400> 63 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaccatgg catcatcatc accaccacga aaacctgtat 300 ttccagcatg cggaaggcac ctttaccagc gatgtgagca gctatctgga aggccaggcg 360 gcgaaagaat ttattgcgtg gctggtgcgt ggccgcggct aa 402 <210> 64 <211> 423 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-GLP-1K28R <400> 64 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagcat gcggaaggca cctttaccag cgatgtgagc 360 agctatctgg aaggccaggc ggcgaaagaa tttattgcgt ggctggtgcg tggccgcggc 420 taa 423 <210> 65 <211> 426 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-GLP-1K28R <400> 65 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag catgcggaag gcacctttac cagcgatgtg 360 agcagctatc tggaaggcca ggcggcgaaa gaatttattg cgtggctggt gcgtggccgc 420 ggctaa 426 <210> 66 <211> 144 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-hPTH1-34 <400> 66 atgcatcatc atcaccacca cgaaaacctg tatttccagt ctgtgagtga aatacagctt 60 atgcataacc tgggaaaaca tctgaactcg atggagagag tagaatggct gcgtaagaag 120 ctgcaggatg tgcacaattt ttaa 144 <210> 67 <211> 531 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-hPTH1-34 <400> 67 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag catgcggaag gcacctttac cagcgatgtg 360 agcagctatc tggaaggcca ggcggcgaaa gaatttattg cgtggctggt gcgtggccgc 420 ggctaaagcg tgagcgaaat tcagctgatg cataacctgg gcaaacatct gaacagcatg 480 gaacgcgtgg aatggctgcg caaaaaactg caggatgtgc ataactttta a 531 <210> 68 <211> 435 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-hPTH1-34 <400> 68 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag agcgtgagcg aaattcagct gatgcataac 360 ctgggcaaac atctgaacag catggaacgc gtggaatggc tgcgcaaaaa actgcaggat 420 gtgcataact tttaa 435 <210> 69 <211> 141 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-GLP-2A2G <400> 69 atgcatcatc atcaccacca cgaaaacctg tatttccagc atggcgatgg cagctttagc 60 gatgaaatga acaccattct ggataacctg gcggcgcgcg attttattaa ctggctgatt 120 cagaccaaaa ttaccgatta a 141 <210> 70 <211> 429 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-GLP-2A2G <400> 70 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagcat ggcgatggca gctttagcga tgaaatgaac 360 accattctgg ataacctggc ggcgcgcgat tttattaact ggctgattca gaccaaaatt 420 accgattaa 429 <210> 71 <211> 432 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-GLP-2A2G <400> 71 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag catggcgatg gcagctttag cgatgaaatg 360 aacaccattc tggataacct ggcggcgcgc gattttatta actggctgat tcagaccaaa 420 attaccgatt aa 432 <210> 72 <211> 222 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-Ecallantide <400> 72 atgcatcatc atcaccacca cgaaaacctg tatttccagg aagcgatgca tagcttttgc 60 gcgtttaaag cggatgatgg cccgtgccgc gcggcgcatc cgcgctggtt ttttaacatt 120 tttacccgcc agtgcgaaga atttatttat ggcggctgcg aaggcaacca gaaccgcttt 180 gaaagcctgg aagaatgcaa aaaaatgtgc acccgcgatt aa 222 <210> 73 <211> 510 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-Ecallantide <400> 73 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccaggaa gcgatgcata gcttttgcgc gtttaaagcg 360 gatgatggcc cgtgccgcgc ggcgcatccg cgctggtttt ttaacatttt tacccgccag 420 tgcgaagaat ttatttatgg cggctgcgaa ggcaaccaga accgctttga aagcctggaa 480 gaatgcaaaa aaatgtgcac ccgcgattaa 510 <210> 74 <211> 513 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-Ecallantide <400> 74 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag gaagcgatgc atagcttttg cgcgtttaaa 360 gcggatgatg gcccgtgccg cgcggcgcat ccgcgctggt tttttaacat ttttacccgc 420 cagtgcgaag aatttattta tggcggctgc gaaggcaacc agaaccgctt tgaaagcctg 480 gaagaatgca aaaaaatgtg cacccgcgat taa 513 <210> 75 <211> 138 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-Nesiritide <400> 75 atgcatcatc atcaccacca cgaaaacctg tatttccaga gcccgaaaat ggtgcagggc 60 agcggctgct ttggccgcaa aatggatcgc attagcagca gcagcggcct gggctgcaaa 120 gtgctgcgcc gccattaa 138 <210> 76 <211> 426 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-Nesiritide <400> 76 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagagc ccgaaaatgg tgcagggcag cggctgcttt 360 ggccgcaaaa tggatcgcat tagcagcagc agcggcctgg gctgcaaagt gctgcgccgc 420 cattaa 426 <210> 77 <211> 429 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-Nesiritide <400> 77 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag agcccgaaaa tggtgcaggg cagcggctgc 360 tttggccgca aaatggatcg cattagcagc agcagcggcc tgggctgcaa agtgctgcgc 420 cgccattaa 429 <210> 78 <211> 201 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-hEGF <400> 78 atgcatcatc atcaccacca cgaaaacctg tatttccaga actccgattc tgagtgccca 60 ctttcccacg atggatactg ccttcatgat ggcgtgtgca tgtacatcga ggctctcgat 120 aagtacgcct gcaactgcgt tgtgggttac attggagaga gatgccagta cagggatctc 180 aagtggtggg agcttaggta a 201 <210> 79 <211> 489 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-hEGF <400> 79 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagaac tccgattctg agtgcccact ttcccacgat 360 ggatactgcc ttcatgatgg cgtgtgcatg tacatcgagg ctctcgataa gtacgcctgc 420 aactgcgttg tgggttacat tggagagaga tgccagtaca gggatctcaa gtggtgggag 480 cttaggtaa 489 <210> 80 <211> 492 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-hEGF <400> 80 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag aactccgatt ctgagtgccc actttcccac 360 gatggatact gccttcatga tggcgtgtgc atgtacatcg aggctctcga taagtacgcc 420 tgcaactgcg ttgtgggtta cattggagag agatgccagt acagggatct caagtggtgg 480 gagcttaggt aa 492 <210> 81 <211> 537 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-INF-beta1b <400> 81 atgcatcatc atcaccacca cgaaaacctg tatttccaga gctataacct gctgggcttt 60 ctgcagcgca gcagcaactt tcagagccag aaactgctgt ggcagctgaa cggccgcctg 120 gaatattgcc tgaaagatcg catgaacttt gatattccgg aagaaattaa acagctgcag 180 cagtttcaga aagaagatgc ggcgctgacc atttatgaaa tgctgcagaa catttttgcg 240 atttttcgcc aggatagcag cagcaccggc tggaacgaaa ccattgtgga aaacctgctg 300 gcgaacgtgt atcatcagat taaccatctg aaaaccgtgc tggaagaaaa actggaaaaa 360 gaagatttta cccgcggcaa actgatgagc agcctgcatc tgaaacgcta ttatggccgc 420 attctgcatt atctgaaagc gaaagaatat agccattgcg cgtggaccat tgtgcgcgtg 480 gaaattctgc gcaactttta ttttattaac cgcctgaccg gctatctgcg caactaa 537 <210> 82 <211> 825 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-INF-beta1b <400> 82 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagagc tataacctgc tgggctttct gcagcgcagc 360 agcaactttc agagccagaa actgctgtgg cagctgaacg gccgcctgga atattgcctg 420 aaagatcgca tgaactttga tattccggaa gaaattaaac agctgcagca gtttcagaaa 480 gaagatgcgg cgctgaccat ttatgaaatg ctgcagaaca tttttgcgat ttttcgccag 540 gatagcagca gcaccggctg gaacgaaacc attgtggaaa acctgctggc gaacgtgtat 600 catcagatta accatctgaa aaccgtgctg gaagaaaaac tggaaaaaga agattttacc 660 cgcggcaaac tgatgagcag cctgcatctg aaacgctatt atggccgcat tctgcattat 720 ctgaaagcga aagaatatag ccattgcgcg tggaccattg tgcgcgtgga aattctgcgc 780 aacttttatt ttattaaccg cctgaccggc tatctgcgca actaa 825 <210> 83 <211> 828 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-INF-beta1b <400> 83 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag agctataacc tgctgggctt tctgcagcgc 360 agcagcaact ttcagagcca gaaactgctg tggcagctga acggccgcct ggaatattgc 420 ctgaaagatc gcatgaactt tgatattccg gaagaaatta aacagctgca gcagtttcag 480 aaagaagatg cggcgctgac catttatgaa atgctgcaga acatttttgc gatttttcgc 540 caggatagca gcagcaccgg ctggaacgaa accattgtgg aaaacctgct ggcgaacgtg 600 tatcatcaga ttaaccatct gaaaaccgtg ctggaagaaa aactggaaaa agaagatttt 660 acccgcggca aactgatgag cagcctgcat ctgaaacgct attatggccg cattctgcat 720 tatctgaaag cgaaagaata tagccattgc gcgtggacca ttgtgcgcgt ggaaattctg 780 cgcaactttt attttattaa ccgcctgacc ggctatctgc gcaactaa 828 <210> 84 <211> 612 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-hGH <400> 84 catcatcatc accaccacga aaacctgtat ttccagtttc caaccattcc actgtctcgc 60 ctgtttgata atgcaatgct gcgtgcacat cgtctgcatc agctggcatt tgatacctat 120 caggaatttg aagaagcata tattccaaaa gaacagaaat attctttcct gcaaaatcca 180 cagacctctc tgtgtttttc tgaatctatt ccaaccccat ccaatcgtga agagacccaa 240 cagaaatcta atctggaact gctgcgtatc tctctgttac tgattcagtc ttggctggaa 300 ccagtgcagt tcctgcgttc tgtgttcgca aattctctgg tttatggtgc atctgattct 360 aatgtttatg atctgctgaa agatttagag gaaggaattc agaccctgat gggtcgtctg 420 gaagatggtt ctccacgtac cggtcagatt tttaaacaga cctattctaa atttgatacc 480 aattctcata atgacgatgc actgctgaaa aactatggtc tgctgtactg ctttcgtaaa 540 gatatggata aagttgaaac ctttctgcgt attgttcagt gtcgttctgt tgaaggttct 600 tgtggttttt aa 612 <210> 85 <211> 903 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-hGH <400> 85 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagttt ccaaccattc cactgtctcg cctgtttgat 360 aatgcaatgc tgcgtgcaca tcgtctgcat cagctggcat ttgataccta tcaggaattt 420 gaagaagcat atattccaaa agaacagaaa tattctttcc tgcaaaatcc acagacctct 480 ctgtgttttt ctgaatctat tccaacccca tccaatcgtg aagagaccca acagaaatct 540 aatctggaac tgctgcgtat ctctctgtta ctgattcagt cttggctgga accagtgcag 600 ttcctgcgtt ctgtgttcgc aaattctctg gtttatggtg catctgattc taatgtttat 660 gatctgctga aagatttaga ggaaggaatt cagaccctga tgggtcgtct ggaagatggt 720 tctccacgta ccggtcagat ttttaaacag acctattcta aatttgatac caattctcat 780 aatgacgatg cactgctgaa aaactatggt ctgctgtact gctttcgtaa agatatggat 840 aaagttgaaa cctttctgcg tattgttcag tgtcgttctg ttgaaggttc ttgtggtttt 900 taa 903 <210> 86 <211> 906 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-hGH <400> 86 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag tttccaacca ttccactgtc tcgcctgttt 360 gataatgcaa tgctgcgtgc acatcgtctg catcagctgg catttgatac ctatcaggaa 420 tttgaagaag catatattcc aaaagaacag aaatattctt tcctgcaaaa tccacagacc 480 tctctgtgtt tttctgaatc tattccaacc ccatccaatc gtgaagagac ccaacagaaa 540 tctaatctgg aactgctgcg tatctctctg ttactgattc agtcttggct ggaaccagtg 600 cagttcctgc gttctgtgtt cgcaaattct ctggtttatg gtgcatctga ttctaatgtt 660 tatgatctgc tgaaagattt agaggaagga attcagaccc tgatgggtcg tctggaagat 720 ggttctccac gtaccggtca gatttttaaa cagacctatt ctaaatttga taccaattct 780 cataatgacg atgcactgct gaaaaactat ggtctgctgt actgctttcg taaagatatg 840 gataaagttg aaacctttct gcgtattgtt cagtgtcgtt ctgttgaagg ttcttgtggt 900 ttttaa 906 <210> 87 <211> 300 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for H6TEV-INSG <400> 87 atgcatcatc atcaccacca cgaaaacctg tatttccagt ttgtgaacca gcatctgtgc 60 ggcagccatc tggtggaagc gctgtatctg gtgtgcggcg aacgcggctt tttttatacc 120 ccgaaaaccc gccgcgaagc ggaagatctg caggtgggcc aggtggaact gggcggcggc 180 ccgggcgcgg gcagcctgca gccgctggcg ctggaaggca gcctgcagaa acgcggcatt 240 gtggaacagt gctgcaccag catttgcagc ctgtatcagc tggaaaacta ttgcggctaa 300 300 <210> 88 <211> 588 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for PG97-H6TEV-INSG <400> 88 atgaatattc gtccattgca tgatcgcgtg atcgtcaagc gtaaagaagt tgaaactaaa 60 tctgctggcg gcatcgttct gaccggctct gcagcggcta aatccacccg cggcgaagtg 120 ctggctgtcg gcaatggccg tatccttgaa aatggcgaag tgaagccgct ggatgtgaaa 180 gttggcgaca tcgttatttt caacgatggc tacggtgtga aatctgagaa gatcgacaat 240 gaagaagtgt tgatcatgtc cgaaagcgac attctggcaa ttgttgaagc gcatcatcat 300 caccaccacg aaaacctgta tttccagttt gtgaaccagc atctgtgcgg cagccatctg 360 gtggaagcgc tgtatctggt gtgcggcgaa cgcggctttt tttatacccc gaaaacccgc 420 cgcgaagcgg aagatctgca ggtgggccag gtggaactgg gcggcggccc gggcgcgggc 480 agcctgcagc cgctggcgct ggaaggcagc ctgcagaaac gcggcattgt ggaacagtgc 540 tgcaccagca tttgcagcct gtatcagctg gaaaactatt gcggctaa 588 <210> 89 <211> 591 <212> DNA <213> Artificial Sequence <220> <223> nucleotide sequence for SUMO-H6TEV-INSG <400> 89 atgtcggact cagaagtcaa tcaagaagct aagccagagg tcaagccaga agtcaagcct 60 gagactcaca tcaatttaaa ggtgtccgat ggatcttcag agatcttctt caagatcaaa 120 aagaccactc ctttaagaag gctgatggaa gcgttcgcta aaagacaggg taaggaaatg 180 gactccttaa gattcttgta cgacggtatt agaattcaag ctgatcagac ccctgaagat 240 ttggacatgg aggataacga tattattgag gctcacagag aacagattgg tggtcatcat 300 catcaccacc acgaaaacct gtatttccag tttgtgaacc agcatctgtg cggcagccat 360 ctggtggaag cgctgtatct ggtgtgcggc gaacgcggct ttttttatac cccgaaaacc 420 cgccgcgaag cggaagatct gcaggtgggc caggtggaac tgggcggcgg cccgggcgcg 480 ggcagcctgc agccgctggc gctggaaggc agcctgcaga aacgcggcat tgtggaacag 540 tgctgcacca gcatttgcag cctgtatcag ctggaaaact attgcggcta a 591

Claims

A fusion tag for producing glucagon-like peptide-1 or an analog thereof comprising the amino acid sequence of SEQ ID NO: 6.

The method of claim 1,
The fusion tag is characterized in that located at the N-terminus of the glucagon-like peptide-1 or an analog thereof, a fusion tag for producing glucagon-like peptide-1 or an analog thereof.

The method of claim 1,
The fusion tag is characterized in that the expression together with the glucagon-like peptide-1 or an analog thereof to form an inclusion body, a fusion tag for producing glucagon-like peptide-1 or an analog thereof.

The method of claim 1,
The fusion tag is a fusion tag for producing glucagon-like peptide-1 or an analog thereof, characterized in that it comprises a linker that connects the fusion tag and the glucagon-like peptide-1 or an analog thereof.

The method of claim 4,
The linker is a fusion tag for producing glucagon-like peptide-1 or an analog thereof, characterized in that it comprises an affinity tag.

The method of claim 4,
The linker is characterized in that it comprises a protease recognition sequence, glucagon-like peptide-1 or a fusion tag for the production of analogs thereof.

The method of claim 6,
The protease recognition sequence is characterized in that located at the C-terminus of the linker, glucagon-like peptide-1 or a fusion tag for producing an analog thereof.

The method of claim 6,
The protease recognition sequence is selected from the group consisting of tobacco etching virus protease recognition sequence, enterokinase recognition sequence, ubiquitin carboxy terminal hydrolase recognition sequence, factor Xa recognition sequence, purine recognition sequence, and combinations thereof, glucagon-like peptide -1 or a fusion tag for the production of an analog thereof.

The method of claim 6,
The protease recognition sequence is characterized in that it comprises the amino acid sequence of SEQ ID NO: 9, glucagon-like peptide-1 or a fusion tag for the production of analogs thereof.