TW201122105A - Polycistronic expression cassettes for producing cellulosomes and applications thereof - Google Patents

Polycistronic expression cassettes for producing cellulosomes and applications thereof Download PDF

Info

Publication number
TW201122105A
TW201122105A TW99144330A TW99144330A TW201122105A TW 201122105 A TW201122105 A TW 201122105A TW 99144330 A TW99144330 A TW 99144330A TW 99144330 A TW99144330 A TW 99144330A TW 201122105 A TW201122105 A TW 201122105A
Authority
TW
Taiwan
Prior art keywords
enzyme
polycistronic
protein
sequence
complex
Prior art date
Application number
TW99144330A
Other languages
Chinese (zh)
Other versions
TWI444477B (en
Inventor
Wen-Hsiung Li
Ming-Che Shih
Chieh-Chen Huang
Jui-Jen Chang
Cheng-Yu Ho
Original Assignee
Academia Sinica
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Academia Sinica filed Critical Academia Sinica
Priority to TW99144330A priority Critical patent/TWI444477B/en
Publication of TW201122105A publication Critical patent/TW201122105A/en
Application granted granted Critical
Publication of TWI444477B publication Critical patent/TWI444477B/en

Links

Landscapes

  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Enzymes And Modification Thereof (AREA)

Abstract

The present invention relates to a polycistronic expression cassette for producing a cellulosome and applications thereof.

Description

201122105 六、發明說明: 【發明所屬之技術領域】 酶複合體之多順 本發明係關於一種用於製造纖維素水解 反子表現卡匣及其應用。 【先前技術】 ^貝纖維素在天然界媪藏豐富’主要存在於植物201122105 VI. Description of the invention: [Technical field to which the invention pertains] The versatility of the enzyme complex The present invention relates to a method for producing a cellulose hydrolyzed anti-expression profile and its use. [Prior Art] ^Beibei cellulose is abundant in the natural world' mainly in plants

^分解後產生的醣類,可再轉化成乙醇,是生質能源的^ 原料。然而’在植物細胞對,木質纖維素係鮮= (hemicellulose)及木質素(lignin)等其他成分形成複雜g 的結構’須進行前處理(pre-treatment),以利提高轉化效率只。 目前使用的前處理包括物理、化學方法,例如’粉庐、 破、高溫、高壓及酸鹼處理等;但這些方法耗能,又會^ ^ 毒廢液,對環境有害,且容易產生其他副產物。生物法則是添 加酵素或特殊菌株,可避免以上缺點,惟需要多類酸二 能達到有效分_維素受質。 、㈣酵素才 "熱纖維梭菌价妨麵认erwoce//聰)是一種耐熱的厭 氧’,具尚效分解纖維素之能力。研究顯示熱纖維梭菌係於細胞 外形成一種高分子量且結構複雜的多酶複合體,稱為纖維素水 解酶複合體(cellulosome),其藉由多種分解酶的協同作用, 而能有效地分解纖維素。除熱纖維梭菌之外,天然界尚有許多 其他微生物具類似的纖維素水解酶複合體,例如,嗜纖維梭菌 /C_ c^//"/〇v〇rara)、解纖維梭菌(C. π仇/〇/沖.cww)、溶紙莎 草梭菌(C_ )、長梗木黴菌( bngibmchiatum )、溶織維 I 狹桿菌〔Bacer〇ides 、解纖維素醋弧菌(也咖./〇 ce/祕細·⑽)、 瘤胃真菌Ν型菌(A/e〇ca//z'wa^:x,洲to/以)、瘤胃真菌ρ型菌 {Piromyces spp)、巨大芽孢桿菌(5⑽γ/⑽脱公她)、地 201122105 衣芽抱桿菌(Bacil丨us licheniformis )、溶纖維芽抱桿菌(及^./如 cellidosohens)、专氐瘤烹珠蛰(Ruminococcus flavefaciens)、 及解纖維醋弧菌(。 研究顯示’熱纖維梭菌之纖維素水解酶複合體主要含有三 種次單元:支架蛋白(scaffoldprotein)、分解酶及細胞表面錨 定蛋白(cell surface anchoring protein);其中支架蛋白擔任骨 架’具有多個第一型黏合域(type Icohesin domains),可與分 解海之第一型錫定域(type [ dockerin domains )結合;支架蛋 白另具有第二型錯定域(type II dockerin domain),可與細胞表 面在田疋蛋白之第二型黏合域(type II cohesin domain)相互結 合’使纖維素水解酶複合體可固定於菌體外的表面上;支架蛋 白又具有碳水化合物結合域(carbohydrate_binding d〇main, fBM) ’可與纖維素碳水化合物受質結合。圖丨顯示熱纖維梭 菌之纖維素水解酶複合體之結構示意圖。 在熱纖維梭菌之纖維素水解酶複合體中,支架蛋白 (CipAj具有九個第一型黏合域,可結合九個分解酶丨細胞 表=錯定蛋白(01pB、SdbA及〇rf2p)各自可連接一或多個 支架蛋白;以及分解酶至少包括外切葡聚糖酶(ex〇glucanases^The sugar produced after decomposition can be converted into ethanol, which is the raw material of biomass energy. However, in the case of plant cell pairs, other components such as hemicellulose and lignin form a complex g structure, which requires pre-treatment to improve conversion efficiency. The pre-treatments currently used include physical and chemical methods such as 'powder, breaking, high temperature, high pressure and acid-base treatment; but these methods consume energy, and will poison the waste liquid, which is harmful to the environment and easy to produce other vices. product. The biological rule is to add enzymes or special strains to avoid the above shortcomings, but it is necessary to have multiple types of acid II to achieve effective fractionation. (4) Enzymes " Clostridium thermocells erwoce// Cong) is a heat-resistant anaerobic ‘, capable of decomposing cellulose. Studies have shown that Clostridium thermocellum forms a high molecular weight and complex multi-enzyme complex called cellulosome, which can be efficiently decomposed by the synergistic action of various decomposing enzymes. Cellulose. In addition to Clostridium thermocellum, many other microorganisms in the natural world have similar cellulolytic enzyme complexes, for example, Clostridium fibrosus/C_c^//"/〇v〇rara), Clostridium cellulosae (C. π 仇 / 〇 / 冲. cww), Clostridium clostridium (C_), Trichoderma longissima (bngibmchiatum), lyophilized I bacillus (Bacer〇ides, Vibrio lyticum (also咖./〇ce/秘细·(10)), rumen fungus Ν-type bacterium (A/e〇ca//z'wa^:x, zhou to/), rumen fungus ρ-type bacterium {Piromyces spp), giant spore Bacillus (5 (10) γ / (10) off her), 201122105 Bacillus licheniformis, Bacillus licheniformis (and ^. / such as cellidosohens), special tumors (Ruminococcus flavefaciens), and solutions Vibrio anguillarum (. Studies have shown that 'cellulolytic enzyme complex of Clostridium thermocellum mainly contains three subunits: scaffold protein, decomposing enzyme and cell surface anchoring protein; As a skeleton, it has multiple type Icohesin domains, which can be decomposed The type 1 dockerin domains bind; the scaffold protein has a type II dockerin domain, which can interact with the cell surface in the second type of binding domain of glutinous protein (type II). The cohesin domain combines with each other to enable the cellulolytic enzyme complex to be immobilized on the surface of the bacteria; the scaffold protein has a carbohydrate-binding d〇main (fBM)' to bind to the cellulose carbohydrate substrate. Figure 丨 shows the structure of the cellulolytic enzyme complex of Clostridium thermocellum. In the cellulolytic enzyme complex of Clostridium thermocellum, the scaffold protein (CipAj has nine first-type binding domains, which can combine nine decompositions). Enzyme 丨 cell table = mislocalized protein (01pB, SdbA and 〇rf2p) can each be linked to one or more scaffold proteins; and decomposing enzymes include at least exoglucanases (ex〇glucanases)

CelS CblA 及 CelK )、内切葡聚糖酶(en〇gjucanases,如 ceiR、 CelA CelF CelN 及 CelB)、聚木糖酶(Xy[anases,如 Xync、 ^ηΥ ϋΥηΖ )及半纖維素酶(hemiceIlul ases,如 LicB 及 ChiA ) 等’熱纖維㈣會因環境不同(例如,面對不同碳 ί山ft種分解酶之正確比例’以達協同分解效率。例如’ :石二匕-於微晶纖維* (avicel)時’熱纖維梭菌表現最高 j、刀:,為CelK,其次依序為、Cels、cdR、CdA、乂㈣ 寺·而以上提及的分解酶在當碳源來自於纖維二糖 ce,=s:)時,熱纖紐酬調整其表現最高量之分解酶為 二一 '人依序為 XynC、CelA、CelK、CelR 及 CelS 等。研 二7妙角午酶之含量比例與決定針對特定碳源之分解效率有 ’目則對於熱纖維梭菌如何調控各種分解酶之正確 201122105 比例之分子機制並不十分清楚。 相關文獻可參見Gold等人,5<3cier/c»/. 189(19): 6787-6795,2007 ; Bayer 等人,/·办"伽ra/ 所〇/· 124 : 221-234, 1998 ; Demain 等人,Μοό—/·施/.肠/ 69 : 124-154, 2005 ;及 Wu,乂〇^办—.516 : 251-264,1993 Bayer 等人, Curr Opin Biotechnol. 18(3):237-45,2007 ; Cha 等人,J Microbiol Biotechnol 17(11):1782-8,2007 ; Blouzard 等人, JOURNAL OF BACTERIOLOGY 189 (6):2300-9,2007。 以人工製造的「設計者纖維素水解酶複合體(designerCelS CblA and CelK), endoglucanase (en〇gjucanases such as ceiR, CelA CelF CelN and CelB), polyxylase (Xy[anases (such as Xync, ^ηΥ ϋΥηΖ) and hemicellulase (hemiceIlul) Ases, such as LicB and ChiA), etc. 'The heat fiber (4) will be different due to the environment (for example, facing the correct ratio of different carbon glutenolytic enzymes) to achieve synergistic decomposition efficiency. For example: : stone bismuth - in microcrystalline fiber * (avicel) when 'H. thermocellum has the highest performance j, knife:, is CelK, followed by Cels, cdR, CdA, 乂 (4) Temple · and the above-mentioned decomposing enzymes when the carbon source comes from fiber II When sugar ce, = s:), the thermal fiber is adjusted to the highest amount of decomposing enzymes, such as XynC, CelA, CelK, CelR and CelS. The ratio of the content of the enzymes to the specific carbon source is determined. How to regulate the correctness of various degrading enzymes for Clostridium thermocellum 201122105 The molecular mechanism of the ratio is not very clear. For related literature, see Gold et al., 5<3cier/c»/. 189(19): 6787-6795, 2007; Bayer et al.,/·do"Gara/ 〇/· 124 : 221-234, 1998 Demain et al., Μοό—/· Shi/. Intestines/69: 124-154, 2005; and Wu, 乂〇^do—.516: 251-264, 1993 Bayer et al., Curr Opin Biotechnol. 18(3) :237-45, 2007; Cha et al, J Microbiol Biotechnol 17(11): 1782-8, 2007; Blouzard et al, JOURNAL OF BACTERIOLOGY 189 (6): 2300-9, 2007. Artificially manufactured "Designer Cellulolytic Enzyme Complex (designer

cellulosome)」被認為是可研究纖維素水解酶複合體水解機u 之重要工具,亦可應用於生質能源之工業製程上。然而,受限 於纖維素水解酶複合體本身的複雜度及多基因轉殖之技術瓶 頸,目前仍無法以人工技術建構出完整或接近完整的纖維素水 解酶複合體。 μ 於此 取千冒有使用大腸桿菌各自表現出嗜纖維梭菌之 EngB么角午If及片段支架蛋白(mjnj_cbpA),再於活於夕卜德宛 到EngB分解酶及片段支架蛋白之間的交互作用'。又 等人提出融合型纖維素水解酶複合體(chimedc ce丨lu丨〇s〇mes) 之技術。Fierobe等人係以重組技術製得融合型支架蛋白 (chimencscaffoldins) ’其含有二種來自不同菌種來源的黏合 域’各自對於所屬菌種之分解酶所對應的錨定域有強烈特異性 且不相互H因此’含有對應的定域的分解酶便可精準地 結合至融合型支架蛋白之對應的黏合域上,藉此可在活體外组 合出預期的纖維素水解酶複合體(j. Bi〇1 Chem. 27_ 21257-21261 )。然而,此技術係純化出各個重组蛋白,再於、,舌 體外組合纖維素水解酶複合體,有許多限制,例如,所選擇的 黏合域及錨定域之間的配對必須具特異性,不可干榫兑他的俨 定域及錨定域之間的配對,以避免影響纖維素水解^複合體: 形成,而目耵可用於此技術之配對種類有限,不超過三種.而 且即便未來有新的配對_出現,每次均須重新建構^的融合 201122105 型未采蛋白’待確S忍该新的配對種類不會干擾原有的配對種類 後,才能使用,所涉工程浩大繁瑣,非常不方便,亦無從調整 各種分解酶之比例以達針對特定受質之協同分解效率。 另一方面,Cho等人曾建構出帶有嗜纖維梭菌片段支架蛋 白(mini-CbpA)及EngB分解酶之基因的表現載體,並將之 轉殖進入枯草桿菌中’證實此等基因可在枯草桿菌中表現並組 成ϋ、的纖維素水解酶複合體(minicellul〇s〇mes) (ch〇 et aL 2004. Production of Minicellulosome form Clostridium in bacillus subtilis WB800. cellulovorans ——。叫—㈣叫 AppUed andCellulosome) is considered to be an important tool for studying the hydrolysis system of cellulolytic enzyme complexes, and can also be applied to the industrial process of biomass energy. However, due to the complexity of the cellulolytic enzyme complex itself and the technical bottleneck of multi-gene transfer, it is still not possible to construct a complete or nearly complete cellulose hydrolase complex by artificial techniques.取 取 取 取 取 取 使用 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌 大肠杆菌'. The technique of a fusion type cellulolytic enzyme complex (chimedc ce丨lu丨〇s〇mes) is proposed. Fierobe et al. produced a fusion scaffold protein (chimencscaffoldins) containing two kinds of binding domains from different bacterial sources by recombinant technology, each of which has strong specificity for the anchoring domain corresponding to the degrading enzyme of the species and does not Mutual H thus 'containing the corresponding localized degrading enzyme can be precisely bound to the corresponding binding domain of the fusion scaffold protein, thereby combining the expected cellulose hydrolase complex in vitro (j. Bi〇 1 Chem. 27_ 21257-21261 ). However, this technique purifies each recombinant protein, and then combines the cellulose hydrolase complex in vitro. There are many limitations. For example, the pairing between the selected binding domain and the anchoring domain must be specific. The pairing between Cognac and his anchoring domain and the anchoring domain to avoid affecting the cellulose hydrolyzed complex: formation, and the types of pairings that can be used for this technology are limited, no more than three. And even if there is new in the future The pairing_ appears, and each time it has to be reconstructed ^ the fusion of 201122105 type uncollected protein 'to be sure that the new pairing type will not interfere with the original pairing type before it can be used, the project involved is very cumbersome, very not Conveniently, there is no way to adjust the ratio of various decomposing enzymes to achieve synergistic decomposition efficiency for a specific substrate. On the other hand, Cho et al. have constructed a expression vector carrying the gene for Clostridium clostridium fragment (mini-CbpA) and EngB degrading enzyme and transferred it into Bacillus subtilis 'to confirm that these genes can be A cellulolytic enzyme complex (minicellul〇s〇mes) that is expressed and composed of Bacillus subtilis (ch〇et aL 2004. Production of Minicellulosome form Clostridium in bacillus subtilis WB800. cellulovorans - called - (4) AppUed and

Ermronmental Microbiology 70(9): 5071-5707)。Arai 等人則是 建構片^又支架蛋白、EngB分解酶及XynB的各別載體,證實 ,同培養含有丨段支終自及EngB分_之基關枯草桿 $ „培養含有片段支架蛋白及XynB分解酶之基因的枯 草桿菌,可產生微小的纖維素水解酶複合體 (mmicellulosomes),一種係由片段支架蛋白及如讲分解酶 一f係由片段支架蛋白及咖6分解酶組成—_ 2007. Synthesls of Clostridium cellulovorans minicellulosomes by intercellular complementation. Proc Natl Acad Sci U S A 104^5):1456·6())。細,上述技術使用的支架蛋白僅有小部分 片段,所含黏合域之數量僅供測試其與分解 八 ;又僅於^宿主中最多轉人—種分解酶基因; 整的纖維素水解酶複合體有相當大的差距,亦 解酶之關以達針對特定受質之協同分解效率:各種刀 【發明内容】 本發明首次提供一種新穎的技術方案,i 中製造含多種分解酶之纖維素水解酶複合體 間的比例可依f要任意調整’賴仿微 ^ 之協同分解效率。相較於先前技術(例如,融 酶複合體)’本發明使則咖重組步驟單純,亦 201122105 Γΐίί 的配對種類’可簡單便利地麵主細胞中穿』 合體之制研特水解酶複 =;rrr^ 私兄生長%的表現量具一天然排名順序, u 其中,該多順反子表現卡匣包括: (a)啟動子;及 P iti順反子核普酸序列,其係、與該啟動子#作地連接, 酶:二=及編碼該等複數個分解酶次單元之複數個;; 名順序以 得該等複數個分解醢aw 反子核苷酸序列中,使 排名順稍上^^早&^動子之控制下之表現量之 表現卡{£。Φ本毛明提供一種載體,其包括上述多順反子 供—種宿主細胞,其含有前述載體。 體之方法,; I?在活體内製造纖維素水解酶複合 酶複纖維素水解 物質之方法, 導人㈣:於本 201122105 配纖維素水解酶 次單元之_表現量來達到調 量比例之方法。 且上此寻硬數個分解酶次單元之間的含 徵將ίϊ 例的細節制如後。本發明之其他特 各個具體實例中的詳細說明及申請專利範圍 常知=述’咸相信本發明所屬技術領域中具有通 韦知為者基於_酬即可_ ==說明僅僅是作為例示說明非以任何方式限 【實施方式】 女上有指明’所有在此處使⑽技術性和科學性術語具 X明所屬技蟄巾之通常技術者—般所瞭解的意義。 士所使用的「-」乙詞,如未制指明,係指至少一個 (一個或一個以上)之數量。 本文所使用的「纖維素水解酶複合體」乙詞是一種多酶複 合,’其躲乡種蛋自Κ單元喊,具高效分賴維素受質 之能力。一纖維素水解酶複合體包括支架蛋白次單元 (scaffoldin subunits )及複數個分解酶次單元(enzymatic subunits)。支架蛋白次單元係擔任天然纖維素水解酶複合體之 月木其典型地包括一或多個第一型黏合域(type I cohesive domains)’其係用以與各個分解酶次單元之第一型錫定域(妳6 II dockerin domains)互相結合。複數個分解酶次單元具二種或 二種以上的分解酶次單元,包括但不限於纖維素酶 (qelhilase )、外切葡聚糖酶(ex〇giucanases )、内切葡聚糖酶 (enoglucanases )、聚木糖酶(Xyianases )、半纖維素酶 (hemicellulases)、纖維二糖水解酶(ceU〇bi〇hydr〇lase)、β_ 葡糖苷酶(β-glucosidase)、外切_β_ι,4·葡聚糖酶(BGLU, EC3.2.1.21 )、地衣聚糖酶(nchenase, β_1,3-1,4-内切葡聚 201122105Ermronmental Microbiology 70(9): 5071-5707). Arai et al. constructed the individual vectors of the scaffolding protein, EngB decomposing enzyme and XynB, and confirmed that the same culture contains the scorpion branch and the EngB _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ Bacillus subtilis, which decomposes the gene of the enzyme, can produce tiny cellulose hydrolase complexes (mmicellulosomes), one consisting of a fragment scaffold protein and a fragmentase protein and a ca6 decomposing enzyme - 2007. Synthesls of Clostridium cellulovorans minicellulosomes by intercellular complementation. Proc Natl Acad Sci USA 104^5): 1456·6()). Fine, the scaffold protein used in the above technique has only a small fraction of the fragment, and the number of adhesive domains contained is only for testing. And the decomposition of eight; and only in the ^ host to transfer the most - a breakdown of the enzyme gene; the entire cellulolytic enzyme complex has a considerable gap, also solve the enzyme to achieve a synergistic decomposition efficiency for a specific substrate: various Knives [Summary of the Invention] The present invention provides a novel technical solution for the first time, in which the ratio of the cellulose hydrolase complex containing a plurality of decomposing enzymes can be determined. It is intended to adjust the synergistic decomposition efficiency of 'Lim imitation micro ^. Compared with the prior art (for example, the melt enzyme complex) 'the invention makes the coffee reorganization step simple, and the pairing type of 201122105 Γΐίί' can be easily and conveniently in the ground cell.穿 』 合 合 水解 水解 ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私 私a sub-nucleotide sequence, which is linked to the promoter #, an enzyme: two = and a plurality of subunits encoding the plurality of decomposing enzymes;; the order of the plurality of decomposed 醢 aws In the nucleotide sequence, the performance of the performance of the control group under the control of the ^^ early & mover {£.Φ本毛明 provides a vector comprising the above-described polycistronic donor host cell , which comprises the above-mentioned carrier. The method of the body, I? The method for producing the cellulolytic enzyme complex enzyme and the cellulose hydrolyzed substance in vivo, and guiding the human (4): in this 201122105, the amount of the cellulolytic enzyme subunit is The method of achieving the modulation ratio. The details of the inclusions between the hard-numbered decomposing enzyme subunits are as follows. The detailed descriptions of the other specific examples of the present invention and the scope of the patent application are generally described in the technical field of the present invention. Having a knowledge of tongwei can be based on _ remuneration _ == Description is merely an illustration and is not limited in any way. [Embodiment] Women have indicated that 'all are here (10) technical and scientific terms have X Ming The general knowledge of the technical towel is generally understood. The word "-" used by a person, if not specified, means at least one (one or more) quantity. The term "cellulolytic enzyme complex" used in this paper is a multi-enzyme complex, which is called the self-cultivating unit of the cockroach, and has the ability to efficiently administer the lysin. A cellulolytic enzyme complex comprises scaffoldin subunits and a plurality of enzymatic subunits. The scaffold protein subunit acts as a natural cellulose hydrolase complex, which typically comprises one or more type I cohesive domains, which are used in conjunction with the first type of each subunit of the decomposition enzyme. The 定6 II dockerin domains are combined with each other. A plurality of decomposing enzyme subunits having two or more decomposing enzyme subunits including, but not limited to, cellulase (extension), exoglucanase (ex〇giucanases), endoglucanase (enoglucanases) ), Xyianases, hemicellulases, ceU〇bi〇hydr〇lase, β-glucosidase, exo-β_ι, 4· Glucanase (BGLU, EC3.2.1.21), lichenitanase (nchenase, β_1,3-1,4-endopoly 201222105)

ClaclI;^ ^ 果知每(Pectmase )、碳水化合物酯酶 i:l= SteraSeS)、蛋白分解酶 Wes),其 對不同的生長環境(例如,含碳源不同)而調節 之表現量之比例,以達協同分解效率。纖維素 版進—步包括細胞表面錨定蛋白次單元(cell °rngproteinsubunit)’其可使該多酶複合體固定於 二少表面上。典· ’細胞表蝴定蛋自次單元包括ClaclI; ^ ^ knows the ratio of the amount of expression of each (Pectmase), carbohydrate esterase i: l = SteraSeS), proteolytic enzyme Wes), which is regulated by different growth environments (eg, different carbon sources), To achieve synergistic decomposition efficiency. The cellulose plate further comprises a cell surface anchor protein subunit which immobilizes the multi-enzyme complex on two less surfaces. Code · ‘ cell table to determine the egg from the subunit

j夕個弟二型黏合域’其可與支架蛋白之第二型駭域互相 結合。 it j文所使用的「支架蛋白㈣酸序列」、「複數個分解酶核 f二序列」=0細胞表面錫定蛋白核苦酸序列」係分別指編碼 0杀蚤白··人單元、複數個分解酶次單元及細胞表面錫定蛋白次 單元之核_序列’其可編碼職的天終自質次單S之全長 胺基酸序列’亦可為其片段或為其㈣物,包括基因工程變 體,其較佳具有至少—個職的黏合域及/或銘定域。 在天然界中’可產生纖維素水解酶複合體之微生物包括但 不限於熱纖維梭囷(c.认)、嗜纖維梭菌(c. (^/以/〇歡飢5)、解纖維梭菌((:^/〇/>儉證)、溶紙莎草梭 菌(C. p印少row/vMs )、長梗木黴菌(办化心&麵 longibmchiatum )、溶戴維 I 擬桿蛰 Q Bacer〇ides ceU^Iosohem)、解纖维I醋弧菌(Acetivibri〇 celljMyticusas)、 瘤月真菌N型菌(加油3沿)、瘤胃真菌p型菌 (Piromyces spp)、巨大> 抱桿菌〔Bacmus megaierium)、也衣 芽孢桿菌(丑&<:丨111^如滿1'11^)、溶纖維芽孢桿菌(5船7/沿 celhilosolvens)、专色瘤罵珠菌(Rumin〇c〇ccus只⑽啦咖似)、 及解纖維醋弧囷(士如.vz.Wo ce//W_yi/cw5·)。表1列出各種微 生物之纖維素水解酶複合體之相關資訊。 表1 : 201122105 微生物 熱織維梭菌 蛋白質次單元 支架蛋白次單元 CipA .具九個黏合域’可與分解酶之錫定域 結合分解酶次單元 外切葡聚糖酶 内切葡聚糖酶 解織維梭菌 聚木糖酶: 半纖維素酶:細胞表面錨定蛋白 OlpB : 具七個黏合域 域結合 SdbA 具一個黏合域 域結合 0rf2p 具二個黏合域 __域結合支架蛋白次單元 CbpA :具九個黏合域The j-type two-type binding domain can be combined with the second type of scaffold protein. The "scene protein (tetra) acid sequence" and "multiple decomposing enzyme nucleus f-sequences" = "cell surface tin-binding protein nucleotide acid sequence" used in it j text respectively refers to the code 0 killing white · human unit, plural The nuclear-sequence of the degrading enzyme subunit and the cell surface tin-fixing protein subunit, which can encode the full-length amino acid sequence of the end-of-life self-primary sub-S, can also be its fragment or its (four), including genes. Engineering variants, which preferably have at least a personal bonding domain and/or a defined domain. In the natural world, 'microorganisms that can produce cellulolytic enzyme complexes include, but are not limited to, thermal fiber pike (c.), C. brevis (c. (^/// 〇 55), fiber-optic shuttle Bacteria ((:^/〇/>俭证), Clostridium erythraea (C. p印少row/vMs), Trichoderma longissima (Conghua heart & face longibmchiatum), dissolved Dawei I Rod BaQ Bacer〇ides ceU^Iosohem), Acetivibri〇celljMyticusas, Helicobacter pylori N-type bacteria (refueling 3 along), Ruminal fungus p-type (Piromyces spp), Huge> Bacmus megaierium, Bacillus licheniformis (ugly &<:丨111^如满1'11^), Bacillus licheniformis (5 ships 7/along celhilosolvens), Staphylococcus aureus (Rumin〇) C〇ccus only (10) is like a coffee, and the fiber vinegar arc (Shi.vz.Wo ce//W_yi/cw5·). Table 1 lists information on the cellulolytic enzyme complexes of various microorganisms. Table 1: 201122105 Microorganisms of the genus Fusarium oxysporum protein subunit scaffold protein subunit CipA. With nine binding domains' can be combined with decomposing enzymes, tin-localization, decomposing enzyme subunit exoglucanase endoglucanase X. velutipes polyxylase: hemicellulase: cell surface anchoring protein OlpB: with seven binding domains binding to SdbA with one binding domain binding to 0rf2p with two binding domains __ domain binding scaffold protein subunit CbpA: with nine bonding domains

如 CelS、CblA 及 CelK 如 CelR、CelA、CelF、CelN 及 CelB 如 XynC、XynY 及 XynZ 如 LicB 及 ChiA 可與支架蛋白之錨定 可與支架蛋白之錨定 可與支架蛋白之錨定 結合,以及分解酶次單元 外切葡聚糖酶: 内切葡聚糖酶: 聚木糖酶: 半纖維素酶: 可與分解酶之錦定域For example, CelS, CblA and CelK such as CelR, CelA, CelF, CelN and CelB such as XynC, XynY and XynZ such as LicB and ChiA can be anchored to the scaffold protein and anchored to the scaffold protein to bind to the scaffold protein, and Decomposing enzyme subunit exoglucanase: endoglucanase: polyxylase: hemicellulase: can be combined with the enzyme

—個細胞表面錨定黏合。 如 CelF 如 CelA、CelD、CelC、 CelG、CelE 如 Cel5A、xyni〇A 如 Gal27A 嗜織維梭菌支架蛋白次單元 CbpA .具九個黏合域,可與分解酶之錨定β嶋蝴定黏合域: 201122105 外切葡聚糖酶: 内切葡聚糖酶: 半纖纟:- A cell surface anchors the adhesion. For example, CelF, such as CelA, CelD, CelC, CelG, CelE, such as Cel5A, xyni〇A, such as Gal27A, V. velutipes scaffold protein subunit CbpA. With nine binding domains, it can bind to the β-嶋 binding domain of the degrading enzyme. : 201122105 Exoglucanase: Endoglucanase: Semi-fibrillar:

如 ExgSSuch as ExgS

如 EngH,£ngK,EngL, EngM:, and EngN 如 ManA 成泽用之「表現卡E」乙詞係指一藉由重組技術或人 色Ϊ;在=胞中,因表現。表; ,1 ,軏:亲之核苷馱序列編碼二個或二個以上的美w姦 =戶順反子核職」乙詞:= ^苦酸序列。3、扁碼—個或二個以上的基因產物之欲轉錄之 固有酸=基因序列的 物或巨刀子。熟f技藝者可明_是,由 = :Γ「蛋基白因序列可編碼相同的多肽。因此,除ΐ另ΐ 為彼所有互 製造之基因或= 解酶戶ίίϊί「天:然排名順序」乙詞係用以說明複數個分 在—環境生長時的表現量或酵素活性的古 =3為維梭菌為例,當培養環境含微晶纖 糖為石反糾’妙_素水_複切之缝齡解酶次& 201122105 =現=酵=fs低的排名順一,心 本文所使用之「木質纖維素類生物質」乙詞是指可 ①水解酶複合體分解而轉化成能源之木質雜素_料。呈 2纖維素類生物質包括—或多種成分,其係選自_ Ί、半纖維素及木質素所組成之群之任—者或其任音纪人。 在-方面,本發明提供—種用於在宿主細胞中製“ 纖維素水解酶複合體之多順反子表現卡g,該纖維素 合體包括支架蛋白次單元及複數個 4 S=解酶次單元依其在-環境生長時的表現量具 其中,該多順反子表現卡匣包括: (a)啟動子;及 兮夕SIT反严核苷酸序列,其係與該啟動子操作地連接, =夕頃反子料酸糊包括編碼該支架蛋白次單元之支架蛋 酶及編碼該輸個分解酶次單元之複姆^ ί 數個分解軸微相餘上敍輯名順序以 得該等複於r 排名順序與㈣^魏量之 素因水解酶至可下:工適; 程;第二’由於導人的多種核練序列在 因位署!二ΐ置與其表現量有關,因此可依設計任意調整基 而可庐得二:ϊίΐί酸序列所對應產物的相對表現量,進 素水解酶複合體,其模擬天然微生物之纖維素 協同分; 201122105 白次二中_:、纖維素水解酶複合體之支架蛋 domain),而複數個分^ (咖I — —domain),並中括弟—型錯定域(typel 互相結合,因此,支;型=域與該第一型錫定域可 合在-起。在-實例可與,個分解酶次單元複 合域,即可提供九個分解酶次;^ 九個第一型黏 之CipA)。 早兀、·,σ 5位置(如,熱纖維梭菌 *包素f解酶複合體可進- domain),而支架恭ώ 目,p a 生粘 α 域(type II cohesive d—,其中該第二獅 合,因此,支架蛋白多时Γ 口、—或” e弟—型錨定域可互相結 面錫定蛋自:欠單元複^^讀個分解酶轉元可與細胞表 中,所述細胞表面 =合分別可提供-個、二個 (如,熱纖維梭菌之sdbA、0rf2p 木-白一位置 體。子表現切可建構高階蛋白質複合 個第-型黏在合^二用,Such as EngH, £ngK, EngL, EngM:, and EngN, such as ManA Chengze's "performance card E" refers to a technique by recombination techniques or humanity; in = cell, due to performance. Table; , 1, 軏: pro-nucleoside 驮 sequence encodes two or more beautiful traitors = household shun counter nuclear position" B: = ^ bitter acid sequence. 3. Flat code - one or more gene products of the intrinsic acid to be transcribed = gene sequence or giant knife. The skilled artisan can clearly _Yes, by = : Γ "The egg-based white factor sequence can encode the same polypeptide. Therefore, in addition to ΐ ΐ ΐ 彼 彼 彼 彼 互 互 或 = = = = = = = = = 「 「 「 「 「 「 「 「 「 「 「 「 「 The word B is used to describe the number of points in the environment - the growth of the environment or the activity of the enzyme = 3 is the case of Clostridium clostridium, when the culture environment contains microcrystalline cellulose as the stone anti-correction 'miao _ _ water _ The re-cutting of the age-resolving enzymes & 201122105 = the current = yeast = fs low ranking is one, the word "lignocellulosic biomass" used in this article refers to the decomposition of the hydrolase complex into energy Woody impurities. The cellulosic biomass comprises - or a plurality of components selected from the group consisting of _ Ί, hemicellulose, and lignin - or any of them. In a aspect, the present invention provides a polycistronic expression card g for producing a "cellulolytic enzyme complex" in a host cell, the cellulose complex comprising a scaffold protein subunit and a plurality of 4 S=resolving enzymes The unit is characterized by its presence in the growth of the environment, and the polycistronic expression cassette comprises: (a) a promoter; and a SIT anti-severe nucleotide sequence operably linked to the promoter, = 夕 反 子 酸 酸 酸 酸 包括 包括 包括 包括 包括 包括 包括 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸 酸In the order of r and (4) ^Weiquan's prime hydrolase to the next: workability; Cheng; second 'because of the various nuclear training sequences of the lead in the position of the position! The second set is related to its performance, so it can be designed Arbitrarily adjust the base and obtain two: the relative expression of the product corresponding to the ϊίΐ acid sequence, the enterin hydrolase complex, which simulates the synergistic separation of the natural microbial cellulose; 201122105 white times two _:, cellulose hydrolase complex Body bracket egg domain), and multiple points ^ ( Coffee I - domain), and in the middle of the brother - type mislocalization (typel combined with each other, therefore, branch; type = domain and the first type of tin localization can be combined - in - instance can be, and decomposition The enzyme subunit complex domain can provide nine decomposing enzymes; ^ nine first type sticky CipA). Early 兀, ·, σ 5 position (eg, Clostridium thermocellum * Bausin f-enzyme complex can be Into the - domain), and the brackets are obedient, the pa-binding alpha domain (type II cohesive d-, wherein the second lion, therefore, the scaffold protein for a long time, - or "e brother-type anchoring domain can mutually The surface of the tinned egg is fixed from: the unit is retorted ^^ reading a decomposing enzyme can be combined with the cell surface, and the surface of the cell can be provided separately, one or two (for example, sdbA, 0rf2p wood of Clostridium thermocellum) - White one position body. The sub-expression can be constructed by constructing a high-order protein compound with a type-type adhesion.

CipA)以及細九個刀角•欠早兀之結合位置,如 (可提供七個支架蛋白次單元包括七個第二型黏合域 k供尚達63個分解酶次單S之結合位置。〗則、〜、了 茅規=本發明’複數個分解酶次單元依其在-環境生長時的 在本發明之多順 等複數個=動/之位置依序排列,使得該 母人早7C在錢動子之控制下之表現量的排名 13 201122105 順f與該元赌名轉補。更具體而言,複數财解酶 ^序列依各自距離啟動子之距離之遠近㈣成—㈣順序,愈 Λ近啟動子之基因序列,表現量愈高,離啟動子越遠之之基因 ^列’表現量愈低,因此,可視需要任意調整其排列順序,使 知對應的複數個分解酶次單元在啟動子之控制下之表現量的 排名順序,與其天然排名順序相符,^產生的複數個分解酶次單 7L依競爭性方式自行結合至支架蛋白次單元提供的結合位 置’形成仿生的纖維素水解酶複合體。 、卜根據本發明,吾人可依任何天然微生物之的纖維素水解酶 複合體(特別是依所含分解酶次單元之表現量的天然排名順 序)’設計出可在宿主細胞中製造纖維素水解酶複合體之多順 隹 反子表現卡匣。根據本發明之纖維素水解酶複合體可衍生自各 種微生物,包括但不限於,熱纖維梭菌(c 嗜纖維梭菌(C_ ceZ/w/ovoram )、解纖維梭菌(c c^hMytic臟)、溶紙莎萆後菌(c papyr〇s〇h;ens)、長便表徽 菌(THc/zoiier臟./〇/7g/6rac/z/ai麵)、溶纖維素擬桿菌 ce//w/⑽/v⑽)、解纖維素醋弧菌“咖.成咖 celluhlytjcusas)、瘤罵氡蛰 N 型蛰(Ne〇callimasHxfr〇ntaUs)、 瘤胃真菌P型菌(尸z>om少ces ψρ)、巨大芽孢桿菌(如c//仏 )、地衣芽孢桿菌(Baciilus lichenif〇rmis )、溶纖維 牙抱才干ΐί (如(?///⑽ce/Zw/ow/ve似)、黄色瘤胃球菌 響 (伽m/⑽coccw /7ανφ6咖)、及解纖維醋弧菌/v./〇 celMolyticus)。 在較佳具體實施例中’本發明之多順反子表現卡匣係依來 自嗜熱性微生物之纖維素水解酶複合體而設計,由於此類微生 物之纖維素水解酶複合體所含各種分解酶次單元具熱安定 性,因此,該多順反子表現卡匣所製得之纖維素水解/酶‘在高溫 下呈現高酵素活性’有利於工業上生質能源之製程。 在一較佳實施例中,該嗜熱性微生物係熱纖維梭菌。在一 具體實施例中,支架蛋白次單元係CipA。 14 201122105 在本發明之一具體實施例中,支架蛋白次單元係CipA以 及複數個分解酶次單元係包括外切葡聚糖酶CelS及CelK、内 切葡水糖每CelA、及^木糖酶XynC及XynZ。在·—特定實例 中,多順反子核苷酸序列之分解酶核苷酸序列係依序編碼 CelS、CelK、CelA、XynC及Xynz。在又一特定實例中,多 順反子核苦酸序列係依序編碼CipA、CelS、CelK、CelA、XynC 及 Xynz 〇 在另一較佳實施例中,本發明之多順反子表現卡匣係依來 自熱纖維梭菌之纖維素水解酶複合體而設計,其天然纖維素水 解酶複合體進一步包括細胞表面錨定蛋白次單元。在一具體實 施例中,支架蛋白次單元係CipA。在又一具體實施例中,細 胞表面錨定蛋白次單元係選自由〇lpB、SdbA及0rf2p所組成 之群之任一者。在另一具體實施例中,複數個分解酶次單元包 括外切葡聚糖酶CelS及CelK、内切葡聚糖酶CelR及CelA及 聚木糖酶XynC及XynZ。 在一特定實例中’多順反子核苷酸序列之分解酶核苷酸序 列 k依序編碼 CelK、CelS ' CelR、CelA、XynC 及 XynZ .甘 中’較佳地,細胞表面錫定蛋白次單元係SdbA,更佳地多順 反子核苷酸序列係依序編碼CipA、CelK、CelS、CelR、SdbA、' CelA、XynC 及 XynZ。 在另一特定實例中,多順反子核苷酸序列之分解酶核苦酸 序列係依序編碼 XynZ、XynC、CelA、CelK、CelR 及 CelS . 其中’較佳地’細胞表面錫定蛋白次單元係SdbA,更佳地多 順反子核杳酸序列係依序編碼CipA、XynZ、XynC、CelA'、 SdbA、CelK、CelR、及 CelS。 在本發明之又一具體實施例中,支架蛋白次單元係 CipA ’以及細胞表面錨定蛋白次單元係〇ιρΒ。 啟動子是一種核苷酸序列’所含元件可啟動操作地連接的 核酸序列之轉錄作用。至少,啟動子含有RNA聚合酶結合位 點。其可進一步含有一或多個增強子元件,其在定義上強 15 201122105 轉錄作用,或包含一或多個控制啟動子開/關狀態之調節元 件。選擇用於構建該表現卡匣的適當啟動子是由被導入表現卡 匣之宿主細胞的類型而定。當使用大腸桿菌作為宿主細胞時’ 適亨的啟動子包括’但不侷限於β_内醯胺酶和乳糖啟動子系統 (參見 Chang 寺人,施加275 : 615-624,1978) ; SP6、Τ3 和T7RNA聚合酶啟動子(Studier等人,施认公叫w〇/. 185 : 60-89,1990);λ-啟動子(Elvin 等人,Ge呢 87: 123-126,1990); 呻啟動子(Nichols和Yan0fsky,舱1〇1 : 155-164 ’ 1983) ; tac 和 trc 啟動子(Russell 等人,2〇 : 231-243,1982) ’ 以及 pCold (參見美國專利案 6,479,26〇)。 當選擇枯草桿_為社細鱗,麻性的啟動子包括pr啟 動子、Spol啟動子、Tac啟動子和LacI啟動子。這些啟動子 亦=被用於其他細菌宿主,例如,大腸桿菌(汾 、梭狀桿菌(C/o磁//膽)、黴漿菌(姆哪/似顯)、乳球 囷⑽⑽)、乳酸桿菌(1咖)、弧菌(脱响) =^綠澡你咖)。用於酵母菌(例如,釀酒酵母)CipA) and the combination of nine knives and angles, such as the combination of the seven scaffold protein subunits including seven second-type binding domains k for the combination of the 63 decomposing enzymes and the single S. Then, ~, the stipulations = the present invention 'plural number of decomposition enzyme subunits according to their in-environment growth in the multi-sequence of the present invention = position / position / order, so that the mother is 7C early Ranked under the control of Qianzizi 13 201122105 顺f and the yuan gambling name to be added. More specifically, the complex financial enzymes ^ sequence according to their distance from the promoter distance (four) into - (four) order, the more The gene sequence of the promoter is higher, and the higher the amount of expression, the farther the gene from the promoter is, the lower the amount of expression is. Therefore, the order of the genes can be arbitrarily adjusted as needed, so that the corresponding multiple decomposition enzyme subunits are The ranking order of the performance under the control of the promoter is consistent with its natural ranking order. The resulting multiple decomposition enzymes are 7L in a competitive manner and bind to the binding site provided by the scaffold protein subunit to form a biomimetic cellulose hydrolysis. Enzyme complex. According to the present invention, we can design a cellulolytic enzyme which can be produced in a host cell according to the cellulolytic enzyme complex of any natural microorganism (especially according to the natural ranking order of the amount of the decomposing enzyme subunit) The multi-cis scorpion complex of the complex exhibits a ruthenium. The cellulolytic enzyme complex according to the present invention can be derived from various microorganisms including, but not limited to, Clostridium thermocellum (C_ceZ/w/ovoram) ), Clostridium cellulosic (cc^hMytic dirty), lysin (c papyr〇s〇h; ens), Campylobacter sinensis (THc/zoiier dirty. / 〇 / 7g / 6rac / z / Ai surface), Bacteroides lysate ce//w/(10)/v(10)), Bacillus fluorescens "Caf. celluhlytjcusas", tumor type N (Ne〇callimasHxfr〇ntaUs), rumen fungus P type bacteria (corporate z > om less ces ψ ρ), Bacillus megaterium (such as c / / 仏), Bacillus licheniformis (Baciilus lichenif〇rmis), soluble fiber teeth only ΐ ί (such as (? / / / (10) ce / Zw /ow/ve), yellow rumen cocci (Gam/(10)coccw /7ανφ6), and Vibrio anguillarum/v./〇celMolyticu s). In a preferred embodiment, the polycistronic expression cassette of the present invention is designed according to a cellulolytic enzyme complex derived from a thermophilic microorganism, and is contained in a cellulolytic enzyme complex of such a microorganism. The various decomposing enzyme subunits have thermal stability. Therefore, the polycistronics exhibit a cellulose hydrolyzate/enzyme 'high enzyme activity at high temperature' which is beneficial to the industrial biomass energy process. In a preferred embodiment, the thermophilic microorganism is Clostridium thermocellum. In a specific embodiment, the scaffold protein subunit is CipA. 14 201122105 In a specific embodiment of the present invention, the scaffold protein subunit CipA and the plurality of decomposing enzyme subunits include exoglucanases CelS and CelK, endoglucoside per CelA, and xylase XynC and XynZ. In a specific example, the cleavage enzyme nucleotide sequence of the polycistronic nucleotide sequence is sequentially encoded by CelS, CelK, CelA, XynC and Xynz. In yet another specific example, the polycistronic nucleotide sequence encodes CipA, CelS, CelK, CelA, XynC, and Xynz sequentially. In another preferred embodiment, the polycistronic expression cassette of the present invention Designed according to a cellulolytic enzyme complex from Clostridium thermocellum, the natural cellulolytic enzyme complex further comprises a cell surface anchoring protein subunit. In a specific embodiment, the scaffold protein subunit is CipA. In still another embodiment, the cell surface anchoring protein subunit is selected from the group consisting of 〇lpB, SdbA, and 0rf2p. In another embodiment, the plurality of degrading enzyme subunits comprise exoglucanases CelS and CelK, endoglucanase CelR and CelA, and polyxylases XynC and XynZ. In a specific example, the degrading nucleotide sequence k of the polycistronic nucleotide sequence encodes CelK, CelS 'CelR, CelA, XynC, and XynZ. Preferably, the cell surface tin-fixed protein The unit line SdbA, more preferably the polycistronic nucleotide sequence, encodes CipA, CelK, CelS, CelR, SdbA, 'CelA, XynC and XynZ sequentially. In another specific example, the polycistronic nucleotide sequence degrading enzyme nucleotide sequence encodes XynZ, XynC, CelA, CelK, CelR, and CelS in sequence. wherein 'preferably' cell surface tin-fixation protein The unit line SdbA, more preferably the polycistronic nucleotide sequence, encodes CipA, XynZ, XynC, CelA', SdbA, CelK, CelR, and CelS. In still another embodiment of the invention, the scaffold protein subunit is CipA' and the cell surface anchoring protein subunit is 〇ιρΒ. A promoter is a transcription of a nucleic acid sequence in which a member of a nucleotide sequence' can initiate operably. At the very least, the promoter contains an RNA polymerase binding site. It may further comprise one or more enhancer elements that are by definition 15 201122105 transcriptional or comprise one or more regulatory elements that control the on/off state of the promoter. The appropriate promoter selected for construction of the performance cassette is determined by the type of host cell into which the performance cassette is introduced. When E. coli is used as a host cell, 'the promoter of the aptamer includes' but is not limited to the β-endoprolinase and the lactose promoter system (see Chang Temple, 275: 615-624, 1978); SP6, Τ3 And the T7 RNA polymerase promoter (Studier et al., cf. 〇 〇 185: 60-89, 1990); λ-promoter (Elvin et al., Ge. 87: 123-126, 1990); (Nichols and Yan0fsky, Cabin 1〇1: 155-164 '1983); tac and trc promoters (Russell et al., 2〇: 231-243, 1982) ' and pCold (see US Patent 6,479, 26) . When the dry grass rod is selected, the hemp promoter includes the pr promoter, the Spol promoter, the Tac promoter, and the LacI promoter. These promoters are also used in other bacterial hosts, for example, Escherichia coli (汾, Clostridium (C/o magnetic//biliary), mycoplasma (Mt./M), Lactococcus (10) (10)), lactic acid Bacillus (1 coffee), Vibrio (de-sounding) =^Green bathing your coffee). For yeast (eg, Saccharomyces cerevisiae)

ίιίϊ®(例如’克魯維酵母、畢赤酵母、曲黴菌、木黴菌 2珠囷)的啟動子包括Lac4啟動子、Adh4啟動子、GapDH ,子、Adhl啟動子、Pgk啟動子、Aac啟動子、ph〇5啟動 子和Gal7啟動子。 在較佳具體實蘭巾,使㈣導型啟動子構_表現卡 =此^啟,子财較條㈣如存树咖糾(例如, ,或四%素)或在特定温度(例如,贼或以上)下 活化。 砝』ί =法例如基因細生PCR或限制酶定序法, 目標基因及其位置。'然後將該表現卡匡導 ΐ /合體的適當宿主細胞内。有需要時,可將此 Ϊ。ίί = 表現卡£導人餘表料錄自的宿主細 :曰Μ。因編碼嗜熱性酵素,則該宿主細胞較佳為嗜中 卿’。W選殖株可藉由例如抗生素抗性選擇法予以鑑定,以 201122105 S5ii=^= ^^撕===卡 活化次讀)或在特疋溫度(例如,4〇t或以上)下才被 造蛋現卡s可併人龍,進而導入用於製 體。 肢的適純主細胞内,以製造纖維素水解酶複合 二見方面’本發明提供-種載體’其包括上述多 可藉由g方本 1供一種宿主細胞’其含有前述載體。 置。然後將該表現姻 熱性酵素,則宿主㈡如=基因編碼嗜 定預期酵素活性二以鐘定,以及藉由測 γ而使其表達該基因所編碼的蛋 ,,菌、乳球菌、乳酸桿菌、 曲或其他真菌(例如,克魯維酵母、= 曲滅囷、木徵菌和念珠菌)。 ㈣酵母、 體之=ΐ=ί:;;==以= 酶複,以獲得該纖維素水解 複人1,本發明之方法尚包括純化纖維素水解酶 t rtf典型地,由於支架蛋白具有碳水化合物 (carbohydrate-binding domain, 201122105 法進行該純化,另-方 離法,如將細胞p養物離#細胞、結合,故亦可以簡單的分 例中,可將宿主^ ,即可進行純化步驟。在一特定實 維素圏粒而達到物與纖ϊ素混合,再以離心法收集纖 又,明實施細節。 豆句括以、種刀解木貝纖維素類生物質之方法, 纖維素類ΐ忿纖^類生物質。典型地,該木質 種成分。 緘維素、半纖維素及/或木質素之一或多 人宿主細胞ίί適本重藉由製備前述載體並將其導 元,以調配此等複數;^後。而魏出複數個分解酶次單 纖維素水解酶======間的表現量來達到調配 比例之方法。1體^ 分解酶次單元之間的含量 ⑴製備載體 反子表現卡匠包括:夕員反子表現卡£ ’該多順 (a)啟動子;及 接,該多連 其中該等複數個分解酶=== 子之位置相符ΓΓ 相對於該啟動 (2)將該載體導人宿主細胞中並 表現出則述複數個分解酶次單元,其中該等福以 元之間的含量比例因此獲得調配。寺硬數個分—次單 科將ίΐϊ之各個具體實例的細節說明如後。本發明之技种-胜 徵將^由以下各個具體實例中的詳細說明及申請 201122105 而更清楚呈現。 實例1:建構本發明之多順反子表現卡匣 使用日本TOYOBO有限公司的KOD-Plus套組,以pCR 分別擴增編碼熱纖維梭菌之支架蛋白CipA、外切葡聚糖酶 CelS及CelK、内切葡聚糖酶CelA、及聚木糖酶XynC及χ%ζ 等纖維素體蛋白之DNA片段;其中CipA含有纖維素結合域 (CBM)、表面層同源模組(SLH),以及九個第[型黏合域。 將所得PCR產物分別選殖進入質體pCR_XL_T〇p〇,並將其導 入大腸桿菌宿主細胞中。利用Qiagen質體Mid丨套組(加州Promoters for ίιίϊ® (eg 'Kluyveromyces, Pichia, Aspergillus, Trichoderma 2') include Lac4 promoter, Adh4 promoter, GapDH, sub-, Adhl promoter, Pgk promoter, Aac promoter , ph〇5 promoter and Gal7 promoter. In the preferred concrete real towel, the (four) guided promoter structure _ performance card = this ^ Kai, the child money compared to the (four) if the tree is corrected (for example, or four% prime) or at a specific temperature (for example, a thief Or above). ί ί = methods such as gene sequencing or restriction enzyme sequencing, target genes and their locations. 'The indicator is then introduced into the appropriate host cell of the ΐ/complex. You can do this when you need it. Ίί = Performance card: The host of the remaining notes is recorded: 曰Μ. Since the thermophilic enzyme is encoded, the host cell is preferably sinister. W selection strains can be identified by, for example, antibiotic resistance selection method, with 201122105 S5ii=^=^^ tear===card activation secondary reading) or at special temperature (for example, 4〇t or above). The egg-making card can be used for the body. The present invention provides a cellulolytic enzyme complex, and the present invention provides a carrier which comprises a host cell which contains the aforementioned carrier. Set. Then, the expression of the thermophilic enzyme, the host (2) such as = gene encoding the desired enzyme activity is determined by the clock, and by measuring γ to express the gene encoded by the gene, bacteria, lactococcus, lactobacillus, Qufu or other fungi (eg, Kluyveromyces, koji, sputum, and Candida). (4) Yeast, body = ΐ = ί:;; = = enzymatically complex to obtain the cellulose hydrolyzed complex 1, the method of the invention still includes purification of the cellulolytic enzyme t rtf typically, since the scaffold protein has carbon water The compound (carbohydrate-binding domain, 201122105 method is used for the purification, and the other method, such as the cell p-culture is separated from the # cell, and can be combined, so that the simple step can be used to carry out the purification step. Mixing the material with the cellulose in a specific vegetal granule, and collecting the fiber by centrifugation, and detailing the implementation. The method of squeezing and squeezing the woody biomass of the wood, cellulose a steroid-like biomass. Typically, the xylem component. One or more human host cells of oryzanol, hemicellulose, and/or lignin are prepared by preparing the aforementioned carrier and guiding it. In order to achieve the complex ratio; Content (1) Preparation of Carrier Anti-Performance Card Players: 'the poly- (a) promoter; and the multiple of the plurality of degrading enzymes === the position of the sub-match ΓΓ relative to the promoter (2) the vector is introduced into the host cell and The plurality of decomposing enzyme subunits are described, wherein the ratio of the content of the fossils is thus adjusted. The details of each specific example of the sub-units are as follows. The technique of the present invention - The win will be more clearly presented by the detailed descriptions in the following specific examples and the application 201122105. Example 1: Construction of the multi-cistronic performance card of the present invention using the KOD-Plus kit of Japan TOYOBO Co., Ltd., pCR respectively Amplifying a DNA fragment encoding a cellosome protein such as CaspA, exo-glucanase CelS and CelK, endoglucanase CelA, and polyxylase XynC and χ%ζ; CipA contains a cellulose binding domain (CBM), a surface layer homology module (SLH), and nine [type binding domains. The resulting PCR products are separately cloned into the plastid pCR_XL_T〇p〇 and introduced into E. coli. In the host cell, using the Qiagen plastid Mid丨 set California

Qiagen公司)從陽性轉形株製備DNA質體並進行限制酶 化,經瓊脂凝膠電泳分離及純化後,獲得各別編碼上述 ^ DNA片段。 疋㈡又 使用Tsuge及Itaya等人提出之「在枯草桿菌依順序組合 基因之方法(ordered gene assembiy in bad丨丨us subt丨此,〇Ga ^上述DNA片段接合至大腸桿g與枯草桿菌之穿梭載體」 PGEST118内’該穿梭載包括熱可料的&啟動子,且 由異ϋί硫代仲半乳㈣(iptg)之料增加在枯草桿^ 知· 31 : Η3·3)),其中所設計的 基因順序為 C1pA_CelS-celK_celA_xynC_xynZ,此等 曰曰化職1)來自於所產生之分解酶之 ^ 妙ιί驗柄作方法為將DNA片段(等莫耳數)與該載體^人, 二、;後利用Takara接合套組ver.卜於2 ^立:JI & π t V ,—此5, 拖土 hCUph叫、13,2mMMgc: 福 mM ATP、300 mM NaC1、2〇% ( w/v ) ^ m糖醇、0.2Qiagen Company) prepared DNA plastids from positive transformants and restriction enzymes, and isolated and purified by agarose gel electrophoresis to obtain the above-mentioned DNA fragments.疋 (2) In addition, the method of combining genes in the order of Bacillus subtilis (ordered gene assembiy in bad丨丨us subt丨, 〇Ga ^ the above DNA fragment is ligated to the gut of gut and Bacillus subtilis) The carrier "in the PGEST118", the shuttle carrier comprises a heat-receiving & promoter, and the material of the sulphur-containing sulphur-semi-milk (4) (iptg) is added to the stalks of the grass (Knowledge 31: Η3·3)) The genetic sequence of the design is C1pA_CelS-celK_celA_xynC_xynZ, which is derived from the produced enzymes. The method is to use a DNA fragment (such as the molar number) and the carrier. After using the Takara joint set ver. Bu 2 2: JI & π t V , - 5, drag hCUph called, 13, 2mMMgc: mM ATP, 300 mM NaC1, 2〇% (w/v ) ^ mitol, 0.2

Wako純化學公司)内,在16。)分·;日本 7刀1里的接合反應,藉 19 201122105 此產生重組DNA分子,其中的表現卡匣包括啟動子及編碼順 序為cipA-celS-celK-celA-xynC-xynZ之多順反子核苦酸序列, 其序列圖8所示(SEQIDNO: 1)。 實例2 :篩選含有本發明之多順反子表現卡匣之轉形株Wako Purification Company), at 16. )); Japan's 7-knife 1 junction reaction, by 19 201122105 This produces recombinant DNA molecules, the performance of which includes the promoter and the polycistronic encoding sequence cipA-celS-celK-celA-xynC-xynZ The nucleotide sequence is shown in Figure 8 (SEQ ID NO: 1). Example 2: Screening of a transgenic strain containing the polycistronic expression cassette of the present invention

將實例1所得之多順反子DNA片段導入枯草桿菌突變株 RM125及BUSY9166。簡言之,先利用兩階段培養法製備勝 任枯草桿菌細胞Ur/〇/. 1961 81(5): 741-6(1961)),然後 將適量的多順反子DNA片段與100 ml的勝任枯草桿菌細胞混 合’在37〇C培養30分鐘。將300毫升LB培養液加至該dna 與細胞之混合物中,然後將該細胞於37它培養丨小時。將培 養細胞塗布於含有四環黴素(10 mg/ml)的LB平板上,撰楼 陽性轉形株。 种机四壞傲常選殖株於含有1 mM異丙基硫代屮①-半上 糖苦OPTG)的培養基内在贼培養5小時然後在砍培: 3小時。接著收集上清液,然後利用Amic〇n過濾器(3〇& 闕值)^行過濾而予以濃'缩。測量渡過液_1^^°射產生έ 螢光之私度,以測定該選殖株的葡聚糖酶活性。The polycistronic DNA fragment obtained in Example 1 was introduced into the B. subtilis mutant strains RM125 and BUSY9166. Briefly, a two-stage culture method was used to prepare a competent subtilis cell, Ur/〇/. 1961 81(5): 741-6 (1961), and then an appropriate amount of polycistronic DNA fragment with 100 ml of competent subtilis Bacillus cells were mixed 'incubated at 37 ° C for 30 minutes. 300 ml of LB medium was added to the mixture of the DNA and the cells, and then the cells were cultured at 37 for 丨 hours. The cultured cells were plated on LB plates containing tetracycline (10 mg/ml) to create a positive transformant. The seeding machine was cultured in thieves for 5 hours in a medium containing 1 mM isopropyl thiopurine 1-half-glycoside OPTG and then chopped for 3 hours. The supernatant was then collected and then concentrated and concentrated using an Amic〇n filter (3〇 & 阙 value). The degree of fluorescing of the sputum was measured by measuring the degree of fluorescing activity of the sputum to determine the glucanase activity of the selected strain.

Α因出f殖株1及13 ’其可表現葡聚糖酶活性' "φ , 7 CR /刀析結果顯不,來自此等選殖株之DNA可步 二六關基因’且限制酵素的消化分娜 •CdS-CelK_celA;:c:fz因依設計順序排列’' 實例3 :酵素活性分析 株)===殖=’以及含空载體(對照《 蛋白的表現。_,以5,_ g將以誘發纖雄 收集上清液,並利用Viva 2 5G (ϋ 1〇分鐘。 U U0咖閥值)(德國 20 201122105Α Because of the f and 1 '13', it can show glucanase activity' " φ, 7 CR / knife analysis results are not obvious, the DNA from these plants can follow the gene Digestion of Na•CdS-CelK_celA;:c:fz is arranged according to the design order'' Example 3: Enzyme activity analysis strain) === Colonization = ' and empty vector (control "protein performance. _, to 5 , _ g will induce the supernatant to collect the supernatant, and use Viva 2 5G (ϋ 1〇 minutes. U U0 coffee threshold) (Germany 20 201122105

Goettingen市Sartodus公司)在對抗置換緩衝液(50 mM Tris、 10 mM CaCL和5 mM DTT pH 6.8)的情況下,在4°c之下進 行濃縮。另一方面,亦收集細胞沈;殿物’將其再懸浮於PBS 内’藉由超音波振盪(脈衝:3秒;停止:每12分鐘為2秒) 溶解’然後以13,200 rpm離心40分鐘’以移除顆粒而形成含 有細胞内蛋白的樣本(「細胞内樣本」)。或者,收集該細胞團 粒然後再懸浮於PBS内’以製造含有完整細胞的樣本。以 Bradford法測定上清液和細胞内樣本的蛋白含量,然後,依下 述方法分析此等樣本的酵素活性。In the case of anti-displacement buffer (50 mM Tris, 10 mM CaCL and 5 mM DTT pH 6.8), it was concentrated at 4 ° C in the case of anti-displacement buffer (Styots, Goettingen). On the other hand, the cell sink was also collected; the temple 'resuspend in PBS' by ultrasonic vibration (pulse: 3 seconds; stop: 2 seconds every 12 minutes) dissolved 'and then centrifuged at 13,200 rpm for 40 minutes' A sample containing intracellular proteins ("intracellular sample") is formed by removing particles. Alternatively, the cell pellet is collected and then resuspended in PBS to produce a sample containing intact cells. The protein content of the supernatant and the intracellular sample was measured by the Bradford method, and then the enzyme activities of the samples were analyzed by the following methods.

丄1内切葡聚糖酶活性 利用天青交聯(AZCL) -β-葡聚糖(染料CMC)(購自丄1 endoglucanase activity using azurite cross-linking (AZCL)-β-glucan (dye CMC) (purchased from

Megazyme公司)作為基質,測定内切葡聚糖酶活性。簡言之, 以於50 mM醋酸鈉的1% (v/w)染料CMC在6(TC分別與上 述上清液樣本及細胞内樣本培養3小時,然後測量各樣本在 590 nm的吸光度(ΟΕ>59〇值),其係與葡聚糖酶的活性強度有 關。圖2 (a)及(b)分別顯示樣本的内切葡聚糖酶之活性及 比活性。 •g古果顯示,相較於對照選殖株之上清液樣本,選殖株1和 f 13之上清液樣本具有高於2倍的内切葡聚糖酶活性; 目,於對照選殖株之細胞崎本,選殖株丨和13之細胞内 亦可測得顯著提升的_葡聚糖酶活性,參見圖2 (a)。 目較於對照選殖株,選殖株1和13的細胞内蛋白含量 r'hi 表不細胞内留存一定數量的外源性蛋白,其與圖2 果一^、、。不之比活性(針對總蛋自含量正f化之酵素活性)結 3·2總葡聚糖酶的活性 則總葡聚糖酶的活十生,將上清液和細胞内樣本於60 2 醋酸鋼緩衝〉夜(pH 5 〇)内,以 的終 共4-甲基傘形導叫纖維素二糖苦(爾c)混合3小時。於 201122105 1 % NaCCb内藉由在365 nm UV照射的螢光測定法測 的活性。圖2 (c)及⑷分別顯示樣本的總葡聚糖^性 及比活性。 每又/古性 結果〒示’來自選殖株.i * 13之上清液所觀察 糖酉母活性&高於取自舰選殖株之上清液所觀察到的活性^ 見圖2 (c);且細胞内發現留存一定數量的外源性 二 圖2(d)。 虫曰,參見 實例4 :織維素水解酶複合體之形成的檢測 以SDS:PAGE檢測選殖株i和13内的纖維素水解酶複合 體之形成。首先,將來自此兩種選殖株的細胞外蛋白樣 ^弗)置於含0.1〇/〇SDS的5-15〇/〇 (w/v)聚丙稀酿胺^膠 行電泳分析。然後將該聚丙烯醯胺凝膠置於含木聚糖或"CMC 的壤脂凝膠之上’並移除兩種凝膠之間的氣泡。以娜 該兩,凝膠及在4(TC培育3小時(若該_凝膠含CMC): 在60°C培育隔夜(若該璦脂凝膠含木聚糖)。之後 凝 膠與聚__導分開、_並進行蛋自#錢料(^r〇 mby)蛋白染色。將瓊脂凝膠在lmg/ml剛果紅内浸泡%-仞 分在里,然後於lMNaCl浸泡10-60分鐘。瓊脂凝膠上CMC戋 木聚糖基質被分解的位置呈現黃色,而未發生分解的位置則呈 現暗紅色。亦將聚丙烯醯胺凝膠培育於含有0 2mg/ml Muc和 5〇IMN,〇Ac的溶液(PH5.0)内’在6(TC培育30分鐘,以 測寒其葡聚糖酶活性。藉由檢查凝膠上在365nm的 , 以偵測MUC的分解程度。 ,得自上述試驗的結果顯示選殖株1和丨3展現細胞外之 内切葡聚糖酶、木聚糖和總葡聚糖酶的活性。 依月?、4田述於]\^31^如,/5/0/.(^陳 262:1〇〇35〜1〇〇38 (1,987)或 Salinovich和 Montelaro^a/. 5ζ.π/ζ飢 156:341-347 _(L986lt方法,將上述聚丙稀酿胺凝膠上的蛋白轉移至聚偏 一氣乙烯薄膜(GE)。以含有5%脫脂乳的pbS阻斷該薄膜, 22 201122105 清洗’=與抗rCipA抗體0 : 5000倍稀釋)在代培参i6 小時。經數次清洗之後’使該難與祖p_共輕山羊抗、 (1 : 5000倍稀釋)共同培養。在經p 之 溶液共同培育,上 晴啸置與上述 =於展現分_之活性的位置之蛋白從凝膠萃取出 來’,、'、、後以5-15% (w/v)聚丙稀胺凝膠和二維D舰凝膠 電泳進打分析。在此分析中發現各個胜肽係與⑺八、^"Endoglucanase activity was determined as a substrate by Megazyme. Briefly, 1% (v/w) dye CMC with 50 mM sodium acetate was incubated with the above supernatant sample and intracellular sample for 3 hours, respectively, and then the absorbance at 590 nm of each sample was measured (ΟΕ&gt (59 〇 value), which is related to the activity intensity of glucanase. Figure 2 (a) and (b) show the activity and specific activity of the endoglucanase of the sample, respectively. The supernatant samples of the selected strains 1 and f 13 had more than 2 times the endoglucanase activity than the supernatant samples of the control colonies; Significantly elevated _glucanase activity was also measured in the cells of the selected strains, 13 (see Figure 2 (a). Compared to the control plants, the intracellular protein content of the selected strains 1 and 13 was The 'hi table does not contain a certain amount of exogenous protein in the cell, which is the same as that in Fig. 2. The specific activity (the activity of the enzyme against the total egg self-content) is 3·2 total glucan. The activity of the enzyme is the activity of the total glucanase, and the supernatant and the intracellular sample are buffered in the buffer of 60 2 acetate steel (pH 5 〇), and the final 4-methyl umbrella is used. Cellulose disaccharide (c) was mixed for 3 hours. The activity was measured by fluorescence spectrometry at 365 nm UV irradiation in 201122105 1% NaCCb. Figures 2 (c) and (4) show the total glucan of the sample, respectively. ^Sexuality and specific activity. Each time/ancient result shows that the activity of the glycoside from the supernatant of the selected strain.i*13 is higher than that observed from the supernatant of the selected strain. The activity ^ is shown in Figure 2 (c); and a certain amount of exogenous 2 is found in the cells. Figure 2 (d). Insect, see Example 4: Detection of the formation of the oryzanol hydrolase complex by SDS:PAGE The formation of the cellulolytic enzyme complex in the selected strains i and 13 was detected. First, the extracellular protein from the two selected strains was placed at 5-15 含/containing 0.1 〇/〇SDS/ 〇 (w / v) polyacrylamide amine gel electrophoresis analysis. The polyacrylamide gel was then placed on top of a lignan- or "CMC-containing loam gel" and the bubbles between the two gels were removed. Take the two gels and incubate at 4 (TC for 3 hours (if the gel contains CMC): Incubate overnight at 60 ° C (if the rouge gel contains xylan). After the gel and poly _ _ separate, _ and carry out the egg from #钱〇(^r〇mby) protein staining. The agar gel is soaked in lmg/ml Congo red in %-仞, then soaked in lMNaCl for 10-60 minutes. The position where the CMC xylan matrix was decomposed on the gel showed a yellow color, while the position where no decomposition occurred showed a dark red color. The polypropylene guanamine gel was also cultivated to contain 0 2 mg/ml Muc and 5 〇IMN, 〇Ac The solution (pH 5.0) was incubated at 6 (TC for 30 minutes to measure its glucanase activity. By examining the gel at 365 nm to detect the degree of decomposition of MUC. The results showed that the selected strains 1 and 丨3 exhibited the activity of extracellular endoglucanase, xylan and total glucanase. 依月?,4田述在]\^31^如,/5/0 /.(^Chen 262:1〇〇35~1〇〇38 (1,987) or Salinovich and Montelaro^a/. 5ζ.π/ζ饿156:341-347 _(L986lt method, the above polyacrylamide Protein transfer on the gel Vinyl film (GE). Block the film with pbS containing 5% skim milk, 22 201122105 Wash '= with anti-rCipA antibody 0: 5000-fold dilution) in the ginseng for i6 hours. After several cleanings, 'make the difficulty The ancestors p_ total light goat anti-co-culture, (1: 5000-fold dilution) co-cultivation. The cultivar was co-cultured with the solution of p, and the protein was extracted from the gel with the above-mentioned protein at the position where the activity was shown. , ', and then with 5-15% (w / v) polyacrylamide gel and two-dimensional D-ship gel electrophoresis analysis. In this analysis, found that each peptide system and (7) eight, ^ "

CelS、QpA和XynZ蛋白的片段高度相似,表示這些蛋白开 成一種蛋白複合體,且展現出想要的分解酶活性。 / 實例5 :酵素熱安定性的測定 t上述方法’以CMC及MUC為受質測試對照選殖株及 選殖株1在不同溫度的細胞外分解酶活性。圖3顯示結 。結果^示’相較於對照選雜,選殖株1顯示在高於5〇 °C具有較高的分解酶活性以及在較高溫度具有較大的差異 性,參見圖3 (a)。此外’選殖株在各測試溫度均具有類似的 • 細胞外蛋白含量,參見圖3⑻,表示選殖株i在高溫具有提 升的分解酶活性歸目於馳株i所表現的分解酶賊安定性。 實例6 :其他實例 6.1其他多順反子表現卡匣之構築及轉形株之選殖 依前述仿生策略’模擬熱纖維梭菌面對不同碳源而調 維素水解酶複合體之分解酶之表現量排名順序,設計本發明之 多順反子表現卡匣之其他實例,其中模式[之多順反子表現 匣係基於微晶纖維素(avicel)為碳源而設計,其基因順序為 cipA-celK-celS-celR-sdbA-celA-xynC-xynZ ;以及模式 Η 之 ^ 順反子表現卡匣係基於纖維二糖為碳源而設計,其基因順序$ 23The fragments of the CelS, QpA and XynZ proteins are highly similar, indicating that these proteins form a protein complex and exhibit the desired degrading enzyme activity. / Example 5: Determination of Enzyme Thermal Stability t The above method 'CCM and MUC were used as the test for the control strain and the strain 1 for extracellular degrading enzyme activity at different temperatures. Figure 3 shows the knot. The results showed that the selected strain 1 showed higher decomposing enzyme activity at higher than 5 ° C and a greater difference at higher temperatures than the control selection, see Fig. 3 (a). In addition, the 'selected strains have similar extracellular protein content at each test temperature. See Figure 3(8), indicating that the selected strain i has elevated decomposing enzyme activity at high temperature, which is attributed to the stability of the decomposing enzyme thief. . Example 6: Other examples 6.1 Other polycistronic expressions The construction of cassettes and the selection of transformed plants are based on the aforementioned biomimetic strategy 'Simulation of Clostridium thermocellum to different carbon sources and the enzymes of the modulin hydrolase complex The order of performance is ranked, and other examples of the polycistronic expression cassette of the present invention are designed, wherein the pattern [the polycistronic expression 匣 is based on microcrystalline cellulose (avicel) as a carbon source, and its genetic order is cipA. -celK-celS-celR-sdbA-celA-xynC-xynZ ; and the mode Η ^ cistron performance card 匣 based on cellobiose as a carbon source, its genetic sequence $ 23

» I 201122105 cipA-xynZ-xynC-celA-sdbA-celK-celR-celS。 實驗方法與前述相同。首先’將各個編碼QpA、CelS、» I 201122105 cipA-xynZ-xynC-celA-sdbA-celK-celR-celS. The experimental method is the same as described above. First, 'each code QpA, CelS,

CelK、CelA、CelR、XynC、XynZ 及 sdbA 等纖維素體蛋白之 DNA片段選殖到Topo載體系統’並分別進行限制酶切位圖譜 分析、序列驗證及膠體萃取純化。然後,使用〇GAB法,將 前述編碼各個纖維素體蛋白之DNA片段,依設計順序與穿梭 載體pGETS 188接合。將接合產物導入括草桿菌突變株 BUSY9797 (CI抑制子突變之菌株),於37〇C培養條件下,挑 選出具四環黴素抗性的陽性選植株。藉由限制酵素的消化分析 確5忍1%性選植株產生的重組載體具有所欲的多 匣,其中模式I之多順反子表現卡匣的基因順序為 修 cipA-celK-celS-celR-sdbA-celA-xynC-xynZ (圖 4 (a)),其序 列如圖9所示(SEQ ID NO: 2),以及模式π之多順反子^現DNA fragments of cellulosic proteins such as CelK, CelA, CelR, XynC, XynZ and sdbA were cloned into the Topo vector system and subjected to restriction enzyme map analysis, sequence verification and colloid extraction purification. Then, the aforementioned DNA fragment encoding each cell body protein was ligated to the shuttle vector pGETS 188 in the designed order using the 〇GAB method. The ligation product was introduced into the B. sphaericus mutant BUSY9797 (strain of the CI inhibitor mutation), and a positive selection plant having tetracycline resistance was selected under the culture condition of 37 C. By restricting the digestion analysis of the enzyme, it is confirmed that the recombinant vector produced by the 5% sex-selective plant has the desired polysaccharide, and the polycistronic expression of the pattern I shows that the gene sequence of the cassette is cultA-celK-celS-celR- sdbA-celA-xynC-xynZ (Fig. 4(a)), whose sequence is shown in Figure 9 (SEQ ID NO: 2), and the polymorphism of the mode π

卡 E 的基因順序為 cipA-xynZ-xynC-celA-sdbA-celK-celR-celS (圖4 (b)) ’其序列如圖i〇所示(SEqIDn〇:3)。 6.2基因轉錄分析 使用RNeasy Protect Bacteria套組(高純度腿a分離套 件’羅氏),從培養20小時後的桿菌中分離出全腿八。使用 f轉錄套件(iScriptcDNA synthesis套件,BioRad)及即時定 ,SYBR green IRT-PCR 套件(羅氏 48〇 SYBR 讲如 , 羅氏)及基因特定之引子組(放大產物之大小為113至137bp; 見下)於LightCycler (LightCycler 480,羅氏)儀器上進行 RT-PCR,其中以RNA絕對定量進行分析。 °° 結果顯示’在模式!及模式n之枯草桿菌選殖株中,此八 個基因中最大量的觸物來自位於f接在啟動子下游的基 因’其餘觸物之㈣套數隨啟動子間之雜增加而成比 例下降;表示此八個基因可成功由單一個Pr啟動子轉錄,且 各基因之表現量之高低排序與此等基因在重組質體中 順序有相同趨勢。 、 24 201122105 6·3織維素水解酶複合體之形成分析 為確定熱纖維梭菌之纖維素水解酶 ;種:主?辦菌)中被表現並組合 ΐ並t f培養物進行 、强此=冰刀析。間吕之,培養模式1及模式TT夕鉍苗*曰丼 、殖株後得細菌培養物,於4。〇以35〇 、如 干 ,清液樣品與4 mg的晶性纖維素The gene sequence of card E is cipA-xynZ-xynC-celA-sdbA-celK-celR-celS (Fig. 4(b))' and its sequence is shown in Figure i(SEqIDn〇: 3). 6.2 Gene transcription analysis Using the RNeasy Protect Bacteria kit (high purity leg a separation kit 'Roche'), the whole leg was isolated from the bacilli after 20 hours of culture. Use the f-transcription kit (iScript cDNA synthesis kit, BioRad) and the SYBR green IRT-PCR kit (Roche 48 〇 SYBR, Roche) and the gene-specific primer set (the size of the amplified product is 113 to 137 bp; see below) RT-PCR was performed on a LightCycler (LightCycler 480, Roche) instrument with absolute quantitation of RNA. ° ° The result shows 'in mode! And in pattern B of the Bacillus subtilis strain, the largest amount of the eight traces of the gene from the gene located downstream of the promoter of the 'the rest of the primer' (four) sets decreased with the increase in the number of promoters; It is indicated that the eight genes can be successfully transcribed by a single Pr promoter, and the order of the expression of each gene has the same tendency as the order of these genes in the recombinant plasmid. , 24 201122105 6·3 formation of weaving hydrolase complex to determine the cellulolytic enzyme of Clostridium thermocellum; species: main? In the case of bacteria, it is expressed and combined with ΐ and t f cultures, strong this = ice knife analysis. In the case of Lu, the culture mode 1 and the pattern TT 铋 铋 seedlings * 曰丼, after the colony, the bacterial culture, at 4. 〇 with 35〇, such as dry, clear liquid sample and 4 mg of crystalline cellulose

=積4 ml之5〇福石粦酸鹽緩衝液(阳7 )。二‘養^日: ί纖集纖維素並以顧鹽緩衝液沖洗兩L將“= 4 ml of 5 〇 粦 粦 缓冲 buffer (yang 7). Two ‘Yu Days: ί Fibre collection of cellulose and flushing two L with Gu salt buffer will “

在Ju隹素團粒上之蛋白質溶於100 mU =胺膠體 (SDS_PAGE)的I 釦後’進行SDS-PAGE分析。圖5顯示結果。…、 量模式1及模式K之括草桿菌選殖株表現出分子 ,,5’82’69-5’及52-5奶的蛋白質帶,分 ί’Cels ’CelR,XynC及⑽,表示模式 人於且選殖株經誘導表現出各種水解酶,且組 纖維素結合區域(CBM)之支架蛋白cipA上,故可以 纖維素吸附法予以純化。 6.4纖維素水解酶複合體之酶譜(挪啊—電泳分析 广ϋ I及模式n之枯草桿菌選殖株之培養物加至含 二Γ濃度的緩衝液之中,不經沸煮且不添加還原劑, %梯度SDS膠體電泳分析,其中各蛋白質分子 Π 1固有電荷之不同而可藉由電泳的力量而分離。圖 丄 =考馬斯亮藍 元成電泳後,進行以聚木糖及CMC為受質的酶譜分析 (zymogram assay),接著隨後以剛果紅染色,受質被聚木糖 酶及内切降解的區域在暗染色背打娜出透明色 25 201122105 f,圖6 (b)及6 (c)顯讀果。在葡聚糖酶活性 將膠體浸入含MUC之緩衝液,接著以365 nm w光源之^ 測定法測量活性;圖6 (a)顯示結果。 結果齡’她於僅含帶pGETS _草桿純殖株 1及模式^之栝草桿自選雜的分泌蛋白針對 =4 祕娜性。此外, 木 之酶譜電泳膠體之透明帶的位置, CMC等區域(Smeararea)對置、聚木糖及 及模式Hitϊΐ酶活性°此等結果是顯示模式1 可盘1加μ「早rf、殖株產生的各個具水解酶活性的蛋白 證i。木 lp帛亚結合在—起形成複合體的其中-個 6.5孅維素水解酶複合激之分解酶活性分析 9rwn培養士模式1及模式Π之括草桿菌選殖株,並於42〇C進杆 含崎及低DTT濃度的緩衝^ Ξ =之液及胞内物質,然後: 日日性纖維素,染劑CMC,及聚太 解酶活性分析。圖7顯示結果。“木糖為又貝進仃各種分 模式斷糧,模式1及 及模式η之枯草= 選的二上f質為結晶纖維素時,模式ί 高,苴t類型I、g #姓%:朱之外切葡聚糖酶的活性顯著地提 抑而性更高(圖 物的上清液展現最高的分、 =囷&殖株之培養 分解酶活性排騎序叫=上。,兩严殖現不同的 核苦酸序列排列順序相互呼應;^殖株所έ表現卡μ之不同的 微生模擬天然 崎口租夕順好表現卡n,其可任意選擇 26 201122105 蛋白質次單元種類,並藉由調整 對表現量===控制各個蛋白質次單元之相 發之應用。此冰4、,,…放生物之目的,且可供生質能源開 定化的酵素顆抑r = 胞表面上’整個細胞可視為固 後續應^ _des),有利於純化及 則可# ==賴彳級的分_設料概子表現卡ϋ, 用^纖維素水解酶複合體,將更有利於生質能 維素水解i複突破先前技術之瓶頸,對纖 常知;闡述,咸相信本發明所屬技術領域中具有通 === 述說本發明至最廣的程度。因此, 式限制其餘的揭容僅疋作為例不說明之用,而非以任何方 【圖式簡單說明】 =1疋胃熱纖紐g之纖維素水_複合體之結構示意圖。SDS-PAGE analysis was performed after the protein on the Ju sap pellet was dissolved in 100 mU = amine colloid (SDS_PAGE). Figure 5 shows the results. ..., the amount pattern 1 and the pattern K of the Bacillus subtilis strain showed molecules, the protein bands of 5'82'69-5' and 52-5 milk, divided into ί'Cels 'CelR, XynC and (10), indicating the pattern The human and the selected strains are induced to exhibit various hydrolase enzymes, and the scaffold protein cipA of the group of cellulose binding regions (CBM) can be purified by the cellulose adsorption method. 6.4 Enzyme spectrum of cellulose hydrolase complex (Nove-electrophoresis analysis) The culture of Bacillus subtilis I and model n Bacillus subtilis strains was added to the buffer containing diterpene concentration without boiling and not added. Reducing agent, % gradient SDS colloidal electrophoresis analysis, in which the intrinsic charge of each protein molecule Π 1 can be separated by the power of electrophoresis. Figure 丄 = Coomassie Brilliant Blue Element Electrophoresis, followed by polyxylose and CMC Qualitative zymogram assay, followed by staining with Congo red, stained with polyxylase and endo-degraded regions in dark stained back with a transparent color 25 201122105 f, Figure 6 (b) and 6 (c) Observed fruit. The colloid was immersed in a buffer containing MUC in glucanase activity, and then the activity was measured by a 365 nm w light source; Figure 6 (a) shows the result. The secreted protein with pGETS _ straw pure strain 1 and the model ^ 栝 杆 自 = = = = = = = = = = = = 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 。 , xylose and the pattern of Hit enzyme activity ° These results are shown in mode 1 It can be used to add 1 μ of the early rf and the hydrolyzing enzyme-producing protein produced by the strain i. The lp 帛 帛 结合 结合 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 孅 形成 形成 形成 孅 孅 孅 孅 孅Analysis of 9rwn culture mode 1 and model Bacillus licheniformis strains, and at 42 ° C into the buffer and low DTT concentration of buffer ^ Ξ = liquid and intracellular substances, then: daily cellulose, The staining agent CMC, and the activity of poly-taiji enzymes. Figure 7 shows the results. "Xylose is a variety of sub-mode breaks in the glutinous rice, mode 1 and the mode η of the grass = the second ff is crystalline cellulose , mode ί high, 苴t type I, g #姓%: the activity of the exo-glucanase is significantly suppressed and the sex is higher (the supernatant of the figure shows the highest score, =囷& The culture of the strain is degraded by the activity of the enzyme, and the sequence of the nucleotide sequence is echoed. The sequence of the different nucleotide sequences of the two strains echoes each other; the micro-simulation of the performance of the plant is different. Performance card n, which can arbitrarily select 26 201122105 protein subunit types, and control each by adjusting the amount of performance === The application of white matter sub-units. This ice 4,,,... the purpose of the organism, and the enzymes that can be used for the development of biomass energy, r = on the cell surface, the whole cell can be regarded as a solid follow-up ^ _des) It is conducive to the purification and then can be # == Lai 彳 grade _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ It is common to know the fiber; it is stated that it is believed that the invention has the broadest scope of the invention in the technical field to which the invention pertains. Therefore, the rest of the disclosure is only used as an example, rather than Any party [simple description of the schema] =1 结构 疋 热 热 热 疋 疋 纤维素 纤维素 纤维素 纤维素 纤维素 纤维素 纤维素 纤维素 纤维素 纤维素 纤维素 。 。

CiPA 桿菌宿主細胞内產生含熱纖維梭菌 酶複人糾f、CdA、々π和XynZ蛋白之纖維素水解 弓複口體的細胞外和細胞内纖_分解活性之圖表,其 t Γ f CMC作為基質所測定的㈣®聚糖酶活性;(b)顯 糖酶比活性’藉由樣本内之内切葡聚糖酶活性對於 Γ,ΐ白δΐ的正常化而測定;(c)顯示利用MUC作為基質所 葡聚糖酶活性;以及⑷顯示總葡聚糖酶比;;性, 本内之總内切葡聚糖酶活性對於其蛋白含量的正常化 圖3顯示枯草桿菌内產生含熱纖維梭菌cipA、CeIS 不同咖度下冽付之葡聚糖酶活性;以及(b)對照選殖株及選 27 201122105 殖株1在不同溫度下 圖4顯示實例6/& Ϊ「蛋白含量。 序’以,模式I,及㈣酸序列順 樣本第 冰結果,其,丨道是對照組 結果素水解酶複合體之酶譜電泳分析 聚糖酶降之位置W顯示内4 顯干去置,(c)顯不聚木㈣降解之位置;及⑷ ,斯免監的染色結果;其中第]道是對照“尸么 、疋拉式I之樣本,以及第2道是模式π之樣本。 比活ΞΙΪ示f解酶活性分析結果’其中⑷_ 切葡聚糖耻活性結果;以及⑷顯示聚木糖耻二二内 .圖8-1至8-5顯示實例1所述多順反子表現卡g所含 cipA-celS-celK-cdA-xynC-xynZ 之核苷酸序列(SEQ ID N〇: 1) ’其中斜體部分代表限制酶區域、底線部分代表各個蛋白質 編碼區域以及陰影部分代表核醣體結合部位。 圖9-1至9-7顯示實例6.1所述模式I之多順反子表現卡 S所含 cipA-celK-celS-celR-sdbA-celA-xynC-xynZ 之核苷酸序 列(SEQ ID NO: 2),其中斜體部分代表限制酶區域、底線部 刀代表各個蛋白質編碼區域以及陰影部分代表核酿體結合部 位。 圖10-1至10_7顯示實例6.1所述模式II之多順反子表現 卡匣所含 cipA-xynZ-xynC-celA-sdbA-celK-celR-celS 之核苷酸 序列(SEQ ID NO: 3),其中斜體部分代表限制酶區域、底線 部分代表各個蛋白質編碼區域以及陰影部分代表核醣體結合 部位。 28A chart showing the extracellular and intracellular fibrillar-decomposing activity of a cellulose hydrolyzate-removing body containing a Clostridium thermophilus f, CdA, 々π and XynZ proteins in a CiPA bacillus host cell, t Γ f CMC (4)® glycanase activity as a substrate; (b) Glycerase specific activity 'determined by normalization of Γ, ΐ white δΐ by endoglucanase activity in the sample; (c) shows utilization MUC as a substrate for glucanase activity; and (4) showing total glucanase ratio;; normal, total endoglucanase activity for normalization of its protein content Figure 3 shows heat production in Bacillus subtilis Clostridium cilia cipA, CeIS glucanase activity under different climax; and (b) control clonal strain and selection 27 201122105 stalk 1 at different temperatures Figure 4 shows Example 6/&; "protein content The order ', the mode I, and the (iv) acid sequence cis sample ice results, the sputum is the result of the control group zymase electrophoresis analysis of the glycanase drop position W shows the internal 4 dry out , (c) the location of the degradation of the wood (4); and (4), the staining results of the smuggling; First] is a control channel "dead what sample Cloth pull Formula I, and the second channel is a sample of π mode. Specific activity shows the results of the analysis of the enzyme activity of 'f (4) _ dextran shame activity; and (4) shows the xylose shame two. Figure 8-1 to 8-5 shows the polycistronic expression shown in Example 1. The nucleotide sequence of cipA-celS-celK-cdA-xynC-xynZ contained in the card g (SEQ ID N〇: 1) 'where the italic part represents the restriction enzyme region, the bottom line portion represents each protein coding region, and the shaded portion represents ribose Body binding site. Figures 9-1 to 9-7 show the nucleotide sequence of cipA-celK-celS-celR-sdbA-celA-xynC-xynZ contained in the polycistronic expression card S of the mode I described in Example 6.1 (SEQ ID NO: 2), wherein the italic portion represents the restriction enzyme region, the bottom line knife represents each protein coding region, and the shaded portion represents the nuclear brew binding site. Figures 10-1 to 10-7 show the nucleotide sequence of cipA-xynZ-xynC-celA-sdbA-celK-celR-celS contained in the polycistronic expression cassette of the mode II described in Example 6.1 (SEQ ID NO: 3) Wherein the italic portion represents the restriction enzyme region, the bottom line portion represents each protein coding region, and the shaded portion represents the ribosome binding site. 28

201122105 <110>中央硏究院201122105 <110> Central Research Institute

<120>製造纖維素水解酶複合體之多順反子表現卡匣及其應用 <130> ACA0050TW <150>中華民國發明專利申請案第098143083號 <151> 2009-12-16 <160〉 3 <170> Patentln version 3.5 <210> 1 <211> 16446 <212> DNA <213>人工序列 <220> <223>重組序列 <400> 1 ggcctgtttg gcctttggga ggaatggtag atgagaaaag tcatcagtat gctcttagtt gtggctatgc tgacgacgat ttttgcggcg atgataccgc agacagtatc ggc^gccaca atgacagtcg agatcggcaa agttacagca gccgttggat caaaagtaga aatacctata accctgaaag gagtgccatc caaaggaatg gccaattgcg acttcgtatt gggttatgat ccaaatgtgc tggaagtaac agaagtaaaa ccaggaagca taataaaaga tccggatcct agcaagagct ttgatagcgc aatatatccg gatcgaaaga tgattgtatt tctgtttgca gaagacagtg gaagaggaac gtatgcaata actcaggatg gagtatttgc aacaattgta gccactgtca aatcagctgc agcggcaccg attactttgc ttgaagtagg tgcatttgcg gacaacgatt tagtagaaat aagcacaact tttgtcgcgg gcggagtaaa tcttggtagt tccgtaccga caacacagcc aaatgttccg tcagacggtg tggtagtaga aattggcaaa gttacgggat ctgttggaac tacagttgaa atacctgtat atttcagagg agttccatcc aaaggaatag caaactgcga ctttgtgttc agatatgatc cgaatgtatt ggaaattata gggatagatc ccggagacat aatagttgac ccgaatccta ccaagagctt tgatactgca atatatcctg acagaaagat aatagtattc ctgtttgcgg aagacagcgg aacaggagcg tatgcaataa ctaaagacgg agtatttgca aaaataagag caactgtaaa atcaagtgct ccgggctata ttactttcga cgaagtaggt ggatttgcag ataatgacct ggtagaacag aaggtatcat ttatagacgg tggtgttaac gttggcaatg caacaccgac caagggagca acaccaacaa atacagctac gccgacaaaa tcagctacgg ctacgcccac caggccatcg gtaccgacaa acacaccgac aaacacaccg gcaaatacac cggtatcagg caatttgaag gttgaattct acaacagcaa tccttcagat actactaact caatcaatcc tcagttcaag gttactaata ccggaagcag tgcaattgat ttgtccaaac tcacattgag atattattat acagtagacg gacagaaaga tcagaccttc tggtgtgacc atgctgcaat aatcggcagt aacggcagct acaacggaat tacttcaaat gtaaaaggaa catttgtaaa aatgagttcc tcaacaaata acgcagacac ctaccttgaa ataagcttta caggcggaac tcttgaaccg ggtgcacatg ttcagataca aggtagattt gcaaagaatg actggagtaa ctatacacag tcaaatgact actcattcaa gtctgcttca cagtttgttg aatgggatca ggtaacagca tacttgaacg gtgttcttgt atggggtaaa gaacccggtg gcagtgtagt accatcaaca 第1頁 60 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 201122105 cagcctgtaa caacaccacc tgcaacaaca aaaccacctg caacaacaaa accacctgca L680 ucaacaatac cgccgtcaga tgatccgaat gcaataaaga ttaaggtgga cacagtaaat 1740 !;caaaaccgg gagacacagt aaatatacct gtaagattea gtggtatacc atccaaggga 1800 atagcaaact gtgactttgt atacagctat gacccgaatg tact tgagat aatagagata 1860 aaaccgggag aattgatagt tgacccgaat cctgacaaga gctttgatac tgcagtatat 1920 cctgacagaa agataatagt attcctgttt gcagaagaca gcggaacagg agegtatgea 1980 ataactaaag acggagtatt tgctacgata gtagcgaaag taaaatccgg agcacctaac 2040 ggactcagtg taatcaaatt tgtagaagta ggcggatttg cgaacaatga ccttgtagaa 2100 cagaggacac agttctttga cggtggagta aatgttggag atacaacagt acctacaaca 2160 cctacaacac ctgtaacaac accgacagat gattegaatg cagtaaggat taaggtggac 2220 acagtaaatg caaaaccggg agacacagta agaatacctg taagattcag cggtatacca 2280 tccaagggaa tagcaaactg tgactttgta tacagctatg acccgaatgt acttgagata 2340 atagagatag aaccgggaga cataatagtt gacccgaatc ctgacaagag ctttgatact 2400 gcagtatatc ctgacagaaa gataatagta ttcctgtttg cggaagacag cggaacagga 2460 gcgtatgcaa taactaaaga cggagtattt gctacgatag tagcgaaagt aaaatccgga 2520 gcacctaacg gactcagtgt aatcaaattt gtagaagtag gcggatttgc gaacaatgac 2580 cttgtagaac agaagacaca gttctttgac ggtggagtaa atgttggaga tacaacagaa 2640 cctgcaacac ctacaacacc tgtaacaaca ccgacaacaa cagatgatct ggatgcagta 2700 aggat taaag tggacacagt aaatgcaaaa ccgggagaca cagtaagaat acctgtaaga 2760 ttcagcggta taccatccaa gggaatagca aactgtgact ttgtatacag ctatgacccg 2820 aatgtacttg agataataga gatagaaccg ggagacataa tagttgaccc gaatcctgac 2880 aagagctttg atactgcagt atatcctgac agaaagataa tagtattcct gtttgcggaa 2940 gacagcggaa caggagcgta tgcaataact aaagaeggag tatttgctac gatagtageg 3000 aaagtaaaat ccggagcacc taacggactc agtgtaatca aatttgtaga agtaggcgga 3060 tttgcgaaca atgaccttgt agaacagaag acacagttct ttgacggtgg agtaaatgtt 3120 ggagatacaa cagaacctgc aacacctaca acacctgtaa caacaccgac aacaacagat 3180 gatctggatg cagtaaggat taaagtggac acagtaaatg caaaaccggg agacacagta 3240 agaatacctg taagattcag cggtatacca tccaagggaa tagcaaactg tgactttgta 3300 tacagctatg acccgaatgt acttgagata atagagatag aaccgggaga cataatagtt 3360 gacccgaatc ctgacaagag ctttgatact gcagtatatc ctgacagaaa gataatagta 3420 ttcctgtttg cagaagacag cggaacagga gcgtatgcaa taactaaaga cggagtattt 3480 gctacgatag tagcgaaagt aaaagaagga gcacctaacg gactcagtgt aatcaaattt 3540 gtagaagtag gcggatttgc gaacaatgac cttgtagaac agaagacaca gttctttgac 3600 ggtggagtaa atgttggaga tacaacagaa cctgcaacac ctacaacacc tgtaacaaca 3660 ccgacaacaa cagatgatct ggatgcagta aggattaaag tggacacagt aaatgcaaaa 3720 ccgggagaca cagtaagaat acctgtaaga ttcagcggta taccatccaa gggaatagca 3780 aactgtgact ttgtatacag ctatgacccg aatgtacttg agataataga gatagaaccg 3840 ggagaattga tagttgaccc gaatcctacc aagagctttg atactgcagt atatcctgac 第2頁 3900<120> Production of a polycistronic expression cassette of a cellulose hydrolase complex and its application <130> ACA0050TW <150> Republic of China invention patent application No. 098143083 <151> 2009-12-16 <lt;;160> 3 <170> Patentln version 3.5 <210> 1 <211> 16446 <212> DNA <213>Artificial sequence<220><223> Recombination sequence <400> 1 ggcctgtttg gcctttggga ggaatggtag atgagaaaag tcatcagtat gctcttagtt gtggctatgc tgacgacgat ttttgcggcg atgataccgc agacagtatc ggc ^ gccaca atgacagtcg agatcggcaa agttacagca gccgttggat caaaagtaga aatacctata accctgaaag gagtgccatc caaaggaatg gccaattgcg acttcgtatt gggttatgat ccaaatgtgc tggaagtaac agaagtaaaa ccaggaagca taataaaaga tccggatcct agcaagagct ttgatagcgc aatatatccg gatcgaaaga tgattgtatt tctgtttgca gaagacagtg gaagaggaac gtatgcaata actcaggatg gagtatttgc aacaattgta gccactgtca aatcagctgc agcggcaccg attactttgc ttgaagtagg tgcatttgcg gacaacgatt tagtagaaat aagcacaact Tttgtcgcgg gcggagtaaa tcttggtagt tccgtaccga caacacagcc aaatgttccg tcagacggtg tggta gtaga aattggcaaa gttacgggat ctgttggaac tacagttgaa atacctgtat atttcagagg agttccatcc aaaggaatag caaactgcga ctttgtgttc agatatgatc cgaatgtatt ggaaattata gggatagatc ccggagacat aatagttgac ccgaatccta ccaagagctt tgatactgca atatatcctg acagaaagat aatagtattc ctgtttgcgg aagacagcgg aacaggagcg tatgcaataa ctaaagacgg agtatttgca aaaataagag caactgtaaa atcaagtgct ccgggctata ttactttcga cgaagtaggt ggatttgcag ataatgacct ggtagaacag aaggtatcat ttatagacgg tggtgttaac gttggcaatg caacaccgac caagggagca acaccaacaa atacagctac gccgacaaaa tcagctacgg ctacgcccac caggccatcg gtaccgacaa acacaccgac aaacacaccg gcaaatacac cggtatcagg caatttgaag gttgaattct acaacagcaa tccttcagat actactaact caatcaatcc tcagttcaag gttactaata ccggaagcag tgcaattgat ttgtccaaac tcacattgag atattattat acagtagacg gacagaaaga tcagaccttc actggagtaa ctat tggtgtgacc atgctgcaat aatcggcagt aacggcagct acaacggaat tacttcaaat gtaaaaggaa catttgtaaa aatgagttcc tcaacaaata acgcagacac ctaccttgaa ataagcttta caggcggaac tcttgaaccg ggtgcacatg ttcagataca aggtagattt gcaaagaatg acacag tcaaatgact actcattcaa gtctgcttca cagtttgttg aatgggatca ggtaacagca tacttgaacg gtgttcttgt atggggtaaa gaacccggtg gcagtgtagt accatcaaca Page 1 60 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 201122105 cagcctgtaa caacaccacc tgcaacaaca aaaccacctg caacaacaaa accacctgca L680 ucaacaatac cgccgtcaga tgatccgaat gcaataaaga ttaaggtgga cacagtaaat 1740;! caaaaccgg gagacacagt aaatatacct gtaagattea gtggtatacc atccaaggga 1800 atagcaaact gtgactttgt atacagctat gacccgaatg tact tgagat aatagagata 1860 aaaccgggag aattgatagt tgacccgaat cctgacaaga gctttgatac tgcagtatat 1920 cctgacagaa agataatagt attcctgttt gcagaagaca gcggaacagg agegtatgea 1980 ataactaaag acggagtatt tgctacgata gtagcgaaag taaaatccgg agcacctaac 2040 ggactcagtg taatcaaatt tgtagaagta ggcggatttg Cgaacaatga ccttgtagaa 2100 cagaggacac agttctttga cggtggagta aatgttggag atacaacagt acctacaaca 2160 cctacaacac ctgtaacaac accgacagat gattegaatg cagtaaggat taaggtggac 2220 a cagtaaatg caaaaccggg agacacagta agaatacctg taagattcag cggtatacca 2280 tccaagggaa tagcaaactg tgactttgta tacagctatg acccgaatgt acttgagata 2340 atagagatag aaccgggaga cataatagtt gacccgaatc ctgacaagag ctttgatact 2400 gcagtatatc ctgacagaaa gataatagta ttcctgtttg cggaagacag cggaacagga 2460 gcgtatgcaa taactaaaga cggagtattt gctacgatag tagcgaaagt aaaatccgga 2520 gcacctaacg gactcagtgt aatcaaattt gtagaagtag gcggatttgc gaacaatgac 2580 cttgtagaac agaagacaca gttctttgac ggtggagtaa atgttggaga tacaacagaa 2640 cctgcaacac ctacaacacc tgtaacaaca ccgacaacaa cagatgatct ggatgcagta 2700 aggat taaag tggacacagt aaatgcaaaa ccgggagaca cagtaagaat acctgtaaga 2760 ttcagcggta taccatccaa gggaatagca aactgtgact ttgtatacag ctatgacccg 2820 aatgtacttg agataataga gatagaaccg ggagacataa tagttgaccc gaatcctgac 2880 aagagctttg atactgcagt atatcctgac agaaagataa tagtattcct gtttgcggaa 2940 gacagcggaa caggagcgta tgcaataact aaagaeggag tatttgctac gatagtageg 3000 aaagtaaaat ccggagcacc taacggactc agtgtaatca aatttgtaga agtaggcgg a 3060 tttgcgaaca atgaccttgt agaacagaag acacagttct ttgacggtgg agtaaatgtt 3120 ggagatacaa cagaacctgc aacacctaca acacctgtaa caacaccgac aacaacagat 3180 gatctggatg cagtaaggat taaagtggac acagtaaatg caaaaccggg agacacagta 3240 agaatacctg taagattcag cggtatacca tccaagggaa tagcaaactg tgactttgta 3300 tacagctatg acccgaatgt acttgagata atagagatag aaccgggaga cataatagtt 3360 gacccgaatc ctgacaagag ctttgatact gcagtatatc ctgacagaaa gataatagta 3420 ttcctgtttg cagaagacag cggaacagga gcgtatgcaa taactaaaga cggagtattt 3480 gctacgatag tagcgaaagt aaaagaagga gcacctaacg gactcagtgt aatcaaattt 3540 gtagaagtag gcggatttgc gaacaatgac cttgtagaac agaagacaca gttctttgac 3600 ggtggagtaa atgttggaga tacaacagaa cctgcaacac ctacaacacc tgtaacaaca 3660 ccgacaacaa cagatgatct ggatgcagta aggattaaag tggacacagt aaatgcaaaa 3720 ccgggagaca cagtaagaat acctgtaaga ttcagcggta taccatccaa gggaatagca 3780 aactgtgact ttgtatacag ctatgacccg aatgtacttg agataataga gatagaaccg 3840 ggagaattga tagttgaccc gaatcctacc aagagctttg atactgcagt a Tatcctgac Page 2 3900

201122105 agaaagatga tagtattcct gtttgcggaa gacagcggaa caggagcgta tgcaataact gaagatggag tatttgctac gatagtagcg aaagtaaaat ccggagcacc taacggactc agtgtaatca aatttgtaga agtaggcgga tttgcgaaca atgaccttgt agaacagaag acacagttct ttgacggtgg agtaaatgtt ggagatacaa cagaacctgc aacacctaca acacctgtaa caacaccgac aacaacagat gatctggatg cagtaaggat taaagtggac acagtaaatg caaaaccggg agacacagta agaatacctg taagattcag cggtatacca tccaagggaa tagcaaactg tgactttgta tacagctatg acccgaatgt acttgagata atagagatag aaccgggaga cataatagtt gacccgaatc ctgacaagag ctttgatact gcagtatatc ctgacagaaa gataatagta ttcctgtttg cagaagacag cggaacggga gcgtatgcaa taactaaaga cggagtattt gctacgatag tagcgaaagt aaaagaagga gcacctaacg gactcagtgt aatcaaattt gtagaagtag gcggatttgc gaacaatgac cttgtagaac agaagacaca gttctttgac ggtggagtaa atgttggaga tacaacagta cctacaacat cgccgacaac aacaccgcca gagccgacga taactccgaa caagttgaca cttaagatag gcagagcaga aggaagacct ggagacacgg tggaaatacc ggttaacttg tatggagtac ctcaaaaagg aatagcaagc ggtgacttcg tagtaagcta tgacccgaat gtacttgaga taatagagat agaaccggga gaattgatag ttgacccgaa tcctaccaag agctttgata ctgcagtata tcctgacaga aagatgatag tattcctgtt tgcggaagac agcggaacag gagcgtatgc aataactgaa gatggagtat ttgctacgat agtagcgaaa gtaaaagaag gagcacctga aggattcagt gcaatagaaa tttctgagtt tggtgcattt gcagataatg atctggtaga agtggaaact gaccttatca atggtggagt acttgtaact aataaacctg taatagaagg atataaagta tccggataca ttttgccaga cttctccttc gacgctactg ttgcaccact tgtaaaggcc ggattcaaag ttgaaatagt aggaacagaa ttgtatgcag taacagatgc aaacggatac tttgaaataa ccggagtacc tgcaaatgca agcggatata cattgaagat ttcaagagca acttacttgg acagagtaat tgcaaatgtt gtagtaacgg gagatacttc agtttcaact tcacaggctc caataatgat gtgggtagga gacatagtga aagacaattc tatcaacctg ttggacgttg cagaagttat ccgt tgcttc aacgctacta aaggaagcgc aaactacgta gaagaacttg acattaatag aaacggcgca attaacatgc aagacataat gattgttcat aagcactttg gagctacatc aagtgattac gacgcacagt aacactgacg gctaacggga ggtagattta tgaatttcag aagaatgttg tgcgcagcca tagtgttgac aattgtactg tccattatgc tgccgtcaac tgtttttgct ttggaagaca agtctccaaa gttgccggat tataaaaacg accttttgta tgaaagaaca ttcgacgaag gtctttgctt tccgtggcat acttgcgaag acagtggagg aaaatgtgat ttcgctgttg ttgatgttcc aggagagcct gggaacaaag ctttccgctt gacagtaatt gacaaaggac aaaacaagtg gagtgtccag atgagacaca gaggtattac cctcgagcaa ggacatacat acacggtaag gtttacgatt tggtctgaca aatcctgtag ggtttatgct aaaattggtc agatgggtga accctatact gaatattgga acaataactg gaatccattc aaccttacac caggacagaa gcttacagtt gaacagaatt ttacaatgaa ctatcctact gatgacacat gcgagttcac attccatttg ggtggagaac ttgctgcagg tacaccttac 第3頁 3960 4020 4080 4140 4200 4260 4320 4380 4440 4500 4560 4620 4680 4740 4800 4860 4920 4980 5040 5100 5160 5220 5280 5340 5400 5460 5520 5580 5640 5700 5760 5820 5880 5940 6000 6060 6120 6180 201122105 tatgtttacc ttgatgatgt atctctctac gatcctaggt ttgtaaagcc tgttgaatat 6240 gtacttccgc agccggatgt acgtgttaac caggtaggat acttaccgtt tgcaaagaag 6300 tatgctactg ttgtatcttc ttcaaccagc ccgcttaagt ggcagcttct caattcggca 6360 aatcaggttg ttttggaagg taatacaata ccaaaaggac ttgacaaaga ttcacaggat 6420 tatgtacatt ggatagattt ctccaacttt aagactgaag gaaaaggtta ttacttcaag 6480 cttccgactg taaacagcga tacaaattac agccatcctt tcgatatcag tgctgatatt 6540 tactccaaga tgaaatttga tgcattggca ttcttctatc acaagagaag cggtattcct 6600 attgaaatgc cgtatgcagg aggagaacag tggaccagac ctgcaggaca tattggtgtt 6660 gctccgaaca aaggagacac aaatgttcct acatggcctc aggatgatga atatgcagga 6720 agacctcaaa aatattatac aaaagatgta accggtggat ggtatgatgc cggtgaccac 6780 pgtaaatatg ttgtaaacgg cggtatagct gtttggacat tgatgaacat gtatgaaagg 6840 ncaaaaatca gaggcatagc taatcaaggt gcttataaag acggtggaat gaacataccg 6900 aagagaaata acggttatcc ggacattctt gatgaagcaa gatgggaaat tgagttcttt 6960 aagaaaatgc aggtaactga aaaagaggat ccttccatag ccggaatggt acaccacaaa 7020 attcacgact tcagatggac tgctttgggt atgttgcctc acgaagatcc ccagccacgt 7080 tacttaaggc cggtaagtac ggctgcgact ttgaactttg cggcaacttt ggcacaaagt 7140 gcacgtcttt ggaaagatta tgatccgact tttgctgctg actgtttgga aaaggctgaa 7200 atagcatggc aggcggcatt aaagcatcct gatatttatg ctgagtatac tcccggtagc 7260 ggtggtcccg gaggcggacc atacaatgac gactatgtcg gagacgaatt ctactgggca 7320 gcctgcgaac tttatgtaac aacaggaaaa gacgaatata agaattacct gatgaattca 7380 cctcactatc t tgaaatgcc tgcaaagatg ggtgaaaacg gtggagcaaa cggagaagac 7440 aacggattgt ggggatgctt cacctgggga actactcaag gattgggaac tattactctt 7500 gcattagttg aaaacggatt gccgtctgca'gacattcaaa aggcaagaaa caatatagct 7560 aaagctgcag acaaatggct tgagaatatt gaagagcaag gttacagact gccgatcaaa 7620 caggcggagg atgagagagg cggttatcca tggggttcaa actccttcat tttgaaccag 7680 atgatagtta tgggatacgc atatgacttt acaggcaaca gcaagtatct tgacggaatg 7740 caggatggta tgagctacct gttgggaaga aacggactgg atcagtccta tgtaacaggg 7800 tatggtgagc gtccacttca gaatcctcat gacagattct ggacgccaca gacaagtaag 7860 aaat tccctg ctocacctcc gggtataatt gccggtggtc cgaactcccg tttcgaagac 7920 ccgacaataa ctgcagcagt taagaaggat acaccgccgc agaagtgcta cattgaccat 7980 acagactcat ggtcaaccaa cgagataact attaactgga atgctccgtt tgcatgggtt 8040 acagcttatc tcgatgaaat tgact taata acaccgccag gaggagtaga cccagaagaa 8100 ccggaggtta tttatggtga ctgcaatggc gacggaaaag ttaattcaac tgacgctgtg 8160 gcattgaaga gatatatctt gagatcaggt ataagcatca acactgataa tgctgatgta 8220 aatgctgatg gcagagttaa ctctacagac ttggcaatat tgaagagata taMcttaaa 8280 gagatagatg tattgccaca taaataagcc acgagtgagg ggaagatgga gagaatggta 8340 aaaagcagaa agatttctat tctgttggca gttgcaatgc tggtatccat aatgataccc 8400 acaactgcat tcgcaggtcc tacaaaggca cctacaaaag atgggacatc ttataaggat 8460 第4頁201122105 agaaagatga tagtattcct gtttgcggaa gacagcggaa caggagcgta tgcaataact gaagatggag tatttgctac gatagtagcg aaagtaaaat ccggagcacc taacggactc agtgtaatca aatttgtaga agtaggcgga tttgcgaaca atgaccttgt agaacagaag acacagttct ttgacggtgg agtaaatgtt ggagatacaa cagaacctgc aacacctaca acacctgtaa caacaccgac aacaacagat gatctggatg cagtaaggat taaagtggac acagtaaatg caaaaccggg agacacagta agaatacctg taagattcag cggtatacca tccaagggaa tagcaaactg tgactttgta tacagctatg acccgaatgt acttgagata atagagatag aaccgggaga cataatagtt gacccgaatc ctgacaagag ctttgatact gcagtatatc ctgacagaaa gataatagta ttcctgtttg cagaagacag cggaacggga gcgtatgcaa taactaaaga cggagtattt gctacgatag tagcgaaagt aaaagaagga gcacctaacg gactcagtgt aatcaaattt gtagaagtag gcggatttgc gaacaatgac cttgtagaac agaagacaca gttctttgac ggtggagtaa atgttggaga tacaacagta cctacaacat cgccgacaac aacaccgcca gagccgacga taactccgaa caagttgaca cttaagatag gcagagcaga aggaagacct ggagacacgg tggaaatacc ggttaacttg tatggagtac ctcaaaaagg aatagcaagc ggtgacttcg tagtaagcta tgacccgaat gtacttgaga taatagagat agaaccggga gaattgatag ttgacccgaa tcctaccaag agctttgata ctgcagtata tcctgacaga aagatgatag tattcctgtt tgcggaagac agcggaacag gagcgtatgc aataactgaa gatggagtat ttgctacgat agtagcgaaa gtaaaagaag gagcacctga aggattcagt gcaatagaaa tttctgagtt tggtgcattt gcagataatg atctggtaga agtggaaact gaccttatca atggtggagt acttgtaact aataaacctg taatagaagg atataaagta tccggataca ttttgccaga cttctccttc gacgctactg ttgcaccact tgtaaaggcc ggattcaaag ttgaaatagt aggaacagaa ttgtatgcag taacagatgc aaacggatac tttgaaataa ccggagtacc tgcaaatgca agcggatata cattgaagat ttcaagagca acttacttgg acagagtaat tgcaaatgtt gtagtaacgg gagatacttc agtttcaact tcacaggctc caataatgat gtgggtagga gacatagtga aagacaattc tatcaacctg ttggacgttg cagaagttat ccgt tgcttc aacgctacta aaggaagcgc aaactacgta gaagaacttg acattaatag aaacggcgca attaacatgc aagacataat gattgttcat aagcactttg gagctacatc aagtgattac gacgcacagt aacactgacg gctaacggga ggtagattta tgaatttcag aagaatgttg tgcgcagcca tagtgttgac aattgtactg tccattatgc tgccgtcaac tgtttttgct ttggaagac a agtctccaaa gttgccggat tataaaaacg accttttgta tgaaagaaca ttcgacgaag gtctttgctt tccgtggcat acttgcgaag acagtggagg aaaatgtgat ttcgctgttg ttgatgttcc aggagagcct gggaacaaag ctttccgctt gacagtaatt gacaaaggac aaaacaagtg gagtgtccag atgagacaca gaggtattac cctcgagcaa ggacatacat acacggtaag gtttacgatt tggtctgaca aatcctgtag ggtttatgct aaaattggtc agatgggtga accctatact gaatattgga acaataactg gaatccattc aaccttacac caggacagaa gcttacagtt gaacagaatt ttacaatgaa ctatcctact gatgacacat gcgagttcac attccatttg ggtggagaac ttgctgcagg tacaccttac 3 Page 3960 4020 4080 4140 4200 4260 4320 4380 4440 4500 4560 4620 4680 4740 4800 4860 4920 4980 5040 5100 5160 5220 5280 5340 5400 5460 5520 5580 5640 5700 5760 5820 5880 5940 6000 6060 6120 6180 201122105 tatgtttacc ttgatgatgt atctctctac gatcctaggt ttgtaaagcc tgttgaatat 6240 gtacttccgc agccggatgt acgtgttaac Caggtaggat acttaccgtt tgcaaagaag 6300 tatgctactg ttgtatcttc ttcaaccagc ccgcttaagt ggcagcttct caattcggca 6360 aatcaggttg ttttggaagg taatacaata ccaaaaggac ttgacaaaga tt cacaggat 6420 tatgtacatt ggatagattt ctccaacttt aagactgaag gaaaaggtta ttacttcaag 6480 cttccgactg taaacagcga tacaaattac agccatcctt tcgatatcag tgctgatatt 6540 tactccaaga tgaaatttga tgcattggca ttcttctatc acaagagaag cggtattcct 6600 attgaaatgc cgtatgcagg aggagaacag tggaccagac ctgcaggaca tattggtgtt 6660 gctccgaaca aaggagacac aaatgttcct acatggcctc aggatgatga atatgcagga 6720 agacctcaaa aatattatac aaaagatgta accggtggat ggtatgatgc cggtgaccac 6780 pgtaaatatg ttgtaaacgg cggtatagct gtttggacat tgatgaacat gtatgaaagg 6840 ncaaaaatca gaggcatagc taatcaaggt gcttataaag acggtggaat gaacataccg 6900 aagagaaata acggttatcc ggacattctt gatgaagcaa gatgggaaat tgagttcttt 6960 aagaaaatgc aggtaactga aaaagaggat ccttccatag ccggaatggt acaccacaaa 7020 attcacgact tcagatggac tgctttgggt atgttgcctc acgaagatcc ccagccacgt 7080 tacttaaggc cggtaagtac ggctgcgact ttgaactttg cggcaacttt ggcacaaagt 7140 gcacgtcttt ggaaagatta tgatccgact tttgctgctg actgtttgga aaaggctgaa 7200 atagcatggc aggcggcatt aaagcatcct gatatttatg ctgag tatac tcccggtagc 7260 ggtggtcccg gaggcggacc atacaatgac gactatgtcg gagacgaatt ctactgggca 7320 gcctgcgaac tttatgtaac aacaggaaaa gacgaatata agaattacct gatgaattca 7380 cctcactatc t tgaaatgcc tgcaaagatg ggtgaaaacg gtggagcaaa cggagaagac 7440 aacggattgt ggggatgctt cacctgggga actactcaag gattgggaac tattactctt 7500 gcattagttg aaaacggatt gccgtctgca'gacattcaaa aggcaagaaa caatatagct 7560 aaagctgcag acaaatggct tgagaatatt gaagagcaag gttacagact gccgatcaaa 7620 caggcggagg atgagagagg cggttatcca tggggttcaa actccttcat tttgaaccag 7680 atgatagtta tgggatacgc atatgacttt acaggcaaca gcaagtatct tgacggaatg 7740 caggatggta tgagctacct gttgggaaga aacggactgg atcagtccta tgtaacaggg 7800 tatggtgagc gtccacttca gaatcctcat gacagattct ggacgccaca gacaagtaag 7860 aaat tccctg ctocacctcc gggtataatt gccggtggtc cgaactcccg tttcgaagac 7920 ccgacaataa ctgcagcagt acaccgccgc agaagtgcta cattgaccat 7980 acagactcat ggtcaaccaa cgagataact attaactgga atgctccgtt tgcatgggtt 8040 acagcttatc tcgatgaaat tgact taagaaggat Taata acacc gccag gaggagtaga cccagaagaa 8100 ccggaggtta tttatggtga ctgcaatggc gacggaaaag ttaattcaac tgacgctgtg 8160 gcattgaaga gatatatctt gagatcaggt ataagcatca acactgataa tgctgatgta 8220 aatgctgatg gcagagttaa ctctacagac ttggcaatat tgaagagata taMcttaaa 8280 gagatagatg tattgccaca taaataagcc acgagtgagg ggaagatgga gagaatggta 8340 aaaagcagaa agatttctat tctgttggca gttgcaatgc tggtatccat aatgataccc 8400 acaactgcat tcgcaggtcc tacaaaggca cctacaaaag atgggacatc ttataaggat 8460 Page 4

201122105 cttttccttg aactctacgg aaaaattaaa gatcctaaga acggatattt cagcccagac gagggaattc cttatcactc aattgaaaca ttgatcgttg aagcgccgga ctacggtcac gttactacca gtgaggcttt cagctattat gtatggcttg aagcaatgta tggaaatctc acaggcaact ggtccggagt agaaacagca tggaaagtta tggaggattg gataattcct gacagcacag agcagccggg tatgtcttct tacaatccaa acagccctgc cacatatgct gacgaatatg aggatccttc atactatcct tcagagttga agtttgatac cgtaagagtt ggatccgacc ctgtacacaa cgaccttgta tccgcatacg gtcctaacat gtacctcatg cactggttga tggacgttga caactggtac ggttttggta caggaacacg ggcaacattc ataaacacct tccaaagagg tgaacaggaa tccacatggg aaaccattcc tcatccgtca atagaagagt tcaaatacgg cggaccgaac ggattccttg atttgtttac aaaggacaga tcatatgcaa aacagtggcg ttatacaaac gctcctgacg cagaaggccg tgctatacag gctgtttact gggcaaacaa atgggcaaag gagcagggta aaggttctgc cgttgcttcc gttgtatcca aggctgcaaa gatgggtgac ttcttgagaa acgacatgtt cgacaaatac ttcatgaaga tcggtgcaca ggacaagact cctgctaccg gttatgacag tgcacactac cttatggcct ggtatactgc atggggtggt ggaattggtg catcctgggc atggaagatc ggatgcagcc acgcacactt cggatatcag aacccattcc agggatgggt aagtgcaaca cagagcgact ttgctcctaa atcatccaac ggtaagagag actggacaac aagctacaag agacagcttg aattctatca gtggttgcag tcggctgaag gtggtattgc cggtggagca accaactcct ggaacggtag atatgagaaa tatcctgctg gtacgtcaac gttctatggt atggcatatg ttccgcatcc tgtatacgct gacccgggta gtaaccagtg gttcggattc caggcatggt caatgcagcg tgtaatggag tactacctcg aaacaggaga ttcatcagtt aagaatttga ttaagaagtg ggtcgactgg gtaatgagcg aaattaagct ctatgacgat ggaacatttg caattcctag cgacctcgag tggtcaggtc agcctgatac atggaccgga acatacacag gcaacccgaa cctccatgta agagtaactt cttacggtac tgaccttggt gttgcaggtt cacttgcaaa tgctcttgca acttatgccg cagctacaga aagatgggaa ggaaaacttg atacaaaagc aagagacatg gctgctgaac tggttaaccg tgcatggtac aacttctact gctctgaagg aaaaggtgtt gttactgagg aagcacgtgc tgactacaaa cgtttctttg agcaggaagt atacgttccg gcaggttgga gcggtactat gccgaacggt gacaagattc agcctggtat taagttcata gacatccgta caaaatatag acaagatcct tactacgata tagtatatca ggcatacttg agaggcgaag ctcctgtatt gaattatcac cgcttctggc atgaagttga ccttgcagtt gcaatgggtg tattggctac atacttcccg gatatgacat ataaagtacc tggtactcct tctactaaat tatacggcga cgtcaatgat gacggaaaag ttaactcaac tgacgctgta gcattgaaga gatatgtttt gagatcaggt ataagcatca acactgacaa tgccgatttg aatgaagacg gcagagttaa ttcaactgac ttaggaattt tgaagagata tattctcaaa gaaatagata cattgccgta caagaactaa cacatggtga aaaggaggaa aaaaaagtga agaacgtaaa aaaaagagta ggtgtggttt tgctgattct tgcagtgttg ggggtttata tgttggcaat gccggcaaac actgtgtcag cggcaggtgt gccttttaac acaaaatacc cctatggtcc tacttctatt gccgataatc 第5頁 8520 8580 8640 8700 8760 8820 8880 8940 9000 9060 9120 9180 9240 9300 9360 9420 9480 9540 9600 9660 9720 9780 9840 9900 9960 10020 10080 10140 10200 10260 10320 10380 10440 10500 10560 10620 10680 10740 201122105 agtcggaagt aactgcaatg ctcaaagcag aatgggaaga ctggaagagc aagagaatta 10800 cctcgaacgg tgcaggagga tacaagagag tacagcgtga tgcttccacc aattatgata 10860 cggtatccga aggtatggga tacggacttc ttttggcggt ttgctttaac gaacaggctt 10920 tgtttgacga tttataccgt tacgtaaaat ctcatttcaa tggaaacgga cttatgcact 10980 ggcacattga tgccaacaac aatgttacaa gtcatgacgg cggcgacggt gcggcaaccg 11040 atgctgatga ggatattgca cttgcgctca tatttgcgga caagttatgg ggttcttccg 11100 gtgcaataaa ctacgggcag gaagcaagga cattgataaa caatctttac aaccattgtg 11160 tagagcatgg atcctatgta ttaaagcccg gtgacagatg gggaggttca tcagtaacaa 11220 acccgtcata ttttgcgcct gcatggtaca aagtgtatgc tcaatataca ggagacacaa 11280 gatggaatca agtggcggac aagtgttacc aaattgttga agaagttaag aaatacaaca 11340 acggaaccgg ccttgttcct gactggtgta ctgcaagcgg aactccggca agcggtcaga 11400 gttacgacta caaatatgat gctacacgtt acggctggag aactgccgtg gactattcat 11460 ggtttggtga ccagagagca aaggcaaact gcgatatgct gaccaaattc tttgccagag 11520 acggggcaaa aggaatcgtt gacggataca caattcaagg ttcaaaaatt agcaacaatc 11580 acaacgcatc atttatagga cctgttgcgg cagcaagtat gacaggttac gatttgaact 11640 ttgcaaagga actttatagg gagactgttg ctgtaaagga cagtgaatat tacggatatt 11700 acggaaacag cttgagactg ctcactttgt tgtacataac aggaaacttc ccgaatcctt 11760 tgagtgacct ttccggccaa ccgacaccac cgtcgaatcc gacaccttca ttgcctcctc 11820 uggttgttta cggtgatgta aatggcgacg gtaatgttaa ctccactgat ttgactatgt 11880 taaaaagata tctgctgaag agtgttacca atataaacag agaggctgca gacgttaatc 11940 litgacggtgc gattaactcc tctgacatga ctatattaaa gagatatctg ataaagagca 12000 taccccacct accttattag cactgggtgt tttgggaggt agatctatgc tgaagaaaaa 12060 actgttgacc cttttgacag tctttgctct gctgactgtc ggtatctgcg gaagtttttt 12120 gccgttaccc aaagcatccg cagcagctct gatttacgat gattttgaaa caggtctgaa 12180 cggatgggga ccaagaggac cggaaaccgt cgaacttacc accgaggaag cttactcggg 12240 aagatacagt ttgaaggtca gcggacgtac cagcacatgg aacgggccca tggttgacaa 12300 aaccgatgtg ttgactttgg gcgaaagcta taagttgggc gtatatgtaa aattcgtggg 12360 tgattcctat tcaaatgagc aaagattcag tttgcagctt caatataacg acggagcagg 12420 agatgtatac caaaatataa aaaccgccac ggtttacaag ggaacatgga ctttgctgga 12480 aggacagctt acagttccca gccatgcaaa ggacgtaaaa atatatgtgg aaaccgaatt 12540 taaaaattct ccgagtccgc aggacttgat ggatttctat attgacgatt tcacagcaac 12600 acctgcaaat ttgcctgaaa ttgagaaaga tattccaagc ttgaaagatg tctttgccgg 12660 ttatttcaaa gtgggtggtg ccgcaactgt ggcggaactg gcgccgaagc ctgcaaaaga 12720 gcttttcctc aagcattata acagcttgac ttttggtaat gagttaaaac cggaaagtgt 12780 acttgactat gatgctacaa ttgcttatat ggaggcaaac ggaggcgacc aggttaatcc 12840 gcagataacc ttgagagcgg caagacccct gttggagttt gcgaaagaac acaacatacc 12900 tgtaagagga catacccttg tatggcacag ccagacaccg gactggttct tcagagaaaa 12960 ttactctcag gacgaaaatg ctccctgggc atccaaggaa gtaatgctgc aaaggttgga 13020 第6頁201122105 cttttccttg aactctacgg aaaaattaaa gatcctaaga acggatattt cagcccagac gagggaattc cttatcactc aattgaaaca ttgatcgttg aagcgccgga ctacggtcac gttactacca gtgaggcttt cagctattat gtatggcttg aagcaatgta tggaaatctc acaggcaact ggtccggagt agaaacagca tggaaagtta tggaggattg gataattcct gacagcacag agcagccggg tatgtcttct tacaatccaa acagccctgc cacatatgct gacgaatatg aggatccttc atactatcct tcagagttga agtttgatac cgtaagagtt ggatccgacc ctgtacacaa cgaccttgta tccgcatacg gtcctaacat gtacctcatg cactggttga tggacgttga caactggtac ggttttggta caggaacacg ggcaacattc ataaacacct tccaaagagg tgaacaggaa tccacatggg aaaccattcc tcatccgtca atagaagagt tcaaatacgg cggaccgaac ggattccttg atttgtttac aaaggacaga tcatatgcaa aacagtggcg ttatacaaac gctcctgacg cagaaggccg tgctatacag gctgtttact gggcaaacaa atgggcaaag gagcagggta aaggttctgc cgttgcttcc gttgtatcca aggctgcaaa gatgggtgac ttcttgagaa acgacatgtt cgacaaatac ttcatgaaga tcggtgcaca ggacaagact cctgctaccg gttatgacag tgcacactac cttatggcct ggtatactgc atggggtggt ggaattggtg catcctgggc atggaagatc ggatgcagcc acgcacactt cggatatcag aacccattcc agggatgggt aagtgcaaca cagagcgact ttgctcctaa atcatccaac ggtaagagag actggacaac aagctacaag agacagcttg aattctatca gtggttgcag tcggctgaag gtggtattgc cggtggagca accaactcct ggaacggtag atatgagaaa tatcctgctg gtacgtcaac gttctatggt atggcatatg ttccgcatcc tgtatacgct gacccgggta gtaaccagtg gttcggattc caggcatggt caatgcagcg tgtaatggag tactacctcg aaacaggaga ttcatcagtt aagaatttga ttaagaagtg ggtcgactgg gtaatgagcg aaattaagct ctatgacgat ggaacatttg caattcctag cgacctcgag tggtcaggtc agcctgatac atggaccgga acatacacag gcaacccgaa cctccatgta agagtaactt cttacggtac tgaccttggt gttgcaggtt cacttgcaaa tgctcttgca acttatgccg cagctacaga aagatgggaa ggaaaacttg atacaaaagc aagagacatg gctgctgaac tggttaaccg tgcatggtac aacttctact gctctgaagg aaaaggtgtt gttactgagg aagcacgtgc tgactacaaa cgtttctttg agcaggaagt atacgttccg gcaggttgga gcggtactat gccgaacggt gacaagattc agcctggtat taagttcata gacatccgta caaaatatag acaagatcct tactacgata tagtatatca ggcatacttg agaggcgaag ctcctgtatt gaattatcac cgcttctggc atgaagttga ccttgcagtt gcaatgggtg tattggctac atacttcccg gatatgacat ataaagtacc tggtactcct tctactaaat cgtcaatgat gacggaaaag ttaactcaac tgacgctgta gcattgaaga gatatgtttt gagatcaggt ataagcatca acactgacaa tgccgatttg aatgaagacg gcagagttaa ttcaactgac ttaggaattt tgaagagata tattctcaaa gaaatagata cattgccgta caagaactaa cacatggtga aaaggaggaa aaaaaagtga agaacgtaaa aaaaagagta ggtgtggttt tgctgattct tgcagtgttg ggggtttata tgttggcaat gccggcaaac actgtgtcag cggcaggtgt gccttttaac acaaaatacc cctatggtcc tacttctatt gccgataatc Page 5 tatacggcga 8520 8580 8640 8700 8760 8820 8880 8940 9000 9060 9120 9180 9240 9300 9360 9420 9480 9540 9600 9660 9720 9780 9840 9900 9960 10020 10080 10140 10200 10260 10320 10380 10440 10500 10560 10620 10680 10740 201122105 agtcggaagt aactgcaatg ctcaaagcag aatgggaaga ctggaagagc aagagaatta 10800 cctcgaacgg tgcaggagga tacaagagag tacagcgtga Tgcttccacc aattatgata 10860 cggtatccga aggtatggga tacggacttc ttttggcggt ttgctttaac gaacaggctt 10920 tgtttgacga tttataccgt tacgtaaaat ctcatttcaa t ggaaacgga cttatgcact 10980 ggcacattga tgccaacaac aatgttacaa gtcatgacgg cggcgacggt gcggcaaccg 11040 atgctgatga ggatattgca cttgcgctca tatttgcgga caagttatgg ggttcttccg 11100 gtgcaataaa ctacgggcag gaagcaagga cattgataaa caatctttac aaccattgtg 11160 tagagcatgg atcctatgta ttaaagcccg gtgacagatg gggaggttca tcagtaacaa 11220 acccgtcata ttttgcgcct gcatggtaca aagtgtatgc tcaatataca ggagacacaa 11280 gatggaatca agtggcggac aagtgttacc aaattgttga agaagttaag aaatacaaca 11340 acggaaccgg ccttgttcct gactggtgta ctgcaagcgg aactccggca agcggtcaga 11400 gttacgacta caaatatgat gctacacgtt acggctggag aactgccgtg gactattcat 11460 ggtttggtga ccagagagca aaggcaaact gcgatatgct gaccaaattc tttgccagag 11520 acggggcaaa aggaatcgtt gacggataca caattcaagg ttcaaaaatt agcaacaatc 11580 acaacgcatc atttatagga cctgttgcgg cagcaagtat gacaggttac gatttgaact 11640 ttgcaaagga actttatagg gagactgttg ctgtaaagga cagtgaatat tacggatatt 11700 acggaaacag cttgagactg ctcactttgt tgtacataac aggaaacttc ccgaatcctt 11760 tgagtgacct ttccggccaa ccgacaccac cgtc gaatcc gacaccttca ttgcctcctc 11820 uggttgttta cggtgatgta aatggcgacg gtaatgttaa ctccactgat ttgactatgt 11880 taaaaagata tctgctgaag agtgttacca atataaacag agaggctgca gacgttaatc 11940 litgacggtgc gattaactcc tctgacatga ctatattaaa gagatatctg ataaagagca 12000 taccccacct accttattag cactgggtgt tttgggaggt agatctatgc tgaagaaaaa 12060 actgttgacc cttttgacag tctttgctct gctgactgtc ggtatctgcg gaagtttttt 12120 gccgttaccc aaagcatccg cagcagctct gatttacgat gattttgaaa caggtctgaa 12180 cggatgggga ccaagaggac cggaaaccgt cgaacttacc accgaggaag cttactcggg 12240 aagatacagt ttgaaggtca gcggacgtac cagcacatgg aacgggccca tggttgacaa 12300 aaccgatgtg ttgactttgg gcgaaagcta taagttgggc gtatatgtaa aattcgtggg 12360 tgattcctat tcaaatgagc aaagattcag tttgcagctt caatataacg acggagcagg 12420 agatgtatac caaaatataa aaaccgccac ggtttacaag ggaacatgga ctttgctgga 12480 aggacagctt acagttccca gccatgcaaa ggacgtaaaa atatatgtgg aaaccgaatt 12540 taaaaattct ccgagtccgc aggacttgat ggatttctat attgacgatt tcacagcaac 12600 acctgcaaat ttgcctgaaa ttgaga aaga tattccaagc ttgaaagatg tctttgccgg 12660 ttatttcaaa gtgggtggtg ccgcaactgt ggcggaactg gcgccgaagc ctgcaaaaga 12720 gcttttcctc aagcattata acagcttgac ttttggtaat gagttaaaac cggaaagtgt 12780 acttgactat gatgctacaa ttgcttatat ggaggcaaac ggaggcgacc aggttaatcc 12840 gcagataacc ttgagagcgg caagacccct gttggagttt gcgaaagaac acaacatacc 12900 tgtaagagga catacccttg tatggcacag ccagacaccg gactggttct tcagagaaaa 12960 ttactctcag gacgaaaatg ctccctgggc atccaaggaa gtaatgctgc aaaggttgga 13020 Page 6

201122105 aaactacata aagaatttaa tggaagcttt ggcgaccgaa tatccgacgg ttaagttcta tgcatgggac gttgtgaatg aggctgttga tcctaatact tcagacggta tgagaactcc gggttcgaat aacaaaaatc ccggaagctc cctgtggatg caaaccgttg gaagagattt tattgttaaa gcttttgaat atgcaagaaa atatgctcct gcggattgta aactcttcta caatgactat aatgaatatg aagacagaaa atgtgatttt attattgaaa ttcttaccga acttaaagcc aaaggcctgg ttgacggtat gggtatgcaa tcccactggg ttatggatta tccaagcata agcatgtttg aaaaatccat cagaagatat gcagcattgg gattggaaat tcagcttacc gagctggata taagaaatcc tgacaacagc cagtgggctt tggaacgtca ggctaatcgt tataaggagc ttgtaacaaa attggtcgat ttgaaaaaag aaggcataaa cattacggca ttggtattct ggggaataac cgacgcgaca agctggcttg gaggatatcc gctcctgttt gacgcggaat acaaggcaaa acctgcattt tatgctatag ttaacagcgt tccgccgctt ccgacagaac cgccggttca ggttataccc ggtgatgtaa acggtgacgg tcgtgtaaat tcatccgact tgactcttat gaaaagatac cttttaaaat ccataagcga cttcccgaca ccggaaggaa aaattgcggc ggatttaaac gaagacggca aggtaaactc gacagatttg ttagcgctga aaaaactcgt tctgagagaa ctttgacact aggtgcaaaa aggaggagaa acatgtcaag aaaacttttc agtgtattac ttgttggctt gatgcttatg acatcgttgc ttgtcacaat aagcagtaca tcagcggcat ccttgccaac catgccgcct tcgggatatg accaggtaag gaacggcgtt ccgagagggc aggtcgtaaa tatttcttat ttctccacgg ccaccaacag taccaggccg gcaagagttt atttgccgcc gggatattca aaggacaaaa aatacagtgt tttgtatctc ttacacggca taggcggtag tgaaaacgac tggttcgaag ggggaggcag agccaatgtt attgccgaca atctgattgc cgagggaaaa atcaagcccc tgataattgt aacaccgaat actaacgccg ccggtccggg aatagcggac ggttatgaaa atttcacaaa agatttgctc aacagtctta ttccctatat cgaatctaac tattcagtct acaccgaccg cgaacatcgg gcgattgcag gactttcaat gggtggagga caatcgttta atattggatt gaccaatctc gataaatttg cctatattgg cccgatttca gcggctccaa acacttatcc aaatgagagg ctttttcctg acggaggaaa agctgcaagg gagaaattga aactgctctt tattgcctgc ggaaccaatg acagtctgat aggttttgga cagagagtac atgaatattg cgttgccaac aacattaacc atgtctattg gcttattcag ggcggaggac acgattttaa tgtgtggaag cccggattgt ggaatttcct tcaaatggca gatgaagccg gattgacgag ggatggaaac actccggttc cgacacccag tccaaagccg gctaacacac gtattgaagc ggaagattat gacggtatta attcttcaag tattgagata ataggtgttc cacctgaagg aggcagagga ataggttata ttaccagtgg tgattatctg gtatacaaga gtatagactt tggaaacgga gcaacgtcgt ttaaggccaa ggttgcaaat gcaaatactt ccaatattga acttagatta aacggtccga atggtactct cataggcaca ctctcggtaa aatccacagg agattggaat acatatgagg agcaaacttg cagcattagc aaagtcaccg gaataaatga tttgtacttg gtattcaaag gccctgtaaa catagactgg ttcacttttg gcgttgaaag cagttccaca ggtctggggg atttaaatgg tgacggaaat attaactcgt cggaccttca ggcgttaaag aggcatttgc tcggtatatc accgcttacg 第7頁 13080 13140 13200 13260 13320 13380 13440 13500 13560 13620 13680 13740 13800 13860 13920 13980 14040 14100 14160 14220 14280 14340 14400 14460 14520 14580 14640 14700 14760 14820 14880 14940 15000 15060 15120 15180 15240 15300 201122105 ggagaggctc ttttaagagc ggatgtaaat aggagcggca aagtggattc tactgactat tcagtgctga aaagatatat actccgcatt attacagagt tccccggaca aggtgatgta cagacaccca atccgtctgt tactccgaca caaactccta tccccacgat ttcgggaaat gctcttaggg attatgcgga ggcaagggga ataaaaatcg gaacatgtgt caactatccg ttttacaaca attcagatcc aacctacaac agcattttgc aaagagaatt ttcaatggtt gtatgtgaaa atgaaatgaa gtttgatgct ttgcagccga gacaaaacgt ttttgatttt tcgaaaggag accagttgct tgcttttgca gaaagaaacg gtatgcagat gaggggacat acgttgattt ggcacaatca aaacccgtca tggcttacaa acggtaactg gaaccgggat tcgctgcttg cggtaatgaa aaatcacatt accactgtta tgacccatta caaaggtaaa attgttgagt gggatgtggc aaacgaatgt atggatgatt ccggcaacgg cttaagaagc agcatatgga gaaatgtaat cggtcaggac taccttgact atgctttcag gtatgcaaga gaagcagatc ccgatgcact tcttttctac aatgattata atattgaaga cttgggtcca aagtccaatg cggtatttaa catgattaaa agtatgaagg aaagaggtgt gccgattgac ggagtaggat tccaatgcca ctttatcaat ggaatgagcc ccgagtacct tgccagcatt gatcaaaata ttaagagata tgcggaaata ggcgttatag tatcctttac cgaaatagat atacgcatac ctcagtcgga aaacccggca actgcattcc aggtacaggc aaacaactat aaggaactta tgaaaatttg tctggcaaac cccaattgca atacctttgt aatgtgggga ttcacagata aatacacatg gattccggga actttcccag gatatggcaa tccattgatt tatgacagca attacaatcc gaaaccggca tacaatgcaa taaaggaagc tcttatgggc tattga <210> 2 <211> 20869 <212> DNA <213>人工序列 <220> <223>重組序列 <400> 2 ggcctgtttg gcctttggga ggaatggtag atgagaaaag tcatcagtat gctcttagtt gtggctatgc tgacgacgat ttttgcggcg atgataccgc agacagtatc ggcggccaca atgacagtcg agatcggcaa agttacagca gccgttggat caaaagtaga aatacctata accctgaaag gagtgccatc caaaggaatg gccaattgcg acttcgtatt gggttatgat ccaaatgtgc tggaagtaac agaagtaaaa ccaggaagca taataaaaga tccggatcct agcaagagct ttgatagcgc aatatatccg gatcgaaaga tgattgtatt tctgtttgca gaagacagtg gaagaggaac gtatgcaata actcaggatg gagtatttgc aacaattgta gccactgtca aatcagctgc agcggcaccg attactttgc ttgaagtagg tgcatttgcg gacaacgatt tagtagaaat aagcacaact tttgtcgcgg gcggagtaaa tcttggtagt tccgtaccga caacacagcc aaatgttccg tcagacggtg tggtagtaga aattggcaaa gttacgggat ctgttggaac tacagttgaa atacctgtat atttcagagg agttccatcc aaaggaatag caaactgcga ctttgtgttc agatatgatc cgaatgtatt ggaaattata gggatagatc ccggagacat aatagttgac ccgaatccta ccaagagctt tgatactgca 第8頁 15360 15420 15480 15540 15600 15660 15720 15780 15840 15900 15960 16020 16080 16140 16200 16260 16320 16380 16440 16446 60 120 180 240 300 360 420 480 540 600 660 720 780 840 840201122105 aaactacata aagaatttaa tggaagcttt ggcgaccgaa tatccgacgg ttaagttcta tgcatgggac gttgtgaatg aggctgttga tcctaatact tcagacggta tgagaactcc gggttcgaat aacaaaaatc ccggaagctc cctgtggatg caaaccgttg gaagagattt tattgttaaa gcttttgaat atgcaagaaa atatgctcct gcggattgta aactcttcta caatgactat aatgaatatg aagacagaaa atgtgatttt attattgaaa ttcttaccga acttaaagcc aaaggcctgg ttgacggtat gggtatgcaa tcccactggg ttatggatta tccaagcata agcatgtttg aaaaatccat cagaagatat gcagcattgg gattggaaat tcagcttacc gagctggata taagaaatcc tgacaacagc cagtgggctt tggaacgtca ggctaatcgt tataaggagc ttgtaacaaa attggtcgat ttgaaaaaag aaggcataaa cattacggca ttggtattct ggggaataac cgacgcgaca agctggcttg gaggatatcc gctcctgttt gacgcggaat acaaggcaaa acctgcattt tatgctatag ttaacagcgt tccgccgctt ccgacagaac cgccggttca ggttataccc ggtgatgtaa acggtgacgg tcgtgtaaat tcatccgact tgactcttat gaaaagatac cttttaaaat ccataagcga cttcccgaca ccggaaggaa aaattgcggc ggatttaaac gaagacggca aggtaaactc gacagatttg ttagcgctga aaaaactcgt tctgagagaa ctttgacact aggtgcaaaa aggaggagaa acatgtcaag aaaacttttc agtgtattac ttgttggctt gatgcttatg acatcgttgc ttgtcacaat aagcagtaca tcagcggcat ccttgccaac catgccgcct aacagtctta ttccctatat cgaatctaac tcgggatatg accaggtaag gaacggcgtt ccgagagggc aggtcgtaaa tatttcttat ttctccacgg ccaccaacag taccaggccg gcaagagttt atttgccgcc gggatattca aaggacaaaa aatacagtgt tttgtatctc ttacacggca taggcggtag tgaaaacgac tggttcgaag ggggaggcag agccaatgtt attgccgaca atctgattgc cgagggaaaa atcaagcccc tgataattgt aacaccgaat actaacgccg ccggtccggg aatagcggac ggttatgaaa atttcacaaa agatttgctc tattcagtct acaccgaccg cgaacatcgg gcgattgcag gactttcaat gggtggagga caatcgttta atattggatt gaccaatctc gataaatttg cctatattgg cccgatttca gcggctccaa acacttatcc aaatgagagg ctttttcctg acggaggaaa agctgcaagg gagaaattga aactgctctt tattgcctgc ggaaccaatg acagtctgat aggttttgga cagagagtac atgaatattg cgttgccaac aacattaacc atgtctattg gcttattcag ggcggaggac acgattttaa tgtgtggaag cccggattgt ggaatttcct tcaaatggca gatgaagccg gattgacgag ggatggaaac actccggttc cgacacccag tccaaagccg gctaacacac gtattgaagc ggaagattat gacggtatta attcttcaag tattgagata ataggtgttc cacctgaagg aggcagagga ataggttata ttaccagtgg tgattatctg gtatacaaga gtatagactt tggaaacgga gcaacgtcgt ttaaggccaa ggttgcaaat gcaaatactt ccaatattga acttagatta aacggtccga atggtactct cataggcaca ctctcggtaa aatccacagg agattggaat acatatgagg agcaaacttg cagcattagc aaagtcaccg gaataaatga tttgtacttg gtattcaaag gccctgtaaa catagactgg ttcacttttg gcgttgaaag cagttccaca ggtctggggg atttaaatgg tgacggaaat attaactcgt cggaccttca ggcgttaaag aggcatttgc tcggtatatc accgcttacg page 7 13080 13140 13200 13260 13320 13380 13440 13500 13320 13380 13680 13740 13560 13620 13680 13740 13800 13860 13920 13980 14040 14100 14160 14220 14280 14340 14400 14460 14520 14580 14640 14700 14760 14820 14880 14940 15000 15060 15120 15180 15240 15300 201122105 ggagaggctc ttttaagagc ggatgtaaat aggagcggca aagtggattc tactgactat tcagtgctga aaagatatat actccgcatt attacagagt tccccggaca Aggtgatgta cagacaccca atccgtctgt tactccgaca caaactccta tccccacgat ttcgggaaat gctcttaggg attatgcgga ggcaagggga ataaa aatcg gaacatgtgt caactatccg ttttacaaca attcagatcc aacctacaac agcattttgc aaagagaatt ttcaatggtt gtatgtgaaa atgaaatgaa gtttgatgct ttgcagccga gacaaaacgt ttttgatttt tcgaaaggag accagttgct tgcttttgca gaaagaaacg gtatgcagat gaggggacat acgttgattt ggcacaatca aaacccgtca tggcttacaa acggtaactg gaaccgggat tcgctgcttg cggtaatgaa aaatcacatt accactgtta tgacccatta caaaggtaaa attgttgagt gggatgtggc aaacgaatgt atggatgatt ccggcaacgg cttaagaagc agcatatgga gaaatgtaat cggtcaggac taccttgact atgctttcag gtatgcaaga gaagcagatc ccgatgcact tcttttctac aatgattata atattgaaga cttgggtcca aagtccaatg cggtatttaa catgattaaa agtatgaagg aaagaggtgt gccgattgac ggagtaggat tccaatgcca ctttatcaat ggaatgagcc ccgagtacct tgccagcatt gatcaaaata ttaagagata tgcggaaata ggcgttatag tatcctttac cgaaatagat atacgcatac ctcagtcgga aaacccggca actgcattcc aggtacaggc aaacaactat aaggaactta tgaaaatttg tctggcaaac cccaattgca atacctttgt aatgtgggga ttcacagata aatacacatg gattccggga actttcccag gatatggcaa tccattgatt tatgacagca attacaatcc gaaaccggca tacaatgcaa taaa Ggaagc tcttatgggc tattga <210> 2 <211> 20869 <212> DNA <213> artificial sequence <220><223> recombination sequence <400> 2 ggcctgtttg gcctttggga ggaatggtag atgagaaaag tcatcagtat gctcttagtt gtggctatgc tgacgacgat ttttgcggcg atgataccgc agacagtatc ggcggccaca atgacagtcg agatcggcaa agttacagca gccgttggat caaaagtaga aatacctata accctgaaag gagtgccatc caaaggaatg gccaattgcg acttcgtatt gggttatgat ccaaatgtgc tggaagtaac agaagtaaaa ccaggaagca taataaaaga tccggatcct agcaagagct ttgatagcgc aatatatccg gatcgaaaga tgattgtatt tctgtttgca gaagacagtg gaagaggaac gtatgcaata actcaggatg gagtatttgc aacaattgta gccactgtca aatcagctgc agcggcaccg attactttgc ttgaagtagg tgcatttgcg gacaacgatt tagtagaaat aagcacaact tttgtcgcgg gcggagtaaa tcttggtagt tccgtaccga caacacagcc aaatgttccg tcagacggtg tggtagtaga aattggcaaa gttacgggat Ctgttggaac tacagttgaa atacctgtat atttcagagg agttccatcc aaaggaatag caaactgcga ctttgtgttc agatatgatc cgaatgtatt ggaaattata gggatagatc ccggagacat aatagttgac ccgaatccta ccaagagctt tgatac Tgca Page 8 15360 15420 15480 15540 15600 15660 15720 15780 15840 15900 15960 16020 16080 16140 16200 16260 16320 16380 16440 16446 60 120 180 240 300 360 420 480 540 600 660 720 780 840 840

201122105 atatatcctg acagaaagat aatagtattc ctgtttgcgg aagacagcgg aacaggagcg tatgcaataa ctaaagacgg agtatttgca aaaataagag caactgtaaa atcaagtgct ccgggctata ttactttcga cgaagtaggt ggatttgcag ataatgacct ggtagaacag aaggtatcat ttatagacgg tggtgttaac gttggcaatg caacaccgac caagggagca acaccaacaa atacagctac gccgacaaaa tcagctacgg ctacgcccac caggccatcg gtaccgacaa acacacegac aaacacaccg gcaaatacac cggtatcagg caatttgaag gttgaattct acaacagcaa tccttcagat actactaact caatcaatcc tcagttcaag gttactaata ccggaagcag tgcaattgat ttgtccaaac tcacattgag atattattat acagtagacg gacagaaaga tcagaccttc tggtgtgacc atgctgcaat aatcggcagt aacggcagct acaacggaat tacttcaaat gtaaaaggaa catttgtaaa aatgagttcc tcaacaaata acgcagacac ctaccttgaa ataagcttta caggcggaac tcttgaaccg ggtgcacatg ttcagataca aggtagattt gcaaagaatg actggagtaa ctatacacag tcaaatgact actcattcaa gtctgcttca cagtttgttg aatgggatca ggtaacagca tacttgaacg gtgttcttgt atggggtaaa gaacccggtg gcagtgtagt accatcaaca cagcctgtaa caacaccacc tgcaacaaca aaaccacctg caacaacaaa accacctgca acaacaatac cgccgtcaga tgatccgaat gcaataaaga ttaaggtgga cacagtaaat gcaaaaccgg gagacacagt aaatatacct gtaagattca gtggtatacc atccaaggga atagcaaact gtgactttgt atacagctat gacccgaatg tacttgagat aatagagata aaaccgggag aattgatagt tgacccgaat cctgacaaga gctttgatac tgcagtatat cctgacagaa agataatagt attcctgttt gcagaagaca gcggaacagg agcgtatgca ataactaaag acggagtatt tgctacgata gtagcgaaag taaaatccgg agcacctaac ggactcagtg taatcaaatt tgtagaagta ggcggatttg cgaacaatga ccttgtagaa cagaggacac agttctttga cggtggagta aatgttggag atacaacagt acctacaaca cctacaacac ctgtaacaac accgacagat gattcgaatg cagtaaggat taaggtggac acagtaaatg caaaaccggg agacacagta agaatacctg taagattcag cggtatacca tccaagggaa tagcaaactg tgactttgta tacagctatg acccgaatgt acttgagata atagagatag aaccgggaga cataatagtt gacccgaatc ctgacaagag ctttgatact gcagtatatc ctgacagaaa gataatagta ttcctgtttg cggaagacag cggaacagga gcgtatgcaa taactaaaga cggagtattt gctacgatag tagcgaaagt aaaatccgga gcacctaacg gactcagtgt aatcaaattt gtagaagtag gcggatttgc gaacaatgac cttgtagaac agaagacaca gttctttgac ggtggagtaa atgttggaga tacaacagaa cctgcaacac ctacaacacc tgtaacaaca ccgacaacaa cagatgatct ggatgcagta aggattaaag tggacacagt aaatgcaaaa ccgggagaca cagtaagaat acctgtaaga ttcagcggta taccatccaa gggaatagca aactgtgact ttgtatacag ctatgacccg aatgtacttg agataataga gatagaaccg ggagacataa tagttgaccc gaatcctgac aagagctttg atactgcagt atatcctgac agaaagataa tagtattcct gtttgcggaa gacagcggaa caggagcgta tgcaataact aaagacggag tatttgctac gatagtagcg aaagtaaaat ccggagcacc taacggactc agtgtaatca aatttgtaga agtaggcgga 第9頁 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 I860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 201122105 tttgcgaaca atgaccttgt agaacagaag acacagttct ttgacggtgg agtaaatgtt 3120 ggagatacaa cagaacctgc aacacctaca acacctgtaa caacaccgac aacaacagat 3180 gatctggatg cagtaaggat taaagtggac acagtaaatg caaaaccggg agacacagta 3240 agaatacctg taagattcag cggtatacca tccaagggaa tagcaaactg tgactttgta 3300 tacagctatg acccgaatgt acttgagata atagagatag aaccgggaga cataatagtt 3360 gacccgaatc ctgacaagag ctttgatact gcagtatatc ctgacagaaa gataatagta 3420 ttcctgtttg cagaagacag cggaacagga gcgtatgcaa taactaaaga cggagtattt 3480 gctacgatag tagcgaaagt aaaagaagga gcacctaacg gactcagtgt aatcaaattt 3540 gtagaagtag gcggatttgc gaacaatgac cttgtagaac agaagacaca gttctttgac 3600 gntggagtaa atgttggaga tacaacagaa cctgcaacac ctacaacacc tgtaacaaca 3660 ccgacaacaa cagatgatct ggatgcagta aggattaaag tggacacagt aaatgcaaaa 3720 ccgggagaca cagtaagaat acctgtaaga ttcagcggta taccatccaa gggaatagca 3780 aactgtgact ttgtatacag ctatgacccg aatgtacttg agataataga gatagaaccg 3840 ggagaattga tagttgaccc gaatcctacc aagagctttg atactgcagt atatcctgac 3900 agaaagatga tagtattcct gtttgcggaa gacagcggaa caggagcgta tgcaataact 3960 gaagatggag tatttgctac gatagtagcg aaagtaaaat ccggagcacc taacggactc 4020 agtgtaatca aatttgtaga agtaggcgga tttgcgaaca atgaccttgt agaacagaag 4080 acacagttct ttgacggtgg agtaaatgtt ggagatacaa cagaacctgc aacacctaca 4140 acacctgtaa caacaccgac aacaacagat gatctggatg cagtaaggat taaagtggac 4200 acagtaaatg caaaaccggg agacacagta agaatacctg taagattcag cggtatacca 4260 tccaagggaa tagcaaactg tgactttgta tacagctatg acccgaatgt acttgagata 4320 atagagatag aaccgggaga cataatagtt gacccgaatc ctgacaagag ctttgatact 4380 gcagtatatc ctgacagaaa gataatagta ttcctgtttg cagaagacag cggaacggga 4440 j;cgtatgcaa taactaaaga cggagtattt gctacgatag tagcgaaagt aaaagaagga 4500 gcacctaacg gactcagtgt aatcaaattt gtagaagtag gcggatttgc gaacaatgac 4560 cttgtagaac agaagacaca gttctttgac ggtggagtaa atgttggaga tacaacagta 4620 cctacaacat cgccgacaac aacaccgcca gagccgacga taactccgaa caagttgaca 4680 cttaagatag gcagagcaga aggaagacct ggagacacgg tggaaatacc ggttaacttg 4740 tatggagiac ctcaaaaagg aatagcaagc ggtgacttcg tagtaagcta tgacccgaat 4800 gtacttgaga taatagagat agaaccggga gaattgatag ttgacccgaa tcctaccaag 4860 agctttgata ctgcagtata tcctgacaga aagatgatag tattcctgtt tgcggaagac 4920 agcggaacag gagcgtatgc aataactgaa gatggagtat ttgctacgat agtagcgaaa 4980 gtaaaagaag gagcacctga aggattcagt gcaatagaaa tttctgagtt tggtgcattt 5040 gcagataatg atctggtaga agtggaaact gaccttatca atggtggagt acttgtaact 5100 aataaacctg taatagaagg atataaagta tccggataca ttttgccaga cttctccttc 5160 gacjictactg ttgcaccact tgtaaaggcc ggattcaaag ttgaaatagt aggaacagaa 5220 ttgtatgcag taacagatgc aaacggatac tttgaaataa ccggagtacc tgcaaatgca 5280 agcggatata cattgaagat ttcaagagca acttacttgg acagagtaat tgcaaatgtt 5340 第10頁 201122105201122105 atatatcctg acagaaagat aatagtattc ctgtttgcgg aagacagcgg aacaggagcg tatgcaataa ctaaagacgg agtatttgca aaaataagag caactgtaaa atcaagtgct ccgggctata ttactttcga cgaagtaggt ggatttgcag ataatgacct ggtagaacag aaggtatcat ttatagacgg tggtgttaac gttggcaatg caacaccgac caagggagca acaccaacaa atacagctac gccgacaaaa tcagctacgg ctacgcccac caggccatcg gtaccgacaa acacacegac aaacacaccg gcaaatacac cggtatcagg caatttgaag gttgaattct acaacagcaa tccttcagat actactaact caatcaatcc tcagttcaag gttactaata ccggaagcag tgcaattgat ttgtccaaac tcacattgag atattattat acagtagacg gacagaaaga tcagaccttc tggtgtgacc atgctgcaat aatcggcagt aacggcagct acaacggaat tacttcaaat gtaaaaggaa catttgtaaa aatgagttcc tcaacaaata acgcagacac ctaccttgaa ataagcttta caggcggaac tcttgaaccg ggtgcacatg ttcagataca aggtagattt gcaaagaatg actggagtaa ctatacacag tcaaatgact actcattcaa gtctgcttca cagtttgttg aatgggatca ggtaacagca tacttgaacg gtgttcttgt atggggtaaa gaacccggtg gcagtgtagt accatcaaca cagcctgtaa caacaccacc tgcaacaaca aaaccacctg caacaacaaa accacctgca acaacaatac cgccgtcaga tgatccgaat gcaataaaga ttaaggtgga cacagtaaat gcaaaaccgg gagacacagt aaatatacct gtaagattca gtggtatacc atccaaggga atagcaaact gtgactttgt atacagctat gacccgaatg tacttgagat aatagagata aaaccgggag aattgatagt tgacccgaat cctgacaaga gctttgatac tgcagtatat cctgacagaa agataatagt attcctgttt gcagaagaca gcggaacagg agcgtatgca ataactaaag acggagtatt tgctacgata gtagcgaaag taaaatccgg agcacctaac ggactcagtg taatcaaatt tgtagaagta ggcggatttg cgaacaatga ccttgtagaa cagaggacac agttctttga cggtggagta aatgttggag atacaacagt acctacaaca cctacaacac ctgtaacaac accgacagat gattcgaatg cagtaaggat taaggtggac acagtaaatg caaaaccggg agacacagta agaatacctg taagattcag cggtatacca tccaagggaa tagcaaactg tgactttgta tacagctatg acccgaatgt acttgagata atagagatag aaccgggaga cataatagtt gacccgaatc ctgacaagag ctttgatact gcagtatatc ctgacagaaa gataatagta ttcctgtttg cggaagacag cggaacagga gcgtatgcaa taactaaaga cggagtattt gctacgatag tagcgaaagt aaaatccgga gcacctaacg gactcagtgt aatcaaattt gtagaagtag gcggatttgc gaacaatgac cttgtagaac agaagacaca gttctttgac ggtggagtaa atgttggaga tacaacagaa cctgcaacac ctacaacacc tgtaacaaca ccgacaacaa cagatgatct ggatgcagta aggattaaag tggacacagt aaatgcaaaa ccgggagaca cagtaagaat acctgtaaga ttcagcggta taccatccaa gggaatagca aactgtgact ttgtatacag ctatgacccg aatgtacttg agataataga gatagaaccg ggagacataa tagttgaccc gaatcctgac aagagctttg atactgcagt atatcctgac agaaagataa tagtattcct gtttgcggaa gacagcggaa caggagcgta tgcaataact aaagacggag tatttgctac gatagtagcg aaagtaaaat ccggagcacc taacggactc agtgtaatca aatttgtaga agtaggcgga page 9 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 I860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 201122105 tttgcgaaca atgaccttgt agaacagaag acacagttct ttgacggtgg agtaaatgtt 3120 ggagatacaa cagaacctgc aacacctaca acacctgtaa caacaccgac Aacaacagat 3180 gatctggatg cagtaaggat taaagtggac acagtaaatg caaaaccggg agacacagta 3240 agaatacctg taagattcag cggtatacca tccaagggaa tagcaaactg tgactttgta 33 00 tacagctatg acccgaatgt acttgagata atagagatag aaccgggaga cataatagtt 3360 gacccgaatc ctgacaagag ctttgatact gcagtatatc ctgacagaaa gataatagta 3420 ttcctgtttg cagaagacag cggaacagga gcgtatgcaa taactaaaga cggagtattt 3480 gctacgatag tagcgaaagt aaaagaagga gcacctaacg gactcagtgt aatcaaattt 3540 gtagaagtag gcggatttgc gaacaatgac cttgtagaac agaagacaca gttctttgac 3600 gntggagtaa atgttggaga tacaacagaa cctgcaacac ctacaacacc tgtaacaaca 3660 ccgacaacaa cagatgatct ggatgcagta aggattaaag tggacacagt aaatgcaaaa 3720 ccgggagaca cagtaagaat acctgtaaga ttcagcggta taccatccaa gggaatagca 3780 aactgtgact ttgtatacag ctatgacccg aatgtacttg agataataga gatagaaccg 3840 ggagaattga tagttgaccc gaatcctacc aagagctttg atactgcagt atatcctgac 3900 agaaagatga tagtattcct gtttgcggaa gacagcggaa caggagcgta tgcaataact 3960 gaagatggag tatttgctac gatagtagcg aaagtaaaat ccggagcacc taacggactc 4020 agtgtaatca aatttgtaga agtaggcgga tttgcgaaca atgaccttgt agaacagaag 4080 acacagttct ttgacggtgg agtaaatgtt ggagatacaa cagaacctgc aacacctaca 4140 aca cctgtaa caacaccgac aacaacagat gatctggatg cagtaaggat taaagtggac 4200 acagtaaatg caaaaccggg agacacagta agaatacctg taagattcag cggtatacca 4260 tccaagggaa tagcaaactg tgactttgta tacagctatg acccgaatgt acttgagata 4320 atagagatag aaccgggaga cataatagtt gacccgaatc ctgacaagag ctttgatact 4380 gcagtatatc ctgacagaaa gataatagta ttcctgtttg cagaagacag cggaacggga 4440 j; cgtatgcaa taactaaaga cggagtattt gctacgatag tagcgaaagt aaaagaagga 4500 gcacctaacg gactcagtgt aatcaaattt gtagaagtag gcggatttgc gaacaatgac 4560 cttgtagaac agaagacaca gttctttgac ggtggagtaa atgttggaga tacaacagta 4620 cctacaacat cgccgacaac aacaccgcca gagccgacga taactccgaa caagttgaca 4680 cttaagatag gcagagcaga aggaagacct ggagacacgg tggaaatacc ggttaacttg 4740 tatggagiac ctcaaaaagg aatagcaagc ggtgacttcg tagtaagcta tgacccgaat 4800 gtacttgaga taatagagat agaaccggga gaattgatag ttgacccgaa tcctaccaag 4860 agctttgata ctgcagtata tcctgacaga aagatgatag tattcctgtt tgcggaagac 4920 agcggaacag gagcgtatgc aataactgaa gatggagtat ttgctacgat agtagcgaaa 4980 Gtaaaaga ag gagcacctga aggattcagt gcaatagaaa tttctgagtt tggtgcattt 5040 gcagataatg atctggtaga agtggaaact gaccttatca atggtggagt acttgtaact 5100 aataaacctg taatagaagg atataaagta tccggataca ttttgccaga cttctccttc 5160 gacjictactg ttgcaccact tgtaaaggcc ggattcaaag ttgaaatagt aggaacagaa 5220 ttgtatgcag taacagatgc aaacggatac tttgaaataa ccggagtacc tgcaaatgca 5280 agcggatata cattgaagat ttcaagagca acttacttgg acagagtaat tgcaaatgtt 5340 Page 10 201 122 105

gtagtaacgg gagatacttc agtttcaact tcacaggctc caataatgat gtgggtagga 5400 gacatagtga aagacaattc tatcaacctg ttggacgttg cagaagttat ccgttgcttc 5460 aacgctacta aaggaagcgc aaactacgta gaagaacttg acattaatag aaacggcgca 5520 attaacatgc aagacataat gattgttcat aagcactttg gagctacatc aagtgattac 5580 gacgcacagt aacactgacg gctaacggga ggtagattta tgaatttcag aagaatgttg 5640 tgcgcagcca tagtgttgac aattgtactg tccattatgc tgccgtcaac tgtttttgct 5700 ttggaagaca agtctccaaa gttgccggat tataaaaacg accttttgta tgaaagaaca 5760 ttcgacgaag gtctttgctt tccgtggcat acttgcgaag acagtggagg aaaatgtgat 5820 ttcgctgttg ttgatgttcc aggagagcct gggaacaaag ctttccgctt gacagtaatt 5880 gacaaaggac aaaacaagtg gagtgtccag atgagacaca gaggtattac cctcgagcaa 5940 ggacatacat acacggtaag gtttacgatt tggtctgaca aatcctgtag ggtttatgct 6000 aaaattggtc agatgggtga accctatact gaatattgga acaataactg gaatccattc 6060 aaccttacac caggacagaa gcttacagtt gaacagaatt ttacaatgaa ctatcctact 6120 gatgacacat gcgagttcac attccatttg ggtggagaac ttgctgcagg tacaccttac 6180 tatgtttacc ttgatgatgt atctctctac gatcctaggt ttgtaaagcc tgt tgaatat 6240 gtacttccgc agccggatgt acgtgttaac caggtaggat acttaccgtt tgcaaagaag 6300 tatgctactg ttgtatcttc ttcaaccagc ccgcttaagt ggcagcttct caattcggca 6360 aatcaggttg ttttggaagg taatacaata ccaaaaggac ttgacaaaga ttcacaggat 6420 tatgtacatt ggatagattt ctccaacttt aagactgaag gaaaaggtta ttacttcaag 6480 cttccgactg taaacagcga tacaaattac agccatcctt tcgatatcag tgctgatatt 6540 tactccaaga tgaaatttga tgcattggca ttcttctatc acaagagaag cggtattcct 66⑻ attgaaatgc cgtatgcagg aggagaacag tggaccagac ctgcaggaca tattggtgtt 6660 gctccgaaca aaggagacac aaatgttcct acatggcctc aggatgatga atatgcagga 6720 agacctcaaa aatattatac aaaagatgta accggtggat ggtatgatgc cggtgaccac 6780 ggtaaatatg ttgtaaacgg cggtatagct gtttggacat tgatgaacat gtatgaaagg 6840 gcaaaaatca gaggcatagc taatcaaggt gcttataaag acggtggaat gaacataccg 6900 gagagaaata acggttatcc ggacattctt gatgaagcaa gatgggaaat tgagttcttt 6960 aagaaaatgc aggtaactga aaaagaggat ccttccatag ccggaatggt acaccacaaa 7020 attcacgact tcagatggac tgctttgggt atgttgcctc acgaagatcc ccagccacgt 7080 tacttaaggc cggtaagtac ggctgcgact ttgaactttg cggcaacttt ggcacaaagt 7140 gcacgtcttt ggaaagatta tgatccgact tttgctgctg actgtttgga aaaggctgaa 7200 atagcatggc aggcggcatt aaagcatcct gatatttatg ctgagtatac tcccggtagc 7260 ggtggtcccg gaggcggacc atacaatgac gactatgtcg gagacgaatt ctactgggca 7320 gcctgcgaac tttatgtaac aacaggaaaa gacgaatata agaattacct gatgaattca 7380 cctcactatc ttgaaatgcc tgcaaagatg ggtgaaaacg gtggagcaaa cggagaagac 7440 aacggattgt ggggatgctt cacctgggga actactcaag gattgggaac tattactctt 7500 gcattagttg aaaacggat t gccgtctgca gacattcaaa aggcaagaaa caatatagct 7560 aaagctgcag acaaatggct tgagaatatt gaagagcaag gt tacagact gccgatcaaa 7620 第11 頁 201122105 caggcggagg atgagagagg cggttatcca tggggttcaa actccttcat tttgaaccag 7680 atgatagtta tgggatacgc atatgacttt acaggcaaca gcaagtatct tgacggaatg 7740 caggatggta tgagctacct gttgggaaga aacggactgg atcagtccta tgtaacaggg 7800 tatggtgagc gtccacttca gaatcctcat gacagattct ggacgccaca gacaagtaag 7860 aaattccctg ctccacctcc gggtataatt gccggtggtc cgaactcccg tttcgaagac 7920 ccgacaataa ctgcagcagt taagaaggat acaccgccgc agaagtgcta cattgaccat 7980 acagactcat ggtcaaccaa cgagataact attaactgga atgctccgtt tgcatgggtt 8040 acagcttatc tcgatgaaat tgacttaata acaccgccag gaggagtaga cccagaagaa 8100 ccggaggtta tttatggtga ctgcaatggc gacggaaaag ttaattcaac tgacgctgtg 8160 gcattgaaga gatatatctt gagatcaggt ataagcatca acactgataa tgctgatgta 8220 aatgctgatg gcagagttaa ctctacagac ttggcaatat tgaagagata tattcttaaa 8280 gagatagatg tattgccaca taaataagcc acgagtgagg ggaagatgga gagaatggta 8340 aaaagcagaa agatttctat tctgttggca gttgcaatgc tggtatccat aatgataccc 8400 acaactgcat tcgcaggtcc tacaaaggca cctacaaaag atgggacatc ttataaggat 8460 cttttccttg aactctacgg aaaaattaaa gatcctaaga acggatattt cagcccagac 8520 gagggaattc cttatcactc aattgaaaca ttgatcgttg aagcgccgga ctacggtcac 8580 gttactacca gtgaggcttt cagctattat gtatggcttg aagcaatgta tggaaatctc 8640 acaggcaact ggtccggagt agaaacagca tggaaagtta tggaggattg gataattcct 8700 gacagcacag agcagccggg tatgtcttct tacaatccaa acagccctgc cacatatgct 8760 gacgaatatg aggatccttc atactatcct tcagagttga agtttgatac cgtaagagtt 8820 ggatccgacc ctgtacacaa cgaccttgta tccgcatacg gtcctaacat gtacctcatg 8880 cactggttga tggacgttga caactggtac ggttttggta caggaacacg ggcaacattc 8940 ataaacacct tccaaagagg tgaacaggaa tccacatggg aaaccattcc tcatccgtca 9000 atagaagagt tcaaatacgg cggaccgaac ggattccttg atttgtttac aaaggacaga 9060 tcatatgcaa aacagtggcg ttatacaaac gctcctgacg cagaaggccg tgctatacag 9120 gctgtttact gggcaaacaa atgggcaaag gagcagggta aaggttctgc cgttgcttcc 9180 gttgtatcca aggctgcaaa gatgggtgac ttcttgagaa acgacatgtt cgacaaatac 9240 ttcatgaaga tcggtgcaca ggacaagact cctgctaccg gttatgacag tgcacactac 9300 cttatggcct ggtatactgc atggggtggt ggaattggtg catcctgggc atggaagatc 9360 ggatgcagcc acgcacactt cggatatcag aacccattcc agggatgggt aagtgcaaca 9420 cagagcgact ttgctcctaa atcatccaac ggtaagagag actggacaac aagctacaag 9480 agacagcttg aattctatca gtggttgcag tcggctgaag gtggtattgc cggtggagca 9540 accaac.tcct ggaacggtag atatgagaaa tatcctgctg gtacgtcaac gttctatggt 9600 atggcatatg ttccgcatcc tgtatacgct gacccgggta gtaaccagtg gttcggattc 9660 caggcatggt caatgcagcg tgtaatggag tactacctcg aaacaggaga ttcatcagtt 9720 aagaatttga ttaagaagtg ggtcgactgg gtaatgagcg aaattaagct ctatgacgat 9780 ggaacatttg caattcctag cgacctcgag tggtcaggtc agcctgatac atggaccgga 9840 acatacacag gcaacccgaa cctccatgta agagtaactt cttacggtac tgaccttggt 9900 第12頁 201122105gtagtaacgg gagatacttc agtttcaact caataatgat gtgggtagga 5400 gacatagtga aagacaattc tatcaacctg ttggacgttg cagaagttat ccgttgcttc 5460 aacgctacta aaggaagcgc aaactacgta gaagaacttg acattaatag aaacggcgca 5520 attaacatgc aagacataat gattgttcat aagcactttg gagctacatc aagtgattac 5580 gacgcacagt aacactgacg gctaacggga ggtagattta tgaatttcag aagaatgttg 5640 tgcgcagcca tagtgttgac aattgtactg tccattatgc tgccgtcaac tgtttttgct 5700 ttggaagaca agtctccaaa gttgccggat tataaaaacg accttttgta tgaaagaaca 5760 ttcgacgaag tcacaggctc gtctttgctt tccgtggcat acttgcgaag acagtggagg aaaatgtgat 5820 ttcgctgttg ttgatgttcc aggagagcct gggaacaaag ctttccgctt gacagtaatt 5880 gacaaaggac aaaacaagtg gagtgtccag atgagacaca gaggtattac cctcgagcaa 5940 ggacatacat acacggtaag gtttacgatt tggtctgaca aatcctgtag ggtttatgct 6000 aaaattggtc agatgggtga accctatact gaatattgga acaataactg gaatccattc 6060 aaccttacac caggacagaa gcttacagtt gaacagaatt ttacaatgaa ctatcctact 6120 gatgacacat gcgagttcac attccatttg ggtggagaac ttgctgcagg tacacctta c 6180 tatgtttacc ttgatgatgt tgt atctctctac gatcctaggt ttgtaaagcc tactccaaga tgaaatttga tgcattggca tgaatat 6240 gtacttccgc agccggatgt acgtgttaac caggtaggat acttaccgtt tgcaaagaag 6300 tatgctactg ttgtatcttc ttcaaccagc ccgcttaagt ggcagcttct caattcggca 6360 aatcaggttg ttttggaagg taatacaata ccaaaaggac ttgacaaaga ttcacaggat 6420 tatgtacatt ggatagattt ctccaacttt aagactgaag gaaaaggtta ttacttcaag 6480 cttccgactg taaacagcga tacaaattac agccatcctt tcgatatcag tgctgatatt 6540 ttcttctatc acaagagaag cggtattcct 66⑻ attgaaatgc cgtatgcagg aggagaacag tggaccagac ctgcaggaca tattggtgtt 6660 gctccgaaca aaggagacac aaatgttcct acatggcctc aggatgatga atatgcagga 6720 agacctcaaa aatattatac aaaagatgta accggtggat ggtatgatgc cggtgaccac 6780 ggtaaatatg ttgtaaacgg cggtatagct gtttggacat tgatgaacat gtatgaaagg 6840 gcaaaaatca gaggcatagc taatcaaggt gcttataaag acggtggaat gaacataccg 6900 gagagaaata acggttatcc ggacattctt gatgaagcaa gatgggaaat tgagttcttt 6960 aagaaaatgc aggtaactga aaaagaggat ccttccatag ccggaatggt acaccacaaa 7020 attcacgact tcagatggac tgctttgggt atgttgcctc acgaagatcc ccagccacgt 7080 tacttaaggc cggtaagtac ggctgcgact ttgaactttg cggcaacttt ggcacaaagt 7140 gcacgtcttt ggaaagatta tgatccgact tttgctgctg actgtttgga aaaggctgaa 7200 atagcatggc aggcggcatt aaagcatcct gatatttatg ctgagtatac tcccggtagc 7260 ggtggtcccg gaggcggacc atacaatgac gactatgtcg gagacgaatt ctactgggca 7320 gcctgcgaac tttatgtaac aacaggaaaa gacgaatata agaattacct gatgaattca 7380 cctcactatc ttgaaatgcc tgcaaagatg ggtgaaaacg gtggagcaaa cggagaagac 7440 aacggattgt ggggatgctt cacctgggga actactcaag gattgggaac tattactctt 7500 gcattagttg aaaacggat t gccgtctgca gacattcaaa aggcaagaaa caatatagct 7560 aaagctgcag acaaatggct tgagaatatt gaagagcaag gt tacagact gccgatcaaa 7620 page 11 201122105 caggcggagg atgagagagg cggttatcca tggggttcaa actccttcat tttgaaccag 7680 atgatagtta tgggatacgc atatgacttt acaggcaaca gcaagtatct tgacggaatg 7740 caggatggta tgagctacct gttgggaaga aacggactgg atcagtccta tgtaacaggg 7800 tatggtgagc Gtccacttca g aatcctcat gacagattct ggacgccaca gacaagtaag 7860 aaattccctg ctccacctcc gggtataatt gccggtggtc cgaactcccg tttcgaagac 7920 ccgacaataa ctgcagcagt taagaaggat acaccgccgc agaagtgcta cattgaccat 7980 acagactcat ggtcaaccaa cgagataact attaactgga atgctccgtt tgcatgggtt 8040 acagcttatc tcgatgaaat tgacttaata acaccgccag gaggagtaga cccagaagaa 8100 ccggaggtta tttatggtga ctgcaatggc gacggaaaag ttaattcaac tgacgctgtg 8160 gcattgaaga gatatatctt gagatcaggt ataagcatca acactgataa tgctgatgta 8220 aatgctgatg gcagagttaa ctctacagac ttggcaatat tgaagagata tattcttaaa 8280 gagatagatg tattgccaca taaataagcc acgagtgagg ggaagatgga gagaatggta 8340 aaaagcagaa agatttctat tctgttggca gttgcaatgc tggtatccat aatgataccc 8400 acaactgcat tcgcaggtcc tacaaaggca atgggacatc cctacaaaag tggaaatctc 8640 acaggcaact ttataaggat 8460 cttttccttg aactctacgg aaaaattaaa gatcctaaga acggatattt cagcccagac 8520 gagggaattc cttatcactc aattgaaaca ttgatcgttg aagcgccgga ctacggtcac 8580 gttactacca gtgaggcttt cagctattat gtatggcttg aagcaatgta ggtccggagt agaaaca gca tggaaagtta tggaggattg gataattcct 8700 gacagcacag agcagccggg tatgtcttct tacaatccaa acagccctgc cacatatgct 8760 gacgaatatg aggatccttc atactatcct tcagagttga agtttgatac cgtaagagtt 8820 ggatccgacc ctgtacacaa cgaccttgta tccgcatacg gtcctaacat gtacctcatg 8880 cactggttga tggacgttga caactggtac ggttttggta caggaacacg ggcaacattc 8940 ataaacacct tccaaagagg tgaacaggaa tccacatggg aaaccattcc tcatccgtca 9000 atagaagagt tcaaatacgg cggaccgaac ggattccttg atttgtttac aaaggacaga 9060 tcatatgcaa aacagtggcg ttatacaaac gctcctgacg cagaaggccg tgctatacag 9120 gctgtttact gggcaaacaa atgggcaaag gagcagggta aaggttctgc cgttgcttcc 9180 gttgtatcca aggctgcaaa gatgggtgac ttcttgagaa acgacatgtt cgacaaatac 9240 ttcatgaaga tcggtgcaca ggacaagact cctgctaccg gttatgacag tgcacactac 9300 cttatggcct ggtatactgc atggggtggt ggaattggtg catcctgggc atggaagatc 9360 ggatgcagcc acgcacactt cggatatcag aacccattcc agggatgggt aagtgcaaca 9420 cagagcgact ttgctcctaa atcatccaac ggtaagagag actggacaac aagctacaag 9480 agacagcttg aattctatca gtggttgcag tc ggctgaag gtggtattgc cggtggagca 9540 accaac.tcct ggaacggtag atatgagaaa tatcctgctg gtacgtcaac gttctatggt 9600 atggcatatg ttccgcatcc tgtatacgct gacccgggta gtaaccagtg gttcggattc 9660 caggcatggt caatgcagcg tgtaatggag tactacctcg aaacaggaga ttcatcagtt 9720 aagaatttga ttaagaagtg ggtcgactgg gtaatgagcg aaattaagct ctatgacgat 9780 ggaacatttg caattcctag cgacctcgag tggtcaggtc agcctgatac atggaccgga 9840 acatacacag gcaacccgaa cctccatgta agagtaactt cttacggtac tgaccttggt 9900 12 Page 201122105

gttgcaggtt cacttgcaaa tgctcttgca acttatgccg cagctacaga aagatgggaa 9960 ggaaaacttg atacaaaagc aagagacatg gctgctgaac tggttaaccg tgcatggtac 10020 aacttctact gctctgaagg aaaaggtgtt gttactgagg aagcacgtgc tgactacaaa 10080 cgtttctttg agcaggaagt atacgttccg gcaggttgga gcggtactat gccgaacggt 10140 gacaagattc agcctggtat taagttcata gacatccgta caaaatatag acaagatcct 10200 tactacgata tagtatatca ggcatacttg agaggcgaag ctcctgtatt gaattatcac 10260 cgcttctggc atgaagttga ccttgcagtt gcaatgggtg tattggctac atacttcccg 10320 gatatgacat ataaagtacc tggtactcct tctactaaat tatacggcga cgtcaatgat 10380 gacggaaaag ttaactcaac tgacgctgta gcattgaaga gatatgtttt gagatcaggt 10440 ataagcatca acactgacaa tgccgatttg aatgaagacg gcagagttaa ttcaactgac 10500 ttaggaattt tgaagagata tattctcaaa gaaatagata cattgccgta caagaactaa 10560 caccaagtga aagggggaga tatatagtga aaaaactcat tatcactgtt atagtatctg 10620 ctgtcctttt aactgctctt ataccgcagt tgcctgtttt tgcagcagac tataactatg 10680 gagaagcact ccaaaaagca attatgttct atgaatttca aatgtccgga aagcttcccg 10740 acaacatccg taacaactgg cgcggtgatt catgtctcgg agacggaagc gatgtaggtc 10800 ttgacctcac aggaggttgg tttgacgccg gtgaccatgt aaaattcaat ctgcctatgg 10860 cttacacagc cactatgctt gcatgggctg tgtatgagta caaggacgcg ttacaaaaaa 10920 gcggtcaatt gggctattta atggatcaga ttaaatgggc atcggactac ttcataagat 10980 gccatcccga aaaatatgta tattattatc aagtgggtaa cggtgacatg gaccacagat 11040 ggtgggtgcc ggcagaatgt atagatgttc aggcaccaag accgtcttac aaagtagatc 11100 tgtcaaatcc cggttccaca gttactgcgg gtacagctgc cgcacttgct gcaactgcct 11160 tggtattcaa agacactgat ccggcatatg ccgctctgtg catacgtcat gcaaaagaac 11220 tctttgattt tgctgaaacc actatgagtg ataaaggata taccgcagca ttgaatttct 11280 acacatctca cagtggatgg tatgacgagc tttcctgggc aggtgcatgg atttatcttg 11340 cagacggtga cgaaacttat cttgaaaaag ctgaaaagta tgtggataaa tggccaatcg 11400 aaagccagac aacttacatt gcttattcat ggggtcactg ctgggacgac gttcactacg 11460 gagcagcact tcttttggca aagat tacaa acaaatcctt atacaaagaa gcgatagaaa 11520 gacacctgga ctattggaca gttggattta atggtcagag agtcagatat acaccaaagg 11580 gtcttgctca cctcactgac tggggtgtat taagacatgc cactactact gcattccttg 11640 catgtgttta ttccgactgg tcagaatgtc caagggaaaa agccaatatt tacatagatt 11700 ttgccaagaa acaggctgac tatgccttag gcagcagcgg cagaagttat gtagtcggat 11760 ttggtgtaaa tcctccgcag catccgcacc acagaactgc ccacagctca tgg丨gtgaca 11820 gtcaaaaagt tcctgaatac cacagacacg ttctttacgg agcactcgta ggcggacctg 11880 atgccagcga tgcttatgtt gatgatatag gaaactatgt aacaaatgag gttgcctgcg 11940 actacaatgc cggttttgta ggattgctcg ccaagatgta tgaaaaatat ggcggaaacc 12000 ccataccaaa cttcatggct atagaagaaa aaacaaatga agaaatttat gttgaagcta 12060 ccgccaattc aaataacggt gtcgaattga aaacatacct ttacaataaa tccggatggc 12120 cggcaagagt ttgcgacaag ctttccttca gatatttcat ggaccttacg gaatatgtat 12180 第13 頁 201122105 ccgccggata caatcctaat gatataactg tttctataat ttacagtgca gcaccaactg 12240 caaaaatttc aaaaccaata ctttatgacg catccaaaaa catatattat tgcgaaatcg 12300 atctctccgg taccaagata ttccccggaa gcaactcaga ccaccagaaa gaaacccaat 12360 ttagaataca gcctcctgca ggcgcacctt gggacaacac caacgacttc tcctatcagg 12420 gaatcaagaa aaacggtgaa gttgtaaaag aaatgcctgt ttatgaagac ggaattctca 12480 tattcggtgt agaacccaat ggtaccggtc ctgcaacacc aacgc-cgaaa ccgtccgtaa 12540 atccttcacc ttcacctacg ccaacatcgg atat tcttta cggtgacatc aatctggacg 12600 gaaaaattaa ctcttcagat gttacactgt taaaaagata tattgtgaag tccatagatg 12660 ttttcccaac cgctgatccg gaacggagct taatagcatc agatgtaaac ggagacggaa 12720 gggtaaactc tacagactat tcatacctta aacgttatgt cttgaaaatc ataccaacca 12780 tacccggaaa ttcatgacac gtcgtgtccg aatttattct attggcagag atttttgttt 12840 tttcggcatt ttcacaaaaa catatgaagt aagggtgagg gggtttggat gaaaacaaaa 12900 gaaaaagaag aaattttgca gtataaataa attataatga aaaatttaag agaaatttaa 12960 atgaggaggc aattatcaaa tgaggaagaa aaaaagat ta atatcattac tgcttgcggt 13020 ttttatcgcc gttgcatgtc tgccggcggg aattgcaagg gcagataaag cctcgagcat 13080 tgagcttaag tttgaccgca ataagggaga agttggagat atacttattg gtaccgtaag 13140 i;ataaacaat atcaagaatt tcgcaggatt tcaggtaaac attgtatatg atccaaaagt 13200 cttaatggct gttgaccctg aaacggggaa agaatttact tcttcaacat ttccgccagg 13260 acgcactgta ctgaaaaaca atgcttacgg cccaatacag attgcggaca atgatccgga 13320 aaaagggata ctgaacttcg cgcttgcata ttcatatatt gcgggataca aagaaacagg 13380 agta^cggag gaaagcggca taattgcgaa aattggattt aaaatactcc agaaaaagag 13440 cactgccgta aaattccagg atacattaag catgcccgga gctatttcgg gaacacagct 13500 gtttgactgg gacggagaag ttattaccgg atatgaggta atacagccgg atgtgctgag 13560 tttgggtgac gagccttatg agacaccggg aacggatatt ccgatatccg acaatccggc 13620 agcaactccg tcatccacgc cgtcagttac tccttcaccg gaagttaaac cgactcagac 13680 gccttcgcct gcagaaaatt ctgcaaaagt ggagcttgaa cctgtgttgg ataatgcaac 13740 aggagaagca aaggcggcaa tagatgaaga aaaattaaac aaggctcttg atgaagcgaa 13800 aaaatcggaa gatgacaaac ttgtggaact taacataaag aaggttgaaa atgccgatgc 13860 ttacatacaa cagcttccgg cgaaattcct gataaaaagt gacgccgaat ataagctgag 13920 aatagctaca gagcagggaa ttatagaagt accggccaac atgctgaata ctgcggatat 13980 ttcaaagctt gtaaaaaatg actccgttgt tgaattcgtc ataagaaaag taaaagtcga 14040 tgaacttggt gcagagctca aagagaagat aggcaacagg ccggtgattg acataagcgt 14100 ggttgttgac ggcaaaaaag ttgaatggag caattacaaa gccaaggtta aaatatcaat 14160 tcct tacaag cctgatgcaa aagagctgga gaaccacgag catattgttg tactccatat 14220 tgatgacgcc ggcaaggcag tttccgtacc cagcggaaaa tatgaacctt ctttgggcgt 14280 cgttacgttt gagacgaatc at t taagcaa gtatgcggtt tcatatgttt acaagacttt 14340 cgc^gatatt ggttcatatg cctgggctaa aaagcagata gaggttttgg cttccaaagg 14400 agtaattaac ggtacatccg ataccacttt tacgccccag gcagacataa caagggcgga 14460 第14 頁 201122105gttgcaggtt cacttgcaaa acttatgccg cagctacaga aagatgggaa 9960 ggaaaacttg atacaaaagc aagagacatg gctgctgaac tggttaaccg tgcatggtac 10020 aacttctact gctctgaagg aaaaggtgtt gttactgagg aagcacgtgc tgactacaaa 10080 cgtttctttg agcaggaagt atacgttccg gcaggttgga gcggtactat gccgaacggt 10140 gacaagattc agcctggtat taagttcata gacatccgta caaaatatag acaagatcct 10200 tactacgata tagtatatca ggcatacttg agaggcgaag ctcctgtatt gaattatcac 10260 cgcttctggc atgaagttga ccttgcagtt gcaatgggtg tattggctac atacttcccg 10320 gatatgacat tgctcttgca ataaagtacc tggtactcct tctactaaat tatacggcga cgtcaatgat 10380 gacggaaaag ttaactcaac tgacgctgta gcattgaaga gatatgtttt gagatcaggt 10440 ataagcatca acactgacaa tgccgatttg aatgaagacg gcagagttaa ttcaactgac 10500 ttaggaattt tgaagagata tattctcaaa gaaatagata cattgccgta caagaactaa 10560 caccaagtga aagggggaga tatatagtga aaaaactcat tatcactgtt atagtatctg 10620 ctgtcctttt aactgctctt ataccgcagt tgcctgtttt tgcagcagac tataactatg 10680 gagaagcact ccaaaaagca attatgttct atgaatttca aatgtccg ga aagcttcccg 10740 acaacatccg taacaactgg cgcggtgatt catgtctcgg agacggaagc gatgtaggtc 10800 ttgacctcac aggaggttgg tttgacgccg gtgaccatgt aaaattcaat ctgcctatgg 10860 cttacacagc cactatgctt gcatgggctg tgtatgagta caaggacgcg ttacaaaaaa 10920 gcggtcaatt gggctattta atggatcaga ttaaatgggc atcggactac ttcataagat 10980 gccatcccga aaaatatgta tattattatc aagtgggtaa cggtgacatg gaccacagat 11040 ggtgggtgcc ggcagaatgt atagatgttc aggcaccaag accgtcttac aaagtagatc 11100 tgtcaaatcc cggttccaca gttactgcgg gtacagctgc cgcacttgct gcaactgcct 11160 tggtattcaa agacactgat ccggcatatg ccgctctgtg catacgtcat gcaaaagaac 11220 tctttgattt tgctgaaacc actatgagtg ataaaggata taccgcagca ttgaatttct 11280 acacatctca cagtggatgg tatgacgagc tttcctgggc aggtgcatgg atttatcttg 11340 cagacggtga cgaaacttat cttgaaaaag ctgaaaagta tgtggataaa tggccaatcg 11400 aaagccagac aacttacatt gcttattcat ggggtcactg ctgggacgac gttcactacg 11460 gagcagcact tcttttggca aagat tacaa acaaatcctt atacaaagaa gcgatagaaa 11520 gacacctgga ctattggaca gttggat tta atggtcagag agtcagatat acaccaaagg 11580 gtcttgctca cctcactgac tggggtgtat taagacatgc cactactact gcattccttg 11640 catgtgttta ttccgactgg tcagaatgtc caagggaaaa agccaatatt tacatagatt 11700 ttgccaagaa acaggctgac tatgccttag gcagcagcgg cagaagttat gtagtcggat 11760 ttggtgtaaa tcctccgcag catccgcacc acagaactgc ccacagctca tgg Shu gtgaca 11820 gtcaaaaagt tcctgaatac cacagacacg ttctttacgg agcactcgta ggcggacctg 11880 atgccagcga tgcttatgtt gatgatatag gaaactatgt aacaaatgag gttgcctgcg 11940 actacaatgc cggttttgta ggattgctcg ccaagatgta tgaaaaatat ggcggaaacc 12000 ccataccaaa cttcatggct atagaagaaa aaacaaatga agaaatttat gttgaagcta 12060 ccgccaattc aaataacggt gtcgaattga aaacatacct ttacaataaa tccggatggc 12120 cggcaagagt ttgcgacaag ctttccttca gatatttcat ggaccttacg gaatatgtat 12180 page 13 201122105 ccgccggata caatcctaat gatataactg tttctataat ttacagtgca gcaccaactg 12240 caaaaatttc aaaaccaata ctttatgacg catccaaaaa catatattat tgcgaaatcg 12300 atctctccgg taccaagata ttccccggaa gcaactcaga ccaccagaaa gaaacccaat 12360 ttagaataca gcctcctgca ggcgcacctt gggacaacac caacgacttc tcctatcagg 12420 gaatcaagaa aaacggtgaa gttgtaaaag aaatgcctgt ttatgaagac ggaattctca 12480 tattcggtgt agaacccaat ggtaccggtc ctgcaacacc aacgc-cgaaa ccgtccgtaa 12540 atccttcacc ttcacctacg ccaacatcgg atat tcttta cggtgacatc aatctggacg 12600 gaaaaattaa ctcttcagat gttacactgt taaaaagata tattgtgaag tccatagatg 12660 ttttcccaac cgctgatccg gaacggagct taatagcatc agatgtaaac ggagacggaa 12720 gggtaaactc tacagactat tcatacctta aacgttatgt cttgaaaatc ataccaacca 12780 tacccggaaa ttcatgacac gtcgtgtccg aatttattct attggcagag atttttgttt 12840 tttcggcatt ttcacaaaaa catatgaagt aagggtgagg gggtttggat gaaaacaaaa 12900 gaaaaagaag aaattttgca gtataaataa attataatga aaaatttaag agaaatttaa 12960 atgaggaggc aattatcaaa tgaggaagaa aaaaagat ta atatcattac tgcttgcggt 13020 ttttatcgcc gttgcatgtc tgccggcggg aattgcaagg gcagataaag cctcgagcat 13080 tgagcttaag tttgaccgca ataagggaga agttggagat atacttattg gtaccgtaag 13140 i; ataaacaat Atcaagaatt tcgc aggatt tcaggtaaac attgtatatg atccaaaagt 13200 cttaatggct gttgaccctg aaacggggaa agaatttact tcttcaacat ttccgccagg 13260 acgcactgta ctgaaaaaca atgcttacgg cccaatacag attgcggaca atgatccgga 13320 aaaagggata ctgaacttcg cgcttgcata ttcatatatt gcgggataca aagaaacagg 13380 agta ^ cggag gaaagcggca taattgcgaa aattggattt aaaatactcc agaaaaagag 13440 cactgccgta aaattccagg atacattaag catgcccgga gctatttcgg gaacacagct 13500 gtttgactgg gacggagaag ttattaccgg atatgaggta atacagccgg atgtgctgag 13560 tttgggtgac gagccttatg agacaccggg aacggatatt ccgatatccg acaatccggc 13620 agcaactccg tcatccacgc cgtcagttac tccttcaccg gaagttaaac cgactcagac 13680 gccttcgcct gcagaaaatt ctgcaaaagt ggagcttgaa cctgtgttgg ataatgcaac 13740 aggagaagca aaggcggcaa tagatgaaga aaaattaaac aaggctcttg atgaagcgaa 13800 aaaatcggaa gatgacaaac ttgtggaact taacataaag aaggttgaaa atgccgatgc 13860 ttacatacaa cagcttccgg cgaaattcct gataaaaagt gacgccgaat ataagctgag 13920 aatagctaca gagcagggaa ttatagaagt accggccaac atgctgaata ctgcggatat 13980 ttca aagctt gtaaaaaatg actccgttgt tgaattcgtc ataagaaaag taaaagtcga 14040 tgaacttggt gcagagctca aagagaagat aggcaacagg ccggtgattg acataagcgt 14100 ggttgttgac ggcaaaaaag ttgaatggag caattacaaa gccaaggtta aaatatcaat 14160 tcct tacaag cctgatgcaa aagagctgga gaaccacgag catattgttg tactccatat 14220 tgatgacgcc ggcaaggcag tttccgtacc cagcggaaaa tatgaacctt ctttgggcgt 14280 cgttacgttt gagacgaatc at t taagcaa gtatgcggtt tcatatgttt acaagacttt 14340 cgc ^ gatatt ggttcatatg cctgggctaa Aaagcagata gaggttttgg cttccaaagg 14400 agtaattaac ggtacatccg ataccacttt tacgccccag gcagacataa caagggcgga 14460 Page 14 201122105

tttcatgata cttcttgtaa aggcactggg attgactgcc gaggttactt ccaattttga 14520 tgatgtgtcc gaaaaagact actattatga atacgtggga attgcaaaag agcttggaat 14580 tacgacagga gtcggaaaca acaagttcaa tccgaaagcc aaaattacaa gacaggatat 14640 gatggtactt acaacaaatg ctctcaggat tgcaggaaaa atatcgagca caggaacccg 14700 cgctgatgtt gaaagatttt cggacaagga ccagatagct tcatatgcgg ttgaaggcgt 14760 tgcaaccttg gtaaaagaag.gtattgtagt gggaagcggc gatattataa atccaagggg 14820 aaatgcttca agagccgaac ttgcagcaat catatacaag atttactaca agtaaaattg 148S0 ttttttgcat aagtcaagtg aaggataaac aggggatacg gcccagggtg aaaagccttt 14940 tgattgggtc gtttccctaa aattatattt ctggtaaaat attcacatgg tgaaaaggag 15000 gaaaaaaaag tgaagaacgt aaaaaaaaga gtaggtgtgg ttttgctgat tcttgcagtg 15060 ttgggggttt atatgttggc aatgccggca aacactgtgt cagcggcagg tgtgcctttt 15120 aacacaaaat acccctatgg tcctacttct attgccgata atcagtcgga agtaactgca 15180 atgctcaaag cagaatggga agactggaag agcaagagaa ttacctcgaa cggtgcagga 15240 ggatacaaga gagtacagcg tgatgcttcc accaattatg atacggtatc cgaaggtatg 15300 ggatacggac ttcttttggc ggtttgcttt aacgaacagg ctttgtttga cgatttatac 15360 cgttacgtaa aatctcattt caatggaaac ggacttatgc actggcacat tgatgccaac 15420 aacaatgtta caagtcatga cggcggcgac ggtgcggcaa ccgatgctga tgaggatatt 15480 gcacttgcgc tcatatttgc ggacaagt ta tggggttctt ccggtgcaat aaactacggg 15540 caggaagcaa ggacattgat aaacaatctt tacaaccatt gtgtagagca tggatcctat 15600 gtattaaagc ccggtgacag atggggaggt tcatcagtaa caaacccgtc atattttgcg 15660 cctgcatggt acaaagtgta tgctcaatat acaggagaca caagatggaa tcaagtggcg 15720 gacaagtgtt accaaattgt tgaagaagtt aagaaataca acaacggaac cggccttgtt 15780 cctgactggt gtactgcaag cggaactccg gcaagcggtc agagttacga ctacaaatat 15840 gatgctacac gttacggctg gagaactgcc gtggactatt catggtttgg tgaccagaga 15900 gcaaaggcaa actgcgatat gctgaccaaa ttctttgcca gagacggggc aaaaggaatc 15960 gttgacggat acacaattca aggttcaaaa at tagcaaca atcacaacgc atcatttata 16020 ggacctgttg cggcagcaag tatgacaggt tacgatttga actttgcaaa ggaactttat 16080 agggagactg ttgctgtaaa ggacagtgaa tattacggat attacggaaa cagcttgaga 16140 ctgctcactt tgttgtacat aacaggaaac ttcccgaatc ctttgagtga cct ttccggc 16200 caaccgacac caccgtcgaa tccgacacct tcattgcctc ctcaggttgt ttacggtgat 16260 gtaaatggcg acggtaatgt taactccact gatttgacta tgttaaaaag atatctgctg 16320 aagagtgtta ccaatataaa cagagaggct gcagacgtta atcgtgacgg tgcgattaac 16380 tcctctgaca tgactatatt aaagagatat ctgataaaga gcatacccca cctaccttat 16440 tagcactggg tgttttggga ggtagatcta tgctgaagaa aaaactgttg acccttttga 16500 cagtctttgc tctgctgact gtcggtatct gcggaagttt tttgccgtta cccaaagcat 16560 ccgcagcagc tctgatttac gatgattttg aaacaggtct gaacggatgg ggaccaagag 16620 gaccggaaac cgtcgaactt accaccgagg aagcttactc gggaagatac agtttgaagg 16680 tcagcggacg taccagcaca tggaacgggc ccatggttga caaaaccgat gtgttgactt 16740 第15頁 201122105 t^ggcgaaag ctataagttg ggcgtatatg taaaattcgt gggtgattcc tattcaaatg 16800 agcaaagatt cagtttgcag cttcaatata acgacggagc aggagatgta taccaaaata 16860 taaaaaccgc cacggtttac aagggaacat ggactttgct ggaaggacag cttacagttc 16920 ccagccatgc aaaggacgta aaaatatatg tggaaaccga atttaaaaat tctccgagtc 16980 c;?caggactt gatggatttc tatattgacg atttcacagc aacacctgca aatttgcctg 17040 aaattgagaa agatattcca agcttgaaag atgtctttgc cggttatttc aaagtgggtg 17100 gtgccgcaac tgtggcggaa ctggcgccga agcctgcaaa agagcttttc ctcaagcatt 17160 ataacagctt gacttttggt aatgagttaa aaccggaaag tgtacttgac tatgatgcta 17220 caattgctta tatggaggca aacggaggcg accaggttaa tccgcagata accttgagag 17280 cggcaagacc cctgttggag tttgcgaaag aacacaacat acctgtaaga ggacataccc 17340 ttgtatggca cagccagaca ccggactggt tcttcagaga aaattactct caggacgaaa 17400 atgctccctg ggcatccaag gaagtaatgc tgcaaaggtt ggaaaactac ataaagaatt 17460 taatggaagc tttggcgacc gaatatccga cggttaagtt ctatgcatgg gacgttgtga 17520 atgaggctgt tgatcctaat acttcagacg gtatgagaac tccgggttcg aataacaaaa 17580 atcccggaag ctccctgtgg atgcaaaccg ttggaagaga ttttattgtt aaagcttttg Π640 aatatgcaag aaaatatgct cctgcggatt gtaaactctt ctacaatgac tataatgaat 17700 atgaagacag aaaatgtgat tttattattg aaattcttac cgaacttaaa gccaaaggcc 17760 tggttgacgg tatgggtatg caatcccact gggttatgga ttatccaagc ataagcatgt 17820 ttgaaaaatc catcagaaga tatgcagcat tgggattgga aattcagctt accgagctgg 17880 atataagaaa tcctgacaac agccagtggg ctttggaacg tcaggctaat cgttataagg 17940 agcttgtaac aaaattggtc gatttgaaaa aagaaggcat aaacattacg gcattggtat 18000 tctg^ggaat aaccgacgcg acaagctggc ttggaggata tccgctcctg tttgacgcgg 18060 aatacaaggc aaaacctgca ttttatgcta tagttaacag cgttccgccg cttccgacag 18120 aaccgccggt tcaggttata cccggtgatg taaacggtga cggtcgtgta aattcatccg 18180 acttgactct tatgaaaaga taccttttaa aatccataag cgacttcccg acaccggaag 18240 gaaaaattgc ggcggattta aacgaagacg gcaaggtaaa ctcgacagat ttgttagcgc 18300 tgaaaaaact cgttctgaga gaactttgac actaggtgca aaaaggagga gaaacatgtc 18360 aagaaaactt ttcagtgtat tacttgttgg cttgatgctt atgacatcgt tgcttgtcac 18420 aataagcagt acatcagcgg catccttgcc aaccatgccg ccttcgggat atgaccaggt 18480 aaggaacggc gttccgagag ggcaggtcgt aaatatttct tatttctcca cggccaccaa 18540 cagtaccagg ccggcaagag tttatttgcc gccgggatat tcaaaggaca aaaaatacag 18600 tgttttgtat ctcttacacg gcataggcgg tagtgaaaac gactggttcg aagggggagg 18660 cagagccaat gttattgccg acaatctgat tgccgaggga aaaatcaagc ccctgataat 18720 tgtaacaccg aatactaacg ccgccggtcc gggaatagcg gacggttatg aaaatttcac 18780 aaaagatttg ctcaacagtc ttattcccta tatcgaatct aactattcag tctacaccga 18840 ccgcgaacat cgggcgattg caggactttc aatgggtgga ggacaatcgt ttaatattgg 18900 attgaccaat ctcgataaat ttgcctatat tggcccgatt tcagcggctc caaacactta 18960 tccaaatgag aggctttttc ctgacggagg aaaagctgca agggagaaat tgaaactgct 19020 第16頁 201122105tttcatgata cttcttgtaa aggcactggg attgactgcc gaggttactt ccaattttga 14520 tgatgtgtcc gaaaaagact actattatga atacgtggga attgcaaaag agcttggaat 14580 tacgacagga gtcggaaaca acaagttcaa tccgaaagcc aaaattacaa gacaggatat 14640 gatggtactt acaacaaatg ctctcaggat tgcaggaaaa atatcgagca caggaacccg 14700 cgctgatgtt gaaagatttt cggacaagga ccagatagct tcatatgcgg ttgaaggcgt 14760 tgcaaccttg gtaaaagaag.gtattgtagt gggaagcggc gatattataa atccaagggg 14820 aaatgcttca agagccgaac ttgcagcaat catatacaag atttactaca agtaaaattg 148S0 ttttttgcat aagtcaagtg aaggataaac aggggatacg gcccagggtg aaaagccttt 14940 tgattgggtc gtttccctaa aattatattt ctggtaaaat attcacatgg tgaaaaggag 15000 gaaaaaaaag tgaagaacgt aaaaaaaaga gtaggtgtgg ttttgctgat tcttgcagtg 15060 ttgggggttt atatgttggc aatgccggca aacactgtgt cagcggcagg tgtgcctttt 15120 aacacaaaat acccctatgg tcctacttct attgccgata atcagtcgga agtaactgca 15180 atgctcaaag cagaatggga agactggaag agcaagagaa ttacctcgaa cggtgcagga 15240 ggatacaaga gagtacagcg tgatgcttcc accaattatg atacggt atc cgaaggtatg 15300 ggatacggac ttcttttggc ggtttgcttt aacgaacagg ctttgtttga cgatttatac 15360 cgttacgtaa aatctcattt caatggaaac ggacttatgc actggcacat tgatgccaac 15420 aacaatgtta caagtcatga cggcggcgac ggtgcggcaa ccgatgctga tgaggatatt 15480 gcacttgcgc tcatatttgc ggacaagt ta tggggttctt ccggtgcaat aaactacggg 15540 caggaagcaa ggacattgat aaacaatctt tacaaccatt gtgtagagca tggatcctat 15600 gtattaaagc ccggtgacag atggggaggt tcatcagtaa caaacccgtc atattttgcg 15660 cctgcatggt acaaagtgta tgctcaatat acaggagaca caagatggaa tcaagtggcg 15720 gacaagtgtt accaaattgt tgaagaagtt aagaaataca acaacggaac cggccttgtt 15780 cctgactggt gtactgcaag cggaactccg gcaagcggtc agagttacga ctacaaatat 15840 gatgctacac gttacggctg gagaactgcc gtggactatt catggtttgg tgaccagaga 15900 gcaaaggcaa actgcgatat gctgaccaaa ttctttgcca gagacggggc aaaaggaatc 15960 gttgacggat acacaattca aggttcaaaa at tagcaaca atcacaacgc atcatttata 16020 ggacctgttg cggcagcaag tatgacaggt tacgatttga actttgcaaa ggaactttat 16080 agggagactg ttgctgtaaa ggaca gtgaa tattacggat attacggaaa cagcttgaga 16140 ctgctcactt tgttgtacat aacaggaaac ttcccgaatc ctttgagtga cct ttccggc 16200 caaccgacac caccgtcgaa tccgacacct tcattgcctc ctcaggttgt ttacggtgat 16260 gtaaatggcg acggtaatgt taactccact gatttgacta tgttaaaaag atatctgctg 16320 aagagtgtta ccaatataaa cagagaggct gcagacgtta atcgtgacgg tgcgattaac 16380 tcctctgaca tgactatatt aaagagatat ctgataaaga gcatacccca cctaccttat 16440 tagcactggg tgttttggga ggtagatcta tgctgaagaa aaaactgttg acccttttga 16500 cagtctttgc tctgctgact gtcggtatct gcggaagttt tttgccgtta cccaaagcat 16560 ccgcagcagc tctgatttac gatgattttg aaacaggtct gaacggatgg ggaccaagag 16620 gaccggaaac cgtcgaactt accaccgagg aagcttactc gggaagatac agtttgaagg 16680 tcagcggacg taccagcaca tggaacgggc ccatggttga caaaaccgat gtgttgactt 16740 page 15 201122105 t ^ ggcgaaag ctataagttg ggcgtatatg taaaattcgt gggtgattcc tattcaaatg 16800 agcaaagatt cagtttgcag cttcaatata acgacggagc aggagatgta taccaaaata 16860 taaaaaccgc cacggtttac aagggaacat ggactttgct Ggaaggacag cttaca gttc 16920 ccagccatgc aaaggacgta aaaatatatg tggaaaccga atttaaaaat tctccgagtc 16980 c;? caggactt gatggatttc tatattgacg atttcacagc aacacctgca aatttgcctg 17040 aaattgagaa agatattcca agcttgaaag atgtctttgc cggttatttc aaagtgggtg 17100 gtgccgcaac tgtggcggaa ctggcgccga agcctgcaaa agagcttttc ctcaagcatt 17160 ataacagctt gacttttggt aatgagttaa aaccggaaag tgtacttgac tatgatgcta 17220 caattgctta tatggaggca aacggaggcg accaggttaa tccgcagata accttgagag 17280 cggcaagacc cctgttggag tttgcgaaag aacacaacat acctgtaaga ggacataccc 17340 ttgtatggca cagccagaca ccggactggt tcttcagaga aaattactct caggacgaaa 17400 atgctccctg ggcatccaag gaagtaatgc tgcaaaggtt ggaaaactac ataaagaatt 17460 taatggaagc tttggcgacc gaatatccga cggttaagtt ctatgcatgg gacgttgtga 17520 atgaggctgt tgatcctaat acttcagacg gtatgagaac tccgggttcg aataacaaaa 17580 atcccggaag ctccctgtgg atgcaaaccg ttggaagaga ttttattgtt aaagcttttg Π640 aatatgcaag aaaatatgct cctgcggatt gtaaactctt ctacaatgac tataatgaat 17700 atgaagacag aaaatgtgat tttattattg aaattcttac Cgaactta aa gccaaaggcc 17760 tggttgacgg tatgggtatg caatcccact gggttatgga ttatccaagc ataagcatgt 17820 ttgaaaaatc catcagaaga tatgcagcat tgggattgga aattcagctt accgagctgg 17880 atataagaaa tcctgacaac agccagtggg ctttggaacg tcaggctaat cgttataagg 17940 agcttgtaac aaaattggtc cttccgacag 18120 aaccgccggt gatttgaaaa aagaaggcat aaacattacg gcattggtat 18000 tctg ^ ggaat aaccgacgcg acaagctggc ttggaggata tccgctcctg tttgacgcgg 18060 aatacaaggc aaaacctgca ttttatgcta tagttaacag cgttccgccg tcaggttata cccggtgatg taaacggtga cggtcgtgta aattcatccg 18180 acttgactct tatgaaaaga taccttttaa aatccataag cgacttcccg acaccggaag 18240 gaaaaattgc ggcggattta aacgaagacg gcaaggtaaa ctcgacagat ttgttagcgc 18300 tgaaaaaact cgttctgaga gaactttgac actaggtgca aaaaggagga gaaacatgtc 18360 aagaaaactt ttcagtgtat tacttgttgg cttgatgctt atgacatcgt tgcttgtcac 18420 aataagcagt acatcagcgg catccttgcc aaccatgccg ccttcgggat atgaccaggt 18480 aaggaacggc gttccgagag ggcaggtcgt aaatatttct tatttctcca cggccaccaa 18540 cagtaccagg ccggcaagag tttatttgcc gccgggatattcaaaggaca aaaaatacag 18600 tgttttgtat ctcttacacg gcataggcgg tagtgaaaac gactggttcg aagggggagg 18660 cagagccaat gttattgccg acaatctgat tgccgaggga aaaatcaagc ccctgataat 18720 tgtaacaccg aatactaacg ccgccggtcc gggaatagcg gacggttatg aaaatttcac 18780 aaaagatttg ctcaacagtc ttattcccta tatcgaatct aactattcag tctacaccga 18840 ccgcgaacat cgggcgattg caggactttc aatgggtgga ggacaatcgt ttaatattgg 18900 attgaccaat ctcgataaat ttgcctatat tggcccgatt tcagcggctc caaacactta 18960 tccaaatgag aggctttttc ctgacggagg aaaagctgca agggagaaat Tgaaactgct 19020 Page 16 201122105

ctttattgcc ttgcgttgcc taatgtgtgg gagggatgga agcggaagat aggaggcaga ctttggaaac tgaacttaga aggagattgg tgatttgtac aagcagttcc tcaggcgtta agcggatgta tatactccgc tgttactccg ggaggcaagg tccaacctac gaagtttgat gcttgctttt tcaaaacccg gaaaaatcac ggcaaacgaa aatcggtcag acttcttttc taacatgatt ccactttatc atatgcggaa ggaaaacccg ttgtctggca atggattccg tccgaaaccg tgcggaacca aacaacatta aagcccggat aacactccgg tatgacggta ggaataggtt ggagcaacgt ttaaacggtc aatacatatg ttggtattca acaggtctgg aagaggcatt aataggagcg attattacag acacaaactc ggaataaaaa aacagcattt get ttgcagc gcagaaagaa teatggetta attaccactg tgtatggatg gactaccttg tacaatgatt aaaagtatga aatggaatga ataggegtta gcaactgcat aaccccaatt ggaactttcc gcatacaatg atgacagtct accatgtcta tgtggaattt ttccgacacc ttaattette atattaccag cgtttaaggc cgaatggtac aggagcaaac aaggccctgt gggatttaaa tgctcggtat gcaaagtgga agt tccccgg ctatccccac tcggaacatg tgcaaagaga cgagacaaaa aeggtatgea caaacggtaa ttatgaccca attccggcaa actatgcttt ataatattga aggaaagagg gccccgagta tagtatcctt tccaggtaca gcaatacctt caggatatgg caataaagga gataggtttt ttggcttatt ccttcaaatg cagtccaaag aagtattgag tggtgattat caaggttgca tctcataggc ttgeageatt aaacatagac tggtgacgga atcaccgctt ttctactgac acaaggtgat gatttcggga tgtcaactat attttcaatg cgtttttgat gatgagggga ctggaaccgg ttacaaaggt eggettaaga caggtatgca agacttgggt tgtgccgatt ccttgccagc taccgaaata ggcaaacaac tgtaatgtgg caatccattg agetettatg ggacagagag cagggeggag gcagatgaag ccggctaaca ataataggtg ctggtataca aatgcaaata acactctcgg agcaaagtca tggttcactt aatattaact aegggagagg tattcagtgc gtacagacac aatgetetta ccgttttaca gttgtatgtg ttttcgaaag catacgttga gattcgctgc aaaattgttg ageageatat agagaageag ccaaagtcca gaeggagtag at tgatcaaa gatataegea tataaggaac ggattcacag atttatgaca ggctattga tacatgaata gacacgattt ccggattgac cacgtattga t tccacctga agagtataga cttccaatat taaaatccac ccggaataaa ttggcgttga cgtcggacct ctcttttaag tgaaaagata ccaatccgtc gggattatgc acaattcaga aaaatgaaat gagaccagt t tttggcacaa ttgcggtaat agtgggatgt ggagaaatgt atcccgatgc atgeggtatt gattccaatg atattaagag tacctcagtc ttatgaaaat ataaatacac gcaattacaa 19080 19140 19200 19260 19320 19380 19440 19500 19560 19620 19680 19740 19800 19860 19920 19980 20040 20100 20160 20220 20280 20340 20400 20460 20520 20580 20640 20700 20760 20820 20869 <210> 3 <211> 20869 <212> DNA <213>人工序列 <220> <223>重組序列 <400〉 3 ggcctgtttg gcctttggga ggaatggtag atgagaaaag tcatcagtat getettagtt gtggctatgc tgaegaegat ttttgcggcg atgataccgc agacagtatc ggcggccaca 第17頁 60 120 201122105 atgacagtcg agatcggcaa agttacagca gccgttggat caaaagtaga aatacctata 180 accctgaaag gagtgccatc caaaggaatg gccaattgcg acttcgtatt gggttatgat 240 ccaaatgtgc tggaagtaac agaagtaaaa ccaggaagca taataaaaga tccggatcct 300 agcaagagct ttgatagcgc aatatatccg gatcgaaaga tgattgtatt tctgtttgca 360 gaagacagtg gaagaggaac gtatgcaata actcaggatg gagtatttgc aacaattgta 420 gccactgtca aatcagctgc agcggcaccg attactttgc ttgaagtagg tgcatttgcg 480 gacaacgatt tagtagaaat aagcacaact tttgtcgcgg gcggagtaaa tcttggtagt 540 tccgtaccga caacacagcc aaatgttccg tcagacggtg tggtagtaga aattggcaaa 600 gttacgggat ctgttggaac tacagttgaa atacctgtat atttcagagg agttccatcc 660 aaaggaatag caaactgcga ctttgtgttc agatatgatc cgaatgtatt ggaaattata 720 gggatagatc ccggagacat aatagttgac ccgaatccta ccaagagctt tgatactgca 780 atatatcctg acagaaagat aatagtattc ctgtttgcgg aagacagcgg aacaggagcg 840 tatgcaataa ctaaagacgg agtatttgca aaaataagag caactgtaaa atcaagtgct 900 ccgggctata ttactttcga cgaagtaggt ggatttgcag ataatgacct ggtagaacag 960 aaggtatcat ttatagacgg tggtgttaac gttggcaatg caacaccgac caagggagca 1020 acaccaacaa atacagctac gccgacaaaa tcagctacgg ctacgcccac caggccatcg 1080 gtaccgacaa acacaccgac aaacacaccg gcaaatacac cggtatcagg caatttgaag 1140 gttgaattct acaacagcaa tccttcagat actactaact caatcaatcc tcagttcaag 1200 gttactaata ccggaagcag tgcaattgat ttgtccaaac tcacattgag atattattat 1260 acagtagacg gacagaaaga tcagaccttc tggtgtgacc atgctgcaat aatcggcagt 1320 aacgiicagct acaacggaat tacttcaaat gtaaaaggaa catttgtaaa aatgagttcc 1380 tcaacaaata acgcagacac ctaccttgaa ataagcttta caggcggaac tcttgaaccg 1440 j^gtgcacatg ttcagataca aggtagattt gcaaagaatg actggagtaa ctatacacag 1500 tcaaatgact actcattcaa gtctgcttca cagtttgttg aatgggatca ggtaacagca 1560 tacttgaacg gtgttcttgt atggggtaaa gaacccggtg gcagtgtagt accatcaaca 1620 cagcctgtaa caacaccacc tgcaacaaca aaaccacctg caacaacaaa accacctgca 1680 acaacaatac cgccgtcaga tgatccgaat gcaataaaga ttaaggtgga cacagtaaat 1740 gcaaaaccgg gagacacagt aaatatacct gtaagattca gtggtatacc atccaaggga 1800 atagcaaact gtgactttgt atacagctat gacccgaatg tacttgagat aatagagata 1860 aaaccgggag aattgatagt tgacccgaat cctgacaaga gctttgatac tgcagtatat 1920 cctgacagaa agataatagt attcctgttt gcagaagaca gcggaacagg agcgtatgca 1980 ataactaaag acggagtatt tgctacgata gtagcgaaag taaaatccgg agcacctaac 2040 ggactcagtg taatcaaatt tgtagaagta ggcggatttg cgaacaatga ccttgtagaa 2100 cagaggacac agttctttga cggtggagta aatgttggag atacaacagt acctacaaca 2160 cctacaacac ctgtaacaac accgacagat gattcgaatg cagtaaggat taaggtggac 2220 acagtaaatg caaaaccggg agacacagta agaatacctg taagattcag cggtatacca 2280 tccaagggaa tagcaaactg tgactttgta tacagctatg acccgaatgt acttgagata 2340 atagagatag aaccgggaga cataatagtt gacccgaatc ctgacaagag ctttgatact 2400 第18頁 201122105ctttattgcc ttgcgttgcc taatgtgtgg gagggatgga agcggaagat aggaggcaga ctttggaaac tgaacttaga aggagattgg tgatttgtac aagcagttcc tcaggcgtta agcggatgta tatactccgc tgttactccg ggaggcaagg tccaacctac gaagtttgat gcttgctttt tcaaaacccg gaaaaatcac ggcaaacgaa aatcggtcag acttcttttc taacatgatt ccactttatc atatgcggaa ggaaaacccg ttgtctggca atggattccg tccgaaaccg tgcggaacca aacaacatta aagcccggat aacactccgg tatgacggta ggaataggtt ggagcaacgt ttaaacggtc aatacatatg ttggtattca acaggtctgg aagaggcatt aataggagcg attattacag acacaaactc ggaataaaaa aacagcattt get ttgcagc gcagaaagaa teatggetta attaccactg tgtatggatg gactaccttg tacaatgatt aaaagtatga aatggaatga ataggegtta gcaactgcat aaccccaatt ggaactttcc gcatacaatg atgacagtct accatgtcta tgtggaattt ttccgacacc ttaattette atattaccag cgtttaaggc cgaatggtac aggagcaaac aaggccctgt gggatttaaa tgctcggtat gcaaagtgga agt tccccgg ctatccccac tcggaacatg tgcaaagaga cgagacaaaa aeggtatgea caaacggtaa ttatgaccca attccggcaa actatgcttt ataatattga aggaaagagg gccccgagta tagtatcctt tccaggtaca gcaatacc tt caggatatgg caataaagga gataggtttt ttggcttatt ccttcaaatg cagtccaaag aagtattgag tggtgattat caaggttgca tctcataggc ttgeageatt aaacatagac tggtgacgga atcaccgctt ttctactgac acaaggtgat gatttcggga tgtcaactat attttcaatg cgtttttgat gatgagggga ctggaaccgg ttacaaaggt eggettaaga caggtatgca agacttgggt tgtgccgatt ccttgccagc taccgaaata ggcaaacaac tgtaatgtgg caatccattg agetettatg ggacagagag cagggeggag gcagatgaag ccggctaaca ataataggtg ctggtataca aatgcaaata acactctcgg agcaaagtca tggttcactt aatattaact aegggagagg tattcagtgc gtacagacac aatgetetta ccgttttaca gttgtatgtg ttttcgaaag catacgttga gattcgctgc aaaattgttg ageageatat agagaageag ccaaagtcca gaeggagtag at tgatcaaa gatataegea tataaggaac ggattcacag atttatgaca ggctattga tacatgaata gacacgattt ccggattgac cacgtattga t tccacctga agagtataga cttccaatat taaaatccac ccggaataaa ttggcgttga cgtcggacct ctcttttaag tgaaaagata ccaatccgtc gggattatgc acaattcaga gagaccagt atcccgatgc atgeggtatt gattccaatg aaaatgaaat t tttggcacaa ttgcggtaat agtgggatgt ggagaaatgt atattaagag tacct Cagtc ttatgaaaat ataaatacac gcaattacaa 19080 19140 19200 19260 19320 19380 19440 19500 19560 19620 19680 19740 19800 19860 19920 19980 20040 20100 20160 20220 20280 20340 20400 20460 20520 20580 20640 20700 20760 20820 20869 <210> 3 <211> 20869 <212> DNA caaaggaatg gccaattgcg 400> 3 ggcctgtttg gcctttggga ggaatggtag atgagaaaag tcatcagtat getettagtt gtggctatgc tgaegaegat ttttgcggcg atgataccgc agacagtatc ggcggccaca page 17 60 120 201122105 atgacagtcg agatcggcaa agttacagca gccgttggat caaaagtaga aatacctata 180 accctgaaag gagtgccatc; < 213 > artificial sequence < 220 > < 223 > recombination sequence & lt acttcgtatt gggttatgat 240 ccaaatgtgc tggaagtaac agaagtaaaa ccaggaagca taataaaaga tccggatcct 300 agcaagagct ttgatagcgc aatatatccg gatcgaaaga tgattgtatt tctgtttgca 360 gaagacagtg gaagaggaac gtatgcaata actcaggatg gagtatttgc aacaattgta 420 gccactgtca aatcagctgc agcggcaccg attactttgc ttgaagtagg tgcatttgcg 480 gacaacgatt tagtagaaat aagcacaact tttgtcgcgg gcggagtaaa tcttggtagt 540 tccgta ccga caacacagcc aaatgttccg tcagacggtg tggtagtaga aattggcaaa 600 gttacgggat ctgttggaac tacagttgaa atacctgtat atttcagagg agttccatcc 660 aaaggaatag caaactgcga ctttgtgttc agatatgatc cgaatgtatt ggaaattata 720 gggatagatc ccggagacat aatagttgac ccgaatccta ccaagagctt tgatactgca 780 atatatcctg acagaaagat aatagtattc ctgtttgcgg aagacagcgg aacaggagcg 840 tatgcaataa ctaaagacgg agtatttgca aaaataagag caactgtaaa atcaagtgct 900 ccgggctata ttactttcga cgaagtaggt ggatttgcag ataatgacct ggtagaacag 960 aaggtatcat ttatagacgg tggtgttaac gttggcaatg caacaccgac caagggagca 1020 acaccaacaa atacagctac gccgacaaaa tcagctacgg ctacgcccac caggccatcg 1080 gtaccgacaa acacaccgac aaacacaccg gcaaatacac cggtatcagg caatttgaag 1140 gttgaattct acaacagcaa tccttcagat actactaact caatcaatcc tcagttcaag 1200 gttactaata ccggaagcag tgcaattgat ttgtccaaac tcacattgag atattattat 1260 acagtagacg gacagaaaga tcagaccttc tggtgtgacc atgctgcaat aatcggcagt 1320 aacgiicagct acaacggaat tacttcaaat gtaaaaggaa catttgtaaa aatgagttcc 1380 tcaacaaata acgcaga cac ctaccttgaa ataagcttta caggcggaac tcttgaaccg 1440 j ^ gtgcacatg ttcagataca aggtagattt gcaaagaatg actggagtaa ctatacacag 1500 tcaaatgact actcattcaa gtctgcttca cagtttgttg aatgggatca ggtaacagca 1560 tacttgaacg gtgttcttgt atggggtaaa gaacccggtg gcagtgtagt accatcaaca 1620 cagcctgtaa caacaccacc tgcaacaaca aaaccacctg caacaacaaa accacctgca 1680 acaacaatac cgccgtcaga tgatccgaat gcaataaaga ttaaggtgga cacagtaaat 1740 gcaaaaccgg gagacacagt aaatatacct gtaagattca gtggtatacc atccaaggga 1800 atagcaaact gtgactttgt atacagctat gacccgaatg tacttgagat aatagagata 1860 aaaccgggag aattgatagt tgacccgaat cctgacaaga gctttgatac tgcagtatat 1920 cctgacagaa agataatagt attcctgttt gcagaagaca gcggaacagg agcgtatgca 1980 ataactaaag acggagtatt tgctacgata gtagcgaaag taaaatccgg agcacctaac 2040 ggactcagtg taatcaaatt tgtagaagta ggcggatttg cgaacaatga ccttgtagaa 2100 cagaggacac agttctttga cggtggagta aatgttggag atacaacagt acctacaaca 2160 cctacaacac ctgtaacaac accgacagat gattcgaatg cagtaaggat taaggtggac 2220 acagtaaatg Caaaaccggg a Gacacagta agaatacctg taagattcag cggtatacca 2280 tccaagggaa tagcaaactg tgactttgta tacagctatg acccgaatgt acttgagata 2340 atagagatag aaccgggaga cataatagtt gacccgaatc ctgacaagag ctttgatact 2400 Page 18 201122105

gcagtatatc ctgacagaaa gataatagta ttcctgtttg cggaagacag cggaacagga 2460 gcgtatgcaa taactaaaga cggagtattt gctacgatag tagcgaaagt aaaatccgga 2520 gcacctaacg gactcagtgt aatcaaattt gtagaagtag gcggatttgc gaacaatgac 2580 cttgtagaac agaagacaca gttctttgac ggtggagtaa atgttggaga tacaacagaa 2640 cctgcaacac ctacaacacc tgtaacaaca ccgacaacaa cagatgatct ggatgcagta 2700 aggattaaag tggacacagt aaatgcaaaa ccgggagaca cagtaagaat acctgtaaga 2760 ttcagcggta taccatccaa gggaatagca aactgtgact ttgtatacag ctatgacccg 2820 aatgtacttg agataataga gatagaaccg ggagacataa tagttgaccc gaatcctgac 2880 aagagctttg atactgcagt atatcctgac agaaagataa tagtattcct gtttgcggaa 2940 gacagcggaa caggagcgta tgcaataact aaagacggag tatttgctac gatagtagcg 3000 aaagtaaaat ccggagcacc taacggactc agtgtaatca aatttgtaga agtaggcgga 3060 tttgcgaaca atgaccttgt agaacagaag acacagttct ttgacggtgg agtaaatgtt 3120 ggagatacaa cagaacctgc aacacctaca acacctgtaa caacaccgac aacaacagat 3180 gatctggatg cagtaaggat taaagtggac acagtaaatg caaaaccggg agacacagta 3240 agaatacctg taagattcag cggtatacca tccaagggaa tagcaaactg tgactttgta 3300 tacagctatg acccgaatgt acttgagata atagagatag aaccgggaga cataatagtt 3360 gacccgaatc ctgacaagag ctttgatact gcagtatatc ctgacagaaa gataatagta 3420 ttcctgtttg cagaagacag cggaacagga gcgtatgcaa taactaaaga cggagtattt 3480 gc.tacgatag tagcgaaagt aaaagaagga gcacctaacg gactcagtgt aatcaaattt 3540 gtagaagtag gcggatttgc gaacaatgac cttgtagaac agaagacaca gttctttgac 3600 ggtggagtaa atgttggaga tacaacagaa cctgcaacac ctacaacacc tgtaacaaca 3660 ccgacaacaa cagatgatct ggatgcagta aggattaaag tggacacagt aaatgcaaaa 3720 ccgggagaca cagtaagaat acctgtaaga ttcagcggta taccatccaa gggaatagca 3780 aactgtgact ttgtatacag ctatgacccg aatgtacttg agataataga gatagaaccg 3840 ggagaattga tagttgaccc gaatcctacc aagagctttg atactgcagt atatcctgac 3900 agaaagatga tagtattcct gtttgcggaa gacagcggaa caggagcgta tgcaataact 3960 gaagatggag tatttgctac gatagtagcg aaagtaaaat ccggagcacc taacggactc 4020 agtgtaatca aatttgtaga agtaggcgga tttgcgaaca atgaccttgt agaacagaag 4080 acacagttct ttgacggtgg agtaaatgtt ggagatacaa cagaacctgc aacacctaca 4140 acacctgtaa caacaccgac aacaacagat gatctggatg cagtaaggat taaagtggac 4200 acagtaaatg caaaaccggg agacacagta agaatacctg taagattcag cggtatacca 4260 tccaagggaa tagcaaactg tgactttgta tacagctatg acccgaatgt acttgagata 4320 atagagatag aaccgggaga cataatagtt gacccgaatc ctgacaagag ctttgatact 4380 gcagtatatc ctgacagaaa gataatagta ttcctgtttg cagaagacag cggaacggga 4440 gcgtatgcaa taactaaaga cggagtattt gctacgatag tagcgaaagt aaaagaagga 4500 gcacctaacg gactcagtgt aatcaaattt gtagaagtag gcggatttgc gaacaatgac 4560 cttgtagaac agaagacaca gttctttgac ggtggagtaa atgttggaga tacaacagta 4620 cctacaacat cgccgacaac aacaccgcca gagccgacga taactccgaa caagt tgaca 4680 第19 頁 201122105 cttaagatag gcagagcaga aggaagacct ggagacacgg tggaaatacc ggttaacttg 4740 tatggagtac ctcaaaaagg aatagcaagc ggtgacttcg tagtaagcta tgacccgaat 4800 gtacttgaga taatagagat agaaccggga gaattgatag ttgacccgaa tcctaccaag 4860 agctttgata ctgcagtata tcctgacaga aagatgatag tattcctgtt tgcggaagac 4920 agcggaacag gagcgtatgc aataactgaa gatggagtat ttgctacgat agtagcgaaa 4980 gtaaaagaag gagcacctga aggattcagt gcaatagaaa tttctgagtt tggtgcattt 5040 gcagataatg atctggtaga agtggaaact gaccttatca atggtggagt acttgtaact 5100 aataaacctg taatagaagg atataaagta tccggataca ttttgccaga cttctccttc 5160 gacgctactg ttgcaccact tgtaaaggcc ggattcaaag ttgaaatagt aggaacagaa 5220 ttgtatgcag taacagatgc aaacggatac tttgaaataa ccggagtacc tgcaaatgca 5280 agcggatata cattgaagat ttcaagagca acttacttgg acagagtaat tgcaaatgtt 5340 gtagtaacgg gagatacttc agtttcaact tcacaggctc caataatgat gtgggtagga 5400 gacatagtga aagacaattc tatcaacctg ttggacgttg cagaagttat ccgttgcttc 5460 aacgctacta aaggaagcgc aaactacgta gaagaacttg acattaatag aaacggcgca 5520 attaacatgc aagacataat gattgttcat aagcactttg gagctacatc aagtgattac 5580 ^acgcacagt aacactgagt gcaaaaagga ggagaaacat gtcaagaaaa cttttcagtg 5640 tattacttgt tggcttgatg cttatgacat cgttgcttgt cacaataagc agtacatcag 5700 cggcatcctt gccaaccatg ccgccttcgg gatatgacca ggtaaggaac ggcgttccga 5760 gagggcaggt cgtaaatatt tcttatttct ccacggccac caacagtacc aggccggcaa 5820 iiagtttattt gccgccggga tattcaaagg acaaaaaata cagtgttttg tatctcttac 5880 acggcatagg cgg^agtgaa aacgactggt tcgaaggggg aggcagagcc aatgttattg 5940 ccgacaatct gattgccgag ggaaaaatca agcccctgat aattgtaaca ccgaatacta 6000 acgccgccgg tccgggaata gcggacggtt atgaaaattt cacaaaagat ttgctcaaca 6060 gtcttattcc ctatatcgaa tctaactatt cagtctacac cgaccgcgaa catcgggcga 6120 ttgcaggact ttcaatgggt ggaggacaat cgtttaatat tggattgacc aatctcgata 6180 aatttgccta tattggcccg atttcagcgg ctccaaacac ttatccaaat gagaggcttt 6240 ttcctgacgg aggaaaagct gcaagggaga aattgaaact gctctttatt gcctgcggaa 6300 ccaatgacag tctgataggt tttggacaga gagtacatga atattgcgtt gccaacaaca 6360 ttaaccatgt ctattggctt attcagggcg gaggacacga ttttaatgtg tggaagcccg 6420 gattgtggaa tttccttcaa atggcagatg aagccggatt gacgagggat ggaaacactc 6480 cggttccgac acccagtcca aagccggcta acacacgtat tgaagcggaa gattatgacg 6540 gtattaattc ttcaagtatt gagataatag gtgttccacc tgaaggaggc agaggaatag 6600 gttatattac cagtggtgat tatctggtat acaagagtat agactttgga aacggagcaa 6660 cgtcgtttaa ggccaaggtt gcaaatgcaa atacttccaa tattgaactt agattaaacg 6720 gtccgaatgg tactctcata ggcacactct cggtaaaatc cacaggagat tggaatacat 6780 atgaggagca aacttgcagc attagcaaag tcaccggaat aaatgatttg tacttggtat 6840 tcaaaggccc tgtaaacata gactggttca cttttggcgt tgaaagcagt tccacaggtc 6900 tggKggattt aaatggtgac ggaaatatta actcgtcgga ccttcaggcg ttaaagaggc 6960 第20頁 201122105gcagtatatc ctgacagaaa gataatagta ttcctgtttg cggaagacag cggaacagga 2460 gcgtatgcaa taactaaaga cggagtattt gctacgatag tagcgaaagt aaaatccgga 2520 gcacctaacg gactcagtgt aatcaaattt gtagaagtag gcggatttgc gaacaatgac 2580 cttgtagaac agaagacaca gttctttgac ggtggagtaa atgttggaga tacaacagaa 2640 cctgcaacac ctacaacacc tgtaacaaca ccgacaacaa cagatgatct ggatgcagta 2700 aggattaaag tggacacagt aaatgcaaaa ccgggagaca cagtaagaat acctgtaaga 2760 ttcagcggta taccatccaa gggaatagca aactgtgact ttgtatacag ctatgacccg 2820 aatgtacttg agataataga gatagaaccg ggagacataa tagttgaccc gaatcctgac 2880 aagagctttg atactgcagt atatcctgac agaaagataa tagtattcct gtttgcggaa 2940 gacagcggaa caggagcgta tgcaataact aaagacggag tatttgctac gatagtagcg 3000 aaagtaaaat ccggagcacc taacggactc agtgtaatca aatttgtaga agtaggcgga 3060 tttgcgaaca atgaccttgt agaacagaag acacagttct ttgacggtgg agtaaatgtt 3120 ggagatacaa cagaacctgc aacacctaca acacctgtaa caacaccgac aacaacagat 3180 gatctggatg cagtaaggat taaagtggac acagtaaatg caaaaccggg agacacagt a 3240 agaatacctg taagattcag cggtatacca tccaagggaa tagcaaactg tgactttgta 3300 tacagctatg acccgaatgt acttgagata atagagatag aaccgggaga cataatagtt 3360 gacccgaatc ctgacaagag ctttgatact gcagtatatc ctgacagaaa gataatagta 3420 ttcctgtttg cagaagacag cggaacagga gcgtatgcaa taactaaaga cggagtattt 3480 gc.tacgatag tagcgaaagt aaaagaagga gcacctaacg gactcagtgt aatcaaattt 3540 gtagaagtag gcggatttgc gaacaatgac cttgtagaac agaagacaca gttctttgac 3600 ggtggagtaa atgttggaga tacaacagaa cctgcaacac ctacaacacc tgtaacaaca 3660 ccgacaacaa cagatgatct ggatgcagta aggattaaag tggacacagt aaatgcaaaa 3720 ccgggagaca cagtaagaat acctgtaaga ttcagcggta taccatccaa gggaatagca 3780 aactgtgact ttgtatacag ctatgacccg aatgtacttg agataataga gatagaaccg 3840 ggagaattga tagttgaccc gaatcctacc aagagctttg atactgcagt atatcctgac 3900 agaaagatga tagtattcct gtttgcggaa gacagcggaa caggagcgta tgcaataact 3960 gaagatggag tatttgctac gatagtagcg aaagtaaaat ccggagcacc taacggactc 4020 agtgtaatca aatttgtaga agtaggcgga tttgcgaaca atgaccttgtagaacagaag 4080 acacagttct ttgacggtgg agtaaatgtt ggagatacaa cagaacctgc aacacctaca 4140 acacctgtaa caacaccgac aacaacagat gatctggatg cagtaaggat taaagtggac 4200 acagtaaatg caaaaccggg agacacagta agaatacctg taagattcag cggtatacca 4260 tccaagggaa tagcaaactg tgactttgta tacagctatg acccgaatgt acttgagata 4320 atagagatag aaccgggaga cataatagtt gacccgaatc ctgacaagag ctttgatact 4380 gcagtatatc ctgacagaaa gataatagta ttcctgtttg cagaagacag cggaacggga 4440 gcgtatgcaa taactaaaga cggagtattt gctacgatag tagcgaaagt aaaagaagga 4500 gcacctaacg gactcagtgt aatcaaattt gtagaagtag gcggatttgc gaacaatgac 4560 cttgtagaac agaagacaca gttctttgac ggtggagtaa atgttggaga tacaacagta 4620 cctacaacat cgccgacaac aacaccgcca gagccgacga taactccgaa caagt tgaca 4680 page 19 201122105 cttaagatag gcagagcaga aggaagacct ggagacacgg tggaaatacc ggttaacttg 4740 tatggagtac ctcaaaaagg aatagcaagc ggtgacttcg tagtaagcta tgacccgaat 4800 gtacttgaga taatagagat agaaccggga gaattgatag ttgacccgaa tcctaccaag 4860 agctttgata ctgcagtata Tcc tgacaga aagatgatag tattcctgtt tgcggaagac 4920 agcggaacag gagcgtatgc aataactgaa gatggagtat ttgctacgat agtagcgaaa 4980 gtaaaagaag gagcacctga aggattcagt gcaatagaaa tttctgagtt tggtgcattt 5040 gcagataatg atctggtaga agtggaaact gaccttatca atggtggagt acttgtaact 5100 aataaacctg taatagaagg atataaagta tccggataca ttttgccaga cttctccttc 5160 gacgctactg ttgcaccact tgtaaaggcc ggattcaaag ttgaaatagt aggaacagaa 5220 ttgtatgcag taacagatgc aaacggatac tttgaaataa ccggagtacc tgcaaatgca 5280 agcggatata cattgaagat ttcaagagca acttacttgg acagagtaat tgcaaatgtt 5340 gtagtaacgg gagatacttc agtttcaact tcacaggctc caataatgat gtgggtagga 5400 gacatagtga aagacaattc tatcaacctg ttggacgttg cagaagttat ccgttgcttc 5460 aacgctacta aaggaagcgc aaactacgta gaagaacttg acattaatag aaacggcgca 5520 attaacatgc aagacataat gattgttcat aagcactttg gagctacatc aagtgattac 5580 ^ acgcacagt aacactgagt gcaaaaagga ggagaaacat gtcaagaaaa cttttcagtg 5640 tattacttgt tggcttgatg cttatgacat cgttgcttgt cacaataagc agtacatcag 5700 cggcatcctt gccaaccatg ccgccttcg g gatatgacca ggtaaggaac ggcgttccga 5760 gagggcaggt cgtaaatatt tcttatttct ccacggccac caacagtacc aggccggcaa 5820 iiagtttattt gccgccggga tattcaaagg acaaaaaata cagtgttttg tatctcttac 5880 acggcatagg cgg ^ agtgaa aacgactggt tcgaaggggg aggcagagcc aatgttattg 5940 ccgacaatct gattgccgag ggaaaaatca agcccctgat aattgtaaca ccgaatacta 6000 acgccgccgg tccgggaata gcggacggtt atgaaaattt cacaaaagat ttgctcaaca 6060 gtcttattcc ctatatcgaa tctaactatt cagtctacac cgaccgcgaa catcgggcga 6120 ttgcaggact ttcaatgggt ggaggacaat cgtttaatat tggattgacc aatctcgata 6180 aatttgccta tattggcccg atttcagcgg ctccaaacac ttatccaaat gagaggcttt 6240 ttcctgacgg aggaaaagct gcaagggaga aattgaaact gctctttatt gcctgcggaa 6300 ccaatgacag tctgataggt tttggacaga gagtacatga atattgcgtt gccaacaaca 6360 ttaaccatgt ctattggctt attcagggcg gaggacacga ttttaatgtg tggaagcccg 6420 gattgtggaa tttccttcaa atggcagatg aagccggatt gacgagggat ggaaacactc 6480 cggttccgac acccagtcca aagccggcta acacacgtat tgaagcggaa gattatgacg 6540 gtattaattc ttcaagtatt Gagataatag gtg ttccacc tgaaggaggc agaggaatag 6600 gttatattac cagtggtgat acaagagtat agactttgga aacggagcaa 6660 cgtcgtttaa ggccaaggtt gcaaatgcaa atacttccaa tattgaactt agattaaacg 6720 gtccgaatgg tactctcata ggcacactct cggtaaaatc cacaggagat tggaatacat 6780 atgaggagca aacttgcagc attagcaaag tcaccggaat aaatgatttg tacttggtat 6840 tcaaaggccc tgtaaacata gactggttca cttttggcgt tgaaagcagt tccacaggtc 6900 tggKggattt aaatggtgac ggaaatatta actcgtcgga ccttcaggcg ttaaagaggc 6960 Page 20 201122105 tatctggtat

atttgctcgg tatatcaccg cttacgggag aggctctttt aagagcggat gtaaatagga 7020 gcggcaaagt ggattctact gactattcag tgctgaaaag atatatactc cgcattatta 7080 cagagttccc cggacaaggt gatgtacaga cacccaatcc gtctgttact ccgacacaaa 7140 ctcctatccc cacgatttcg ggaaatgctc ttagggatta tgcggaggca aggggaataa 7200 aaatcggaac atgtgtcaac tatccgtttt acaacaattc agatccaacc tacaacagca 7260 ttttgcaaag agaattttca atggttgtat gtgaaaatga aatgaagttt gatgctttgc 7320 agccgagaca aaacgttttt gatttttcga aaggagacca gttgcttgct tttgcagaaa 7380 gaaacggtat gcagatgagg ggacatacgt tgatttggca caatcaaaac ccgtcatggc 7440 ttacaaacgg taactggaac cgggattcgc tgcttgcggt aatgaaaaat cacattacca 7500 ctgttatgac ccattacaaa ggtaaaattg ttgagtggga tgtggcaaac gaatgtatgg 7560 atgattccgg caacggctta agaagcagca tatggagaaa tgtaatcggt caggactacc 7620 ttgactatgc tttcaggtat gcaagagaag cagatcccga tgcacttctt ttctacaatg 7680 attataatat tgaagacttg ggtccaaagt ccaatgcggt atttaacatg attaaaagta 7740 tgaaggaaag aggtgtgccg attgacggag taggattcca atgccacttt atcaatggaa 7800 tgagccccga gtaccttgcc agcattgatc aaaatattaa gagatatgcg gaaataggcg 7860 ttatagtatc ctttaccgaa atagatatac gcatacctca gtcggaaaac ccggcaactg 7920 cattccaggt acaggcaaac aactataagg aacttatgaa aatttgtctg gcaaacccca 7980 attgcaatac ctttgtaatg tggggattca cagataaata cacatggatt ccgggaactt 8040 tcccaggata tggcaatcca ttgatttatg acagcaatta caatccgaaa ccggcataca 8100 atgcaataaa ggaagctctt atgggctatt gacaccgagt gttttgggag gtagatctat 8160 gctgaagaaa aaactgttga cccttttgac agtctttgct ctgctgactg tcggtatctg 8220 cggaagtttt ttgccgttac ccaaagcatc cgcagcagct ctgatttacg atgattttga 8280 aacaggtctg aacggatggg gaccaagagg accggaaacc gtcgaactta ccaccgagga 8340 agcttactcg ggaagataca gtttgaaggt cagcggacgt accagcacat ggaacgggcc 8400 catggttgac aaaaccgatg tgttgacttt gggcgaaagc tataagttgg gcgtatatgt 8460 aaaattcgtg ggtgattcct attcaaatga gcaaagattc agtttgcagc ttcaatataa 8520 cgacggagca ggagatgtat accaaaatat aaaaaccgcc acggtttaca agggaacatg 8580 gactttgctg gaaggacagc ttacagttcc cagccatgca aaggacgtaa aaatatatgt 8640 ggaaaccgaa tttaaaaatt ctccgagtcc gcaggacttg atggatttct atattgacga 8700 tttcacagca acacctgcaa atttgcctga aattgagaaa gatattccaa gcttgaaaga 8760 tgtctttgcc ggttatttca aagtgggtgg tgccgcaact gtggcggaac tggcgccgaa 8820 gcctgcaaaa gagcttttcc tcaagcat ta taacagcttg acttttggta atgagttaaa 8880 accggaaagt gtacttgact atgatgctac aattgcttat atggaggcaa acggaggcga 8940 ccaggttaat ccgcagataa cct tgagagc ggcaagaccc ctgttggagt ttgcgaaaga 9000 acacaacata cctgtaagag gacataccct tgtatggcac agccagacac cggactggtt 9060 cttcagagaa aattactctc aggacgaaaa tgctccctgg gcatccaagg aagtaatgct 9120 gcaaaggttg gaaaactaca taaagaattt aatggaagct ttggcgaccg aatatccgac 9180 ggttaagttc tatgcatggg acgttgtgaa tgaggctgtt gatcctaata cttcagacgg 第21頁 9240 201122105 tatgagaact ccgggttcga ataacaaaaa tcccggaagc tccctgtgga tgcaaaccgt 9300atttgctcgg tatatcaccg cttacgggag aggctctttt aagagcggat gtaaatagga 7020 gcggcaaagt ggattctact gactattcag tgctgaaaag atatatactc cgcattatta 7080 cagagttccc cggacaaggt gatgtacaga cacccaatcc gtctgttact ccgacacaaa 7140 ctcctatccc cacgatttcg ggaaatgctc ttagggatta tgcggaggca aggggaataa 7200 aaatcggaac atgtgtcaac tatccgtttt acaacaattc agatccaacc tacaacagca 7260 ttttgcaaag agaattttca atggttgtat gtgaaaatga aatgaagttt gatgctttgc 7320 agccgagaca aaacgttttt gatttttcga aaggagacca gttgcttgct tttgcagaaa 7380 gaaacggtat gcagatgagg ggacatacgt tgatttggca caatcaaaac ccgtcatggc 7440 ttacaaacgg taactggaac cgggattcgc tgcttgcggt aatgaaaaat cacattacca 7500 ctgttatgac ccattacaaa ggtaaaattg ttgagtggga tgtggcaaac gaatgtatgg 7560 atgattccgg caacggctta aggtgtgccg attgacggag taggattcca agaagcagca tatggagaaa tgtaatcggt caggactacc 7620 ttgactatgc tttcaggtat gcaagagaag cagatcccga tgcacttctt ttctacaatg 7680 attataatat tgaagacttg ggtccaaagt ccaatgcggt atttaacatg attaaaagta 7740 tgaaggaaag atgccacttt atcaatgga a 7800 tgagccccga gtaccttgcc agcattgatc aaaatattaa gagatatgcg gaaataggcg 7860 ttatagtatc ctttaccgaa atagatatac gcatacctca gtcggaaaac ccggcaactg 7920 cattccaggt acaggcaaac aactataagg aacttatgaa aatttgtctg gcaaacccca 7980 attgcaatac ctttgtaatg tggggattca cagataaata cacatggatt ccgggaactt 8040 tcccaggata tggcaatcca ttgatttatg acagcaatta caatccgaaa ccggcataca 8100 atgcaataaa ggaagctctt atgggctatt gacaccgagt gttttgggag gtagatctat 8160 gctgaagaaa aaactgttga cccttttgac agtctttgct ctgctgactg tcggtatctg 8220 cggaagtttt ttgccgttac ccaaagcatc cgcagcagct ctgatttacg atgattttga 8280 aacaggtctg aacggatggg gaccaagagg accggaaacc gtcgaactta ccaccgagga 8340 agcttactcg ggaagataca gtttgaaggt cagcggacgt accagcacat ggaacgggcc 8400 catggttgac aaaaccgatg tgttgacttt gggcgaaagc tataagttgg gcgtatatgt 8460 aaaattcgtg ggtgattcct attcaaatga gcaaagattc agtttgcagc ttcaatataa 8520 cgacggagca ggagatgtat accaaaatat aaaaaccgcc acggtttaca agggaacatg 8580 gactttgctg gaaggacagc ttacagttcc cagccatgca aaggacgtaa a aatatatgt 8640 ggaaaccgaa tttaaaaatt ctccgagtcc gcaggacttg atggatttct atattgacga 8700 tttcacagca acacctgcaa atttgcctga aattgagaaa gatattccaa gcttgaaaga 8760 tgtctttgcc ggttatttca aagtgggtgg tgccgcaact gtggcggaac tggcgccgaa 8820 gcctgcaaaa gagcttttcc tcaagcat ta taacagcttg acttttggta atgagttaaa 8880 accggaaagt gtacttgact atgatgctac aattgcttat atggaggcaa acggaggcga 8940 ccaggttaat ccgcagataa cct tgagagc ggcaagaccc ctgttggagt ttgcgaaaga 9000 acacaacata cctgtaagag gacataccct tgtatggcac agccagacac cggactggtt 9060 cttcagagaa aattactctc aggacgaaaa tgctccctgg gcatccaagg aagtaatgct 9120 gcaaaggttg gaaaactaca taaagaattt aatggaagct ttggcgaccg aatatccgac 9180 ggttaagttc tatgcatggg acgttgtgaa tgaggctgtt gatcctaata cttcagacgg page 21 9240 201122105 tatgagaact ccgggttcga ataacaaaaa tcccggaagc tccctgtgga tgcaaaccgt 9300

Uigaagagat tttattgtta aagcttttga atatgcaaga aaatatgctc ctgcggattg 9360 taaactcttc tacaatgact ataatgaata tgaagacaga aaatgtgatt ttattattga 9420 aattcttacc gaacttaaag ccaaaggcct ggttgacggt atgggtatgc aatcccactg 9480 g(«ttatggat tatccaagca taagcatgtt tgaaaaatcc atcagaagat atgcagcatt 9540 gilgattggaa attcagctta ccgagctgga tataagaaat cctgacaaca gccagtgggc 9600 tttggaacgt caggctaatc gttataagga gcttgtaaca aaattggtcg atttgaaaaa 9660 agaaggcata aacattacgg cattggtatt ctggggaata accgacgcga caagctggct 9720 tggaggatat ccgctcctgt ttgacgcgga atacaaggca aaacctgcat tttatgctat 9780 agttaacagc gttccgccgc ttccgacaga accgccggtt caggttatac ccggtgatgt 9840 aaacggtgac ggtcgtgtaa attcatccga cttgactctt atgaaaagat accttttaaa 9900 atccataagc gacttcccga caccggaagg aaaaattgcg gcggatttaa acgaagacgg 9960 caaggtaaac tcgacagatt tgttagcgct gaaaaaactc gttctgagag aactttgaca 10020 cccagtgaaa aggaggaaaa aaaagtgaag aacgtaaaaa aaagagtagg tgtggttttg 10080 ctgattcttg cagtgttggg ggtttatatg ttggcaatgc cggcaaacac tgtgtcagcg 10140 gcaggtgtgc cttttaacac aaaatacccc tatggtccta cttctattgc cgataatcag 10200 tcggaagtaa ctgcaatgct caaagcagaa tgggaagact ggaagagcaa gagaattacc 10260 tcgaacggtg caggaggata caagagagta cagcgtgatg cttccaccaa ttatgatacg 10320 atatccgaag gtatgggata cggacttctt ttggcggttt gctttaacga acaggctttg 10380 tttgacgatt tataccgtta cgtaaaatct catttcaatg gaaacggact tatgcactgg 10440 cacattgatg ccaacaacaa tgttacaagt catgacggcg gcgacggtgc ggcaaccgat 10500 gctgatgagg atattgcact tgcgctcata tttgcggaca agttatgggg ttcttccggt 10560 gcaataaact acgggcagga agcaaggaca ttgataaaca atctttacaa ccattgtgta 10620 gagcatggat cctatgtatt aaagcccggt gacagatggg gaggttcatc agtaacaaac 10680 ccgtcatatt ttgcgcctgc atggtacaaa gtgtatgctc aatatacagg agacacaaga 10740 tggaatcaag tggcggacaa gtgttaccaa attgttgaag aagttaagaa atacaacaac 10800 ggaaccggcc ttgttcctga ctggtgtact gcaagcggaa ctccggcaag cggtcagagt 10860 tacgactaca aatatgatgc tacacgttac ggctggagaa ctgccgtgga ctattcatgg 10920 tttggtgacc agagagcaaa ggcaaactgc gatatgctga ccaaattctt tgccagagac 10980 ggggcaaaag gaatcgttga cggatacaca attcaaggtt caaaaattag caacaatcac 11040 aacgcatcat ttataggacc tgttgcggca gcaagtatga caggttacga tttgaacttt 11100 gcaaaggaac tttataggga gactgttgct gtaaaggaca gtgaatatta cggatattac 11160 ggaaacagct tgagactgct cactttgttg tacataacag gaaacttccc gaatcctttg 11220 agtgaccttt ccggccaacc gacaccaccg tcgaatccga caccttcatt gcctcctcag 11280 gttgtttacg gtgatgtaaa tggcgacggt aatgttaact ccactgattt gactatgtta 11340 aaaagatatc tgctgaagag tgttaccaat ataaacagag aggctgcaga cgttaatcgt 11400 gacggtgcga ttaactcctc tgacatgact atattaaaga gatatctgat aaagagcata 11460 ccccacctac cttattagca cgtcgtgtcc gaatttattc tattggcaga gatttttgtt 11520 第22頁 201122105Uigaagagat tttattgtta aagcttttga atatgcaaga aaatatgctc ctgcggattg 9360 taaactcttc tacaatgact ataatgaata tgaagacaga aaatgtgatt ttattattga 9420 aattcttacc gaacttaaag ccaaaggcct ggttgacggt atgggtatgc aatcccactg 9480 g ( «ttatggat tatccaagca taagcatgtt atcagaagat atgcagcatt 9540 gilgattggaa attcagctta ccgagctgga tataagaaat cctgacaaca gccagtgggc 9600 tttggaacgt caggctaatc gttataagga gcttgtaaca aaattggtcg atttgaaaaa 9660 agaaggcata aacattacgg cattggtatt ctggggaata accgacgcga tgaaaaatcc caagctggct 9720 tggaggatat ccgctcctgt ttgacgcgga atacaaggca aaacctgcat tttatgctat 9780 agttaacagc gttccgccgc ttccgacaga accgccggtt caggttatac ccggtgatgt 9840 aaacggtgac ggtcgtgtaa attcatccga cttgactctt atgaaaagat accttttaaa 9900 atccataagc gacttcccga caccggaagg aaaaattgcg gcggatttaa acgaagacgg 9960 caaggtaaac tcgacagatt tgttagcgct gaaaaaactc gttctgagag aactttgaca 10020 cccagtgaaa aggaggaaaa aaaagtgaag aacgtaaaaa aaagagtagg tgtggttttg 10080 ctgattcttg cagtgttggg ggtttatatg ttggcaatgc cggcaaacac tgtgtcagcg 10140 gcaggtgtgc cttttaacac aaaatacccc tatggtccta cttctattgc cgataatcag 10200 tcggaagtaa ctgcaatgct caaagcagaa tgggaagact ggaagagcaa gagaattacc 10260 tcgaacggtg tttgcggaca agttatgggg ttcttccggt caggaggata caagagagta cagcgtgatg cttccaccaa ttatgatacg 10320 atatccgaag gtatgggata cggacttctt ttggcggttt gctttaacga acaggctttg 10380 tttgacgatt tataccgtta cgtaaaatct catttcaatg gaaacggact tatgcactgg 10440 cacattgatg ccaacaacaa tgttacaagt catgacggcg gcgacggtgc ggcaaccgat 10500 gctgatgagg atattgcact tgcgctcata 10560 gcaataaact acgggcagga agcaaggaca ttgataaaca atctttacaa ccattgtgta 10620 gagcatggat cctatgtatt aaagcccggt gacagatggg gaggttcatc agtaacaaac 10680 ccgtcatatt ttgcgcctgc atggtacaaa gtgtatgctc aatatacagg agacacaaga 10740 tggaatcaag tggcggacaa gtgttaccaa attgttgaag aagttaagaa atacaacaac 10800 ggaaccggcc ttgttcctga ctggtgtact gcaagcggaa ctccggcaag cggtcagagt 10860 tacgactaca aatatgatgc tacacgttac ggctggagaa ctgccgtgga ctattcatgg 10920 tttggtgacc agagagcaaa ggcaaactgc gatatgctga ccaaattctt tgccagaga c 10980 ggggcaaaag gaatcgttga cggatacaca attcaaggtt caaaaattag caacaatcac 11040 aacgcatcat ttataggacc tgttgcggca gcaagtatga caggttacga tttgaacttt 11100 gcaaaggaac tttataggga gactgttgct gtaaaggaca gtgaatatta cggatattac 11160 ggaaacagct tgagactgct cactttgttg tacataacag gaaacttccc gaatcctttg 11220 agtgaccttt ccggccaacc gacaccaccg tcgaatccga caccttcatt gcctcctcag 11280 gttgtttacg gtgatgtaaa tggcgacggt aatgttaact ccactgattt gactatgtta 11340 aaaagatatc tgctgaagag tgttaccaat ataaacagag aggctgcaga cgttaatcgt 11400 gacggtgcga ttaactcctc tgacatgact atattaaaga gatatctgat aaagagcata 11460 ccccacctac cttattagca cgtcgtgtcc gaatttattc tattggcaga gatttttgtt 11520 Page 22 201122105

ttttcggcat tttcacaaaa acatatgaag taagggtgag ggggtttgga tgaaaacaaa 11580 agaaaaagaa gaaattttgc agtataaata aattataatg aaaaatttaa gagaaattta 11640 aatgaggagg caattatcaa atgaggaaga aaaaaagatt aatatcatta ctgettgegg 11700 tttttatcgc cgttgcatgt ctgccggcgg gaattgcaag ggcagataaa gcctcgagca 11760 ttgagcttaa gtttgaccgc aataagggag aagttggaga tataettatt ggtaccgtaa 11820 ggataaacaa tatcaagaat ttcgcaggat ttcaggtaaa cattgtatat gatccaaaag 11880 tcttaatggc tgttgaccct gaaacgggga aagaatttac ttcttcaaca tttccgccag 11940 gacgcactgt actgaaaaac aatgcttacg gcccaataca gattgcggac aatgatccgg 12000 aaaaagggat actgaacttc gcgcttgcat attcatatat tgcgggatac aaagaaacag 12060 gagtagcgga ggaaagcggc ataattgcga aaattggatt taaaatactc cagaaaaaga 12120 gcactgccgt aaaattccag gatacattaa gcatgcccgg agetattteg ggaacacagc 12180 tgtttgactg ggacggagaa gttattaccg gatatgaggt aatacagccg gatgtgctga 12240 gtttgggtga cgagccttat gagacaccgg gaaeggatat tccgatatcc gacaatccgg 12300 cagcaactcc gtcatccacg ccgtcagtta ctccttcacc ggaagttaaa ccgactcaga 12360 cgccttcgcc tgcagaaaat tctgcaaaag tggagcttga acctgtgttg gataatgcaa 12420 caggagaagc aaaggcggca atagatgaag aaaaattaaa caaggctctt gatgaagega 12480 aaaaatcgga agatgacaaa cttgtggaac ttaacataaa gaaggttgaa aatgccgatg 12540 cttacataca acagcttccg gcgaaattcc tgataaaaag tgacgccgaa tataagctga 12600 gaatagctac agagcaggga attatagaag taccggccaa catgctgaat actgcggata 12660 tttcaaagct tgtaaaaaat gactccgttg ttgaattcgt cataagaaaa gtaaaagtcg 12720 atgaacttgg tgcagagctc aaagagaaga taggcaacag gccggtgatt gacataagcg 12780 tggttgttga cggcaaaaaa gttgaatgga gcaattacaa agccaaggtt aaaatatcaa 12840 ttccttacaa gcctgatgca aaagagctgg agaaccacga gcatattgtt gtactccata 12900 ttgatgacgc cggcaaggca gtttccgtac ccagcggaaa atatgaacct tctttgggcg 12960 tcgttacgtt tgagacgaat catttaagca agtatgcggt ttcatatgtt tacaagactt 13020 tcgcggatat tggttcatat gcctgggcta aaaagcagat agaggttttg gcttccaaag 13080 gagtaattaa cggtacatcc gataccactt ttacgcccca ggcagacata acaagggcgg 13140 atttcatgat acttcttgta aaggcactgg gattgactgc cgaggttact tccaattttg 13200 atgatgtgtc cgaaaaagac tactattatg aatacgtggg aattgcaaaa gagcttggaa 13260 ttacgacagg agtcggaaac aacaagt tea atccgaaagc caaaattaca agacaggata 13320 tgatggtact tacaacaaat gctctcagga ttgcaggaaa aatategage acaggaaccc 13380 gcgctgatgt tgaaagattt tcggacaagg accagatagc ttcatatgcg gttgaaggcg 13440 ttgcaacctt ggtaaaagaa ggtattgtag tgggaagcgg egatattata aatccaaggg 13500 gaaatgcttc aagagccgaa cttgcagcaa tcatatacaa gatttactac aagtaaaatt 13560 gttttttgca taagtcaagt gaaggataaa caggggatac ggcccagggt gaaaagcctt 13620 ttgattgggt cgtttcccta aaattatatt tctggtaaaa tattcacatg eggetaaegg 13680 gaggtagatt tatgaatttc agaagaatgt tgtgcgcagc catagtgttg acaattgtac 13740 tgtccattat gctgccgtca actgtttttg ctttggaaga caagtctcca aagttgccgg 13800 第23 頁 201122105 attataaaaa cgaccttttg tatgaaagaa cattcgacga aggtctttgc tttccgtggc 13860 atacttgcga agacagtgga ggaaaatgtg atttcgctgt tgttgatgtt ccaggagagc 13920 ctgggaacaa agctttccgc ttgacagtaa ttgacaaagg acaaaacaag tggagtgtcc 13980 agatgagaca cagaggtatt accctcgagc aaggacatac atacacggta aggtttacga 14040 tttggtctga caaatcctgt agggtttatg ctaaaattgg tcagatgggt gaaccctata 14100 ctgaatattg gaacaataac tggaatccat tcaaccttac accaggacag aagcttacag 14160 ttgaacagaa ttttacaatg aactatccta ctgatgacac atgcgagttc acattccatt 14220 tgggtggaga acttgctgca ggtacacctt actatgttta ccttgatgat gtatctctct 14280 acgatcctag gtttgtaaag cctgttgaat atgtacttcc gcagccggat gtacgtgtta 14340 accaggtagg atacttaccg tttgcaaaga agtatgctac tgttgtatct tcttcaacca 14400 gcccgcttaa gtggcagctt ctcaattcgg caaatcaggt tgttttggaa ggtaatacaa 14460 taccaaaagg acttgacaaa gattcacagg attatgtaca ttggatagat ttctccaact 14520 ttaagactga aggaaaaggt tattacttca agcttccgac tgtaaacagc gatacaaatt 14580 acagccatcc tttcgatatc agtgctgata tttactccaa gatgaaattt gatgcattgg 14640 cattcttcta tcacaagaga agcggtattc ctattgaaat gccgtatgca ggaggagaac 14700 agtggaccag acctgcagga catattggtg ttgctccgaa caaaggagac acaaatgttc 14760 ctacatggcc tcaggatgat gaatatgcag gaagacctca aaaatattat acaaaagatg 14820 taaccggtgg atggtatgat gccggtgacc acggtaaata tgttgtaaac ggcggtatag 14880 ctgtttggac attgatgaac atgtatgaaa gggcaaaaat cagaggcata gctaatcaag 14940 gtgcttataa agacggtgga atgaacatac cggagagaaa taacggttat ccggacattc 15000 ttgatgaagc aagatgggaa attgagttct ttaagaaaat gcaggtaact gaaaaagagg 15060 at-ccttccat agccggaatg gtacaccaca aaattcacga cttcagatgg actgctttgg 15120 iitatgttgcc tcacgaagat ccccagccac gttacttaag gccggtaagt acggctgcga 15180 ctttgaactt tgcggcaact ttggcacaaa gtgcacgtct ttggaaagat tatgatccga 15240 cttttgctgc tgactgtttg gaaaaggctg aaatagcatg gcaggcggca ttaaagcatc 15300 ctgatattta tgctgagtat actcccggta gcggtggtcc cggaggcgga ccatacaatg 15360 acgactatgt cggagacgaa ttctactggg cagcctgcga actttatgta acaacaggaa 15420 aagacgaata taagaattac ctgatgaatt cacctcacta tcttgaaatg cctgcaaaga 15480 tgggtgaaaa cggtggagca aacggagaag acaacggatt gtggggatgc ttcacctggg 15540 gaactactca aggattggga actattactc ttgcattagt tgaaaacgga ttgccgtctg 15600 cagacattca aaaggcaaga aacaatatag ctaaagctgc agacaaatgg cttgagaata 15660 ttgaagagca aggttacaga ctgccgatca aacaggcgga ggatgagaga ggcggttatc 15720 catggggttc aaactccttc attttgaacc agatgatagt tatgggatac gcatatgact 15780 ttacaggcaa cagcaagtat cttgacggaa tgcaggatgg tatgagctac ctgttgggaa 15840 gaaacggact ggatcagtcc tatgtaacag ggtatggtga gcgtccactt cagaatcctc 15900 atgacagatt ctggacgcca cagacaagta agaaattccc tgctccacct ccgggtataa 15960 ttgccggtgg tccgaactcc cgtttcgaag acccgacaat aactgcagca gttaagaagg 16020 atacaccgcc gcagaagtgc tacattgacc atacagactc atggtcaacc aacgagataa 16080 第24頁 201122105ttttcggcat tttcacaaaa acatatgaag taagggtgag ggggtttgga tgaaaacaaa 11580 agaaaaagaa gaaattttgc agtataaata aattataatg aaaaatttaa gagaaattta 11640 aatgaggagg caattatcaa atgaggaaga aaaaaagatt aatatcatta ctgettgegg 11700 tttttatcgc cgttgcatgt ctgccggcgg gaattgcaag ggcagataaa gcctcgagca 11760 ttgagcttaa gtttgaccgc aataagggag aagttggaga tataettatt ggtaccgtaa 11820 ggataaacaa tatcaagaat ttcgcaggat ttcaggtaaa cattgtatat gatccaaaag 11880 tcttaatggc tgttgaccct gaaacgggga aagaatttac ttcttcaaca tttccgccag 11940 gacgcactgt actgaaaaac aatgcttacg gcccaataca gattgcggac aatgatccgg 12000 aaaaagggat actgaacttc gcgcttgcat attcatatat tgcgggatac aaagaaacag 12060 gagtagcgga ggaaagcggc ataattgcga aaattggatt taaaatactc cagaaaaaga 12120 gcactgccgt aaaattccag gatacattaa gcatgcccgg agetattteg ggaacacagc 12180 tgtttgactg ggacggagaa gttattaccg gatatgaggt aatacagccg gatgtgctga 12240 gtttgggtga cgagccttat gagacaccgg gaaeggatat tccgatatcc gacaatccgg 12300 cagcaactcc gtcatccacg ccgtcagtta ctccttcacc ggaagtt aaa ccgactcaga 12360 cgccttcgcc tgcagaaaat tctgcaaaag tggagcttga acctgtgttg gataatgcaa 12420 caggagaagc aaaggcggca atgaacttgg tgcagagctc aaagagaaga atagatgaag aaaaattaaa caaggctctt gatgaagega 12480 aaaaatcgga agatgacaaa cttgtggaac ttaacataaa gaaggttgaa aatgccgatg 12540 cttacataca acagcttccg gcgaaattcc tgataaaaag tgacgccgaa tataagctga 12600 gaatagctac agagcaggga attatagaag taccggccaa catgctgaat actgcggata 12660 tttcaaagct tgtaaaaaat gactccgttg ttgaattcgt cataagaaaa gtaaaagtcg 12720 taggcaacag gccggtgatt gacataagcg 12780 tggttgttga cggcaaaaaa gttgaatgga gcaattacaa agccaaggtt aaaatatcaa 12840 ttccttacaa gcctgatgca aaagagctgg agaaccacga gcatattgtt gtactccata 12900 ttgatgacgc cggcaaggca gtttccgtac ccagcggaaa atatgaacct tctttgggcg 12960 tcgttacgtt tgagacgaat catttaagca agtatgcggt ttcatatgtt tacaagactt 13020 tcgcggatat tggttcatat gcctgggcta aaaagcagat agaggttttg gcttccaaag 13080 gagtaattaa cggtacatcc gataccactt ttacgcccca ggcagacata acaagggcgg 13140 atttcatgat acttcttgta aaggcac tgg gattgactgc cgaggttact tccaattttg 13200 atgatgtgtc cgaaaaagac tactattatg aatacgtggg aattgcaaaa gagcttggaa 13260 ttacgacagg agtcggaaac aacaagt tea atccgaaagc caaaattaca agacaggata 13320 tgatggtact tacaacaaat gctctcagga ttgcaggaaa aatategage acaggaaccc 13380 gcgctgatgt tgaaagattt tcggacaagg accagatagc ttcatatgcg gttgaaggcg 13440 ttgcaacctt ggtaaaagaa ggtattgtag tgggaagcgg egatattata aatccaaggg 13500 gaaatgcttc aagagccgaa cttgcagcaa tcatatacaa gatttactac aagtaaaatt 13560 gttttttgca taagtcaagt gaaggataaa caggggatac ggcccagggt gaaaagcctt 13620 ttgattgggt cgtttcccta aaattatatt tctggtaaaa tattcacatg eggetaaegg 13680 gaggtagatt tatgaatttc agaagaatgt tgtgcgcagc catagtgttg acaattgtac 13740 tgtccattat gctgccgtca actgtttttg ctttggaaga caagtctcca aagttgccgg 13800 page 23 201122105 attataaaaa cgaccttttg tatgaaagaa cattcgacga aggtctttgc tttccgtggc 13860 atacttgcga agacagtgga ggaaaatgtg atttcgctgt tgttgatgtt ccaggagagc 13920 ctgggaacaa agctttccgc ttgacagtaa ttgacaaagg acaaaacaag tggagtgtcc 13980 agatgagaca cagaggtatt accctcgagc aaggacatac atacacggta aggtttacga 14040 tttggtctga caaatcctgt agggtttatg ctaaaattgg tcagatgggt gaaccctata 14100 ctgaatattg gaacaataac tggaatccat tcaaccttac accaggacag aagcttacag 14160 ttgaacagaa ttttacaatg aactatccta ctgatgacac atgcgagttc acattccatt 14220 tgggtggaga acttgctgca ggtacacctt actatgttta ccttgatgat gtatctctct 14280 acgatcctag gtttgtaaag cctgttgaat atgtacttcc gcagccggat gtacgtgtta 14340 accaggtagg atacttaccg tttgcaaaga agtatgctac tgttgtatct tcttcaacca 14400 gcccgcttaa gtggcagctt ctcaattcgg caaatcaggt tgttttggaa ggtaatacaa 14460 taccaaaagg acttgacaaa gattcacagg attatgtaca ttggatagat ttctccaact 14520 ttaagactga aggaaaaggt tattacttca agcttccgac tgtaaacagc gatacaaatt 14580 acagccatcc tttcgatatc agtgctgata tttactccaa gatgaaattt gatgcattgg 14640 cattcttcta tcacaagaga agcggtattc ctattgaaat gccgtatgca ggaggagaac 14700 agtggaccag acctgcagga catattggtg ttgctccgaa caaaggagac acaaatgttc 14760 ctacatggcc tcaggatgat gaatatgcag gaagacctca aaaatattat acaaaagatg 14820 taaccggtgg atggtatgat gccggtgacc acggtaaata tgttgtaaac ggcggtatag 14880 ctgtttggac attgatgaac atgtatgaaa gggcaaaaat cagaggcata gctaatcaag 14940 gtgcttataa agacggtgga atgaacatac cggagagaaa taacggttat ccggacattc 15000 ttgatgaagc aagatgggaa attgagttct ttaagaaaat gcaggtaact gaaaaagagg 15060 at-ccttccat agccggaatg gtacaccaca aaattcacga cttcagatgg actgctttgg 15120 iitatgttgcc tcacgaagat ccccagccac gttacttaag gccggtaagt acggctgcga 15180 ctttgaactt tgcggcaact ttggcacaaa gtgcacgtct ttggaaagat tatgatccga 15240 cttttgctgc tgactgtttg gaaaaggctg aaatagcatg gcaggcggca ttaaagcatc 15300 ctgatattta tgctgagtat actcccggta gcggtggtcc cggaggcgga ccatacaatg 15360 acgactatgt cggagacgaa ttctactggg cagcctgcga actttatgta acaacaggaa 15420 aagacgaata taagaattac ctgatgaatt cacctcacta tcttgaaatg cctgcaaaga 15480 tgggtgaaaa cggtggagca aacggagaag acaacggatt gtggggatgc ttcacctggg 15540 gaactactca aggattggga actattactc ttgcattagt tgaaaacgga ttgccgtctg 15600 cagacattc a aaaggcaaga aacaatatag ctaaagctgc agacaaatgg cttgagaata 15660 ttgaagagca aggttacaga ctgccgatca aacaggcgga ggatgagaga ggcggttatc 15720 catggggttc aaactccttc attttgaacc agatgatagt tatgggatac gcatatgact 15780 ttacaggcaa cagcaagtat cttgacggaa tgcaggatgg tatgagctac ctgttgggaa 15840 gaaacggact ggatcagtcc tatgtaacag ggtatggtga gcgtccactt cagaatcctc 15900 atgacagatt ctggacgcca cagacaagta agaaattccc tgctccacct ccgggtataa 15960 ttgccggtgg tccgaactcc cgtttcgaag acccgacaat aactgcagca gttaagaagg 16020 atacaccgcc Gcagaagtgc tacattgacc atacagactc atggtcaacc aacgagataa 16080 Page 24 201122105

ctattaactg gaatgctccg tttgcatggg t tacagct ta tctcgatgaa attgacttaa 16140 taacaccgcc aggaggagta gacccagaag aaccggaggt tatttatggt gactgcaatg 16200 gcgacggaaa agttaattca actgacgctg tggcattgaa gagatatatc ttgagatcag 16260 gtataagcat caacactgat aatgctgatg taaatgctga tggcagagtt aactctacag 16320 acttggcaat attgaagaga tatattctta aagagataga tgtattgcca cataaataag 16380 ccatgggtga aagggggaga tatatagtga aaaaactcat tatcactgtt atagtatctg 16440 ctgtcctttt aactgctctt ataccgcagt tgcctgtttt tgcagcagac tataactatg 16500 gagaagcact ccaaaaagca attatgttct atgaatttca aatgtccgga aagcttcccg 16560 acaacatccg taacaactgg cgcggtgatt catgtctcgg agacggaagc gatgtaggtc 16620 ttgacctcac aggaggttgg tttgacgccg gtgaccatgt aaaattcaat ctgcctatgg 16680 cttacacagc cactatgctt gcatgggctg tgtatgagta caaggacgcg ttacaaaaaa 16740 gcggtcaatt gggctattta atggatcaga ttaaatgggc atcggactac ttcataagat 16800 gccatcccga aaaatatgta tattattatc aagtgggtaa cggtgacatg gaccacagat 16860 ggtgggtgcc ggcagaatgt atagatgttc aggcaccaag accgtcttac aaagtagatc 16920 tgtcaaatcc cggttccaca gttactgcgg gtacagctgc cgcacttgct gcaactgcct 16980 tggtattcaa agacactgat ccggcatatg ccgctctgtg catacgtcat gcaaaagaac 17040 tctttgattt tgctgaaacc actatgagtg ataaaggata taccgcagca ttgaatttct 17100 acacatctca cagtggatgg tatgacgagc tttcctgggc aggtgcatgg atttatcttg 17160 cagacggtga cgaaact tat ct tgaaaaag ctgaaaagta tgtggataaa tggccaatcg 17220 aaagccagac aacttacatt gcttattcat ggggtcactg ctgggacgac gttcactacg 17280 gagcagcact tcttttggca aagattacaa acaaatcctt atacaaagaa gcgatagaaa 17340 gacacctgga ctattggaca gttggattta atggtcagag agtcagatat acaccaaagg 17400 gtcttgctca cctcactgac tggggtgtat taagacatgc cactactact gcattccttg 17460 catgtgttta ttccgactgg tcagaatgtc caagggaaaa agccaatatt tacatagatt 17520 ttgccaagaa acaggctgac tatgccttag gcagcagcgg cagaagttat gtagtcggat 17580 ttggtgtaaa tcctccgcag catccgcacc acagaactgc ccacagctca tggtgtgaca 17640 gtcaaaaagt tcctgaatac cacagacacg ttctttacgg agcactcgta ggcggacctg 17700 atgccagcga tgcttatgtt gatgatatag gaaactatgt aacaaatgag gttgcctgcg 17760 actacaatgc cggttttgta ggattgctcg ccaagatgta tgaaaaatat ggcggaaacc 17820 ccataccaaa cttcatggct atagaagaaa aaacaaatga agaaatttat gttgaagcta 17880 ccgccaattc aaataacggt gtcgaattga aaacatacct ttacaataaa tccggatggc 17940 cggcaagagt ttgcgacaag ctttccttca gatatttcat ggaccttacg gaatatgtat 18000 ccgccggata caatcctaat gatataactg tttctataat ttacagtgca gcaccaactg 18060 caaaaatttc aaaaccaata ctttatgacg catccaaaaa catatattat tgcgaaatcg 18120 atctctccgg taccaagata ttccccggaa gcaactcaga ccaccagaaa gaaacccaat 18180 ttagaataca gcctcctgca ggcgcacctt gggacaacac caacgacttc tcctatcagg 18240 gaatcaagaa aaacggtgaa gttgtaaaag aaatgcctgt ttatgaagac ggaattctca 18300 tattcggtgt agaacccaat ggtaccggtc ctgcaacacc aacgccgaaa 第25 ccgtccgtaa 頁 18360 201122105 atccttcacc ttcacctacg ccaacatcgg atattcttta cggtgacatc aatctggacg 18420. gaaaaattaa ctcttcagat gttacactgt taaaaagata tattgtgaag tccatagatg 18480 ttttcccaac cgctgatccg gaacggagct taatagcatc agatgtaaac ggagacggaa 18540 gggtaaactc tacagactat tcatacctta aacgttatgt cttgaaaatc ataccaacca 18600 tacccggaaa ttcatgacac taggtgaggg gaagatggag agaatggtaa aaagcagaaa 18660 gatttctatt ctgttggcag ttgcaatgct ggtatccata atgataccca caactgcatt 18720 ciicaggtcct acaaaggcac ctacaaaaga tgggacatct tataaggatc ttttccttga 18780 actctacgga aaaattaaag atcctaagaa cggatatttc agcccagacg agggaattcc 18840 ttatcactca attgaaacat tgatcgttga agcgccggac tacggtcacg ttactaccag 18900 tj?aggctttc agctattatg tatggcttga agcaatgtat ggaaatctca caggcaactg 18960 gtccgaagta gaaacagcat ggaaagttat ggaggattgg ataattcctg acagcacaga 19020 gcagccgggt atgtcttctt acaatccaaa cagccctgcc acatatgctg acgaatatga 19080 ggatccttca tactatcctt cagagttgaa gtttgatacc gtaagagttg gatccgaccc 19140 tgtacacaac gaccttgtat ccgcatacgg tcctaacatg tacctcatgc actggttgat 19200 ggacgttgac aactggtacg gttttggtac aggaacacgg gcaacattca taaacacctt 19260 ccaaagaggt gaac谷ggaat ccacatggga aaccattcct catccgtcaa tagaagagtt 19320 caaatacggc ggaccgaacg gattccttga tttgtttaca aaggacagat catatgcaaa 19380 acagtggcgt tatacaaacg ctcctgacgc agaaggccgt gctatacagg ctgtttactg 19440 ggcaaacaaa tgggcaaagg agcagggtaa aggttctgcc gttgcttccg ttgtatccaa 19500 ggctgcaaag atgggtgact tcttgagaaa cgacatgttc gacaaatact tcatgaagat 19560 cggtgcacag gacaagactc ctgctaccgg ttatgacagt gcacactacc ttatggcctg 19620 iitatactgca tggggtggtg gaattggtgc atcctgggca tggaagatcg gatgcagcca 19680 cgcacacttc ggatatcaga acccattcca gggatgggta agtgcaacac agagcgactt 19740 tgctcctaaa tcatccaacg gtaagagaga ctggacaaca agctacaaga gacagcttga 19800 attctatcag tggttgcagt cggctgaagg tggtattgcc ggtggagcaa ccaactcctg 19860 iiaacsgtaga tatgagaaat atcctgctgg tacgtcaacg ttctatggta tggcatatgt 19920 tccgcatcct gtatacgctg acccgggtag taaccagtgg ttcggattcc aggcatggtc 19980 aatgcagcgt gtaatggagt actacctcga aacaggagat tcatcagtta agaatttgat 20040 taagaagtgg gtcgactggg taatgagcga aattaagctc tatgacgatg gaacatttgc 20100 aattcctagc gacctcgagt ggtcaggtca gcctgataca tggaccggaa catacacagg 20160 caacccgaac ctccatgtaa gagtaacttc ttacggtact gaccttggtg ttgcaggttc 20220 acttgcaaat gctcttgcaa cttatgccgc agctacagaa agatgggaag gaaaacttga 20280 tacaaaagca agagacatgg ctgctgaact ggttaaccgt gcatggtaca acttctactg 20340 ctctgaagga aaaggtgttg ttactgagga agcacgtgct gactacaaac gtttctttga 20400 gcaggaagta tacgttccgg caggttggag cggtactatg ccgaacggtg acaagattca 20460 gcctggtatt aagttcatag acatccgtac aaaatataga caagatcctt actacgatat 20520 agtatatcag gcatacttga gaggcgaagc tcctgtattg aattatcacc gcttctggca 20580 tgaagttgac cttgcagttg caatgggtgt attggctaca tacttcccgg atatgacata 20640 第26頁 201122105 taaagtacct ggtactcctt ctactaaatt atacggcgac gtcaatgatg acggaaaagt 20700 taactcaact gacgctgtag cattgaagag atatgttttg agatcaggta taagcatcaa 20760 cactgacaat gccgatttga atgaagacgg cagagttaat tcaactgact taggaatttt 20820 gaagagatat attctcaaag aaatagatac attgccgtac aagaactaa 20869ctattaactg gaatgctccg tttgcatggg tacagct ta tctcgatgaa attgacttaa 16140 taacaccgcc aactgctctt ataccgcagt tgcctgtttt aggaggagta gacccagaag aaccggaggt tatttatggt gactgcaatg 16200 gcgacggaaa agttaattca actgacgctg tggcattgaa gagatatatc ttgagatcag 16260 gtataagcat caacactgat aatgctgatg taaatgctga tggcagagtt aactctacag 16320 acttggcaat attgaagaga tatattctta aagagataga tgtattgcca cataaataag 16380 ccatgggtga aagggggaga tatatagtga aaaaactcat tatcactgtt atagtatctg 16440 ctgtcctttt tgcagcagac tataactatg t 16500 gagaagcact ccaaaaagca attatgttct atgaatttca aatgtccgga aagcttcccg 16560 acaacatccg taacaactgg cgcggtgatt catgtctcgg agacggaagc gatgtaggtc 16620 ttgacctcac aggaggttgg tttgacgccg gtgaccatgt aaaattcaat ctgcctatgg 16680 cttacacagc cactatgctt gcatgggctg tgtatgagta caaggacgcg ttacaaaaaa 16740 gcggtcaatt gggctattta atggatcaga ttaaatgggc atcggactac ttcataagat 16800 gccatcccga aaaatatgta tattattatc aagtgggtaa cggtgacatg gaccacagat 16860 ggtgggtgcc ggcagaatgt atagatgttc aggcaccaag accgt cttac aaagtagatc 16920 tgtcaaatcc cggttccaca gttactgcgg gtacagctgc cgcacttgct gcaactgcct 16980 tggtattcaa agacactgat ccggcatatg ccgctctgtg catacgtcat gcaaaagaac 17040 tctttgattt tgctgaaacc actatgagtg ataaaggata taccgcagca ttgaatttct 17100 acacatctca cagtggatgg tatgacgagc tttcctgggc aggtgcatgg atttatcttg 17160 cagacggtga cgaaact tat ct tgaaaaag ctgaaaagta tgtggataaa tggccaatcg 17220 aaagccagac aacttacatt gcttattcat ggggtcactg ctgggacgac gttcactacg 17280 gagcagcact tcttttggca aagattacaa acaaatcctt atacaaagaa gcgatagaaa 17340 gacacctgga ctattggaca gttggattta atggtcagag agtcagatat acaccaaagg 17400 gtcttgctca cctcactgac tggggtgtat taagacatgc cactactact gcattccttg 17460 catgtgttta ttccgactgg tcagaatgtc caagggaaaa agccaatatt tacatagatt 17520 ttgccaagaa acaggctgac tatgccttag gcagcagcgg cagaagttat gtagtcggat 17580 ttggtgtaaa tcctccgcag catccgcacc acagaactgc ccacagctca tggtgtgaca 17640 gtcaaaaagt tcctgaatac cacagacacg ttctttacgg agcactcgta ggcggacctg 17700 atgccagcga tgcttatgtt gat gatatag gaaactatgt aacaaatgag gttgcctgcg 17760 actacaatgc cggttttgta ggattgctcg ccaagatgta tgaaaaatat ggcggaaacc 17820 ccataccaaa cttcatggct atagaagaaa aaacaaatga agaaatttat gttgaagcta 17880 ccgccaattc aaataacggt gtcgaattga aaacatacct ttacaataaa tccggatggc 17940 cggcaagagt ttgcgacaag ctttccttca gatatttcat ggaccttacg gaatatgtat 18000 ccgccggata caatcctaat gatataactg tttctataat ttacagtgca gcaccaactg 18060 caaaaatttc aaaaccaata ctttatgacg catccaaaaa catatattat tgcgaaatcg 18120 atctctccgg taccaagata ttccccggaa gcaactcaga ccaccagaaa gaaacccaat 18180 ttagaataca gcctcctgca ggcgcacctt gggacaacac caacgacttc tcctatcagg 18240 gaatcaagaa aaacggtgaa gttgtaaaag aaatgcctgt ttatgaagac ggaattctca 18300 tattcggtgt agaacccaat ggtaccggtc ctgcaacacc aacgccgaaa on 25 ccgtccgtaa page 18360 201122105 atccttcacc ttcacctacg ccaacatcgg atattcttta cggtgacatc aatctggacg 18420. gaaaaattaa ctcttcagat gttacactgt taaaaagata tattgtgaag tccatagatg 18480 ttttcccaac cgctgatccg gaacggagct taatagcatc agatgtaaac gga gacggaa 18540 gggtaaactc tacagactat tcatacctta aacgttatgt cttgaaaatc ataccaacca 18600 tacccggaaa ttcatgacac taggtgaggg gaagatggag agaatggtaa aaagcagaaa 18660 gatttctatt ctgttggcag ttgcaatgct ggtatccata atgataccca caactgcatt 18720 ciicaggtcct acaaaggcac ctacaaaaga tgggacatct tataaggatc ttttccttga 18780 actctacgga aaaattaaag atcctaagaa cggatatttc agcccagacg agggaattcc 18840 ttatcactca attgaaacat tgatcgttga agcgccggac tacggtcacg ttactaccag 18900 tj? aggctttc agctattatg tatggcttga agcaatgtat ggaaatctca caggcaactg 18960 gtccgaagta gaaacagcat ggaaagttat ggaggattgg ataattcctg acagcacaga 19020 gcagccgggt atgtcttctt acaatccaaa cagccctgcc acatatgctg acgaatatga 19080 ggatccttca tactatcctt cagagttgaa gtttgatacc gtaagagttg gatccgaccc 19140 tgtacacaac gaccttgtat ccgcatacgg tcctaacatg tacctcatgc actggttgat 19200 ggacgttgac aactggtacg gttttggtac aggaacacgg gcaacattca taaacacctt 19260 ccaaagaggt gaac Valley ggaat ccacatggga aaccattcct catccgtcaa tagaagagtt 19320 caaatacggc ggaccgaacg gattccttga Tttgtttaca a aggacagat catatgcaaa 19380 acagtggcgt tatacaaacg ctcctgacgc agaaggccgt gctatacagg ctgtttactg 19440 ggcaaacaaa tgggcaaagg agcagggtaa aggttctgcc gttgcttccg ttgtatccaa 19500 ggctgcaaag atgggtgact tcttgagaaa cgacatgttc gacaaatact tcatgaagat 19560 cggtgcacag gacaagactc ctgctaccgg ttatgacagt gcacactacc ttatggcctg 19620 iitatactgca tggggtggtg gaattggtgc atcctgggca tggaagatcg gatgcagcca 19680 cgcacacttc ggatatcaga acccattcca gggatgggta agtgcaacac agagcgactt 19740 tgctcctaaa tcatccaacg gtaagagaga ctggacaaca agctacaaga gacagcttga 19800 attctatcag tggttgcagt cggctgaagg tggtattgcc ggtggagcaa ccaactcctg 19860 iiaacsgtaga tatgagaaat atcctgctgg tacgtcaacg ttctatggta tggcatatgt 19920 tccgcatcct gtatacgctg acccgggtag taaccagtgg ttcggattcc aggcatggtc 19980 aatgcagcgt gtaatggagt actacctcga aacaggagat tcatcagtta agaatttgat 20040 taagaagtgg gtcgactggg taatgagcga aattaagctc tatgacgatg gaacatttgc 20100 aattcctagc gacctcgagt ggtcaggtca gcctgataca tggaccggaa catacacagg 20160 caacccgaac ctccatgtaa gagtaacttc tt acggtact gaccttggtg ttgcaggttc 20220 acttgcaaat gctcttgcaa cttatgccgc agctacagaa agatgggaag gaaaacttga 20280 tacaaaagca agagacatgg ctgctgaact ggttaaccgt gcatggtaca acttctactg 20340 ctctgaagga aaaggtgttg ttactgagga agcacgtgct gactacaaac gtttctttga 20400 gcaggaagta tacgttccgg caggttggag cggtactatg ccgaacggtg acaagattca 20460 gcctggtatt aagttcatag acatccgtac aaaatataga caagatcctt actacgatat 20520 agtatatcag gcatacttga gaggcgaagc tcctgtattg aattatcacc gcttctggca 20580 tgaagttgac cttgcagttg caatgggtgt attggctaca tacttcccgg atatgacata 20640 Page 26 201122105 taaagtacct ggtactcctt ctactaaatt atacggcgac gtcaatgatg acggaaaagt 20700 taactcaact gacgctgtag cattgaagag atatgttttg agatcaggta taagcatcaa 20760 cactgacaat gccgatttga atgaagacgg cagagttaat tcaactgact taggaatttt 20820 gaagagatat attctcaaag aaatagatac attgccgtac aagaactaa 20869

第27頁Page 27

Claims (1)

201122105 七 、申請專利範圍: 解酶1 複生物纖維素水 ,_維素轉酶複 (scaffoldm subunits)及複數個分解酶次單开 早兀 subunits) ’料概赌解献單被 現量具-天赌名猶, 、*讀生長時的表 其中’该多順反子表現卡J£包括: U)啟動子;及 州苦f序列,其係與該啟動子操作地連接, ίίΪΓ及編賴輕_分解啦單元之複數個分解^ ^,該等複_分_核微相係依 序排列於該多順反子核 仔該專稷數個分解酶次單元在該啟動子之控二^ 名順序與上述天然排名順序相符。 、里之排 牟蚕2白2請圍第1項之多順反子表現奸’其中該支 木蛋白-人早兀包括一或多個第—型黏合 domain^該等,數個分解酶次單元包括第—型敕域^= ==d_n),其中該第—型黏合域與該第—型敕域可互 ^3, ί申請專利範圍第1項之多順反子表現卡立中今天 ’、,纖;維素水解酶複合體進—步包括細胞表_白、“ (cell surface anchoring protein subunits),,X 白碼―胞表面錫定蛋白次單元之細胞表面錫定蛋 4.如=專利範圍第3項之多順反子表 胞表面跋蛋白次單认括—或多個第二型黏合 201122105 cohesive domains),該支架蛋白另包括第二型錨定域(type π dockerin domains ),其中該第二型黏合域與該第二型錨定域可互 相結合。 5.如申請專利範圍第1項之多順反子表現卡匣,其中該微 生物係選自由熱纖維梭菌、嗜纖維 梭菌(C*. ce//w/ownms)、解纖維梭菌(C· ce/Mo/沖‘cmw)、溶紙 莎草履菌(C. 、長梗木黴菌(7>7\:/2〇<^772<3· longibrachiatum)' 容I氟後得儀{Baceroides cellulosolvens)、 解纖維素醋弧菌、瘤胃真菌N型菌 {Nepcallimastix frontalis) ' 瘤罵美菌 Ϋ 盤菌 QPiromyces spp) ' 巨大,芽孢桿菌(5flcz7/⑽meg她r/wm )、地衣芽抱桿菌(Bacillus licheniformis )、溶纖維芽孢桿菌(方⑽·//⑽)、黄色 瘤月球菌(T^mzViococo^ )、及解纖維醋弧菌 (上如\7加/〇<^//«/0/沖’<:泌)所組成之群組之其中之一者。 6·如申請專利範圍第5項之多順反子表現卡匣,其中該微 生物係熱纖維梭菌。 7. 如申請專利範圍第6項之多順反子表現卡匣,其中支架 蛋白次單元係CipA。 8. 如申請專利範圍第7項之多順反子表現卡匣,其中該等 複數個分解_次單元係包括外切葡聚糖酶(ex〇glucanases) Ceis 及CelK、内切葡聚糖酶(en〇giucanases) ceiA、及聚木糖酶 (xylanases ) XynC 及 Xynz。 >9.如申請專利範圍第8項之多順反子表現卡匣,其中該分 解酶核苷酸序列係依序編碼Cels、CelK、CelA、XynC及Xynz。 10. 如申請專利範圍第9項之多順反子表現卡匣,其中該 多順反子核苷酸序列係依序編碼CipA、CelS、CelK、CelA、 XynC 及 Xynz。 11. 如申請專利範圍第3項之多順反子表現卡g,其中該微 生物係熱纖維梭菌。 12. 如申請專利範圍第丨丨項之多順反子表現卡匣,其中支 201122105 架蛋白次單元係CipA。 細胞申定^^圍之多順反子表現卡E,其中該 成之群之任早讀、選自由琴、sdbA及⑽p所組 專利範11第13項之多順反子表現卡£,其中該 解酶次單元包括外切葡聚糖酶CelS及CelK、内切 阁聚㈣CdR及CelA及聚木糖酶及Xynz。 八JiLUt專利範11第14項之多順反子表現卡11,其中該 Ζ ^ Λ 吹序列係依序編碼 CelK、CelS、CelR、CelA、XynC 及 XynZ。 帛15項之辣反子表現姐,其中細 胞表面錨疋蛋白次單元係SdbA。 夕專利範圍第16項之多順反子表現卡11,其中該 反子核技糊係依序編碼αρΑ、ωκ、⑽、⑽、 SdbA、CelA、Xync 及 xynZ。 19. ^申請專利範圍第18項之多順反子表現卡匣,其中細 胞表面錨定蛋白次單元係SdbA。 。丨盾2及0·早利範圍第19項之多順反子表現卡11,其中該 夕順反子核皆酉夂序歹,J係依序編碼cipA SdbA、CelK、CelR、及 Cds。 Υ t21. t申請專利範圍帛13項之多順反子表現輕,其中細 胞表面錫疋蛋白次單元係〇lpB。 啟動2子2是6項之㈣好錄付,其中該 項之2多3順卡ΐ包括如申請專利範圍第1至22項中任- 24. -種宿主細胞’其包括如申請專利範圍第Μ項中之載 201122105 體 25. 26. subtilis) 3 24項之宿主細胞’其係嗜中溫性。 。°月專利粍圍帛24項之宿主細胞,其係枯草桿菌(及 辛水解酶趨項之宿主細胞培養於可表現該纖維 ί 細娜爾酶複合體。 草桿菌uLi) 項之方法’其中該宿主細胞是枯 纖維項嫩,其進—步包括純化該 維素圍第29項之方法,其中該步驟係使用纖 離法3而1達範圍第29項之方法’其中該步驟係使用分 32. —種分解木質纖維素類生物質之方法,苴 請H圍第24項之宿主細胞接觸木質纖維素類生=括/如申 该多順反 元之上===水;=合體所含複數個分解酶次單 子表^括一載體,其包括一多順反子表現卡卜 (a)啟動子;及 ⑻乡概子㈣酸序列,其係躺啟動 接,該夕順反子核苷酸序列包括編碼支架蛋白次^也連 ==列!複數個分解酶次單元之複數個分ΐί 核苷敲序列,其中該等複數個分解酶核苷酸鮮姆 啟動子之位置順序依序制於該彡概子核I _ = 於該 該等複數個分解酶次單元在該啟動子之控制下之=,2得 該ί複數個分解酶㈣酸序列相對_啟=3 201122105 (2)將該載體導入宿主細胞中並在適當環境中培養,以表 現出前述複數個分解酶次單元,其中調配該等複數個分解酶次 單元之間的表現量而使纖維素水解酶複合體上此等複數個分解 酶次單元之間含量比例因此獲得調配。201122105 VII, the scope of application for patents: solution of enzyme 1 complex bio-cellulose water, _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ The name of the gambling is, *, read the table when growing. 'The multi-cistronic performance card J includes: U) the promoter; and the state suffering f sequence, which is operatively connected to the promoter, ίίΪΓ and 编淡_ Decomposing a plurality of decompositions of the unit ^ ^, the complex _ points _ nuclear microphase system is sequentially arranged in the polycistronic nucleus, the number of decomposing enzyme subunits in the promoter control The order is consistent with the above natural ranking order. In the row, the tussah silkworm 2 white 2 please surround the first item of the cis-transformation of the genus. The woody protein-human early sputum includes one or more first-type binding domain^, etc., several decomposition enzymes The unit includes a first-type ^ domain ^= ==d_n), wherein the first-type bonded domain and the first-type 敕-domain are mutually interchangeable, and the multi-cistronic performance of the first application of the patent scope is set today. ',, fiber; vitamin hydrolase complex step - including cell surface _ white, "(cell surface anchoring protein subunits), X white code - cell surface tin-fixed protein subunit cell surface tin fixed egg 4. = Patent No. 3 of the patent scope cis-activator surface 跋 protein sub-single-- or multiple second-type adhesion 201122105 cohesive domains), the scaffold protein further includes type π dockerin domains Wherein the second type of binding domain and the second type of anchoring domain can be combined with each other. 5. The polycistronic performance of the first aspect of the patent application is characterized in that the microorganism is selected from Clostridium thermocellum, Clostridium faecium (C*. ce//w/ownms), Clostridium cellulosum (C·ce/Mo/rushing 'cmw), lysine Bacteria (C., Trichoderma longissima (7>7\:/2〇<^772<3. longibrachiatum)' contains the fluoride (Baceroides cellulosolvens), Vibrio anguillarum, rumen fungus N-type Bacteria {Nepcallimastix frontalis) 'Bacillus cerevisiae QPiromyces spp) 'Huge, Bacillus (5flcz7/(10)meg her r/wm), Bacillus licheniformis, Bacillus licheniformis (Parts (10)·// (10)), Rhizoctonia solani (T^mzViococo^), and Vibrio anguillarum (such as \7 plus / 〇 < ^ / / « / 0 / rush '<: secretion) group One of them. 6. The polycistronic performance card of the fifth paragraph of the patent application scope, wherein the microorganism is Clostridium thermocellum. 7. If the polycistronic performance card of the sixth application patent scope, Wherein the scaffold protein subunit is CipA. 8. The polycistronic expression cassette of claim 7 wherein the plurality of decomposition_subunits include exoglucanases Ceis and CelK, endoglucanase (en〇giucanases) ceiA, and polyxyxyases (xylanases) XynC and Xynz. >9. The polycistronic expression of the eighth aspect of the patent range is characterized in that the nucleotide sequence of the dissociation enzyme encodes Cels, CelK, CelA, XynC and Xynz in sequence. 10. The polycistronic nucleotide sequence of claim 9 wherein the polycistronic nucleotide sequence encodes CipA, CelS, CelK, CelA, XynC and Xynz in sequence. 11. The polycistronic performance card g as claimed in claim 3, wherein the microorganism is Clostridium thermocellum. 12. If the polycistronic performance card of the third paragraph of the patent application is applied, the 201122105 protein subunit is CipA. The cell is determined to be a multi-cistronic expression card E, wherein the group of the early readings is selected from the multi-cistronic performance card of the 13th item of the patent model 11 by Qin, sdbA and (10)p, wherein The cleavage subunit includes exoglucanase CelS and CelK, endo-CeC (C)R and CelA, and polyxylase and Xynz. The multi-cistronic expression card 11 of the eightth item of JiLUt Patent No. 11 in which the Ζ ^ 吹 blowing sequence is sequentially encoded by CelK, CelS, CelR, CelA, XynC and XynZ.帛 15 items of Spicy Anti-Sisters, in which the cell surface anchorage protein subunit is SdbA. The multi-cistronic performance card 11 of the ninth patent range, wherein the anti-nuclear technology paste sequentially encodes αρΑ, ωκ, (10), (10), SdbA, CelA, Xync, and xynZ. 19. ^ The polycistronic expression of the 18th patent application area, in which the cell surface anchorage protein subunit is SdbA. .丨 盾 2 and 0 · Early benefit range item 19 of the multi-cistronic performance card 11, wherein the cis-trans-sub-nucleus is in sequence, and the J-series encodes cipA SdbA, CelK, CelR, and Cds in sequence. Υ t21. t The polycistronics of the 13 patent application scope are light, and the subunit of the cell surface tin 疋 protein is 〇 lpB. Start 2 sub- 2 is 6 (4) good pay, of which 2 more than 3 cis-cards include as in the scope of patent application Nos. 1 to 22 - 24. - Host cells - which include the scope of patent application In the item 201122105, the body 25.25 subtilis) The host cell of the 24th item is temperate. . The monthly host cell of 24 patents, which is a method of Bacillus subtilis (and the host cell culture of the hydrolytic enzyme trend can express the fiber 细 娜 酶 enzyme complex. Phytophthora uLi) The host cell is a dry fiber, and the further step comprises the method of purifying the vitamin 29, wherein the step is the method using the fibrillation method 3 and the range 29 item 'where the step is 32 - A method for decomposing lignocellulosic biomass, please contact the host cell of the 24th H area in contact with lignocellulosic raw = = / above the application of the multi-cis anti-element = = = water; = contained in the complex a plurality of decomposing enzyme sub-tables including a vector comprising a polycistronic expression card (a) promoter; and (8) a genus (four) acid sequence, which is ligated to initiate cleavage, the cis-trans nucleoside The acid sequence includes a plurality of cleavage nucleoside cleavage sequences encoding the scaffold protein, and the number of the cleavage enzymes, wherein the positions of the plurality of cleavage enzyme nucleotides are sequentially sequential. In the 彡 sub-core I _ = in the plurality of decomposing enzyme subunits Under the control of the promoter, the number of the decomposing enzymes is determined by the promoter. The vector is introduced into the host cell and cultured in an appropriate environment to express the aforementioned plurality of decomposing enzymes. A subunit in which the amount of expression between the plurality of decomposing enzyme subunits is formulated such that the content ratio between the plurality of decomposing enzyme subunits on the cellulolytic enzyme complex is thus formulated.
TW99144330A 2009-12-16 2010-12-16 Polycistronic expression cassettes for producing cellulosomes and applications thereof TWI444477B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW99144330A TWI444477B (en) 2009-12-16 2010-12-16 Polycistronic expression cassettes for producing cellulosomes and applications thereof

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
TW98143083 2009-12-16
TW99144330A TWI444477B (en) 2009-12-16 2010-12-16 Polycistronic expression cassettes for producing cellulosomes and applications thereof

Publications (2)

Publication Number Publication Date
TW201122105A true TW201122105A (en) 2011-07-01
TWI444477B TWI444477B (en) 2014-07-11

Family

ID=45045916

Family Applications (1)

Application Number Title Priority Date Filing Date
TW99144330A TWI444477B (en) 2009-12-16 2010-12-16 Polycistronic expression cassettes for producing cellulosomes and applications thereof

Country Status (1)

Country Link
TW (1) TWI444477B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3485734A1 (en) * 2017-11-21 2019-05-22 Technische Universität München Method for preparing food products comprising rye

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3485734A1 (en) * 2017-11-21 2019-05-22 Technische Universität München Method for preparing food products comprising rye
WO2019101794A1 (en) * 2017-11-21 2019-05-31 Technische Universität München Method for preparing food products comprising rye
CN111615335A (en) * 2017-11-21 2020-09-01 慕尼黑工业大学 Process for preparing a food product comprising rye

Also Published As

Publication number Publication date
TWI444477B (en) 2014-07-11

Similar Documents

Publication Publication Date Title
Zou et al. Construction of a cellulase hyper-expression system in Trichoderma reesei by promoter and enzyme engineering
DK2361311T3 (en) MAKE EXPRESSIVE CELLULASES FOR SIMULTANEOUS INSURANCE AND ACTION BY CELLULOSE
NZ598403A (en) Polypeptides having cellulolytic enhancing activity and polynucleotides encoding same
MX2010012952A (en) Compositions and methods for producing fermentable carbohydrates in plants.
CN110423705A (en) For the method by product yield and yield in addition alternately electron acceptor improvement microorganism
KR20140027154A (en) Glycosyl hydrolase enzymes and uses thereof for biomass hydrolysis
HUE027643T2 (en) Construction of highly efficient cellulase compositions for enzymatic hydrolysis of cellulose
WO2010075529A2 (en) Heterologous biomass degrading enzyme expression in thermoanaerobacterium saccharolyticum
Zhu et al. Characterization of a family 5 glycoside hydrolase isolated from the outer membrane of cellulolytic Cytophaga hutchinsonii
WO2014155566A1 (en) Thermostable cellobiohydrolase
CN109852597A (en) A kind of beta galactosidase galRBM20_1 and its preparation method and application
CN104911197B (en) The acquisition methods of the natural variation body of enzyme and super heat resistant fibre disaccharide-hydrolysing enzymes
Toyama et al. A novel β-glucosidase isolated from the microbial metagenome of Lake Poraquê (Amazon, Brazil)
US9580702B2 (en) Thermostable cellobiohydrolase and amino acid substituted variant thereof
NZ597623A (en) Polypeptides having beta-glucosidase activity and polynucleotides encoding same
CN114761553A (en) Nucleic acids, vectors, host cells and methods for producing beta-fructofuranosidase from aspergillus niger
TW201122105A (en) Polycistronic expression cassettes for producing cellulosomes and applications thereof
Liu et al. Engineering of dual-functional hybrid glucanases
US20140154751A1 (en) Enhanced fermentation of cellodextrins and beta-d-glucose
JP2017175958A (en) Thermostable cellobiohydrolase
Louime et al. Molecular cloning and biochemical characterization of a family-9 endoglucanase with an unusual structure from the gliding bacteria Cytophaga hut chinsonii
Wereszka et al. A cellulase produced by the rumen protozoan Epidinium ecaudatum is of bacterial origin and has an unusual pH optimum
CN107236720B (en) Thermostable cellobiohydrolase
Rai et al. All4894 encoding a novel fasciclin (FAS-1 domain) protein of Anabaena sp. PCC7120 revealed the presence of a thermostable β-glucosidase
JP6429377B2 (en) Thermostable cellobiohydrolase