[go: up one dir, main page]

TW202217000A - Sars-cov-2 mrna domain vaccines - Google Patents

Sars-cov-2 mrna domain vaccines Download PDF

Info

Publication number
TW202217000A
TW202217000A TW110123289A TW110123289A TW202217000A TW 202217000 A TW202217000 A TW 202217000A TW 110123289 A TW110123289 A TW 110123289A TW 110123289 A TW110123289 A TW 110123289A TW 202217000 A TW202217000 A TW 202217000A
Authority
TW
Taiwan
Prior art keywords
mrna
seq
amino acid
mol
orf
Prior art date
Application number
TW110123289A
Other languages
Chinese (zh)
Inventor
瓊斯 吉洛姆 史都華
Original Assignee
美商現代公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 美商現代公司 filed Critical 美商現代公司
Publication of TW202217000A publication Critical patent/TW202217000A/en

Links

Images

Landscapes

  • Peptides Or Proteins (AREA)
  • Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)

Abstract

The disclosure relates to coronavirus ribonucleic acid (RNA) vaccines as well as methods of using the vaccines and compositions comprising the vaccines. The RNA vaccines encode domains and subunits of coronavirus.

Description

SARS-COV-2 MRNA結構域疫苗SARS-COV-2 mRNA domain vaccine

人類冠狀病毒係冠狀病毒科之高度接觸傳染性有套膜正單股RNA病毒。已知冠狀病毒科之兩個亞科會引起人類疾病。最重要的是β-冠狀病毒(β-coronaviruses,betacoronaviruses)。β-冠狀病毒係輕度至中度上呼吸道感染之常見致病因子。然而,新型冠狀病毒感染(諸如由最初於2019年12月自中國武漢市鑑別之冠狀病毒引起之感染)之爆發與高死亡率死亡人數相關。此最近鑑別之冠狀病毒(稱為嚴重急性呼吸症候群冠狀病毒2 (SARS-CoV-2),以前稱為「2019新型冠狀病毒」或「2019-nCoV」)已迅速感染數十萬人。SARS-CoV-2病毒引起之大流行病已由世界衛生組織(WHO)命名為COVID-19 (冠狀病毒疾病2019,Coronavirus Disease 2019)。SARS-CoV-2分離株(Wuhan-Hu-1)之第一個基因體序列由位於北京之中國CDC之研究人員在2020年1月10日在Virological發表,該Virological係用於分析及解釋病毒分子進化及流行病學之基於UK之討論論壇。接著將該序列於2020年1月12日保藏在GenBank中,GenBank登錄號為MN908947.1。Human coronavirus is a highly contagious mantle positive single-stranded RNA virus of the family Coronaviridae. Two subfamilies of the coronavirus family are known to cause disease in humans. The most important are beta-coronaviruses (beta-coronaviruses, betacoronaviruses). Beta-coronavirus is a common causative agent of mild to moderate upper respiratory tract infections. However, outbreaks of novel coronavirus infections, such as those caused by a coronavirus originally identified from Wuhan, China in December 2019, are associated with high mortality. This recently identified coronavirus, known as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), formerly known as "2019 novel coronavirus" or "2019-nCoV", has rapidly infected hundreds of thousands of people. The pandemic caused by the SARS-CoV-2 virus has been named COVID-19 (Coronavirus Disease 2019) by the World Health Organization (WHO). The first genome sequence of the SARS-CoV-2 isolate (Wuhan-Hu-1) was published on January 10, 2020 by researchers at the Chinese CDC in Beijing in Virological, a department for virus analysis and interpretation UK-based discussion forum for molecular evolution and epidemiology. The sequence was then deposited in GenBank on January 12, 2020, with GenBank accession number MN908947.1.

目前,尚無針對COVID-19之特異性治療或針對SARS-CoV-2感染之疫苗。與冠狀病毒感染、特別是SARS-CoV-2大流行相關之持續之健康問題及死亡率在國際上受到極大關注。由SARS-CoV-2引起之公共健康危機加强快速開發針對該等病毒之有效及安全之疫苗候選者的重要性。Currently, there is no specific treatment for COVID-19 or a vaccine against SARS-CoV-2 infection. The ongoing health problems and mortality associated with coronavirus infections, especially the SARS-CoV-2 pandemic, are of great international concern. The public health crisis caused by SARS-CoV-2 reinforces the importance of rapidly developing effective and safe vaccine candidates against these viruses.

在一些實施例中,本文提供包含一或多種信使核糖核酸(mRNA)分子之組合物(例如疫苗),該等信使核糖核酸(mRNA)分子編碼能夠引發針對SARS-CoV-2抗原之強效中和抗體反應之高免疫原性抗原。本文所述之mRNA分子用於表現SARS-CoV-2冠狀病毒刺突(S)蛋白之關鍵中和結構域,當個別地或組合用作免疫原性組合物或疫苗時,其有效誘導保護性免疫,以保護人免於天然病毒感染及/或若感染則減輕症狀。In some embodiments, provided herein are compositions (eg, vaccines) comprising one or more messenger ribonucleic acid (mRNA) molecules encoding a potent medium capable of eliciting an antigen against SARS-CoV-2 Highly immunogenic antigens that react with antibodies. The mRNA molecules described herein are used to express the key neutralizing domain of the SARS-CoV-2 coronavirus spike (S) protein, which is effective in inducing protective properties when used individually or in combination as an immunogenic composition or vaccine Immunization to protect a person from natural viral infection and/or reduce symptoms if infected.

已知β冠狀病毒之套膜S蛋白決定病毒宿主向性及進入宿主細胞,且對於SARS-CoV-2感染係至關重要的。S蛋白之組織在β冠狀病毒(諸如SARS-CoV-2、SARS-CoV、MERS-CoV、HKU1-CoV、MHV-CoV及NL63-CoV (包括兩個次單元S1及S2,其分別介導附著及膜融合))中相似。S1次單元包括N末端結構域(NTD)及受體結合結構域(RBD)。The envelope S protein of betacoronaviruses is known to determine viral host tropism and entry into host cells and is critical for SARS-CoV-2 infection. The organization of the S protein in betacoronaviruses such as SARS-CoV-2, SARS-CoV, MERS-CoV, HKU1-CoV, MHV-CoV and NL63-CoV (including two subunits S1 and S2, which mediate attachment and membrane fusion)). The S1 subunit includes an N-terminal domain (NTD) and a receptor binding domain (RBD).

次單元抗原之表現將免疫反應集中於特定次單元,對與其他相關病毒共有之抗原之其他結構域具有特異性之記憶B及T細胞之刺激最小。本文提供之數據展現,投與編碼膜結合或可溶性SARS-CoV-2 S1次單元抗原之mRNA產生針對SARS-CoV-2 RBD抗原、NTD抗原、野生型全長S蛋白及具有雙脯胺酸突變以穩定融合前構象之S蛋白中之每一者之抗體效價。如本文所示,在所有測試之劑量下,兩劑量方案(亦即,包括加强劑量)有效誘導可識別且結合至SARS-CoV-2 WT S蛋白之抗體。令人驚訝的是,即使在S1次單元中未發現雙脯胺酸突變(雙脯胺酸突變發生在S2中,而在測試之免疫原中不存在S2),當針對雙脯胺酸穩定形式之S蛋白量測時,誘導之效價最高。The presentation of subunit antigens focuses the immune response on specific subunits, with minimal stimulation of memory B and T cells specific for other domains of antigens shared by other related viruses. The data presented herein demonstrate that administration of mRNA encoding a membrane-bound or soluble SARS-CoV-2 S1 subunit antigen produces targeting of the SARS-CoV-2 RBD antigen, NTD antigen, wild-type full-length S protein, and a bisproline mutation to Antibody titers for each of the S proteins that stabilized the prefusion conformation. As shown herein, at all doses tested, the two-dose regimen (ie, including the booster dose) was effective in inducing antibodies that recognize and bind to the SARS-CoV-2 WT S protein. Surprisingly, even though no bisproline mutation was found in the S1 subunit (the bisproline mutation occurred in S2, which was not present in the tested immunogens), when targeting the bisproline stable form When the S protein was measured, the induction titer was the highest.

另外,已知NTD及RBD二者皆係結合中和病毒活性之抗體之位點。在SARS-CoV-2之情形下,RBD係刺突蛋白之受體結合位點,其結合血管收縮肽轉化酶2 (ACE2)。NTD之功能尚未完全瞭解,其似乎在結合糖部分及促進刺突蛋白自融合前構象向融合後構象之構象轉變中起作用。無論如何,NTD及RBD結構域皆誘導如本文所示之高結合抗體及中和抗體效價。In addition, both NTD and RBD are known to be sites of binding of antibodies that neutralize viral activity. In the case of SARS-CoV-2, RBD is the receptor binding site for the spike protein, which binds vasoconstrictor peptide converting enzyme 2 (ACE2). The function of NTD is not fully understood, but it appears to play a role in binding the sugar moiety and promoting the conformational transition of the spike protein from the prefusion to the postfusion conformation. Regardless, both the NTD and RBD domains induce high binding and neutralizing antibody titers as shown herein.

舉例而言,相當令人驚訝的是,本文一些實施例中提供之數據顯示,儘管來自投與編碼膜結合RBD抗原(RBD-TM)或膜結合NTD抗原(NTD-TM)之mRNA之血清顯示對SARS-CoV-2 S1/S2刺突蛋白具有免疫原性,但兩種mRNA (且因此兩種抗原)之50:50組合產生對SARS-CoV-2 S1/S2刺突蛋白出乎意料高之協同中和抗體效價。For example, it is rather surprising that the data presented in some of the examples herein show that although sera from administration of mRNA encoding membrane-bound RBD antigen (RBD-TM) or membrane-bound NTD antigen (NTD-TM) show Immunogenic for the SARS-CoV-2 S1/S2 spike protein, but a 50:50 combination of the two mRNAs (and thus the two antigens) yields unexpectedly high levels of the SARS-CoV-2 S1/S2 spike protein The synergistic neutralizing antibody titer.

因此,本揭示案之一些態樣提供包含編碼SARS-CoV-2 S蛋白之功能結構域之mRNA的組合物,該功能結構域能够誘導針對SARS-CoV-2之免疫反應,例如中和抗體反應。在一些實施例中,mRNA調配於脂質奈米顆粒中。Accordingly, some aspects of the present disclosure provide compositions comprising mRNA encoding a functional domain of the SARS-CoV-2 S protein capable of inducing an immune response, such as a neutralizing antibody response, against SARS-CoV-2 . In some embodiments, the mRNA is formulated in lipid nanoparticles.

在一些態樣中,提供包含編碼SARS-CoV-2刺突蛋白之至少兩個結構域且小於全長刺突蛋白之開放閱讀框(ORF)的mRNA。小於全長刺突蛋白之刺突蛋白係具有比全長刺突蛋白少至少一個胺基酸的刺突蛋白之一或多個結構域及/或次單元,或具有以非天然順序或序列連接在一起之一或多個結構域的融合蛋白。在一些實施例中,兩個結構域中之一者係SARS-CoV-2刺突蛋白之N末端結構域(NTD)。在一些實施例中,兩個結構域中之一者係SARS-CoV-2刺突蛋白之受體結合結構域(RBD)。在一些實施例中,ORF編碼與NTD及/或RBD連接之跨膜結構域(TD)。在一些實施例中,TD係流行性感冒血球凝集素跨膜結構域。在一些實施例中,ORF包含NTD - RBD - TM。在一些實施例中,至少兩個結構域經由可裂解或不可裂解之連接體連接。在一些實施例中,不可裂解之連接體係甘胺酸-絲胺酸(GS)連接體。在一些實施例中,GS連接體係4-15個胺基酸。在一些實施例中,連接體係泛HLA DR結合抗原決定基(pan HLA DR-binding epitope) (PADRE)。在一些實施例中,ORF編碼信號肽。在一些實施例中,信號肽與NTD連接。在一些實施例中,信號肽與RBD連接。在一些實施例中,信號肽與SARS-CoV-2异源。在一些實施例中,至少兩個結構域係可溶的。在一些實施例中,ORF編碼運輸信號結構域。在一些實施例中,運輸信號結構域係巨噬細胞標記物。在一些實施例中,巨噬細胞標記物係CD86及/或CD11b。在一些實施例中,運輸信號結構域係VSV-G胞質尾(VSVGct)。在一些實施例中,兩個結構域中之一者係SARS-CoV-2刺突蛋白之第一重複七肽:HPPHCPC (HR1)。在一些實施例中,兩個結構域中之一者係SARS-CoV-2刺突蛋白之第二重複七肽:HPPHCPC (HR2)。在一些實施例中,ORF編碼與HR1及/或HR2連接之跨膜結構域(TD)。在一些實施例中,TD係流行性感冒血球凝集素跨膜結構域。在一些實施例中,ORF編碼融合肽(FP)。在一些實施例中,ORF編碼CT尾。In some aspects, an mRNA comprising an open reading frame (ORF) that encodes at least two domains of the SARS-CoV-2 spike protein and is smaller than the full-length spike protein is provided. Spike proteins that are smaller than the full-length spike protein are those that have one or more domains and/or subunits of the spike protein that are at least one amino acid fewer than the full-length spike protein, or have a non-native order or sequence linked together A fusion protein of one or more domains. In some embodiments, one of the two domains is the N-terminal domain (NTD) of the SARS-CoV-2 spike protein. In some embodiments, one of the two domains is the receptor binding domain (RBD) of the SARS-CoV-2 spike protein. In some embodiments, the ORF encodes a transmembrane domain (TD) linked to NTD and/or RBD. In some embodiments, the TD is an influenza hemagglutinin transmembrane domain. In some embodiments, the ORF comprises NTD-RBD-TM. In some embodiments, at least two domains are linked via a cleavable or non-cleavable linker. In some embodiments, the non-cleavable linker system glycine-serine (GS) linker. In some embodiments, the GS linkage system is 4-15 amino acids. In some embodiments, the linking system is a pan HLA DR-binding epitope (PADRE). In some embodiments, the ORF encodes a signal peptide. In some embodiments, the signal peptide is linked to the NTD. In some embodiments, the signal peptide is linked to the RBD. In some embodiments, the signal peptide is heterologous to SARS-CoV-2. In some embodiments, at least two domains are soluble. In some embodiments, the ORF encodes a trafficking signaling domain. In some embodiments, the trafficking signaling domain is a macrophage marker. In some embodiments, the macrophage markers are CD86 and/or CD11b. In some embodiments, the trafficking signaling domain is a VSV-G cytoplasmic tail (VSVGct). In some embodiments, one of the two domains is the first repeat heptapeptide of the SARS-CoV-2 spike protein: HPPHCPC (HR1). In some embodiments, one of the two domains is the second repeat heptapeptide of the SARS-CoV-2 spike protein: HPPHCPC (HR2). In some embodiments, the ORF encodes a transmembrane domain (TD) linked to HR1 and/or HR2. In some embodiments, the TD is an influenza hemagglutinin transmembrane domain. In some embodiments, the ORF encodes a fusion peptide (FP). In some embodiments, the ORF encodes a CT tail.

在一些態樣中,提供包含編碼SARS-CoV-2刺突蛋白之受體結合結構域(RBD)之開放閱讀框(ORF)的mRNA。在一些實施例中,RBD係可溶的。在一些實施例中,RBD與跨膜結構域、視情况地流行性感冒血球凝集素跨膜結構域連接。In some aspects, an mRNA comprising an open reading frame (ORF) encoding the receptor binding domain (RBD) of the SARS-CoV-2 spike protein is provided. In some embodiments, the RBD is soluble. In some embodiments, the RBD is linked to a transmembrane domain, optionally an influenza hemagglutinin transmembrane domain.

本發明之一或多個實施例之詳情在以下說明中闡述。自以下附圖及若干實施例之詳細說明以及所附申請專利範圍將明瞭本發明之其他特徵或優點。The details of one or more embodiments of the invention are set forth in the description below. Other features or advantages of the present invention will become apparent from the following drawings and detailed description of several embodiments and appended claims.

相關申请案related applications

本申請案根據35 U.S.C. § 119(e)主張於2020年6月25日提出申請之美國臨時申請案號63/044,330及於2020年8月7日提出申請之美國臨時申請案號63/063,137之權益,每一申請案皆以全文引用之方式併入本文中。This application claims U.S. Provisional Application No. 63/044,330, filed June 25, 2020, and U.S. Provisional Application No. 63/063,137, filed August 7, 2020, under 35 U.S.C. § 119(e) The rights and interests of each application are incorporated herein by reference in their entirety.

嚴重急性呼吸症候群冠狀病毒2 (SARS-CoV-2)係新近出現之高發病率及死亡率之呼吸道病毒。與2002年出現之SARS-CoV及2012年出現之中東呼吸症候群冠狀病毒(MERS-CoV)相比,SARS-CoV-2已經迅速在全世界傳播。世界衛生組織(WHO)報導,截至2020年7月6日,COVID-19之當前爆發在世界範圍內具有幾乎1150萬確診病例,其中有超過530,000例死亡。COVID-19感染之新病例呈上升趨勢,且仍然在迅速增加。因此,開發各種安全且有效之疫苗及藥物來預防及治療COVID-19且减少COVID-19在全世界所具有之嚴重影響係至關重要的。使用多種方式製得之疫苗及藥物以及具有改善之安全性及效能之疫苗係必要的。仍然需要加速針對冠狀病毒疾病2019 (COVID-19)之疫苗及治療藥物之先進設計及開發。Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a newly emerged respiratory virus with high morbidity and mortality. Compared with SARS-CoV, which emerged in 2002, and Middle East Respiratory Syndrome Coronavirus (MERS-CoV), which emerged in 2012, SARS-CoV-2 has spread rapidly around the world. The World Health Organization (WHO) reports that as of July 6, 2020, the current outbreak of COVID-19 has nearly 11.5 million confirmed cases worldwide, with more than 530,000 deaths. New cases of COVID-19 infections are on the rise and are still increasing rapidly. Therefore, it is crucial to develop various safe and effective vaccines and drugs to prevent and treat COVID-19 and reduce the severe impact of COVID-19 worldwide. There is a need for vaccines and drugs prepared using a variety of methods, as well as vaccines with improved safety and efficacy. There is still a need to accelerate advanced design and development of vaccines and therapeutics for the coronavirus disease 2019 (COVID-19).

在2020年1月7日,將嚴重急性呼吸症候群冠狀病毒2 (SARS-CoV-2)鑑別為於2019年12月出現在中國湖北省武漢市之新型肺炎之致病因子(Lu H等人,(2020) J Med Virol.四月;92(4):401-402。)。不久,該病毒在中國引起爆發且傳播至世界各地。根據SARS-CoV-2之基因體結構之分析,其屬於β-冠狀病毒(CoV) (Chan等人,2020 Emerg Microbes Infect.; 9(1):221-236)。 On January 7, 2020, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was identified as the causative agent of a novel pneumonia that appeared in Wuhan, Hubei Province, China in December 2019 (Lu H et al. (2020) J Med Virol. Apr;92(4):401-402.). Soon, the virus caused an outbreak in China and spread around the world. According to the analysis of the genome structure of SARS-CoV-2, it belongs to beta-coronavirus (CoV) (Chan et al., 2020 Emerg Microbes Infect .; 9(1):221-236).

冠狀病毒表面上之關鍵蛋白係刺突蛋白。已設計大量mRNA構築體且在本文中揭示。當在適當遞送媒劑中調配時,編碼刺突抗原、其次單元及結構域之mRNA能够誘導針對SARS-CoV-2之强免疫反應,從而產生有效且強效之mRNA疫苗。投與編碼各種刺突蛋白抗原、特別是刺突蛋白次單元及結構域抗原之mRNA導致mRNA遞送至免疫系統之免疫組織及細胞,在那裏其快速轉譯成蛋白抗原。其他免疫細胞(例如B細胞及T細胞)接著能够識別及啓動,且免疫反應發展針對編碼之蛋白之免疫反應,且最終產生針對冠狀病毒之持久保護性反應。經由使用本文揭示之編碼刺突蛋白、其次單元及結構域之高效mRNA疫苗避免低免疫原性(由於對免疫系統之呈遞差或抗原之不正確折叠而導致之蛋白疫苗開發之缺點)。The key protein on the surface of the coronavirus is the spike protein. A number of mRNA constructs have been designed and are disclosed herein. When formulated in an appropriate delivery vehicle, mRNA encoding the spike antigen, subunits and domains are capable of inducing a potent immune response against SARS-CoV-2, resulting in an effective and potent mRNA vaccine. Administration of mRNA encoding various Spike antigens, particularly Spike subunit and domain antigens, results in delivery of the mRNA to immune tissues and cells of the immune system, where it is rapidly translated into protein antigens. Other immune cells, such as B cells and T cells, are then able to recognize and initiate, and the immune response develops an immune response against the encoded protein and ultimately a durable protective response against the coronavirus. Low immunogenicity (a disadvantage of protein vaccine development due to poor presentation to the immune system or incorrect folding of the antigen) is avoided by using the highly efficient mRNA vaccines disclosed herein encoding the spike protein, its subunits and domains.

本揭示案提供引發針對冠狀病毒抗原之強效中和抗體之組合物(例如,mRNA疫苗)。在一些實施例中,組合物包括編碼至少一種(例如,一種、兩種或更多種)冠狀病毒抗原(諸如SARS-CoV-2抗原)之mRNA。在一些實施例中,mRNA編碼刺突蛋白結構域,諸如受體結合結構域(RBD)、N末端結構域(NTD)或RBD及NTD之組合。The present disclosure provides compositions (eg, mRNA vaccines) that elicit potent neutralizing antibodies against coronavirus antigens. In some embodiments, the composition includes mRNA encoding at least one (eg, one, two, or more) coronavirus antigens, such as SARS-CoV-2 antigens. In some embodiments, the mRNA encodes a spike protein domain, such as a receptor binding domain (RBD), an N-terminal domain (NTD), or a combination of RBD and NTD.

本揭示案之一些態樣提供包含編碼融合蛋白之開放閱讀框的信使核糖核酸(mRNA),該融合蛋白包含SARS-CoV-2刺突蛋白之受體結合結構域(RBD)及蛋白質跨膜結構域,例如天然存在之或异源之跨膜結構域。Some aspects of the present disclosure provide messenger ribonucleic acid (mRNA) comprising an open reading frame encoding a fusion protein comprising the receptor binding domain (RBD) and protein transmembrane structure of the SARS-CoV-2 spike protein domains, such as naturally occurring or heterologous transmembrane domains.

在一些實施例中,蛋白質跨膜結構域係流行性感冒血球凝集素跨膜結構域。In some embodiments, the protein transmembrane domain is an influenza hemagglutinin transmembrane domain.

在一些實施例中,融合蛋白包含與SEQ ID NO: 77之胺基酸序列具有至少80%一致性的胺基酸序列。In some embodiments, the fusion protein comprises an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 77.

在一些實施例中,融合蛋白包含與SEQ ID NO: 77之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。In some embodiments, the fusion protein comprises at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to the amino acid sequence of SEQ ID NO: 77 amino acid sequence.

在一些實施例中,融合蛋白包含SEQ ID NO: 77之胺基酸序列。In some embodiments, the fusion protein comprises the amino acid sequence of SEQ ID NO:77.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 76之核苷酸序列具有至少70%一致性的核苷酸序列。In some embodiments, the open reading frame comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO:76.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 76之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。In some embodiments, the open reading frame comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or Nucleotide sequences that are at least 99% identical.

在一些實施例中,開放閱讀框包含SEQ ID NO: 76之核苷酸序列。In some embodiments, the open reading frame comprises the nucleotide sequence of SEQ ID NO:76.

本揭示案之其他態樣提供包含編碼融合蛋白之開放閱讀框的信使核糖核酸(mRNA),該融合蛋白包含SARS-CoV-2刺突蛋白之胺基(N)末端結構域及跨膜結構域。Other aspects of the present disclosure provide messenger ribonucleic acid (mRNA) comprising an open reading frame encoding a fusion protein comprising the amino (N) terminal domain and the transmembrane domain of the SARS-CoV-2 spike protein .

在一些實施例中,跨膜結構域係流行性感冒血球凝集素跨膜結構域。In some embodiments, the transmembrane domain is an influenza hemagglutinin transmembrane domain.

在一些實施例中,融合蛋白包含與SEQ ID NO: 47之胺基酸序列具有至少80%一致性的胺基酸序列。In some embodiments, the fusion protein comprises an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 47.

在一些實施例中,融合蛋白包含與SEQ ID NO: 47之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。In some embodiments, the fusion protein comprises at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to the amino acid sequence of SEQ ID NO: 47 amino acid sequence.

在一些實施例中,融合蛋白包含SEQ ID NO: 47之胺基酸序列。In some embodiments, the fusion protein comprises the amino acid sequence of SEQ ID NO:47.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 46之核苷酸序列具有至少70%一致性的核苷酸序列。In some embodiments, the open reading frame comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 46.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 46之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。In some embodiments, the open reading frame comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or Nucleotide sequences that are at least 99% identical.

在一些實施例中,開放閱讀框包含SEQ ID NO: 46之核苷酸序列。In some embodiments, the open reading frame comprises the nucleotide sequence of SEQ ID NO:46.

本揭示案之其他態樣提供包含編碼融合蛋白之開放閱讀框的信使核糖核酸(mRNA),該融合蛋白包含視情况地經由連接體與SARS-CoV-2刺突蛋白之胺基(N)末端結構域連接之SARS-CoV-2刺突蛋白的受體結合結構域。Other aspects of the disclosure provide messenger ribonucleic acid (mRNA) comprising an open reading frame encoding a fusion protein comprising, optionally via a linker, the amine (N) terminus of the SARS-CoV-2 spike protein Domain-linked receptor-binding domains of the SARS-CoV-2 spike protein.

在一些實施例中,融合蛋白進一步包含跨膜結構域。In some embodiments, the fusion protein further comprises a transmembrane domain.

在一些實施例中,融合蛋白包含與SEQ ID NO: 92之胺基酸序列具有至少80%一致性的胺基酸序列。In some embodiments, the fusion protein comprises an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 92.

在一些實施例中,融合蛋白包含與SEQ ID NO: 92之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。In some embodiments, the fusion protein comprises at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to the amino acid sequence of SEQ ID NO: 92 amino acid sequence.

在一些實施例中,融合蛋白包含SEQ ID NO: 92之胺基酸序列。In some embodiments, the fusion protein comprises the amino acid sequence of SEQ ID NO:92.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 91之核苷酸序列具有至少70%一致性的核苷酸序列。In some embodiments, the open reading frame comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 91.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 91之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。In some embodiments, the open reading frame comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or Nucleotide sequences that are at least 99% identical.

在一些實施例中,開放閱讀框包含SEQ ID NO: 91之核苷酸序列。In some embodiments, the open reading frame comprises the nucleotide sequence of SEQ ID NO:91.

在一些實施例中,mRNA進一步包含5'非轉譯區(UTR),該5'非轉譯區視情況地包含SEQ ID NO: 131或2之核苷酸序列。In some embodiments, the mRNA further comprises a 5' untranslated region (UTR) comprising the nucleotide sequence of SEQ ID NO: 131 or 2, as appropriate.

在一些實施例中,mRNA進一步包含3'非轉譯區(UTR),該3'非轉譯區視情況地包含SEQ ID NO: 132或4之核苷酸序列。In some embodiments, the mRNA further comprises a 3' untranslated region (UTR) comprising the nucleotide sequence of SEQ ID NO: 132 or 4, as appropriate.

在一些實施例中,mRNA進一步包含5'端帽,視情況地7mG(5')ppp(5')NlmpNp。In some embodiments, the mRNA further comprises a 5' end cap, optionally 7mG(5')ppp(5')NlmpNp.

在一些實施例中,mRNA進一步包含聚A尾,該聚A尾視情況地具有約100個核苷酸之長度。In some embodiments, the mRNA further comprises a poly-A tail, optionally having a length of about 100 nucleotides.

在一些實施例中,mRNA包含化學修飾,視情況地1-甲基假尿苷。In some embodiments, the mRNA comprises a chemical modification, optionally 1-methylpseudouridine.

本揭示案之一些態樣提供包含前述段落中任一者之mRNA之組合物。Some aspects of the present disclosure provide compositions comprising the mRNA of any of the preceding paragraphs.

本揭示案之其他態樣提供包含前述段落中任一者之至少兩種mRNA之組合物。Other aspects of the present disclosure provide compositions comprising at least two mRNAs of any of the preceding paragraphs.

本揭示案之其他態樣提供組合物,該組合物包含:(a) 包含編碼融合蛋白之開放閱讀框的信使核糖核酸(mRNA),該融合蛋白包含SARS-CoV-2刺突蛋白之受體結合結構域(RBD)及蛋白質跨膜結構域;(b) 包含編碼融合蛋白之開放閱讀框之mRNA,該融合蛋白包含SARS-CoV-2刺突蛋白之胺基(N)末端結構域及跨膜結構域。在一些實施例中,(a)之mRNA對(b)之mRNA之比率為約1:1,例如1:2、1:3、2:1或3:1。在一些實施例中,組合物之至少50%之mRNA係(a)之mRNA。舉例而言,組合物之至少55%、60%、65%、70%、75%、80%、85%、90%或95%之mRNA係(a)之mRNA。在一些實施例中,組合物之至少50%之mRNA係(b)之mRNA。舉例而言,組合物之至少55%、60%、65%、70%、75%、80%、85%、90%或95%之mRNA係(b)之mRNA。Other aspects of the present disclosure provide compositions comprising: (a) messenger ribonucleic acid (mRNA) comprising an open reading frame encoding a fusion protein comprising a receptor for the SARS-CoV-2 spike protein Binding domain (RBD) and protein transmembrane domain; (b) mRNA comprising an open reading frame encoding a fusion protein comprising the amino (N) terminal domain and transmembrane domain of the SARS-CoV-2 spike protein membrane domain. In some embodiments, the ratio of mRNA of (a) to mRNA of (b) is about 1:1, eg, 1:2, 1:3, 2:1, or 3:1. In some embodiments, at least 50% of the mRNA of the composition is the mRNA of (a). For example, at least 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90% or 95% of the mRNA of the composition is the mRNA of (a). In some embodiments, at least 50% of the mRNA of the composition is the mRNA of (b). For example, at least 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90% or 95% of the mRNA of the composition is the mRNA of (b).

在一些實施例中,蛋白質跨膜結構域係流行性感冒血球凝集素跨膜結構域。In some embodiments, the protein transmembrane domain is an influenza hemagglutinin transmembrane domain.

在一些實施例中,(a)之融合蛋白包含與SEQ ID NO: 77之胺基酸序列具有至少80%一致性的胺基酸序列。In some embodiments, the fusion protein of (a) comprises an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 77.

在一些實施例中,(a)之融合蛋白包含與SEQ ID NO: 77之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。In some embodiments, the fusion protein of (a) comprises at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least the amino acid sequence of SEQ ID NO: 77 99% identical amino acid sequence.

在一些實施例中,(a)之融合蛋白包含SEQ ID NO: 77之胺基酸序列。In some embodiments, the fusion protein of (a) comprises the amino acid sequence of SEQ ID NO:77.

在一些實施例中,(a)之開放閱讀框包含與SEQ ID NO: 76之核苷酸序列具有至少70%一致性的核苷酸序列。In some embodiments, the open reading frame of (a) comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 76.

在一些實施例中,(a)之開放閱讀框包含與SEQ ID NO: 76之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。In some embodiments, the open reading frame of (a) comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 97% of the nucleotide sequence of SEQ ID Nucleotide sequences that are at least 98% or at least 99% identical.

在一些實施例中,(a)之開放閱讀框包含SEQ ID NO: 76之核苷酸序列。In some embodiments, the open reading frame of (a) comprises the nucleotide sequence of SEQ ID NO:76.

在一些實施例中,(b)之融合蛋白包含與SEQ ID NO: 47之胺基酸序列具有至少80%一致性的胺基酸序列。In some embodiments, the fusion protein of (b) comprises an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 47.

在一些實施例中,(b)之融合蛋白包含與SEQ ID NO: 47之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。In some embodiments, the fusion protein of (b) comprises at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least the amino acid sequence of SEQ ID NO: 47 99% identical amino acid sequence.

在一些實施例中,(b)之融合蛋白包含SEQ ID NO: 47之胺基酸序列。In some embodiments, the fusion protein of (b) comprises the amino acid sequence of SEQ ID NO:47.

在一些實施例中,(b)之開放閱讀框包含與SEQ ID NO: 46之核苷酸序列具有至少70%一致性的核苷酸序列。In some embodiments, the open reading frame of (b) comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 46.

在一些實施例中,(b)之開放閱讀框包含與SEQ ID NO: 46之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。In some embodiments, the open reading frame of (b) comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 97% of the nucleotide sequence of SEQ ID NO: 46, Nucleotide sequences that are at least 98% or at least 99% identical.

在一些實施例中,(b)之開放閱讀框包含SEQ ID NO: 46之核苷酸序列。In some embodiments, the open reading frame of (b) comprises the nucleotide sequence of SEQ ID NO:46.

在一些實施例中,mRNA調配於脂質奈米顆粒中。In some embodiments, the mRNA is formulated in lipid nanoparticles.

在一些實施例中,組合物進一步包含脂質奈米顆粒。In some embodiments, the composition further comprises lipid nanoparticles.

在一些實施例中,(a)之mRNA調配於脂質奈米顆粒中,且(b)之mRNA調配於脂質奈米顆粒中。In some embodiments, the mRNA of (a) is formulated in lipid nanoparticles, and the mRNA of (b) is formulated in lipid nanoparticles.

在一些實施例中,脂質奈米顆粒包含陽離子脂質。In some embodiments, the lipid nanoparticles comprise cationic lipids.

在一些實施例中,脂質奈米顆粒進一步包含中性脂質。In some embodiments, the lipid nanoparticles further comprise neutral lipids.

在一些實施例中,脂質奈米顆粒進一步包含固醇。In some embodiments, the lipid nanoparticle further comprises a sterol.

在一些實施例中,脂質奈米顆粒進一步包含聚乙二醇(PEG)改質之脂質。In some embodiments, the lipid nanoparticles further comprise polyethylene glycol (PEG) modified lipids.

在一些實施例中,脂質奈米顆粒包含可電離之陽離子脂質、中性脂質、固醇及PEG改質之脂質。In some embodiments, lipid nanoparticles comprise ionizable cationic lipids, neutral lipids, sterols, and PEG-modified lipids.

在一些實施例中,可電離之陽離子脂質係8-((2-羥基乙基)(6-側氧基-6-(十一烷基氧基)己基)胺基)辛酸十七烷-9-基酯(化合物1)。In some embodiments, the ionizable cationic lipid is 8-((2-hydroxyethyl)(6-oxy-6-(undecyloxy)hexyl)amino)octanoic acid heptadecan-9 - base ester (compound 1).

在一些實施例中,中性脂質係1,2-二硬脂醯基-sn-甘油-3-磷酸膽鹼(DSPC)。In some embodiments, the neutral lipid is 1,2-distearyl-sn-glycero-3-phosphocholine (DSPC).

在一些實施例中,固醇係膽固醇。In some embodiments, the sterol is cholesterol.

在一些實施例中,PEG改質之脂質係1,2二肉豆蔻醯基-sn-甘油-甲氧基聚乙二醇(PEG2000 DMG)。In some embodiments, the PEG-modified lipid is 1,2 dimyristoyl-sn-glycero-methoxypolyethylene glycol (PEG2000 DMG).

在一些實施例中,脂質奈米顆粒包含20-60 mol%可電離之陽離子脂質、5-25 mol%中性脂質、25-55 mol%固醇及0.5-15 mol% PEG改質之脂質。In some embodiments, the lipid nanoparticles comprise 20-60 mol% ionizable cationic lipids, 5-25 mol% neutral lipids, 25-55 mol% sterols, and 0.5-15 mol% PEG-modified lipids.

在一些實施例中,脂質奈米顆粒包含:47 mol%可電離之陽離子脂質;11.5 mol%中性脂質;38.5 mol%固醇;及3.0 mol% PEG改質之脂質;48 mol%可電離之陽離子脂質;11 mol%中性脂質;38.5 mol%固醇;及2.5 mol% PEG改質之脂質;49 mol%可電離之陽離子脂質;10.5 mol%中性脂質;38.5 mol%固醇;及2.0 mol% PEG改質之脂質;50 mol%可電離之陽離子脂質;10 mol%中性脂質;38.5 mol%固醇;及1.5 mol% PEG改質之脂質;或51 mol%可電離之陽離子脂質;9.5 mol%中性脂質;38.5 mol%固醇;及1.0 mol% PEG改質之脂質。In some embodiments, the lipid nanoparticles comprise: 47 mol% ionizable cationic lipids; 11.5 mol% neutral lipids; 38.5 mol% sterols; and 3.0 mol% PEG-modified lipids; 48 mol% ionizable lipids 11 mol% neutral lipids; 38.5 mol% sterols; and 2.5 mol% PEG-modified lipids; 49 mol% ionizable cationic lipids; 10.5 mol% neutral lipids; 38.5 mol% sterols; and 2.0 mol% PEG-modified lipids; 50 mol% ionizable cationic lipids; 10 mol% neutral lipids; 38.5 mol% sterols; and 1.5 mol% PEG-modified lipids; or 51 mol% ionizable cationic lipids; 9.5 mol% neutral lipids; 38.5 mol% sterols; and 1.0 mol% PEG-modified lipids.

在一些實施例中,脂質奈米顆粒包含47 mol%化合物1;11.5 mol% DSPC;38.5 mol%膽固醇;及3.0 mol% PEG2000 DMG;48 mol%化合物1;11 mol% DSPC;38.5 mol%膽固醇;及2.5 mol% PEG2000 DMG;49 mol%化合物1;10.5 mol% DSPC;38.5 mol%膽固醇;及2.0 mol% PEG2000 DMG;50 mol%化合物1;10 mol% DSPC;38.5 mol%膽固醇;及1.5 mol% PEG2000 DMG;或51 mol%化合物1;9.5 mol% DSPC;38.5 mol%膽固醇;及1.0 mol% PEG2000 DMG。In some embodiments, the lipid nanoparticle comprises 47 mol% Compound 1; 11.5 mol% DSPC; 38.5 mol% cholesterol; and 3.0 mol% PEG2000 DMG; 48 mol% Compound 1; 11 mol% DSPC; 38.5 mol% cholesterol; and 2.5 mol% PEG2000 DMG; 49 mol% Compound 1; 10.5 mol% DSPC; 38.5 mol% cholesterol; and 2.0 mol% PEG2000 DMG; 50 mol% Compound 1; 10 mol% DSPC; 38.5 mol% cholesterol; and 1.5 mol% PEG2000 DMG; or 51 mol% Compound 1; 9.5 mol% DSPC; 38.5 mol% cholesterol; and 1.0 mol% PEG2000 DMG.

本揭示案之其他態樣提供了一種方法,該方法包括向個體投與在個體中有效誘導針對SARS-CoV-2之中和抗體反應之量的後述申請專利範圍中任一項之mRNA或組合物。Other aspects of the present disclosure provide a method comprising administering to an individual an mRNA or combination of any of the following claims in an amount effective to induce a neutralizing antibody response against SARS-CoV-2 in the individual thing.

本揭示案之其他態樣提供了一種方法,該方法包括向個體投與在個體中有效誘導針對SARS-CoV-2之T細胞免疫反應之量的後述申請專利範圍中任一項之mRNA或組合物。Other aspects of the present disclosure provide a method comprising administering to an individual an mRNA or combination of any of the following claims in an amount effective to induce a T cell immune response against SARS-CoV-2 in the individual thing.

本揭示案之一些態樣提供包含開放閱讀框(ORF)之信使核糖核酸(mRNA),該開放閱讀框編碼能夠誘導針對SARS-CoV-2之免疫反應(諸如中和抗體反應)之冠狀病毒抗原,其中抗原包含SARS-CoV-2之蛋白質片段或功能性蛋白質結構域,視情況地其中RNA調配於脂質奈米顆粒中。Some aspects of the present disclosure provide messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding a coronavirus antigen capable of inducing an immune response (such as a neutralizing antibody response) against SARS-CoV-2 , wherein the antigen comprises a protein fragment or functional protein domain of SARS-CoV-2, optionally wherein the RNA is formulated in lipid nanoparticles.

在一些實施例中,抗原係功能性蛋白質結構域。In some embodiments, the antigen is a functional protein domain.

在一些實施例中,蛋白質結構域係SARS-CoV-2刺突蛋白之N末端結構域(NTD)。In some embodiments, the protein domain is the N-terminal domain (NTD) of the SARS-CoV-2 spike protein.

在一些實施例中,NTD與跨膜結構域、視情况地流行性感冒血球凝集素跨膜結構域連接。In some embodiments, the NTD is linked to a transmembrane domain, optionally an influenza hemagglutinin transmembrane domain.

在一些實施例中,抗原包含與SEQ ID NO: 47之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 47之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of the amino acid sequence of SEQ ID NO: 47 % identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO:47.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 46之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 46之核苷酸序列。In some embodiments, the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, A nucleotide sequence that is at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 46.

在一些實施例中,蛋白質結構域係SARS-CoV-2刺突蛋白之受體結合結構域(RBD)。In some embodiments, the protein domain is the receptor binding domain (RBD) of the SARS-CoV-2 spike protein.

在一些實施例中,RBD係可溶的。In some embodiments, the RBD is soluble.

在一些實施例中,抗原包含與SEQ ID NO: 62之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 62之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of the amino acid sequence of SEQ ID NO: 62 % identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO:62.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 61之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 61之核苷酸序列。In some embodiments, the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, A nucleotide sequence that is at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 61.

在一些實施例中,RBD與跨膜結構域、視情况地流行性感冒血球凝集素跨膜結構域連接。In some embodiments, the RBD is linked to a transmembrane domain, optionally an influenza hemagglutinin transmembrane domain.

在一些實施例中,抗原包含與SEQ ID NO: 77之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 77之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of the amino acid sequence of SEQ ID NO: 77 % identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO:77.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 76之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 76之核苷酸序列。In some embodiments, the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, A nucleotide sequence that is at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO:76.

在一些實施例中,NTD與SARS-CoV-2刺突蛋白之RBD連接以形成NTD-RBD融合蛋白。In some embodiments, the NTD is linked to the RBD of the SARS-CoV-2 spike protein to form an NTD-RBD fusion protein.

在一些實施例中,NTD-RBD融合體與跨膜結構域(TM)連接,視情況與流行性感冒血球凝集素跨膜結構域連接,以形成NTD-RBD-TM蛋白。In some embodiments, the NTD-RBD fusion is linked to a transmembrane domain (TM), optionally linked to an influenza hemagglutinin transmembrane domain, to form an NTD-RBD-TM protein.

在一些實施例中,抗原包含與SEQ ID NO: 92之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 92之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of the amino acid sequence of SEQ ID NO: 92 % identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO:92.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 91之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 91之核苷酸序列。In some embodiments, an open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, A nucleotide sequence that is at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 91.

在一些實施例中,NTD-RBD融合體包含C末端截短。In some embodiments, the NTD-RBD fusion comprises a C-terminal truncation.

在一些實施例中,抗原包含與SEQ ID NO: 107之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 107之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of the amino acid sequence of SEQ ID NO: 107 % identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO: 107.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 106之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 106之核苷酸序列。In some embodiments, an open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 97%, at least 97%, A nucleotide sequence that is at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 106.

在一些實施例中,NTD及/或RBD包括延伸區。In some embodiments, NTDs and/or RBDs include extension regions.

在一些實施例中,抗原包含與SEQ ID NO: 59、86、89、116、119或122中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 59、86、89、116、119或122中之任一者之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90%, at least 95% of the amino acid sequence of any one of SEQ ID NOs: 59, 86, 89, 116, 119, or 122 , at least 96%, at least 97%, at least 98%, or at least 99% identical amino acid sequences, optionally wherein the antigen comprises any one of SEQ ID NOs: 59, 86, 89, 116, 119 or 122 the amino acid sequence.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 58、85、88、115、118或121中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 58、85、88、115、118或121中之任一者之核苷酸序列。In some embodiments, the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 70%, at least 75%, at least 80%, at least the 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical nucleotide sequences, optionally wherein the open reading frame comprises SEQ ID NOs: 58, 85, 88, 115, The nucleotide sequence of any of 118 or 121.

在一些實施例中,蛋白質結構域係SARS-CoV-2刺突蛋白之S1次單元結構域。In some embodiments, the protein domain is the S1 subunit domain of the SARS-CoV-2 spike protein.

在一些實施例中,S1次單元係可溶的。In some embodiments, the S1 subunit is soluble.

在一些實施例中,抗原包含與SEQ ID NO: 5之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 5之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of the amino acid sequence of SEQ ID NO: 5 % identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO:5.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 3之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 3之核苷酸序列。In some embodiments, an open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, A nucleotide sequence that is at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 3.

在一些實施例中,S1次單元與跨膜結構域、視情况地流行性感冒血球凝集素跨膜結構域連接。In some embodiments, the S1 subunit is linked to a transmembrane domain, optionally an influenza hemagglutinin transmembrane domain.

在一些實施例中,抗原包含與SEQ ID NO: 17之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 17之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of the amino acid sequence of SEQ ID NO: 17 % identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO: 17.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 16之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 16之核苷酸序列。In some embodiments, the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, A nucleotide sequence that is at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 16.

在一些實施例中,S1次單元經修飾以去除S蛋白之RBD或RBD之一部分。In some embodiments, the S1 subunit is modified to remove the RBD or a portion of the RBD of the S protein.

在一些實施例中,抗原包含與SEQ ID NO: 20、23、26、29、32或35中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 20、23、26、29、32或35中之任一者之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90%, at least 95% of the amino acid sequence of any one of SEQ ID NOs: 20, 23, 26, 29, 32, or 35 , at least 96%, at least 97%, at least 98% or at least 99% identical amino acid sequences, optionally wherein the antigen comprises any one of SEQ ID NOs: 20, 23, 26, 29, 32 or 35 the amino acid sequence.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 19、22、25、28、31或34中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 19、22、25、28、31或34中之任一者之核苷酸序列。In some embodiments, the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical nucleotide sequences, optionally wherein the open reading frame comprises SEQ ID NOs: 19, 22, 25, 28, The nucleotide sequence of any of 31 or 34.

在一些實施例中,S1次單元與S蛋白之S2次單元連接。In some embodiments, the S1 subunit is linked to the S2 subunit of the S protein.

在一些實施例中,S2次單元來自SARS-CoV-2 S蛋白。In some embodiments, the S2 subunit is from the SARS-CoV-2 S protein.

在一些實施例中,S1次單元來自HKU1 S蛋白。In some embodiments, the S1 subunit is from the HKU1 S protein.

在一些實施例中,抗原包含與SEQ ID NO: 38之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 38之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of the amino acid sequence of SEQ ID NO: 38 % identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO:38.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 37之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 37之核苷酸序列。In some embodiments, the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, A nucleotide sequence that is at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO:37.

在一些實施例中,S1次單元來自OC43 S蛋白。In some embodiments, the S1 subunit is from an OC43 S protein.

在一些實施例中,抗原包含與SEQ ID NO: 41之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 41之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% of the amino acid sequence of SEQ ID NO: 41 % identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO: 41.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 40之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 40之核苷酸序列。In some embodiments, the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, A nucleotide sequence that is at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 40.

在一些實施例中,抗原進一步包含支架結構域,視情況地選自鐵蛋白、二氧四氫蝶啶(lumazine)合成酶及摺疊子。In some embodiments, the antigen further comprises a scaffold domain, optionally selected from the group consisting of ferritin, lumazine synthetase, and foldon.

在一些實施例中,支架結構域係鐵蛋白。In some embodiments, the scaffold domain is ferritin.

在一些實施例中,抗原包含與SEQ ID NO: 8或65之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 8或65之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or An amino acid sequence of at least 99% identity wherein the antigen comprises the amino acid sequence of SEQ ID NO: 8 or 65, as appropriate.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 7或64之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 7或64之核苷酸序列。In some embodiments, the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97% of the nucleotide sequence of SEQ ID NO: 7 or 64 %, at least 98%, or at least 99% identical to a nucleotide sequence, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 7 or 64.

在一些實施例中,支架結構域係二氧四氫蝶啶合成酶。In some embodiments, the scaffold domain is a dioxytetrahydropteridine synthase.

在一些實施例中,抗原包含與SEQ ID NO: 11、14、68或71中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 11、14、68或71中之任一者之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96% of the amino acid sequence of any one of SEQ ID NOs: 11, 14, 68, or 71 , an amino acid sequence of at least 97%, at least 98%, or at least 99% identity, optionally wherein the antigen comprises the amino acid sequence of any one of SEQ ID NOs: 11, 14, 68, or 71.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 10、13、67或70中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 10、13、67或70中之任一者之核苷酸序列。In some embodiments, the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least A nucleotide sequence that is 95%, at least 96%, at least 97%, at least 98% or at least 99% identical, optionally wherein the open reading frame comprises any one of SEQ ID NOs: 10, 13, 67 or 70 the nucleotide sequence.

在一些實施例中,支架結構域係摺疊子。In some embodiments, the scaffold domain is a foldon.

在一些實施例中,抗原包含與SEQ ID NO: 44、50、74、80、83、101、104或113中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 44、50、74、80、83、101、104或113中之任一者之胺基酸序列。In some embodiments, the antigen comprises at least 80%, at least 85%, at least 90% of the amino acid sequence of any one of SEQ ID NOs: 44, 50, 74, 80, 83, 101, 104, or 113 , at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical amino acid sequences, optionally wherein the antigen comprises SEQ ID NOs: 44, 50, 74, 80, 83, 101, The amino acid sequence of any of 104 or 113.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 43、49、73、79、82、100、103或112中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 43、49、73、79、82、100、103或112中之任一者之核苷酸序列。In some embodiments, the open reading frame comprises at least 70%, at least 75%, at least 70%, at least 75%, at least 70%, at least A nucleotide sequence that is 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical, optionally wherein the open reading frame comprises SEQ ID NOs: 43, 49, The nucleotide sequence of any of 73, 79, 82, 100, 103 or 112.

在一些實施例中,抗原進一步包含運輸信號,視情況地選自巨噬細胞標記物,視情況地CD86、CD11B及/或VSVGct。In some embodiments, the antigen further comprises a trafficking signal, optionally selected from macrophage markers, optionally CD86, CD11B and/or VSVGct.

在一些實施例中,抗原包含與SEQ ID NO: 95、98或110中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 95、98或110中之任一者之胺基酸序列。In some embodiments the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least An amino acid sequence that is 97%, at least 98%, or at least 99% identical, as appropriate, wherein the antigen comprises the amino acid sequence of any of SEQ ID NOs: 95, 98, or 110.

在一些實施例中,開放閱讀框包含與SEQ ID NO: 94、97或109中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 94、97或109中之任一者之核苷酸序列。In some embodiments, the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95% of the nucleotide sequence of any one of SEQ ID NOs: 94, 97, or 109 , a nucleotide sequence of at least 96%, at least 97%, at least 98%, or at least 99% identity, optionally wherein the open reading frame comprises the nucleotides of any one of SEQ ID NOs: 94, 97, or 109 sequence.

在一些實施例中,mRNA調配於脂質奈米顆粒中。In some embodiments, the mRNA is formulated in lipid nanoparticles.

在一些實施例中,脂質奈米顆粒包含陽離子脂質(視情況地可電離之陽離子脂質)、中性脂質、固醇及/或聚乙二醇(PEG)改質之脂質。可電離之陽離子脂質在本文中可與可電離之脂質及陽離子脂質互換使用以指可電離之脂質。在一些實施例中,脂質奈米顆粒包含40-50 mol%可電離之脂質,視情況地45-50 mol%,例如45-46 mol%、46-47 mol%、47-48 mol%、48-49 mol%或49-50 mol%,例如約45 mol%、45.5 mol%、46 mol%、46.5 mol%、47 mol%、47.5 mol%、48 mol%、48.5 mol%、49 mol%或49.5 mol%。在一些實施例中,脂質奈米顆粒包含30-45 mol%固醇,視情況地35-40 mol%,例如30-31 mol%、31-32 mol%、32-33 mol%、33-34 mol%、35-35 mol%、35-36 mol%、36-37 mol%、38-38 mol%、38-39 mol%或39-40 mol%。在一些實施例中,脂質奈米顆粒包含5-15 mol%輔助脂質,視情況地10-12 mol%,例如5-6 mol%、6-7 mol%、7-8 mol%、8-9 mol%、9-10 mol%、10-11 mol%、11-12 mol%、12-13 mol%、13-14 mol%或14-15 mol%。在一些實施例中,脂質奈米顆粒包含1-5% PEG脂質,視情況地1-3 mol%,例如1.5-2.5 mol%、1-2 mol%、2-3 mol%、3-4 mol%或4-5 mol%。In some embodiments, the lipid nanoparticles comprise cationic lipids (optionally ionizable cationic lipids), neutral lipids, sterols, and/or polyethylene glycol (PEG) modified lipids. Ionizable cationic lipids are used interchangeably herein with ionizable lipids and cationic lipids to refer to ionizable lipids. In some embodiments, the lipid nanoparticle comprises 40-50 mol% ionizable lipid, optionally 45-50 mol%, eg, 45-46 mol%, 46-47 mol%, 47-48 mol%, 48 -49 mol% or 49-50 mol%, such as about 45 mol%, 45.5 mol%, 46 mol%, 46.5 mol%, 47 mol%, 47.5 mol%, 48 mol%, 48.5 mol%, 49 mol% or 49.5 mol%. In some embodiments, the lipid nanoparticle comprises 30-45 mol% sterol, optionally 35-40 mol%, eg, 30-31 mol%, 31-32 mol%, 32-33 mol%, 33-34 mol% mol%, 35-35 mol%, 35-36 mol%, 36-37 mol%, 38-38 mol%, 38-39 mol%, or 39-40 mol%. In some embodiments, the lipid nanoparticle comprises 5-15 mol% helper lipid, optionally 10-12 mol%, eg, 5-6 mol%, 6-7 mol%, 7-8 mol%, 8-9 mol% mol%, 9-10 mol%, 10-11 mol%, 11-12 mol%, 12-13 mol%, 13-14 mol% or 14-15 mol%. In some embodiments, lipid nanoparticles comprise 1-5% PEG lipid, optionally 1-3 mol%, eg, 1.5-2.5 mol%, 1-2 mol%, 2-3 mol%, 3-4 mol% % or 4-5 mol%.

在一些實施例中,可電離之陽離子脂質係8-((2-羥基乙基)(6-側氧基-6-(十一烷基氧基)己基)胺基)辛酸十七烷-9-基酯(化合物1),中性脂質係1,2-二硬脂醯基-sn-甘油-3-磷酸膽鹼(DSPC),固醇係膽固醇,且/或PEG改質之脂質係1,2二肉豆蔻醯基-sn-甘油-甲氧基聚乙二醇(PEG2000 DMG)。In some embodiments, the ionizable cationic lipid is 8-((2-hydroxyethyl)(6-oxy-6-(undecyloxy)hexyl)amino)octanoic acid heptadecan-9 - base ester (compound 1), neutral lipid system 1,2-distearyl-sn-glycero-3-phosphocholine (DSPC), sterol system cholesterol, and/or PEG-modified lipid system 1 , 2 dimyristyl-sn-glycerol-methoxy polyethylene glycol (PEG2000 DMG).

在一些實施例中,脂質奈米顆粒包含20-60 mol%可電離之陽離子脂質、5-25 mol%中性脂質、25-55 mol%固醇及0.5-15 mol% PEG改質之脂質。In some embodiments, the lipid nanoparticles comprise 20-60 mol% ionizable cationic lipids, 5-25 mol% neutral lipids, 25-55 mol% sterols, and 0.5-15 mol% PEG-modified lipids.

在一些實施例中,脂質奈米顆粒包含:47 mol%可電離之陽離子脂質;11.5 mol%中性脂質;38.5 mol%固醇;及3.0 mol% PEG改質之脂質;48 mol%可電離之陽離子脂質;11 mol%中性脂質;38.5 mol%固醇;及2.5 mol% PEG改質之脂質;49 mol%可電離之陽離子脂質;10.5 mol%中性脂質;38.5 mol%固醇;及2.0 mol% PEG改質之脂質;50 mol%可電離之陽離子脂質;10 mol%中性脂質;38.5 mol%固醇;及1.5 mol% PEG改質之脂質;或51 mol%可電離之陽離子脂質;9.5 mol%中性脂質;38.5 mol%固醇;及1.0 mol% PEG改質之脂質。In some embodiments, the lipid nanoparticles comprise: 47 mol% ionizable cationic lipids; 11.5 mol% neutral lipids; 38.5 mol% sterols; and 3.0 mol% PEG-modified lipids; 48 mol% ionizable lipids 11 mol% neutral lipids; 38.5 mol% sterols; and 2.5 mol% PEG-modified lipids; 49 mol% ionizable cationic lipids; 10.5 mol% neutral lipids; 38.5 mol% sterols; and 2.0 mol% PEG-modified lipids; 50 mol% ionizable cationic lipids; 10 mol% neutral lipids; 38.5 mol% sterols; and 1.5 mol% PEG-modified lipids; or 51 mol% ionizable cationic lipids; 9.5 mol% neutral lipids; 38.5 mol% sterols; and 1.0 mol% PEG-modified lipids.

在一些實施例中,脂質奈米顆粒包含47 mol%化合物1;11.5 mol% DSPC;38.5 mol%膽固醇;及3.0 mol% PEG2000 DMG;48 mol%化合物1;11 mol% DSPC;38.5 mol%膽固醇;及2.5 mol% PEG2000 DMG;49 mol%化合物1;10.5 mol% DSPC;38.5 mol%膽固醇;及2.0 mol% PEG2000 DMG;50 mol%化合物1;10 mol% DSPC;38.5 mol%膽固醇;及1.5 mol% PEG2000 DMG;或51 mol%化合物1;9.5 mol% DSPC;38.5 mol%膽固醇;及1.0 mol% PEG2000 DMG。In some embodiments, the lipid nanoparticle comprises 47 mol% Compound 1; 11.5 mol% DSPC; 38.5 mol% cholesterol; and 3.0 mol% PEG2000 DMG; 48 mol% Compound 1; 11 mol% DSPC; 38.5 mol% cholesterol; and 2.5 mol% PEG2000 DMG; 49 mol% Compound 1; 10.5 mol% DSPC; 38.5 mol% cholesterol; and 2.0 mol% PEG2000 DMG; 50 mol% Compound 1; 10 mol% DSPC; 38.5 mol% cholesterol; and 1.5 mol% PEG2000 DMG; or 51 mol% Compound 1; 9.5 mol% DSPC; 38.5 mol% cholesterol; and 1.0 mol% PEG2000 DMG.

國際申請案號PCT/US2016/058327 (公開案號WO2017/070626)及國際申請案號PCT/US2018/022777 (公開案號WO2018/170347)之全部內容以引用方式併入本文中。 SARS-Cov-2 The entire contents of International Application No. PCT/US2016/058327 (Publication No. WO2017/070626) and International Application No. PCT/US2018/022777 (Publication No. WO2018/170347) are incorporated herein by reference. SARS-Cov-2

SARS-CoV-2之基因體係單股正義RNA (+ssRNA),大小為29.8-30 kb,編碼約9860個胺基酸(Chan等人,2000,同上;Kim等人,2020 Cell,5月14日;181(4):914-921,e10)。SARS-CoV-2係具有5'-端帽及3'-聚A尾之多順反子mRNA。將SARS-CoV-2基因體組織成編碼結構蛋白及非結構蛋白(Nsp)之特定基因。結構蛋白在基因體中之順序係5'-複製酶(開放閱讀框(ORF)1/ab)-結構蛋白[刺突(S)-套膜(E)-膜(M)-核酸蛋白殼(N)]-3'。冠狀病毒之基因體包括可變數量之編碼輔助蛋白、非結構蛋白及結構蛋白之開放閱讀框(Song等人,2019 Viruses;11(1):第59頁)。大部分抗原性肽位於結構蛋白中(Cui等人,2019 Nat. Rev. Microbiol.;17(3):181-192)。刺突表面醣蛋白(S)、小套膜蛋白(E)、基質蛋白(M)及核酸蛋白殼蛋白(N)係四種主要之結構蛋白。由於S-蛋白有助於細胞向性,其能够誘導中和抗體(NAb)及保護性免疫,故可將其視為所有其他結構蛋白中冠狀病毒疫苗開發中最重要之靶標之一。此外,胺基酸序列分析顯示S-蛋白含有冠狀病毒中之保守區,此可為開發通用疫苗之基礎。 抗原 The genetic system of SARS-CoV-2 single-stranded positive-sense RNA (+ssRNA), 29.8-30 kb in size, encodes about 9860 amino acids (Chan et al., 2000, supra; Kim et al., 2020 Cell, May 14 Sun;181(4):914-921, e10). SARS-CoV-2 is a polycistronic mRNA with a 5'-end cap and a 3'-poly A tail. The SARS-CoV-2 genome is organized into specific genes encoding structural and nonstructural proteins (Nsp). The sequence of structural proteins in the genome is 5'-replicase (open reading frame (ORF) 1/ab)-structural protein [spike (S)-mantle (E)-membrane (M)-nucleoprotein shell ( N)]-3'. The genome of coronaviruses includes a variable number of open reading frames encoding accessory, nonstructural, and structural proteins (Song et al., 2019 Viruses; 11(1): p. 59). Most antigenic peptides are located in structural proteins (Cui et al., 2019 Nat. Rev. Microbiol.; 17(3):181-192). Spike surface glycoprotein (S), small envelope protein (E), matrix protein (M) and nucleic acid protein capsid protein (N) are four major structural proteins. Since the S-protein contributes to cell tropism, its ability to induce neutralizing antibodies (NAbs) and protective immunity, it can be regarded as one of the most important targets in coronavirus vaccine development among all other structural proteins. In addition, amino acid sequence analysis showed that the S-protein contains a conserved region in coronaviruses, which can be the basis for the development of a universal vaccine. antigen

本發明之組合物(例如疫苗組合物)之特徵在於設計用於編碼相關抗原(例如,源自β冠狀病毒結構蛋白之抗原,特別是源自SARS-CoV-2刺突蛋白之抗原)之核酸,特別是mRNA。本發明之組合物(例如疫苗組合物)不包含抗原本身,而是包含一旦遞送至細胞、組織或個體即編碼抗原或抗原性序列之核酸,特別是mRNA。核酸分子(特別是mRNA)之遞送係藉由將該等核酸分子調配於適當載劑或遞送媒劑(例如,脂質奈米顆粒)中來實現,使得在投與至細胞、組織或個體後,核酸由細胞攝取,該等細胞進而表現由核酸(例如mRNA)編碼之蛋白質。如本文所用之術語「抗原」係指諸如蛋白質(例如醣蛋白)、多肽、肽或諸如此類等物質,其引發免疫反應,例如當存在於個體中時(例如,當存在於人類或哺乳動物個體中時)引發免疫反應。本發明至少部分基於以下理解:當mRNA編碼之抗原自投與至細胞或個體之mRNA表現時,可引起免疫系統產生針對表現之抗原之免疫反應,例如可觸發針對表現抗原之抗體(例如結合及/或中和抗體)之產生,可觸發對表現之抗原特異性之B細胞及或T細胞反應,且最終可引起針對隨後遇到抗原或與抗原相關之病原體之保護性或預防性反應。較佳mRNA編碼之抗原係「病毒抗原」。如本文所用之術語「病毒抗原」係指源自病毒、例如源自致病病毒之抗原。如本文所用之術語抗原可指全長蛋白,例如全長病毒蛋白,或可指蛋白之片段(例如多肽或肽片段)、次單元或結構域,例如病毒蛋白次單元或結構域。Compositions (eg vaccine compositions) of the invention are characterized by nucleic acids designed to encode relevant antigens (eg, antigens derived from betacoronavirus structural proteins, particularly antigens derived from the SARS-CoV-2 spike protein) , especially mRNA. The compositions (eg vaccine compositions) of the invention do not comprise the antigen itself, but rather comprise nucleic acid, particularly mRNA, that encodes the antigen or antigenic sequence once delivered to a cell, tissue or individual. Delivery of nucleic acid molecules, particularly mRNA, is accomplished by formulating the nucleic acid molecules in a suitable carrier or delivery vehicle (eg, lipid nanoparticles) such that upon administration to a cell, tissue or individual, Nucleic acids are taken up by cells, which in turn express proteins encoded by nucleic acids (eg, mRNA). The term "antigen" as used herein refers to substances such as proteins (eg, glycoproteins), polypeptides, peptides, or the like, which elicit an immune response, eg, when present in an individual (eg, when present in a human or mammalian individual). time) triggers an immune response. The invention is based, at least in part, on the understanding that, when an mRNA-encoded antigen is self-administered to the expression of the mRNA in a cell or individual, the immune system can be induced to mount an immune response against the expressed antigen, for example, antibodies to the expressed antigen can be triggered (e.g., binding and The production of neutralizing antibodies) can trigger a B-cell and or T-cell response specific for the presented antigen and, ultimately, can elicit a protective or preventive response against subsequently encountered antigen or antigen-related pathogens. A preferred mRNA-encoded antigen is a "viral antigen". The term "viral antigen" as used herein refers to an antigen derived from a virus, eg, from a pathogenic virus. The term antigen as used herein may refer to a full-length protein, eg, a full-length viral protein, or may refer to a fragment (eg, a polypeptide or peptide fragment), subunit or domain of a protein, eg, a viral protein subunit or domain.

許多蛋白質具有四級或三維結構,其由一個以上多肽或若干締合成寡聚分子之多肽鏈組成。如本文所用之術語「次單元」係指單一蛋白質分子,例如,由處理新生蛋白質分子產生之多肽或多肽鏈,該次單元與其他蛋白質分子(例如次單元或鏈)組裝(或「共組裝」)形成蛋白質複合體。蛋白質可具有相對小數目之次單元,且因此闡述為「寡聚的」,或可由大數目之次單元組成且因此闡述為「多聚的」。寡聚或多聚蛋白質之次單元可為相同的、同源的或完全不相似的,且專用於不同之任務。Many proteins have a quaternary or three-dimensional structure consisting of more than one polypeptide or several polypeptide chains associated into oligomeric molecules. The term "subunit" as used herein refers to a single protein molecule, eg, a polypeptide or polypeptide chain resulting from processing a nascent protein molecule, that assembles (or "co-assembles" with other protein molecules (eg, subunits or chains) ) to form protein complexes. A protein may have a relatively small number of subunits and thus be described as "oligomeric", or may consist of a large number of subunits and thus be described as "polymeric". Subunits of oligomeric or multimeric proteins can be identical, homologous or completely dissimilar, and dedicated to different tasks.

蛋白質或蛋白質次單元可進一步包含結構域。如本文所用之術語「結構域」係指蛋白質內不同之功能及/或結構單元。通常,「結構域」負責特定功能或相互作用,從而有助於蛋白質之總體作用。結構域可存在於各種生物環境中。相似結構域(亦即,共享結構、功能及/或序列同源性之結構域)可存在於單一蛋白質內或可存在於具有相似或不同功能之不同蛋白質內。蛋白質結構域通常係給定蛋白質三級結構或序列之保守部分,其可獨立於蛋白質或其次單元之其餘部分起作用及存在。The protein or protein subunit may further comprise domains. The term "domain" as used herein refers to distinct functional and/or structural units within a protein. Typically, "domains" are responsible for specific functions or interactions that contribute to the overall functioning of the protein. Domains can exist in a variety of biological contexts. Similar domains (ie, domains that share structural, functional and/or sequence homology) may exist within a single protein or may exist within different proteins with similar or different functions. A protein domain is typically a conserved portion of a given protein's tertiary structure or sequence that can function and exist independently of the rest of the protein or its subunits.

在結構及分子生物學中,相同、同源或相似之次單元或結構域可幫助對新近鑑別或新穎蛋白質進行分類,正如在SARS-CoV-2病毒基因體序列公佈後立即進行之那樣。In structural and molecular biology, identical, homologous, or similar subunits or domains can aid in the classification of newly identified or novel proteins, as was done immediately after the publication of the SARS-CoV-2 viral genome sequence.

如本文所用之術語抗原不同於術語「抗原決定基」,該抗原決定基係抗原之亞結構,例如多肽或碳水化合物結構,其可由抗原結合位點識別,但不足以誘導免疫反應。業內闡述以分離形式遞送至個體或免疫細胞之蛋白抗原,例如分離之蛋白質、多肽或肽抗原,然而,蛋白抗原之設計、測試、驗證及產生可為昂貴且耗時的,尤其當大規模產生蛋白時。相反,mRNA技術可用於快速設計及測試編碼各種抗原之mRNA構築體。此外,結合調配於適當遞送媒劑(例如,脂質奈米顆粒)中之mRNA之快速產生可快速進行且可快速大規模產生mRNA疫苗。潜在益處亦來自於以下事實:由本發明之mRNA編碼之抗原由個體之細胞表現,例如由人體表現,且因此,個體(例如人體)用作「工廠」以產生抗原,其進而引發期望免疫反應。The term antigen as used herein differs from the term "epitope," which is a substructure of an antigen, such as a polypeptide or carbohydrate structure, that is recognized by the antigen binding site, but is not sufficient to induce an immune response. Protein antigens, such as isolated protein, polypeptide or peptide antigens, are described in the art for delivery to an individual or immune cell in isolated form, however, the design, testing, validation and production of protein antigens can be expensive and time consuming, especially when produced on a large scale protein time. In contrast, mRNA technology can be used to rapidly design and test mRNA constructs encoding various antigens. In addition, rapid production of mRNA in combination with formulated in an appropriate delivery vehicle (eg, lipid nanoparticles) can be performed rapidly and mRNA vaccines can be produced rapidly on a large scale. Potential benefits also arise from the fact that the antigens encoded by the mRNAs of the invention are expressed by cells of the individual, eg, by the human body, and thus, the individual (eg, the human body) acts as a "factory" to generate antigens, which in turn elicit the desired immune response.

在較佳態樣中,抗原係能够誘導免疫反應(例如,引起免疫系統產生針對抗原之抗體)之蛋白質。在本文中,除非另有說明,否則術語「抗原」之使用包括免疫原性蛋白以及衍生自免疫原性蛋白之多肽或肽,例如免疫原性片段(誘導(或能够誘導)對抗原之免疫反應之免疫原性片段)。應理解,術語「蛋白質」涵蓋多肽及肽,且術語「抗原」涵蓋抗原性片段。其他分子可為抗原性的,諸如細菌多醣或蛋白及多醣結構之組合,但對於本文包括之病毒疫苗,源自β冠狀病毒SARS-CoV-2之病毒蛋白、病毒蛋白片段及設計及/或突變之蛋白係本文特徵性之抗原。 核酸 /mRNA In preferred aspects, the antigen is a protein capable of inducing an immune response (eg, causing the immune system to produce antibodies against the antigen). As used herein, unless otherwise stated, the use of the term "antigen" includes immunogenic proteins as well as polypeptides or peptides derived from immunogenic proteins, such as immunogenic fragments that induce (or are capable of inducing) an immune response to the antigen the immunogenic fragment). It is to be understood that the term "protein" encompasses polypeptides and peptides, and the term "antigen" encompasses antigenic fragments. Other molecules may be antigenic, such as bacterial polysaccharides or combinations of proteins and polysaccharide structures, but for viral vaccines included herein, viral proteins, viral protein fragments and designs and/or mutations derived from betacoronavirus SARS-CoV-2 The protein is the antigen characterized herein. Nucleic acid /mRNA

本文所述之疫苗技術之特徵在於設計用於編碼相關抗原(例如β冠狀病毒刺突蛋白抗原、其次單元、結構域或片段(例如抗原性片段))之核酸,特別是信使RNA (mRNA)。本發明之核酸(例如mRNA)較佳調配於適當載劑或遞送媒劑(例如,脂質奈米顆粒)中,使得核酸(例如mRNA)適於活體內使用。核酸(例如mRNA)當經適當調配時能够遞送至個體(例如人類個體)內之細胞及/或組織,以完成由該等核酸編碼之蛋白質之轉譯。The vaccine technologies described herein feature nucleic acids, particularly messenger RNA (mRNA), designed to encode relevant antigens (eg, betacoronavirus spike protein antigens, subunits, domains or fragments (eg, antigenic fragments)). The nucleic acid (eg, mRNA) of the present invention is preferably formulated in a suitable carrier or delivery vehicle (eg, lipid nanoparticle), making the nucleic acid (eg, mRNA) suitable for in vivo use. Nucleic acids (eg, mRNAs), when appropriately formulated, can be delivered to cells and/or tissues within an individual (eg, a human individual) to accomplish translation of the proteins encoded by the nucleic acids.

核酸分子係由携帶遺傳資訊之連接核苷酸構成之大分子,且藉由指導蛋白質合成過程,指導大部分(若並非全部)細胞功能。核酸包含核苷酸(核苷酸單體)之聚合物。因此,核酸亦稱為聚核苷酸(亦稱為聚核苷酸鏈)。核酸之兩個主要類別係去氧核糖核酸(DNA)及核糖核酸(RNA)。DNA構成所有自由生活之生物體及大部分病毒中之遺傳物質。RNA係某些病毒之遺傳物質,但亦在所有活細胞中發現,在該等活細胞中,RNA在細胞過程中、最顯著的是在蛋白質之製備中起重要作用。Nucleic acid molecules are macromolecules composed of linked nucleotides that carry genetic information and direct most, if not all, cellular functions by directing the process of protein synthesis. Nucleic acids comprise polymers of nucleotides (nucleotide monomers). Accordingly, nucleic acids are also known as polynucleotides (also known as polynucleotide chains). The two main classes of nucleic acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). DNA constitutes the genetic material in all free-living organisms and in most viruses. RNA is the genetic material of some viruses, but is also found in all living cells where it plays an important role in cellular processes, most notably in the production of proteins.

核苷係核酸(諸如DNA及RNA)之結構次單元。核苷由共價附接至五碳碳水化合物核糖或「糖」(其為核糖或去氧核糖)之含氮鹼基(核鹼基)、通常為嘧啶(胞嘧啶、胸腺嘧啶或尿嘧啶)或嘌呤(腺嘌呤或鳥嘌呤)構成。核苷酸由含氮鹼基、糖(核糖或去氧核糖)及一至三個磷酸基團組成。本質上,核苷酸僅僅係具有一或多個額外磷酸基團之核苷。Nucleosides are structural subunits of nucleic acids such as DNA and RNA. Nucleosides consist of a nitrogenous base (nucleobase), usually a pyrimidine (cytosine, thymine, or uracil), covalently attached to the five-carbon carbohydrate ribose or "sugar" (which is ribose or deoxyribose) or purine (adenine or guanine). Nucleotides consist of a nitrogenous base, a sugar (ribose or deoxyribose), and one to three phosphate groups. Essentially, nucleotides are simply nucleosides with one or more additional phosphate groups.

核酸分子(DNA及RNA)由在一個核苷酸之糖鹼基與相鄰核苷酸之磷酸基團之間藉由化學鍵(稱為酯鍵)在鏈中彼此連接的核苷酸構成。糖在每個核苷酸之3'端,且磷酸在每個核苷酸之5'端。附接至一個核苷酸上之糖之5'碳之磷酸基團與下一個核苷酸之3'碳上之游離羥基形成酯鍵。該等鍵稱為磷酸二酯鍵,且當合成分子時,糖-磷酸主鏈闡述為在5'至3'方向上延伸或生長。Nucleic acid molecules (DNA and RNA) are composed of nucleotides linked to each other in a chain by chemical bonds (called ester bonds) between the sugar bases of one nucleotide and the phosphate groups of adjacent nucleotides. The sugar is at the 3' end of each nucleotide and the phosphate is at the 5' end of each nucleotide. The phosphate group attached to the 5' carbon of the sugar on one nucleotide forms an ester bond with the free hydroxyl group on the 3' carbon of the next nucleotide. These bonds are called phosphodiester bonds, and when synthesizing the molecule, the sugar-phosphate backbone is illustrated as extending or growing in the 5' to 3' direction.

核酸之核鹼基部分之特徵在於嘌呤鹼基:腺嘌呤(A)及鳥嘌呤(G);及嘧啶鹼基:胞嘧啶(C)、DNA中之胸腺嘧啶(T)及RNA中之尿嘧啶(U)。核酸之糖部分之特徵在於DNA中之去氧核糖、RNA中之核糖。五個核苷通常分別縮寫為其單字母代碼A、G、C、T及U。然而,胸苷更通常地寫為「dT」 (「d」代表「去氧」),此乃因其含有2'-去氧呋喃核糖部分而非尿苷中發現之呋喃核糖環。此係由於胸苷在去氧核糖核酸(DNA)而非核糖核酸(RNA)中發現。相反,尿苷在RNA中而非DNA中發現。其餘三個核苷可在RNA及DNA二者中發現。在RNA中,其將表示為A、C及G,而在DNA中,其將表示為dA、dC及dG。The nucleobase portion of nucleic acids is characterized by the purine bases: adenine (A) and guanine (G); and the pyrimidine bases: cytosine (C), thymine (T) in DNA, and uracil in RNA (U). The sugar moiety of nucleic acids is characterized by deoxyribose in DNA and ribose in RNA. The five nucleosides are commonly abbreviated by their one-letter codes A, G, C, T, and U, respectively. However, thymidine is more commonly written as "dT" ("d" stands for "deoxy") because it contains a 2'-deoxyribofuranose moiety rather than the ribofuranose ring found in uridine. This is due to the fact that thymidine is found in deoxyribonucleic acid (DNA) rather than ribonucleic acid (RNA). In contrast, uridine is found in RNA rather than DNA. The remaining three nucleosides can be found in both RNA and DNA. In RNA it will be denoted A, C and G, and in DNA it will be denoted dA, dC and dG.

熟習此項技術者理解,除非另有說明,否則本申請案中所闡釋之核酸序列可在代表性DNA序列中列舉「T」,但在序列表示mRNA之情況下,「T」將被取代為「U」。因此,所揭示及由本文特定序列識別號鑑別之任何DNA亦揭示與DNA互補之相應mRNA序列,其中DNA序列之每個「T」經「U」取代。It will be understood by those skilled in the art that unless otherwise stated, the nucleic acid sequences illustrated in this application may recite a "T" in a representative DNA sequence, but where the sequence represents mRNA, the "T" will be replaced by "U". Thus, any DNA disclosed and identified by a specific sequence identification number herein also discloses the corresponding mRNA sequence complementary to the DNA, wherein each "T" of the DNA sequence is substituted with a "U".

核酸可為或可包括(例如)去氧核糖核酸(DNA)、核糖核酸(RNA) (例如mRNA)、蘇糖核酸(TNA)、二醇核酸(GNA)、肽核酸(PNA)、鎖核酸(LNA,包括具有β-D-核糖組態之LNA、具有α-L-核糖組態之α-LNA (LNA之非鏡像异構物)、具有2'-胺基官能化之2'-胺基-LNA及具有2'-胺基官能化之2'-胺基-α-LNA)、乙烯核酸(ENA)、環己烯基核酸(CeNA)及/或其嵌合體及/或組合。Nucleic acids can be or can include, for example, deoxyribonucleic acid (DNA), ribonucleic acid (RNA) (eg, mRNA), threose nucleic acid (TNA), glycol nucleic acid (GNA), peptide nucleic acid (PNA), locked nucleic acid ( LNAs, including LNAs with a β-D-ribose configuration, α-LNAs with an α-L-ribose configuration (a mirror isomer of LNA), 2'-amine functionalized with a 2'-amine group -LNA and 2'-amino-α-LNA with 2'-amino functionalization), ethylene nucleic acid (ENA), cyclohexenyl nucleic acid (CeNA) and/or chimeras and/or combinations thereof.

本發明之特徵在於信使RNA (mRNA),特別是設計用於編碼相關抗原(例如β冠狀病毒刺突蛋白抗原、其次單元、結構域或片段(例如抗原性片段))之mRNA。信使RNA (mRNA) (一種RNA亞型)係對應於基因之遺傳序列之單股RNA分子。在轉錄過程中產生mRNA,其中單股DNA藉由RNA聚合酶解碼,且合成、亦即轉錄mRNA。mRNA在合成蛋白質、亦即轉譯之過程中由核糖體讀取。因此,信使RNA (mRNA)係編碼(至少一種)蛋白質(天然存在的、非天然存在的或經改質之胺基酸聚合物)的RNA,且可經轉譯以在活體外、活體內、原位或離體産生編碼之蛋白質。The invention features messenger RNA (mRNA), particularly mRNA designed to encode a relevant antigen (eg, betacoronavirus spike protein antigen, subunits, domains or fragments (eg, antigenic fragments)). Messenger RNA (mRNA), a subtype of RNA, is a single-stranded RNA molecule that corresponds to the genetic sequence of a gene. mRNA is produced during transcription, in which single-stranded DNA is decoded by RNA polymerase, and mRNA is synthesized, ie, transcribed. mRNA is read by ribosomes during protein synthesis, ie, translation. Thus, messenger RNA (mRNA) is RNA that encodes (at least one) protein (naturally-occurring, non-naturally-occurring or modified amino acid polymer), and can be translated for in vitro, in vivo, original The encoded protein is produced in situ or ex vivo.

本揭示案之組合物包含(至少一種)具有編碼冠狀病毒抗原之開放閱讀框(ORF)之mRNA。在一些實施例中,mRNA進一步包含5ꞌ UTR、3ꞌ UTR、聚(A)尾及/或5ꞌ端帽或端帽類似物。開放閱讀框(ORF)係一段連續之DNA或RNA,以起始密碼子(例如,甲硫胺酸(ATG或AUG))開始且以終止密碼子(例如TAA、TAG或TGA,或UAA、UAG或UGA)結束。ORF通常編碼蛋白質。應理解,本文揭示之序列可進一步包含額外元件,例如5'及3' UTR,但與ORF不同,彼等元件不一定存在於本揭示案之mRNA中。亦應理解,本發明之mRNA (例如本揭示案之β冠狀病毒疫苗中特徵性之mRNA)可包括任何5'非轉譯區(UTR)及/或任何3' UTR。示例性UTR序列提供於序列表中(例如,SEQ ID NO: 2、4、131及132);然而,其他UTR序列可使用或替換本文所述之任何UTR序列。亦可自本文提供之mRNA省略UTR。Compositions of the present disclosure comprise (at least one) mRNA having an open reading frame (ORF) encoding a coronavirus antigen. In some embodiments, the mRNA further comprises a 5ꞌ UTR, a 3ꞌ UTR, a poly(A) tail, and/or a 5ꞌ end cap or end cap analog. An open reading frame (ORF) is a continuous stretch of DNA or RNA that begins with a start codon (eg, methionine (ATG or AUG)) and ends with a stop codon (eg, TAA, TAG, or TGA, or UAA, UAG) or UGA) ends. ORFs usually encode proteins. It is understood that the sequences disclosed herein may further comprise additional elements, such as 5' and 3' UTRs, but unlike ORFs, these elements are not necessarily present in the mRNAs of the present disclosure. It should also be understood that the mRNAs of the present invention (eg, mRNAs characterized in the betacoronavirus vaccines of the present disclosure) may include any 5' untranslated region (UTR) and/or any 3' UTR. Exemplary UTR sequences are provided in the Sequence Listing (eg, SEQ ID NOs: 2, 4, 131, and 132); however, other UTR sequences may use or replace any of the UTR sequences described herein. UTRs can also be omitted from the mRNAs provided herein.

在一些實施例中,組合物包含mRNA,該mRNA包含與SEQ ID NO: 45、75或90中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%或100%一致性的核苷酸序列。在一些實施例中,組合物包含mRNA,該mRNA包含與表1至15中之序列中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%或100%一致性的核苷酸序列。In some embodiments, the composition comprises mRNA comprising at least 70%, at least 75%, at least 80%, at least 85% of the nucleotide sequence of any one of SEQ ID NOs: 45, 75 or 90 , at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical nucleotide sequences. In some embodiments, the composition comprises mRNA comprising at least 70%, at least 75%, at least 80%, at least 85%, at least 70%, at least 75%, at least 80%, at least 85%, at least a nucleotide sequence of any of the sequences in Tables 1-15 Nucleotide sequences that are 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% identical.

在一些實施例中,組合物包含mRNA,該mRNA包含與SEQ ID NO: 46、76或91中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%或100%一致性的ORF。在一些實施例中,組合物包含mRNA,該mRNA包含與表1至15中之序列中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%或100%一致性的ORF。In some embodiments, the composition comprises mRNA comprising at least 70%, at least 75%, at least 80%, at least 85% of the nucleotide sequence of any one of SEQ ID NOs: 46, 76, or 91 , at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical ORFs. In some embodiments, the composition comprises mRNA comprising at least 70%, at least 75%, at least 80%, at least 85%, at least 70%, at least 75%, at least 80%, at least 85%, at least a nucleotide sequence of any of the sequences in Tables 1-15 ORFs of 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity.

表1至15中提供本揭示案之組合物之冠狀病毒抗原及編碼冠狀病毒抗原之mRNA之示例性序列。Exemplary sequences of coronavirus antigens and mRNA encoding coronavirus antigens of the compositions of the present disclosure are provided in Tables 1-15.

應理解,本文所述mRNA編碼之任何一種抗原可包含或不包含信號序列。 編碼之冠狀病毒刺突 (S) 蛋白抗原 It will be appreciated that any of the antigens encoded by the mRNAs described herein may or may not contain a signal sequence. Encoded coronavirus spike (S) protein antigen

已知之β-冠狀病毒之套膜刺突(S)蛋白决定病毒宿主向性及進入宿主細胞。冠狀病毒刺突(S)蛋白係疫苗設計之選擇抗原,此乃因其可誘導中和抗體及保護性免疫。S蛋白對於SARS-CoV-2感染至關重要。S蛋白之組織在β-冠狀病毒(諸如SARS-CoV-2、SARS-CoV、MERS-CoV、HKU1-CoV、MHV-CoV及NL63-CoV)中係相似的。The known mantle spike (S) protein of betacoronaviruses determines viral host tropism and entry into host cells. The coronavirus spike (S) protein is the antigen of choice for vaccine design because it induces neutralizing antibodies and protective immunity. The S protein is essential for SARS-CoV-2 infection. The organization of the S protein is similar in betacoronaviruses such as SARS-CoV-2, SARS-CoV, MERS-CoV, HKU1-CoV, MHV-CoV and NL63-CoV.

如本文所用之術語「刺突蛋白」係指形成自病毒(包括β冠狀病毒)之套膜(病毒表面)突出之同三聚體的醣蛋白。三聚化刺突蛋白藉由與宿主細胞表面上之受體結合、之後融合病毒及宿主細胞膜而促進病毒粒子進入宿主細胞。S蛋白係高度醣基化之大的I型跨膜融合蛋白,其由1,160至1,400個胺基酸組成,此取决於病毒之類型。β冠狀病毒刺突蛋白包含約1100至1500個胺基酸,且包含如 1所闡釋之結構(亦即,結構域組成及組織)。SARS-CoV-2刺突(S)蛋白係疫苗設計之選擇抗原,此乃因其可誘導中和抗體及保護性免疫。本發明之mRNA經設計用於產生SARS-CoV-2刺突蛋白(亦即,編碼刺突蛋白,使得當mRNA遞送至細胞或組織(例如個體之細胞或組織)時刺突蛋白表現),以及其抗原性變異體。熟習此項技術者將理解,儘管基本上全長或完整刺突蛋白對於病毒(例如β冠狀病毒)實施其促進病毒進入宿主細胞之預期功能可為必需的,但當主要尋求引發針對刺突蛋白之免疫反應時,可耐受刺突蛋白結構及/或序列之一定量之變化。舉例而言,可耐受例如自編碼之刺突蛋白(例如編碼之刺突蛋白抗原)之N末端或C末端之一個至幾個、可能至多5個或至多10個胺基酸之少量截短,而不改變蛋白質之抗原性質。同樣,可耐受編碼之刺突蛋白(例如編碼之刺突蛋白抗原)之一個至幾個、可能至多5個或至多10個胺基酸(或更多)之變化(例如保守取代),而不改變蛋白質之抗原性質。在示例性實施例中,刺突蛋白(例如編碼之刺突蛋白抗原)具有表1-15之序列中之任一者中所闡釋之胺基酸序列(例如,源自如SEQ ID NO: 125所闡釋之胺基酸序列)。在其他實施例中,與具有表1-15之序列中之任一者中所闡釋之胺基酸序列(例如,源自如SEQ ID NO: 125所闡釋之胺基酸序列)之刺突蛋白相比(當與其比對時),刺突蛋白(例如編碼之刺突蛋白抗原)具有不大於100、不大於90、不大於80、不大於70、不大於60、不大於50、不大於40、不大於30、不大於20、不大於10或不大於5個胺基酸取代及/或缺失。在編碼之刺突蛋白序列中進行微小變化之情況下,變異體較佳具有與參考刺突蛋白序列相同之活性及/或具有與參考刺突蛋白相同之免疫特異性,如例如在免疫分析(例如酶聯免疫吸附分析(ELISA分析))中所測定。 The term "spike protein" as used herein refers to a glycoprotein that forms a homotrimer protruding from the envelope (viral surface) of viruses, including betacoronaviruses. Trimeric spike proteins facilitate viral entry into host cells by binding to receptors on the host cell surface, followed by fusion of the viral and host cell membranes. The S protein is a large, highly glycosylated type I transmembrane fusion protein consisting of 1,160 to 1,400 amino acids, depending on the type of virus. Betacoronavirus spike proteins comprise approximately 1100 to 1500 amino acids and comprise the structure (i.e., domain composition and organization) as illustrated in Figure 1 . The SARS-CoV-2 spike (S) protein is the antigen of choice for vaccine design because of its ability to induce neutralizing antibodies and protective immunity. The mRNA of the invention is designed to produce the SARS-CoV-2 spike protein (ie, encodes the spike protein such that the spike protein behaves when the mRNA is delivered to a cell or tissue (eg, a cell or tissue of an individual)), and its antigenic variants. Those skilled in the art will appreciate that although substantially full-length or intact spike proteins may be necessary for a virus (eg, betacoronavirus) to perform its intended function of facilitating viral entry into host cells, when primary elicitation of targeting of the spike protein is sought. A quantitative change in the structure and/or sequence of the spike protein is tolerated during an immune response. For example, minor truncations such as one to several, possibly up to 5 or up to 10 amino acids from the N-terminus or C-terminus of the encoded spike protein (eg, the encoded spike protein antigen) can be tolerated , without changing the antigenic properties of the protein. Likewise, one to several, possibly up to 5 or up to 10 amino acid (or more) changes (eg, conservative substitutions) in the encoded spike protein (eg, the encoded spike protein antigen) are tolerated, while Does not alter the antigenic properties of the protein. In exemplary embodiments, the spike protein (eg, the encoded spike protein antigen) has the amino acid sequence (eg, derived from, eg, SEQ ID NO: 125) as set forth in any of the sequences of Tables 1-15 the amino acid sequence explained). In other embodiments, with a spike protein having the amino acid sequence set forth in any of the sequences of Tables 1-15 (eg, derived from the amino acid sequence set forth as SEQ ID NO: 125) Compared (when aligned to) the spike protein (eg, the encoded spike protein antigen) has no greater than 100, no greater than 90, no greater than 80, no greater than 70, no greater than 60, no greater than 50, no greater than 40 , no more than 30, no more than 20, no more than 10, or no more than 5 amino acid substitutions and/or deletions. Where minor changes are made in the encoded spike protein sequence, the variant preferably has the same activity and/or the same immunospecificity as the reference spike protein sequence, as for example in an immunoassay ( For example, as determined in an enzyme-linked immunosorbent assay (ELISA assay).

冠狀病毒之S蛋白可分為兩個重要之功能次單元,其中包括N末端S1次單元,其形成S蛋白之球狀頭部;及C末端S2區,其形成蛋白之莖且直接嵌入病毒套膜中。在與潜在宿主細胞相互作用時,S1次單元將識別且結合至宿主細胞上之受體,尤其是血管收縮肽轉化酶2 (ACE2)受體,而作為S蛋白之最保守組分之S2次單元將負責將病毒之套膜與宿主細胞膜融合。(參見例如Shang等人,PLoS Pathog. 2020年3月;16(3):e1008392。)。三聚化S蛋白三聚體之每個單體含有兩個次單元S1及S2,分別介導附著及膜融合。參見例如 1。作為活體內感染過程之一部分,兩個次單元藉由酶裂解過程彼此分離。S蛋白首先藉由弗林蛋白酶(furin)介導之裂解在感染細胞中在S1/S2位點進行裂解,在活體內,隨後在S1內之S2'位點發生絲胺酸蛋白酶介導之裂解事件。在SARS-CoV2中,S1/S2裂解位點在胺基酸676 - TQTNSPRRAR/SVA - 688 (參考SEQ ID NO: 127)。S2'裂解位點在胺基酸811 - KPSKR/SFI - 818 (參考SEQ ID NO: 126)。 The S protein of coronavirus can be divided into two important functional subunits, including the N-terminal S1 subunit, which forms the globular head of the S protein; and the C-terminal S2 region, which forms the stem of the protein and directly embeds the viral jacket in the membrane. When interacting with a potential host cell, the S1 subunit will recognize and bind to receptors on the host cell, especially the vasoconstrictor peptide-converting enzyme 2 (ACE2) receptor, while the S2 subunit, the most conserved component of the S protein The unit will be responsible for fusing the viral envelope with the host cell membrane. (See eg, Shang et al., PLoS Pathog. 2020 Mar;16(3):e1008392.). Each monomer of the trimerized S protein trimer contains two subunits, S1 and S2, which mediate attachment and membrane fusion, respectively. See e.g. Figure 1 . As part of the in vivo infection process, the two subunits are separated from each other by an enzymatic cleavage process. The S protein is first cleaved at the S1/S2 site in infected cells by furin-mediated cleavage, followed by serine protease-mediated cleavage at the S2' site within S1 in vivo event. In SARS-CoV2, the S1/S2 cleavage site is at amino acid 676 - TQTNSPRRAR/SVA - 688 (refer to SEQ ID NO: 127). The S2' cleavage site is at amino acid 811 - KPSKR/SFI - 818 (refer to SEQ ID NO: 126).

如本文所用,例如在設計由本發明之核酸(例如mRNA)編碼之SARS-CoV-2 S蛋白抗原之情况下,術語「S1次單元」(例如,S1次單元抗原)係指刺突蛋白之N末端次單元,以S蛋白N末端開始且以S1/S2裂解位點結束,而術語「S2次單元」 (例如,S2次單元抗原)係指刺突蛋白之C末端次單元,以S1/S2裂解位點開始且以刺突蛋白之C末端結束。如上文所述,熟習此項技術者將理解,儘管基本上全長或完整刺突蛋白S1或S2次單元可分別為受體結合或膜融合所必需的,但當主要尋求引發針對刺突蛋白次單元之免疫反應時,可耐受S1或S2結構及/或序列之一定量之變化。舉例而言,可耐受例如自編碼之次單元(例如編碼之S1或S2蛋白抗原)之N或C末端一個至幾個、可能至多4、5、6、7、8、9或10個胺基酸之少量截短,而不改變蛋白質之抗原性質。同樣,可耐受編碼之刺突蛋白次單元(例如編碼之S1或S2蛋白抗原)之一個至幾個、可能至多4、5、6、7、8、9或10個胺基酸(或更多)之變化(例如保守取代),而不改變蛋白質之抗原性質。在示例性實施例中,刺突蛋白(例如編碼之刺突蛋白抗原)具有表1-15之序列中之任一者中所闡釋之胺基酸序列(例如,源自如SEQ ID NO: 125所闡釋之胺基酸序列)。在其他實施例中,與包含具有如SEQ ID NO: 125所闡釋之胺基酸序列之刺突蛋白之胺基酸1-685或由胺基酸1-685組成之刺突蛋白S1次單元或包含胺基酸686-1273或由胺基酸686-1273組成的刺突蛋白S2次單元相比(當與其比對時),刺突蛋白次單元(例如編碼之S1或S2蛋白抗原)具有不大於50、不大於40、不大於30、不大於20、不大於10或不大於5個胺基酸取代及/或缺失。當在編碼之刺突蛋白次單元序列中進行微小變化時,變異體較佳具有與參考刺突蛋白次單元序列相同之活性及/或具有與參考刺突蛋白次單元相同之免疫特異性,如例如在免疫分析(例如酶聯免疫吸附分析(ELISA分析))中所測定。As used herein, eg, in the context of designing a SARS-CoV-2 S protein antigen encoded by a nucleic acid (eg, mRNA) of the invention, the term "S1 subunit" (eg, S1 subunit antigen) refers to the N of the spike protein The terminal subunit, starting at the N-terminus of the S protein and ending with the S1/S2 cleavage site, and the term "S2 subunit" (e.g., the S2 subunit antigen) refers to the C-terminal subunit of the spike protein, beginning with S1/S2 The cleavage site begins and ends with the C-terminus of the spike protein. As noted above, those skilled in the art will appreciate that while substantially full-length or complete Spike protein S1 or S2 subunits may be required for receptor binding or membrane fusion, respectively, when the primary goal is to elicit targeting of the Spike protein subunits Quantitative changes in S1 or S2 structure and/or sequence are tolerated during the immune response of the unit. For example, one to several, possibly up to 4, 5, 6, 7, 8, 9 or 10 amines can be tolerated, eg, from the N or C terminus of the encoded subunit (eg, the encoded S1 or S2 protein antigen) Minor truncations of amino acids without altering the antigenic properties of the protein. Likewise, one to several, possibly up to 4, 5, 6, 7, 8, 9, or 10 amino acids (or more) of the encoded spike protein subunit (eg, the encoded S1 or S2 protein antigen) may be tolerated. multiple) changes (such as conservative substitutions) without changing the antigenic properties of the protein. In exemplary embodiments, the spike protein (eg, the encoded spike protein antigen) has the amino acid sequence (eg, derived from, eg, SEQ ID NO: 125) as set forth in any of the sequences of Tables 1-15 the amino acid sequence explained). In other embodiments, with a spike protein S1 subunit comprising or consisting of amino acids 1-685 of a spike protein having the amino acid sequence set forth in SEQ ID NO: 125 or Spike protein subunits (eg, the encoded S1 or S2 protein antigens) have different properties compared to (when aligned with) the spike protein S2 subunits comprising or consisting of amino acids 686-1273. More than 50, no more than 40, no more than 30, no more than 20, no more than 10, or no more than 5 amino acid substitutions and/or deletions. When minor changes are made in the encoded Spike subunit sequence, the variant preferably has the same activity and/or the same immunospecificity as the reference Spike subunit sequence, such as For example, as determined in an immunoassay, such as an enzyme-linked immunosorbent assay (ELISA assay).

SARS-CoV-2刺突蛋白之S1及S2次單元進一步包括藉由結構及功能容易辨別之結構域,其特徵又在於設計由核酸疫苗(特別是本發明之mRNA疫苗)編碼之抗原。在S1次單元內,結構域包括N末端結構域(NTD)及受體結合結構域(RBD),該RBD結構域進一步包括受體結合基元(RBM)。野生型S1次單元亦包括在NTD結構域之N末端之信號肽(SD)以及第一子結構域(SD1)及第二子結構域(SD2)。在S2次單元內,結構域包括融合肽(FP)、七肽重複區1 (HR1)、七肽重複區2 (HR2)、跨膜結構域(TM)及細胞質結構域,亦稱為胞質尾區(CT) (Lu R.等人,同上;Wan等人, J. Virol.2020年3月, 94 (7) e00127-20)。HR1及HR2結構域可稱為SARS-CoV-2之「融合核心區」 (Xia等人,2020 Cell Mol Immunol.1月; 17(1):1-12。)。 1繪示SARS-CoV-2刺突蛋白中之結構域架構。S1次單元包括N末端結構域(NTD)、連接體區、受體結合結構域(RBD)、第一子結構域(SD1)及第二子結構域(SD2)。S1次單元可經修飾以添加C末端跨膜結構域(TM),或者其可為可溶的。S2次單元尤其包括第一七肽重複區(HR1)、第二七肽重複區(HR2)、跨膜結構域(TM)及胞質尾區。可溶性S2次單元可在沒有TM結構域之情况下產生。 The S1 and S2 subunits of the SARS-CoV-2 spike protein further include domains that are easily discernible by structure and function, and are further characterized by designing antigens encoded by nucleic acid vaccines, especially mRNA vaccines of the present invention. Within the S1 subunit, the domains include an N-terminal domain (NTD) and a receptor binding domain (RBD), which further includes a receptor binding motif (RBM). The wild-type S1 subunit also includes a signal peptide (SD) at the N-terminus of the NTD domain and a first subdomain (SD1) and a second subdomain (SD2). Within the S2 subunit, the domains include fusion peptide (FP), heptapeptide repeat 1 (HR1), heptapeptide repeat 2 (HR2), transmembrane domain (TM), and the cytoplasmic domain, also known as the cytoplasmic domain Tail region (CT) (Lu R. et al, supra; Wan et al, J. Virol. Mar 2020, 94(7) e00127-20). The HR1 and HR2 domains may be referred to as the "fusion core region" of SARS-CoV-2 (Xia et al., 2020 Cell Mol Immunol. Jan; 17(1): 1-12.). Figure 1 depicts the domain architecture in the SARS-CoV-2 spike protein. The S1 subunit includes an N-terminal domain (NTD), a linker region, a receptor binding domain (RBD), a first subdomain (SD1), and a second subdomain (SD2). The S1 subunit can be modified to add a C-terminal transmembrane domain (TM), or it can be soluble. The S2 subunit includes, inter alia, the first heptapeptide repeat region (HR1), the second heptapeptide repeat region (HR2), the transmembrane domain (TM), and the cytoplasmic tail region. Soluble S2 subunits can be produced without the TM domain.

S1之NTD及RBD係用於本發明之疫苗設計方法之良好抗原,此乃因該等結構域已顯示係β冠狀病毒感染個體中之中和抗體之靶標。如本文所用,例如,在抗原設計(該抗原由本發明之mRNA編碼且例如由本發明之mRNA疫苗表現)之背景下,術語「N末端結構域」或「NTD」係指SARS-CoV-2 S1次單元內之結構域,其包含約290個胺基酸長度,與具有如SEQ ID NO: 125所闡釋之胺基酸序列之刺突蛋白之S1次單元的胺基酸1-290具有一致性。如本文所用,例如,在抗原設計(該抗原由本發明之mRNA編碼且例如由本發明之mRNA疫苗表現)之背景下,術語「受體結合結構域」或「RBD」係指SARS-CoV-2之S1次單元內之結構域,其包含約175-225個胺基酸長度,與具有如SEQ ID NO: 125所闡釋之胺基酸序列之刺突蛋白之S1次單元的胺基酸316-517具有一致性。如本文所用之術語「受體結合基元」係指RBD中直接接觸ACE2受體之部分。預測表現之RBD特異性結合至血管收縮肽轉化酶2 (ACE2)作為其受體及/或與RBD結合及/或中和抗體(例如CR3022)特異性反應。The NTD and RBD of S1 are good antigens for use in the vaccine design methods of the present invention, as these domains have been shown to be targets for neutralizing antibodies in betacoronavirus-infected individuals. As used herein, for example, in the context of antigen design (the antigen is encoded by the mRNA of the invention and expressed, for example, by the mRNA vaccine of the invention), the term "N-terminal domain" or "NTD" refers to SARS-CoV-2 S1 time The domain within the unit, comprising about 290 amino acids in length, is identical to amino acids 1-290 of the S1 subunit of the spike protein having the amino acid sequence set forth in SEQ ID NO: 125. As used herein, eg, in the context of antigen design (the antigen is encoded by the mRNA of the invention and expressed, eg, by the mRNA vaccine of the invention), the term "receptor binding domain" or "RBD" refers to the A domain within the S1 subunit comprising about 175-225 amino acids in length with amino acids 316-517 of the S1 subunit of the spike protein having the amino acid sequence set forth in SEQ ID NO: 125 Consistent. The term "receptor binding motif" as used herein refers to the portion of the RBD that directly contacts the ACE2 receptor. The RBDs predicted to appear specifically bind to vasoconstrictor peptide converting enzyme 2 (ACE2) as its receptor and/or react specifically with RBD binding and/or neutralizing antibodies (eg, CR3022).

本文提供之組合物包括mRNA,其可編碼任一或多個全長或部分(截短或其他序列缺失) S蛋白次單元(例如,S1或S2次單元)、S蛋白次單元之一或多個結構域或結構域之組合(例如,NTD、RBD或NTD-RBD融合體,有或無SD1及/或SD2)或全長或部分S1及S2蛋白次單元之嵌合體。本文考慮其他S蛋白次單元及/或結構域組態。Compositions provided herein include mRNAs that can encode any or more full-length or partial (truncated or other sequence deletions) S protein subunits (eg, S1 or S2 subunits), one or more of S protein subunits Domains or combinations of domains (eg, NTD, RBD or NTD-RBD fusions, with or without SD1 and/or SD2) or chimeras of full or partial S1 and S2 protein subunits. Other S protein subunit and/or domain configurations are contemplated herein.

2 及圖 6繪示源自SARS-CoV-2刺突蛋白之示例性結構域及次單元抗原。圖2A及圖2B分別繪示可溶性及跨膜RBD抗原。跨膜NTD抗原示於圖2C中。圖2D-圖2F及圖2I中所示之結構域抗原表示NTD及RBD之示例性融合蛋白,每種具有SP及TM結構域。構築體中之兩者亦具有末端運輸結構域(CD86及/或CD11b)。結構域經由連接體、特別是GS連接體或PADRE連接體(圖2I)連接。具有在NTD結構域之N末端之RBD結構域的結構域構築體繪示於圖2G及圖2H中。每個構築體亦可包括SP及/或TM結構域。 編碼之次單元抗原 2 and 6 depict exemplary domains and subunit antigens derived from the SARS - CoV-2 spike protein. 2A and 2B depict soluble and transmembrane RBD antigens, respectively. Transmembrane NTD antigens are shown in Figure 2C. The domain antigens shown in Figures 2D-2F and 2I represent exemplary fusion proteins of NTD and RBD, each with SP and TM domains. Both of the constructs also have terminal trafficking domains (CD86 and/or CD11b). The domains are linked via linkers, in particular GS linkers or PADRE linkers (Figure 2I). Domain constructs with the RBD domain at the N-terminus of the NTD domain are depicted in Figures 2G and 2H. Each construct may also include SP and/or TM domains. encoded subunit antigen

本揭示案之一些態樣提供包含編碼SARS-CoV-2 S蛋白之(至少一個)次單元之mRNA的組合物。在一些實施例中,mRNA編碼S1次單元(例如全長或部分)。在其他實施例中,mRNA編碼S2次單元(例如全長或部分)。在其他實施例中,mRNA編碼嵌合S1-S2蛋白,其中一個次單元來自SARS-CoV-2 S蛋白,而另一次單元來自另一生物體,例如病毒,諸如流行性感冒病毒。由本揭示案之mRNA編碼之SARS-CoV-2次單元(S1及/或S2)可為可溶的或膜結合的(例如,與跨膜結構域連接)。基於S2之示例性抗原設計示於圖6中。圖6A繪示全長S2,包括FP、HR1、HR2、TM及CT結構域。由次單元之間之連接體構成之一種形式之S2示於圖6B中。無CT結構域之結構域抗原示於圖6C及圖6D中。 可溶性次單元抗原 Some aspects of the present disclosure provide compositions comprising mRNA encoding (at least one) subunit of the SARS-CoV-2 S protein. In some embodiments, the mRNA encodes the S1 subunit (eg, full-length or partial). In other embodiments, the mRNA encodes the S2 subunit (eg, full-length or partial). In other embodiments, the mRNA encodes a chimeric S1-S2 protein, wherein one subunit is from the SARS-CoV-2 S protein and the other subunit is from another organism, eg, a virus, such as influenza virus. The SARS-CoV-2 subunits (S1 and/or S2) encoded by the mRNAs of the present disclosure may be soluble or membrane bound (eg, linked to a transmembrane domain). An exemplary antigen design based on S2 is shown in FIG. 6 . Figure 6A depicts full-length S2, including FP, HRl, HR2, TM and CT domains. One form of S2 consisting of links between subunits is shown in Figure 6B. Domain antigens without the CT domain are shown in Figures 6C and 6D. soluble subunit antigen

可溶性蛋白存在於細胞之細胞質中或自細胞分泌(例如,不結合膜)。細胞分泌之可溶性抗原可由補體調理,且由淋巴結中之濾泡樹突細胞捕獲,在這裏其可由對表現之蛋白質上存在之抗原決定基特異性之B細胞識別。次單元抗原之表現進一步允許將免疫反應集中於特定次單元,且對與其他相關病毒共有之抗原之其他結構域具有特異性之記憶B及T細胞之刺激最小。不受限於理論,據信SARS-CoV-2 S1次單元(包括NTD、RBD及在一些情况下SARS-CoV-2 S1次單元之插入多肽)以可溶形式之呈遞,產生S1次單元特異性免疫反應。因此,在一些實施例中,本文提供之mRNA編碼可溶性SARS-CoV-2 S1次單元抗原及/或可溶性SARS-CoV-2 S2次單元抗原。可溶性SARS-CoV-2 S1次單元抗原及編碼其之mRNA之非限制性實例提供於下文 1A 1B中。本文提供可溶性SARS-CoV-2次單元抗原之其他實例。 1A. 可溶性次單元抗原 名稱 SEQ ID NO: mRNA ORF 蛋白質 SARS-CoV-2可溶性S1次單元 3 5 1B. 可溶性次單元抗原 SARS-CoV-2可溶性S1次單元 SEQ ID NO: 1自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 3及3' UTR SEQ ID NO: 4。 1 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCA 3 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNS 5 聚A尾 100 nt    膜結合之次單元抗原 Soluble proteins are present in the cytoplasm of cells or secreted from cells (eg, do not bind membranes). Soluble antigens secreted by cells can be opsonized by complement and captured by follicular dendritic cells in lymph nodes, where they are recognized by B cells specific for epitopes present on the expressed protein. The presentation of subunit antigens further allows to focus the immune response on specific subunits with minimal stimulation of memory B and T cells specific for other domains of antigens shared by other related viruses. Without being bound by theory, it is believed that presentation of the SARS-CoV-2 S1 subunits (including NTDs, RBDs, and in some cases intervening polypeptides of the SARS-CoV-2 S1 subunits) in a soluble form yields S1 subunit-specific Sexual immune response. Thus, in some embodiments, the mRNA provided herein encodes a soluble SARS-CoV-2 S1 subunit antigen and/or a soluble SARS-CoV-2 S2 subunit antigen. Non-limiting examples of soluble SARS-CoV-2 S1 subunit antigens and mRNAs encoding them are provided in Tables 1A and 1B below . Additional examples of soluble SARS-CoV-2 subunit antigens are provided herein. Table 1A. Soluble Subunit Antigens name SEQ ID NO: mRNA ORF protein SARS-CoV-2 soluble S1 subunit 3 5 Table 1B. Soluble Subunit Antigens SARS-CoV-2 soluble S1 subunit SEQ ID NO: 1 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 3 and 3' UTR SEQ ID NO: 4. 1 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCA ACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGG CGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCA 3 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNS 5 Poly A tail 100nt membrane bound subunit antigen

膜結合之蛋白錨定在細胞膜中(不可溶的)。不受限於理論,據信抗原呈遞細胞將嵌入之抗原攜帶至引流淋巴結以產生強烈免疫反應。引流淋巴結中出現之生發中心反應包括CD4 +TFH 細胞與B細胞之間之延長接觸,允許共刺激及局部細胞介素信號,諸如IL-4及IL-21,其有利於對所呈遞抗原特異之B細胞之複製及類別轉換至IgG1之產生,其各自可促進長壽命之漿細胞及記憶B細胞之產生。因此,在一些實施例中,mRNA編碼膜結合之SARS-CoV-2 S1次單元抗原及/或膜結合之SARS-CoV-2 S2次單元抗原。在一些實施例中,膜結合抗原(例如,S1次單元、S2次單元、NTD、RBD或其任何組合)與跨膜結構域(例如天然存在之跨膜結構域或異源跨膜結構域(源自異源蛋白質),其負責將蛋白質錨定在細胞膜中)連接。膜結合之SARS-CoV-2 S1次單元抗原及SARS-CoV-2 S2次單元抗原及編碼其之mRNA的非限制性實例提供於下文 2A 2B中。本文考慮其他膜結合之SARS-CoV-2 S1次單元抗原。 2A. 膜結合之次單元抗原 名稱 SEQ ID NO:    mRNA ORF 蛋白質 與跨膜結構域連接之SARS-CoV-2 S1次單元(S1-666-TM) 16 17 與跨膜結構域連接之SARS-CoV-2 S2次單元(S2-TM) 145 146 2B. 膜結合之次單元抗原 與跨膜結構域連接之 SARS-CoV-2 S1 次單元 (S1-666-TM) SEQ ID NO: 15自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 16及3' UTR SEQ ID NO: 4。 15 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCAUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 16 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSGGGSILAIYSTVASSLVLLVSLGAISF 17 聚A尾 100 nt    包含跨膜結構域之 SARS-CoV-2 S2 次單元 (S2-TM) SEQ ID NO: 147自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 145及3' UTR SEQ ID NO: 4。 147 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACCAACUUCACCAUCAGCGUGACCACCGAGAUUCUGCCCGUGAGCAUGACCAAGACCAGCGUGGACUGCACCAUGUACAUCUGCGGCGACAGCACCGAGUGCAGCAACCUGCUGCUGCAGUACGGCAGCUUCUGCACCCAGCUGAACCGGGCCCUGACCGGCAUCGCCGUGGAGCAGGACAAGAACACCCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACCCCUCCCAUCAAGGACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACCCUAGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUCGGCGACAUAGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAACGGCCUGACCGUGCUGCCUCCCCUGCUGACCGACGAGAUGAUCGCCCAGUACACCAGCGCCCUGUUAGCCGGAACCAUCACCAGCGGCUGGACUUUCGGCGCUGGAGCCGCUCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACCCAGAACGUGCUGUACGAGAACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCUGAGCAGCACCGCUAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCAGAACGCCCAGGCCCUGAACACCCUGGUGAAGCAGCUGAGCAGCAACUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCCUGAGCCGGCUGGACCCUCCCGAGGCCGAGGUGCAGAUCGACCGGCUGAUCACUGGCCGGCUGCAGAGCCUGCAGACCUACGUGACCCAGCAGCUGAUCCGGGCCGCCGAGAUUCGGGCCAGCGCCAACCUGGCCGCCACCAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUUCCCCAGAGCGCACCCCACGGAGUGGUGUUCCUGCACGUGACCUACGUGCCCGCCCAGGAGAAGAACUUCACCACCGCCCCAGCCAUCUGCCACGACGGCAAGGCCCACUUUCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACCCACUGGUUCGUGACCCAGCGGAACUUCUACGAGCCCCAGAUCAUCACCACCGACAACACCUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACCGUGUACGAUCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAAUCACACCAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGAUCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCACUACACC 145 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MYSMQLASCVTLTLVLLVNSQGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDPPEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT 146 聚A尾 100 nt    次單元抗原截短及 RBD 缺失 Membrane-bound proteins are anchored in the cell membrane (insoluble). Without being bound by theory, it is believed that antigen presenting cells carry embedded antigen to draining lymph nodes to generate a strong immune response. Germinal center responses occurring in draining lymph nodes include prolonged contacts between CD4 + TFH cells and B cells, allowing for co-stimulation and local intercellular signaling, such as IL-4 and IL-21, which favors antigen specificity for presented antigens. Replication and class switching of B cells to the production of IgGl, each of which promotes the production of long-lived plasma cells and memory B cells. Thus, in some embodiments, the mRNA encodes a membrane-bound SARS-CoV-2 S1 subunit antigen and/or a membrane-bound SARS-CoV-2 S2 subunit antigen. In some embodiments, a membrane-bound antigen (eg, S1 subunit, S2 subunit, NTD, RBD, or any combination thereof) is associated with a transmembrane domain (eg, a naturally occurring transmembrane domain or a heterologous transmembrane domain ( derived from a heterologous protein), which is responsible for anchoring the protein in the cell membrane) ligation. Non-limiting examples of membrane-bound SARS-CoV-2 S1 subunit antigens and SARS-CoV-2 S2 subunit antigens and mRNAs encoding them are provided in Tables 2A and 2B below . Other membrane-bound SARS-CoV-2 S1 subunit antigens are considered here. Table 2A. Membrane-bound subunit antigens name SEQ ID NO: mRNA ORF protein SARS-CoV-2 S1 subunit linked to transmembrane domain (S1-666-TM) 16 17 SARS-CoV-2 S2 subunit (S2-TM) linked to the transmembrane domain 145 146 Table 2B. Membrane-bound subunit antigens SARS-CoV-2 S1 subunit linked to transmembrane domain (S1-666-TM) SEQ ID NO: 15 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 16 and 3' UTR SEQ ID NO: 4. 15 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCA ACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGG CGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCAUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 16 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSGGGSILAIYSTVASSLVLLVSLGAISF 17 Poly A tail 100nt SARS-CoV-2 S2 subunit (S2-TM) comprising a transmembrane domain SEQ ID NO: 147 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 145 and 3' UTR SEQ ID NO: 4. 147 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACCAACUUCACCAUCAGCGUGACCACCGAGAUUCUGCCCGUGAGCAUGACCAAGACCAGCGUGGACUGCACCAUGUACAUCUGCGGCGACAGCACCGAGUGCAGCAACCUGCUGCUGCAGUACGGCAGCUUCUGCACCCAGCUGAACCGGGCCCUGACCGGCAUCGCCGUGGAGCAGGACAAGAACACCCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACCCCUCCCAUCAAGGACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACCCUAGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUCGGCGACAUAGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAACGGCCUGACCGUGCUGCCUCCCCUGCUGACCGACGAGAUGAUCGCCCAGUACACCAGCGCCCUGUUAGCCGGAACCAUCACCAGCGGCUGGACUUUCGGCGCUGGAGCCGCUCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACCCAGAACGUGCUGUACGAGAACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCUGAGCAGCACCGCUAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCAGAACGCCCAGGCCCUGAACACCCUGGUGAAGCAGCUGAGCAGCAACUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCCUGAGCCGGCUGGACCCUCCCGAGGCCGAGGUGCAGAUCGACCGGCUGAUCACUGGCCGGCUGCAGAGCCUGCAGACCUACGUGACCCAGCAGC UGAUCCGGGCCGCCGAGAUUCGGGCCAGCGCCAACCUGGCCGCCACCAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUUCCCCAGAGCGCACCCCACGGAGUGGUGUUCCUGCACGUGACCUACGUGCCCGCCCAGGAGAAGAACUUCACCACCGCCCCAGCCAUCUGCCACGACGGCAAGGCCCACUUUCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACCCACUGGUUCGUGACCCAGCGGAACUUCUACGAGCCCCAGAUCAUCACCACCGACAACACCUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACCGUGUACGAUCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAAUCACACCAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGAUCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCACUACACC 145 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MYSMQLASCVTLTLVLLVNSQGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDPPEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT 146 Poly A tail 100nt Subunit antigen truncation and RBD deletion

在一些實施例中,組合物包含編碼已經修飾以去除RBD或RBD之一部分之S1次單元的mRNA。S1次單元之截短為免疫系統提供較少之抗原決定基來識別,從而使免疫反應偏向於其餘抗原決定基,此可選擇針對對於病毒中和重要之特定抗原決定基之抗體。RBD之截短或部分缺失可防止表現之蛋白質或携帶其之細胞與受體ACE2相互作用,使其更可能到達淋巴結且刺激期望免疫反應。此外,去除RBD可防止以前針對相關病毒產生之交叉反應性抗體造成之抗原決定基掩蔽,且因此將引發之免疫反應特異性地集中於期望抗原。另外,去除RBD可改變表現之次單元之構象,允許對該等替代構象抗原決定基特異之B細胞攝取綫性肽且將綫性肽呈遞給T細胞,從而間接增强CD4 +T細胞對仍以天然構象存在之彼等抗原決定基之反應。 In some embodiments, the composition comprises an mRNA encoding an S1 subunit that has been modified to remove an RBD or a portion of an RBD. Truncation of the S1 subunit provides the immune system with fewer epitopes to recognize, thereby biasing the immune response to the remaining epitopes, which select antibodies against specific epitopes important for virus neutralization. Truncation or partial deletion of the RBD prevents the expressed protein or cells carrying it from interacting with the receptor ACE2, making it more likely to reach lymph nodes and stimulate the desired immune response. In addition, removal of RBD prevents epitope masking by cross-reactive antibodies previously raised against related viruses, and thus focuses the elicited immune response specifically to the desired antigen. In addition, removal of RBD alters the conformation of the expressed subunit, allowing B cells specific for these alternative conformational epitopes to uptake and present linear peptides to T cells, thereby indirectly enhancing the ability of CD4 + T cells to remain viable. The reaction of those epitopes in their native conformation.

在一些實施例中,組合物包含編碼S1次單元之mRNA,該S1次單元已經修飾以去除RBD或RBD之一部分,其中S2次單元含有聚醣。聚醣藉由經由天冬醯胺殘基之N-連接醣基化或絲胺酸或蘇胺酸殘基上之O-連接醣基化與蛋白質附接。在蛋白質之一些組分上存在聚醣屏蔽物可掩蔽肽抗原決定基,從而將抗體反應集中於其他暴露之肽抗原決定基。此外,醣基化蛋白亦引發識別包被聚醣之抗體。識別聚醣抗原決定基之B細胞將攝取綫性肽抗原決定基且將其呈遞給CD4 +T細胞,從而加強CD4 +T細胞對遍及蛋白發現之綫性抗原決定基之反應。 In some embodiments, the composition comprises mRNA encoding an S1 subunit that has been modified to remove an RBD or a portion of an RBD, wherein the S2 subunit contains a glycan. Glycans are attached to proteins by N-linked glycosylation via asparagine residues or O-linked glycosylation on serine or threonine residues. The presence of glycan shields on some components of a protein can mask peptide epitopes, thereby focusing antibody responses on other exposed peptide epitopes. In addition, glycosylated proteins also elicit antibodies that recognize coated glycans. B cells recognizing the glycan epitope will take up the linear peptide epitope and present it to CD4 + T cells, thereby enhancing the CD4 + T cell response to the linear epitope found on ubiquitin.

截短之SARS-CoV-2 S1次單元抗原及編碼其之mRNA之非限制性實例提供於下文 3A 3B中。 Non-limiting examples of truncated SARS-CoV-2 S1 subunit antigens and mRNAs encoding them are provided in Tables 3A and 3B below .

具有RBD缺失之SARS-CoV-2 S1次單元及編碼其之mRNA之非限制性實例提供於下文 4A 4B中。 3A. 次單元抗原截短 名稱 SEQ ID NO: mRNA ORF 蛋白質 經截短且與跨膜結構域連接之SARS-CoV-2 S1次單元(S1-531-TM) 19 20 經截短且與跨膜結構域連接之SARS-CoV-2 S1次單元(S1-594-TM) 22 23 經聚G截短且與跨膜結構域連接之SARS-CoV-2 S1次單元(S1-594-聚G-TM) 25 26 經聚G/DS截短且與跨膜結構域連接之SARS-CoV-2 S1次單元(S1-594-聚G-DS-TM) 28 29 經截短且與跨膜結構域連接之SARS-CoV-2 S1次單元(S1-666-TM) 150 151 3B. 次單元抗原截短 經截短且與跨膜結構域連接之 SARS-CoV-2 S1 次單元 (S1-531-TM) SEQ ID NO: 18自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 19及3' UTR SEQ ID NO: 4。 18 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 19 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTsgggsilaiystvasslvllvslgaisf 20 聚A尾 100 nt    經截短且與跨膜結構域連接之 SARS-CoV-2 S1 次單元 (S1-594-TM) SEQ ID NO: 21自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 22及3' UTR SEQ ID NO: 4。 21 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 22 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGsgggsilaiystvasslvllvslgaisf 23 聚A尾 100 nt    經聚 G 截短且與跨膜結構域連接之 SARS-CoV-2 S1 次單元 (S1-594- G-TM) SEQ ID NO: 24自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 25及3' UTR SEQ ID NO: 4。 24 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGGCAGCGGCGGCGGCAGCGGCGGAGGCAGCGGAGGAGGCAGCGGCGGAGGCAGUGGAGGCCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 25 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVGSGGGSGGGSGGGSGGGSGGQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGsgggsilaiystvasslvllvslgaisf 26 聚A尾 100 nt    經聚 G/DS 截短且與跨膜結構域連接之 SARS-CoV-2 S1 次單元 (S1-594- G-DS-TM) SEQ ID NO: 27自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 28及3' UTR SEQ ID NO: 4。 27 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUGCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGGCAGCGGCGGCGGCAGCGGCGGAGGCAGCGGAGGAGGCAGCGGCGGAGGCAGUGGAGGCCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUUGCCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 28 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVCRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVGSGGGSGGGSGGGSGGGSGGQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFCQFGRDIADTTDAVRDPQTLEILDITPCSFGGsgggsilaiystvasslvllvslgaisf 29 聚A尾 100 nt    經截短且與跨膜結構域連接之 SARS-CoV-2 S1 次單元 (S1-666-TM) SEQ ID NO: 149自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 150及3' UTR SEQ ID NO: 4。 149 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCA 150 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNS 151 聚A尾 100 nt    4A. 次單元抗原 RBD 缺失 名稱 SEQ ID NO: mRNA ORF 蛋白質 缺失RBD之SARS-CoV-2 S1次單元及添加聚醣之S2次單元 31 32 缺失RBD及NTD之SARS-CoV-2 S1次單元及添加聚醣之S2次單元 34 35 4B. 次單元抗原 RBD 缺失 缺失 RBD SARS-CoV-2 S1 次單元及添加聚醣之 S2 次單元 SEQ ID NO: 30自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 31及3' UTR SEQ ID NO: 4。 30 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCGGCGGCAGCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCACCCCGGAGGGCAAGGAGCGUGGCCAGCCAGAGCAUCAUCGCCUACACCAUGAGCCUGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACCAACUUCACCAUCAGCGUGACCACCGAGAUUCUGCCCGUGAGCAUGACCAAGACCAGCGUGGACUGCACCAUGUACAUCUGCGGCGACAGCACCGAGUGCAGCAACCUGCUGCUGCAGUACGGCAGCUUCUGCACCCAGCUGAACCGGGCCCUGACCGGCAUCGCCGUGGAGCAGGACAAGAACACCCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACCCCUCCCAUCAAGGACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACCCUAGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUCGGCGACAUAGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAACGGCCUGACCGUGCUGCCUCCCCUGCUGACCGACGAGAUGAUCGCCCAGUACACCAGCGCCCUGUUAGCCGGAACCAUCACCAGCGGCUGGACUUUCGGCGCUGGAGCCGCUCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACCCAGAACGUGCUGUACGAGAACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCUGAGCAGCACCGCUAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCAGAACGCCCAGGCCCUGAACACCCUGGUGAAGCAGCUGAGCAGCAACUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCCUGAGCCGGCUGGACCCUCCCAACGCCACCGUGCAGAUCGACCGGCUGAUCACUGGCCGGCUGCAGAGCCUGCAGACCUACGUGACCCAGCAGCUGAUCCGGGCCGCCGAGAUUCGGGCCAGCGCCAACCUGGCCGCCACCAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUUCCCCAGAGCGCACCCCACGGAGUGGUGUUCCUGCACGUGACCUACGUGCCCGCCCAGGAGAAGAACUUCACCACCGCCCCAGCCAUCUGCCACGACGGCAAGGCCCACUUUCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACCCACUGGUUCGUGACCCAGCGGAACUUCUACGAGCCCCAGAUCAUCACCACCGACAACACCUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACCGUGUACGAUCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAAUCACACCAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGAUCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCACUACACC 31 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFGGSGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDPPNATVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT 32 聚A尾 100 nt    缺失 RBD NTD SARS-CoV-2 S1 次單元及添加聚醣之 S2 次單元 SEQ ID NO: 33自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 34及3' UTR SEQ ID NO: 4。 33 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCGGCGGCAGCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCACCCCGGAGGGCAAGGAGCGUGGCCAGCCAGAGCAUCAUCGCCUACACCAUGAGCCUGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACCAACUUCACCAUCAGCGUGACCACCGAGAUUCUGCCCGUGAGCAUGACCAAGACCAGCGUGGACUGCACCAUGUACAUCUGCGGCGACAGCACCGAGUGCAGCAACCUGCUGCUGAACUACACCAGCUUCUGCACCCAGCUGAACCGGGCCCUGACCGGCAUCGCCGUGGAGCAGGACAAGAACACCCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACCCCUCCCAUCAAGGACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACCCUAGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUCGGCGACAUAGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAACGGCCUGACCGUGCUGCCUCCCCUGCUGACCGACGAGAUGAUCGCCCAGUACACCAGCGCCCUGUUAGCCGGAACCAUCACCAGCGGCUGGACUUUCGGCGCUGGAGCCGCUCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACCCAGAACGUGCUGUACGAGAACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCUGAGCAGCACCGCUAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCAGAACGCCCAGGCCCUGAACACCCUGGUGAAGCAGCUGAGCAGCAACUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCCUGAGCCGGCUGGACCCUCCCAACGCCACCGUGCAGAUCGACCGGCUGAUCACUGGCCGGCUGCAGAGCCUGCAGACCUACGUGACCCAGCAGCUGAUCCGGGCCGCCGAGAUUCGGGCCAGCGCCAACCUGGCCGCCACCAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUUCCCCAGAGCGCACCCCACGGAGUGGUGUUCCUGCACGUGACCUACGUGCCCGCCCAGGAGAAGAACUUCACCACCGCCCCAGCCAUCUGCCACGACGGCAAGGCCCACUUUCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACCCACUGGUUCGUGACCCAGCGGAACUUCUACGAGCCCCAGAUCAUCACCACCGACAACACCUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACCGUGUACGAUCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAAUCACACCAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGAUCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCACUACACC 34 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFGGSGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLNYTSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDPPNATVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT 35 聚A尾 100 nt    嵌合 S1-S2 次單元抗原 Non-limiting examples of SARS-CoV-2 S1 subunits with RBD deletions and mRNAs encoding them are provided in Tables 4A and 4B below . Table 3A. Subunit antigenic truncations name SEQ ID NO: mRNA ORF protein Truncated SARS-CoV-2 S1 subunit linked to the transmembrane domain (S1-531-TM) 19 20 Truncated SARS-CoV-2 S1 subunit linked to the transmembrane domain (S1-594-TM) twenty two twenty three SARS-CoV-2 S1 subunit truncated by poly-G and linked to the transmembrane domain (S1-594-poly-G-TM) 25 26 SARS-CoV-2 S1 subunit truncated by polyG/DS and linked to the transmembrane domain (S1-594-polyG-DS-TM) 28 29 Truncated SARS-CoV-2 S1 subunit linked to the transmembrane domain (S1-666-TM) 150 151 Table 3B. Subunit antigenic truncations Truncated SARS-CoV-2 S1 subunit linked to the transmembrane domain (S1-531-TM) SEQ ID NO: 18 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 19 and 3' UTR SEQ ID NO: 4. 18 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCA ACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 19 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTsgggsilaiystvasslvllvslgaisf 20 Poly A tail 100nt Truncated SARS-CoV-2 S1 subunit linked to the transmembrane domain (S1-594-TM) SEQ ID NO: 21 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 22 and 3' UTR SEQ ID NO: 4. twenty one Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCA ACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC twenty two 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGsgggsilaiystvasslvllvslgaisf twenty three Poly A tail 100nt SARS-CoV-2 S1 subunit truncated by poly- G and linked to the transmembrane domain (S1-594 -poly -G-TM) SEQ ID NO: 24 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 25 and 3' UTR SEQ ID NO: 4. twenty four Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGGCAGCGGCGGCGGCAGCGGCGGAGGCAGCGGAGGAGGCAGCGGCGGAGGCAGUGGAGGCCAGCCCACCGAGAGCA UCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 25 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVGSGGGSGGGSGGGSGGGSGGQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGsgggsilaiystvasslvllvslgaisf 26 Poly A tail 100nt SARS-CoV-2 S1 subunit truncated by polyG /DS and linked to the transmembrane domain (S1-594 - polyG -DS-TM) SEQ ID NO: 27 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 28 and 3' UTR SEQ ID NO: 4. 27 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUGCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGGCAGCGGCGGCGGCAGCGGCGGAGGCAGCGGAGGAGGCAGCGGCGGAGGCAGUGGAGGCCAGCCCACCGAGAGCA UCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUUGCCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 28 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVCRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVGSGGGSGGGSGGGSGGGSGGQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFCQFGRDIADTTDAVRDPQTLEILDITPCSFGGsgggsilaiystvasslvllvslgaisf 29 Poly A tail 100nt Truncated SARS-CoV-2 S1 subunit linked to the transmembrane domain (S1-666-TM) SEQ ID NO: 149 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 150 and 3' UTR SEQ ID NO: 4. 149 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCA ACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGG CGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCA 150 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNS 151 Poly A tail 100nt Table 4A. Subunit antigen RBD deletion name SEQ ID NO: mRNA ORF protein SARS-CoV-2 S1 subunit with RBD deletion and S2 subunit with added glycan 31 32 SARS-CoV-2 S1 subunit with deletion of RBD and NTD and S2 subunit with added glycan 34 35 Table 4B. Subunit antigen RBD deletion SARS-CoV-2 S1 subunit with RBD deletion and S2 subunit with added glycan SEQ ID NO: 30 consists of the following from 5' to 3': 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 31 and 3' UTR SEQ ID NO: 4. 30 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCGGCGGCAGCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCA GCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCACCCCGGAGGGCAAGGAGCGUGGCCAGCCAGAGCAUCAUCGCCUACACCAUGAGCCUGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACCAACUUCACCAUCAGCGUGACCACCGAGAUUCUGCCCGUGAGCAUGACCAAGACCAGCGUGGACUGCACCAUGUACAUCUGCGGCGACAGCACCGAGUGCAGCAACCUGCUGCUGCAGUACGGCAGCUUCUGCACCCAGCUGAACCGGGCCCUGACCGGCAUCGCCGUGGAGCAGGACAAGAACACCCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACCCCUCCCAUCAAGGACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACCCUAGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUCGGCGACAUAGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAACGGCCUGACCGUGCUGCCUCCCCUGCUGACCGACGAGAUGAUCGCCCAGUACACCAGCGCCCUGUUAGCCGGAACCAUCACCAGCGGCUGGACUUUCGGCGCUGGAGCCGCUCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACCCAGAACGUGCUGUACGAGAACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCU GAGCAGCACCGCUAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCAGAACGCCCAGGCCCUGAACACCCUGGUGAAGCAGCUGAGCAGCAACUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCCUGAGCCGGCUGGACCCUCCCAACGCCACCGUGCAGAUCGACCGGCUGAUCACUGGCCGGCUGCAGAGCCUGCAGACCUACGUGACCCAGCAGCUGAUCCGGGCCGCCGAGAUUCGGGCCAGCGCCAACCUGGCCGCCACCAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUUCCCCAGAGCGCACCCCACGGAGUGGUGUUCCUGCACGUGACCUACGUGCCCGCCCAGGAGAAGAACUUCACCACCGCCCCAGCCAUCUGCCACGACGGCAAGGCCCACUUUCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACCCACUGGUUCGUGACCCAGCGGAACUUCUACGAGCCCCAGAUCAUCACCACCGACAACACCUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACCGUGUACGAUCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAAUCACACCAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGAUCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCAC UACACC 31 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFGGSGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDPPNATVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLH YT 32 Poly A tail 100nt SARS-CoV-2 S1 subunit with deletion of RBD and NTD and S2 subunit with added glycan SEQ ID NO: 33 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 34 and 3' UTR SEQ ID NO: 4. 33 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCGGCGGCAGCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCACCCCGGAGGGCAAGGAGCGUGGCCAGCCAGAGCAUCAUCGCCUACACCAUGAGCCUGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACCAACUUCACCAUCAGCGUGACCACCGAGAUUCUGCCCGUGAGCAUGACCAAGACCAGCGUGGACUGCACCAUGUACAUCUGCGGCGACAGCACCGAGUGCAGCAACCUGCUGCUGAACUACACCAGCUUCUGCACCCAGCUGAACCGGGCCCUGACCGGCAUCGCCGUGGAGCAGGACAAGAACACCCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACCCCUCCCAUCAAGGACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACCCUAGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUCGGCGACAUAGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAACGGCCUGACCGUGCUGCCUCCCCUGCUGACCGACGAGAUGAUCGCCCAGU ACACCAGCGCCCUGUUAGCCGGAACCAUCACCAGCGGCUGGACUUUCGGCGCUGGAGCCGCUCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACCCAGAACGUGCUGUACGAGAACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCUGAGCAGCACCGCUAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCAGAACGCCCAGGCCCUGAACACCCUGGUGAAGCAGCUGAGCAGCAACUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCCUGAGCCGGCUGGACCCUCCCAACGCCACCGUGCAGAUCGACCGGCUGAUCACUGGCCGGCUGCAGAGCCUGCAGACCUACGUGACCCAGCAGCUGAUCCGGGCCGCCGAGAUUCGGGCCAGCGCCAACCUGGCCGCCACCAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUUCCCCAGAGCGCACCCCACGGAGUGGUGUUCCUGCACGUGACCUACGUGCCCGCCCAGGAGAAGAACUUCACCACCGCCCCAGCCAUCUGCCACGACGGCAAGGCCCACUUUCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACCCACUGGUUCGUGACCCAGCGGAACUUCUACGAGCCCCAGAUCAUCACCACCGACAACACCUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACCGUGUACGAUCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAAUCACACCAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGAUCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUA CGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCACUACACC 34 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFGGSGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLNYTSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDPPNATVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT 35 Poly A tail 100nt Chimeric S1-S2 subunit antigen

在一些實施例中,組合物包含編碼嵌合蛋白(例如具有來自一種病毒之S蛋白之S1次單元及來自另一種不同病毒之S蛋白之S2次單元之嵌合S1-S2蛋白)之mRNA。舉例而言,S2次單元可來自SARS-CoV-2,而S1次單元可來自HKU1。作為另一實例,S2次單元可來自SARS-CoV-2,而S1次單元可來自OC43。該等嵌合蛋白可能由藉由先前暴露而產生之對HKU1或OC43之S1次單元特異之循環抗體調理,促進巨噬細胞及樹突細胞有效地攝取SARS-CoV-2 S2次單元肽且將其交叉呈遞給CD4 +T細胞。藉由循環抗體之調理作用亦促進由濾泡樹突細胞之捕獲,用於呈遞給具有對SARS-CoV-2 S2次單元抗原決定基特異之受體的B細胞。嵌合S1/S2次單元構築體及編碼其之mRNA之非限制性實例提供於下文 5A 5B中。 5A. 嵌合 S1 次單元 -S2 次單元抗原 名稱 SEQ ID NO: mRNA ORF 蛋白質 與HKU1 S1次單元連接之SARS-CoV-2 S2次單元 37 38 與OC43 S1次單元連接之SARS-CoV-2 S2次單元 40 41 5B. 嵌合 S1 次單元 -S2 次單元抗原 HKU1 S1 次單元連接之 SARS-CoV-2 S2 次單元 SEQ ID NO: 36自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 37及3' UTR SEQ ID NO: 4。 36 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCCUGAUCAUCUUCAUCCUGCCCACCACCCUGGCCGUGAUCGGCGACUUCAACUGCACCAACAGCUUCAUCAACGACUACAACAAGACCAUCCCUCGGAUCAGCGAAGACGUGGUGGACGUGUCCCUGGGCCUGGGCACCUACUACGUGCUGAACCGGGUGUACCUGAACACCACACUGCUGUUCACCGGCUACUUCCCCAAGAGCGGCGCCAACUUCCGGGACCUGGCCCUGAAGGGCAGCAUCUACCUGAGCACCUUGUGGUACAAGCCUCCCUUCCUGAGCGACUUCAAUAACGGCAUCUUCUCUAAGGUGAAGAACACCAAGCUGUACGUAAACAACACCCUGUACAGCGAGUUCAGCACCAUCGUGAUCGGCAGCGUGUUCGUCAACACCAGCUACACCAUCGUGGUGCAGCCCCACAACGGCAUCCUGGAGAUCACCGCCUGCCAGUACACCAUGUGCGAGUACCCUCACACCGUGUGCAAGAGCAAGGGCUCCAUCCGGAACGAGAGCUGGCACAUCGACAGCAGCGAGCCGCUGUGCCUGUUCAAGAAGAACUUCACCUACAACGUGAGCGCCGACUGGCUGUACUUCCACUUCUACCAGGAGCGGGGCGUGUUCUACGCCUACUACGCCGACGUGGGCAUGCCAACCACCUUCCUGUUCAGCCUGUACCUGGGCACCAUCCUGAGCCACUACUACGUGAUGCCCCUGACCUGCAACGCCAUCAGCUCAAACACCGACAACGAGACCCUGGAGUACUGGGUGACUCCACUGAGCCGGCGGCAGUACCUGCUGAACUUCGACGAGCACGGCGUGAUCACCAACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCGGCUUCACCGUGAAGCCCGUAGCCACCGUGUACCGGCGGAUCCCCAACCUGCCCGACUGCGACAUCGACAACUGGCUGAACAACGUCAGCGUGCCCAGCCCACUGAACUGGGAGCGGCGGAUCUUCAGCAACUGCAACUUCAAUCUGAGCACCCUGCUGCGGCUGGUGCACGUGGACAGCUUCAGCUGCAACAACCUGGACAAGAGCAAGAUCUUCGGUAGCUGCUUCAACAGCAUCACCGUGGACAAGUUCGCCAUCCCUAACCGGCGGCGGGACGAUCUGCAGCUGGGCAGCAGCGGCUUCCUGCAGAGCAGCAACUACAAGAUCGACAUCAGCAGCUCAAGCUGCCAGCUGUACUACAGCCUGCCCCUGGUGAACGUGACCAUCAACAACUUCAACCCCAGCAGCUGGAACCGGCGGUACGGCUUCGGCAGCUUCAACCUGAGCAGCUACGACGUGGUGUACAGCGACCACUGCUUCAGCGUGAACAGCGACUUCUGCCCCUGUGCCGACCCUAGCGUGGUGAACAGCUGCGCCAAGAGCAAGCCUCCCAGCGCCAUUUGCCCCGCCGGCACCAAGUACCGGCACUGCGACCUGGACACCACCCUGUACGUGAAGAACUGGUGCCGGUGCAGCUGCCUGCCCGACCCCAUCAGCACCUACAGCCCCAACACCUGUCCCCAGAAGAAGGUGGUGGUGGGUAUCGGCGAGCACUGUCCCGGCCUGGGCAUCAACGAGGAGAAGUGCGGCACCCAGCUGAACCACAGCAGCUGCUUCUGUAGCCCCGACGCCUUCCUGGGCUGGAGCUUCGACAGCUGCAUCAGCAACAACCGGUGCAACAUCUUUAGCAACUUCAUCUUCAACGGAAUCAACAGCGGCACCACCUGCAGCAACGACCUGCUGUAUAGCAACACCGAGAUCAGCACCGGCGUGUGCGUGAACUACGACCUGUACGGCAUCACCGGCCAGGGCAUCUUCAAGGAGGUGAGCGCCGCCUACUACAACAACUGGCAGAACCUGCUGUACGACAGCAACGGCAACAUCAUCGGCUUCAAGGACUUUCUGACCAACAAGACCUACACCAUCCUGCCCUGCUACAGCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCACCCCGGAGGGCAAGGAGCGUGGCCAGCCAGAGCAUCAUCGCCUACACCAUGAGCCUGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACCAACUUCACCAUCAGCGUGACCACCGAGAUUCUGCCCGUGAGCAUGACCAAGACCAGCGUGGACUGCACCAUGUACAUCUGCGGCGACAGCACCGAGUGCAGCAACCUGCUGCUGCAGUACGGCAGCUUCUGCACCCAGCUGAACCGGGCCCUGACCGGCAUCGCCGUGGAGCAGGACAAGAACACCCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACCCCUCCCAUCAAGGACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACCCUAGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUCGGCGACAUAGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAACGGCCUGACCGUGCUGCCUCCCCUGCUGACCGACGAGAUGAUCGCCCAGUACACCAGCGCCCUGUUAGCCGGAACCAUCACCAGCGGCUGGACUUUCGGCGCUGGAGCCGCUCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACCCAGAACGUGCUGUACGAGAACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCUGAGCAGCACCGCUAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCAGAACGCCCAGGCCCUGAACACCCUGGUGAAGCAGCUGAGCAGCAACUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCCUGAGCCGGCUGGACCCUCCCGAGGCCGAGGUGCAGAUCGACCGGCUGAUCACUGGCCGGCUGCAGAGCCUGCAGACCUACGUGACCCAGCAGCUGAUCCGGGCCGCCGAGAUUCGGGCCAGCGCCAACCUGGCCGCCACCAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUUCCCCAGAGCGCACCCCACGGAGUGGUGUUCCUGCACGUGACCUACGUGCCCGCCCAGGAGAAGAACUUCACCACCGCCCCAGCCAUCUGCCACGACGGCAAGGCCCACUUUCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACCCACUGGUUCGUGACCCAGCGGAACUUCUACGAGCCCCAGAUCAUCACCACCGACAACACCUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACCGUGUACGAUCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAAUCACACCAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGAUCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCACUACACC 37 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFLIIFILPTTLAVIGDFNCTNSFINDYNKTIPRISEDVVDVSLGLGTYYVLNRVYLNTTLLFTGYFPKSGANFRDLALKGSIYLSTLWYKPPFLSDFNNGIFSKVKNTKLYVNNTLYSEFSTIVIGSVFVNTSYTIVVQPHNGILEITACQYTMCEYPHTVCKSKGSIRNESWHIDSSEPLCLFKKNFTYNVSADWLYFHFYQERGVFYAYYADVGMPTTFLFSLYLGTILSHYYVMPLTCNAISSNTDNETLEYWVTPLSRRQYLLNFDEHGVITNAVDCALDPLSETKCTLKSFTVEKGIYQTSGFTVKPVATVYRRIPNLPDCDIDNWLNNVSVPSPLNWERRIFSNCNFNLSTLLRLVHVDSFSCNNLDKSKIFGSCFNSITVDKFAIPNRRRDDLQLGSSGFLQSSNYKIDISSSSCQLYYSLPLVNVTINNFNPSSWNRRYGFGSFNLSSYDVVYSDHCFSVNSDFCPCADPSVVNSCAKSKPPSAICPAGTKYRHCDLDTTLYVKNWCRCSCLPDPISTYSPNTCPQKKVVVGIGEHCPGLGINEEKCGTQLNHSSCFCSPDAFLGWSFDSCISNNRCNIFSNFIFNGINSGTTCSNDLLYSNTEISTGVCVNYDLYGITGQGIFKEVSAAYYNNWQNLLYDSNGNIIGFKDFLTNKTYTILPCYSGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDPPEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT 38 聚A尾 100 nt    OC43 S1 次單元連接之 SARS-CoV-2 S2 次單元 SEQ ID NO: 39自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 40及3' UTR SEQ ID NO: 4。 39 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCCUGAUCCUGCUGAUCAGCCUGCCCACCGCCUUCGCCGUGAUCGGCGACCUGAAGUGCACCAGCGACAACAUCAACGACAAGGACACCGGCCCACCACCCAUCAGCACCGACACCGUGGACGUGACCAACGGCCUGGGCACCUACUACGUGCUGGACCGGGUGUACCUGAACACCACCCUGUUCCUGAACGGCUACUACCCCACCAGCGGCAGCACCUACCGGAAUAUGGCCCUGAAGGGCAGCGUGCUGCUGAGCCGGCUGUGGUUCAAGCCACCAUUCCUGAGCGACUUCAUCAACGGAAUCUUCGCCAAGGUGAAGAACACCAAGGUGAUCAAGGACCGGGUGAUGUACAGCGAGUUCCCCGCCAUCACCAUUGGCAGUACCUUCGUGAACACCAGCUACAGCGUGGUGGUGCAGCCCCGGACCAUCAACAGCACCCAGGACGGCGACAACAAGCUGCAGGGCCUGCUGGAGGUGAGCGUGUGCCAGUACAACAUGUGCGAGUACCCUCAGACCAUCUGCCACCCCAACCUGGGCAACCACCGGAAGGAGCUGUGGCACCUGGACACCGGCGUGGUGAGCUGCCUGUACAAGCGGAACUUCACCUACGACGUAAACGCCGACUACCUGUACUUCCACUUCUACCAGGAGGGCGGCACCUUCUACGCCUACUUCACCGACACGGGCGUGGUGACCAAGUUCCUGUUCAACGUGUACCUGGGCAUGGCCCUGAGCCACUACUACGUGAUGCCCCUGACCUGUAACAGCAAGCUGACCCUGGAGUACUGGGUGACCCCUCUGACCAGCCGGCAGUACCUGCUGGCCUUCAACCAGGACGGCAUCAUCUUCAACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAACGGCUACACCGUGCAGCCCAUCGCCGACGUGUACCGGCGGAAGCCCAACCUGCCCAACUGCAACAUCGAGGCCUGGCUGAACGACAAGAGCGUGCCCUCGCCCCUGAACUGGGAGCGGAAGACCUUCAGCAACUGCAACUUCAACAUGAGCAGUCUGAUGAGCUUCAUCCAGGCCGACAGCUUCACCUGCAACAACAUCGACGCCGCCAAGAUCUACGGCAUGUGCUUCAGCAGCAUCACCAUCGACAAGUUUGCCAUCCCCAACGGCCGGAAGGUGGACCUGCAGCUGGGCAACCUGGGCUACCUGCAGAGCUUCAACUACCGGAUCGACACCACCGCCACCUCUUGCCAGCUGUACUACAACCUGCCCGCCGCCAACGUGAGCGUGAGCCGGUUCAACCCCAGCACCUGGAACAAGCGGUUCGGCUUCAUUGAGGACAGCGUGUUCAAACCCCGGCCCGCAGGAGUACUGACCAACCACGACGUGGUGUACGCCCAGCACUGCUUCAAGGCACCCAAGAACUUCUGCCCCUGCAAGCUGAACGGCAGCUGUGUGGGCUCUGGCCCCGGUAAGAACAACGGCAUAGGGACUUGCCCGGCAGGGACCAACUACCUGACCUGCGACAACCUGUGCACACCCGACCCCAUCACCUUCACCGGCACCUACAAGUGUCCCCAGACCAAGAGCCUGGUGGGCAUCGGCGAGCACUGCAGCGGCCUGGCCGUGAAGAGCGACUACUGCGGCGGCAACAGCUGCACCUGUCGGCCCCAGGCCUUCCUGGGCUGGAGCGCCGACAGCUGCCUGCAGGGCGACAAGUGCAACAUCUUUGCCAACUUCAUCCUGCACGACGUGAACAGCGGCCUGACCUGCAGCACCGACCUGCAGAAGGCCAACACCGACAUCAUCCUGGGCGUGUGCGUGAACUACGACUUGUACGGCAUCCUGGGCCAGGGCAUCUUCGUGGAGGUGAACGCCACCUACUACAACAGCUGGCAGAACCUGCUGUACGACAGCAACGGCAACCUGUACGGCUUCCGGGACUACAUCAUCAACCGGACCUUCAUGAUCCGGAGCUGCUACAGCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCACCCCGGAGGGCAAGGAGCGUGGCCAGCCAGAGCAUCAUCGCCUACACCAUGAGCCUGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACCAACUUCACCAUCAGCGUGACCACCGAGAUUCUGCCCGUGAGCAUGACCAAGACCAGCGUGGACUGCACCAUGUACAUCUGCGGCGACAGCACCGAGUGCAGCAACCUGCUGCUGCAGUACGGCAGCUUCUGCACCCAGCUGAACCGGGCCCUGACCGGCAUCGCCGUGGAGCAGGACAAGAACACCCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACCCCUCCCAUCAAGGACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACCCUAGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUCGGCGACAUAGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAACGGCCUGACCGUGCUGCCUCCCCUGCUGACCGACGAGAUGAUCGCCCAGUACACCAGCGCCCUGUUAGCCGGAACCAUCACCAGCGGCUGGACUUUCGGCGCUGGAGCCGCUCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACCCAGAACGUGCUGUACGAGAACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCUGAGCAGCACCGCUAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCAGAACGCCCAGGCCCUGAACACCCUGGUGAAGCAGCUGAGCAGCAACUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCCUGAGCCGGCUGGACCCUCCCGAGGCCGAGGUGCAGAUCGACCGGCUGAUCACUGGCCGGCUGCAGAGCCUGCAGACCUACGUGACCCAGCAGCUGAUCCGGGCCGCCGAGAUUCGGGCCAGCGCCAACCUGGCCGCCACCAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUUCCCCAGAGCGCACCCCACGGAGUGGUGUUCCUGCACGUGACCUACGUGCCCGCCCAGGAGAAGAACUUCACCACCGCCCCAGCCAUCUGCCACGACGGCAAGGCCCACUUUCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACCCACUGGUUCGUGACCCAGCGGAACUUCUACGAGCCCCAGAUCAUCACCACCGACAACACCUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACCGUGUACGAUCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAAUCACACCAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGAUCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCACUACACC 40 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFLILLISLPTAFAVIGDLKCTSDNINDKDTGPPPISTDTVDVTNGLGTYYVLDRVYLNTTLFLNGYYPTSGSTYRNMALKGSVLLSRLWFKPPFLSDFINGIFAKVKNTKVIKDRVMYSEFPAITIGSTFVNTSYSVVVQPRTINSTQDGDNKLQGLLEVSVCQYNMCEYPQTICHPNLGNHRKELWHLDTGVVSCLYKRNFTYDVNADYLYFHFYQEGGTFYAYFTDTGVVTKFLFNVYLGMALSHYYVMPLTCNSKLTLEYWVTPLTSRQYLLAFNQDGIIFNAVDCALDPLSETKCTLKSFTVEKGIYQTNGYTVQPIADVYRRKPNLPNCNIEAWLNDKSVPSPLNWERKTFSNCNFNMSSLMSFIQADSFTCNNIDAAKIYGMCFSSITIDKFAIPNGRKVDLQLGNLGYLQSFNYRIDTTATSCQLYYNLPAANVSVSRFNPSTWNKRFGFIEDSVFKPRPAGVLTNHDVVYAQHCFKAPKNFCPCKLNGSCVGSGPGKNNGIGTCPAGTNYLTCDNLCTPDPITFTGTYKCPQTKSLVGIGEHCSGLAVKSDYCGGNSCTCRPQAFLGWSADSCLQGDKCNIFANFILHDVNSGLTCSTDLQKANTDIILGVCVNYDLYGILGQGIFVEVNATYYNSWQNLLYDSNGNLYGFRDYIINRTFMIRSCYSGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDPPEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT 41 聚A尾 100 nt    編碼之結構域抗原 In some embodiments, the composition comprises mRNA encoding a chimeric protein (eg, a chimeric S1-S2 protein having the S1 subunit of the S protein from one virus and the S2 subunit of the S protein from a different virus). For example, the S2 subunit can be from SARS-CoV-2, and the S1 subunit can be from HKU1. As another example, the S2 subunit can be from SARS-CoV-2, and the S1 subunit can be from OC43. These chimeric proteins may be opsonized by circulating antibodies specific for the S1 subunit of HKU1 or OC43 generated by prior exposure to promote efficient uptake of the SARS-CoV-2 S2 subunit peptide by macrophages and dendritic cells and to It is cross-presented to CD4 + T cells. Opsonization by circulating antibodies also facilitates capture by follicular dendritic cells for presentation to B cells with receptors specific for the SARS-CoV-2 S2 subunit epitope. Non-limiting examples of chimeric S1/S2 subunit constructs and the mRNAs encoding them are provided in Table 5A and Table 5B below. Table 5A. Chimeric S1 Subunit - S2 Subunit Antigens name SEQ ID NO: mRNA ORF protein SARS-CoV-2 S2 subunit linked to HKU1 S1 subunit 37 38 SARS-CoV-2 S2 subunit linked to OC43 S1 subunit 40 41 Table 5B. Chimeric S1 Subunit - S2 Subunit Antigens SARS-CoV-2 S2 subunit linked to HKU1 S1 subunit SEQ ID NO: 36 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 37 and 3' UTR SEQ ID NO: 4. 36 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCCUGAUCAUCUUCAUCCUGCCCACCACCCUGGCCGUGAUCGGCGACUUCAACUGCACCAACAGCUUCAUCAACGACUACAACAAGACCAUCCCUCGGAUCAGCGAAGACGUGGUGGACGUGUCCCUGGGCCUGGGCACCUACUACGUGCUGAACCGGGUGUACCUGAACACCACACUGCUGUUCACCGGCUACUUCCCCAAGAGCGGCGCCAACUUCCGGGACCUGGCCCUGAAGGGCAGCAUCUACCUGAGCACCUUGUGGUACAAGCCUCCCUUCCUGAGCGACUUCAAUAACGGCAUCUUCUCUAAGGUGAAGAACACCAAGCUGUACGUAAACAACACCCUGUACAGCGAGUUCAGCACCAUCGUGAUCGGCAGCGUGUUCGUCAACACCAGCUACACCAUCGUGGUGCAGCCCCACAACGGCAUCCUGGAGAUCACCGCCUGCCAGUACACCAUGUGCGAGUACCCUCACACCGUGUGCAAGAGCAAGGGCUCCAUCCGGAACGAGAGCUGGCACAUCGACAGCAGCGAGCCGCUGUGCCUGUUCAAGAAGAACUUCACCUACAACGUGAGCGCCGACUGGCUGUACUUCCACUUCUACCAGGAGCGGGGCGUGUUCUACGCCUACUACGCCGACGUGGGCAUGCCAACCACCUUCCUGUUCAGCCUGUACCUGGGCACCAUCCUGAGCCACUACUACGUGAUGCCCCUGACCUGCAACGCCAUCAGCUCAAACACCGACAACGAGACCCUGGAGUACUGGGUGACUCCACUGAGCCGGCGGCAGUACCUGCUGAACUUCGACGAGCACGGCGUGAUCACCAACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCGGCUUCACCGUGAAGCCCGUAGCCACCGUGUACCGGCGGAUCCCCAACCUGCCCGACUGCGACAUCGACAACUGGCUGA ACAACGUCAGCGUGCCCAGCCCACUGAACUGGGAGCGGCGGAUCUUCAGCAACUGCAACUUCAAUCUGAGCACCCUGCUGCGGCUGGUGCACGUGGACAGCUUCAGCUGCAACAACCUGGACAAGAGCAAGAUCUUCGGUAGCUGCUUCAACAGCAUCACCGUGGACAAGUUCGCCAUCCCUAACCGGCGGCGGGACGAUCUGCAGCUGGGCAGCAGCGGCUUCCUGCAGAGCAGCAACUACAAGAUCGACAUCAGCAGCUCAAGCUGCCAGCUGUACUACAGCCUGCCCCUGGUGAACGUGACCAUCAACAACUUCAACCCCAGCAGCUGGAACCGGCGGUACGGCUUCGGCAGCUUCAACCUGAGCAGCUACGACGUGGUGUACAGCGACCACUGCUUCAGCGUGAACAGCGACUUCUGCCCCUGUGCCGACCCUAGCGUGGUGAACAGCUGCGCCAAGAGCAAGCCUCCCAGCGCCAUUUGCCCCGCCGGCACCAAGUACCGGCACUGCGACCUGGACACCACCCUGUACGUGAAGAACUGGUGCCGGUGCAGCUGCCUGCCCGACCCCAUCAGCACCUACAGCCCCAACACCUGUCCCCAGAAGAAGGUGGUGGUGGGUAUCGGCGAGCACUGUCCCGGCCUGGGCAUCAACGAGGAGAAGUGCGGCACCCAGCUGAACCACAGCAGCUGCUUCUGUAGCCCCGACGCCUUCCUGGGCUGGAGCUUCGACAGCUGCAUCAGCAACAACCGGUGCAACAUCUUUAGCAACUUCAUCUUCAACGGAAUCAACAGCGGCACCACCUGCAGCAACGACCUGCUGUAUAGCAACACCGAGAUCAGCACCGGCGUGUGCGUGAACUACGACCUGUACGGCAUCACCGGCCAGGGCAUCUUCAAGGAGGUGAGCGCCGCCUACUACAACAACUGGCAGAACCUGCUGUACGACAGCAACGGCAACAUCAUCGGCUUCAAGGACUUUCUGACCAACAAGACCUA CACCAUCCUGCCCUGCUACAGCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCACCCCGGAGGGCAAGGAGCGUGGCCAGCCAGAGCAUCAUCGCCUACACCAUGAGCCUGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACCAACUUCACCAUCAGCGUGACCACCGAGAUUCUGCCCGUGAGCAUGACCAAGACCAGCGUGGACUGCACCAUGUACAUCUGCGGCGACAGCACCGAGUGCAGCAACCUGCUGCUGCAGUACGGCAGCUUCUGCACCCAGCUGAACCGGGCCCUGACCGGCAUCGCCGUGGAGCAGGACAAGAACACCCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACCCCUCCCAUCAAGGACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACCCUAGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUCGGCGACAUAGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAACGGCCUGACCGUGCUGCCUCCCCUGCUGACCGACGAGAUGAUCGCCCAGUACACCAGCGCCCUGUUAGCCGGAACCAUCACCAGCGGCUGGACUUUCGGCGCUGGAGCCGCUCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACCCAGAACGUGCUGUACGAG AACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCUGAGCAGCACCGCUAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCAGAACGCCCAGGCCCUGAACACCCUGGUGAAGCAGCUGAGCAGCAACUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCCUGAGCCGGCUGGACCCUCCCGAGGCCGAGGUGCAGAUCGACCGGCUGAUCACUGGCCGGCUGCAGAGCCUGCAGACCUACGUGACCCAGCAGCUGAUCCGGGCCGCCGAGAUUCGGGCCAGCGCCAACCUGGCCGCCACCAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUUCCCCAGAGCGCACCCCACGGAGUGGUGUUCCUGCACGUGACCUACGUGCCCGCCCAGGAGAAGAACUUCACCACCGCCCCAGCCAUCUGCCACGACGGCAAGGCCCACUUUCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACCCACUGGUUCGUGACCCAGCGGAACUUCUACGAGCCCCAGAUCAUCACCACCGACAACACCUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACCGUGUACGAUCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAAUCACACCAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGAUCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCUGCGGCA GCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCACUACACC 37 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFLIIFILPTTLAVIGDFNCTNSFINDYNKTIPRISEDVVDVSLGLGTYYVLNRVYLNTTLLFTGYFPKSGANFRDLALKGSIYLSTLWYKPPFLSDFNNGIFSKVKNTKLYVNNTLYSEFSTIVIGSVFVNTSYTIVVQPHNGILEITACQYTMCEYPHTVCKSKGSIRNESWHIDSSEPLCLFKKNFTYNVSADWLYFHFYQERGVFYAYYADVGMPTTFLFSLYLGTILSHYYVMPLTCNAISSNTDNETLEYWVTPLSRRQYLLNFDEHGVITNAVDCALDPLSETKCTLKSFTVEKGIYQTSGFTVKPVATVYRRIPNLPDCDIDNWLNNVSVPSPLNWERRIFSNCNFNLSTLLRLVHVDSFSCNNLDKSKIFGSCFNSITVDKFAIPNRRRDDLQLGSSGFLQSSNYKIDISSSSCQLYYSLPLVNVTINNFNPSSWNRRYGFGSFNLSSYDVVYSDHCFSVNSDFCPCADPSVVNSCAKSKPPSAICPAGTKYRHCDLDTTLYVKNWCRCSCLPDPISTYSPNTCPQKKVVVGIGEHCPGLGINEEKCGTQLNHSSCFCSPDAFLGWSFDSCISNNRCNIFSNFIFNGINSGTTCSNDLLYSNTEISTGVCVNYDLYGITGQGIFKEVSAAYYNNWQNLLYDSNGNIIGFKDFLTNKTYTILPCYSGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYE NQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDPPEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT 38 Poly A tail 100nt SARS-CoV-2 S2 subunit linked to OC43 S1 subunit SEQ ID NO: 39 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 40 and 3' UTR SEQ ID NO: 4. 39 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCCUGAUCCUGCUGAUCAGCCUGCCCACCGCCUUCGCCGUGAUCGGCGACCUGAAGUGCACCAGCGACAACAUCAACGACAAGGACACCGGCCCACCACCCAUCAGCACCGACACCGUGGACGUGACCAACGGCCUGGGCACCUACUACGUGCUGGACCGGGUGUACCUGAACACCACCCUGUUCCUGAACGGCUACUACCCCACCAGCGGCAGCACCUACCGGAAUAUGGCCCUGAAGGGCAGCGUGCUGCUGAGCCGGCUGUGGUUCAAGCCACCAUUCCUGAGCGACUUCAUCAACGGAAUCUUCGCCAAGGUGAAGAACACCAAGGUGAUCAAGGACCGGGUGAUGUACAGCGAGUUCCCCGCCAUCACCAUUGGCAGUACCUUCGUGAACACCAGCUACAGCGUGGUGGUGCAGCCCCGGACCAUCAACAGCACCCAGGACGGCGACAACAAGCUGCAGGGCCUGCUGGAGGUGAGCGUGUGCCAGUACAACAUGUGCGAGUACCCUCAGACCAUCUGCCACCCCAACCUGGGCAACCACCGGAAGGAGCUGUGGCACCUGGACACCGGCGUGGUGAGCUGCCUGUACAAGCGGAACUUCACCUACGACGUAAACGCCGACUACCUGUACUUCCACUUCUACCAGGAGGGCGGCACCUUCUACGCCUACUUCACCGACACGGGCGUGGUGACCAAGUUCCUGUUCAACGUGUACCUGGGCAUGGCCCUGAGCCACUACUACGUGAUGCCCCUGACCUGUAACAGCAAGCUGACCCUGGAGUACUGGGUGACCCCUCUGACCAGCCGGCAGUACCUGCUGGCCUUCAACCAGGACGGCAUCAUCUUCAACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAACGGCUACACCGUGCAGCCCAUCGCCGACGUGUACCGGCGGAAGCCCAACCUGCCCA ACUGCAACAUCGAGGCCUGGCUGAACGACAAGAGCGUGCCCUCGCCCCUGAACUGGGAGCGGAAGACCUUCAGCAACUGCAACUUCAACAUGAGCAGUCUGAUGAGCUUCAUCCAGGCCGACAGCUUCACCUGCAACAACAUCGACGCCGCCAAGAUCUACGGCAUGUGCUUCAGCAGCAUCACCAUCGACAAGUUUGCCAUCCCCAACGGCCGGAAGGUGGACCUGCAGCUGGGCAACCUGGGCUACCUGCAGAGCUUCAACUACCGGAUCGACACCACCGCCACCUCUUGCCAGCUGUACUACAACCUGCCCGCCGCCAACGUGAGCGUGAGCCGGUUCAACCCCAGCACCUGGAACAAGCGGUUCGGCUUCAUUGAGGACAGCGUGUUCAAACCCCGGCCCGCAGGAGUACUGACCAACCACGACGUGGUGUACGCCCAGCACUGCUUCAAGGCACCCAAGAACUUCUGCCCCUGCAAGCUGAACGGCAGCUGUGUGGGCUCUGGCCCCGGUAAGAACAACGGCAUAGGGACUUGCCCGGCAGGGACCAACUACCUGACCUGCGACAACCUGUGCACACCCGACCCCAUCACCUUCACCGGCACCUACAAGUGUCCCCAGACCAAGAGCCUGGUGGGCAUCGGCGAGCACUGCAGCGGCCUGGCCGUGAAGAGCGACUACUGCGGCGGCAACAGCUGCACCUGUCGGCCCCAGGCCUUCCUGGGCUGGAGCGCCGACAGCUGCCUGCAGGGCGACAAGUGCAACAUCUUUGCCAACUUCAUCCUGCACGACGUGAACAGCGGCCUGACCUGCAGCACCGACCUGCAGAAGGCCAACACCGACAUCAUCCUGGGCGUGUGCGUGAACUACGACUUGUACGGCAUCCUGGGCCAGGGCAUCUUCGUGGAGGUGAACGCCACCUACUACAACAGCUGGCAGAACCUGCUGUACGACAGCAACGGCAACCUGUACGGCUUCCGGGACUACAUCAUCAACCG GACCUUCAUGAUCCGGAGCUGCUACAGCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCACCCCGGAGGGCAAGGAGCGUGGCCAGCCAGAGCAUCAUCGCCUACACCAUGAGCCUGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACCAACUUCACCAUCAGCGUGACCACCGAGAUUCUGCCCGUGAGCAUGACCAAGACCAGCGUGGACUGCACCAUGUACAUCUGCGGCGACAGCACCGAGUGCAGCAACCUGCUGCUGCAGUACGGCAGCUUCUGCACCCAGCUGAACCGGGCCCUGACCGGCAUCGCCGUGGAGCAGGACAAGAACACCCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACCCCUCCCAUCAAGGACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACCCUAGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUCGGCGACAUAGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAACGGCCUGACCGUGCUGCCUCCCCUGCUGACCGACGAGAUGAUCGCCCAGUACACCAGCGCCCUGUUAGCCGGAACCAUCACCAGCGGCUGGACUUUCGGCGCUGGAGCCGCUCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACCCAGAACGUGCUG UACGAGAACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCUGAGCAGCACCGCUAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCAGAACGCCCAGGCCCUGAACACCCUGGUGAAGCAGCUGAGCAGCAACUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCCUGAGCCGGCUGGACCCUCCCGAGGCCGAGGUGCAGAUCGACCGGCUGAUCACUGGCCGGCUGCAGAGCCUGCAGACCUACGUGACCCAGCAGCUGAUCCGGGCCGCCGAGAUUCGGGCCAGCGCCAACCUGGCCGCCACCAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUUCCCCAGAGCGCACCCCACGGAGUGGUGUUCCUGCACGUGACCUACGUGCCCGCCCAGGAGAAGAACUUCACCACCGCCCCAGCCAUCUGCCACGACGGCAAGGCCCACUUUCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACCCACUGGUUCGUGACCCAGCGGAACUUCUACGAGCCCCAGAUCAUCACCACCGACAACACCUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACCGUGUACGAUCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAAUCACACCAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGAUCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCU GCGGCAGCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCACUACACC 40 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFLILLISLPTAFAVIGDLKCTSDNINDKDTGPPPISTDTVDVTNGLGTYYVLDRVYLNTTLFLNGYYPTSGSTYRNMALKGSVLLSRLWFKPPFLSDFINGIFAKVKNTKVIKDRVMYSEFPAITIGSTFVNTSYSVVVQPRTINSTQDGDNKLQGLLEVSVCQYNMCEYPQTICHPNLGNHRKELWHLDTGVVSCLYKRNFTYDVNADYLYFHFYQEGGTFYAYFTDTGVVTKFLFNVYLGMALSHYYVMPLTCNSKLTLEYWVTPLTSRQYLLAFNQDGIIFNAVDCALDPLSETKCTLKSFTVEKGIYQTNGYTVQPIADVYRRKPNLPNCNIEAWLNDKSVPSPLNWERKTFSNCNFNMSSLMSFIQADSFTCNNIDAAKIYGMCFSSITIDKFAIPNGRKVDLQLGNLGYLQSFNYRIDTTATSCQLYYNLPAANVSVSRFNPSTWNKRFGFIEDSVFKPRPAGVLTNHDVVYAQHCFKAPKNFCPCKLNGSCVGSGPGKNNGIGTCPAGTNYLTCDNLCTPDPITFTGTYKCPQTKSLVGIGEHCSGLAVKSDYCGGNSCTCRPQAFLGWSADSCLQGDKCNIFANFILHDVNSGLTCSTDLQKANTDIILGVCVNYDLYGILGQGIFVEVNATYYNSWQNLLYDSNGNLYGFRDYIINRTFMIRSCYSGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVL YENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDPPEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT 41 Poly A tail 100nt encoded domain antigen

本揭示案之其他態樣提供包含編碼S蛋白之SARS-CoV-2 S1次單元之(至少一個)子結構域之mRNA的組合物。子結構域可為N末端結構域(NTD)或受體結合結構域(RBD)(有或無SD1及/或SD2)。在一些實施例中,mRNA編碼NTD及RBD (有或無SD1及/或SD2)之組合(例如,非天然組合)。在一些實施例中,NTD及/或RBD與跨膜結構域(有或無SD1及/或SD2)連接。在一些實施例中,mRNA編碼S蛋白之SARS-CoV-2 S1次單元之兩個子結構域(NTD及RBD),其經突變以包含半胱胺酸殘基。在一些實施例中,該等突變導致二硫鍵之形成。作為實例,mRNA可編碼包含F43C突變之NTD及包含Q563C突變之RBD,最終產生經由二硫鍵與RBD連接之NTD。 N 末端結構域 (NTD) 構築體 Other aspects of the disclosure provide compositions comprising mRNA encoding a (at least one) subdomain of the SARS-CoV-2 S1 subunit of the S protein. A subdomain can be an N-terminal domain (NTD) or a receptor binding domain (RBD) (with or without SD1 and/or SD2). In some embodiments, the mRNA encodes a combination (eg, a non-natural combination) of NTD and RBD (with or without SD1 and/or SD2). In some embodiments, the NTD and/or RBD is linked to the transmembrane domain (with or without SD1 and/or SD2). In some embodiments, the mRNA encodes two subdomains (NTD and RBD) of the SARS-CoV-2 S1 subunit of the S protein, which are mutated to include cysteine residues. In some embodiments, the mutations result in the formation of disulfide bonds. As an example, an mRNA can encode an NTD comprising the F43C mutation and an RBD comprising the Q563C mutation, ultimately resulting in an NTD linked to the RBD via a disulfide bond. N -terminal domain (NTD) construct

在一些實施例中,本文提供之mRNA編碼SARS-CoV-2 S蛋白之S1次單元之NTD。某些β冠狀病毒之NTD會引發保護性位準之抗體。對其他β冠狀病毒(諸如MERS)之NTD特異之抗體藉由防止膜融合及病毒進入起作用(Zhou H等人, Nat Commun.2019; 3068),提供不同於防止病毒附著至ACE2之第二種中和機制。由本揭示案之mRNA編碼之SARS-CoV-2 NTD可為可溶的或膜結合的。膜結合之SARS-CoV-2 NTD抗原及編碼其之mRNA之非限制性實例提供於下文 6A 6B中。 6A. 膜結合之 NTD 抗原 名稱 SEQ ID NO:    mRNA ORF 蛋白質 與跨膜結構域連接之SARS-CoV-2 NTD (NTD-TM) 46 47 6B. 膜結合之 NTD 抗原 與跨膜結構域連接之 SARS-CoV-2 NTD (NTD-TM) SEQ ID NO: 45自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 46及3' UTR SEQ ID NO: 4。 45 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 46 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDSGGGSILAIYSTVASSLVLLVSLGAISF 47 聚A尾 100 nt    受體結合結構域 (RBD) 構築體 In some embodiments, the mRNA provided herein encodes the NTD of the S1 subunit of the SARS-CoV-2 S protein. Certain betacoronavirus NTDs elicit protective levels of antibodies. Antibodies specific for the NTD of other betacoronaviruses, such as MERS, function by preventing membrane fusion and viral entry (Zhou H et al., Nat Commun. 2019; 3068), providing a second alternative to preventing viral attachment to ACE2 Neutralization mechanism. The SARS-CoV-2 NTDs encoded by the mRNAs of the present disclosure may be soluble or membrane bound. Non-limiting examples of membrane-bound SARS-CoV-2 NTD antigens and mRNAs encoding them are provided in Tables 6A and 6B below . Table 6A. Membrane bound NTD antigens name SEQ ID NO: mRNA ORF protein SARS-CoV-2 NTD linked to transmembrane domain (NTD-TM) 46 47 Table 6B. Membrane bound NTD antigens SARS-CoV-2 NTD linked to transmembrane domain (NTD-TM) SEQ ID NO: 45 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 46 and 3' UTR SEQ ID NO: 4. 45 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 46 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDSGGGSILAIYSTVASSLVLLVSLGAISF 47 Poly A tail 100nt Receptor binding domain (RBD) construct

在其他實施例中,本文提供之mRNA編碼SARS-CoV-2 S蛋白之S1次單元之RBD。RBD結合宿主細胞上之ACE2受體,此介導病毒附著至細胞。附著對於病毒進入細胞及複製係必需的。因此,阻斷病毒附著至細胞中之RBD靶向抗體反應有效地中和細胞外病毒顆粒,從而防止增殖且促進對中和之病毒顆粒之其他組分之進一步免疫反應。由本揭示案之mRNA編碼之SARS-CoV-2 RBD可為可溶的或膜結合的(例如,與跨膜結構域連接)。 可溶性 RBD 抗原 In other embodiments, the mRNA provided herein encodes the RBD of the S1 subunit of the SARS-CoV-2 S protein. RBD binds to ACE2 receptors on host cells, which mediate viral attachment to cells. Attachment is necessary for virus entry into cells and replication lines. Thus, RBD-targeted antibody responses that block viral attachment into cells effectively neutralize extracellular viral particles, preventing proliferation and promoting further immune responses to other components of the neutralized viral particles. The SARS-CoV-2 RBD encoded by the mRNA of the present disclosure can be soluble or membrane bound (eg, linked to a transmembrane domain). soluble RBD antigen

在一些實施例中,mRNA編碼可溶性SARS-CoV-2 RBD。樹突細胞藉由胞飲作用對可溶性蛋白進行取樣,且在遷移至引流淋巴結時,將包含取樣之蛋白質之綫性肽呈遞至CD4 +T細胞。該等CD4 +T細胞向已經識別、攝取及呈遞來自RBD之抗原決定基之B細胞提供增殖信號,因此投與特异性RBD而沒有SARS-CoV-2刺突蛋白之其他組分預計將免疫反應集中於RBD中存在之抗原決定基。可溶性SARS-CoV-2 RBD及編碼其之mRNA之非限制性實例提供於下文 7A 7B中。 7. 可溶性 RBD 抗原 名稱 SEQ ID NO:    mRNA ORF 蛋白質 SARS-CoV-2可溶性RBD 61 62 7B. 可溶性 RBD 抗原 SARS-CoV-2 可溶性 RBD SEQ ID NO: 60自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 61及3' UTR SEQ ID NO: 4。 60 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAG 61 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MYSMQLASCVTLTLVLLVNSQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPK 62 聚A尾 100 nt    膜結合之 RBD 抗原 In some embodiments, the mRNA encodes a soluble SARS-CoV-2 RBD. Dendritic cells sample soluble proteins by pinocytosis and, upon migration to draining lymph nodes, present linear peptides containing the sampled proteins to CD4 + T cells. These CD4 + T cells provide proliferative signals to B cells that have recognized, taken up and presented epitopes from RBDs, so administration of specific RBDs without other components of the SARS-CoV-2 spike protein is expected to result in an immune response Focus on epitopes present in RBD. Non-limiting examples of soluble SARS-CoV-2 RBDs and mRNAs encoding them are provided in Tables 7A and 7B below . Table 7. Soluble RBD Antigens name SEQ ID NO: mRNA ORF protein SARS-CoV-2 soluble RBD 61 62 Table 7B. Soluble RBD Antigens SARS-CoV-2 soluble RBD SEQ ID NO: 60 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 61 and 3' UTR SEQ ID NO: 4. 60 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAG 61 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MYSMQLASCVTLTLVLLVNSQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPK 62 Poly A tail 100nt Membrane bound RBD antigen

在一些實施例中,mRNA編碼膜結合之SARS-CoV-2 RBD。預計表現膜結合之RBD之細胞將該等膜結合之抗原攜帶至引流淋巴結,且促進RBD特異性B細胞對抗原決定基之有效識別。由於B細胞表面含有許多表面結合之抗體且表現細胞含有許多膜結合之RBD之拷貝,故預計B細胞最初識別抗原後會發生B細胞受體之交聯,從而經由親合效應刺激強反應。膜結合之SARS-CoV-2 RBD及編碼其之mRNA之非限制性實例提供於下文 8A 8B中。 8A. 膜結合之 RBD 抗原 名稱 SEQ ID NO: mRNA ORF 蛋白質 與跨膜結構域連接之SARS-CoV-2 RBD (RBD-TM) 76 77 8B. 膜結合之 RBD 抗原 與跨膜結構域連接之 SARS-CoV-2 RBD (RBD-TM) SEQ ID NO: 75自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 76及3' UTR SEQ ID NO: 4。 75 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 76 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MYSMQLASCVTLTLVLLVNSQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKSGGGSILAIYSTVASSLVLLVSLGAISF 77 聚A尾 100 nt    結構域融合抗原 In some embodiments, the mRNA encodes a membrane-bound SARS-CoV-2 RBD. Cells expressing membrane-bound RBDs are expected to carry these membrane-bound antigens to draining lymph nodes and promote efficient recognition of epitopes by RBD-specific B cells. Since B cells contain many surface-bound antibodies on the surface and expressing cells contain many copies of membrane-bound RBDs, it is expected that cross-linking of B cell receptors occurs after initial recognition of antigen by B cells, thereby stimulating strong responses via affinity effects. Non-limiting examples of membrane-bound SARS-CoV-2 RBDs and the mRNAs encoding them are provided in Tables 8A and 8B below . Table 8A. Membrane bound RBD antigens name SEQ ID NO: mRNA ORF protein SARS-CoV-2 RBD linked to transmembrane domain (RBD-TM) 76 77 Table 8B. Membrane bound RBD antigens SARS-CoV-2 RBD linked to transmembrane domain (RBD-TM) SEQ ID NO: 75 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 76 and 3' UTR SEQ ID NO: 4. 75 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 76 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MYSMQLASCVTLTLVLLVNSQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGAISFPKSGGGAI 77 Poly A tail 100nt domain fusion antigen

在其他實施例中,本文提供之mRNA編碼SARS-CoV-2 NTD-RBD融合蛋白。舉例而言,S蛋白之SARS-CoV-2 S1次單元之NTD及RBD可經由連接體(諸如短胺基酸(例如,甘胺酸-絲胺酸)連接體)彼此連接,以允許結構域之間之撓性/鉸接及空間。在另一實施例中,可使用包含抗原性抗原決定基(例如II類通用T細胞抗原決定基,諸如PADRE)之連接體。在一些實施例中,跨膜區例如經由另一短胺基酸(例如,甘胺酸-絲胺酸或PADRE)連接體與NTD-RBD融合體連接以獲得撓性且允許膜與抗原之間存在合理距離。不受限於理論,據信此種膜結合之串聯組態在一個開放閱讀框中呈遞大部分(若並非全部)已知中和抗原決定基及保護性抗原決定基。投與該融合蛋白接著應將免疫反應集中於已知保護性抗原決定基,且减少對非保護性抗原決定基特異之抗體及T細胞之不必要產生。此外,針對不同結構域之抗體可經由不同機制、例如藉由阻斷與宿主細胞之附著或防止結合之病毒經歷膜融合且進入宿主細胞來中和病毒顆粒。因此,由包含不同結構域之融合蛋白引發之廣泛反應在進化上可能更强,需要多個不同突變來逃避疫苗誘導之免疫。SARS-CoV-2 NTD-RBD融合蛋白及編碼其之mRNA之非限制性實例提供於下文 9A 9B中。 連接體 In other embodiments, the mRNA provided herein encodes a SARS-CoV-2 NTD-RBD fusion protein. For example, the NTD and RBD of the SARS-CoV-2 S1 subunit of the S protein can be linked to each other via a linker, such as a short amino acid (eg, glycine-serine) linker, to allow the domains Flexibility/hinged and space between. In another embodiment, linkers comprising antigenic epitopes (eg, class II universal T cell epitopes such as PADRE) can be used. In some embodiments, the transmembrane region is linked to the NTD-RBD fusion, eg, via another short amino acid (eg, glycine-serine or PADRE) linker to obtain flexibility and allow for the separation between the membrane and the antigen A reasonable distance exists. Without being bound by theory, it is believed that this membrane-bound tandem configuration presents most, if not all, known neutralizing and protective epitopes in one open reading frame. Administration of the fusion protein should then focus the immune response on known protective epitopes and reduce unwanted production of antibodies and T cells specific for non-protective epitopes. In addition, antibodies directed against different domains can neutralize viral particles through different mechanisms, such as by blocking attachment to the host cell or preventing the bound virus from undergoing membrane fusion and entering the host cell. Thus, the broad response elicited by fusion proteins comprising different domains may be evolutionarily stronger, requiring multiple distinct mutations to evade vaccine-induced immunity. Non-limiting examples of SARS-CoV-2 NTD-RBD fusion proteins and mRNAs encoding them are provided in Table 9A and Table 9B below. linker

根據本揭示案可使用多種連接體。如本文提供之連接體僅僅係將兩個其他胺基酸序列人工連接在一起之胺基酸序列。本文所用之連接體可為可裂解的或不可裂解的。可裂解之連接體允許將mRNA轉譯成多肽,之後裂解連接體,允許獨立地釋放每一個別組分。不可裂解之連接體保持連接一或多個蛋白質次單元,允許整個蛋白質執行需要組分次單元緊密接近之功能。該等連接體之非限制性實例包括甘胺酸-絲胺酸(GS)連接體(不可裂解的);以及F2A連接體、P2A連接體、T2A連接體及E2A連接體(可裂解的)。本文可使用其他連接法。A variety of linkers can be used in accordance with the present disclosure. Linkers as provided herein are simply amino acid sequences that artificially link together two other amino acid sequences. Linkers as used herein can be cleavable or non-cleavable. The cleavable linker allows translation of the mRNA into a polypeptide, followed by cleavage of the linker, allowing each individual component to be released independently. Non-cleavable linkers remain linked to one or more protein subunits, allowing the entire protein to perform functions that require close proximity of the subunits. Non-limiting examples of such linkers include glycine-serine (GS) linkers (non-cleavable); and F2A linkers, P2A linkers, T2A linkers, and E2A linkers (cleavable). Other connection methods may be used in this article.

在一些實施例中,連接體係GS連接體。GS連接體係包括甘胺酸及絲胺酸胺基酸重複之多肽連接體。其等包含撓性及親水性殘基,且可用於實施蛋白質次單元之融合,而不干擾蛋白質結構域之摺疊及功能,且不形成二級結構。在一些實施例中,mRNA編碼包含3至20個胺基酸長之GS連接體之融合蛋白。舉例而言,GS連接體可具有之長度為(或具有之長度至少為)3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19或20個胺基酸。在一些實施例中,GS連接體係(或至少係) 15個胺基酸長(例如,GGSGGSGGSGGSGGG (SEQ ID NO: 133))。在一些實施例中,GS連接體係(或至少係) 8個胺基酸長(例如,GGGSGGGS (SEQ ID NO: 134))。在一些實施例中,GS連接體係(或至少係) 7個胺基酸長(例如,GGGSGGG (SEQ ID NO: 135))。在一些實施例中,GS連接體係(或至少係) 4個胺基酸長(例如,GGGS (SEQ ID NO: 136))。在一些實施例中,GS連接體包含(GGGS)n (SEQ ID NO: 136),其中n為1-5之任一整數。在一些實施例中,GS連接體係(或至少係) 4個胺基酸長(例如,GSGG (SEQ ID NO: 152))。在一些實施例中,GS連接體包含(GSGG)n (SEQ ID NO: 152),其中n為1-5之任一整數。In some embodiments, the ligation system is a GS linker. The GS linker system includes polypeptide linkers of glycine and serine amino acid repeats. These contain flexible and hydrophilic residues and can be used to effect fusion of protein subunits without interfering with the folding and function of the protein domains and without forming secondary structures. In some embodiments, the mRNA encodes a fusion protein comprising a 3 to 20 amino acid long GS linker. For example, a GS linker can have a length of (or have a length of at least) 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 amino acids. In some embodiments, the GS linker is (or at least is) 15 amino acids long (eg, GGSGGSGGSGGSGGGG (SEQ ID NO: 133)). In some embodiments, the GS linker is (or at least is) 8 amino acids long (eg, GGGSGGGS (SEQ ID NO: 134)). In some embodiments, the GS linker is (or at least is) 7 amino acids long (eg, GGGSGGG (SEQ ID NO: 135)). In some embodiments, the GS linker is (or at least is) 4 amino acids long (eg, GGGS (SEQ ID NO: 136)). In some embodiments, the GS linker comprises (GGGS)n (SEQ ID NO: 136), wherein n is any integer from 1-5. In some embodiments, the GS linker is (or at least is) 4 amino acids long (eg, GSGG (SEQ ID NO: 152)). In some embodiments, the GS linker comprises (GSGG)n (SEQ ID NO: 152), wherein n is any integer from 1-5.

在一些實施例中,連接體係甘胺酸連接體,例如具有3個胺基酸之長度(或至少3個胺基酸之長度) (例如,GGG)。In some embodiments, the linking system is a glycine linker, eg, having a length of 3 amino acids (or at least 3 amino acids in length) (eg, GGG).

在一些實施例中,由mRNA疫苗編碼之蛋白質包括一個以上連接體,其可彼此相同或不同(例如,在同一S蛋白構築體中之GGGSGGG (SEQ ID NO: 135)及GGGS (SEQ ID NO: 136))。In some embodiments, the protein encoded by the mRNA vaccine includes more than one linker, which may be the same or different from each other (e.g., GGGSGGG (SEQ ID NO: 135) and GGGS (SEQ ID NO: 135) in the same S protein construct 136)).

在一些實施例中,連接體包含編碼泛HLA DR結合抗原決定基(PADRE)之mRNA (例如,AKFVAAWTLKAAA (SEQ ID NO: 148))。PADRE係免疫顯性輔助CD4 T細胞抗原決定基及强效免疫原(參見,例如Alexander J.等人,J of Immuno. 164(3): 1625-33,以引用方式併入本文中)。 9A. 結構域融合抗原 名稱 SEQ ID NO: mRNA ORF 蛋白質 與跨膜結構域連接之SARS-CoV-2 NTD-RBD (NTD-RBD-TM) 91 92 與跨膜結構域連接之SARS-CoV-2 RBD-NTD (RBD-NTD-TM) 139 140 與跨膜結構域連接之SARS-CoV-2 NTD-PADRE-RBD (NTD-PADRE-RBD-TM) 142 143 9B. 結構域融合抗原 與跨膜結構域連接之 SARS-CoV-2 NTD-RBD (NTD-RBD-TM) SEQ ID NO: 90自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 91及3' UTR SEQ ID NO: 4。 90 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 91 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiystvasslvllvslgaisf 92 聚A尾 100 nt    與跨膜結構域連接之 SARS-CoV-2 RBD-NTD (RBD-NTD-TM) SEQ ID NO: 141自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 139及3' UTR SEQ ID NO: 4。 141 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCGGAGGCGGAAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 139 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MYSMQLASCVTLTLVLLVNSQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKSGGGSGGGSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDSGGGSILAIYSTVASSLVLLVSLGAISF 140 聚A尾 100 nt    與跨膜結構域連接之 SARS-CoV-2 NTD-PADRE-RBD (NTD-PADRE-RBD-TM) SEQ ID NO: 144自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 142及3' UTR SEQ ID NO: 4。 144 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGCCAAGUUCGUGGCCGCCUGGACUCUGAAGGCCGCAGCCGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 142 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDGGAKFVAAWTLKAAAGGPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKSGGGSILAIYSTVASSLVLLVSLGAISF 143 聚A尾 100 nt    運輸信號 In some embodiments, the linker comprises mRNA encoding a pan-HLA DR-binding epitope (PADRE) (eg, AKFVAAWTLKAAA (SEQ ID NO: 148)). PADRE is an immunodominant helper CD4 T cell epitope and potent immunogen (see, eg, Alexander J. et al., J of Immuno. 164(3): 1625-33, incorporated herein by reference). Table 9A. Domain fusion antigens name SEQ ID NO: mRNA ORF protein SARS-CoV-2 NTD-RBD linked to transmembrane domain (NTD-RBD-TM) 91 92 SARS-CoV-2 RBD-NTD linked to transmembrane domain (RBD-NTD-TM) 139 140 SARS-CoV-2 NTD-PADRE-RBD linked to transmembrane domain (NTD-PADRE-RBD-TM) 142 143 Table 9B. Domain fusion antigens SARS-CoV-2 NTD-RBD linked to transmembrane domain (NTD-RBD-TM) SEQ ID NO: 90 consists of the following from 5' to 3': 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 91 and 3' UTR SEQ ID NO: 4. 90 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACA GCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 91 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiystvasslvllvslgaisf 92 Poly A tail 100nt SARS-CoV-2 RBD-NTD linked to transmembrane domain (RBD-NTD-TM) SEQ ID NO: 141 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 139 and 3' UTR SEQ ID NO: 4. 141 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCGGAGGCGGAAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGC UGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 139 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MYSMQLASCVTLTLVLLVNSQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKSGGGSGGGSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDSGGGSILAIYSTVASSLVLLVSLGAISF 140 Poly A tail 100nt SARS-CoV-2 NTD-PADRE-RBD linked to transmembrane domain (NTD-PADRE-RBD-TM) SEQ ID NO: 144 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 142 and 3' UTR SEQ ID NO: 4. 144 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGCCAAGUUCGUGGCCGCCUGGACUCUGAAGGCCGCAGCCGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGA AGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 142 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDGGAKFVAAWTLKAAAGGPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKSGGGSILAIYSTVASSLVLLVSLGAISF 143 Poly A tail 100nt transport signal

在一些實施例中,mRNA編碼與高爾基體(Golgi)運輸信號連接之SARS-CoV-2 S蛋白結構域(例如,NTD、RBD或NTD-RBD融合體)。該等信號之非限制性實例包括巨噬細胞標記物,諸如CD86及/或CD11b,其係高度表現的,且細胞內區可控制自高爾基體至細胞表面之有效輸出。本文可使用其他細胞運輸信號(序列),例如VSV-G胞質尾(VSVGct)。預計將編碼之蛋白質更有效地運輸至細胞表面會增加B細胞對抗原之識別,且因此促進產生針對編碼之SARS-CoV-2 S蛋白結構域之抗體。與運輸信號連接之SARS-CoV-2抗原及編碼其之mRNA之非限制性實例提供於下文 10A 10B中。 10A. 與運輸信號連接之結構域融合抗原 名稱 SEQ ID NO: mRNA ORF 蛋白質 與跨膜結構域及huCD86連接之SARS-CoV-2 NTD-RBD (NTD-RBD-TM-CD86) 94 95 與跨膜結構域及huCD11B連接之SARS-CoV-2 NTD-RBD (NTD-RBD-TM-CD11B) 97 98 與VSVGct連接之SARS-CoV-2 NTD-RBD (NTD-RBD-VSVGct) 109 110 10B. 與運輸信號連接之結構域融合抗原 與跨膜結構域及 huCD86 連接之 SARS-CoV-2 NTD-RBD (NTD-RBD-TM-CD86) SEQ ID NO: 93自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 94及3' UTR SEQ ID NO: 4。 93 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUCAAGAAGAAGAAGCGGCCACGGAACUCCUACAAGUGCGGCACCAACACCAUGGAGCGGGAGGAGAGCGAGCAGACCAAGAAGCGGGAGAAGAUCCACAUUCCUGAACGGUCCGACGAAGCCCAGCGGGUGUUCAAGAGCAGCAAGACCAGCAGCUGCGACAAGAGCGACACCUGCUUC 94 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiystvasslvllvslgaisfKKKKRPRNSYKCGTNTMEREESEQTKKREKIHIPERSDEAQRVFKSSKTSSCDKSDTCF 95 聚A尾 100 nt    與跨膜結構域及 huCD11B 連接之 SARS-CoV-2 NTD-RBD (NTD-RBD-TM-CD11B) SEQ ID NO: 96自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 97及3' UTR SEQ ID NO: 4。 96 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUCAAGCGGCAGUACAAGGACAUGAUGAGCGAGGGAGGACCACCUGGCGCUGAGCCACAG 97 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiystvasslvllvslgaisfKRQYKDMMSEGGPPGAEPQ 98 聚A尾 100 nt    VSVGct 連接之 SARS-CoV-2 NTD-RBD (NTD-RBD-VSVGct) SEQ ID NO: 108自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 109及3' UTR SEQ ID NO: 4。 108 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAGCAGCAUCGCCAGCUUCUUCUUCAUCAUCGGGCUGAUCAUCGGCCUCUUCCUGGUGCUGCGGGUGGGCAUCCACCUGUGCAUCAAGCUGAAGCACACCAAGAAGAGACAGAUCUACACCGACAUCGAGAUGAACCGGCUGGGCAAG 109 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsSSIASFFFIIGLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK 110 聚A尾 100 nt    結構域融合 C 末端截短 In some embodiments, the mRNA encodes a SARS-CoV-2 S protein domain (eg, NTD, RBD, or NTD-RBD fusion) linked to a Golgi trafficking signal. Non-limiting examples of such signals include macrophage markers, such as CD86 and/or CD11b, which are highly expressed and the intracellular domain that controls efficient export from the Golgi to the cell surface. Other cellular trafficking signals (sequences) such as the VSV-G cytoplasmic tail (VSVGct) can be used herein. More efficient transport of the encoded protein to the cell surface is expected to increase B cell recognition of the antigen and thus facilitate the production of antibodies against the encoded SARS-CoV-2 S protein domain. Non-limiting examples of SARS-CoV-2 antigens linked to trafficking signals and mRNAs encoding them are provided in Table 10A and Table 10B below. Table 10A. Domain fusion antigens linked to trafficking signals name SEQ ID NO: mRNA ORF protein SARS-CoV-2 NTD-RBD linked to transmembrane domain and huCD86 (NTD-RBD-TM-CD86) 94 95 SARS-CoV-2 NTD-RBD linked to transmembrane domain and huCD11B (NTD-RBD-TM-CD11B) 97 98 SARS-CoV-2 NTD-RBD linked to VSVGct (NTD-RBD-VSVGct) 109 110 Table 10B. Domain fusion antigens linked to trafficking signals SARS-CoV-2 NTD-RBD linked to transmembrane domain and huCD86 (NTD-RBD-TM-CD86) SEQ ID NO: 93 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 94 and 3' UTR SEQ ID NO: 4. 93 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACA GCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUCAAGAAGAAGAAGCGGCCACGGAACUCCUACAAGUGCGGCACCAACACCAUGGAGCGGGAGGAGAGCGAGCAGACCAAGAAGCGGGAGAAGAUCCACAUUCCUGAACGGUCCGACGAAGCCCAGCGGGUGUUCAAGAGCAGCAAGACCAGCAGCUGCGACAAGAGCGACACCUGCUUC 94 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiystvasslvllvslgaisfKKKKRPRNSYKCGTNTMEREESEQTKKREKIHIPERSDEAQRVFKSSKTSSCDKSDTCF 95 Poly A tail 100nt SARS-CoV-2 NTD-RBD linked to transmembrane domain and huCD11B (NTD-RBD-TM-CD11B) SEQ ID NO: 96 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 97 and 3' UTR SEQ ID NO: 4. 96 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACA GCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUCAAGCGGCAGUACAAGGACAUGAUGAGCGAGGGAGGACCACCUGGCGCUGAGCCACAG 97 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiystvasslvllvslgaisfKRQYKDMMSEGGPPGAEPQ 98 Poly A tail 100nt SARS-CoV-2 NTD-RBD linked to VSVGct ( NTD-RBD-VSVGct) SEQ ID NO: 108 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 109 and 3' UTR SEQ ID NO: 4. 108 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACA GCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAGCAGCAUCGCCAGCUUCUUCUUCAUCAUCGGGCUGAUCAUCGGCCUCUUCCUGGUGCUGCGGGUGGGCAUCCACCUGUGCAUCAAGCUGAAGCACACCAAGAAGAGACAGAUCUACACCGACAUCGAGAUGAACCGGCUGGGCAAG 109 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsSSIASFFFIIGLIIGLFLVLRVGIHLCIKLKHTKKRQIYTDIEMNRLGK 110 Poly A tail 100nt Domain fusion C- terminal truncation

在其他實施例中,本文提供之mRNA編碼SARS-CoV-2 NTD-RBD融合蛋白,其中C末端結構域之某一部分已經截短/缺失。在一個實施例中,NTD-RBD融合蛋白之C末端結構域缺失13個(或至少13個)胺基酸。預計該等胺基酸之缺失會增加抗原決定基於抗體中之暴露,從而刺激對NTD及RBD結構域上存在之保護性抗原決定基之更强之免疫反應。In other embodiments, the mRNA provided herein encodes a SARS-CoV-2 NTD-RBD fusion protein in which a certain portion of the C-terminal domain has been truncated/deleted. In one embodiment, 13 (or at least 13) amino acids are deleted from the C-terminal domain of the NTD-RBD fusion protein. Deletion of these amino acids is expected to increase epitope exposure in the antibody, thereby stimulating a stronger immune response to the protective epitopes present on the NTD and RBD domains.

具有C末端截短之SARS-CoV-2結構域融合抗原及編碼其之mRNA之非限制性實例提供於下文 11A 11B中。 11A. 結構域融合 C 末端截短 名稱 SEQ ID NO: mRNA ORF 蛋白質 具有13個胺基酸之C末端截短之SARS-CoV-2 NTD-RBD (NTD-RBD-∆13) 106 107 11B. 結構域融合 C 末端截短 具有 13 個胺基酸之 C 末端截短之 SARS-CoV-2 NTD-RBD (NTD-RBD- 13) SEQ ID NO: 105自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 106及3' UTR SEQ ID NO: 4。 105 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGAC 106 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiysLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDD 107 聚A尾 100 nt    結構域延伸 Non-limiting examples of SARS-CoV-2 domain fusion antigens with C-terminal truncations and mRNAs encoding them are provided in Table 11A and Table 11B below. Table 11A. Domain fusion C- terminal truncation name SEQ ID NO: mRNA ORF protein SARS-CoV-2 NTD-RBD with C-terminal truncation of 13 amino acids (NTD-RBD-∆13) 106 107 Table 11B. Domain fusion C- terminal truncation SARS-CoV-2 NTD-RBD with C - terminal truncation of 13 amino acids (NTD-RBD- 13) SEQ ID NO: 105 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 106 and 3' UTR SEQ ID NO: 4. 105 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACA GCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGAC 106 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiysLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDD 107 Poly A tail 100nt domain extension

在一些實施例中,SARS-CoV-2 S蛋白結構域抗原包括「延伸」區,其包括與本業內理解之NTD結構域或RBD結構域相鄰及/或側接之序列。RBD_EXT系列涵蓋SD1 (子結構域1)。NTD_EXT系列涵蓋NTD中之C末端螺旋。一些B細胞及抗體識別僅在正確摺疊而非變性形式之SARS-CoV-2 S蛋白NTD及RBD中發現之構象抗原決定基。包括與NTD及RBD結構域相鄰及/或側接之序列不僅可為抗原提供額外B細胞抗原決定基,且可潜在地達成彼等結構域之更優化摺疊,且用對可在任一結構域邊緣上發現之抗原決定基特異之抗體刺激B細胞。此外,包括該等延伸序列可因此增加NTD或RBD與表現細胞膜之間之距離,從而增加兩個結構域於抗體中之暴露,若表現之蛋白質太靠近細胞表面,則抗體結合效率較低。最後,包括延伸序列會增加肽之集合,該等肽可潜在地由識別NTD或RBD抗原決定基之B細胞呈遞給CD4 +T細胞,接著加工完整蛋白用於抗原呈遞,從而增加NTD或RBD特異性B細胞接受足够T細胞輔助之機會。SARS-CoV-2結構域延伸及編碼其之mRNA之非限制性實例提供於下文 12A 12B中。 12. 結構域延伸 名稱 SEQ ID NO:    mRNA ORF 蛋白質 SARS-CoV-2 NTD DS延伸區連接至跨膜結構域(NTD-EXT-F43C-TM) 52 53 SARS-CoV-2 NTD DS延伸區連接至跨膜結構域(NTD-F43C-EXT-TM) 55 56 SARS-CoV-2 NTD延伸區連接至跨膜結構域(NTD-EXT-TM) 58 59 SARS-CoV-2 RBD延伸區連接至跨膜結構域(RBD-EXT-TM) 85 86 SARS-CoV-2 RBD DS延伸區連接至跨膜結構域(RBD-Q563D-EXT-TM) 88 89 SARS-CoV-2 NTD-RBD延伸區連接至跨膜結構域(NTD-RBD-EXT-TM) 115 116 SARS-CoV-2 NTD延伸區-RBD連接至跨膜結構域(NTD-EXT-RBD-TM) 118 119 SARS-CoV-2 NTD延伸區-RBD-延伸區連接至跨膜結構域(NTD-EXT-RBD-EXT-TM) 121 122 12B. 結構域延伸 SARS-CoV-2 NTD DS 延伸區連接至跨膜結構域 (NTD-EXT-F43C-TM) SEQ ID NO: 51自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 52及3' UTR SEQ ID NO: 4。 51 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUGCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 52 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVCRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDsgggsilaiystvasslvllvslgaisf 53 聚A尾 100 nt    SARS-CoV-2 NTD DS 延伸區連接至跨膜結構域 (NTD-F43C-EXT-TM) SEQ ID NO: 54自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 55及3' UTR SEQ ID NO: 4。 54 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUGCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 55 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVCRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTsgggsilaiystvasslvllvslgaisf 56 聚A尾 100 nt    SARS-CoV-2 NTD 延伸區連接至跨膜結構域 (NTD-EXT-TM) SEQ ID NO: 57自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 58及3' UTR SEQ ID NO: 4。 57 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 58 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTsgggsilaiystvasslvllvslgaisf 59 聚A尾 100 nt    SARS-CoV-2 RBD 延伸區連接至跨膜結構域 (RBD-EXT-TM) SEQ ID NO: 84自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 85及3' UTR SEQ ID NO: 4。 84 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 85 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 mysmqlascvtltlvllvnsQRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSsgggsilaiystvasslvllvslgaisf 86 聚A尾 100 nt    SARS-CoV-2 RBD DS 延伸區連接至跨膜結構域 (RBD-Q563D-EXT-TM) SEQ ID NO: 87自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 88及3' UTR SEQ ID NO: 4。 87 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUUGCCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 88 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 mysmqlascvtltlvllvnsQRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFCQFGRDIADTTDAVRDPQTLEILDITPCSsgggsilaiystvasslvllvslgaisf 89 聚A尾 100 nt    SARS-CoV-2 NTD-RBD 延伸區連接至跨膜結構域 (NTD-RBD-EXT-TM) SEQ ID NO: 114自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 115及3' UTR SEQ ID NO: 4。 114 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACAGCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 115 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggQRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSsgggsilaiystvasslvllvslgaisf 116 聚A尾 100 nt    SARS-CoV-2 NTD 延伸區 -RBD 連接至跨膜結構域 (NTD-EXT-RBD-TM) SEQ ID NO: 117自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 118及3' UTR SEQ ID NO: 4。 117 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 118 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiystvasslvllvslgaisf 119 聚A尾 100 nt    SARS-CoV-2 NTD 延伸區 -RBD- 延伸區連接至跨膜結構域 (NTD-EXT-RBD-EXT-TM) SEQ ID NO: 120自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 121及3' UTR SEQ ID NO: 4。 120 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGGAGGCGGAUCGGGAGGCGGACAGCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 121 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTgggsgggQRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSsgggsilaiystvasslvllvslgaisf 122 聚A尾 100 nt    結構域混合物 In some embodiments, the SARS-CoV-2 S protein domain antigen includes an "extended" region that includes sequences adjacent to and/or flanking an NTD or RBD domain as understood in the art. The RBD_EXT family covers SD1 (subdomain 1). The NTD_EXT series covers the C-terminal helix in NTD. Some B cells and antibodies recognize conformational epitopes found only in properly folded, non-denatured forms of the SARS-CoV-2 S protein NTD and RBD. Including sequences adjacent to and/or flanking the NTD and RBD domains can not only provide additional B-cell epitopes to the antigen, but can potentially achieve more optimal folding of those domains, and use pairs that can be used in either domain. Antibodies specific for epitopes found on the edge stimulate B cells. In addition, the inclusion of these extended sequences can thus increase the distance between the NTD or RBD and the expressed cell membrane, thereby increasing the exposure of both domains to the antibody, which binds less efficiently if the expressed protein is too close to the cell surface. Finally, the inclusion of extended sequences increases the collection of peptides that can potentially be presented to CD4 + T cells by B cells recognizing NTD or RBD epitopes, followed by processing of the intact protein for antigen presentation, thereby increasing NTD or RBD specificity Sexual B cells receive adequate T cell help. Non-limiting examples of SARS-CoV-2 domain extensions and mRNAs encoding them are provided in Table 12A and Table 12B below. Table 12. Domain extensions name SEQ ID NO: mRNA ORF protein SARS-CoV-2 NTD DS extension linked to transmembrane domain (NTD-EXT-F43C-TM) 52 53 SARS-CoV-2 NTD DS extension linked to transmembrane domain (NTD-F43C-EXT-TM) 55 56 SARS-CoV-2 NTD extension linked to transmembrane domain (NTD-EXT-TM) 58 59 SARS-CoV-2 RBD extension linked to transmembrane domain (RBD-EXT-TM) 85 86 SARS-CoV-2 RBD DS extension linked to transmembrane domain (RBD-Q563D-EXT-TM) 88 89 SARS-CoV-2 NTD-RBD extension linked to transmembrane domain (NTD-RBD-EXT-TM) 115 116 SARS-CoV-2 NTD extension-RBD linked to transmembrane domain (NTD-EXT-RBD-TM) 118 119 SARS-CoV-2 NTD extension-RBD- extension linked to transmembrane domain (NTD-EXT-RBD-EXT-TM) 121 122 Table 12B. Domain extensions SARS-CoV-2 NTD DS extension linked to transmembrane domain (NTD-EXT-F43C-TM) SEQ ID NO: 51 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 52 and 3' UTR SEQ ID NO: 4. 51 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUGCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 52 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVCRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDsgggsilaiystvasslvllvslgaisf 53 Poly A tail 100nt SARS-CoV-2 NTD DS extension linked to transmembrane domain (NTD-F43C-EXT-TM) SEQ ID NO: 54 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 55 and 3' UTR SEQ ID NO: 4. 54 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUGCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCA GCUUC 55 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVCRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTsgggsilaiystvasslvllvslgaisf 56 Poly A tail 100nt SARS-CoV-2 NTD extension linked to transmembrane domain (NTD-EXT-TM) SEQ ID NO: 57 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 58 and 3' UTR SEQ ID NO: 4. 57 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCA GCUUC 58 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTsgggsilaiystvasslvllvslgaisf 59 Poly A tail 100nt SARS-CoV-2 RBD extension linked to transmembrane domain (RBD-EXT-TM) SEQ ID NO: 84 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 85 and 3' UTR SEQ ID NO: 4. 84 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 85 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence mysmqlascvtltlvllvnsQRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSsgggsilaiystvasslvllvslgaisf 86 Poly A tail 100nt SARS-CoV-2 RBD DS extension linked to transmembrane domain (RBD-Q563D-EXT-TM) SEQ ID NO: 87 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 88 and 3' UTR SEQ ID NO: 4. 87 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUUGCCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 88 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence mysmqlascvtltlvllvnsQRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFCQFGRDIADTTDAVRDPQTLEILDITPCSsgggsilaiystvasslvllvslgaisf 89 Poly A tail 100nt SARS-CoV-2 NTD-RBD extension linked to transmembrane domain (NTD-RBD-EXT-TM) SEQ ID NO: 114 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 115 and 3' UTR SEQ ID NO: 4. 114 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACAGCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGA ACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 115 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggQRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSsgggsilaiystvasslvllvslgaisf 116 Poly A tail 100nt SARS-CoV-2 NTD extension - RBD linked to transmembrane domain (NTD-EXT-RBD-TM) SEQ ID NO: 117 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 118 and 3' UTR SEQ ID NO: 4. 117 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCA GCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 118 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiystvasslvllvslgaisf 119 Poly A tail 100nt SARS-CoV-2 NTD extension - RBD- extension linked to transmembrane domain (NTD-EXT-RBD-EXT-TM) SEQ ID NO: 120 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 121 and 3' UTR SEQ ID NO: 4. 120 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGGAGGCGGAUCGGGAGGCGGACAGCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCC CCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 121 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTgggsgggQRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSsgggsilaiystvasslvllvslgaisf 122 Poly A tail 100nt domain mixture

在一些態樣中,本揭示案提供包含編碼SARS-CoV-2 S蛋白子結構域之mRNA之混合物的組合物。在一個實例中,組合物包含編碼NTD (有或無SD1、SD2及/或跨膜結構域)之mRNA及編碼RBD (有或無SD1、SD2及/或跨膜結構域)之mRNA之混合物。在一些實施例中,組合物包含編碼與跨膜結構域連接之NTD (例如SEQ ID NO: 47)之mRNA (例如SEQ ID NO: 45或46)及編碼與跨膜結構域連接之RBD (例如SEQ ID NO: 77)之mRNA (例如SEQ ID NO: 75或76)。In some aspects, the present disclosure provides compositions comprising mixtures of mRNAs encoding SARS-CoV-2 S protein subdomains. In one example, the composition comprises a mixture of mRNA encoding NTD (with or without SD1, SD2 and/or transmembrane domains) and mRNA encoding RBD (with or without SD1, SD2 and/or transmembrane domains). In some embodiments, the composition comprises an mRNA (e.g., SEQ ID NO: 45 or 46) encoding an NTD (e.g., SEQ ID NO: 47) linked to a transmembrane domain and an RBD (e.g., SEQ ID NO: 45 or 46) encoding a transmembrane domain SEQ ID NO: 77) mRNA (eg, SEQ ID NO: 75 or 76).

組合物中一種mRNA對另一種mRNA之濃度之比率可為1:1 (50:50)、1:2、1:3、1:4或1:5。在一些實施例中,該比率為1:1。舉例而言,組合物可包含1:1比率之編碼與跨膜結構域連接之NTD (例如SEQ ID NO: 47)之mRNA (例如SEQ ID NO: 45或46)對編碼與跨膜結構域連接之RBD (例如SEQ ID NO: 77)之mRNA (例如SEQ ID NO: 75或76)。在一些實施例中,該比率為1:2。舉例而言,組合物可包含1:2比率之編碼與跨膜結構域連接之NTD (例如SEQ ID NO: 47)之mRNA (例如SEQ ID NO: 45或46)對編碼與跨膜結構域連接之RBD (例如SEQ ID NO: 77)之mRNA (例如SEQ ID NO: 75或76)。作為另一實例,組合物可包含1:2比率之編碼與跨膜結構域連接之RBD (例如,SEQ ID NO: 77)之mRNA (例如SEQ ID NO: 75或76)對編碼與跨膜結構域連接之NTD (例如,SEQ ID NO: 47)之mRNA(例如,SEQ ID NO: 45或46)。編碼不同抗原之不同mRNA可刺激不同強度之免疫反應(Magini D等人,PLoS ONE.2016; 11:e0161193),且投與等莫耳比率之編碼兩種不同抗原之兩種mRNA可引起對一者而非另一者之免疫反應(John S等人, Vaccine.2018; 36:1689-1699)。對共遞送之mRNA之比率之操縱可用於引發以相等功效靶向期望抗原之廣泛免疫反應。 編碼之奈米顆粒抗原 The ratio of the concentration of one mRNA to the other mRNA in the composition may be 1:1 (50:50), 1:2, 1:3, 1:4 or 1:5. In some embodiments, the ratio is 1:1. For example, a composition may comprise a 1:1 ratio of mRNA (eg, SEQ ID NO: 45 or 46) encoding an NTD linked to a transmembrane domain (eg, SEQ ID NO: 47) to a pair encoding the NTD linked to the transmembrane domain mRNA (eg, SEQ ID NO: 75 or 76) of the RBD (eg, SEQ ID NO: 77). In some embodiments, the ratio is 1:2. For example, a composition may comprise a 1:2 ratio of mRNA (eg, SEQ ID NO: 45 or 46) encoding an NTD linked to a transmembrane domain (eg, SEQ ID NO: 47) to a pair encoding the NTD linked to the transmembrane domain mRNA (eg, SEQ ID NO: 75 or 76) of the RBD (eg, SEQ ID NO: 77). As another example, a composition may comprise a 1:2 ratio of mRNA (e.g., SEQ ID NO: 75 or 76) encoding an RBD (e.g., SEQ ID NO: 77) linked to the transmembrane domain to the encoding and transmembrane structure mRNA (eg, SEQ ID NO: 45 or 46) of a domain-linked NTD (eg, SEQ ID NO: 47). Different mRNAs encoding different antigens can stimulate immune responses of different strengths (Magini D et al., PLoS ONE. 2016; 11:e0161193), and administration of two mRNAs encoding two different antigens in equimolar ratios can elicit a one-to-one response immune response to one but not the other (John S et al., Vaccine. 2018; 36:1689-1699). Manipulation of the ratio of co-delivered mRNAs can be used to elicit a broad immune response targeting the desired antigen with equal efficacy. Encoded Nanoparticle Antigen

在一些實施例中,本文提供之mRNA疫苗編碼包含與支架結構域連接之冠狀病毒抗原之融合蛋白。在一些實施例中,支架結構域賦予由本揭示案之mRNA編碼之抗原期望之性質。舉例而言,支架結構域可例如藉由改變抗原之結構、改變抗原之攝取及加工及/或使抗原結合至另一分子來改善抗原之免疫原性。在一些實施例中,與抗原連接之支架結構域促進抗原自組裝成病毒奈米顆粒或更大之蛋白質摺疊免疫原。可如本文提供使用之支架結構域之非限制性實例包括鐵蛋白結構域、二氧四氫蝶啶合成酶結構域、摺疊子結構域及包封蛋白(encapsulin)結構域。可使用其他支架結構域。 鐵蛋白 In some embodiments, the mRNA vaccines provided herein encode fusion proteins comprising a coronavirus antigen linked to a scaffold domain. In some embodiments, the scaffold domain confers desirable properties to the antigen encoded by the mRNA of the present disclosure. For example, scaffold domains can improve the immunogenicity of an antigen, eg, by altering the structure of the antigen, altering the uptake and processing of the antigen, and/or binding the antigen to another molecule. In some embodiments, the scaffold domain linked to the antigen promotes self-assembly of the antigen into viral nanoparticles or larger protein-folded immunogens. Non-limiting examples of scaffold domains that can be used as provided herein include ferritin domains, dioxytetrahydropteridine synthase domains, foldon domains, and encapsulin domains. Other scaffold domains can be used. Ferritin

在一些實施例中,鐵蛋白結構域用作支架結構域。鐵蛋白係主要功能為在細胞內儲存鐵之蛋白質。鐵蛋白由二十四(24)個次單元組成,每個次單元由自組裝成具有八面體對稱性之四級結構之四α螺旋束構成(Cho K. J.等人, J Mol Biol.2009;390: 83-98;Granier T.等人, J Biol Inorg Chem.2003;8: 105-111;及Lawson D.M.等人, Nature.1991;349: 541-544)。鐵蛋白自組裝成具有强的熱穩定性及化學穩定性之奈米顆粒。預計以此方式將抗原包封在鐵蛋白奈米顆粒內既會延遲抗原之降解亦會聚集個別抗原,其中每個奈米顆粒含有二十四(24)個抗原次單元。相同抗原之多個拷貝之聚集增强由樹突細胞之抗原攝取及遷移,以及更强之CD4 +及CD8 +T細胞反應(Kastenmüller K等人, J Clin Invest.2011; 121(5):1782-96)。因此,鐵蛋白奈米顆粒係非常適合抗原呈遞及疫苗開發之平臺。 In some embodiments, a ferritin domain is used as a scaffold domain. Ferritin is a protein whose main function is to store iron in cells. Ferritin consists of twenty-four (24) subunits, each consisting of four alpha helical bundles that self-assemble into a quaternary structure with octahedral symmetry (Cho KJ et al., J Mol Biol. 2009; 390: 83-98; Granier T. et al, J Biol Inorg Chem. 2003; 8: 105-111; and Lawson DM et al, Nature. 1991; 349: 541-544). Ferritin self-assembles into nanoparticles with strong thermal and chemical stability. Encapsulation of antigens within ferritin nanoparticles in this manner is expected to both delay degradation of the antigens and aggregate individual antigens, each nanoparticle containing twenty-four (24) antigenic subunits. Aggregation of multiple copies of the same antigen enhances antigen uptake and migration by dendritic cells, and stronger CD4 + and CD8 + T cell responses (Kastenmüller K et al, J Clin Invest. 2011; 121(5):1782-96 ). Therefore, ferritin nanoparticles are very suitable platforms for antigen presentation and vaccine development.

在一些實施例中,本文提供之mRNA編碼例如經由甘胺酸(例如,GGG)連接體結構域與鐵蛋白結構域連接之RBD。可使用其他連接體。In some embodiments, the mRNA provided herein encodes an RBD linked to a ferritin domain, eg, via a glycine (eg, GGG) linker domain. Other linkers can be used.

在其他實施例中,本文提供之mRNA編碼例如經由甘胺酸(例如,GGG)連接體與鐵蛋白結構域連接之S蛋白之S1結構域。如本文中別處指示,可使用其他連接體。In other embodiments, the mRNA provided herein encodes the S1 domain of the S protein, eg, linked to the ferritin domain via a glycine (eg, GGG) linker. Other linkers can be used as indicated elsewhere herein.

與鐵蛋白結構域連接之SARS-CoV-2抗原及編碼其之mRNA之非限制性實例提供於下文 13A 13B中。 13A. 與鐵蛋白結構域連接之抗原 名稱 SEQ ID NO: mRNA ORF 蛋白質 與鐵蛋白連接之SARS-CoV-2 S1次單元(S1-鐵蛋白) 7 8 與鐵蛋白連接之SARS-CoV-2 RBD (RBD-鐵蛋白) 64 65 13B. 與鐵蛋白結構域連接之抗原 與鐵蛋白連接之SARS-CoV-2 S1次單元(S1-鐵蛋白) SEQ ID NO: 6自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 7及3' UTR SEQ ID NO: 4。 6 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCAGGAGGAGGCAGCGGCGGCGAUAUCAUCAAGCUUCUGAACGAGCAAGUUAACAAGGAAAUGCAGAGCAGUAAUCUCUACAUGAGCAUGAGCAGCUGGUGCUACACCCACUCCCUGGACGGAGCAGGCCUCUUCCUGUUCGACCACGCAGCCGAGGAGUACGAGCACGCUAAGAAGUUGAUCAUUUUCUUGAACGAGAACAACGUGCCCGUGCAGCUAACGUCAAUCAGCGCACCUGAGCACAAGUUCGAGGGCCUGACCCAGAUCUUCCAGAAGGCCUACGAACACGAACAGCACAUCUCCGAGAGCAUCAACAAUAUUGUGGAUCACGCUAUCAAGUCCAAGGACCACGCUACCUUCAACUUCCUGCAGUGGUACGUGGCCGAGCAACAUGAGGAGGAGGUGCUGUUCAAGGACAUCCUGGACAAGAUCGAGCUGAUCGGUAAUGAGAAUCACGGCCUGUACCUGGCCGACCAGUACGUGAAGGGCAUCGCCAAGAGCCGGAAGUCAGGCUCA 7 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSgggSGGDIIKLLNEQVNKEMQSSNLYMSMSSWCYTHSLDGAGLFLFDHAAEEYEHAKKLIIFLNENNVPVQLTSISAPEHKFEGLTQIFQKAYEHEQHISESINNIVDHAIKSKDHATFNFLQWYVAEQHEEEVLFKDILDKIELIGNENHGLYLADQYVKGIAKSRKSGS 8 聚A尾 100 nt    與鐵蛋白連接之 SARS-CoV-2 RBD (RBD- 鐵蛋白 ) SEQ ID NO: 63自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 64及3' UTR SEQ ID NO: 4。 63 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGGGAGGAGGCAGCGGCGGCGAUAUCAUCAAGCUUCUGAACGAGCAAGUUAACAAGGAAAUGCAGAGCAGUAAUCUCUACAUGAGCAUGAGCAGCUGGUGCUACACCCACUCCCUGGACGGAGCAGGCCUCUUCCUGUUCGACCACGCAGCCGAGGAGUACGAGCACGCUAAGAAGUUGAUCAUUUUCUUGAACGAGAACAACGUGCCCGUGCAGCUAACGUCAAUCAGCGCACCUGAGCACAAGUUCGAGGGCCUGACCCAGAUCUUCCAGAAGGCCUACGAACACGAACAGCACAUCUCCGAGAGCAUCAACAAUAUUGUGGAUCACGCUAUCAAGUCCAAGGACCACGCUACCUUCAACUUCCUGCAGUGGUACGUGGCCGAGCAACAUGAGGAGGAGGUGCUGUUCAAGGACAUCCUGGACAAGAUCGAGCUGAUCGGUAAUGAGAAUCACGGCCUGUACCUGGCCGACCAGUACGUGAAGGGCAUCGCCAAGAGCCGGAAGUCAGGCUCA 64 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MYSMQLASCVTLTLVLLVNSQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKGGGSGGDIIKLLNEQVNKEMQSSNLYMSMSSWCYTHSLDGAGLFLFDHAAEEYEHAKKLIIFLNENNVPVQLTSISAPEHKFEGLTQIFQKAYEHEQHISESINNIVDHAIKSKDHATFNFLQWYVAEQHEEEVLFKDILDKIELIGNENHGLYLADQYVKGIAKSRKSGS 65 聚A尾 100 nt    二氧四氫蝶啶合成酶 Non-limiting examples of SARS-CoV-2 antigens linked to ferritin domains and mRNAs encoding them are provided in Table 13A and Table 13B below. Table 13A. Antigens linked to ferritin domains name SEQ ID NO: mRNA ORF protein SARS-CoV-2 S1 subunit linked to ferritin (S1-ferritin) 7 8 SARS-CoV-2 RBD linked to ferritin (RBD-ferritin) 64 65 Table 13B. Antigens linked to ferritin domains SARS-CoV-2 S1 subunit linked to ferritin (S1-ferritin) SEQ ID NO: 6 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 7 and 3' UTR SEQ ID NO: 4. 6 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCA ACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGG CGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCAGGAGGAGGCAGCGGCGGCGAUAUCAUCAAGCUUCUGAACGAGCAAGUUAACAAGGAAAUGCAGAGCAGUAAUCUCUACAUGAGCAUGAGCAGCUGGUGCUACACCCACUCCCUGGACGGAGCAGGCCUCUUCCUGUUCGACCACGCAGCCGAGGAGUACGAGCACGCUAAGAAGUUGAUCAUUUUCUUGAACGAGAACAACGUGCCCGUGCAGCUAACGUCAAUCAGCGCACCUGAGCACAAGUUCGAGGGCCUGACCCAGAUCUUCCAGAAGGCCUACGAACACGAACAGCACAUCUCCGAGAGCAUCAACAAUAUUGUGGAUCACGCUAUCAAGUCCAAGGACCACGCUACCUUCAACUUCCUGCAGUGGUACGUGGCCGAGCAACAUGAGGAGGAGGUGCUGUUCAAGGACAUCCUGGACAAGAUCGAGCUGAUCGGUAAUGAGAAUCACGGCCUGUACCUGGCCGACCAGUACGUGAAGGGCAUCGCCAAGAGCCGGAAGUCAGGCUCA 7 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSgggSGGDIIKLLNEQVNKEMQSSNLYMSMSSWCYTHSLDGAGLFLFDHAAEEYEHAKKLIIFLNENNVPVQLTSISAPEHKFEGLTQIFQKAYEHEQHISESINNIVDHAIKSKDHATFNFLQWYVAEQHEEEVLFKDILDKIELIGNENHGLYLADQYVKGIAKSRKSGS 8 Poly A tail 100nt SARS-CoV-2 RBD linked to ferritin (RBD - ferritin ) SEQ ID NO: 63 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 64 and 3' UTR SEQ ID NO: 4. 63 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGGGAGGAGGCAGCGGCGGCGAUAUCAUCAAGCUUCUGAACGAGCAAGUUAACAAGGAAAUGCAGAGCAGUAAUCUCUACAUGAGCAUGAGCAGCUGGUGCUACACCCACUCCCUGGACGGAGCAGGCCUCUUCCUGUUCGACCACGCAGCCGAGGAGUACGAGCACGCUAAGAAGUUGAUCAUUUUCUUGAACGAGAACAACGUGCCCGUGCAGCUAACGUCAAUCAGCGCACCUGAGCACAAGUUCGAGGGCCUGACCCAGAUCUUCCAGAAGGCCUACGAACACGAACAGCACAUCUCCGAGAGCAUCAACAAUAUUGUGGAUCACGCUAUCAAGUCCA AGGACCACGCUACCUUCAACUUCCUGCAGUGGUACGUGGCCGAGCAACAUGAGGAGGAGGUGCUGUUCAAGGACAUCCUGGACAAGAUCGAGCUGAUCGGUAAUGAGAAUCACGGCCUGUACCUGGCCGACCAGUACGUGAAGGGCAUCGCCAAGAGCCGGAAGUCAGGCUCA 64 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MYSMQLASCVTLTLVLLVNSQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKGGGSGGDIIKLLNEQVNKEMQSSNLYMSMSSWCYTHSLDGAGLFLFDHAAEEYEHAKKLIIFLNENNVPVQLTSISAPEHKFEGLTQIFQKAYEHEQHISESINNIVDHAIKSKDHATFNFLQWYVAEQHEEEVLFKDILDKIELIGNENHGLYLADQYVKGIAKSRKSGS 65 Poly A tail 100nt dioxytetrahydropteridine synthase

在一些實施例中,二氧四氫蝶啶合成酶結構域用作支架結構域。二氧四氫蝶啶合成酶係一種在多種生物體(包括古細菌、細菌、真菌、植物及真細菌)中負責核黃素生物合成之倒數第二個催化步驟之酶。二氧四氫蝶啶合成酶由同寡聚體構成,該等同寡聚體根據其來源之種類而大小及次單元數不同,包括五聚體、十聚體及二十面六十聚體(icosahedral sixty-mer)。二氧四氫蝶啶合成酶單體長度為150個胺基酸,且包括具有側翼串聯α-螺旋之β-摺疊。已報導二氧四氫蝶啶合成酶之不同四級結構,闡釋其形態多樣性:自同五聚體至形成150 Å直徑之衣殼之十二(12)個五聚體之對稱裝配。在二氧四氫蝶啶合成酶表面之抗原呈遞導致以有序陣列展示之抗原之高局部濃度。該等重複結構使得B細胞受體能够交聯,且經由親合效應導致强免疫反應。In some embodiments, a dioxytetrahydropteridine synthase domain is used as a scaffold domain. Dioxytetrahydropteridine synthase is an enzyme responsible for the penultimate catalytic step of riboflavin biosynthesis in a variety of organisms including archaea, bacteria, fungi, plants and eubacteria. Dioxytetrahydropteridine synthase is composed of homo-oligomers that vary in size and number of subunits according to the type of source, including pentamers, decamers, and icosahedral hexamers ( icosahedral sixty-mer). The dioxytetrahydropteridine synthase monomer is 150 amino acids in length and includes a β-sheet with flanking tandem α-helices. Different quaternary structures of dioxytetrahydropteridine synthases have been reported, illustrating their morphological diversity: symmetrical assembly of twelve (12) pentamers ranging from homopentamers to form 150 Å diameter capsids. Antigen presentation on the surface of dioxytetrahydropteridine synthase results in high local concentrations of antigen displayed in ordered arrays. These repeating structures enable cross-linking of B cell receptors and lead to strong immune responses via affinity effects.

在一些實施例中,本文提供之mRNA編碼(例如)經由甘胺酸-絲胺酸(例如,GGS)與二氧四氫蝶啶合成酶結構域連接之RBD。可使用其他連接體。In some embodiments, the mRNA provided herein encodes an RBD linked to a dioxytetrahydropteridine synthase domain, eg, via a glycine-serine (eg, GGS). Other linkers can be used.

在其他實施例中,本文提供之mRNA編碼(例如)經由甘胺酸-絲胺酸(例如,GGS)連接體與二氧四氫蝶啶合成酶結構域連接之S蛋白之S1結構域。如本文中別處指示,可使用其他連接體。In other embodiments, the mRNA provided herein encodes the S1 domain of the S protein linked, eg, via a glycine-serine (eg, GGS) linker to a dioxytetrahydropteridine synthase domain. Other linkers can be used as indicated elsewhere herein.

與摺疊子結構域連接之SARS-CoV-2抗原及編碼其之mRNA之非限制性實例提供於下文 14A 14B中。 14A. 與二氧四氫蝶啶合成酶結構域連接之抗原 名稱 SEQ ID NO: 與二氧四氫蝶啶合成酶C末端連接之SARS-CoV-2可溶性S1 (LS-S1) 10 11 與二氧四氫蝶啶合成酶N末端連接之SARS-CoV-2可溶性S1 (S1-LS) 13 14 與二氧四氫蝶啶合成酶C末端連接之SARS-CoV-2 RBD (LS-RBD) 67 68 與二氧四氫蝶啶合成酶N末端連接之SARS-CoV-2 RBD (RBD-LS) 70 71 14B. 與二氧四氫蝶啶合成酶結構域連接之抗原 與二氧四氫蝶啶合成酶C末端連接之SARS-CoV-2可溶性S1 (LS-S1) SEQ ID NO: 9自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 10及3' UTR SEQ ID NO: 4。 9 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGGGCAUCCUGCCCAGCCCUGGCAUGCCCGCUCUGCUGAGCCUGGUGAGCCUGCUGAGCGUGCUGCUGAUGGGCUGCGUGGCUGAGACCGGCAUGCAGAUCUACGAGGGCAAGCUGACCGCAGAGGGCCUGCGGUUCGGCAUCGUGGCCAGCCGCGCCAACCACGCUCUGGUGGACCGGCUUGUGGAGGGCGCUAUCGACGCCAUCGUGAGACACGGCGGCCGGGAAGAGGACAUCACCCUGGUGCGGGUGUGCGGCAGCUGGGAGAUUCCCGUCGCCGCCGGAGAACUGGCCCGGAAGGAGGACAUCGACGCCGUGAUCGCCAUCGGCGUGCUGUGCAGAGGCGCCACGCCCAGCUUCGACUACAUCGCCAGCGAGGUGAGCAAGGGCCUGGCCGACCUGAGCCUGGAGCUGCGGAAGCCCAUCACCUUCGGCGUGAUCACCGCCGACACCCUGGAGCAGGCCAUCGAGGCCGCAGGCACCUGCCACGGCAACAAGGGCUGGGAAGCCGCCCUGUGCGCCAUCGAGAUGGCCAACCUGUUCAAGAGCCUGCGGGGCGGAAGUGGAGGCUCUGGUGGCAGCGGAGGAUCUGGCGGCGGCACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCA 10 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MGILPSPGMPALLSLVSLLSVLLMGCVAETGMQIYEGKLTAEGLRFGIVASRANHALVDRLVEGAIDAIVRHGGREEDITLVRVCGSWEIPVAAGELARKEDIDAVIAIGVLCRGATPSFDYIASEVSKGLADLSLELRKPITFGVITADTLEQAIEAAGTCHGNKGWEAALCAIEMANLFKSLRGGSGGSGGSGGSGGGTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNS 11 聚A尾 100 nt    與二氧四氫蝶啶合成酶N末端連接之SARS-CoV-2可溶性S1 (S1-LS) SEQ ID NO: 12自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 13及3' UTR SEQ ID NO: 4。 12 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCAGGAGGAGGCUCCGGAGGCGGUAGCGCUGAGACCGGCAUGCAGAUCUACGAGGGCAAGCUGACCGCAGAGGGCCUGCGGUUCGGCAUCGUGGCCAGCCGCGCCAACCACGCUCUGGUGGACCGGCUUGUGGAGGGCGCUAUCGACGCCAUCGUGAGACACGGCGGCCGGGAAGAGGACAUCACCCUGGUGCGGGUGUGCGGCAGCUGGGAGAUUCCCGUCGCCGCCGGAGAACUGGCCCGGAAGGAGGACAUCGACGCCGUGAUCGCCAUCGGCGUGCUGUGCAGAGGCGCCACGCCCAGCUUCGACUACAUCGCCAGCGAGGUGAGCAAGGGCCUGGCCGACCUGAGCCUGGAGCUGCGGAAGCCCAUCACCUUCGGCGUGAUCACCGCCGACACCCUGGAGCAGGCCAUCGAGGCCGCAGGCACCUGCCACGGCAACAAGGGCUGGGAAGCCGCCCUGUGCGCCAUCGAGAUGGCCAACCUGUUCAAGAGCCUGCGG 13 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSgggsgggsAETGMQIYEGKLTAEGLRFGIVASRANHALVDRLVEGAIDAIVRHGGREEDITLVRVCGSWEIPVAAGELARKEDIDAVIAIGVLCRGATPSFDYIASEVSKGLADLSLELRKPITFGVITADTLEQAIEAAGTCHGNKGWEAALCAIEMANLFKSLR 14 聚A尾 100 nt    與二氧四氫蝶啶合成酶 C 末端連接之 SARS-CoV-2 RBD (LS-RBD) SEQ ID NO: 66自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 67及3' UTR SEQ ID NO: 4。 66 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGGGCAUCCUGCCCAGCCCUGGCAUGCCCGCUCUGCUGAGCCUGGUGAGCCUGCUGAGCGUGCUGCUGAUGGGCUGCGUGGCUGAGACCGGCAUGCAGAUCUACGAGGGCAAGCUGACCGCAGAGGGCCUGCGGUUCGGCAUCGUGGCCAGCCGCGCCAACCACGCUCUGGUGGACCGGCUUGUGGAGGGCGCUAUCGACGCCAUCGUGAGACACGGCGGCCGGGAAGAGGACAUCACCCUGGUGCGGGUGUGCGGCAGCUGGGAGAUUCCCGUCGCCGCCGGAGAACUGGCCCGGAAGGAGGACAUCGACGCCGUGAUCGCCAUCGGCGUGCUGUGCAGAGGCGCCACGCCCAGCUUCGACUACAUCGCCAGCGAGGUGAGCAAGGGCCUGGCCGACCUGAGCCUGGAGCUGCGGAAGCCCAUCACCUUCGGCGUGAUCACCGCCGACACCCUGGAGCAGGCCAUCGAGGCCGCAGGCACCUGCCACGGCAACAAGGGCUGGGAAGCCGCCCUGUGCGCCAUCGAGAUGGCCAACCUGUUCAAGAGCCUGCGGGGCGGAAGUGGAGGCUCUGGUGGCAGCGGAGGAUCUGGCGGCGGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAG 67 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MGILPSPGMPALLSLVSLLSVLLMGCVAETGMQIYEGKLTAEGLRFGIVASRANHALVDRLVEGAIDAIVRHGGREEDITLVRVCGSWEIPVAAGELARKEDIDAVIAIGVLCRGATPSFDYIASEVSKGLADLSLELRKPITFGVITADTLEQAIEAAGTCHGNKGWEAALCAIEMANLFKSLRGGSGGSGGSGGSGGGQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPK 68 聚A尾 100 nt    與二氧四氫蝶啶合成酶 N 末端連接之 SARS-CoV-2 RBD (RBD-LS) SEQ ID NO: 69自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 70及3' UTR SEQ ID NO: 4。 69 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGGGAGGAGGCUCCGGAGGCGGUAGCGCUGAGACCGGCAUGCAGAUCUACGAGGGCAAGCUGACCGCAGAGGGCCUGCGGUUCGGCAUCGUGGCCAGCCGCGCCAACCACGCUCUGGUGGACCGGCUUGUGGAGGGCGCUAUCGACGCCAUCGUGAGACACGGCGGCCGGGAAGAGGACAUCACCCUGGUGCGGGUGUGCGGCAGCUGGGAGAUUCCCGUCGCCGCCGGAGAACUGGCCCGGAAGGAGGACAUCGACGCCGUGAUCGCCAUCGGCGUGCUGUGCAGAGGCGCCACGCCCAGCUUCGACUACAUCGCCAGCGAGGUGAGCAAGGGCCUGGCCGACCUGAGCCUGGAGCUGCGGAAGCCCAUCACCUUCGGCGUGAUCACCGCCGACACCCUGGAGCAGGCCAUCGAGGCCGCAGGCACCUGCCACGGCAACAAGGGCUGGGAAGCCGCCCUGUGCGCCAUCGAGAUGGCCAACCUGUUCAAGAGCCUGCGG 70 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MYSMQLASCVTLTLVLLVNSQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKgggsgggsAETGMQIYEGKLTAEGLRFGIVASRANHALVDRLVEGAIDAIVRHGGREEDITLVRVCGSWEIPVAAGELARKEDIDAVIAIGVLCRGATPSFDYIASEVSKGLADLSLELRKPITFGVITADTLEQAIEAAGTCHGNKGWEAALCAIEMANLFKSLR 71 聚A尾 100 nt    摺疊子 Non-limiting examples of SARS-CoV-2 antigens linked to the foldon domains and mRNAs encoding them are provided in Table 14A and Table 14B below. Table 14A. Antigens linked to dioxytetrahydropteridine synthase domains name SEQ ID NO: SARS-CoV-2 soluble S1 (LS-S1) linked to the C-terminus of dioxytetrahydropteridine synthase 10 11 SARS-CoV-2 soluble S1 (S1-LS) linked to the N-terminus of dioxytetrahydropteridine synthase 13 14 SARS-CoV-2 RBD linked to the C-terminus of dioxytetrahydropteridine synthase (LS-RBD) 67 68 SARS-CoV-2 RBD linked to the N-terminus of dioxytetrahydropteridine synthase (RBD-LS) 70 71 Table 14B. Antigens linked to dioxytetrahydropteridine synthase domains SARS-CoV-2 soluble S1 (LS-S1) linked to the C-terminus of dioxytetrahydropteridine synthase SEQ ID NO: 9 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 10 and 3' UTR SEQ ID NO: 4. 9 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGGGCAUCCUGCCCAGCCCUGGCAUGCCCGCUCUGCUGAGCCUGGUGAGCCUGCUGAGCGUGCUGCUGAUGGGCUGCGUGGCUGAGACCGGCAUGCAGAUCUACGAGGGCAAGCUGACCGCAGAGGGCCUGCGGUUCGGCAUCGUGGCCAGCCGCGCCAACCACGCUCUGGUGGACCGGCUUGUGGAGGGCGCUAUCGACGCCAUCGUGAGACACGGCGGCCGGGAAGAGGACAUCACCCUGGUGCGGGUGUGCGGCAGCUGGGAGAUUCCCGUCGCCGCCGGAGAACUGGCCCGGAAGGAGGACAUCGACGCCGUGAUCGCCAUCGGCGUGCUGUGCAGAGGCGCCACGCCCAGCUUCGACUACAUCGCCAGCGAGGUGAGCAAGGGCCUGGCCGACCUGAGCCUGGAGCUGCGGAAGCCCAUCACCUUCGGCGUGAUCACCGCCGACACCCUGGAGCAGGCCAUCGAGGCCGCAGGCACCUGCCACGGCAACAAGGGCUGGGAAGCCGCCCUGUGCGCCAUCGAGAUGGCCAACCUGUUCAAGAGCCUGCGGGGCGGAAGUGGAGGCUCUGGUGGCAGCGGAGGAUCUGGCGGCGGCACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCU GGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGG CUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCA 10 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MGILPSPGMPALLSLVSLLSVLLMGCVAETGMQIYEGKLTAEGLRFGIVASRANHALVDRLVEGAIDAIVRHGGREEDITLVRVCGSWEIPVAAGELARKEDIDAVIAIGVLCRGATPSFDYIASEVSKGLADLSLELRKPITFGVITADTLEQAIEAAGTCHGNKGWEAALCAIEMANLFKSLRGGSGGSGGSGGSGGGTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNS 11 Poly A tail 100nt SARS-CoV-2 soluble S1 (S1-LS) linked to the N-terminus of dioxytetrahydropteridine synthase SEQ ID NO: 12 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 13 and 3' UTR SEQ ID NO: 4. 12 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCA ACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGG CGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCAGGAGGAGGCUCCGGAGGCGGUAGCGCUGAGACCGGCAUGCAGAUCUACGAGGGCAAGCUGACCGCAGAGGGCCUGCGGUUCGGCAUCGUGGCCAGCCGCGCCAACCACGCUCUGGUGGACCGGCUUGUGGAGGGCGCUAUCGACGCCAUCGUGAGACACGGCGGCCGGGAAGAGGACAUCACCCUGGUGCGGGUGUGCGGCAGCUGGGAGAUUCCCGUCGCCGCCGGAGAACUGGCCCGGAAGGAGGACAUCGACGCCGUGAUCGCCAUCGGCGUGCUGUGCAGAGGCGCCACGCCCAGCUUCGACUACAUCGCCAGCGAGGUGAGCAAGGGCCUGGCCGACCUGAGCCUGGAGCUGCGGAAGCCCAUCACCUUCGGCGUGAUCACCGCCGACACCCUGGAGCAGGCCAUCGAGGCCGCAGGCACCUGCCACGGCAACAAGGGCUGGGAAGCCGCCCUGUGCGCCAUCGAGAUGGCCAACCUGUUCAAGAGCCUGCGG 13 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSgggsgggsAETGMQIYEGKLTAEGLRFGIVASRANHALVDRLVEGAIDAIVRHGGREEDITLVRVCGSWEIPVAAGELARKEDIDAVIAIGVLCRGATPSFDYIASEVSKGLADLSLELRKPITFGVITADTLEQAIEAAGTCHGNKGWEAALCAIEMANLFKSLR 14 Poly A tail 100nt SARS-CoV-2 RBD linked to the C - terminus of dioxytetrahydropteridine synthase (LS-RBD) SEQ ID NO: 66 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 67 and 3' UTR SEQ ID NO: 4. 66 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGGGCAUCCUGCCCAGCCCUGGCAUGCCCGCUCUGCUGAGCCUGGUGAGCCUGCUGAGCGUGCUGCUGAUGGGCUGCGUGGCUGAGACCGGCAUGCAGAUCUACGAGGGCAAGCUGACCGCAGAGGGCCUGCGGUUCGGCAUCGUGGCCAGCCGCGCCAACCACGCUCUGGUGGACCGGCUUGUGGAGGGCGCUAUCGACGCCAUCGUGAGACACGGCGGCCGGGAAGAGGACAUCACCCUGGUGCGGGUGUGCGGCAGCUGGGAGAUUCCCGUCGCCGCCGGAGAACUGGCCCGGAAGGAGGACAUCGACGCCGUGAUCGCCAUCGGCGUGCUGUGCAGAGGCGCCACGCCCAGCUUCGACUACAUCGCCAGCGAGGUGAGCAAGGGCCUGGCCGACCUGAGCCUGGAGCUGCGGAAGCCCAUCACCUUCGGCGUGAUCACCGCCGACACCCUGGAGCAGGCCAUCGAGGCCGCAGGCACCUGCCACGGCAACAAGGGCUGGGAAGCCGCCCUGUGCGCCAUCGAGAUGGCCAACCUGUUCAAGAGCCUGCGGGGCGGAAGUGGAGGCUCUGGUGGCAGCGGAGGAUCUGGCGGCGGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGA AGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUUGUGUGGCCCCAAG 67 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MGILPSPGMPALLSLVSLLSVLLMGCVAETGMQIYEGKLTAEGLRFGIVASRANHALVDRLVEGAIDAIVRHGGREEDITLVRVCGSWEIPVAAGELARKEDIDAVIAIGVLCRGATPSFDYIASEVSKGLADLSLELRKPITFGVITADTLEQAIEAAGTCHGNKGWEAALCAIEMANLFKSLRGGSGGSGGSGGSGGGQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPK 68 Poly A tail 100nt SARS-CoV-2 RBD linked to the N - terminus of dioxytetrahydropteridine synthase (RBD-LS) SEQ ID NO: 69 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 70 and 3' UTR SEQ ID NO: 4. 69 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGGGAGGAGGCUCCGGAGGCGGUAGCGCUGAGACCGGCAUGCAGAUCUACGAGGGCAAGCUGACCGCAGAGGGCCUGCGGUUCGGCAUCGUGGCCAGCCGCGCCAACCACGCUCUGGUGGACCGGCUUGUGGAGGGCGCUAUCGACGCCAUCGUGAGACACGGCGGCCGGGAAGAGGACAUCACCCUGGUGCGGGUGUGCGGCAGCUGGGAGAUUCCCGUCGCCGCCGGAGAACUGGCCCGGAAGGAGGACAUCGACGCCGUGAUCGCCAUCGGCGUGCUGUGCAGAGGCGCCACGCCCAGCUUCGACUACAUCGCCAGCGAGGUGAGCAAGGGCCUGGCCG ACCUGAGCCUGGAGCUGCGGAAGCCCAUCACCUUCGGCGUGAUCACCGCCGACACCCUGGAGCAGGCCAUCGAGGCCGCAGGCACCUGCCACGGCAACAAGGGCUGGGAAGCCGCCCUGUGCGCCAUCGAGAUGGCCAACCUGUUCAAGAGCCUGCGG 70 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MYSMQLASCVTLTLVLLVNSQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKgggsgggsAETGMQIYEGKLTAEGLRFGIVASRANHALVDRLVEGAIDAIVRHGGREEDITLVRVCGSWEIPVAAGELARKEDIDAVIAIGVLCRGATPSFDYIASEVSKGLADLSLELRKPITFGVITADTLEQAIEAAGTCHGNKGWEAALCAIEMANLFKSLR 71 Poly A tail 100nt folder

在一些實施例中,摺疊子結構域用作支架結構域。T4彈力素之C末端結構域(摺疊子)對於形成彈力素三聚體結構係必需的,且可用作人工三聚化結構域(參見例如Meier S.等人, Journal of Molecular Biology2004年12月3日;344(4): 1051-1069;Tao Y等人, Structure1997年6月15日;5(6):789-98)。當與S蛋白胞外結構域融合時,摺疊子結構域促進S蛋白之正確三聚化,從而避免蛋白質之錯誤摺疊。導致S蛋白融合前構象產生之該過程導致表現、構象均質性增加,且引發強效中和抗體反應。 In some embodiments, the Foldon domain is used as a scaffold domain. The C-terminal domain (foldon) of T4 elastin is necessary for the formation of the elastin trimer structure and can be used as an artificial trimerization domain (see eg Meier S. et al, Journal of Molecular Biology 2004 12 Jun 3;344(4):1051-1069; Tao Y et al. Structure 1997 Jun 15;5(6):789-98). When fused to the S protein extracellular domain, the Foldon domain promotes correct trimerization of the S protein, thereby avoiding protein misfolding. This process leading to the production of the prefusion conformation of the S protein results in increased expression, conformational homogeneity, and elicits a potent neutralizing antibody response.

不受限於理論,認為此組態將導致摺疊子在蛋白質之細胞內區上在很大程度上免疫原性沈默。與摺疊子結構域連接之SARS-CoV-2抗原及編碼其之mRNA之非限制性實例提供於下文 15A 15B中。 15A. 與摺疊子結構域連接之抗原 名稱 SEQ ID NO: mRNA ORF 蛋白質 與摺疊子結構域連接之SARS-CoV-2 NTD 43 44 與跨膜結構域及摺疊子結構域連接之SARS-CoV-2 NTD 49 50 與摺疊子結構域連接之SARS-CoV-2 RBD 73 74 與摺疊子結構域及跨膜結構域連接之SARS-CoV-2 RBD (RBD-FD-TM) 79 80 與跨膜結構域及摺疊子結構域連接之SARS-CoV-2 RBD (RBD-TM-FD) 82 83 與摺疊子結構域及跨膜結構域連接之SARS-CoV-2 NTD-RBD (NTD-RBD-FD-TM) 100 101 與跨膜結構域及摺疊子結構域連接之SARS-CoV-2 NTD-RBD (NTD-RBD-TM-FD) 103 104 與摺疊子結構域連接之SARS-CoV-2 NTD-RBD 112 113 15B. 與摺疊子結構域連接之抗原 與摺疊子結構域連接之 SARS-CoV-2 NTD SEQ ID NO: 42自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 43及3' UTR SEQ ID NO: 4。 42 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUCUGGCGGAGGCGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGC 43 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDsggggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLG 44 聚A尾 100 nt    與跨膜結構域及摺疊子結構域連接之 SARS-CoV-2 NTD SEQ ID NO: 48自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 49及3' UTR SEQ ID NO: 4。 48 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUCGGCGGAGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGCAAG 49 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDsgggsilaiystvasslvllvslgaisfgggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLGk 50 聚A尾 100 nt    與摺疊子結構域連接之 SARS-CoV-2 RBD SEQ ID NO: 72自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 73及3' UTR SEQ ID NO: 4。 72 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGC 73 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 mysmqlascvtltlvllvnsQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsggggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLG 74 聚A尾 100 nt    與摺疊子結構域及跨膜結構域連接之 SARS-CoV-2 RBD (RBD-FD-TM) SEQ ID NO: 78自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 79及3' UTR SEQ ID NO: 4。 78 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 79 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 mysmqlascvtltlvllvnsQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsggggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLGggsilaiystvasslvllvslgaisf 80 聚A尾 100 nt    與跨膜結構域及摺疊子結構域連接之 SARS-CoV-2 RBD (RBD-TM-FD) SEQ ID NO: 81自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 82及3' UTR SEQ ID NO: 4。 81 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUCGGCGGAGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGCAAG 82 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 mysmqlascvtltlvllvnsQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiystvasslvllvslgaisfgggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLGk 83 聚A尾 100 nt    與摺疊子結構域及跨膜結構域連接之 SARS-CoV-2 NTD-RBD (NTD-RBD-FD-TM) SEQ ID NO: 99自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 100及3' UTR SEQ ID NO: 4。 99 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 100 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsggggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLGggsilaiystvasslvllvslgaisf 101 聚A尾 100 nt    與跨膜結構域及摺疊子結構域連接之 SARS-CoV-2 NTD-RBD (NTD-RBD-TM-FD) SEQ ID NO: 102自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 103及3' UTR SEQ ID NO: 4。 102 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUCGGCGGAGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGCAAG 103 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiystvasslvllvslgaisfgggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLGk 104 聚A尾 100 nt    與摺疊子結構域連接之 SARS-CoV-2 NTD-RBD SEQ ID NO: 111自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 112及3' UTR SEQ ID NO: 4。 111 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGC 112 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsggggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLG 113 聚A尾 100 nt    包封蛋白 Without being bound by theory, it is believed that this configuration will result in the largely immunogenic silencing of the foldon on the intracellular region of the protein. Non-limiting examples of SARS-CoV-2 antigens linked to the foldon domains and mRNAs encoding them are provided in Table 15A and Table 15B below. Table 15A. Antigens linked to foldon domains name SEQ ID NO: mRNA ORF protein SARS-CoV-2 NTD linked to the foldon domain 43 44 SARS-CoV-2 NTD linked to transmembrane and foldon domains 49 50 SARS-CoV-2 RBD linked to the foldon domain 73 74 SARS-CoV-2 RBD Linked to Foldon and Transmembrane Domains (RBD-FD-TM) 79 80 SARS-CoV-2 RBD linked to transmembrane and foldon domains (RBD-TM-FD) 82 83 SARS-CoV-2 NTD-RBD linked to foldon and transmembrane domains (NTD-RBD-FD-TM) 100 101 SARS-CoV-2 NTD-RBD linked to transmembrane and foldon domains (NTD-RBD-TM-FD) 103 104 SARS-CoV-2 NTD-RBD linked to the foldon domain 112 113 Table 15B. Antigens linked to foldon domains SARS-CoV-2 NTD linked to the foldon domain SEQ ID NO: 42 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 43 and 3' UTR SEQ ID NO: 4. 42 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUCUGGCGGAGGCGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGC 43 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDsggggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLG 44 Poly A tail 100nt SARS-CoV-2 NTD linked to transmembrane and foldon domains SEQ ID NO: 48 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 49 and 3' UTR SEQ ID NO: 4. 48 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUCGGCGGAGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAG ACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGCAAG 49 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDsgggsilaiystvasslvllvslgaisfgggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLGk 50 Poly A tail 100nt SARS-CoV-2 RBD linked to the foldon domain SEQ ID NO: 72 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 73 and 3' UTR SEQ ID NO: 4. 72 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGC 73 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence mysmqlascvtltlvllvnsQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFELLHAPATFGGCGPKsggleapgg 74 Poly A tail 100nt SARS-CoV-2 RBD Linked to Foldon and Transmembrane Domains (RBD-FD-TM) SEQ ID NO: 78 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 79 and 3' UTR SEQ ID NO: 4. 78 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 79 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence mysmqlascvtltlvllvnsQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsggggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLGggsilaiystvasslvllvslgaisf 80 Poly A tail 100nt SARS-CoV-2 RBD linked to transmembrane and foldon domains (RBD-TM-FD) SEQ ID NO: 81 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 82 and 3' UTR SEQ ID NO: 4. 81 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUACAGCAUGCAGCUGGCUAGCUGCGUGACCCUGACCCUGGUGCUGCUGGUGAACAGCCAGCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUCGGCGGAGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGCAAG 82 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence mysmqlascvtltlvllvnsQPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiystvasslvllvslgaisfgggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLGk 83 Poly A tail 100nt SARS-CoV-2 NTD-RBD linked to foldon and transmembrane domains (NTD-RBD-FD-TM) SEQ ID NO: 99 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 100 and 3' UTR SEQ ID NO: 4. 99 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACA GCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUC 100 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsggggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLGggsilaiystvasslvllvslgaisf 101 Poly A tail 100nt SARS-CoV-2 NTD-RBD linked to transmembrane and foldon domains (NTD-RBD-TM-FD) SEQ ID NO: 102 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 103 and 3' UTR SEQ ID NO: 4. 102 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACA GCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCAGCAUCCUGGCCAUCUACAGCACCGUGGCCAGCAGCCUGGUGCUGCUGGUGAGCCUGGGCGCCAUCAGCUUCGGCGGAGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGCAAG 103 3' UTR UGAUAAUAGGCUGGAGCCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsgggsilaiystvasslvllvslgaisfgggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLGk 104 Poly A tail 100nt SARS-CoV-2 NTD-RBD linked to the foldon domain SEQ ID NO: 111 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 112 and 3' UTR SEQ ID NO: 4. 111 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACGGAGGCGGAUCGGGAGGCGGACCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACA GCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGUCUGGCGGAGGCGGCAGCGCCAUCGGCGGCUACAUCCCCGAGGCCCCUAGAGACGGCCAGGCCUACGUGCGGAAGGACGGCGAGUGGGUGCUGCUGAGCACCUUCCUGGGC 112 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDgggsgggPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKsggggSAIGGYIPEAPRDGQAYVRKDGEWVLLSTFLG 113 Poly A tail 100nt Encapsulated protein

在一些實施例中,包封蛋白結構域用作支架結構域。包封蛋白係自適溫性海栖熱袍菌( Thermotoga maritima)分離之蛋白質籠形奈米顆粒。包封蛋白由相同31 kDa單體之60個拷貝組裝而成,該等單體具有薄的二十面體T = 1對稱之籠形結構,其內徑及外徑分別為20 nm及24 nm (Sutter M.等人, Nat Struct Mol Biol.2008;15: 939-947)。儘管包封蛋白在海栖熱袍菌中之確切功能尚未清楚地為人們所瞭解,但最近其晶體結構已得以解决,且其功能假定為包封諸如DyP (染料脫色過氧化物酶)及Flp (鐵蛋白樣蛋白)等蛋白質之細胞隔室,該等蛋白質參與氧化應激反應30 (Rahmanpour R.等人, FEBS J.2013;280: 2097-2104)。使用包封蛋白進行奈米顆粒構築使得蛋白質抗原能够在奈米顆粒表面上展示,且能够將諸如mRNA等貨物包封在奈米顆粒本身內。以前之基於包封蛋白奈米顆粒之疫苗已引發對表面展示抗原及貨物蛋白本身之强免疫反應(Lagoutte P.等人, Vaccine.2018;36(25): 3622-3628)。 In some embodiments, an encapsulated protein domain is used as a scaffold domain. The encapsulated protein is a protein cage-shaped nanoparticle isolated from thermotoga maritima . The encapsulated protein is assembled from 60 copies of the same 31 kDa monomer with a thin icosahedral T=1 symmetric cage with inner and outer diameters of 20 nm and 24 nm, respectively (Sutter M. et al., Nat Struct Mol Biol. 2008; 15: 939-947). Although the exact function of the encapsulating protein in Thermotoga maris is not clearly understood, its crystal structure has recently been solved and its function is postulated to encapsulate enzymes such as DyP (dye depigmenting peroxidase) and Flp (ferritin-like protein) and other proteins involved in the oxidative stress response 30 (Rahmanpour R. et al., FEBS J. 2013; 280: 2097-2104). Nanoparticle construction using encapsulated proteins enables protein antigens to be displayed on the nanoparticle surface and to encapsulate cargoes such as mRNA within the nanoparticles themselves. Previous vaccines based on encapsulated protein nanoparticles have elicited strong immune responses to surface displayed antigens and the cargo protein itself (Lagoutte P. et al., Vaccine. 2018;36(25):3622-3628).

在一些實施例中,本文提供之mRNA編碼與包封蛋白結構域連接之S蛋白結構域(例如,S1、S2、RBD及/或NTD)。 融合蛋白 In some embodiments, the mRNA provided herein encodes a S protein domain (eg, S1, S2, RBD, and/or NTD) linked to an encapsulate protein domain. fusion protein

在一些實施例中,本揭示案之組合物包括編碼抗原性融合蛋白之mRNA。因此,編碼之一或多種抗原可包括接合在一起之兩種或更多種蛋白質(例如,蛋白質及/或蛋白質片段)。或者,與蛋白抗原融合之蛋白質不促進對其自身之强免疫反應,而是促進對冠狀病毒抗原之强免疫反應。在一些實施例中,抗原性融合蛋白保留來自每一原始蛋白質之功能性質。In some embodiments, the compositions of the present disclosure include mRNA encoding an antigenic fusion protein. Thus, encoding one or more antigens can include two or more proteins (eg, proteins and/or protein fragments) joined together. Alternatively, the protein fused to the protein antigen does not promote a strong immune response to itself, but rather a strong immune response to the coronavirus antigen. In some embodiments, the antigenic fusion proteins retain functional properties from each original protein.

在一些實施例中,融合蛋白包含來自SARS-CoV-2刺突蛋白之受體結合結構域。In some embodiments, the fusion protein comprises a receptor binding domain from the SARS-CoV-2 spike protein.

在一些實施例中,融合蛋白包含來自SARS-CoV-2刺突蛋白之N末端結構域。In some embodiments, the fusion protein comprises the N-terminal domain from the SARS-CoV-2 spike protein.

在一些實施例中,融合蛋白包含跨膜結構域。在一些實施例中,跨膜結構域可來自不為SARS-CoV-2之病毒。舉例而言,跨膜結構域可來自流行性感冒血球凝集素跨膜結構域,其已展現有效地將蛋白質錨定在細胞表面。 變異體 In some embodiments, the fusion protein comprises a transmembrane domain. In some embodiments, the transmembrane domain can be from a virus other than SARS-CoV-2. For example, the transmembrane domain can be derived from the influenza hemagglutinin transmembrane domain, which has been shown to efficiently anchor proteins to the cell surface. variant

在一些實施例中,本揭示案之組合物包括編碼冠狀病毒抗原變異體之RNA。抗原變異體或其他多肽變異體係指其胺基酸序列與野生型、天然或參照序列不同之分子。與天然或參照序列相比,抗原/多肽變異體可在胺基酸序列內之某些位置處具有取代、缺失及/或插入。通常,變異體與野生型、天然或參照序列具有至少50%之一致性。在一些實施例中,變異體與野生型、天然或參照序列共享至少80%或至少90%之一致性。In some embodiments, the compositions of the present disclosure include RNA encoding a coronavirus antigenic variant. An antigenic variant or other system of polypeptide variation refers to a molecule whose amino acid sequence differs from the wild-type, native, or reference sequence. Antigen/polypeptide variants may have substitutions, deletions and/or insertions at certain positions within the amino acid sequence compared to the native or reference sequence. Typically, the variant is at least 50% identical to the wild-type, native, or reference sequence. In some embodiments, the variant shares at least 80% or at least 90% identity with the wild-type, native or reference sequence.

由本揭示案之核酸編碼之變異體抗原/多肽可含有賦予多種期望性質中之任一者之胺基酸變化,例如,在個體中增强其免疫原性、增强其表現及/或改善其穩定性或PK/PD性質。變異體抗原/多肽可使用常規誘變技術製備,且適當地進行分析以確定其是否具有期望性質。用以測定表現水準及免疫原性之分析為業內所熟知,且示例性之此類分析在實例部分中闡釋。類似地,可使用業內公認之技術、例如藉由測定抗原在接種之個體中隨時間推移之表現及/或藉由觀察誘導之免疫反應之耐久性,來量測蛋白質變異體之PK/PD性質。由變異體核酸編碼之蛋白質之穩定性可藉由分析熱穩定性或尿素變性時之穩定性來量測,或可使用電腦預測來量測。用於該等實驗及電腦測定之方法為業內已知。The variant antigens/polypeptides encoded by the nucleic acids of the present disclosure may contain amino acid changes that confer any of a variety of desirable properties, eg, enhancing their immunogenicity, enhancing their performance, and/or improving their stability in an individual or PK/PD properties. Variant antigens/polypeptides can be prepared using conventional mutagenesis techniques and suitably analyzed to determine whether they have the desired properties. Assays to determine expression levels and immunogenicity are well known in the art, and exemplary such assays are illustrated in the Examples section. Similarly, the PK/PD properties of protein variants can be measured using art-recognized techniques, such as by measuring the performance of the antigen in vaccinated individuals over time and/or by observing the durability of the induced immune response . The stability of the protein encoded by the variant nucleic acid can be measured by analyzing thermal stability or stability upon urea denaturation, or can be measured using computer prediction. The methods used for these experiments and computer assays are known in the art.

在一些實施例中,組合物包含mRNA或mRNA ORF,其包含本文提供之序列中之任一者之核苷酸序列(參見,例如,序列表),或包含與本文提供之序列中之任一者之核苷酸序列至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致的核苷酸序列。In some embodiments, the composition comprises an mRNA or mRNA ORF comprising the nucleotide sequence of any of the sequences provided herein (see, eg, the Sequence Listing), or comprising any of the sequences provided herein The nucleotide sequences of these are at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical.

術語「一致性」係指如藉由比較序列確定之兩種或更多種多肽(例如抗原)或聚核苷酸(核酸)之序列之間之關係。一致性亦係指如藉由兩個或更多個胺基酸殘基或核酸殘基之串之間之匹配數確定的序列之間或之中之序列相關性程度。一致性度量由特定數學模型或電腦程式(例如,「算法」)處理之具有空位比對(若有的話)之兩個或更多個序列中之較小者間之一致性匹配百分比。相關抗原或核酸之一致性可容易地藉由已知方法計算。「一致性百分比(%)」當應用於多肽或聚核苷酸序列時,定義為在比對序列且引入空位(若需要)以實現最大一致性百分比之後,候選胺基酸或核酸序列中與第二序列之胺基酸序列或核酸序列中之殘基(胺基酸殘基或核酸殘基)一致之殘基之百分數。用於比對之方法及電腦程式為業內所熟知。應理解,一致性取决於一致性百分比之計算,但由於計算中引入之空位及罰分,一致性之值可能不同。通常,特定聚核苷酸或多肽(例如抗原)之變異體與該特定參照聚核苷酸或多肽具有如藉由本文所述及熟習此項技術者已知之序列比對程式及參數所確定之至少40%、45%、50%、55%、60%、65%、70%、75%、80%、85%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%但小於100%之序列一致性。用於比對之該等工具包括BLAST套件之彼等工具(Stephen F. Altschul等人(1997),「Gapped BLAST and PSI-BLAST: a new generation of protein database search programs」,Nucleic Acids Res. 25:3389-3402)。另一種常用局部比對技術係基於Smith-Waterman算法(Smith, T.F.及Waterman, M.S. (1981) 「Identification of common molecular subsequences.」J. Mol. Biol. 147:195-197)。基於動態規劃之一般全局比對技術係Needleman-Wunsch算法(Needleman, S.B.及Wunsch, C.D. (1970) 「A general method applicable to the search for similarities in the amino acid sequences of two proteins.」J. Mol. Biol. 48:443-453)。最近,已開發一種快速最佳全局序列比對算法(FOGSAA),據稱其比其他最佳全局比對方法(包括Needleman-Wunsch算法)更快地產生核苷酸及蛋白質序列之全局比對。The term "identity" refers to the relationship between the sequences of two or more polypeptides (eg, antigens) or polynucleotides (nucleic acids) as determined by comparing the sequences. Identity also refers to the degree of sequence relatedness between or among sequences as determined by the number of matches between strings of two or more amino acid residues or nucleic acid residues. A measure of identity is the percent identity match between the smaller of two or more sequences with gapped alignments, if any, processed by a particular mathematical model or computer program (eg, an "algorithm"). The identity of related antigens or nucleic acids can be readily calculated by known methods. "Percent identity (%)", when applied to polypeptide or polynucleotide sequences, is defined as the difference in the candidate amino acid or nucleic acid sequence with the The percentage of residues in which the amino acid sequence of the second sequence or the residues (amino acid residues or nucleic acid residues) in the nucleic acid sequence are identical. Methods and computer programs for alignment are well known in the art. It should be understood that agreement depends on the calculation of percent agreement, but the value of agreement may vary due to gaps and penalties introduced in the calculation. Typically, a variant of a particular polynucleotide or polypeptide (eg, an antigen) has the properties as determined by sequence alignment programs and parameters known to those skilled in the art as described herein and to that particular reference polynucleotide or polypeptide. At least 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96 %, 97%, 98%, 99% but less than 100% sequence identity. Such tools for alignment include those of the BLAST suite (Stephen F. Altschul et al. (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25: 3389-3402). Another commonly used local alignment technique is based on the Smith-Waterman algorithm (Smith, T.F. and Waterman, M.S. (1981) "Identification of common molecular subsequences." J. Mol. Biol. 147:195-197). A general global alignment technique based on dynamic programming is the Needleman-Wunsch algorithm (Needleman, S.B. and Wunsch, C.D. (1970) "A general method applicable to the search for similarities in the amino acid sequences of two proteins." J. Mol. Biol 48:443-453). Recently, a Fast Optimal Global Sequence Alignment Algorithm (FOGSAA) has been developed which is said to produce global alignments of nucleotide and protein sequences faster than other optimal global alignment methods, including the Needleman-Wunsch algorithm.

因此,編碼相對於參照序列、特別是本文揭示之多肽(例如抗原)序列含有取代、插入及/或添加、缺失及共價修飾之肽或多肽的聚核苷酸包括在本揭示案之範圍內。舉例而言,序列標籤或胺基酸(諸如一或多個離胺酸)可添加至肽序列中(例如,在N末端或C末端)。序列標籤可用於肽偵測、純化或定位。離胺酸可用於增加肽之溶解度或允許生物素化。或者,位於肽或蛋白質之胺基酸序列之羧基及胺基末端區之胺基酸殘基可視情况地缺失,以提供截短之序列。根據序列之用途,例如作為可溶或連接至固體載體之更大序列之一部分之序列之表現,某些胺基酸(例如,C末端或N末端殘基)可或者缺失。在一些實施例中,信號序列、終止序列、跨膜結構域、連接體、多聚化結構域(諸如,摺疊子區)及諸如此類(或編碼其)之序列可用實現相同或相似功能之替代序列取代。在一些實施例中,可例如藉由引入更大之胺基酸填充蛋白質核心中之空腔以改善穩定性。在其他實施例中,可用疏水性殘基置換包埋之氫鍵網路以改善穩定性。在其他實施例中,可去除醣基化位點且用適當殘基置換。熟習此項技術者可容易地鑑別該等序列。亦應理解,本文提供之一些序列含有可在例如用於製備mRNA疫苗之前缺失之序列標籤或末端肽序列(例如,在N末端或C末端)。Accordingly, polynucleotides encoding peptides or polypeptides containing substitutions, insertions and/or additions, deletions, and covalent modifications relative to a reference sequence, particularly a polypeptide (eg, antigen) sequence disclosed herein are included within the scope of the present disclosure . For example, sequence tags or amino acids such as one or more lysines can be added to the peptide sequence (eg, at the N-terminus or C-terminus). Sequence tags can be used for peptide detection, purification or localization. Lysine can be used to increase the solubility of peptides or to allow biotinylation. Alternatively, amino acid residues located in the carboxyl and amino terminal regions of the amino acid sequence of a peptide or protein can optionally be deleted to provide a truncated sequence. Certain amino acids (eg, C-terminal or N-terminal residues) may or may not be deleted depending on the use of the sequence, eg, the representation of the sequence as part of a larger sequence that is soluble or linked to a solid support. In some embodiments, signal sequences, termination sequences, transmembrane domains, linkers, multimerization domains (such as foldon regions), and the like (or sequences encoding the same) may be used as alternative sequences that perform the same or similar functions replace. In some embodiments, stability can be improved by filling cavities in the protein core, eg, by introducing larger amino acids. In other embodiments, the embedded hydrogen bonding network can be replaced with hydrophobic residues to improve stability. In other embodiments, glycosylation sites can be removed and replaced with appropriate residues. Those skilled in the art can readily identify such sequences. It will also be understood that some of the sequences provided herein contain sequence tags or terminal peptide sequences (eg, at the N-terminus or C-terminus) that may be deleted, eg, prior to use in the preparation of mRNA vaccines.

如由熟習此項技術者認識到,亦將蛋白質片段、功能蛋白質結構域及同源蛋白質視為在相關冠狀病毒抗原之範圍內。舉例而言,本文提供參照蛋白質之任何蛋白質片段(意指比參照抗原序列短至少一個胺基酸殘基、但在其他方面相同之多肽序列),條件係該片段係免疫原性的且賦予針對冠狀病毒之保護性免疫反應。除了與參照蛋白質相同但截短之變異體之外,在一些實施例中,如本文提供或參考之任何序列中所示,抗原包括2、3、4、5、6、7、8、9、10或更多個突變。抗原/抗原性多肽之長度範圍可為約4、6或8個胺基酸至全長蛋白質。 穩定元件 As recognized by those skilled in the art, protein fragments, functional protein domains and homologous proteins are also considered to be within the scope of relevant coronavirus antigens. For example, provided herein is any protein fragment (meaning a polypeptide sequence that is at least one amino acid residue shorter than a reference antigen sequence, but otherwise identical) of a reference protein, provided that the fragment is immunogenic and confers a target to Protective immune response to coronavirus. In addition to identical but truncated variants of the reference protein, in some embodiments, as shown in any of the sequences provided or referenced herein, the antigens include 2, 3, 4, 5, 6, 7, 8, 9, 10 or more mutations. Antigens/antigenic polypeptides can range in length from about 4, 6 or 8 amino acids to full length proteins. Stabilizing element

天然存在之真核mRNA分子除了諸如5'-端帽結構或3'-聚(A)尾等其他結構特徵之外,亦可含有穩定元件,包括但不限於在其5'端(5' UTR)及/或在其3'端(3 ' UTR)之非轉譯區(UTR)。5' UTR及3' UTR二者皆通常係自基因體DNA轉錄且係成熟前mRNA之元件。通常在mRNA加工期間向轉錄之(成熟前) mRNA中添加成熟mRNA之特徵性結構特徵(諸如5'-端帽及3'-聚(A)尾)。Naturally occurring eukaryotic mRNA molecules may also contain stabilizing elements, including but not limited to, at their 5' end (5' UTR ) and/or the untranslated region (UTR) at its 3' end (3' UTR). Both the 5' UTR and the 3' UTR are typically transcribed from genomic DNA and are elements of pre-mature mRNA. Structural features characteristic of mature mRNAs (such as 5'-end caps and 3'-poly(A) tails) are typically added to transcribed (pre-mature) mRNAs during mRNA processing.

在一些實施例中,組合物包括具有編碼至少一種具有至少一個修飾之抗原性多肽之開放閱讀框、至少一個5'末端端帽之mRNA,且調配在脂質奈米顆粒內。聚核苷酸之5'-加端帽可根據製造商之方案在活體外轉錄反應期間使用以下化學RNA端帽類似物同時完成,以產生5'-鳥苷端帽結構:3´-O-Me-m7G(5')ppp(5') G [ARCA端帽];G(5')ppp(5')A;G(5')ppp(5')G;m7G(5')ppp(5')A;m7G(5')ppp(5')G (New England BioLabs, Ipswich, MA)。經修飾之RNA之5'-加端帽可在轉錄後使用牛痘病毒加端帽酶完成,以產生「端帽0」結構:m7G(5')ppp(5')G (New England BioLabs, Ipswich, MA)。端帽1結構可使用牛痘病毒加端帽酶及2'-O甲基轉移酶產生,以產生:m7G(5')ppp(5')G-2′-O-甲基。端帽2結構可自端帽1結構、隨後使用2'-O甲基轉移酶對5'-倒數第三個核苷酸進行2'-O-甲基化來產生。端帽3結構可自端帽2結構、隨後使用2'-O甲基轉移酶對5'-倒數第四個核苷酸進行2'-O-甲基化來產生。酶可源自重組來源。In some embodiments, the composition includes an mRNA having an open reading frame encoding at least one antigenic polypeptide with at least one modification, at least one 5' end cap, and formulated within a lipid nanoparticle. 5'-end capping of polynucleotides can be accomplished simultaneously during in vitro transcription reactions according to the manufacturer's protocol using the following chemical RNA end cap analogs to generate 5'-guanosine end cap structures: 3´-O- Me-m7G(5')ppp(5')G [ARCA end caps]; G(5')ppp(5')A; G(5')ppp(5')G; m7G(5')ppp( 5')A; m7G(5')ppp(5')G (New England BioLabs, Ipswich, MA). 5'-end capping of the modified RNA can be accomplished post-transcriptionally using the vaccinia virus capping enzyme to generate the "end cap 0" structure: m7G(5')ppp(5')G (New England BioLabs, Ipswich , MA). The endcap 1 structure can be generated using vaccinia virus endcapase and 2'-O methyltransferase to produce: m7G(5')ppp(5')G-2'-O-methyl. The end cap 2 structure can be generated from the end cap 1 structure followed by 2'-O-methylation of the 5'-third to last nucleotide using a 2'-O methyltransferase. The end cap 3 structure can be generated from the end cap 2 structure followed by 2'-O-methylation of the 5'-fourth-to-last nucleotide using a 2'-O methyltransferase. Enzymes can be derived from recombinant sources.

3'-聚(A)尾通常係添加至轉錄之mRNA之3'端之一段腺嘌呤核苷酸。在一些情况下,其可包含高達約400個腺嘌呤核苷酸。在一些實施例中,3'-聚(A)尾之長度對於個別mRNA之穩定性可為必需之要素。The 3'-poly(A) tail is usually a stretch of adenine nucleotides added to the 3' end of the transcribed mRNA. In some cases, it may contain up to about 400 adenine nucleotides. In some embodiments, the length of the 3'-poly(A) tail may be an essential element for the stability of the individual mRNA.

在一些實施例中,組合物包括穩定元件。穩定元件可包括例如組蛋白莖環。已鑑別莖環結合蛋白(SLBP),一種32 kDa之蛋白質。其與細胞核及細胞質二者中組蛋白信使3'端之組蛋白莖環結合。其表現水準受細胞週期之調控;其在S期達到峰值,此時組蛋白mRNA水準亦升高。已顯示該蛋白質對於U7 snRNP對組蛋白前mRNA之有效3'端加工係至關重要的。SLBP在加工後繼續與莖環結合,且接著刺激成熟組蛋白mRNA在細胞質中轉譯成組蛋白。SLBP之RNA結合結構域在後生動物及原生動物中係保守的;其與組蛋白莖環之結合取决於環之結構。最小結合位點包括相對於莖環之5'之至少三個核苷酸及3'之兩個核苷酸。In some embodiments, the composition includes a stabilizing element. Stabilizing elements may include, for example, histone stem-loops. Stem-loop binding protein (SLBP), a 32 kDa protein, has been identified. It binds to the histone stem-loop at the 3' end of histone messengers in both the nucleus and cytoplasm. Its level of expression is regulated by the cell cycle; it peaks in S phase, when histone mRNA levels are also elevated. This protein has been shown to be critical for efficient 3' end processing of histone pre-mRNA by U7 snRNP. SLBP continues to bind to the stem-loop after processing and then stimulates the translation of mature histone mRNA into histones in the cytoplasm. The RNA binding domain of SLBP is conserved in metazoans and protozoa; its binding to the histone stem-loop depends on the structure of the loop. The minimal binding site includes at least three nucleotides 5' and two nucleotides 3' relative to the stem loop.

在一些實施例中,mRNA包括編碼區、至少一個組蛋白莖環及視情况選用之聚(A)序列或聚腺苷酸化信號。聚(A)序列或聚腺苷酸化信號通常應增强編碼之蛋白質之表現水準。在一些實施例中,編碼之蛋白質不為組蛋白、報導基因蛋白(例如螢光素酶、GFP、EGFP、β-半乳糖苷酶、EGFP)或標記物或選擇蛋白(例如α-球蛋白、半乳糖激酶及黃嘌呤:鳥嘌呤磷酸核糖基轉移酶(GPT))。In some embodiments, the mRNA includes a coding region, at least one histone stem-loop, and optionally a poly(A) sequence or a polyadenylation signal. The poly(A) sequence or polyadenylation signal should generally enhance the level of expression of the encoded protein. In some embodiments, the encoded protein is not a histone, reporter protein (eg, luciferase, GFP, EGFP, β-galactosidase, EGFP) or marker or selectin (eg, α-globulin, Galactokinase and xanthine:guanine phosphoribosyltransferase (GPT)).

在一些實施例中,mRNA包括聚(A)序列或聚腺苷酸化信號與至少一個組蛋白莖環之組合,即使二者在性質上代表替代機制,但協同地作用以增加蛋白質表現超過用任一個別元件觀察到之水準。聚(A)及至少一種組蛋白莖環之組合之協同效應不依賴於元件之順序或聚(A)序列之長度。In some embodiments, the mRNA includes a poly(A) sequence or a polyadenylation signal in combination with at least one histone stem-loop, even though the two represent alternative mechanisms in nature, but act synergistically to increase protein expression over using either A level observed by another component. The synergistic effect of the combination of poly(A) and at least one histone stem-loop is independent of the order of the elements or the length of the poly(A) sequence.

在一些實施例中,mRNA不包括組蛋白下游元件(HDE)。「組蛋白下游元件」 (HDE)包括在天然存在之莖環之3'之大約15至20個核苷酸之富含嘌呤之聚核苷酸段,其代表U7 snRNA之結合位點,該U7 snRNA參與組蛋白前mRNA加工成成熟組蛋白mRNA。在一些實施例中,核酸不包括內含子。In some embodiments, the mRNA does not include histone downstream elements (HDEs). "Histone Downstream Elements" (HDEs) include stretches of purine-rich polynucleotides of approximately 15 to 20 nucleotides 3' to naturally occurring stem loops that represent the binding site for U7 snRNA, the U7 snRNA is involved in the processing of histone pre-mRNA into mature histone mRNA. In some embodiments, the nucleic acid does not include introns.

mRNA可含有或可不含有增强子及/或啓動子序列,其可經修飾或未經修飾,或者其可經活化或未經活化。在一些實施例中,組蛋白莖環通常源自組蛋白基因,且包括由間隔體分開之兩個相鄰之部分或完全反向互補序列之分子內鹼基配對,該間隔體由形成結構之環之短序列組成。未配對之環區通常不能與莖環元件中之任一者鹼基配對。其由於係許多RNA二級結構之關鍵組分而更經常地出現於RNA中,但亦可存在於單股DNA中。莖環結構之穩定性通常取决於長度、失配或膨出部之數目及配對區之鹼基組成。在一些實施例中,搖擺鹼基配對(非沃森-克裏克鹼基配對(non-Watson-Crick base pairing))可產生。在一些實施例中,至少一種組蛋白莖環序列包含15至45個核苷酸之長度。mRNA may or may not contain enhancer and/or promoter sequences, which may or may not be modified, or which may or may not be activated. In some embodiments, a histone stem-loop is typically derived from a histone gene and includes intramolecular base pairing of two adjacent partially or fully reverse complementary sequences separated by a spacer formed by a structural A short sequence of loops. Unpaired loop regions are generally unable to base pair with any of the stem-loop elements. It occurs more frequently in RNA as it is a key component of many RNA secondary structures, but can also be found in single-stranded DNA. The stability of the stem-loop structure generally depends on the length, the number of mismatches or bulges, and the base composition of the paired regions. In some embodiments, rocking base pairing (non-Watson-Crick base pairing) can be generated. In some embodiments, the at least one histone stem-loop sequence comprises a length of 15 to 45 nucleotides.

在一些實施例中,mRNA去除一或多個富含AU之序列。有時稱為AURES之該等序列係在3'UTR中發現之不穩定序列。AURES可自RNA疫苗去除。或者,AURES可保留在RNA疫苗中。 信號肽 In some embodiments, the mRNA removes one or more AU-rich sequences. These sequences, sometimes referred to as AURES, are unstable sequences found in the 3'UTR. AURES can be removed from RNA vaccines. Alternatively, AURES can be retained in the RNA vaccine. signal peptide

在一些實施例中,組合物包含具有ORF之mRNA,該ORF編碼與冠狀病毒抗原融合之信號肽。包含蛋白質N末端15-60個胺基酸之信號肽通常係分泌路徑上跨膜易位所需要的,且因此,普遍控制真核及原核生物中大部分蛋白質進入分泌路徑。在真核生物中,新生前體蛋白(先驅蛋白)之信號肽將核糖體引導至粗內質網(ER)膜,且起始生長之肽鏈穿過其轉運以進行加工。ER加工產生成熟蛋白質,其中信號肽通常藉由宿主細胞之ER駐留信號肽酶自前體蛋白裂解,或者其保持未裂解且作為膜錨起作用。信號肽亦可促進蛋白質靶向細胞膜。In some embodiments, the composition comprises mRNA with an ORF encoding a signal peptide fused to a coronavirus antigen. Signal peptides comprising the N-terminal 15-60 amino acids of proteins are generally required for transmembrane translocation on the secretory pathway, and thus, generally control the entry of most proteins into the secretory pathway in eukaryotes and prokaryotes. In eukaryotes, the signal peptide of a nascent precursor protein (pioneer protein) guides ribosomes to the crude endoplasmic reticulum (ER) membrane, and the growth-initiating peptide chain is transported across it for processing. ER processing produces mature proteins in which the signal peptide is typically cleaved from the precursor protein by the host cell's ER-resident signal peptidase, or it remains uncleaved and functions as a membrane anchor. Signal peptides can also facilitate protein targeting to the cell membrane.

信號肽之長度可為15-60個胺基酸。舉例而言,信號肽之長度可為15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40、41、42、43、44、45、46、47、48、49、50、51、52、53、54、55、56、57、58、59或60個胺基酸。在一些實施例中,信號肽之長度為20-60、25-60、30-60、35-60、40-60、45- 60、50-60、55-60、15-55、20-55、25-55、30-55、35-55、40-55、45-55、50-55、15-50、20-50、25-50、30-50、35-50、40-50、45-50、15-45、20-45、25-45、30-45、35-45、40-45、15-40、20-40、25-40、30-40、35-40、15-35、20-35、25-35、30-35、15-30、20-30、25-30、15-25、20-25或15-20個胺基酸。The signal peptide can be 15-60 amino acids in length. For example, the length of the signal peptide can be 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59 or 60 amino acids. In some embodiments, the length of the signal peptide is 20-60, 25-60, 30-60, 35-60, 40-60, 45-60, 50-60, 55-60, 15-55, 20-55 , 25-55, 30-55, 35-55, 40-55, 45-55, 50-55, 15-50, 20-50, 25-50, 30-50, 35-50, 40-50, 45 -50, 15-45, 20-45, 25-45, 30-45, 35-45, 40-45, 15-40, 20-40, 25-40, 30-40, 35-40, 15-35 , 20-35, 25-35, 30-35, 15-30, 20-30, 25-30, 15-25, 20-25, or 15-20 amino acids.

來自異源基因之信號肽(其調控自然界中除冠狀病毒抗原外之基因之表現)為業內已知,且可測試其期望性質,且接著納入本揭示案之核酸中。 序列最佳化 Signal peptides from heterologous genes that regulate the expression of genes other than coronavirus antigens in nature are known in the art and can be tested for their desired properties and then incorporated into the nucleic acids of the present disclosure. sequence optimization

在一些實施例中,對編碼本揭示案之抗原之ORF進行密碼子最佳化。密碼子最佳化方法為業內已知。舉例而言,本文提供之任何一或多個序列之ORF可為密碼子最佳化的。在一些實施例中,密碼子最佳化可用於匹配靶及宿主生物體中之密碼子頻率以確保正確摺疊;偏向GC含量以增加mRNA穩定性或减少二級結構;最小化可損害基因構築或表現之串聯重複密碼子或鹼基串(base run);定制轉錄及轉譯控制區;插入或去除蛋白質運輸序列;在編碼之蛋白質中去除/添加轉譯後修飾位點(例如,醣基化位點);添加、去除或改組蛋白結構域;插入或缺失限制性位點;修飾核糖體結合位點及mRNA降解位點;調整轉譯率以允許蛋白質之各種結構域正確摺疊;或减少或消除聚核苷酸內之問題二級結構。密碼子最佳化工具、算法及服務為業內已知-非限制性實例包括來自GeneArt (Life Technologies)、DNA2.0 (Menlo Park CA)及/或專有方法之服務。在一些實施例中,使用最佳化算法最佳化開放閱讀框(ORF)序列。In some embodiments, ORFs encoding antigens of the present disclosure are codon-optimized. Codon optimization methods are known in the art. For example, the ORF of any one or more of the sequences provided herein may be codon-optimized. In some embodiments, codon optimization can be used to match codon frequencies in the target and host organisms to ensure correct folding; bias toward GC content to increase mRNA stability or reduce secondary structure; minimize damage to gene architecture or Representation of tandem repeat codons or base runs; custom transcription and translation control regions; insertion or removal of protein trafficking sequences; removal/addition of post-translational modification sites (eg, glycosylation sites) in the encoded protein ); add, remove, or shuffle protein domains; insert or delete restriction sites; modify ribosome binding sites and mRNA degradation sites; adjust translation rates to allow correct folding of various domains of the protein; or reduce or eliminate polynucleation Problematic secondary structure within nucleotides. Codon optimization tools, algorithms and services are known in the art - non-limiting examples include services from GeneArt (Life Technologies), DNA2.0 (Menlo Park CA) and/or proprietary methods. In some embodiments, an optimization algorithm is used to optimize open reading frame (ORF) sequences.

在一些實施例中,密碼子最佳化之序列與天然存在或野生型序列ORF (例如,編碼冠狀病毒抗原之天然存在或野生型mRNA序列)共享小於95%之序列一致性。在一些實施例中,密碼子最佳化之序列與天然存在或野生型序列(例如,編碼冠狀病毒抗原之天然存在或野生型mRNA序列)共享小於90%之序列一致性。在一些實施例中,密碼子最佳化之序列與天然存在或野生型序列(例如,編碼冠狀病毒抗原之天然存在或野生型mRNA序列)共享小於85%之序列一致性。在一些實施例中,密碼子最佳化之序列與天然存在或野生型序列(例如,編碼冠狀病毒抗原之天然存在或野生型mRNA序列)共享小於80%之序列一致性。在一些實施例中,密碼子最佳化之序列與天然存在或野生型序列(例如,編碼冠狀病毒抗原之天然存在或野生型mRNA序列)共享小於75%之序列一致性。In some embodiments, the codon-optimized sequence shares less than 95% sequence identity with a naturally-occurring or wild-type sequence ORF (eg, a naturally-occurring or wild-type mRNA sequence encoding a coronavirus antigen). In some embodiments, the codon-optimized sequence shares less than 90% sequence identity with a naturally-occurring or wild-type sequence (eg, a naturally-occurring or wild-type mRNA sequence encoding a coronavirus antigen). In some embodiments, the codon-optimized sequence shares less than 85% sequence identity with a naturally-occurring or wild-type sequence (eg, a naturally-occurring or wild-type mRNA sequence encoding a coronavirus antigen). In some embodiments, the codon-optimized sequence shares less than 80% sequence identity with a naturally-occurring or wild-type sequence (eg, a naturally-occurring or wild-type mRNA sequence encoding a coronavirus antigen). In some embodiments, the codon-optimized sequence shares less than 75% sequence identity with a naturally-occurring or wild-type sequence (eg, a naturally-occurring or wild-type mRNA sequence encoding a coronavirus antigen).

在一些實施例中,密碼子最佳化之序列與天然存在或野生型序列(例如,編碼冠狀病毒抗原之天然存在或野生型mRNA序列)共享介於65%與85%之間(例如,介於約67%與約85%之間或介於約67%與約80%之間)之序列一致性。在一些實施例中,密碼子最佳化之序列與天然存在或野生型序列(例如,編碼冠狀病毒抗原之天然存在或野生型mRNA序列)共享介於65%與75%或約80%之間之序列一致性。In some embodiments, the codon-optimized sequence shares between 65% and 85% (eg, between 65% and 85%) with a naturally occurring or wild-type sequence (eg, a naturally occurring or wild-type mRNA sequence encoding a coronavirus antigen). between about 67% and about 85% or between about 67% and about 80%) sequence identity. In some embodiments, the codon-optimized sequence shares between 65% and 75% or about 80% with a naturally occurring or wild-type sequence (eg, a naturally occurring or wild-type mRNA sequence encoding a coronavirus antigen) sequence consistency.

在一些實施例中,密碼子最佳化之序列編碼之抗原與由非密碼子最佳化之序列編碼之冠狀病毒抗原之免疫原性一樣,或比其免疫原性更高(例如,高至少10%、至少20%、至少30%、至少40%、至少50%、至少100%或至少200%)。In some embodiments, the antigen encoded by the codon-optimized sequence is as immunogenic as, or more immunogenic (eg, at least higher than the coronavirus antigen encoded by the non-codon-optimized sequence) 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 100%, or at least 200%).

經修飾之mRNA在轉染至哺乳動物宿主細胞中時具有12-18小時之間或大於18小時、例如24小時、36小時、48小時、60小時、72小時或大於72小時之穩定性,且能够由哺乳動物宿主細胞表現。The modified mRNA is stable when transfected into a mammalian host cell for between 12-18 hours or greater than 18 hours, such as 24 hours, 36 hours, 48 hours, 60 hours, 72 hours or greater than 72 hours, and Can be expressed by mammalian host cells.

在一些實施例中,密碼子最佳化之RNA可為其中G/C之水準增加之RNA。核酸分子(例如mRNA)之G/C含量可影響RNA之穩定性。具有增加量之鳥嘌呤(G)及/或胞嘧啶(C)殘基之RNA可在功能上比含有大量腺嘌呤(A)及胸腺嘧啶(T)或尿嘧啶(U)核苷酸之mRNA更穩定。作為實例,WO02/098443揭示一種醫藥組合物,含有藉由轉譯區中之序列修飾而穩定之mRNA。由於遺傳密碼之簡併性,修飾藉由用促進更大RNA穩定性而不改變所得胺基酸之彼等密碼子取代現有密碼子來起作用。該方法限於RNA之編碼區。 化學上未經修飾之核苷酸 In some embodiments, the codon-optimized RNA can be an RNA in which the level of G/C is increased. The G/C content of a nucleic acid molecule (eg, mRNA) can affect RNA stability. RNAs with increased amounts of guanine (G) and/or cytosine (C) residues are functionally more functional than mRNAs that contain larger amounts of adenine (A) and thymine (T) or uracil (U) nucleotides more stable. As an example, WO02/098443 discloses a pharmaceutical composition containing mRNA stabilized by sequence modifications in the translation region. Due to the degeneracy of the genetic code, modifications work by replacing existing codons with those that promote greater RNA stability without altering the resulting amino acid. This method is limited to the coding region of RNA. Chemically unmodified nucleotides

在一些實施例中,mRNA未經化學修飾,且包含由腺苷、鳥苷、胞嘧啶及尿苷組成之標準核糖核苷酸。在一些實施例中,本揭示案之核苷酸及核苷包含標準核苷殘基,諸如存在於轉錄之RNA中之彼等核苷殘基(例如A、G、C或U)。在一些實施例中,本揭示案之核苷酸及核苷包含標準去氧核糖核苷,例如存在於DNA中之彼等去氧核糖核苷(例如dA、dG、dC或dT)。 化學修飾 In some embodiments, the mRNA is not chemically modified and comprises standard ribonucleotides consisting of adenosine, guanosine, cytosine, and uridine. In some embodiments, the nucleotides and nucleosides of the present disclosure comprise standard nucleoside residues, such as those found in transcribed RNA (eg, A, G, C, or U). In some embodiments, the nucleotides and nucleosides of the present disclosure comprise standard deoxyribonucleosides, such as those found in DNA (eg, dA, dG, dC, or dT). chemical modification

在一些實施例中,本揭示案之組合物包含具有編碼冠狀病毒抗原之開放閱讀框之mRNA,其中核酸包含可為標準之(未經修飾之)或如業內已知修飾之核苷酸及/或核苷。在一些實施例中,本揭示案之核苷酸及核苷包含經修飾之核苷酸或核苷。該等經修飾之核苷酸及核苷可為天然存在之經修飾之核苷酸及核苷或非天然存在之經修飾之核苷酸及核苷。該等修飾可包括如業內所公認之在核苷酸及/或核苷之糖、主鏈或核鹼基部分之彼等修飾。In some embodiments, the compositions of the present disclosure comprise mRNA having an open reading frame encoding a coronavirus antigen, wherein the nucleic acid comprises nucleotides that may be standard (unmodified) or modified as known in the art and/or or nucleosides. In some embodiments, the nucleotides and nucleosides of the present disclosure comprise modified nucleotides or nucleosides. Such modified nucleotides and nucleosides can be naturally occurring modified nucleotides and nucleosides or non-naturally occurring modified nucleotides and nucleosides. Such modifications may include such modifications in the sugar, backbone or nucleobase moieties of nucleotides and/or nucleosides, as recognized in the art.

在一些實施例中,本揭示案之天然存在之經修飾之核苷酸或核苷係業內通常已知或公認之核苷酸或核苷。該等天然存在之經修飾之核苷酸及核苷之非限制性實例尤其可在廣泛公認之MODOMICS數據庫中發現。In some embodiments, the naturally-occurring modified nucleotides or nucleosides of the present disclosure are nucleotides or nucleosides generally known or recognized in the art. Non-limiting examples of such naturally occurring modified nucleotides and nucleosides can be found, inter alia, in the widely recognized MODOMICS database.

在一些實施例中,本揭示案之非天然存在之經修飾之核苷酸或核苷係業內通常已知或公認之核苷酸或核苷。該等非天然存在之經修飾之核苷酸及核苷之非限制性實例尤其可參見公開之美國申請案號 PCT/US2012/058519;PCT/US2013/075177;PCT/US2014/058897;PCT/US2014/058891;PCT/US2014/070413;PCT/US2015/36773;PCT/US2015/36759;PCT/US2015/36771;或PCT/IB2017/051367,所有該等申請案皆以引用方式併入本文中。In some embodiments, the non-naturally occurring modified nucleotides or nucleosides of the present disclosure are nucleotides or nucleosides generally known or recognized in the art. Non-limiting examples of such non-naturally occurring modified nucleotides and nucleosides can be found in, inter alia, Published US Application Nos. PCT/US2012/058519; PCT/US2013/075177; PCT/US2014/058897; PCT/US2014 /058891; PCT/US2014/070413; PCT/US2015/36773; PCT/US2015/36759; PCT/US2015/36771; or PCT/IB2017/051367, all of which are incorporated herein by reference.

因此,本揭示案之核酸(例如,DNA核酸及RNA核酸,諸如mRNA核酸)可包含標準核苷酸及核苷、天然存在之核苷酸及核苷、非天然存在之核苷酸及核苷或其任一組合。Accordingly, nucleic acids (eg, DNA nucleic acids and RNA nucleic acids, such as mRNA nucleic acids) of the present disclosure can include standard nucleotides and nucleosides, naturally occurring nucleotides and nucleosides, non-naturally occurring nucleotides and nucleosides or any combination thereof.

在一些實施例中,本揭示案之核酸(例如,DNA核酸及RNA核酸,諸如mRNA核酸)包含多種(一種以上)不同類型之標準及/或經修飾之核苷酸及核苷。在一些實施例中,核酸之特定區含有一種、兩種或更多種(視情况地不同)類型之標準及/或經修飾之核苷酸及核苷。In some embodiments, nucleic acids (eg, DNA nucleic acids and RNA nucleic acids, such as mRNA nucleic acids) of the present disclosure comprise multiple (more than one) different types of standard and/or modified nucleotides and nucleosides. In some embodiments, a particular region of a nucleic acid contains one, two or more (as appropriate) types of standard and/or modified nucleotides and nucleosides.

在一些實施例中,相對於包含標準核苷酸及核苷之未經修飾之核酸,引入細胞或生物體之經修飾之RNA核酸(例如,經修飾之mRNA核酸)分別在細胞或生物體中展現降低之降解。In some embodiments, the modified RNA nucleic acid (eg, modified mRNA nucleic acid) introduced into the cell or organism is in the cell or organism, respectively, relative to the unmodified nucleic acid comprising standard nucleotides and nucleosides Shows reduced degradation.

在一些實施例中,相對於包含標準核苷酸及核苷之未經修飾之核酸,引入細胞或生物體中之經修飾之RNA核酸(例如,經修飾之mRNA核酸)可分別在細胞或生物體中展現降低之免疫原性(例如,降低之先天反應)。In some embodiments, a modified RNA nucleic acid (eg, a modified mRNA nucleic acid) introduced into a cell or organism, relative to an unmodified nucleic acid comprising standard nucleotides and nucleosides, can be used in the cell or organism, respectively. A reduced immunogenicity is exhibited in vivo (eg, a reduced innate response).

在一些實施例中,核酸(例如RNA核酸,諸如mRNA核酸)包含非天然修飾之核苷酸,其在核酸之合成或合成後引入以實現期望功能或性質。修飾可存在於核苷酸間鍵聯、嘌呤或嘧啶鹼基或糖上。修飾可用化學合成或用聚合酶在鏈之末端或鏈中之任何其他地方引入。核酸之任何區可經化學修飾。In some embodiments, nucleic acids (eg, RNA nucleic acids, such as mRNA nucleic acids) comprise non-naturally modified nucleotides that are introduced during or after synthesis of the nucleic acid to achieve a desired function or property. Modifications can exist on internucleotide linkages, purine or pyrimidine bases, or sugars. Modifications can be introduced at the end of the chain or anywhere else in the chain by chemical synthesis or with a polymerase. Any region of a nucleic acid can be chemically modified.

本揭示案提供核酸(例如RNA核酸,諸如mRNA核酸)之經修飾之核苷及核苷酸。「核苷」係指含有糖分子(例如戊醣或核糖)或其衍生物與有機鹼基(例如嘌呤或嘧啶)或其衍生物(在本文中亦稱為「核鹼基」)組合之化合物。「核苷酸」係指包括磷酸基團之核苷。經修飾之核苷酸可藉由任何有用之方法、例如化學、酶促或重組方法合成,以包括一或多個經修飾之或非天然之核苷。核酸可包含連接之核苷之一或多個區。該等區可具有可變之主鏈鍵聯。鍵聯可為標準磷酸二酯鍵聯,在該情形下,核酸將包含核苷酸之區。The present disclosure provides modified nucleosides and nucleotides of nucleic acids (eg, RNA nucleic acids, such as mRNA nucleic acids). "Nucleoside" refers to a compound containing a sugar molecule (eg, pentose or ribose) or a derivative thereof in combination with an organic base (eg, purine or pyrimidine) or a derivative thereof (also referred to herein as a "nucleobase") . "Nucleotide" refers to a nucleoside that includes a phosphate group. Modified nucleotides can be synthesized by any useful method, such as chemical, enzymatic or recombinant methods, to include one or more modified or non-natural nucleosides. Nucleic acids may comprise one or more regions of linked nucleosides. The regions may have variable backbone linkages. The linkage can be a standard phosphodiester linkage, in which case the nucleic acid will comprise a region of nucleotides.

經修飾之核苷酸鹼基配對不僅涵蓋標準腺苷-胸腺嘧啶、腺苷-尿嘧啶或鳥苷-胞嘧啶鹼基對,且亦包括在核苷酸及/或包含非標準或經修飾之鹼基之經修飾之核苷酸之間形成之鹼基對,其中氫鍵供體及氫鍵受體之排列允許在非標準鹼基與標準鹼基之間或在兩個互補非標準鹼基結構之間、例如在具有至少一個化學修飾之彼等核酸中之氫鍵。該非標準鹼基配對之一個實例係經修飾之核苷酸肌苷與腺嘌呤、胞嘧啶或尿嘧啶之間之鹼基配對。鹼基/糖或連接體之任何組合可納入本揭示案之核酸中。Modified nucleotide base pairings encompass not only standard adenosine-thymine, adenosine-uracil, or guanosine-cytosine base pairs, but also include nucleotides and/or include non-standard or modified base pairs. A base pair formed between modified nucleotides of bases in which the hydrogen bond donor and hydrogen bond acceptor are arranged to allow between a non-standard base and a standard base or between two complementary non-standard bases Hydrogen bonds between structures, such as in those nucleic acids with at least one chemical modification. An example of such non-standard base pairing is the base pairing between the modified nucleotides inosine and adenine, cytosine or uracil. Any combination of bases/sugars or linkers can be incorporated into the nucleic acids of the present disclosure.

在一些實施例中,核酸(例如RNA核酸,諸如mRNA核酸)中經修飾之核鹼基包含1-甲基-假尿苷(m1ψ)、1-乙基-假尿苷(e1ψ)、5-甲氧基-尿苷(mo5U)、5-甲基-胞苷(m5C)及/或假尿苷(ψ)。在一些實施例中,核酸(例如RNA核酸,諸如mRNA核酸)中之經修飾之核鹼基包含5-甲氧基甲基尿苷、5-甲硫基尿苷、1-甲氧基甲基假尿苷、5-甲基胞苷及/或5-甲氧基胞苷。在一些實施例中,多核糖核苷酸包括至少兩種(例如,2、3、4或更多種)任何上文所提及之經修飾之核鹼基之組合,包括但不限於化學修飾。In some embodiments, the modified nucleobases in nucleic acids (eg, RNA nucleic acids, such as mRNA nucleic acids) comprise 1-methyl-pseudouridine (m1ψ), 1-ethyl-pseudouridine (e1ψ), 5- Methoxy-uridine (mo5U), 5-methyl-cytidine (m5C) and/or pseudouridine (ψ). In some embodiments, the modified nucleobases in nucleic acids (eg, RNA nucleic acids, such as mRNA nucleic acids) comprise 5-methoxymethyluridine, 5-methylthiouridine, 1-methoxymethyl Pseudouridine, 5-methylcytidine and/or 5-methoxycytidine. In some embodiments, the polyribonucleotide includes a combination of at least two (eg, 2, 3, 4, or more) any of the above-mentioned modified nucleobases, including but not limited to chemical modifications .

在一些實施例中,本揭示案之mRNA包含在核酸之一或多個或所有尿苷位置之1-甲基-假尿苷(m1ψ)取代。In some embodiments, the mRNA of the present disclosure comprises a 1-methyl-pseudouridine (m1ψ) substitution at one or more or all uridine positions of the nucleic acid.

在一些實施例中,本揭示案之mRNA包含在核酸之一或多個或所有尿苷位置之1-甲基-假尿苷(m1ψ)取代及在核酸之一或多個或所有胞苷位置之5-甲基胞苷取代。In some embodiments, the mRNA of the present disclosure comprises a 1-methyl-pseudouridine (m1ψ) substitution at one or more or all uridine positions of the nucleic acid and at one or more or all cytidine positions of the nucleic acid 5-methylcytidine substitution.

在一些實施例中,本揭示案之mRNA包含在核酸之一或多個或所有尿苷位置之假尿苷(ψ)取代。In some embodiments, the mRNAs of the present disclosure comprise pseudouridine (ψ) substitutions at one or more or all uridine positions of the nucleic acid.

在一些實施例中,本揭示案之mRNA包含在核酸之一或多個或所有尿苷位置之假尿苷(ψ)取代及在核酸之一或多個或所有胞苷位置之5-甲基胞苷取代。In some embodiments, mRNAs of the present disclosure comprise pseudouridine (ψ) substitutions at one or more or all uridine positions of the nucleic acid and a 5-methyl group at one or more or all cytidine positions of the nucleic acid Cytidine substitution.

在一些實施例中,本揭示案之mRNA包含在核酸之一或多個或所有尿苷位置之尿苷。In some embodiments, the mRNA of the present disclosure comprises uridine at one or more or all uridine positions of the nucleic acid.

在一些實施例中,對於特定修飾,對mRNA進行均勻修飾(例如,完全修飾、在整個序列中修飾)。舉例而言,核酸可用1-甲基-假尿苷均勻修飾,此意味著mRNA序列中之所有尿苷殘基皆用1-甲基-假尿苷置換。類似地,藉由用經修飾之殘基(諸如上文所闡釋之彼等殘基)置換,可針對序列中存在之任何類型之核苷殘基對核酸進行均勻修飾。In some embodiments, for a particular modification, the mRNA is modified uniformly (eg, fully modified, modified throughout the sequence). For example, nucleic acids can be uniformly modified with 1-methyl-pseudouridine, which means that all uridine residues in the mRNA sequence are replaced with 1-methyl-pseudouridine. Similarly, nucleic acids can be uniformly modified for any type of nucleoside residue present in the sequence by replacement with modified residues, such as those explained above.

本揭示案之核酸可沿著分子之整個長度經部分或完全修飾。舉例而言,在本揭示案之核酸中,或在其預定序列區中(例如,在包括或不包括聚(A)尾之mRNA中),可均勻地修飾一或多種或所有或給定類型之核苷酸(例如,嘌呤或嘧啶,或任一或多種或所有A、G、U、C)。在一些實施例中,本揭示案之核酸中(或其序列區中)之所有核苷酸X皆係經修飾之核苷酸,其中X可為核苷酸A、G、U、C中之任一者,或組合A+G、A+U、A+C、G+U、G+C、U+C、A+G+U、A+G+C、G+U+C或A+G+C中之任一者。Nucleic acids of the present disclosure can be partially or fully modified along the entire length of the molecule. For example, in a nucleic acid of the present disclosure, or in a predetermined sequence region thereof (eg, in an mRNA that includes or does not include a poly(A) tail), one or more or all or a given type can be modified uniformly nucleotides (eg, purines or pyrimidines, or any or more or all of A, G, U, C). In some embodiments, all nucleotides X in the nucleic acids of the present disclosure (or in their sequence regions) are modified nucleotides, wherein X can be one of nucleotides A, G, U, C Any, or a combination of A+G, A+U, A+C, G+U, G+C, U+C, A+G+U, A+G+C, G+U+C, or A+ Any of G+C.

核酸可含有約1%至約100%之經修飾之核苷酸(相對於總核苷酸含量,或相對於一或多種類型之核苷酸,亦即A、G、U或C中之任一或多者)或任何居間百分比(例如,1%至20%、1%至25%、1%至50%、1%至60%、1%至70%、1%至80%、1%至90%、1%至95%、10%至20%、10%至25%、10%至50%、10%至60%、10%至70%、10%至80%、10%至90%、10%至95%、10%至100%、20%至25%、20%至50%、20%至60%、20%至70%、20%至80%、20%至90%、20%至95%、20%至100%、50%至60%、50%至70%、50%至80%、50%至90%、50%至95%、50%至100%、70%至80%、70%至90%、70%至95%、70%至100%、80%至90%、80%至95%、80%至100%、90%至95%、90%至100%及95%至100%)。應理解,未經修飾之A、G、U或C之存在佔任何其餘百分數。Nucleic acids may contain from about 1% to about 100% modified nucleotides (relative to total nucleotide content, or to one or more types of nucleotides, ie, any of A, G, U, or C). one or more) or any intermediate percentage (eg, 1% to 20%, 1% to 25%, 1% to 50%, 1% to 60%, 1% to 70%, 1% to 80%, 1% to 90%, 1% to 95%, 10% to 20%, 10% to 25%, 10% to 50%, 10% to 60%, 10% to 70%, 10% to 80%, 10% to 90% %, 10% to 95%, 10% to 100%, 20% to 25%, 20% to 50%, 20% to 60%, 20% to 70%, 20% to 80%, 20% to 90%, 20% to 95%, 20% to 100%, 50% to 60%, 50% to 70%, 50% to 80%, 50% to 90%, 50% to 95%, 50% to 100%, 70% to 80%, 70% to 90%, 70% to 95%, 70% to 100%, 80% to 90%, 80% to 95%, 80% to 100%, 90% to 95%, 90% to 100 % and 95% to 100%). It should be understood that the presence of unmodified A, G, U or C accounts for any remaining percentages.

mRNA可含有最少1%且最多100%之經修飾之核苷酸或任何居間百分數,諸如至少5%之經修飾之核苷酸、至少10%之經修飾之核苷酸、至少25%之經修飾之核苷酸、至少50%之經修飾之核苷酸、至少80%之經修飾之核苷酸或至少90%之經修飾之核苷酸。舉例而言,核酸可含有經修飾之嘧啶,諸如經修飾之尿嘧啶或胞嘧啶。在一些實施例中,核酸中至少5%、至少10%、至少25%、至少50%、至少80%、至少90%或100%之尿嘧啶由經修飾之尿嘧啶(例如5-取代之尿嘧啶)置換。經修飾之尿嘧啶可由具有單一獨特結構之化合物置換,或可由具有不同結構(例如,2、3、4或更多種獨特結構)之複數種化合物置換。在一些實施例中,核酸中至少5%、至少10%、至少25%、至少50%、至少80%、至少90%或100%之胞嘧啶由經修飾之胞嘧啶(例如5-取代之胞嘧啶)置換。經修飾之胞嘧啶可由具有單一獨特結構之化合物置換,或可由具有不同結構(例如,2、3、4或更多種獨特結構)之複數種化合物置換。 非轉譯區 (UTR) mRNA may contain a minimum of 1% and a maximum of 100% modified nucleotides or any intermediate percentages, such as at least 5% modified nucleotides, at least 10% modified nucleotides, at least 25% modified nucleotides Modified nucleotides, at least 50% modified nucleotides, at least 80% modified nucleotides, or at least 90% modified nucleotides. For example, nucleic acids can contain modified pyrimidines, such as modified uracil or cytosine. In some embodiments, at least 5%, at least 10%, at least 25%, at least 50%, at least 80%, at least 90%, or 100% of the uracils in the nucleic acid consist of modified uracils (eg, 5-substituted uracils pyrimidine) replacement. The modified uracil can be replaced by a compound having a single unique structure, or can be replaced by a plurality of compounds having different structures (eg, 2, 3, 4, or more unique structures). In some embodiments, at least 5%, at least 10%, at least 25%, at least 50%, at least 80%, at least 90%, or 100% of the cytosines in the nucleic acid are modified cytosines (eg, 5-substituted cytosines) pyrimidine) replacement. Modified cytosines can be replaced by compounds with a single unique structure, or can be replaced by multiple compounds with different structures (eg, 2, 3, 4, or more unique structures). Untranslated Region (UTR)

本揭示案之mRNA可包含一或多個充當或用作非轉譯區之區或部分。當mRNA經設計為編碼至少一種相關抗原時,核酸可包含該等非轉譯區(UTR)中之一或多者。核酸之野生型非轉譯區經轉錄,而非經轉譯。在mRNA中,5' UTR在轉錄起始位點開始,且繼續至起始密碼子,但不包括起始密碼子;而3' UTR在終止密碼子之後立即開始,且繼續至轉錄終止信號。愈來愈多之證據表明UTR在核酸分子之穩定性及轉譯方面所起之調控作用。UTR之調控特徵可納入本揭示案之聚核苷酸中,以尤其增强分子之穩定性。亦可納入特定特徵以確保在轉錄物被錯誤導向不期望器官部位之情形下轉錄物之受控下調。多種5' UTR及3' UTR序列為業內已知及可獲得的。The mRNAs of the present disclosure may comprise one or more regions or portions that serve or function as untranslated regions. When the mRNA is designed to encode at least one relevant antigen, the nucleic acid may comprise one or more of these untranslated regions (UTRs). The wild-type untranslated region of the nucleic acid is transcribed, not translated. In mRNA, the 5'UTR begins at the transcription initiation site and continues to, but not including, the initiation codon; while the 3'UTR begins immediately after the stop codon and continues to the transcription termination signal. There is increasing evidence that UTRs play a regulatory role in the stability and translation of nucleic acid molecules. Regulatory features of UTRs can be incorporated into the polynucleotides of the present disclosure to, inter alia, enhance the stability of the molecule. Specific features may also be incorporated to ensure controlled downregulation of transcripts in the event that they are misdirected to undesired organ sites. Various 5'UTR and 3'UTR sequences are known and available in the art.

5ꞌ UTR係mRNA之直接在起始密碼子(核糖體轉譯之mRNA轉錄物之第一個密碼子)上游(5ꞌ)之區。5ꞌ UTR不編碼蛋白質(係非編碼的)。天然5' UTR具有在轉譯起始中起作用之特徵。其具有如Kozak序列之記號,通常已知Kozak序列參與核糖體起始許多基因轉譯之過程。Kozak序列具有共有CCR(A/G)CCAUGG (SEQ ID NO: 128),其中R係起始密碼子(AUG)上游三個鹼基處之嘌呤(腺嘌呤或鳥嘌呤),起始密碼子(AUG)後接另一『G』。亦已知5' UTR形成參與延伸因子結合之二級結構。The 5ꞌ UTR is the region of the mRNA immediately upstream (5ꞌ) of the initiation codon (the first codon of the ribosome-translated mRNA transcript). 5ꞌ UTR does not encode protein (it is non-coding). The native 5'UTR is characterized by a role in translation initiation. It has a signature like the Kozak sequence, which is generally known to be involved in the process by which the ribosome initiates the translation of many genes. The Kozak sequence has a consensus CCR(A/G)CCAUGG (SEQ ID NO: 128), where R is a purine (adenine or guanine) three bases upstream of the initiation codon (AUG), and the initiation codon ( AUG) followed by another "G". The 5'UTR is also known to form secondary structures involved in elongation factor binding.

在本揭示案之一些實施例中,5' UTR係異源UTR,亦即,係在自然界中發現之與不同ORF相關之UTR。在另一實施例中,5' UTR係合成UTR,亦即,自然界中不出現。合成之UTR包括已經突變以改善其性質之UTR,例如,增加基因表現之UTR,以及完全合成之UTR。示例性5' UTR包括非洲爪蟾屬(Xenopus)或人類源a-球蛋白或b-球蛋白(8278063;9012219)、人類細胞色素b-245a多肽及羥基類固醇(17b)去氫酶及烟草蝕紋病毒(US8278063, 9012219)。亦可使用CMV立即早期1 (IE1)基因(US20140206753、WO2013/185069)、序列GGGAUCCUACC (SEQ ID NO: 129) (WO2014144196)。在另一實施例中,TOP基因之5' UTR係缺乏5' TOP基元(寡嘧啶束)之TOP基因之5' UTR (例如,WO/2015101414、WO2015101415、WO/2015/062738、WO2015024667、WO2015024667);可使用源自核糖體蛋白大32 (L32)基因之5' UTR元件(WO/2015101414、WO2015101415、WO/2015/062738)、源自羥基類固醇(17-β)去氫酶4基因(HSD17B4)之5'UTR之5' UTR元件(WO2015024667)或源自ATP5A1之5' UTR之5' UTR元件(WO2015024667)。在一些實施例中,使用內部核糖體進入位點(IRES)代替5' UTR。In some embodiments of the present disclosure, the 5' UTR is a heterologous UTR, ie, a UTR that is found in nature to be associated with a different ORF. In another embodiment, the 5' UTR is a synthetic UTR, ie, does not occur in nature. Synthetic UTRs include UTRs that have been mutated to improve their properties, eg, UTRs that increase gene expression, as well as fully synthetic UTRs. Exemplary 5' UTRs include Xenopus or human derived a-globulin or b-globin (8278063; 9012219), human cytochrome b-245a polypeptide and hydroxysteroid (17b) dehydrogenase and tobacco etch Streak virus (US8278063, 9012219). The CMV immediate early 1 (IE1) gene (US20140206753, WO2013/185069), sequence GGGAUCCUACC (SEQ ID NO: 129) (WO2014144196) can also be used. In another embodiment, the 5' UTR of the TOP gene is the 5' UTR of the TOP gene lacking the 5' TOP motif (oligopyrimidine tract) (eg, WO/2015101414, WO2015101415, WO/2015/062738, WO2015024667, WO2015024667 ); the 5' UTR element derived from the ribosomal protein large 32 (L32) gene (WO/2015101414, WO2015101415, WO/2015/062738), the hydroxysteroid (17-beta) dehydrogenase 4 gene (HSD17B4) can be used ) of the 5' UTR element (WO2015024667) or the 5' UTR element derived from the 5' UTR of ATP5A1 (WO2015024667). In some embodiments, an internal ribosome entry site (IRES) is used in place of the 5' UTR.

在一些實施例中,本揭示案之5' UTR包含選自SEQ ID NO: 131及SEQ ID NO: 2之序列。In some embodiments, the 5' UTR of the present disclosure comprises a sequence selected from the group consisting of SEQ ID NO: 131 and SEQ ID NO: 2.

3ꞌ UTR係mRNA之直接在終止密碼子(mRNA轉錄物之發出轉譯終止信號之密碼子)下游(3ꞌ)之區。3ꞌ UTR不編碼蛋白質(係非編碼的)。已知天然或野生型3 ' UTR具有嵌入其中之腺苷及尿苷之段。該等富AU之記號在具有高周轉率之基因中特別普遍。富AU之元件(ARE)基於其序列特徵及功能特性可分成三類(Chen等人,1995):I類ARE在富U之區內含有AUUUA基元之若干分散之拷貝。C-Myc及MyoD含有I類ARE。II類ARE具有兩個或更多個重疊UUAUUUA(U/A)(U/A) (SEQ ID NO: 130)九聚體。含有此種類型之ARE之分子包括GM-CSF及TNF-α。III類ARE定義不太明確。該等富U之區不含AUUUA基元。c-Jun及肌細胞生成素係此類別之兩個充分研究之實例。已知大多數結合至ARE之蛋白質使信使不穩定,而據記載ELAV家族之成員(最著名的是HuR)增加mRNA之穩定性。HuR結合至所有三個類別之ARE。將HuR特異性結合位點改造至核酸分子之3 'UTR中將導致HuR結合,且因此導致活體內信使之穩定。The 3ꞌ UTR is the region of the mRNA immediately downstream (3ꞌ) of the stop codon (the codon that signals the termination of translation of the mRNA transcript). 3ꞌ UTRs do not encode proteins (are non-coding). Natural or wild-type 3'UTRs are known to have stretches of adenosine and uridine embedded therein. These AU-rich markers are particularly prevalent in genes with high turnover. AU-rich elements (AREs) can be divided into three classes based on their sequence characteristics and functional properties (Chen et al., 1995): Class I AREs contain scattered copies of the AUUUA motif within the U-rich region. C-Myc and MyoD contain class I AREs. Class II AREs have two or more overlapping UUAUUUA(U/A)(U/A) (SEQ ID NO: 130) nonamers. Molecules containing AREs of this type include GM-CSF and TNF-alpha. Class III AREs are less well-defined. These U-rich regions do not contain AUUUA primitives. c-Jun and myogenin are two well-studied examples of this class. Most proteins that bind to AREs are known to destabilize messengers, while members of the ELAV family, most notably HuR, have been documented to increase mRNA stability. HuR binds to all three classes of AREs. Engineering a HuR-specific binding site into the 3'UTR of a nucleic acid molecule will result in HuR binding and thus in vivo messenger stabilization.

3' UTR富AU元件(ARE)之引入、去除或修飾可用於調節本揭示案之核酸(例如RNA)之穩定性。當工程改造特定核酸時,可引入ARE之一或多個拷貝以使本揭示案之核酸更不穩定,從而縮減轉譯且减少所得蛋白質之產生。同樣,可鑑別及去除或突變ARE以增加細胞內穩定性,從而增加所得蛋白質之轉譯及產生。可在相關細胞株中使用本揭示案之核酸執行轉染實驗,且可在轉染後之不同時間點分析蛋白質產生。舉例而言,可用不同ARE工程改造分子,且藉由對相關蛋白質使用ELISA套組且分析在轉染後6小時、12小時、24小時、48小時及7天產生之蛋白質,來轉染細胞。Introduction, removal or modification of 3'UTR AU-rich elements (AREs) can be used to modulate the stability of nucleic acids (eg, RNAs) of the present disclosure. When engineering a particular nucleic acid, one or more copies of the ARE can be introduced to make the nucleic acid of the present disclosure more unstable, thereby curtailing translation and reducing the production of the resulting protein. Likewise, AREs can be identified and removed or mutated to increase intracellular stability, thereby increasing translation and production of the resulting protein. Transfection experiments using the nucleic acids of the present disclosure can be performed in relevant cell lines, and protein production can be analyzed at various time points after transfection. For example, the molecules can be engineered with different AREs and cells can be transfected by using an ELISA kit for the relevant protein and analyzing the protein produced at 6 hours, 12 hours, 24 hours, 48 hours and 7 days after transfection.

3' UTR可為異源的或合成的。關於3' UTR,球蛋白UTR (包括非洲爪蟾屬β-球蛋白UTR及人類β-球蛋白UTR)為業內已知(8278063、9012219、US20110086907)。藉由首尾相連地選殖兩個連續之人類β-球蛋白3' UTR,已開發在一些細胞類型中具有增强之穩定性之經修飾之β-球蛋白構築體,且其為業內所熟知(US2012/0195936、WO2014/071963)。另外,a2-球蛋白、a1-球蛋白、UTR及其突變體亦為業內已知(WO2015101415、WO2015024667)。在非專利文獻中mRNA構築體中闡述之其他3' UTR包括CYBA (Ferizi等人,2015)及白蛋白(Thess等人,2015)。其他示例性3' UTR包括牛或人類生長激素(野生型或經修飾) (WO2013/185069、US20140206753、WO2014152774)、兔β球蛋白及B型肝炎病毒(HBV)之3′ UTR,α-球蛋白3' UTR及病毒VEEV 3' UTR序列亦為業內已知。在一些實施例中,使用序列UUUGAAUU (WO2014144196)。在一些實施例中,使用人類及小鼠核糖體蛋白之3 'UTR。其他實例包括rps9 3'UTR (WO2015101414)、FIG4 (WO2015101415)及人類白蛋白7 (WO2015101415)。The 3' UTR can be heterologous or synthetic. Regarding 3' UTRs, globulin UTRs (including Xenopus β-globin UTR and human β-globin UTR) are known in the art (8278063, 9012219, US20110086907). By colonizing two consecutive human β-globin 3' UTRs end-to-end, modified β-globin constructs with enhanced stability in some cell types have been developed and are well known in the art ( US2012/0195936, WO2014/071963). In addition, a2-globulin, a1-globulin, UTR and mutants thereof are also known in the art (WO2015101415, WO2015024667). Other 3' UTRs described in mRNA constructs in the non-patent literature include CYBA (Ferizi et al., 2015) and albumin (Thess et al., 2015). Other exemplary 3' UTRs include bovine or human growth hormone (wild type or modified) (WO2013/185069, US20140206753, WO2014152774), rabbit beta globulin and the 3' UTR of hepatitis B virus (HBV), alpha-globulin 3' UTR and viral VEEV 3' UTR sequences are also known in the art. In some embodiments, the sequence UUUGAAUU (WO2014144196) is used. In some embodiments, the 3'UTRs of human and mouse ribosomal proteins are used. Other examples include rps9 3'UTR (WO2015101414), FIG4 (WO2015101415) and human albumin 7 (WO2015101415).

在一些實施例中,本揭示案之3ꞌ UTR包含選自SEQ ID NO: 132及SEQ ID NO: 4之序列。In some embodiments, the 3ꞌ UTR of the present disclosure comprises a sequence selected from the group consisting of SEQ ID NO: 132 and SEQ ID NO: 4.

熟習此項技術者將理解,異源或合成之5' UTR可與任何期望之3' UTR序列一起使用。舉例而言,異源5' UTR可與合成3' UTR或異源3'' UTR一起使用。Those skilled in the art will understand that a heterologous or synthetic 5'UTR can be used with any desired 3'UTR sequence. For example, a heterologous 5' UTR can be used with a synthetic 3' UTR or a heterologous 3' UTR.

非UTR序列亦可用作核酸內之區或亞區。舉例而言,內含子或一部分內含子序列可納入本揭示案之核酸之區中。納入內含子序列可增加蛋白質產生及核酸含量。Non-UTR sequences can also be used as regions or subregions within a nucleic acid. For example, introns or portions of intron sequences can be incorporated into regions of the nucleic acids of the present disclosure. Incorporation of intron sequences can increase protein production and nucleic acid content.

特徵之組合可包括在側翼區中且可包含在其他特徵內。舉例而言,ORF可以側接5' UTR,其可含有强Kozak轉譯起始信號;及/或3' UTR,其可包括用於作為模板添加聚A尾之寡(dT)序列。5' UTR可包含來自相同及/或不同基因之第一聚核苷酸片段及第二聚核苷酸片段,諸如美國專利申請公開案號20100293625及PCT/US2014/069155中所述之5' UTR,該等公開案以全文引用之方式併入本文中。Combinations of features may be included in flanking regions and may be included within other features. For example, the ORF can be flanked by a 5' UTR, which can contain a strong Kozak translation initiation signal; and/or a 3' UTR, which can include an oligo (dT) sequence for adding a poly-A tail as a template. The 5'UTR may comprise a first polynucleotide fragment and a second polynucleotide fragment from the same and/or different genes, such as the 5'UTRs described in US Patent Application Publication No. 20100293625 and PCT/US2014/069155 , these publications are incorporated herein by reference in their entirety.

應理解,來自任何基因之任何UTR均可納入核酸之區中。此外,可利用任何已知基因之多個野生型UTR。本揭示案之範圍內亦提供不為野生型區之變異體之人工UTR。該等UTR或其部分可以與選出該等UTR之轉錄物相同之方向放置,或可改變方向或位置。因此,5'或3' UTR可倒置、縮短、延長,與一或多個其他5' UTR或3' UTR一起製備。如本文之術語「改變之」用在關於UTR序列時,意指相對於參考序列,UTR已經以某種方式改變。舉例而言,3′ UTR或5′ UTR可藉由如上文教示之方向或位置之改變而相對於野生型或天然UTR改變,或可藉由包括額外核苷酸、缺失核苷酸、交換或轉座核苷酸而改變。產生「改變之」UTR (無論3'或5 ')之該等改變中之任一者包含變異體UTR。It is understood that any UTR from any gene can be incorporated into a region of nucleic acid. In addition, multiple wild-type UTRs for any known gene can be utilized. Artificial UTRs that are not variants of the wild-type region are also provided within the scope of this disclosure. The UTRs or portions thereof may be placed in the same orientation as the transcripts from which the UTRs were selected, or the orientation or position may be altered. Thus, a 5' or 3' UTR can be inverted, shortened, elongated, and prepared with one or more other 5' UTRs or 3' UTRs. The term "altered" as used herein in reference to a UTR sequence means that the UTR has been altered in some way relative to the reference sequence. For example, a 3' UTR or 5' UTR can be altered relative to a wild-type or native UTR by a change in orientation or position as taught above, or by the inclusion of additional nucleotides, deletions of nucleotides, exchanges or change by transposition of nucleotides. Any of these changes that result in an "altered" UTR (whether 3' or 5') comprises a variant UTR.

在一些實施例中,可使用雙重、三重或四重UTR,例如5' UTR或3' UTR。如本文所用之「雙重」UTR係其中串聯或實質上串聯編碼相同UTR之兩個拷貝之UTR。舉例而言,可使用雙重β-球蛋白3' UTR,如美國專利公開案20100129877中所述,該專利之全部內容以引用方式併入本文中。In some embodiments, double, triple or quadruple UTRs may be used, eg, 5' UTRs or 3' UTRs. A "double" UTR as used herein is a UTR in which two copies of the same UTR are encoded in tandem or substantially in tandem. For example, dual beta-globin 3' UTRs can be used, as described in US Patent Publication 20100129877, which is incorporated herein by reference in its entirety.

具有模式化UTR亦在本揭示案之範圍內。如本文所用之「模式化UTR」係反映重複或交替模式之彼等UTR,諸如重複一次、兩次或超過3次之ABABAB或AABBAABBAABB或ABCABCABC或其變異體。在該等模式中,每個字母A、B或C代表核苷酸水準之不同UTR。It is also within the scope of this disclosure to have a patterned UTR. As used herein, "patterned UTRs" reflect repeating or alternating patterns of those UTRs, such as ABABAB or AABBAABBAABB or ABCABCABC or variants thereof repeated once, twice, or more than three times. In these patterns, each letter A, B or C represents a different UTR at the nucleotide level.

在一些實施例中,側翼區選自其蛋白質共享共同功能、結構、特徵或性質之轉錄物家族。舉例而言,相關多肽可屬於在特定細胞、組織中或在發育期間之某個時間表現之蛋白質家族。來自該等基因中之任一者之UTR可交換為相同或不同蛋白質家族之任何其他UTR,以產生新的聚核苷酸。如本文所用之「蛋白質家族」以最廣泛之意義使用,指共享至少一種功能、結構、特徵、定位、起源或表現模式之一組兩種或更多種相關多肽。In some embodiments, the flanking regions are selected from a family of transcripts whose proteins share a common function, structure, characteristic or property. For example, related polypeptides may belong to a family of proteins that are expressed in a particular cell, tissue, or at some point during development. UTRs from any of these genes can be exchanged for any other UTRs of the same or different protein families to generate new polynucleotides. A "family of proteins" as used herein is used in the broadest sense to refer to a group of two or more related polypeptides that share at least one function, structure, characteristic, localization, origin, or mode of expression.

非轉譯區亦可包括轉譯增强子元件(TEE)。作為非限制性實例,TEE可包括美國申請案號20090226470中闡述之彼等TEE及業內已知之彼等TEE,該專利之全文以引用方式併入本文中。 RNA 之活體外轉錄 Untranslated regions may also include translational enhancer elements (TEEs). By way of non-limiting example, TEEs may include those TEEs set forth in US Application No. 20090226470 and those TEEs known in the art, incorporated herein by reference in its entirety. In vitro transcription of RNA

編碼本文所述之聚核苷酸之cDNA可使用活體外轉錄(IVT)系統轉錄。RNA之活體外轉錄為業內已知,且闡述於國際公開案WO 2014/152027中,該公開案之全文以引用方式併入本文中。在一些實施例中,本揭示案之RNA係根據WO 2018/053209及WO 2019/036682中闡述之任一或多種方法製備,該等專利各自以引用方式併入本文中。cDNAs encoding the polynucleotides described herein can be transcribed using an in vitro transcription (IVT) system. In vitro transcription of RNA is known in the art and described in International Publication WO 2014/152027, which is incorporated herein by reference in its entirety. In some embodiments, the RNAs of the present disclosure are prepared according to any one or more of the methods set forth in WO 2018/053209 and WO 2019/036682, each of which is incorporated herein by reference.

在一些實施例中,RNA轉錄物係使用非擴增之綫性化DNA模板在活體外轉錄反應中產生,以產生RNA轉錄物。在一些實施例中,模板DNA係分離之DNA。在一些實施例中,模板DNA係cDNA。在一些實施例中,cDNA藉由mRNA (例如但不限於冠狀病毒mRNA)之反轉錄形成。在一些實施例中,用質體DNA模板轉染細胞,例如細菌細胞,例如大腸杆菌,例如DH-1細胞。在一些實施例中,培養轉染之細胞以複製質體DNA,接著分離及純化該質體DNA。在一些實施例中,DNA模板包括RNA聚合酶啓動子,例如位於相關基因5'且與其可操作連接之T7啓動子。In some embodiments, RNA transcripts are produced in an in vitro transcription reaction using a non-amplified linearized DNA template to generate RNA transcripts. In some embodiments, the template DNA is isolated DNA. In some embodiments, the template DNA is cDNA. In some embodiments, cDNA is formed by reverse transcription of mRNA, such as, but not limited to, coronavirus mRNA. In some embodiments, cells, eg, bacterial cells, eg, E. coli, eg, DH-1 cells, are transfected with a plastid DNA template. In some embodiments, the transfected cells are cultured to replicate plastid DNA, followed by isolation and purification of the plastid DNA. In some embodiments, the DNA template includes an RNA polymerase promoter, such as the T7 promoter located 5' to and operably linked to the gene of interest.

在一些實施例中,活體外轉錄模板編碼5'非轉譯(UTR)區,含有開放閱讀框,且編碼3' UTR及聚(A)尾。活體外轉錄模板之特定核酸序列組成及長度將取决於模板編碼之mRNA。In some embodiments, the in vitro transcription template encodes a 5' untranslated (UTR) region, contains an open reading frame, and encodes a 3' UTR and a poly(A) tail. The specific nucleic acid sequence composition and length of an in vitro transcribed template will depend on the mRNA encoded by the template.

「5'非轉譯區」 (UTR)係指不編碼多肽之mRNA之直接位於起始密碼子(亦即,核糖體轉譯之mRNA轉錄物之第一個密碼子)上游(亦即,5')之區。當產生RNA轉錄物時,5' UTR可包含啓動子序列。該等啓動子序列為業內已知。應理解,該等啓動子序列將不存在於本揭示案之疫苗中。A "5' untranslated region" (UTR) refers to the mRNA that does not encode a polypeptide that is directly upstream (ie, 5') of the initiation codon (ie, the first codon of a ribosome-translated mRNA transcript). area. When producing RNA transcripts, the 5' UTR may contain a promoter sequence. Such promoter sequences are known in the art. It will be understood that such promoter sequences will not be present in the vaccines of the present disclosure.

「3'非轉譯區」 (UTR)係指不編碼多肽之mRNA之直接位於終止密碼子(亦即,發出轉譯終止信號之mRNA轉錄物之密碼子)下游(亦即,3')之區。A "3' untranslated region" (UTR) refers to the region of an mRNA that does not encode a polypeptide that is immediately downstream (ie, 3') of a stop codon (ie, the codon of the mRNA transcript that signals translation termination).

「開放閱讀框」係以起始密碼子(例如,甲硫胺酸(ATG))開始且以終止密碼子(例如,TAA、TAG或TGA)結束之DNA之連續段,且編碼多肽。An "open reading frame" is a contiguous stretch of DNA that begins with a start codon (eg, methionine (ATG)) and ends with a stop codon (eg, TAA, TAG, or TGA), and encodes a polypeptide.

「聚(A)尾」係mRNA之位於含有多個連續單磷酸腺苷之3' UTR之下游、例如直接下游(亦即,3')的區。聚A尾可含有10至300個單磷酸腺苷。舉例而言,聚(A)尾可含有10、20、30、40、50、60、70、80、90、100、110、120、130、140、150、160、170、180、190、200、210、220、230、240、250、260、270、280、290或300個單磷酸腺苷。在一些實施例中,聚(A)尾含有50至250個單磷酸腺苷。在相關生物學環境中(例如,在細胞中,活體內),聚(A)尾用於保護mRNA免於例如在細胞質中之酶促降解,且幫助轉錄終止,及/或mRNA自細胞核輸出及轉譯。A "poly(A) tail" is a region of an mRNA that is located downstream, eg, immediately downstream (ie, 3') of the 3' UTR containing multiple consecutive adenosine monophosphates. Poly A tails can contain from 10 to 300 adenosine monophosphates. For example, poly(A) tails can contain 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200 , 210, 220, 230, 240, 250, 260, 270, 280, 290 or 300 adenosine monophosphates. In some embodiments, the poly(A) tail contains 50 to 250 adenosine monophosphates. In relevant biological contexts (eg, in cells, in vivo), poly(A) tails are used to protect mRNA from enzymatic degradation, eg, in the cytoplasm, and to aid in transcription termination, and/or mRNA export from the nucleus and translate.

在一些實施例中,核酸包括200至3,000個核苷酸。舉例而言,核酸可包括200至500、200至1000、200至1500、200至3000、500至1000、500至1500、500至2000、500至3000、1000至1500、1000至2000、1000至3000、1500至3000或2000至3000個核苷酸。In some embodiments, the nucleic acid includes 200 to 3,000 nucleotides. For example, the nucleic acid can include 200 to 500, 200 to 1000, 200 to 1500, 200 to 3000, 500 to 1000, 500 to 1500, 500 to 2000, 500 to 3000, 1000 to 1500, 1000 to 2000, 1000 to 3000 , 1500 to 3000 or 2000 to 3000 nucleotides.

活體外轉錄系統通常包括轉錄緩衝液、核苷酸三磷酸(NTP)、RNA酶抑制劑及聚合酶。In vitro transcription systems typically include transcription buffers, nucleotide triphosphates (NTPs), RNase inhibitors, and polymerases.

NTP可在內部製造,可選自供應商,或可如本文所述合成。NTP可選自但不限於本文所述之彼等NTP,包括天然及非天然(經修飾之) NTP。NTPs can be manufactured in-house, can be selected from suppliers, or can be synthesized as described herein. NTPs can be selected from, but are not limited to, those NTPs described herein, including natural and non-natural (modified) NTPs.

任何數量之RNA聚合酶或變異體可用於本揭示案之方法中。聚合酶可選自但不限於噬菌體RNA聚合酶,例如T7 RNA聚合酶、T3 RNA聚合酶、SP6 RNA聚合酶及/或突變型聚合酶,諸如但不限於能够納入經修飾之核酸及/或經修飾之核苷酸(包括化學修飾之核酸及/或核苷酸)之聚合酶。一些實施例排除使用DNA酶。Any number of RNA polymerases or variants can be used in the methods of the present disclosure. The polymerase may be selected from, but not limited to, bacteriophage RNA polymerases, such as T7 RNA polymerase, T3 RNA polymerase, SP6 RNA polymerase, and/or mutant polymerases, such as, but not limited to, capable of incorporating modified nucleic acids and/or modified nucleic acids. Polymerases of modified nucleotides, including chemically modified nucleic acids and/or nucleotides. Some embodiments exclude the use of DNase.

在一些實施例中,RNA轉錄物藉由酶促加端帽來加端帽。在一些實施例中,RNA包含5'末端端帽,例如7mG(5')ppp(5')NlmpNp。 化學合成 In some embodiments, RNA transcripts are end-capped by enzymatic end-capping. In some embodiments, the RNA comprises a 5' end cap, eg, 7mG(5')ppp(5')NlmpNp. chemical synthesis

固相化學合成。本揭示案之核酸可使用固相技術全部或部分地製造。核酸之固相化學合成係一種自動化方法,其中分子固定於固體載體上,且在反應物溶液中逐步合成。固相合成可用於在核酸序列中位點特異性引入化學修飾。Solid-phase chemical synthesis. Nucleic acids of the present disclosure can be produced in whole or in part using solid phase techniques. Solid-phase chemical synthesis of nucleic acids is an automated method in which molecules are immobilized on solid supports and synthesized stepwise in reactant solutions. Solid phase synthesis can be used to site-specifically introduce chemical modifications in nucleic acid sequences.

液相化學合成。藉由順序添加單體結構單元合成本揭示案之核酸可在液相中實施。 Liquid phase chemical synthesis. Synthesis of nucleic acids of the present disclosure by sequential addition of monomeric building blocks can be performed in the liquid phase.

合成方法之組合。上文論述之合成方法各自具有其自身之優點及限制。已嘗試組合該等方法以克服該等限制。該等方法之組合在本揭示案之範圍內。固相或液相化學合成與酶促連接之組合使用提供產生不能藉由單獨化學合成獲得之長鏈核酸之有效方式。 核酸區或亞區之連接 A combination of synthetic methods. The synthetic methods discussed above each have their own advantages and limitations. Attempts have been made to combine these approaches to overcome these limitations. Combinations of these methods are within the scope of this disclosure. The combined use of solid-phase or liquid-phase chemical synthesis and enzymatic ligation provides an efficient way to generate long-chain nucleic acids that cannot be obtained by chemical synthesis alone. Linking nucleic acid regions or subregions

亦可使用藉由連接酶裝配核酸。DNA或RNA連接酶經由磷酸二酯鍵之形成促進聚核苷酸鏈之5'及3'端之分子間連接。核酸(諸如嵌合聚核苷酸及/或環狀核酸)可藉由一或多個區或亞區之連接來製備。DNA片段可藉由連接酶催化之反應連接,以產生具有不同功能之重組DNA。兩個寡去氧核苷酸(一個具有5'磷醯基,且另一個具有游離之3'羥基)用作DNA連接酶之受質。 純化 Assembly of nucleic acids by ligases can also be used. DNA or RNA ligase facilitates the intermolecular ligation of the 5' and 3' ends of polynucleotide chains through the formation of phosphodiester bonds. Nucleic acids, such as chimeric polynucleotides and/or circular nucleic acids, can be prepared by linking one or more regions or subregions. DNA fragments can be joined by ligase-catalyzed reactions to generate recombinant DNA with different functions. Two oligodeoxynucleotides (one with a 5' phosphonium group and the other with a free 3' hydroxyl group) were used as substrates for DNA ligase. purification

本文所述之核酸之純化可包括但不限於核酸淨化、品質保證及品質控制。淨化可藉由業內已知之方法實施,該等方法例如但不限於AGENCOURT®珠粒(Beckman Coulter Genomics, Danvers, MA)、聚-T珠粒、LNATM寡-T捕獲探針(EXIQON®公司, Vedbaek, Denmark)或基於HPLC之純化方法,諸如但不限於强陰離子交換HPLC、弱陰離子交換HPLC、反相HPLC (RP-HPLC)及疏水相互作用HPLC (HIC-HPLC)。術語「純化之」當關於核酸使用時,諸如「純化之核酸」,係指與至少一種污染物分離之核酸。「污染物」係使另一種物質不適合、不純或劣質之任何物質。因此,純化之核酸(例如,DNA及RNA)以不同於其在自然界中發現之形式或環境存在,或以不同於其在進行處理或純化方法之前存在之形式或環境存在。Purification of nucleic acids described herein can include, but is not limited to, nucleic acid purification, quality assurance, and quality control. Purification can be performed by methods known in the art such as, but not limited to, AGENCOURT® beads (Beckman Coulter Genomics, Danvers, MA), poly-T beads, LNATM oligo-T capture probes (EXIQON® Corporation, Vedbaek , Denmark) or HPLC-based purification methods such as but not limited to strong anion exchange HPLC, weak anion exchange HPLC, reverse phase HPLC (RP-HPLC) and hydrophobic interaction HPLC (HIC-HPLC). The term "purified" when used in reference to nucleic acid, such as "purified nucleic acid," refers to nucleic acid that has been separated from at least one contaminant. A "contaminant" is any substance that renders another substance unsuitable, impure or inferior. Thus, purified nucleic acids (eg, DNA and RNA) exist in a form or environment different from the form or environment in which they are found in nature, or in a form or environment different from the form or environment in which they existed prior to processing or purification methods.

品質保證及/或品質控制檢查可使用諸如但不限於凝膠電泳、UV吸光度或分析型HPLC之方法執行。Quality assurance and/or quality control checks can be performed using methods such as, but not limited to, gel electrophoresis, UV absorbance, or analytical HPLC.

在一些實施例中,核酸可藉由包括但不限於反轉錄酶-PCR之方法測序。 定量 In some embodiments, nucleic acids can be sequenced by methods including, but not limited to, reverse transcriptase-PCR. Quantitative

在一些實施例中,本揭示案之核酸可在外來體中或當衍生自一或多種體液時定量。體液包括外周血、血清、血漿、腹水、尿液、腦脊髓液(CSF)、痰、唾液、骨髓、滑液、房水、羊水、耳垢、母乳、支氣管肺泡灌洗液、精液、前列腺液、考珀液(cowper's fluid)或預射精液、汗液、排泄物、毛髮、淚液、囊腫液、胸膜液及腹膜液、心包液、淋巴、食糜、乳糜、膽汁、間質液、經血、膿液、皮脂、嘔吐物、陰道分泌物、黏膜分泌物、糞便水、胰液、竇腔灌洗液、支氣管肺抽吸物、囊胚腔液及臍帶血。或者,外來體可自選自由以下組成之群之器官回收:肺、心臟、胰臟、胃、腸、膀胱、腎、卵巢、睪丸、皮膚、結腸、乳房、前列腺、腦、食管、肝及胎盤。In some embodiments, nucleic acids of the present disclosure can be quantified in exosomes or when derived from one or more body fluids. Body fluids include peripheral blood, serum, plasma, ascites, urine, cerebrospinal fluid (CSF), sputum, saliva, bone marrow, synovial fluid, aqueous humor, amniotic fluid, ear wax, breast milk, bronchoalveolar lavage fluid, semen, prostatic fluid, Cowper's fluid or pre-ejaculate fluid, sweat, excrement, hair, tears, cystic fluid, pleural and peritoneal fluid, pericardial fluid, lymph, chyle, chyle, bile, interstitial fluid, menstrual blood, pus , sebum, vomit, vaginal secretions, mucosal secretions, fecal water, pancreatic juice, sinus lavage fluid, bronchopulmonary aspirates, blastocyst fluid and umbilical cord blood. Alternatively, the exosomes can be recovered from an organ selected from the group consisting of lung, heart, pancreas, stomach, intestine, bladder, kidney, ovary, testis, skin, colon, breast, prostate, brain, esophagus, liver and placenta.

可使用構築體特異性探針、細胞計量術、qRT-PCR、即時PCR、PCR、流式細胞術、電泳、質譜或其組合來實施分析,而可使用免疫組織化學方法(諸如酶聯免疫吸附分析(ELISA))方法分離外來體。外來體亦可藉由粒徑篩析層析、密度梯度離心、差速離心、奈米膜超濾、免疫吸附捕獲、親和純化、微流體分離或其組合來分離。Analysis can be performed using construct-specific probes, cytometry, qRT-PCR, real-time PCR, PCR, flow cytometry, electrophoresis, mass spectrometry, or a combination thereof, while immunohistochemical methods such as enzyme-linked immunosorbent assay can be used. assay (ELISA)) method to isolate exosomes. Exosomes can also be isolated by particle size chromatography, density gradient centrifugation, differential centrifugation, nanomembrane ultrafiltration, immunosorbent capture, affinity purification, microfluidic separation, or a combination thereof.

該等方法使研究者能够實時監測剩餘或遞送之核酸水準。由於在一些實施例中,本揭示案之核酸由於結構或化學修飾而不同於內源形式,故此係可能的。These methods allow researchers to monitor the level of nucleic acid remaining or delivered in real time. This is possible because, in some embodiments, the nucleic acids of the present disclosure differ from the endogenous form due to structural or chemical modifications.

在一些實施例中,核酸可使用諸如但不限於紫外可見光譜(UV/Vis)之方法來定量。UV/Vis光譜儀之非限制性實例係NANODROP®光譜儀(ThermoFisher, Waltham, MA)。可分析定量之核酸,以確定核酸是否具有正確大小、檢查核酸未發生降解。核酸之降解可藉由以下方法檢查,諸如但不限於瓊脂糖凝膠電泳;基於HPLC之純化方法,諸如但不限於强陰離子交換HPLC、弱陰離子交換HPLC、反相HPLC (RP-HPLC)及疏水相互作用HPLC (HIC-HPLC);液相層析-質譜(LCMS)、毛細管電泳(CE)及毛細管凝膠電泳(CGE)。 脂質奈米顆粒 (LNP) In some embodiments, nucleic acids can be quantified using methods such as, but not limited to, ultraviolet visible spectroscopy (UV/Vis). A non-limiting example of a UV/Vis spectrometer is the NANODROP® spectrometer (ThermoFisher, Waltham, MA). The quantified nucleic acid can be analyzed to determine if the nucleic acid is of the correct size and to check that the nucleic acid has not been degraded. Degradation of nucleic acids can be checked by methods such as but not limited to agarose gel electrophoresis; HPLC based purification methods such as but not limited to strong anion exchange HPLC, weak anion exchange HPLC, reverse phase HPLC (RP-HPLC) and hydrophobic Interaction HPLC (HIC-HPLC); Liquid Chromatography-Mass Spectrometry (LCMS), Capillary Electrophoresis (CE) and Capillary Gel Electrophoresis (CGE). Lipid Nanoparticles (LNPs)

在一些實施例中,本揭示案之mRNA調配於脂質奈米顆粒(LNP)中。脂質奈米顆粒通常包含可電離之陽離子脂質、非陽離子脂質、固醇及PEG脂質組分以及相關核酸貨物。本揭示案之脂質奈米顆粒可使用業內通常已知之組分、組合物及方法產生,例如,參見PCT/US2016/052352;PCT/US2016/068300;PCT/US2017/037551;PCT/US2015/027400;PCT/US2016/047406;PCT/US2016000129;PCT/US2016/014280;PCT/US2016/014280;PCT/US2017/038426;PCT/US2014/027077;PCT/US2014/055394;PCT/US2016/52117;PCT/US2012/069610;PCT/US2017/027492;PCT/US2016/059575及PCT/US2016/069491,所有該等專利之全文皆以引用方式併入本文中。In some embodiments, the mRNAs of the present disclosure are formulated in lipid nanoparticles (LNPs). Lipid nanoparticles typically contain ionizable cationic lipids, non-cationic lipids, sterol and PEG lipid components, and related nucleic acid cargoes. The lipid nanoparticles of the present disclosure can be produced using components, compositions and methods generally known in the art, eg, see PCT/US2016/052352; PCT/US2016/068300; PCT/US2017/037551; PCT/US2015/027400; PCT/US2016/047406; PCT/US2016000129; PCT/US2016/014280; PCT/US2016/014280; PCT/US2017/038426; 069610; PCT/US2017/027492; PCT/US2016/059575 and PCT/US2016/069491, all of which are incorporated herein by reference in their entirety.

本揭示案之疫苗通常調配於脂質奈米顆粒中。在一些實施例中,脂質奈米顆粒包含至少一種可電離之陽離子脂質、至少一種非陽離子脂質、至少一種固醇及/或至少一種聚乙二醇(PEG)改質之脂質。The vaccines of the present disclosure are typically formulated in lipid nanoparticles. In some embodiments, the lipid nanoparticles comprise at least one ionizable cationic lipid, at least one non-cationic lipid, at least one sterol, and/or at least one polyethylene glycol (PEG)-modified lipid.

在一些實施例中,脂質奈米顆粒包含40-50 mol%可電離之脂質,視情況地45-50 mol%,例如45-46 mol%、46-47 mol%、47-48 mol%、48-49 mol%或49-50 mol%,例如約45 mol%、45.5 mol%、46 mol%、46.5 mol%、47 mol%、47.5 mol%、48 mol%、48.5 mol%、49 mol%或49.5 mol%。In some embodiments, the lipid nanoparticle comprises 40-50 mol% ionizable lipid, optionally 45-50 mol%, eg, 45-46 mol%, 46-47 mol%, 47-48 mol%, 48 -49 mol% or 49-50 mol%, such as about 45 mol%, 45.5 mol%, 46 mol%, 46.5 mol%, 47 mol%, 47.5 mol%, 48 mol%, 48.5 mol%, 49 mol% or 49.5 mol%.

在一些實施例中,脂質奈米顆粒包含30-45 mol%固醇,視情況地35-40 mol%,例如30-31 mol%、31-32 mol%、32-33 mol%、33-34 mol%、35-35 mol%、35-36 mol%、36-37 mol%、38-38 mol%、38-39 mol%或39-40 mol%。In some embodiments, the lipid nanoparticle comprises 30-45 mol% sterol, optionally 35-40 mol%, eg, 30-31 mol%, 31-32 mol%, 32-33 mol%, 33-34 mol% mol%, 35-35 mol%, 35-36 mol%, 36-37 mol%, 38-38 mol%, 38-39 mol%, or 39-40 mol%.

在一些實施例中,脂質奈米顆粒包含5-15 mol%輔助脂質,視情況地10-12 mol%,例如5-6 mol%、6-7 mol%、7-8 mol%、8-9 mol%、9-10 mol%、10-11 mol%、11-12 mol%、12-13 mol%、13-14 mol%或14-15 mol%。In some embodiments, the lipid nanoparticle comprises 5-15 mol% helper lipid, optionally 10-12 mol%, eg, 5-6 mol%, 6-7 mol%, 7-8 mol%, 8-9 mol% mol%, 9-10 mol%, 10-11 mol%, 11-12 mol%, 12-13 mol%, 13-14 mol% or 14-15 mol%.

在一些實施例中,脂質奈米顆粒包含1-5% PEG脂質,視情況地1-3 mol%,例如1.5-2.5 mol%、1-2 mol%、2-3 mol%、3-4 mol%或4-5 mol%。In some embodiments, lipid nanoparticles comprise 1-5% PEG lipid, optionally 1-3 mol%, eg, 1.5-2.5 mol%, 1-2 mol%, 2-3 mol%, 3-4 mol% % or 4-5 mol%.

在一些實施例中,脂質奈米顆粒包含20-60 mol%可電離之陽離子脂質。舉例而言,脂質奈米顆粒可包含20-50 mol%、20-40 mol%、20-30 mol%、30-60 mol%、30-50 mol%、30-40 mol%、40-60 mol%、40-50 mol%或50-60 mol%可電離之陽離子脂質。在一些實施例中,脂質奈米顆粒包含20 mol%、30 mol%、40 mol%、50 mol%或60 mol%可電離之陽離子脂質。在一些實施例中,脂質奈米顆粒包含 35 mol%、36 mol%、37 mol%、38 mol%、39 mol%、40 mol%、41 mol%、42 mol%、43 mol%、44 mol%、45 mol% %、46 mol%、47 mol%、48 mol%、49 mol%、50 mol%、51 mol%、52 mol%、53 mol%、54 mol%或55 mol%可電離之陽離子脂質。In some embodiments, the lipid nanoparticles comprise 20-60 mol% ionizable cationic lipids. For example, lipid nanoparticles can comprise 20-50 mol%, 20-40 mol%, 20-30 mol%, 30-60 mol%, 30-50 mol%, 30-40 mol%, 40-60 mol% %, 40-50 mol% or 50-60 mol% ionizable cationic lipids. In some embodiments, the lipid nanoparticles comprise 20 mol %, 30 mol %, 40 mol %, 50 mol %, or 60 mol % of ionizable cationic lipids. In some embodiments, the lipid nanoparticles comprise 35 mol%, 36 mol%, 37 mol%, 38 mol%, 39 mol%, 40 mol%, 41 mol%, 42 mol%, 43 mol%, 44 mol% , 45 mol%, 46 mol%, 47 mol%, 48 mol%, 49 mol%, 50 mol%, 51 mol%, 52 mol%, 53 mol%, 54 mol%, or 55 mol% of ionizable cationic lipids .

在一些實施例中,脂質奈米顆粒包含5-25 mol%非陽離子脂質。舉例而言,脂質奈米顆粒可包含 5-20 mol%、5-15 mol%、5-10 mol%、10-25 mol%、10-20 mol%、10-25 mol%、15-25 mol%、15-20 mol% 或 20-25 mol%非陽離子脂質。在一些實施例中,脂質奈米顆粒包含5 mol%、10 mol%、15 mol%、20 mol%或25 mol%非陽離子脂質。In some embodiments, the lipid nanoparticles comprise 5-25 mol% non-cationic lipids. For example, lipid nanoparticles can comprise 5-20 mol%, 5-15 mol%, 5-10 mol%, 10-25 mol%, 10-20 mol%, 10-25 mol%, 15-25 mol% %, 15-20 mol% or 20-25 mol% non-cationic lipids. In some embodiments, the lipid nanoparticle comprises 5 mol%, 10 mol%, 15 mol%, 20 mol%, or 25 mol% non-cationic lipid.

在一些實施例中,脂質奈米顆粒包含25-55 mol%固醇。舉例而言,脂質奈米顆粒可包含25-50 mol%、25-45 mol%、25-40 mol%、25-35 mol%、25-30 mol%、30-55 mol%、30-50 mol%、30-45 mol%、30-40 mol%、30-35 mol%、35-55 mol%、35-50 mol%、35-45 mol%、35-40 mol%、40-55 mol%、40-50 mol%、40-45 mol%、45-55 mol%、45-50 mol%或50-55 mol%固醇。在一些實施例中,脂質奈米顆粒包含25 mol%、30 mol%、35 mol%、40 mol%、45 mol%、50 mol%或55 mol%固醇。In some embodiments, the lipid nanoparticles comprise 25-55 mol% sterols. For example, lipid nanoparticles can comprise 25-50 mol%, 25-45 mol%, 25-40 mol%, 25-35 mol%, 25-30 mol%, 30-55 mol%, 30-50 mol% %, 30-45 mol%, 30-40 mol%, 30-35 mol%, 35-55 mol%, 35-50 mol%, 35-45 mol%, 35-40 mol%, 40-55 mol%, 40-50 mol%, 40-45 mol%, 45-55 mol%, 45-50 mol% or 50-55 mol% sterol. In some embodiments, the lipid nanoparticles comprise 25 mol%, 30 mol%, 35 mol%, 40 mol%, 45 mol%, 50 mol%, or 55 mol% sterols.

在一些實施例中,脂質奈米顆粒包含0.5-15 mol% PEG改質之脂質。舉例而言,脂質奈米顆粒可包含0.5-10 mol%、0.5-5 mol%、1-15 mol%、1-10 mol%、1-5 mol%、2-15 mol%、2-10 mol%、2-5 mol%、5-15 mol%、5-10 mol%或10-15 mol%。在一些實施例中,脂質奈米顆粒包含0.5 mol%、1 mol%、2 mol%、3 mol%、4 mol%、5 mol%、6 mol%、7 mol%、8 mol%、9 mol%、10 mol% %、11 mol%、12 mol%、13 mol%、14 mol%或15 mol% PEG改質之脂質。In some embodiments, the lipid nanoparticles comprise 0.5-15 mol% PEG-modified lipids. For example, lipid nanoparticles can comprise 0.5-10 mol%, 0.5-5 mol%, 1-15 mol%, 1-10 mol%, 1-5 mol%, 2-15 mol%, 2-10 mol% %, 2-5 mol%, 5-15 mol%, 5-10 mol% or 10-15 mol%. In some embodiments, the lipid nanoparticles comprise 0.5 mol%, 1 mol%, 2 mol%, 3 mol%, 4 mol%, 5 mol%, 6 mol%, 7 mol%, 8 mol%, 9 mol% , 10 mol%, 11 mol%, 12 mol%, 13 mol%, 14 mol% or 15 mol% PEG-modified lipids.

在一些實施例中,脂質奈米顆粒包含20-60 mol%可電離之陽離子脂質、5-25 mol%非陽離子脂質、25-55 mol%固醇及0.5-15 mol% PEG改質之脂質。In some embodiments, the lipid nanoparticles comprise 20-60 mol% ionizable cationic lipids, 5-25 mol% non-cationic lipids, 25-55 mol% sterols, and 0.5-15 mol% PEG-modified lipids.

在一些實施例中,本揭示案之可電離之陽離子脂質包含式(I)化合物:

Figure 02_image001
, 或其鹽或異構物,其中: R 1選自由C 5-30烷基、C 5-20烯基、-R*YR''、-YR''及-R''M'R'組成之群; R 2及R 3獨立地選自由H、C 1-14烷基、C 2-14烯基、-R*YR''、-YR''及-R*OR''組成之群,或R 2及R 3與其所附接之原子一起形成雜環或碳環; R 4選自由C 3-6碳環、-(CH 2) nQ、-(CH 2) nCHQR、-CHQR、-CQ(R) 2及未經取代之C 1-6烷基組成之群,其中Q選自碳環、雜環、-OR、-O(CH 2) nN(R) 2、-C(O)OR、-OC(O)R、-CX 3、-CX 2H、-CXH 2、-CN-、N(R) 2、-C(O)N(R) 2、-N(R)C(O)R、-N(R)S(O) 2R、-N(R)C(O)N(R) 2、-N(R)C(S)N(R) 2、-N(R)R 8、-O(CH 2) nOR、-N(R)C(=NR 9)N(R) 2、-N(R)C(=CHR 9)N(R) 2、-OC(O)N(R) 2、-N(R)C(O)OR、-N(OR)C(O)R、-N(OR)S(O) 2R、-N(OR)C(O)OR、-N(OR)C(O)N(R) 2、-N(OR)C(S)N(R) 2、-N(OR)C(=NR 9)N(R) 2、-N(OR)C(=CHR 9)N(R) 2、-C(=NR 9)N(R) 2、-C(=NR 9)R、-C(O)N(R)OR及-C(R)N(R) 2C(O)OR,且每一n獨立地選自1、2、3、4及5; 每一R 5獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R 6獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; M及M'獨立地選自-C(O)O-、-OC(O)-、-C(O)N(R')-、-N(R')C(O)-、-C(O)-、-C(S)-、-C(S)S-、-SC(S)-、-CH(OH)-、-P(O)(OR')O-、-S(O) 2-、-S-S-、芳基及雜芳基; R 7選自由C 1-3烷基、C 2-3烯基及H組成之群; R 8選自由C 3-6碳環及雜環組成之群; R 9選自由H、CN、NO 2、C 1-6烷基、-OR、-S(O) 2R、-S(O) 2N(R) 2、C 2-6烯基、C 3-6碳環及雜環組成之群; 每一R獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R'獨立地選自由C 1-18烷基、C 2-18烯基、-R*YR''、-YR''及H組成之群; 每一R''獨立地選自由C 3-14烷基及C 3-14烯基組成之群; 每一R*獨立地選自由C 1-12烷基及C 2-12烯基組成之群; 每一Y獨立地係C 3-6碳環; 每一X獨立地選自由F、Cl、Br及I組成之群;且 m選自5、6、7、8、9、10、11、12及13。 In some embodiments, the ionizable cationic lipids of the present disclosure comprise a compound of formula (I):
Figure 02_image001
, or a salt or isomer thereof, wherein: R 1 is selected from the group consisting of C 5-30 alkyl, C 5-20 alkenyl, -R*YR'', -YR'' and -R''M'R' the group; R 2 and R 3 are independently selected from the group consisting of H, C 1-14 alkyl, C 2-14 alkenyl, -R*YR'', -YR'' and -R*OR'', or R 2 and R 3 together with the atoms to which they are attached form a heterocycle or carbocycle; R 4 is selected from C 3-6 carbocycle, -(CH 2 ) n Q, -(CH 2 ) n CHQR, -CHQR, The group consisting of -CQ(R) 2 and unsubstituted C 1-6 alkyl, wherein Q is selected from carbocycle, heterocycle, -OR, -O(CH 2 ) n N(R) 2 , -C( O)OR, -OC(O) R , -CX3, -CX2H, -CXH2 , -CN-, N(R) 2 , -C(O)N(R )2 , -N(R) C(O)R, -N(R)S(O) 2 R, -N(R)C(O)N(R) 2 , -N(R)C(S)N(R) 2 , -N (R)R 8 , -O(CH 2 ) n OR, -N(R)C(=NR 9 )N(R) 2 , -N(R)C(=CHR 9 )N(R) 2 , - OC(O)N(R) 2 , -N(R)C(O)OR, -N(OR)C(O)R, -N(OR)S(O) 2 R, -N(OR)C (O)OR, -N(OR)C(O)N(R) 2 , -N(OR)C(S)N(R) 2 , -N(OR)C(=NR 9 )N(R) 2 , -N(OR)C(=CHR 9 )N(R) 2 , -C(=NR 9 )N(R) 2 , -C(=NR 9 )R, -C(O)N(R) OR and -C(R)N(R) 2C (O)OR, and each n is independently selected from 1, 2, 3, 4, and 5 ; each R5 is independently selected from C1-3 alkyl , C 2-3 alkenyl and the group consisting of H; each R 6 is independently selected from the group consisting of C 1-3 alkyl, C 2-3 alkenyl and H; M and M' are independently selected from -C (O)O-, -OC(O)-, -C(O)N(R')-, -N(R')C(O)-, -C(O)-, -C(S)- , -C(S)S-, -SC(S)-, -CH(OH)-, -P(O)(OR')O-, -S(O) 2 -, -SS-, aryl and Heteroaryl; R 7 is selected from the group consisting of C 1-3 alkyl, C 2-3 alkenyl and H; R 8 is selected from the group consisting of C 3-6 carbocycle and heterocycle; R 9 is selected from H, CN, NO 2 , C 1-6 alkyl, -OR, -S(O) 2 R, -S(O) 2 N(R) 2 , C 2-6 alkenyl, C 3-6 carbocycle and The group consisting of heterocycles; each R is independently selected from the group consisting of C 1-3 alkyl, C 2-3 alkenyl and H; each R' is independently selected from C 1-18 alkyl, C 2- 18 The group consisting of alkenyl, -R*YR'', -YR'' and H; each R'' is independently selected from the group consisting of C 3-14 alkyl and C 3-14 alkenyl; each R *is independently selected from the group consisting of C1-12 alkyl and C2-12 alkenyl; each Y is independently a C3-6 carbocycle; each X is independently selected from F, Cl, Br, and I and m is selected from 5, 6, 7, 8, 9, 10, 11, 12 and 13.

在一些實施例中,式(I)化合物之亞組包括以下之彼等化合物:其中當R 4係-(CH 2) nQ、-(CH 2) nCHQR、-CHQR或-CQ(R) 2時,則(i) 當n為1、2、3、4或5時,Q不為-N(R) 2,或(ii) 當n為1或2時,Q不為5員、6員或7員雜環烷基。 In some embodiments, the subgroup of compounds of formula (I) includes those compounds wherein when R4 is - ( CH2 )nQ, -( CH2 ) nCHQR , -CHQR or -CQ(R) 2 , then (i) when n is 1, 2, 3, 4, or 5, Q is not -N(R) 2 , or (ii) when n is 1 or 2, Q is not 5-membered, 6-membered or 7-membered heterocycloalkyl.

在一些實施例中,式(I)化合物之另一亞組包括以下之彼等化合物:其中 R 1選自由C 5-30烷基、C 5-20烯基、-R*YR''、-YR''及-R''M'R'組成之群; R 2及R 3獨立地選自由H、C 1-14烷基、C 2-14烯基、-R*YR''、-YR''及-R*OR''組成之群,或R 2及R 3與其所附接之原子一起形成雜環或碳環; R 4選自由C 3-6碳環、-(CH 2) nQ、-(CH 2) nCHQR、-CHQR、-CQ(R) 2及未經取代之C 1-6烷基組成之群,其中Q選自C 3-6碳環、具有一或多個選自N、O及S之雜原子之5至14員雜芳基、-OR、-O(CH 2) nN(R) 2、-C(O)OR、-OC(O)R、-CX 3、-CX 2H、-CXH 2、-CN、-C(O)N(R) 2、-N(R)C(O)R、-N(R)S(O) 2R、-N(R)C(O)N(R) 2、-N(R)C(S)N(R) 2、-CRN(R) 2C(O)OR、-N(R)R 8、-O(CH 2) nOR、-N(R)C(=NR 9)N(R) 2、-N(R)C(=CHR 9)N(R) 2、-OC(O)N(R) 2、-N(R)C(O)OR、-N(OR)C(O)R、-N(OR)S(O) 2R、-N(OR)C(O)OR、-N(OR)C(O)N(R) 2、-N(OR)C(S)N(R) 2、-N(OR)C(=NR 9)N(R) 2、-N(OR)C(=CHR 9)N(R) 2、-C(=NR 9)N(R) 2、-C(=NR 9)R、-C(O)N(R)OR及具有一或多個選自N、O及S之雜原子之5至14員雜環烷基,經一或多個選自以下之取代基取代:側氧基(=O)、OH、胺基、單烷基胺基或二烷基胺基及C 1-3烷基,且每一n獨立地選自1、2、3、4及5; 每一R 5獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R 6獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; M及M'獨立地選自-C(O)O-、-OC(O)-、-C(O)N(R')-、-N(R')C(O)-、-C(O)-、-C(S)-、-C(S)S-、-SC(S)-、-CH(OH)-、-P(O)(OR')O-、-S(O) 2-、-S-S-、芳基及雜芳基; R 7選自由C 1-3烷基、C 2-3烯基及H組成之群; R 8選自由C 3-6碳環及雜環組成之群; R 9選自由H、CN、NO 2、C 1-6烷基、-OR、-S(O) 2R、-S(O) 2N(R) 2、C 2-6烯基、C 3-6碳環及雜環組成之群; 每一R獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R'獨立地選自由C 1-18烷基、C 2-18烯基、-R*YR''、-YR''及H組成之群; 每一R''獨立地選自由C 3-14烷基及C 3-14烯基組成之群; 每一R*獨立地選自由C 1-12烷基及C 2-12烯基組成之群; 每一Y獨立地係C 3-6碳環; 每一X獨立地選自由F、Cl、Br及I組成之群;且 m選自5、6、7、8、9、10、11、12及13, 或其鹽或異構物。 In some embodiments, another subset of compounds of formula (I) includes those compounds wherein R 1 is selected from C 5-30 alkyl, C 5-20 alkenyl, -R*YR'', - The group consisting of YR'' and -R''M'R'; R 2 and R 3 are independently selected from H, C 1-14 alkyl, C 2-14 alkenyl, -R*YR'', -YR '' and -R*OR'', or R 2 and R 3 together with the atoms to which they are attached form a heterocycle or carbocycle; R 4 is selected from C 3-6 carbocycle, -(CH 2 ) n The group consisting of Q, -(CH 2 ) n CHQR, -CHQR, -CQ(R) 2 and unsubstituted C 1-6 alkyl, wherein Q is selected from C 3-6 carbocyclic ring, having one or more 5- to 14-membered heteroaryl selected from heteroatoms of N, O and S, -OR, -O( CH2 ) nN (R) 2 , -C(O)OR, -OC(O)R, - CX 3 , -CX 2 H, -CXH 2 , -CN, -C(O)N(R) 2 , -N(R)C(O)R, -N(R)S(O) 2 R, - N(R)C(O)N(R) 2 , -N(R)C(S)N(R) 2 , -CRN(R) 2 C(O)OR, -N(R)R 8 , - O(CH 2 ) n OR, -N(R)C(=NR 9 )N(R) 2 , -N(R)C(=CHR 9 )N(R) 2 , -OC(O)N(R ) 2 , -N(R)C(O)OR, -N(OR)C(O)R, -N(OR)S(O) 2 R, -N(OR)C(O)OR, -N (OR)C(O)N(R) 2 , -N(OR)C(S)N(R) 2 , -N(OR)C(=NR 9 )N(R) 2 , -N(OR) C(=CHR 9 )N(R) 2 , -C(=NR 9 )N(R) 2 , -C(=NR 9 )R, -C(O)N(R)OR and having one or more 5- to 14-membered heterocycloalkyl of heteroatoms selected from N, O and S, substituted with one or more substituents selected from: pendant oxy (=O), OH, amine, monoalkylamine group or dialkylamine group and C 1-3 alkyl group, and each n is independently selected from 1, 2, 3, 4 and 5; each R 5 is independently selected from C 1-3 alkyl group, C 2 The group consisting of -3 alkenyl and H ; each R is independently selected from the group consisting of C 1-3 alkyl, C 2-3 alkenyl and H; M and M' are independently selected from -C(O) O-, -OC(O)-, -C(O)N(R')-, -N(R')C(O)-, -C(O)-, -C(S)-, -C (S)S-, -SC(S)-, -CH(OH)-, -P(O)(OR')O-, -S(O) 2- , -SS-, aryl and heteroaryl ; R 7 is chosen from C The group consisting of 1-3 alkyl, C 2-3 alkenyl and H; R 8 is selected from the group consisting of C 3-6 carbocycle and heterocycle; R 9 is selected from H, CN, NO 2 , C 1-6 The group consisting of alkyl, -OR, -S(O) 2 R, -S(O) 2 N(R) 2 , C 2-6 alkenyl, C 3-6 carbocycle and heterocycle; each R independently is selected from the group consisting of C 1-3 alkyl, C 2-3 alkenyl, and H; each R' is independently selected from C 1-18 alkyl, C 2-18 alkenyl, -R*YR'' , -YR'' and H; each R'' is independently selected from the group consisting of C 3-14 alkyl and C 3-14 alkenyl; each R* is independently selected from C 1-12 alkane each Y is independently a C3-6 carbocycle; each X is independently selected from the group consisting of F, Cl, Br, and I; and m is selected from 5, 6 , 7, 8, 9, 10, 11, 12 and 13, or salts or isomers thereof.

在一些實施例中,式(I)化合物之另一亞組包括以下之彼等化合物:其中 R 1選自由C 5-30烷基、C 5-20烯基、-R*YR''、-YR''及-R''M'R'組成之群; R 2及R 3獨立地選自由H、C 1-14烷基、C 2-14烯基、-R*YR''、-YR''及-R*OR''組成之群,或R 2及R 3與其所附接之原子一起形成雜環或碳環; R 4選自由C 3-6碳環、-(CH 2) nQ、-(CH 2) nCHQR、-CHQR、-CQ(R) 2及未經取代之C 1-6烷基組成之群,其中Q選自C 3-6碳環、具有一或多個選自N、O及S之雜原子之5至14員雜環、-OR、-O(CH 2) nN(R) 2、-C(O)OR、-OC(O)R、-CX 3、-CX 2H、-CXH 2、-CN、-C(O)N(R) 2、-N(R)C(O)R、-N(R)S(O) 2R、-N(R)C(O)N(R) 2、-N(R)C(S)N(R) 2、-CRN(R) 2C(O)OR、-N(R)R 8、-O(CH 2) nOR、-N(R)C(=NR 9)N(R) 2、-N(R)C(=CHR 9)N(R) 2、-OC(O)N(R) 2、-N(R)C(O)OR、-N(OR)C(O)R、-N(OR)S(O) 2R、-N(OR)C(O)OR、-N(OR)C(O)N(R) 2、-N(OR)C(S)N(R) 2、-N(OR)C(=NR 9)N(R) 2、-N(OR)C(=CHR 9)N(R) 2、-C(=NR 9)R、-C(O)N(R)OR及-C(=NR 9)N(R) 2,且每一n獨立地選自1、2、3、4及5;且當Q係5至14員雜環且(i) R 4係-(CH 2) nQ,其中n為1或2,或(ii) R 4係-(CH 2) nCHQR,其中n為1,或(iii) R 4係-CHQR及-CQ(R) 2時,則Q係5至14員雜芳基或8至14員雜環烷基; 每一R 5獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R 6獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; M及M'獨立地選自-C(O)O-、-OC(O)-、-C(O)N(R')-、-N(R')C(O)-、-C(O)-、-C(S)-、-C(S)S-、-SC(S)-、-CH(OH)-、-P(O)(OR')O-、-S(O) 2-、-S-S-、芳基及雜芳基; R 7選自由C 1-3烷基、C 2-3烯基及H組成之群; R 8選自由C 3-6碳環及雜環組成之群; R 9選自由H、CN、NO 2、C 1-6烷基、-OR、-S(O) 2R、-S(O) 2N(R) 2、C 2-6烯基、C 3-6碳環及雜環組成之群; 每一R獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R'獨立地選自由C 1-18烷基、C 2-18烯基、-R*YR''、-YR''及H組成之群; 每一R''獨立地選自由C 3-14烷基及C 3-14烯基組成之群; 每一R*獨立地選自由C 1-12烷基及C 2-12烯基組成之群; 每一Y獨立地係C 3-6碳環; 每一X獨立地選自由F、Cl、Br及I組成之群;且 m選自5、6、7、8、9、10、11、12及13, 或其鹽或異構物。 In some embodiments, another subgroup of compounds of formula (I) includes those compounds wherein R 1 is selected from C 5-30 alkyl, C 5-20 alkenyl, -R*YR'', - The group consisting of YR'' and -R''M'R'; R 2 and R 3 are independently selected from H, C 1-14 alkyl, C 2-14 alkenyl, -R*YR'', -YR '' and -R*OR'', or R 2 and R 3 together with the atoms to which they are attached form a heterocycle or carbocycle; R 4 is selected from C 3-6 carbocycle, -(CH 2 ) n The group consisting of Q, -(CH 2 ) n CHQR, -CHQR, -CQ(R) 2 and unsubstituted C 1-6 alkyl, wherein Q is selected from C 3-6 carbocyclic ring, having one or more 5- to 14-membered heterocycle of heteroatoms selected from N, O and S, -OR, -O( CH2 ) nN (R) 2 , -C(O)OR, -OC(O)R, -CX 3 , -CX 2 H, -CXH 2 , -CN, -C(O)N(R) 2 , -N(R)C(O)R, -N(R)S(O) 2 R, -N (R)C(O)N(R) 2 , -N(R)C(S)N(R) 2 , -CRN(R) 2 C(O)OR, -N(R)R 8 , -O (CH 2 ) n OR, -N(R)C(=NR 9 )N(R) 2 , -N(R)C(=CHR 9 )N(R) 2 , -OC(O)N(R) 2 , -N(R)C(O)OR, -N(OR)C(O)R, -N(OR)S(O) 2 R, -N(OR)C(O)OR, -N( OR)C(O)N(R) 2 , -N(OR)C(S)N(R) 2 , -N(OR)C(=NR 9 )N(R) 2 , -N(OR)C (=CHR 9 )N(R) 2 , -C(=NR 9 )R, -C(O)N(R)OR, and -C(=NR 9 )N(R) 2 , and each n independently is selected from 1, 2, 3, 4, and 5; and when Q is a 5- to 14-membered heterocycle and (i) R4 is - ( CH2 )nQ, wherein n is 1 or 2 , or (ii) R4 is -(CH 2 ) n CHQR, wherein n is 1, or (iii) R 4 is -CHQR and -CQ(R) 2 , then Q is a 5- to 14-membered heteroaryl or an 8- to 14-membered heterocycloalkane each R 5 is independently selected from the group consisting of C 1-3 alkyl, C 2-3 alkenyl and H; each R 6 is independently selected from C 1-3 alkyl, C 2-3 alkenyl and the group consisting of H; M and M' are independently selected from -C(O)O-, -OC(O)-, -C(O)N(R')-, -N(R')C(O )-, -C(O)-, -C(S)-, -C(S)S-, -SC(S)-, -CH(OH)-, -P(O)(OR')O-, -S(O) 2 -, -SS-, aryl and heteroaryl; R 7 is selected from C 1-3 alkyl, C 2-3 alkenyl and H The group consisting of; R 8 is selected from the group consisting of C 3-6 carbocycles and heterocycles; R 9 is selected from H, CN, NO 2 , C 1-6 alkyl, -OR, -S(O) 2 R, -S(O) 2 N(R) 2 , C 2-6 alkenyl, C 3-6 carbocycle and heterocycle; each R is independently selected from C 1-3 alkyl, C 2-3 the group consisting of alkenyl and H; each R' is independently selected from the group consisting of C 1-18 alkyl, C 2-18 alkenyl, -R*YR'', -YR'', and H; each R '' is independently selected from the group consisting of C 3-14 alkyl and C 3-14 alkenyl; each R* is independently selected from the group consisting of C 1-12 alkyl and C 2-12 alkenyl; each Y is independently a C3-6 carbocycle; each X is independently selected from the group consisting of F, Cl, Br, and I; and m is selected from 5, 6, 7, 8, 9, 10, 11, 12, and 13 , or its salts or isomers.

在一些實施例中,式(I)化合物之另一亞組包括以下之彼等化合物:其中 R 1選自由C 5-30烷基、C 5-20烯基、-R*YR''、-YR''及-R''M'R'組成之群; R 2及R 3獨立地選自由H、C 1-14烷基、C 2-14烯基、-R*YR''、-YR''及-R*OR''組成之群,或R 2及R 3與其所附接之原子一起形成雜環或碳環; R 4選自由C 3-6碳環、-(CH 2) nQ、-(CH 2) nCHQR、-CHQR、-CQ(R) 2及未經取代之C 1-6烷基組成之群,其中Q選自C 3-6碳環、具有一或多個選自N、O及S之雜原子5至14員雜芳基、-OR、-O(CH 2) nN(R) 2、-C(O)OR、-OC(O)R、-CX 3、-CX 2H、-CXH 2、-CN、-C(O)N(R) 2、-N(R)C(O)R、-N(R)S(O) 2R、-N(R)C(O)N(R) 2、-N(R)C(S)N(R) 2、-CRN(R) 2C(O)OR、-N(R)R 8、-O(CH 2) nOR、-N(R)C(=NR 9)N(R) 2、-N(R)C(=CHR 9)N(R) 2、-OC(O)N(R) 2、-N(R)C(O)OR、-N(OR)C(O)R、-N(OR)S(O) 2R、-N(OR)C(O)OR、-N(OR)C(O)N(R) 2、-N(OR)C(S)N(R) 2、-N(OR)C(=NR 9)N(R) 2、-N(OR)C(=CHR 9)N(R) 2、-C(=NR 9)R、-C(O)N(R)OR及-C(=NR 9)N(R) 2,且每一n獨立地選自1、2、3、4及5; 每一R 5獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R 6獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; M及M'獨立地選自-C(O)O-、-OC(O)-、-C(O)N(R')-、-N(R')C(O)-、-C(O)-、-C(S)-、-C(S)S-、-SC(S)-、-CH(OH)-、-P(O)(OR')O-、-S(O) 2-、-S-S-、芳基及雜芳基; R 7選自由C 1-3烷基、C 2-3烯基及H組成之群; R 8選自由C 3-6碳環及雜環組成之群; R 9選自由H、CN、NO 2、C 1-6烷基、-OR、-S(O) 2R、-S(O) 2N(R) 2、C 2-6烯基、C 3-6碳環及雜環組成之群; 每一R獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R'獨立地選自由C 1-18烷基、C 2-18烯基、-R*YR''、-YR''及H組成之群; 每一R''獨立地選自由C 3-14烷基及C 3-14烯基組成之群; 每一R*獨立地選自由C 1-12烷基及C 2-12烯基組成之群; 每一Y獨立地係C 3-6碳環; 每一X獨立地選自由F、Cl、Br及I組成之群;且 m選自5、6、7、8、9、10、11、12及13, 或其鹽或異構物。 In some embodiments, another subgroup of compounds of formula (I) includes those compounds wherein R 1 is selected from C 5-30 alkyl, C 5-20 alkenyl, -R*YR'', - The group consisting of YR'' and -R''M'R'; R 2 and R 3 are independently selected from H, C 1-14 alkyl, C 2-14 alkenyl, -R*YR'', -YR '' and -R*OR'', or R 2 and R 3 together with the atoms to which they are attached form a heterocycle or carbocycle; R 4 is selected from C 3-6 carbocycle, -(CH 2 ) n The group consisting of Q, -(CH 2 ) n CHQR, -CHQR, -CQ(R) 2 and unsubstituted C 1-6 alkyl, wherein Q is selected from C 3-6 carbocyclic ring, having one or more Heteroatom 5- to 14-membered heteroaryl selected from N, O and S, -OR, -O(CH 2 ) n N(R) 2 , -C(O)OR, -OC(O)R, -CX 3 , -CX 2 H, -CXH 2 , -CN, -C(O)N(R) 2 , -N(R)C(O)R, -N(R)S(O) 2 R, -N (R)C(O)N(R) 2 , -N(R)C(S)N(R) 2 , -CRN(R) 2 C(O)OR, -N(R)R 8 , -O (CH 2 ) n OR, -N(R)C(=NR 9 )N(R) 2 , -N(R)C(=CHR 9 )N(R) 2 , -OC(O)N(R) 2 , -N(R)C(O)OR, -N(OR)C(O)R, -N(OR)S(O) 2 R, -N(OR)C(O)OR, -N( OR)C(O)N(R) 2 , -N(OR)C(S)N(R) 2 , -N(OR)C(=NR 9 )N(R) 2 , -N(OR)C (=CHR 9 )N(R) 2 , -C(=NR 9 )R, -C(O)N(R)OR, and -C(=NR 9 )N(R) 2 , and each n independently selected from 1, 2, 3, 4, and 5 ; each R is independently selected from the group consisting of C 1-3 alkyl, C 2-3 alkenyl, and H ; each R is independently selected from C 1- The group consisting of 3 alkyl, C 2-3 alkenyl and H; M and M' are independently selected from -C(O)O-, -OC(O)-, -C(O)N(R')- , -N(R')C(O)-, -C(O)-, -C(S)-, -C(S)S-, -SC(S)-, -CH(OH)-, - P(O)(OR')O-, -S(O) 2 -, -SS-, aryl and heteroaryl; R 7 is selected from C 1-3 alkyl, C 2-3 alkenyl and H group; R 8 is selected from the group consisting of C 3-6 carbocycle and heterocycle; R 9 is selected from H, CN, NO 2 , C 1-6 alkyl, -OR, -S(O) 2 R, -S(O) 2 N(R) 2 , the group consisting of C 2-6 alkenyl, C 3-6 carbocycle and heterocycle; each R is independently selected from C 1-3 alkyl, C The group consisting of 2-3 alkenyl and H; each R' is independently selected from the group consisting of C 1-18 alkyl, C 2-18 alkenyl, -R*YR'', -YR'' and H; Each R'' is independently selected from the group consisting of C 3-14 alkyl and C 3-14 alkenyl; each R* is independently selected from the group consisting of C 1-12 alkyl and C 2-12 alkenyl each Y is independently a C3-6 carbocycle; each X is independently selected from the group consisting of F, Cl, Br, and I; and m is selected from 5, 6, 7, 8, 9, 10, 11, 12 and 13, or their salts or isomers.

在一些實施例中,式(I)化合物之另一亞組包括以下之彼等化合物:其中 R 1選自由C 5-30烷基、C 5-20烯基、-R*YR''、-YR''及-R''M'R'組成之群; R 2及R 3獨立地選自由H、C 2-14烷基、C 2-14烯基、-R*YR''、-YR''及-R*OR''組成之群,或R 2及R 3與其所附接之原子一起形成雜環或碳環; R 4係-(CH 2) nQ或-(CH 2) nCHQR,其中Q係-N(R) 2,且n選自3、4及5; 每一R 5獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R 6獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; M及M'獨立地選自-C(O)O-、-OC(O)-、-C(O)N(R')-、-N(R')C(O)-、-C(O)-、-C(S)-、-C(S)S-、-SC(S)-、-CH(OH)-、-P(O)(OR')O-、-S(O) 2-、-S-S-、芳基及雜芳基; R 7選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R'獨立地選自由C 1-18烷基、C 2-18烯基、-R*YR''、-YR''及H組成之群; 每一R''獨立地選自由C 3-14烷基及C 3-14烯基組成之群; 每一R*獨立地選自由C 1-12烷基及C 1-12烯基組成之群; 每一Y獨立地係C 3-6碳環; 每一X獨立地選自由F、Cl、Br及I組成之群;且 m選自5、6、7、8、9、10、11、12及13, 或其鹽或異構物。 In some embodiments, another subgroup of compounds of formula (I) includes those compounds wherein R 1 is selected from C 5-30 alkyl, C 5-20 alkenyl, -R*YR'', - The group consisting of YR'' and -R''M'R'; R 2 and R 3 are independently selected from H, C 2-14 alkyl, C 2-14 alkenyl, -R*YR'', -YR '' and -R*OR'', or R 2 and R 3 together with the atoms to which they are attached form a heterocyclic or carbocyclic ring; R 4 is -(CH 2 ) n Q or -(CH 2 ) n CHQR, wherein Q is -N(R) 2 , and n is selected from 3, 4, and 5; each R 5 is independently selected from the group consisting of C 1-3 alkyl, C 2-3 alkenyl, and H; each - R 6 is independently selected from the group consisting of C 1-3 alkyl, C 2-3 alkenyl and H; M and M' are independently selected from -C(O)O-, -OC(O)-, - C(O)N(R')-, -N(R')C(O)-, -C(O)-, -C(S)-, -C(S)S-, -SC(S) -, -CH(OH)-, -P(O)(OR')O-, -S(O) 2 -, -SS-, aryl and heteroaryl; R 7 is selected from C 1-3 alkyl , C 2-3 alkenyl and the group consisting of H; each R is independently selected from the group consisting of C 1-3 alkyl, C 2-3 alkenyl and H; each R' is independently selected from C 1- The group consisting of 18 alkyl, C 2-18 alkenyl, -R*YR'', -YR'' and H; each R'' is independently selected from C 3-14 alkyl and C 3-14 alkenyl the group consisting of; each R* is independently selected from the group consisting of C 1-12 alkyl and C 1-12 alkenyl; each Y is independently C 3-6 carbocycle; each X is independently selected from F , Cl, Br, and I; and m is selected from 5, 6, 7, 8, 9, 10, 11, 12, and 13, or a salt or isomer thereof.

在一些實施例中,式(I)化合物之另一亞組包括以下之彼等化合物:其中 R 1選自由C 5-30烷基、C 5-20烯基、-R*YR''、-YR''及-R''M'R'組成之群; R 2及R 3獨立地選自由C 1-14烷基、C 2-14烯基、-R*YR''、-YR''及-R*OR''組成之群,或R 2及R 3與其所附接之原子一起形成雜環或碳環; R 4選自由-(CH 2) nQ、-(CH 2) nCHQR、-CHQR及-CQ(R) 2組成之群,其中Q係-N(R) 2,且n選自1、2、3、4及5; 每一R 5獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R 6獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; M及M'獨立地選自-C(O)O-、-OC(O)-、-C(O)N(R')-、-N(R')C(O)-、-C(O)-、-C(S)-、-C(S)S-、-SC(S)-、-CH(OH)-、-P(O)(OR')O-、-S(O) 2-、-S-S-、芳基及雜芳基; R 7選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R獨立地選自由C 1-3烷基、C 2-3烯基及H組成之群; 每一R'獨立地選自由C 1-18烷基、C 2-18烯基、-R*YR''、-YR''及H組成之群; 每一R''獨立地選自由C 3-14烷基及C 3-14烯基組成之群; 每一R*獨立地選自由C 1-12烷基及C 1-12烯基組成之群; 每一Y獨立地係C 3-6碳環; 每一X獨立地選自由F、Cl、Br及I組成之群;且 m選自5、6、7、8、9、10、11、12及13, 或其鹽或異構物。 In some embodiments, another subgroup of compounds of formula (I) includes those compounds wherein R 1 is selected from C 5-30 alkyl, C 5-20 alkenyl, -R*YR'', - The group consisting of YR'' and -R''M'R'; R 2 and R 3 are independently selected from C 1-14 alkyl, C 2-14 alkenyl, -R*YR'', -YR'' and -R*OR'', or R 2 and R 3 together with the atoms to which they are attached form a heterocyclic or carbocyclic ring; R 4 is selected from -(CH 2 ) n Q, -(CH 2 ) n CHQR The group consisting of , -CHQR and -CQ(R) 2 , wherein Q is -N(R) 2 , and n is selected from 1, 2, 3, 4, and 5; each R 5 is independently selected from C 1-3 The group consisting of alkyl, C 2-3 alkenyl and H; each R 6 is independently selected from the group consisting of C 1-3 alkyl, C 2-3 alkenyl and H; M and M' are independently selected from -C(O)O-, -OC(O)-, -C(O)N(R')-, -N(R')C(O)-, -C(O)-, -C(S )-, -C(S)S-, -SC(S)-, -CH(OH)-, -P(O)(OR')O-, -S(O) 2 -, -SS-, aromatic R is selected from the group consisting of C 1-3 alkyl, C 2-3 alkenyl and H; each R is independently selected from C 1-3 alkyl, C 2-3 alkenyl and The group consisting of H; each R' is independently selected from the group consisting of C 1-18 alkyl, C 2-18 alkenyl, -R*YR'', -YR'' and H; each R'' is independently is selected from the group consisting of C 3-14 alkyl and C 3-14 alkenyl; each R* is independently selected from the group consisting of C 1-12 alkyl and C 1-12 alkenyl; each Y is independently is a C3-6 carbocycle; each X is independently selected from the group consisting of F, Cl, Br, and I; and m is selected from 5, 6, 7, 8, 9, 10, 11, 12, and 13, or salt or isomer.

在一些實施例中,式(I)化合物之亞組包括式(IA)化合物:

Figure 02_image003
, 或其鹽或異構物,其中l選自1、2、3、4及5;m選自5、6、7、8及9;M 1係鍵或M';R 4係未經取代之C 1-3烷基,或-(CH 2) nQ,其中Q係OH、-NHC(S)N(R) 2、-NHC(O)N(R) 2、-N(R)C(O)R、-N(R)S(O) 2R、-N(R)R 8、-NHC(=NR 9)N(R) 2、-NHC(=CHR 9)N(R) 2、-OC(O)N(R) 2、-N(R)C(O)OR、雜芳基或雜環烷基;M及M'獨立地選自-C(O)O-、-OC(O)-、-C(O)N(R')-、-P(O)(OR')O-、-S-S-、芳基及雜芳基;且R 2及R 3獨立地選自由H、C 1-14烷基及C 2-14烯基組成之群。 In some embodiments, a subgroup of compounds of formula (I) includes compounds of formula (IA):
Figure 02_image003
, or a salt or isomer thereof, wherein l is selected from 1, 2, 3, 4 and 5; m is selected from 5, 6, 7, 8 and 9; M 1 is a bond or M'; R 4 is unsubstituted C 1-3 alkyl, or -(CH 2 ) n Q, wherein Q is OH, -NHC(S)N(R) 2 , -NHC(O)N(R) 2 , -N(R)C (O)R, -N(R)S(O) 2 R, -N(R)R 8 , -NHC(=NR 9 )N(R) 2 , -NHC(=CHR 9 )N(R) 2 , -OC(O)N(R) 2 , -N(R)C(O)OR, heteroaryl or heterocycloalkyl; M and M' are independently selected from -C(O)O-, -OC (O)-, -C(O)N(R')-, -P(O)(OR')O-, -SS-, aryl and heteroaryl; and R 2 and R 3 are independently selected from The group consisting of H, C 1-14 alkyl and C 2-14 alkenyl.

在一些實施例中,式(I)化合物之亞組包括式(II)化合物:

Figure 02_image005
或其鹽或異構物,其中l選自1、2、3、4及5;M 1係鍵或M';R 4係未經取代之C 1-3烷基,或-(CH 2) nQ,其中n為2、3或4,且Q為OH、-NHC(S)N(R) 2、-NHC(O)N(R) 2、-N(R)C(O)R、-N(R)S(O) 2R、-N(R)R 8、-NHC(=NR 9)N(R) 2、-NHC(=CHR 9)N(R) 2、-OC(O)N(R) 2、-N(R)C(O)OR、雜芳基或雜環烷基;M及M'獨立地選自-C(O)O-、-OC(O)-、-C(O)N(R')-、-P(O)(OR')O-、-S-S-、芳基及雜芳基;且R 2及R 3獨立地選自由H、C 1-14烷基及C 2-14烯基組成之群。 In some embodiments, a subgroup of compounds of formula (I) includes compounds of formula (II):
Figure 02_image005
or a salt or isomer thereof, wherein l is selected from 1, 2, 3, 4 and 5; M 1 is a bond or M'; R 4 is unsubstituted C 1-3 alkyl, or -(CH 2 ) n Q, where n is 2, 3, or 4, and Q is OH, -NHC(S)N(R) 2 , -NHC(O)N(R) 2 , -N(R)C(O)R, -N(R)S(O) 2 R, -N(R)R 8 , -NHC(=NR 9 )N(R) 2 , -NHC(=CHR 9 )N(R) 2 , -OC(O )N(R) 2 , -N(R)C(O)OR, heteroaryl or heterocycloalkyl; M and M' are independently selected from -C(O)O-, -OC(O)-, -C(O)N(R')-, -P(O)(OR')O-, -SS-, aryl and heteroaryl; and R 2 and R 3 are independently selected from H, C 1- A group consisting of 14 alkyl and C 2-14 alkenyl.

在一些實施例中,式(I)化合物之亞組包括式(IIa)、(IIb)、(IIc)或(IIe)之化合物:

Figure 02_image007
Figure 02_image009
, 或其鹽或異構物,其中R 4係如本文所述。 In some embodiments, a subgroup of compounds of formula (I) includes compounds of formula (IIa), (IIb), (IIc), or (IIe):
Figure 02_image007
Figure 02_image009
, or a salt or isomer thereof, wherein R 4 is as described herein.

在一些實施例中,式(I)化合物之亞組包括式(IId)化合物:

Figure 02_image011
, 或其鹽或異構物,其中n為2、3或4;且m、R'、R''及R 2至R 6係如本文所述。舉例而言,R 2及R 3可各自獨立地選自由C 5-14烷基及C 5-14烯基組成之群。 In some embodiments, a subgroup of compounds of formula (I) includes compounds of formula (IId):
Figure 02_image011
, or a salt or isomer thereof, wherein n is 2, 3, or 4; and m, R', R'', and R 2 to R 6 are as described herein. For example, R 2 and R 3 can each be independently selected from the group consisting of C 5-14 alkyl and C 5-14 alkenyl.

在一些實施例中,本揭示案之可電離之陽離子脂質包含具有以下結構之化合物:

Figure 02_image013
(化合物I)。 In some embodiments, the ionizable cationic lipids of the present disclosure comprise compounds having the following structure:
Figure 02_image013
(Compound I).

在一些實施例中,本揭示案之可電離之陽離子脂質包含具有以下結構之化合物:

Figure 02_image015
(化合物II)。 In some embodiments, the ionizable cationic lipids of the present disclosure comprise compounds having the following structure:
Figure 02_image015
(Compound II).

在一些實施例中,本揭示案之非陽離子脂質包含1,2-二硬脂醯基-sn-甘油-3-磷酸膽鹼(DSPC)、1,2-二油醯基-sn-甘油-3-磷酸乙醇胺(DOPE)、1,2-二亞油醯基-sn-甘油-3-磷酸膽鹼(DLPC)、1,2-二肉豆蔻醯基-sn-甘油-磷酸膽鹼(DMPC)、1,2-二油醯基-sn-甘油-3-磷酸膽鹼(DOPC)、1,2-二棕櫚醯基-sn-甘油-3-磷酸膽鹼(DPPC)、1,2-二-十一烷醯基-sn-甘油-磷酸膽鹼(DUPC)、1-棕櫚醯基-2-油醯基-sn-甘油-3-磷酸膽鹼(POPC)、1,2-二-O-十八碳烯基-sn-甘油-3-磷酸膽鹼(18:0二醚PC)、1-油醯基-2 膽固醇基半琥珀醯基-sn-甘油-3-磷酸膽鹼(OChemsPC)、1-十六烷基-sn-甘油-3-磷酸膽鹼(C16 Lyso PC)、1,2-二亞麻醯基-sn-甘油-3-磷酸膽鹼、1,2-二花生四烯醯基-sn-甘油-3-磷酸膽鹼、1,2-二-二十二碳六烯醯基-sn-甘油-3-磷酸膽鹼、1,2-二植烷醯基-sn-甘油-3-磷酸乙醇胺(ME 16.0 PE)、1,2-二硬脂醯基-sn-甘油-3-磷酸乙醇胺、1,2-二亞油醯基-sn-甘油-3-磷酸乙醇胺、1,2-二亞麻醯基-sn-甘油-3-磷酸乙醇胺、1,2-二花生四烯醯基-sn-甘油-3-磷酸乙醇胺、1,2-二-二十二碳六烯醯基-sn-甘油-3-磷酸乙醇胺、1,2-二油醯基-sn-甘油-3-磷酸-外消旋-(1-甘油)鈉鹽(DOPG)、鞘磷脂及其混合物。In some embodiments, the non-cationic lipids of the present disclosure comprise 1,2-distearyl-sn-glycero-3-phosphocholine (DSPC), 1,2-dioleyl-sn-glycero- 3-Phosphoethanolamine (DOPE), 1,2-Dilinoleyl-sn-glycero-3-phosphocholine (DLPC), 1,2-Dimyristyl-sn-glycero-phosphocholine (DMPC) ), 1,2-dioleyl-sn-glycero-3-phosphocholine (DOPC), 1,2-dipalmitoyl-sn-glycero-3-phosphocholine (DPPC), 1,2- Di-undecanyl-sn-glycero-phosphocholine (DUPC), 1-palmitoyl-2-oleyl-sn-glycero-3-phosphocholine (POPC), 1,2-di- O-octadecenyl-sn-glycero-3-phosphocholine (18:0 diether PC), 1-oleyl-2-cholesterylhemisuccinoyl-sn-glycero-3-phosphocholine ( OChemsPC), 1-hexadecyl-sn-glycero-3-phosphocholine (C16 Lyso PC), 1,2-dilinanoyl-sn-glycero-3-phosphocholine, 1,2-diarachis Tetraenyl-sn-glycero-3-phosphocholine, 1,2-di-docosahexaenyl-sn-glycero-3-phosphocholine, 1,2-diphytanyl- sn-glycero-3-phosphoethanolamine (ME 16.0 PE), 1,2-distearyl-sn-glycero-3-phosphoethanolamine, 1,2-dilinoleo-sn-glycero-3-phosphate Ethanolamine, 1,2-Dilinanoyl-sn-glycero-3-phosphoethanolamine, 1,2-Diarachidonyl-sn-glycero-3-phosphoethanolamine, 1,2-di-docosyl Hexenyl-sn-glycero-3-phosphoethanolamine, 1,2-dioleyl-sn-glycero-3-phosphate-racemic-(1-glycerol) sodium salt (DOPG), sphingomyelin and its mixture.

在一些實施例中,本揭示案之PEG改質之脂質包含PEG改質之磷脂醯乙醇胺、PEG改質之磷脂酸、PEG改質之神經醯胺、PEG改質之二烷基胺、PEG改質之二醯基甘油、PEG改質之二烷基甘油及其混合物。在一些實施例中,PEG改質之脂質係DMG-PEG、PEG-c-DOMG (亦稱為PEG-DOMG)、PEG-DSG及/或PEG-DPG。In some embodiments, the PEG-modified lipids of the present disclosure comprise PEG-modified phosphatidylethanolamine, PEG-modified phosphatidic acid, PEG-modified ceramide, PEG-modified dialkylamine, PEG-modified phosphatidic acid, Quality diacylglycerol, PEG-modified dialkylglycerol and mixtures thereof. In some embodiments, the PEG-modified lipids are DMG-PEG, PEG-c-DOMG (also known as PEG-DOMG), PEG-DSG, and/or PEG-DPG.

在一些實施例中,本揭示案之固醇包含膽固醇、糞生固醇、谷固醇、麥角固醇、菜油固醇、豆固醇、蕓苔固醇、番茄鹼、熊果酸、α-生育酚及其混合物。In some embodiments, the sterols of the present disclosure comprise cholesterol, coprosterol, sitosterol, ergosterol, campesterol, stigmasterol, brassosterol, tomatine, ursolic acid, alpha - Tocopherols and mixtures thereof.

在一些實施例中,本揭示案之LNP包含化合物1之可電離之陽離子脂質,其中非陽離子脂質係DSPC,結構脂質係膽固醇,且PEG脂質係DMG-PEG。In some embodiments, the LNPs of the present disclosure comprise the ionizable cationic lipid of Compound 1, wherein the non-cationic lipid is DSPC, the structural lipid is cholesterol, and the PEG lipid is DMG-PEG.

在一些實施例中,脂質奈米顆粒包含45-55莫耳百分比(mol%)之可電離之陽離子脂質。舉例而言,脂質奈米顆粒可包含45 mol%、46 mol%、47 mol%、48 mol%、49 mol%、50 mol%、51 mol%、52 mol%、53 mol%、54 mol%或55 mol%可電離之陽離子脂質。In some embodiments, the lipid nanoparticles comprise 45-55 mole percent (mol %) of ionizable cationic lipids. For example, the lipid nanoparticle may comprise 45 mol%, 46 mol%, 47 mol%, 48 mol%, 49 mol%, 50 mol%, 51 mol%, 52 mol%, 53 mol%, 54 mol%, or 55 mol% ionizable cationic lipid.

在一些實施例中,脂質奈米顆粒包含5-15 mol%、5-10 mol%或10-15 mol% DSPC。舉例而言,脂質奈米顆粒可包含5 mol%、6 mol%、7 mol%、8 mol%、9 mol%、10 mol%、11 mol%、12 mol%、13 mol%、14 mol%或15 mol% DSPC。In some embodiments, the lipid nanoparticles comprise 5-15 mol%, 5-10 mol%, or 10-15 mol% DSPC. For example, the lipid nanoparticle can comprise 5 mol%, 6 mol%, 7 mol%, 8 mol%, 9 mol%, 10 mol%, 11 mol%, 12 mol%, 13 mol%, 14 mol% or 15 mol% DSPC.

在一些實施例中,脂質奈米顆粒包含35-40 mol%膽固醇。舉例而言,脂質奈米顆粒可包含35 mol%、35.5 mol%、36 mol%、36.5 mol%、37 mol%、37.5 mol%、38 mol%、38.5 mol%、39 mol%、39.5 mol%或40 mol%膽固醇。In some embodiments, the lipid nanoparticles comprise 35-40 mol% cholesterol. For example, the lipid nanoparticle can comprise 35 mol%, 35.5 mol%, 36 mol%, 36.5 mol%, 37 mol%, 37.5 mol%, 38 mol%, 38.5 mol%, 39 mol%, 39.5 mol%, or 40 mol% cholesterol.

在一些實施例中,脂質奈米顆粒包含1-2 mol%、1-3 mol%、1-4 mol%或1-5 mol% DMG-PEG。舉例而言,脂質奈米顆粒可包含1 mol%、1.5 mol%、2 mol%、2.5 mol%、3 mol%或3.5 mol% DMG-PEG。In some embodiments, the lipid nanoparticles comprise 1-2 mol%, 1-3 mol%, 1-4 mol%, or 1-5 mol% DMG-PEG. For example, lipid nanoparticles can comprise 1 mol%, 1.5 mol%, 2 mol%, 2.5 mol%, 3 mol%, or 3.5 mol% DMG-PEG.

在一些實施例中,脂質奈米顆粒包含50 mol%可電離之陽離子脂質、10 mol% DSPC、38.5 mol%膽固醇及1.5 mol% DMG-PEG。In some embodiments, the lipid nanoparticles comprise 50 mol% ionizable cationic lipid, 10 mol% DSPC, 38.5 mol% cholesterol, and 1.5 mol% DMG-PEG.

在一些實施例中,脂質奈米顆粒包含49 mol%可電離之陽離子脂質、10 mol% DSPC、38.5 mol%膽固醇及2.5 mol% DMG-PEG。In some embodiments, the lipid nanoparticles comprise 49 mol% ionizable cationic lipids, 10 mol% DSPC, 38.5 mol% cholesterol, and 2.5 mol% DMG-PEG.

在一些實施例中,脂質奈米顆粒包含49 mol%可電離之陽離子脂質、11 mol% DSPC、38.5 mol%膽固醇及1.5 mol% DMG-PEG。In some embodiments, the lipid nanoparticles comprise 49 mol% ionizable cationic lipids, 11 mol% DSPC, 38.5 mol% cholesterol, and 1.5 mol% DMG-PEG.

在一些實施例中,脂質奈米顆粒包含48 mol%可電離之陽離子脂質、11 mol% DSPC、38.5 mol%膽固醇及2.5 mol% DMG-PEG。In some embodiments, the lipid nanoparticles comprise 48 mol% ionizable cationic lipids, 11 mol% DSPC, 38.5 mol% cholesterol, and 2.5 mol% DMG-PEG.

在一些實施例中,本揭示案之LNP包含約2:1至約30:1之N:P比。In some embodiments, the LNPs of the present disclosure comprise an N:P ratio of about 2:1 to about 30:1.

在一些實施例中,本揭示案之LNP包含約6:1之N:P比。In some embodiments, the LNPs of the present disclosure comprise an N:P ratio of about 6:1.

在一些實施例中,本揭示案之LNP包含約3:1之N:P比。In some embodiments, the LNPs of the present disclosure comprise an N:P ratio of about 3:1.

在一些實施例中,本揭示案之LNP包含約10:1至約100:1之可電離之陽離子脂質組分對RNA之重量/重量比。In some embodiments, the LNPs of the present disclosure comprise a weight/weight ratio of ionizable cationic lipid component to RNA of about 10:1 to about 100:1.

在一些實施例中,本揭示案之LNP包含約20:1之可電離之陽離子脂質組分對RNA之重量/重量比。In some embodiments, the LNPs of the present disclosure comprise a weight/weight ratio of ionizable cationic lipid component to RNA of about 20:1.

在一些實施例中,本揭示案之LNP包含約10:1之可電離之陽離子脂質組分對RNA之重量/重量比。In some embodiments, the LNPs of the present disclosure comprise a weight/weight ratio of ionizable cationic lipid component to RNA of about 10:1.

在一些實施例中,本揭示案之LNP具有約50 nm至約150 nm之平均直徑。In some embodiments, the LNPs of the present disclosure have an average diameter of about 50 nm to about 150 nm.

在一些實施例中,本揭示案之LNP具有約70 nm至約120 nm之平均直徑。 多價疫苗 In some embodiments, the LNPs of the present disclosure have an average diameter of about 70 nm to about 120 nm. polyvalent vaccine

如本文提供之組合物可包括編碼相同或不同種類之兩種或更多種抗原之RNA或多種RNA。在一些實施例中,組合物包括編碼兩種或更多種冠狀病毒抗原之mRNA或多種mRNA。在一些實施例中,RNA可編碼1種、2種、3種、4種、5種、6種、7種、8種、9種、10種、11種、12種或更多種冠狀病毒抗原。Compositions as provided herein can include RNA or RNAs encoding two or more antigens of the same or different species. In some embodiments, the composition includes mRNA or mRNAs encoding two or more coronavirus antigens. In some embodiments, the RNA can encode 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more coronaviruses antigen.

在一些實施例中,可將兩種或更多種不同的編碼抗原之mRNA調配於相同脂質奈米顆粒中。在其他實施例中,可將兩種或更多種不同之編碼抗原之RNA調配於單獨脂質奈米顆粒中(每一RNA調配於單一脂質奈米顆粒中)。接著,可組合脂質奈米顆粒且作為單一疫苗組合物(例如,包含編碼多種抗原之多種RNA)投與,或可單獨投與。 組合疫苗 In some embodiments, two or more different antigen-encoding mRNAs can be formulated in the same lipid nanoparticle. In other embodiments, two or more different antigen-encoding RNAs can be formulated into separate lipid nanoparticles (each RNA is formulated into a single lipid nanoparticle). The lipid nanoparticles can then be combined and administered as a single vaccine composition (eg, comprising multiple RNAs encoding multiple antigens), or can be administered separately. combination vaccine

如本文提供之組合物可包括編碼相同或不同病毒株之兩種或更多種抗原之mRNA或多種RNA。本文亦提供了包含編碼一或多種冠狀病毒及不同生物體之一或多種抗原之RNA的組合疫苗。因此,本揭示案之疫苗可為靶向相同株/種類之一或多種抗原或不同株/種類之一或多種抗原之組合疫苗,例如誘導對在冠狀病毒感染風險高之相同地理區域發現之生物體或當暴露於冠狀病毒時個體可能暴露於之生物體之免疫的抗原。 醫藥調配物 Compositions as provided herein can include mRNA or multiple RNAs encoding two or more antigens of the same or different virus strains. Also provided herein are combination vaccines comprising RNA encoding one or more coronaviruses and one or more antigens from different organisms. Thus, the vaccines of the present disclosure may be combination vaccines targeting one or more antigens of the same strain/species or one or more antigens of different strains/species, eg to induce response to organisms found in the same geographic area with a high risk of coronavirus infection Antigens of immunity of an organism or an organism to which an individual may be exposed when exposed to the coronavirus. Pharmaceutical formulations

本文提供用於預防或治療例如人類及其他哺乳動物之冠狀病毒之組合物(例如醫藥組合物)、方法、套組及試劑。本文提供之組合物可用作治療劑或預防劑。其可在醫藥中用於預防及/或治療冠狀病毒感染。Provided herein are compositions (eg, pharmaceutical compositions), methods, kits, and reagents for the prevention or treatment of, eg, human and other mammalian coronaviruses. The compositions provided herein can be used as therapeutic or prophylactic agents. It can be used in medicine to prevent and/or treat coronavirus infection.

在一些實施例中,可將如本文所述之含有RNA之冠狀病毒疫苗投與給個體(例如哺乳動物個體,諸如人類個體),且mRNA在活體內轉譯以產生抗原性多肽(抗原)。In some embodiments, an RNA-containing coronavirus vaccine as described herein can be administered to an individual (eg, a mammalian individual, such as a human individual), and the mRNA is translated in vivo to produce an antigenic polypeptide (antigen).

組合物(例如,包含RNA)之「有效量」至少部分基於靶組織、靶細胞類型、投與方式、RNA之物理特徵(例如,長度、核苷酸組成及/或修飾之核苷之程度)、疫苗之其他組分及其他决定因素,諸如個體之年齡、體重、身高、性別及一般健康狀况。通常,有效量之組合物提供誘導或增强之免疫反應,作為個體細胞中之抗原產生之函數。在一些實施例中,有效量之含有具有至少一個化學修飾之mRNA之組合物比含有編碼相同抗原或肽抗原之相應未經修飾之聚核苷酸之組合物更有效。抗原產生增加可藉由細胞轉染(用RNA疫苗轉染之細胞之百分數)增加、自聚核苷酸之蛋白質轉譯及/或表現增加、核酸降解减少(例如,如藉由自經修飾之聚核苷酸之蛋白質轉譯之持續時間增加)或宿主細胞之抗原特異性免疫反應之改變來證明。An "effective amount" of a composition (e.g., comprising RNA) is based, at least in part, on the target tissue, target cell type, mode of administration, physical characteristics of the RNA (e.g., length, nucleotide composition, and/or degree of modified nucleosides) , other components of the vaccine and other determinants such as the age, weight, height, sex and general health of the individual. Typically, an effective amount of the composition provides for induction or enhancement of an immune response as a function of antigen production in the cells of the individual. In some embodiments, an effective amount of a composition containing mRNA with at least one chemical modification is more effective than a composition containing the corresponding unmodified polynucleotide encoding the same antigen or peptide antigen. Increased antigen production can be achieved by increased cell transfection (percentage of cells transfected with RNA vaccine), increased protein translation and/or expression from polynucleotides, decreased nucleic acid degradation (eg, as by modified polynucleotides). This is evidenced by an increase in the duration of protein translation of nucleotides) or by changes in the antigen-specific immune response of the host cell.

術語「醫藥組合物」係指活性劑與惰性或活性載劑之組合,使得組合物特別適用於活體內或離體診斷或治療用途。「醫藥學上可接受之載劑」在投與給個體後或在投與給個體時不會引起不期望之生理效應。醫藥組合物中之載劑自其與活性成分相容且能够穩定活性成分之意義上必須亦係「可接受的」。一或多種增溶劑可用作遞送活性劑之醫藥載劑。醫藥學上可接受之載劑之實例包括(但不限於)生物相容性媒劑、佐劑、添加劑及稀釋劑,以獲得可用作劑型之組合物。其他載劑之實例包括膠態氧化矽、硬脂酸鎂、纖維素及月桂基硫酸鈉。額外適宜醫藥載劑及稀釋劑以及使用其之醫藥必需品闡述於Remington's Pharmaceutical Sciences中。The term "pharmaceutical composition" refers to a combination of an active agent and an inert or active carrier that makes the composition particularly suitable for in vivo or ex vivo diagnostic or therapeutic use. A "pharmaceutically acceptable carrier" does not cause undesired physiological effects after or when administered to a subject. A carrier in a pharmaceutical composition must also be "acceptable" in the sense that it is compatible with and capable of stabilizing the active ingredient. One or more solubilizers can be used as pharmaceutical carriers for delivery of active agents. Examples of pharmaceutically acceptable carriers include, but are not limited to, biocompatible vehicles, adjuvants, additives, and diluents to obtain compositions useful as dosage forms. Examples of other carriers include colloidal silica, magnesium stearate, cellulose and sodium lauryl sulfate. Additional suitable pharmaceutical carriers and diluents and the medical necessities for their use are described in Remington's Pharmaceutical Sciences.

在一些實施例中,根據本揭示案之組合物(包含聚核苷酸及其編碼之多肽)可用於治療或預防冠狀病毒感染。組合物可作為主動免疫方案之一部分預防性或治療性地投與給健康個體或在潜伏期(incubation phase)期間或在症狀發作後之活動性感染期間之感染早期投與。在一些實施例中,提供給細胞、組織或個體之RNA之量可為有效用於免疫預防之量。In some embodiments, compositions according to the present disclosure (including polynucleotides and their encoded polypeptides) can be used to treat or prevent coronavirus infection. The compositions may be administered prophylactically or therapeutically to healthy individuals as part of an active immunization regimen or early in infection during the incubation phase or during active infection following the onset of symptoms. In some embodiments, the amount of RNA provided to the cell, tissue or individual may be an amount effective for immunoprophylaxis.

組合物可與其他預防性或治療性化合物一起投與。作為非限制性實例,預防性或治療性化合物可為佐劑或加强劑。如本文所用,當提及預防性組合物(諸如疫苗)時,術語「加强劑」係指額外投與預防性(疫苗)組合物。加强劑(或加强疫苗)可在較早投與預防性組合物之後給予。初始投與預防性組合物與加强劑之間之投與時間可為(但不限於) 1分鐘、2分鐘、3分鐘、4分鐘、5分鐘、6分鐘、7分鐘、8分鐘、9分鐘、10分鐘、15分鐘、20分鐘、35分鐘、40分鐘、45分鐘、50分鐘、55分鐘、1小時、2小時、3小時、4小時、5小時、6小時、7小時、8小時、9小時、10小時、11小時、12小時、13小時、14小時、15小時、16小時、17小時、18小時、19小時、20小時、21小時、22小時、23小時、1天、36小時、2天、3天、4天、5天、6天、1週、10天、2週、3週、1個月、2個月、3個月、4個月、5個月、6個月、7個月、8個月、9個月、10個月、11個月、1年、18個月、2年、3年、4年、5年、6年、7年、8年、9年、10年、11年、12年、13年、14年、15年、16年、17年、18年、19年、20年、25年、30年、35年、40年、45年、50年、55年、60年、65年、70年、75年、80年、85年、90年、95年或超過99年。在示例性實施例中,初始投與預防性組合物與加强劑之間之投與時間可為(但不限於) 1週、2週、3週、1個月、2個月、3個月、6個月或1年。The compositions can be administered with other prophylactic or therapeutic compounds. By way of non-limiting example, a prophylactic or therapeutic compound can be an adjuvant or booster. As used herein, when referring to a prophylactic composition (such as a vaccine), the term "booster" refers to the additional administration of the prophylactic (vaccine) composition. A booster (or booster vaccine) can be administered after an earlier administration of the prophylactic composition. The administration time between the initial administration of the prophylactic composition and the booster can be, but is not limited to, 1 minute, 2 minutes, 3 minutes, 4 minutes, 5 minutes, 6 minutes, 7 minutes, 8 minutes, 9 minutes, 10 minutes, 15 minutes, 20 minutes, 35 minutes, 40 minutes, 45 minutes, 50 minutes, 55 minutes, 1 hour, 2 hours, 3 hours, 4 hours, 5 hours, 6 hours, 7 hours, 8 hours, 9 hours , 10 hours, 11 hours, 12 hours, 13 hours, 14 hours, 15 hours, 16 hours, 17 hours, 18 hours, 19 hours, 20 hours, 21 hours, 22 hours, 23 hours, 1 day, 36 hours, 2 days, 3 days, 4 days, 5 days, 6 days, 1 week, 10 days, 2 weeks, 3 weeks, 1 month, 2 months, 3 months, 4 months, 5 months, 6 months, 7 months, 8 months, 9 months, 10 months, 11 months, 1 year, 18 months, 2 years, 3 years, 4 years, 5 years, 6 years, 7 years, 8 years, 9 years , 10 years, 11 years, 12 years, 13 years, 14 years, 15 years, 16 years, 17 years, 18 years, 19 years, 20 years, 25 years, 30 years, 35 years, 40 years, 45 years, 50 years years, 55 years, 60 years, 65 years, 70 years, 75 years, 80 years, 85 years, 90 years, 95 years or more than 99 years. In exemplary embodiments, the administration time between the initial administration of the prophylactic composition and the booster can be, but is not limited to, 1 week, 2 weeks, 3 weeks, 1 month, 2 months, 3 months , 6 months or 1 year.

在一些實施例中,組合物可經肌內、鼻內或真皮內投與,類似於業內已知之滅活疫苗之投與。In some embodiments, the composition may be administered intramuscularly, intranasally, or intradermally, similar to the administration of inactivated vaccines known in the art.

根據感染之盛行率或未滿足之醫療需要之程度或水準,組合物可用於各種環境。作為非限制性實例,RNA疫苗可用於治療及/或預防多種傳染病。RNA疫苗具有優越的性質,此乃因其比可商購獲得之疫苗產生大得多之抗體效價、更佳之中和免疫性、產生更耐久之免疫反應及/或更早產生反應。The compositions can be used in a variety of settings depending on the prevalence of infection or the degree or level of unmet medical need. By way of non-limiting example, RNA vaccines can be used to treat and/or prevent a variety of infectious diseases. RNA vaccines have superior properties because they generate much larger antibody titers, better neutralize immunity, generate more durable immune responses, and/or generate responses earlier than commercially available vaccines.

本文提供醫藥組合物,其包括RNA及/或複合體,視情况地與一或多種醫藥學上可接受之賦形劑組合。Provided herein are pharmaceutical compositions comprising RNA and/or complexes, optionally in combination with one or more pharmaceutically acceptable excipients.

RNA可單獨調配或投與,或與一或多種其他組分聯合調配或投與。舉例而言,組合物可包含其他組分,包括但不限於佐劑。RNA can be formulated or administered alone, or in combination with one or more other components. For example, the composition may contain other components including, but not limited to, adjuvants.

在一些實施例中,組合物不包含佐劑(其不含佐劑)。In some embodiments, the composition does not contain an adjuvant (it does not contain an adjuvant).

RNA可與一或多種醫藥學上可接受之賦形劑組合調配或投與。在一些實施例中,疫苗組合物包含至少一種額外活性物質,例如治療活性物質、預防活性物質或兩者之組合。疫苗組合物可為無菌的、無熱原的,或無菌及無熱原的。在醫藥劑(諸如疫苗組合物)之調配及/或製造中之一般考慮因素可參見(例如) Remington: The Science and Practice of Pharmacy,第21版,Lippincott Williams & Wilkins, 2005 (其全部內容以引用方式併入本文中)。RNA can be formulated or administered in combination with one or more pharmaceutically acceptable excipients. In some embodiments, the vaccine composition comprises at least one additional active, eg, a therapeutic active, a prophylactic active, or a combination of the two. Vaccine compositions can be sterile, pyrogen-free, or sterile and pyrogen-free. General considerations in the formulation and/or manufacture of pharmaceutical agents such as vaccine compositions can be found, for example, in Remington: The Science and Practice of Pharmacy, 21st Ed. Lippincott Williams & Wilkins, 2005 (incorporated by reference in its entirety). manner is incorporated herein).

在一些實施例中,將組合物投與給人類、人類患者或個體。出於本揭示案之目的,片語「活性成分」通常係指RNA疫苗或其中含有之聚核苷酸,例如編碼抗原之mRNA。In some embodiments, the composition is administered to a human, human patient or individual. For the purposes of this disclosure, the phrase "active ingredient" generally refers to an RNA vaccine or a polynucleotide contained therein, such as mRNA encoding an antigen.

本文所述之疫苗組合物之調配物可藉由藥理學領域中已知或以後開發之任何方法來製備。一般而言,該等製備方法包括使活性成分(例如mRNA)與賦形劑及/或一或多種其他輔助成分締合且接著若必要及/或期望,將產品分割、成型及/或包裝成期望單劑量或多劑量單位之步驟。Formulations of the vaccine compositions described herein can be prepared by any method known or later developed in the art of pharmacology. In general, such methods of preparation include associating the active ingredient (eg, mRNA) with excipients and/or one or more other auxiliary ingredients and then, if necessary and/or desired, dividing, shaping and/or packaging the product into a Single-dose or multiple-dose unit procedures are desired.

本揭示案之醫藥組合物中之活性成分、醫藥學上可接受之賦形劑及/或任何額外成分之相對量將根據所治療之個體之身份、大小及/或狀况且進一步根據組合物欲投與之途徑而變化。舉例而言,組合物可包含介於0.1%與100%之間、例如介於0.5%與50%之間、介於1%與30%之間、介於5%與80%之間、至少80% (w/w)之活性成分。The relative amounts of active ingredients, pharmaceutically acceptable excipients and/or any additional ingredients in the pharmaceutical compositions of the present disclosure will depend on the identity, size and/or condition of the individual being treated and further on the composition to be administered change with the way. For example, the composition may comprise between 0.1% and 100%, such as between 0.5% and 50%, between 1% and 30%, between 5% and 80%, at least 80% (w/w) active ingredient.

在一些實施例中,使用一或多種賦形劑調配mRNA以:(1)增加穩定性;(2)增加細胞轉染;(3)允許持續或延遲釋放(例如,自貯庫調配物);(4)改變生物分佈(例如,靶向特定組織或細胞類型);(5)增加編碼之蛋白質在活體內之轉譯;及/或(6)改變編碼之蛋白質(抗原)在活體內之釋放曲綫。除了傳統賦形劑(諸如任何及所有溶劑、分散介質、稀釋劑或其他液體媒劑、分散或懸浮助劑、表面活性劑、等滲劑、增稠劑或乳化劑、防腐劑)之外,賦形劑亦可包括(但不限於)類脂質、脂質體、脂質奈米顆粒、聚合物、脂質複合體、核-殼奈米顆粒、肽、蛋白質、用RNA轉染之細胞(例如,用於移植至個體中)、玻尿酸酶、奈米顆粒模擬物及其組合。 投藥 / 投與 In some embodiments, mRNA is formulated with one or more excipients to: (1) increase stability; (2) increase cell transfection; (3) allow sustained or delayed release (eg, from a depot formulation); (4) alter biodistribution (eg, target specific tissues or cell types); (5) increase translation of the encoded protein in vivo; and/or (6) alter the release profile of the encoded protein (antigen) in vivo . In addition to conventional excipients such as any and all solvents, dispersion media, diluents or other liquid vehicles, dispersing or suspending aids, surfactants, isotonic, thickening or emulsifying agents, preservatives, Excipients may also include, but are not limited to, lipidoids, liposomes, lipid nanoparticles, polymers, lipid complexes, core-shell nanoparticles, peptides, proteins, cells transfected with RNA (e.g., with transplantation into an individual), hyaluronidase, nanoparticle mimics, and combinations thereof. dosing / administration

本文提供用於預防及/或治療人類及其他哺乳動物之冠狀病毒感染之組合物(例如,RNA疫苗)、方法、套組及試劑。免疫組合物可用作治療劑或預防劑。在一些實施例中,組合物用於提供對冠狀病毒感染之預防性保護。在一些實施例中,組合物用於治療冠狀病毒感染。在一些實施例中,組合物用於引發免疫效應細胞,例如離體活化外周血單核細胞(PBMC),接著將其輸注(再輸注)至個體中。Provided herein are compositions (eg, RNA vaccines), methods, kits and reagents for preventing and/or treating coronavirus infections in humans and other mammals. Immunological compositions can be used as therapeutic or prophylactic agents. In some embodiments, the compositions are used to provide prophylactic protection against coronavirus infection. In some embodiments, the composition is used to treat a coronavirus infection. In some embodiments, the compositions are used to elicit immune effector cells, eg, ex vivo activated peripheral blood mononuclear cells (PBMCs), which are then infused (re-infused) into an individual.

個體可為任何哺乳動物,包括非人靈長類及人類個體。通常,個體為人類個體。The subject can be any mammal, including non-human primates and human subjects. Typically, the individual is a human individual.

在一些實施例中,將組合物(例如,RNA疫苗)以有效誘導抗原特異性免疫反應之量投與給個體(例如,哺乳動物個體,諸如人類個體)。編碼冠狀病毒抗原之RNA在活體內表現及轉譯以產生抗原,此接著刺激個體之免疫反應。In some embodiments, a composition (eg, an RNA vaccine) is administered to an individual (eg, a mammalian individual, such as a human individual) in an amount effective to induce an antigen-specific immune response. The RNA encoding the coronavirus antigen is expressed and translated in vivo to produce the antigen, which in turn stimulates an immune response in the individual.

在投與本揭示案之組合物之後,可實現對冠狀病毒之預防性保護。免疫組合物可投與一次、兩次、三次、四次或更多次,但可能投與一次疫苗即足矣(視情况地隨後進行單次加强)。儘管不太期望,但可將組合物投與給感染之個體以實現治療反應。可能需要相應地調整投藥。Preventive protection against coronaviruses can be achieved following administration of the compositions of the present disclosure. The immunizing composition may be administered once, twice, three times, four times or more, although one administration of the vaccine may suffice (optionally followed by a single boost). Although less desirable, the composition can be administered to an infected individual to achieve a therapeutic response. Dosing may need to be adjusted accordingly.

在本揭示案之態樣中,提供在個體中引發針對冠狀病毒抗原(或多種抗原)之免疫反應之方法。在一些實施例中,方法包括向個體投與包含具有編碼冠狀病毒抗原之開放閱讀框之mRNA的組合物,從而在個體中誘導對冠狀病毒抗原特異性之免疫反應,其中相對於用預防有效劑量之針對該抗原之傳統疫苗接種之個體中之抗抗原抗體效價,疫苗接種後個體中之抗抗原抗體效價增加。「抗抗原抗體」係特異性結合至抗原之血清抗體。In aspects of the present disclosure, methods of eliciting an immune response against a coronavirus antigen (or antigens) in an individual are provided. In some embodiments, the method comprises administering to the individual a composition comprising mRNA having an open reading frame encoding a coronavirus antigen, thereby inducing in the individual an immune response specific for the coronavirus antigen, wherein relative to using a prophylactically effective dose The anti-antigen antibody titers in individuals traditionally vaccinated against the antigen increased after vaccination. "Anti-antigen antibodies" are serum antibodies that specifically bind to an antigen.

預防有效劑量係在臨床上可接受之水準上預防病毒感染之有效劑量。在一些實施例中,有效劑量係疫苗之包裝插頁中列出之劑量。如本文所用,傳統疫苗係指除本揭示案之mRNA疫苗之外之疫苗。例如,傳統疫苗包括(但不限於)活微生物疫苗、殺死之微生物疫苗、次單元疫苗、蛋白質抗原疫苗、DNA疫苗、類病毒顆粒(VLP)疫苗等。在示例性實施例中,傳統疫苗係已經獲得監管部門批准及/或在國家藥物監管機構(例如美國食品藥品管理局(FDA)或歐洲藥物管理局(EMA))註册之疫苗。A prophylactically effective dose is a dose effective at a clinically acceptable level to prevent viral infection. In some embodiments, the effective dose is the dose listed on the package insert of the vaccine. As used herein, traditional vaccines refer to vaccines other than the mRNA vaccines of the present disclosure. For example, conventional vaccines include, but are not limited to, live microbial vaccines, killed microbial vaccines, subunit vaccines, protein antigen vaccines, DNA vaccines, virus-like particle (VLP) vaccines, and the like. In exemplary embodiments, conventional vaccines are vaccines that have been approved by regulatory authorities and/or registered with a national drug regulatory agency, such as the US Food and Drug Administration (FDA) or the European Medicines Agency (EMA).

在一些實施例中,相對於用預防有效劑量之針對冠狀病毒之傳統疫苗接種之個體或未接種疫苗之個體中之抗抗原抗體效價,疫苗接種後之個體中之抗抗原抗體效價增加1 log至10 log。在一些實施例中,相對於用預防有效劑量之針對冠狀病毒之傳統疫苗接種之個體或未接種疫苗之個體中之抗抗原抗體效價,疫苗接種後之個體中之抗抗原抗體效價增加1 log、2 log、3 log、4 log、5 log或10 log。In some embodiments, the anti-antigen antibody titer in a vaccinated individual increases by 1 relative to the anti-antigen antibody titer in an individual vaccinated with a prophylactically effective dose of traditional vaccine against coronavirus or an unvaccinated individual log to 10 log. In some embodiments, the anti-antigen antibody titer in a vaccinated individual increases by 1 relative to the anti-antigen antibody titer in an individual vaccinated with a prophylactically effective dose of traditional vaccine against coronavirus or an unvaccinated individual log, 2 log, 3 log, 4 log, 5 log or 10 log.

在本揭示案之其他態樣中,提供在個體中引發針對冠狀病毒之免疫反應之方法。該方法包括向個體投與包含含有編碼冠狀病毒抗原之開放閱讀框之mRNA的組合物,從而在個體中誘導對冠狀病毒特異性之免疫反應,其中個體中之免疫反應等同於以相對於組合物2倍至100倍劑量水準用針對冠狀病毒之傳統疫苗接種之個體中之免疫反應。In other aspects of the present disclosure, methods of eliciting an immune response against a coronavirus in an individual are provided. The method comprises administering to an individual a composition comprising mRNA comprising an open reading frame encoding a coronavirus antigen, thereby inducing in the individual an immune response specific to the coronavirus, wherein the immune response in the individual is equivalent to that relative to the composition Immune responses in individuals vaccinated with conventional vaccines against coronavirus at 2-fold to 100-fold dose levels.

在一些實施例中,個體中之免疫反應等同於以相對於本揭示案之組合物兩倍之劑量水準用傳統疫苗接種之個體中之免疫反應。在一些實施例中,個體中之免疫反應等同於以相對於本揭示案之組合物三倍之劑量水準用傳統疫苗接種之個體中之免疫反應。在一些實施例中,個體中之免疫反應等同於以相對於本揭示案之組合物4倍、5倍、10倍、50倍或100倍之劑量水準用傳統疫苗接種之個體中之免疫反應。在一些實施例中,個體中之免疫反應等同於以相對於本揭示案之組合物10倍至1000倍之劑量水準用傳統疫苗接種之個體中之免疫反應。在一些實施例中,個體中之免疫反應等同於以相對於本揭示案之組合物100倍至1000倍之劑量水準用傳統疫苗接種之個體中之免疫反應。In some embodiments, the immune response in an individual is equivalent to that in an individual vaccinated with conventional vaccines at dose levels that are twice relative to the compositions of the present disclosure. In some embodiments, the immune response in an individual is equivalent to that in an individual vaccinated with conventional vaccines at dose levels three times relative to the compositions of the present disclosure. In some embodiments, the immune response in an individual is equivalent to that in an individual vaccinated with conventional vaccines at dose levels 4-, 5-, 10-, 50-, or 100-fold relative to the compositions of the present disclosure. In some embodiments, the immune response in an individual is equivalent to that in an individual vaccinated with conventional vaccines at dose levels 10- to 1000-fold relative to the compositions of the present disclosure. In some embodiments, the immune response in an individual is equivalent to that in an individual vaccinated with conventional vaccines at dose levels 100- to 1000-fold relative to the compositions of the present disclosure.

在其他實施例中,藉由測定個體中之[蛋白]抗體效價來評價免疫反應。在其他實施例中,測試來自經免疫個體之血清或抗體中和病毒攝取或减少人類B淋巴球之冠狀病毒轉型之能力。在其他實施例中,使用業內公認之技術來量測促進强T細胞反應之能力。In other embodiments, the immune response is assessed by measuring [protein] antibody titers in the individual. In other embodiments, sera or antibodies from immunized individuals are tested for their ability to neutralize viral uptake or reduce coronavirus transformation of human B lymphocytes. In other embodiments, the ability to promote a strong T cell response is measured using art-recognized techniques.

本揭示案之其他態樣提供藉由向個體投與包含具有編碼冠狀病毒抗原之開放閱讀框之mRNA之組合物,從而在個體中誘導對冠狀病毒抗原特異性之免疫反應,而在個體中引發針對冠狀病毒之免疫反應的方法,其中相對於在用預防有效劑量之針對冠狀病毒之傳統疫苗接種之個體中誘導之免疫反應,該個體中誘導之免疫反應早2天至10週。在一些實施例中,在以相對於本揭示案之組合物2倍至100倍之劑量水準用預防有效劑量之傳統疫苗接種之個體中,誘導該個體中之免疫反應。Other aspects of the present disclosure provide for inducing an immune response specific to a coronavirus antigen in an individual by administering to the individual a composition comprising mRNA having an open reading frame encoding a coronavirus antigen, eliciting in an individual A method of an immune response against a coronavirus wherein the immune response is induced in an individual 2 days to 10 weeks earlier than that induced in an individual vaccinated with a prophylactically effective dose of a conventional vaccine against the coronavirus. In some embodiments, an immune response in an individual is induced in an individual vaccinated with a prophylactically effective dose of a conventional vaccine at a dose level of 2- to 100-fold relative to a composition of the present disclosure.

在一些實施例中,相對於在用預防有效劑量之傳統疫苗接種之個體中誘導之免疫反應,該個體中誘導之免疫反應早2天、3天、1週、2週、3週、5週或10週。In some embodiments, the immune response is induced in an individual 2 days, 3 days, 1 week, 2 weeks, 3 weeks, 5 weeks earlier than the immune response induced in the individual vaccinated with a prophylactically effective dose of the traditional vaccine or 10 weeks.

本文亦提供藉由向個體投與具有編碼第一抗原之開放閱讀框之mRNA而在個體中引發針對冠狀病毒之免疫反應的方法,其中RNA不包括穩定元件,且其中佐劑不與疫苗共調配或共投與。Also provided herein are methods of eliciting an immune response against a coronavirus in an individual by administering to the individual mRNA having an open reading frame encoding a first antigen, wherein the RNA does not include stabilizing elements, and wherein the adjuvant is not co-formulated with the vaccine or co-investment.

組合物可藉由任何產生治療有效結果之途徑投與。該等途徑包括(但不限於)真皮內、肌內、鼻內及/或皮下投與。本揭示案提供包括向有需要之個體投與RNA疫苗之方法。所需之確切量將因個體而異,此取决於個體之物種、年齡及一般狀况、疾病之嚴重程度、特定組合物、其投與模式、其活性模式及諸如此類。RNA通常調配成劑量單位形式以便於投與及劑量之均勻性。然而,應理解,RNA之總日用量可由主治醫師在合理之醫學判斷範圍內决定。對於任何特定患者之具體治療有效、預防有效或適當之成像劑量水準將取决於多種因素,包括所治療之病症及病症之嚴重程度;所採用之具體化合物之活性;所採用之具體組合物;患者之年齡、體重、一般健康狀况、性別及飲食;投與時間、投與途徑及所採用之具體化合物之排泄速率;治療之持續時間;與所採用之具體化合物組合或同時使用之藥物;以及醫學領域熟知之類似因素。The compositions can be administered by any route that produces therapeutically effective results. Such routes include, but are not limited to, intradermal, intramuscular, intranasal and/or subcutaneous administration. The present disclosure provides methods comprising administering RNA vaccines to individuals in need. The exact amount required will vary from individual to individual, depending on the individual's species, age and general condition, the severity of the disease, the particular composition, its mode of administration, its mode of activity, and the like. RNA is usually formulated in dosage unit form for ease of administration and uniformity of dosage. It should be understood, however, that the total daily amount of RNA can be determined by the attending physician within the scope of sound medical judgment. The particular therapeutically, prophylactically, or appropriate imaging dose level for any particular patient will depend on a variety of factors, including the condition being treated and the severity of the condition; the activity of the particular compound employed; the particular composition employed; the patient age, weight, general health, sex, and diet; time of administration, route of administration, and rate of excretion of the particular compound employed; duration of treatment; drugs used in combination or concomitantly with the particular compound employed; and Similar factors are well known in the medical field.

如本文提供之RNA之有效量可低至20 μg,例如以單一劑量或兩個10 μg劑量投與。在一些實施例中,有效量為20 µg-300 µg或25 µg-300 µg之總劑量。舉例而言,有效量可為20 µg、25 µg、30 µg、35 µg、40 µg、45 µg、50 µg、55 µg、60 µg、65 µg、70 µg、75 µg、80 µg、85 µg、90 µg、95 µg、100 µg、110 µg、120 µg、130 µg、140 µg、150 µg、160 µg、170 µg、180 µg、190 µg、200 µg、250 µg或300 µg之總劑量。在一些實施例中,有效量為20 μg之總劑量。在一些實施例中,有效量為25 μg之總劑量。在一些實施例中,有效量為50 μg之總劑量。在一些實施例中,有效量為75 μg之總劑量。在一些實施例中,有效量為100 μg之總劑量。在一些實施例中,有效量為150 μg之總劑量。在一些實施例中,有效量為200 μg之總劑量。在一些實施例中,有效量為250 μg之總劑量。在一些實施例中,有效量為300 μg之總劑量。An effective amount of RNA as provided herein can be as low as 20 μg, eg, administered in a single dose or in two 10 μg doses. In some embodiments, the effective amount is a total dose of 20 μg-300 μg or 25 μg-300 μg. For example, an effective amount can be 20 µg, 25 µg, 30 µg, 35 µg, 40 µg, 45 µg, 50 µg, 55 µg, 60 µg, 65 µg, 70 µg, 75 µg, 80 µg, 85 µg, Total dose of 90 µg, 95 µg, 100 µg, 110 µg, 120 µg, 130 µg, 140 µg, 150 µg, 160 µg, 170 µg, 180 µg, 190 µg, 200 µg, 250 µg or 300 µg. In some embodiments, the effective amount is a total dose of 20 μg. In some embodiments, the effective amount is a total dose of 25 μg. In some embodiments, the effective amount is a total dose of 50 μg. In some embodiments, the effective amount is a total dose of 75 μg. In some embodiments, the effective amount is a total dose of 100 μg. In some embodiments, the effective amount is a total dose of 150 μg. In some embodiments, the effective amount is a total dose of 200 μg. In some embodiments, the effective amount is a total dose of 250 μg. In some embodiments, the effective amount is a total dose of 300 μg.

本文所述之RNA可調配成本文所述之劑型,諸如鼻內、氣管內或可注射(例如,靜脉內、眼內、玻璃體內、肌內、真皮內、心內、腹膜內及皮下)。 疫苗效能 The RNAs described herein can be formulated into dosage forms described herein, such as intranasal, intratracheal, or injectable (eg, intravenous, intraocular, intravitreal, intramuscular, intradermal, intracardiac, intraperitoneal, and subcutaneous). vaccine efficacy

本揭示案之一些態樣提供組合物(例如,RNA疫苗)之調配物,其中以有效量調配RNA以在個體中產生抗原特異性免疫反應(例如,產生對冠狀病毒抗原特異性之抗體)。「有效量」係有效產生抗原特異性免疫反應之RNA之劑量。本文亦提供在個體中誘導抗原特異性免疫反應之方法。Some aspects of the present disclosure provide formulations of compositions (eg, RNA vaccines) in which the RNA is formulated in an effective amount to generate an antigen-specific immune response (eg, to generate antibodies specific to a coronavirus antigen) in an individual. An "effective amount" is an amount of RNA effective to generate an antigen-specific immune response. Also provided herein are methods of inducing an antigen-specific immune response in an individual.

如本文所用,對本揭示案之疫苗或LNP之免疫反應係在個體中產生對疫苗中存在之(一或多種)冠狀病毒蛋白之體液及/或細胞免疫反應。出於本揭示案之目的,「體液」免疫反應係指由抗體分子(包括例如分泌性(IgA)或IgG分子)介導之免疫反應,而「細胞」免疫反應係由T淋巴球(例如,CD4 +輔助細胞及/或CD8 +T細胞(例如CTL))及/或其他白血球介導之免疫反應。細胞免疫之一個重要態樣包括由細胞溶解性T細胞(CTL)引起之抗原特異性反應。CTL對與主要組織相容性複合體(MHC)編碼之蛋白質締合呈遞且在細胞表面上表現之肽抗原具有特異性。CTL幫助誘導及促進細胞內微生物之破壞或感染該等微生物之細胞之溶解。細胞免疫之另一態樣包括由輔助T細胞引起之抗原特異性反應。輔助T細胞用於幫助刺激功能且使非特異性效應細胞之活性集中於在表面上展示與MHC分子締合之肽抗原的细胞。細胞免疫反應亦導致產生細胞介素、趨化介素及由活化T細胞及/或其他白血球產生之其他該等分子(包括源自CD4 +及CD8 +T細胞者)。 As used herein, an immune response to a vaccine or LNP of the present disclosure is the production of a humoral and/or cellular immune response in an individual to the coronavirus protein(s) present in the vaccine. For the purposes of this disclosure, a "humoral" immune response refers to an immune response mediated by antibody molecules, including, for example, secretory (IgA) or IgG molecules, while a "cellular" immune response is one mediated by T lymphocytes (eg, CD4 + helper cells and/or CD8 + T cells (eg CTL)) and/or other leukocyte-mediated immune responses. An important aspect of cellular immunity involves antigen-specific responses elicited by cytolytic T cells (CTLs). CTLs are specific for peptide antigens presented in association with proteins encoded by the major histocompatibility complex (MHC) and expressed on the cell surface. CTLs help to induce and promote the destruction of intracellular microorganisms or the lysis of cells infected with such microorganisms. Another aspect of cellular immunity involves antigen-specific responses elicited by helper T cells. Helper T cells are used to help stimulate function and focus the activity of nonspecific effector cells on cells displaying peptide antigens associated with MHC molecules on their surface. The cellular immune response also results in the production of interleukins, chemokines, and other such molecules (including those derived from CD4 + and CD8 + T cells) produced by activated T cells and/or other white blood cells.

在一些實施例中,抗原特異性免疫反應藉由量測在投與如本文提供之組合物之個體中產生之抗冠狀病毒抗原抗體效價來表徵。抗體效價係個體內抗體(例如,對特定抗原或抗原之抗原決定基特異性之抗體)之量之量度。抗體效價通常表示為提供陽性結果之最大稀釋度之倒數。例如,酶聯免疫吸附分析(ELISA)係用於測定抗體效價之常見分析。In some embodiments, an antigen-specific immune response is characterized by measuring the anti-coronavirus antigen antibody titers produced in an individual administered a composition as provided herein. Antibody titer is a measure of the amount of antibody (eg, antibody specific for a particular antigen or epitope of an antigen) in an individual. Antibody titers are usually expressed as the reciprocal of the largest dilution that gave a positive result. For example, enzyme-linked immunosorbent assay (ELISA) is a common assay used to determine antibody titers.

可使用多種血清學測試來量測針對編碼之相關抗原之抗體,例如SAR-CoV-2病毒或SAR-CoV-2病毒抗原,例如SAR-CoV-2刺突或S蛋白,或其結構域。該等測試包括血球凝集抑制測試、補體結合測試、螢光抗體測試、酶聯免疫吸附分析(ELISA)及噬斑减少中和測試(PRNT)。該等測試中之每一者皆量測不同抗體活性。在示例性實施例中,噬斑减少中和測試或PRNT (例如PRNT50或PRNT90)用作保護之血清學相關物。PRNT量測活體外病毒中和之生物學參數,且係某些類別之病毒中最具血清學病毒特異性之測試,與病毒感染之保護之血清水準密切相關。A variety of serological tests can be used to measure antibodies against encoded relevant antigens, such as SAR-CoV-2 virus or SAR-CoV-2 viral antigens, such as the SAR-CoV-2 spike or S protein, or domains thereof. Such tests include hemagglutination inhibition test, complement fixation test, fluorescent antibody test, enzyme-linked immunosorbent assay (ELISA) and plaque reduction neutralization test (PRNT). Each of these tests measures a different antibody activity. In exemplary embodiments, plaque reduction neutralization tests or PRNTs (eg, PRNT50 or PRNT90) are used as serological correlates of protection. PRNT measures the biological parameter of virus neutralization in vitro and is the most serological virus-specific test among certain classes of viruses, and is closely related to the serum level of protection against viral infection.

PRNT之基本設計允許在試管或微量滴定板中發生病毒-抗體相互作用,且接著藉由將混合物平鋪在病毒易感性細胞、較佳哺乳動物來源之細胞上來量測抗體對病毒感染性之效應。用限制後代病毒傳播之半固體培養基覆蓋細胞。引發生產性感染之每一病毒產生局部感染區域(噬斑),其可以多種方式偵測。對噬斑計數且與病毒之起始濃度進行比較以確定總病毒感染性之降低百分比。在PRNT中,通常在與標準量之病毒混合之前,將所測試之血清樣品進行連續稀釋。病毒之濃度保持恒定,使得當添加至易感細胞中且用半固體培養基覆蓋時,可辨別且計數個別噬斑。以此方式,可在任何選擇之病毒活性降低百分比下計算每一血清樣品之PRNT終點效價。The basic design of PRNT allows for virus-antibody interactions to take place in test tubes or microtiter plates, and then to measure the effect of the antibody on virus infectivity by plating the mixture on virus susceptible cells, preferably cells of mammalian origin . Cells were overlaid with semi-solid medium that would limit transmission of progeny virus. Each virus that causes a productive infection produces localized areas of infection (plaques) that can be detected in a variety of ways. Plaques were counted and compared to the starting concentration of virus to determine the percent reduction in total virus infectivity. In PRNT, serial dilutions of the tested serum samples are typically made before mixing with a standard amount of virus. The concentration of virus is kept constant so that when added to susceptible cells and covered with semi-solid medium, individual plaques can be identified and counted. In this manner, the PRNT endpoint titer for each serum sample can be calculated at any selected percent reduction in viral activity.

在意欲評價疫苗免疫原性之功能分析中,用於抗體滴定之血清樣品稀釋系列理想地應在低於「血清保護性」臨限值效價下開始。關於SARS-CoV-2中和抗體,「血清保護性」臨限值效價仍然未知;但在某些實施例中,1:10之血清陽性臨限值可被認為係血清保護臨限值。In functional assays intended to assess vaccine immunogenicity, dilution series of serum samples for antibody titration should ideally begin at titers below the "seroprotective" threshold value. With respect to SARS-CoV-2 neutralizing antibodies, the "seroprotective" threshold titer is still unknown; however, in certain embodiments, a seropositivity threshold of 1:10 may be considered a seroprotective threshold.

PRNT終點效價表示為顯示所需之噬斑計數减少百分比之最後血清稀釋度之倒數。PRNT效價可基於50%或更大之噬斑計數减少來計算(PRNT50)。對於疫苗血清,PRNT50效價比使用較高截止值(例如,PRNT90)之效價更佳,自滴定曲綫之綫性部分提供更準確之結果。PRNT endpoint titers are expressed as the reciprocal of the final serum dilution showing the desired percent reduction in plaque counts. PRNT titers can be calculated based on a 50% or greater reduction in plaque count (PRNT50). For vaccine sera, PRNT50 titers are better than those using higher cutoffs (eg, PRNT90), providing more accurate results from the linear portion of the titration curve.

存在用以計算PRNT效價之若干方式。計算效價之最簡單及最廣泛使用之方式係對噬斑計數,且將效價報告為基於輸入噬斑之反滴定,顯示輸入噬斑計數减少>50%之最後血清稀釋度之倒數。使用來自若干血清稀釋之曲綫擬合方法可允許計算更精確之結果。存在多種電腦分析程式可用於該目的(例如,SPSS或GraphPad Prism)。There are several ways to calculate PRNT titers. The simplest and most widely used way to calculate titers is to count plaques, and to report titers as the reciprocal of the final serum dilution based on a back-titration of input plaques showing a >50% reduction in input plaque counts. Using a curve fitting method from several serum dilutions may allow more accurate results to be calculated. Various computer analysis programs exist for this purpose (eg, SPSS or GraphPad Prism).

在一些實施例中,抗體效價用於評價個體是否已經感染或確定是否需要免疫。在一些實施例中,抗體效價用於確定自體免疫反應之强度,確定是否需要加强免疫,確定先前之疫苗是否有效,以及鑑別任何最近或先前之感染。根據本揭示案,抗體效價可用於確定由組合物(例如,RNA疫苗)在個體中誘導之免疫反應之强度。In some embodiments, antibody titers are used to assess whether an individual is already infected or to determine whether immunization is required. In some embodiments, antibody titers are used to determine the strength of an autoimmune response, to determine the need for booster immunization, to determine whether previous vaccines are effective, and to identify any recent or previous infection. According to the present disclosure, antibody titers can be used to determine the strength of an immune response induced in an individual by a composition (eg, an RNA vaccine).

在一些實施例中,個體中產生之抗冠狀病毒抗原抗體效價相對於對照增加至少1 log。舉例而言,個體中產生之抗冠狀病毒抗原抗體效價相對於對照可增加至少1.5、至少2、至少2.5或至少3 log。在一些實施例中,個體中產生之抗冠狀病毒抗原抗體效價相對於對照增加1、1.5、2、2.5或3 log。在一些實施例中,個體中產生之抗冠狀病毒抗原抗體效價相對於對照增加1-3 log。舉例而言,個體中產生之抗冠狀病毒抗原抗體效價相對於對照可增加1-1.5、1-2、1-2.5、1-3、1.5-2、1.5-2.5、1.5-3、2-2.5、2-3或2.5-3 log。In some embodiments, the anti-coronavirus antigen antibody titer produced in the individual is increased by at least 1 log relative to a control. For example, the anti-coronavirus antigen antibody titer produced in an individual can be increased by at least 1.5, at least 2, at least 2.5, or at least 3 logs relative to a control. In some embodiments, the anti-coronavirus antigen antibody titer produced in the individual is increased by 1, 1.5, 2, 2.5, or 3 logs relative to a control. In some embodiments, the anti-coronavirus antigen antibody titers produced in the individual are increased by 1-3 logs relative to a control. For example, the anti-coronavirus antigen antibody titer produced in an individual can be increased by 1-1.5, 1-2, 1-2.5, 1-3, 1.5-2, 1.5-2.5, 1.5-3, 2- 2.5, 2-3 or 2.5-3 log.

在一些實施例中,個體中產生之抗冠狀病毒抗原抗體效價相對於對照增加至少2倍。舉例而言,個體中產生之抗冠狀病毒抗原抗體效價可相對於對照增加至少3倍、至少4倍、至少5倍、至少6倍、至少7倍、至少8倍、至少9倍或至少10倍。在一些實施例中,個體中產生之抗冠狀病毒抗原抗體效價相對於對照增加2、3、4、5、6、7、8、9或10倍。在一些實施例中,個體中產生之抗冠狀病毒抗原抗體效價相對於對照增加2-10倍。舉例而言,個體中產生之抗冠狀病毒抗原抗體效價相對於對照可增加2-10、2-9、2-8、2-7、2-6、2-5、2-4、2-3、3-10、3-9、3-8、3-7、3-6、3-5、3-4、4-10、4-9、4-8、4-7、4-6、4-5、5-10、5-9、5-8、5-7、5-6、6-10、6-9、6-8、6-7、7-10、7-9、7-8、8-10、8-9或9-10倍。In some embodiments, the anti-coronavirus antigen antibody titer produced in the individual is increased by at least 2-fold relative to a control. For example, the anti-coronavirus antigen antibody titer produced in the individual can be increased by at least 3-fold, at least 4-fold, at least 5-fold, at least 6-fold, at least 7-fold, at least 8-fold, at least 9-fold, or at least 10-fold relative to a control times. In some embodiments, the anti-coronavirus antigen antibody titer produced in the individual is increased by 2, 3, 4, 5, 6, 7, 8, 9, or 10-fold relative to a control. In some embodiments, the anti-coronavirus antigen antibody titer produced in the individual is increased by 2-10 fold relative to the control. For example, the anti-coronavirus antigen antibody titer produced in an individual can be increased by 2-10, 2-9, 2-8, 2-7, 2-6, 2-5, 2-4, 2- 3, 3-10, 3-9, 3-8, 3-7, 3-6, 3-5, 3-4, 4-10, 4-9, 4-8, 4-7, 4-6, 4-5, 5-10, 5-9, 5-8, 5-7, 5-6, 6-10, 6-9, 6-8, 6-7, 7-10, 7-9, 7- 8, 8-10, 8-9 or 9-10 times.

在一些實施例中,抗原特異性免疫反應量測為冠狀病毒血清中和抗體效價之幾何平均效價(GMT)之比率,稱為幾何平均比率(GMR)。幾何平均效價(GMT)係藉由將所有值相乘且取該數之 n次根來計算之一組個體之平均抗體效價,其中 n係具有可用數據之個體之數目。 In some embodiments, the antigen-specific immune response is measured as the ratio of the geometric mean titers (GMT) of the titers of coronavirus serum neutralizing antibodies, referred to as the geometric mean ratio (GMR). Geometric mean titers (GMT) are calculated by multiplying all values together and taking the n root of the number, where n is the number of individuals with available data.

在一些實施例中,對照係在未投與組合物(例如RNA疫苗)之個體中產生之抗冠狀病毒抗原抗體效價。在一些實施例中,對照係在投與重組或純化之蛋白疫苗之個體中產生之抗冠狀病毒抗原抗體效價。重組蛋白疫苗通常包括在異源表現系統(例如細菌或酵母)中產生或自大量病原生物體中純化之蛋白抗原。In some embodiments, the control is an anti-coronavirus antigen antibody titer produced in individuals not administered the composition (eg, RNA vaccine). In some embodiments, the control is an anti-coronavirus antigen antibody titer produced in an individual administered a recombinant or purified protein vaccine. Recombinant protein vaccines typically include protein antigens produced in heterologous expression systems (eg, bacteria or yeast) or purified from a number of pathogenic organisms.

在一些實施例中,在鼠類模型中量測組合物(例如RNA疫苗)有效之能力。舉例而言,可將組合物投與給鼠類模型,且分析鼠類模型之中和抗體效價之誘導。病毒攻擊研究亦可用於評價本揭示案之疫苗之效能。舉例而言,可將組合物投與給鼠類模型,用病毒攻擊鼠類模型,且分析鼠類模型之存活及/或免疫反應(例如中和抗體反應、T細胞反應(例如細胞介素反應))。In some embodiments, the ability of a composition (eg, an RNA vaccine) to be effective is measured in a murine model. For example, the composition can be administered to a murine model and assayed for the induction of neutralizing antibody titers in the murine model. Viral challenge studies can also be used to evaluate the efficacy of the vaccines of the present disclosure. For example, the composition can be administered to a murine model, challenged with a virus, and the murine model analyzed for survival and/or immune responses (eg, neutralizing antibody responses, T cell responses (eg, interleukin responses) )).

在一些實施例中,組合物(例如,RNA疫苗)之有效量係與重組蛋白疫苗之標準護理劑量相比减少之劑量。如本文所提供之「標準護理」係指醫學或心理治療指南,且可為一般性的或特異性的。「標準護理」基於科學證據及參與給定疾患之治療之醫學專業人士之間之合作來指定適當治療。其係醫師/臨床醫師對於某種類型之患者、疾病或臨床情况應遵循之診斷及治療過程。本文提供之「標準護理劑量」係指醫師/臨床醫師或其他醫學專業人士在遵循用於治療或預防冠狀病毒感染或相關疾患之標準護理指南的同時,將投與給個體以治療或預防冠狀病毒感染或相關疾患之重組或純化蛋白疫苗、或活的减毒或滅活疫苗、或VLP疫苗的劑量。In some embodiments, an effective amount of a composition (eg, an RNA vaccine) is a reduced dose compared to a standard-of-care dose of a recombinant protein vaccine. "Standard of care" as provided herein refers to medical or psychotherapy guidelines, and can be general or specific. "Standard Care" prescribes appropriate treatment based on scientific evidence and collaboration between medical professionals involved in the treatment of a given condition. It is the process of diagnosis and treatment that a physician/clinician should follow for a certain type of patient, disease or clinical condition. "Standard of Care Dosage" provided herein refers to the amount that a physician/clinician or other medical professional would administer to an individual to treat or prevent coronavirus while following standard of care guidelines for the treatment or prevention of coronavirus infection or related conditions Doses of recombinant or purified protein vaccines, or live attenuated or inactivated vaccines, or VLP vaccines for infections or related conditions.

在一些實施例中,在投與有效量之組合物之個體中產生之抗冠狀病毒抗原抗體效價等同於在投與標準護理劑量之重組或純化蛋白疫苗、或活的减毒或滅活疫苗、或VLP疫苗之對照個體中產生的抗冠狀病毒抗原抗體效價。In some embodiments, the anti-coronavirus antigen antibody titers produced in an individual administered an effective amount of the composition are equivalent to those administered a standard-of-care dose of recombinant or purified protein vaccines, or live attenuated or inactivated vaccines , or the anti-coronavirus antigen antibody titers produced in control individuals of VLP vaccine.

疫苗效能可使用標準分析來評價(例如,參見Weinberg等人,J Infect Dis. 2010年6月1日;201(11):1607-10)。舉例而言,疫苗效能可藉由雙盲、隨機化、臨床對照試驗來量測。疫苗效能可表示為未接種疫苗(ARU)及接種疫苗(ARV)之研究同類群組之間疾病罹患率(AR)之成比例降低,且可使用下式自接種疫苗組中疾病之相對風險(RR)計算: 效能 = (ARU - ARV)/ARU × 100;且 效能 = (1-RR) × 100。 Vaccine efficacy can be assessed using standard assays (see, eg, Weinberg et al., J Infect Dis. 2010 Jun 1;201(11):1607-10). For example, vaccine efficacy can be measured by double-blind, randomized, controlled clinical trials. Vaccine efficacy can be expressed as the proportional reduction in disease attack rate (AR) between unvaccinated (ARU) and vaccinated (ARV) study cohorts, and can be derived from the relative risk of disease in the vaccinated group using the following formula ( RR) calculation: Performance = (ARU - ARV)/ARU × 100; and Efficacy = (1-RR) × 100.

同樣,疫苗有效性可使用標準分析來評估(例如,參見Weinberg等人,J Infect Dis. 2010年6月1日;201(11):1607-10)。疫苗有效性係評價疫苗(其可能已經證明具有高疫苗效能)如何减少群體中之疾病。此量度可評價在自然現場條件下而非在受控臨床試驗中,疫苗接種程式而不僅僅疫苗本身之益處及不利效應之淨平衡。疫苗有效性與疫苗效能(功效)成正比,但亦受群體中之靶群體免疫之程度以及影響住院治療、流動就診或花費之「真實世界」結果之其他非疫苗相關因素影響。舉例而言,可使用回溯性病例對照分析,其中比較一組感染病例及適當對照之間之疫苗接種率。疫苗有效性可表示為比率差異,使用勝算比(OR)用於儘管接種但亦發生感染: 有效性 = (1 - OR) × 100。 Likewise, vaccine efficacy can be assessed using standard assays (eg, see Weinberg et al., J Infect Dis. 2010 Jun 1;201(11):1607-10). Vaccine efficacy is the evaluation of how well a vaccine, which may have demonstrated high vaccine efficacy, reduces disease in a population. This measure evaluates the net balance of benefits and adverse effects of a vaccination regimen, not just the vaccine itself, under natural field conditions rather than in controlled clinical trials. Vaccine effectiveness is proportional to vaccine potency (efficacy), but is also influenced by the extent of target herd immunity in the population and other non-vaccine-related factors that affect the "real-world" outcome of hospitalizations, ambulatory visits, or costs. For example, a retrospective case-control analysis can be used in which vaccination rates are compared between a group of infected cases and appropriate controls. Vaccine effectiveness can be expressed as a ratio difference, using the odds ratio (OR) for infection despite vaccination: Validity = (1 - OR) × 100.

在一些實施例中,相對於未接種疫苗之對照個體,組合物(例如,RNA疫苗)之效能為至少60%。舉例而言,相對於未接種疫苗之對照個體,組合物之效能可為至少65%、至少70%、至少75%、至少80%、至少85%、至少95%、至少98%或100%。In some embodiments, the efficacy of the composition (eg, RNA vaccine) is at least 60% relative to unvaccinated control individuals. For example, the efficacy of the composition can be at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 95%, at least 98%, or 100% relative to unvaccinated control individuals.

消除性免疫性 .消除性免疫性係指防止病原體有效感染至宿主中之獨特免疫狀態。在一些實施例中,有效量之本揭示案之組合物足以在個體中提供消除性免疫性至少1年。舉例而言,有效量之本揭示案之組合物足以在個體中提供消除性免疫性至少2年、至少3年、至少4年或至少5年。在一些實施例中,有效量之本揭示案之組合物足以在個體中以相對於對照低至少5倍之劑量提供消除性免疫性。舉例而言,有效量可足以在個體中以相對於對照低至少10倍、15倍或20倍之劑量提供消除性免疫性。 Eliminating immunity . Eliminating immunity refers to a unique immune state that prevents efficient infection of a pathogen into a host. In some embodiments, an effective amount of a composition of the present disclosure is sufficient to provide eliminating immunity in an individual for at least 1 year. For example, an effective amount of a composition of the present disclosure is sufficient to provide eliminating immunity in an individual for at least 2 years, at least 3 years, at least 4 years, or at least 5 years. In some embodiments, an effective amount of a composition of the present disclosure is sufficient to provide depleting immunity in an individual at a dose that is at least 5-fold lower relative to a control. For example, an effective amount can be sufficient to provide depleting immunity in an individual at a dose that is at least 10-fold, 15-fold or 20-fold lower relative to a control.

可偵測抗原 .在一些實施例中,有效量之本揭示案之組合物足以產生可偵測水準之冠狀病毒抗原,如在投與後1-72小時在個體之血清中所量測。 Detectable antigens . In some embodiments, an effective amount of a composition of the present disclosure is sufficient to produce detectable levels of coronavirus antigens, as measured in the serum of an individual 1-72 hours after administration.

效價 .抗體效價係個體內抗體(例如,對特定抗原(例如,抗冠狀病毒抗原)特異性之抗體)之數量之量度。抗體效價通常表示為提供陽性結果之最大稀釋度之倒數。例如,酶聯免疫吸附分析(ELISA)係用於測定抗體效價之常見分析。 Titer . Antibody titer is a measure of the number of antibodies (eg, antibodies specific for a particular antigen (eg, an anti-coronavirus antigen)) in an individual. Antibody titers are usually expressed as the reciprocal of the largest dilution that gave a positive result. For example, enzyme-linked immunosorbent assay (ELISA) is a common assay used to determine antibody titers.

在一些實施例中,有效量之本揭示案之組合物足以產生由針對冠狀病毒抗原之中和抗體產生之1,000-10,000中和抗體效價,如在投與後1-72小時在個體之血清中所量測。在一些實施例中,有效量之本揭示案之組合物足以產生由針對冠狀病毒抗原之中和抗體產生之1,000-5,000中和抗體效價,如在投與後1-72小時在個體之血清中所量測。在一些實施例中,有效量足以產生由針對冠狀病毒抗原之中和抗體產生之5,000-10,000中和抗體效價,如在投與後1-72小時在個體之血清中所量測。In some embodiments, an effective amount of a composition of the present disclosure is sufficient to produce 1,000-10,000 neutralizing antibody titers produced by neutralizing antibodies directed against a coronavirus antigen, such as in the serum of an individual 1-72 hours after administration Measured in. In some embodiments, an effective amount of a composition of the present disclosure is sufficient to generate 1,000-5,000 neutralizing antibody titers produced by neutralizing antibodies directed against a coronavirus antigen, such as in the serum of an individual 1-72 hours after administration Measured in. In some embodiments, the effective amount is sufficient to generate a titer of 5,000-10,000 neutralizing antibodies produced by neutralizing antibodies directed against a coronavirus antigen, as measured in the serum of the individual 1-72 hours after administration.

在一些實施例中,中和抗體效價係至少100 NT 50。舉例而言,中和抗體效價可為至少200、300、400、500、600、700、800、900或1000 NT 50。在一些實施例中,中和抗體效價係至少10,000 NT 50In some embodiments, the neutralizing antibody titer is at least 100 NT 50 . For example, the neutralizing antibody titer can be at least 200, 300, 400, 500 , 600, 700, 800, 900, or 1000 NT50. In some embodiments, the neutralizing antibody titer is at least 10,000 NT50 .

在一些實施例中,中和抗體效價係至少100個中和單位/毫升(NU/mL)。舉例而言,中和抗體效價可為至少200、300、400、500、600、700、800、900或1000 NU/mL。在一些實施例中,中和抗體效價係至少10,000 NU/mL。In some embodiments, the neutralizing antibody titer is at least 100 neutralizing units per milliliter (NU/mL). For example, the neutralizing antibody titer can be at least 200, 300, 400, 500, 600, 700, 800, 900, or 1000 NU/mL. In some embodiments, the neutralizing antibody titer is at least 10,000 NU/mL.

在一些實施例中,個體中產生之抗冠狀病毒抗原抗體效價相對於對照增加至少1 log。舉例而言,個體中產生之抗冠狀病毒抗原抗體效價相對於對照可增加至少2、3、4、5、6、7、8、9或10 log。In some embodiments, the anti-coronavirus antigen antibody titer produced in the individual is increased by at least 1 log relative to a control. For example, the anti-coronavirus antigen antibody titer produced in an individual can be increased by at least 2, 3, 4, 5, 6, 7, 8, 9, or 10 logs relative to a control.

在一些實施例中,個體中產生之抗冠狀病毒抗原抗體效價相對於對照增加至少2倍。舉例而言,個體中產生之抗冠狀病毒抗原抗體效價相對於對照增加至少3、4、5、6、7、8、9或10倍。In some embodiments, the anti-coronavirus antigen antibody titer produced in the individual is increased by at least 2-fold relative to a control. For example, the anti-coronavirus antigen antibody titer produced in the individual is increased by at least 3, 4, 5, 6, 7, 8, 9 or 10 fold relative to the control.

在一些實施例中,幾何平均值,即n個數之乘積之n次根,通常用於闡述成比例生長。在一些實施例中,幾何平均值用於描述個體中產生之抗體效價。In some embodiments, the geometric mean, the nth root of the product of n numbers, is often used to describe proportional growth. In some embodiments, the geometric mean is used to describe the antibody titer produced in an individual.

對照可為例如未接種疫苗之個體,或投與活的减毒病毒疫苗、滅活病毒疫苗或蛋白質次單元疫苗之個體。 額外實施例 Controls can be, for example, unvaccinated individuals, or individuals administered a live attenuated virus vaccine, an inactivated virus vaccine, or a protein subunit vaccine. Additional Embodiments

以下編號之段落涵蓋本揭示案之額外實施例: 1. 一種信使核糖核酸(mRNA),其包含編碼融合蛋白之開放閱讀框,該融合蛋白包含SARS-CoV-2刺突蛋白之受體結合結構域(RBD)及蛋白質跨膜結構域。 2. 如段落1之mRNA,其中該蛋白質跨膜結構域係流行性感冒血球凝集素跨膜結構域。 3. 如段落2之mRNA,其中該融合蛋白包含與SEQ ID NO: 77之胺基酸序列具有至少80%一致性的胺基酸序列。 4. 如段落3之mRNA,其中該融合蛋白包含與SEQ ID NO: 77之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。 5. 如段落4之mRNA,其中該融合蛋白包含SEQ ID NO: 77之胺基酸序列。 6. 如前述段落中任一者之mRNA,其中該開放閱讀框包含與SEQ ID NO: 76之核苷酸序列具有至少70%一致性的核苷酸序列。 7. 如段落6之mRNA,其中該開放閱讀框包含與SEQ ID NO: 76之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。 8. 如段落7之mRNA,其中該開放閱讀框包含SEQ ID NO: 76之核苷酸序列。 9. 一種信使核糖核酸(mRNA),其包含編碼融合蛋白之開放閱讀框,該融合蛋白包含SARS-CoV-2刺突蛋白之胺基(N)末端結構域及跨膜結構域。 10. 如段落9之mRNA,其中該跨膜結構域係流行性感冒血球凝集素跨膜結構域。 11. 如段落10之mRNA,其中該融合蛋白包含與SEQ ID NO: 47之胺基酸序列具有至少80%一致性的胺基酸序列。 12. 如段落11之mRNA,其中該融合蛋白包含與SEQ ID NO: 47之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。 13. 如段落12之mRNA,其中該融合蛋白包含SEQ ID NO: 47之胺基酸序列。 14. 如前述段落中任一者之mRNA,其中該開放閱讀框包含與SEQ ID NO: 46之核苷酸序列具有至少70%一致性的核苷酸序列。 15. 如段落14之mRNA,其中該開放閱讀框包含與SEQ ID NO: 46之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。 16. 如段落15之mRNA,其中該開放閱讀框包含SEQ ID NO: 46之核苷酸序列。 17. 一種信使核糖核酸(mRNA),其包含編碼融合蛋白之開放閱讀框,該融合蛋白包含與SARS-CoV-2刺突蛋白的受體結合結構域連接之SARS-CoV-2刺突蛋白之胺基(N)末端結構域。 18. 如段落17之mRNA,其中該融合蛋白進一步包含跨膜結構域。 19. 如段落18之mRNA,其中該融合蛋白包含與SEQ ID NO: 92之胺基酸序列具有至少80%一致性的胺基酸序列。 20. 如段落18之mRNA,其中該融合蛋白包含與SEQ ID NO: 92之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。 21. 如段落20之mRNA,其中該融合蛋白包含SEQ ID NO: 92之胺基酸序列。 22. 如前述段落中任一者之mRNA,其中該開放閱讀框包含與SEQ ID NO: 91之核苷酸序列具有至少70%一致性的核苷酸序列。 23. 如段落22之mRNA,其中該開放閱讀框包含與SEQ ID NO: 91之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。 24. 如段落23之mRNA,其中該開放閱讀框包含SEQ ID NO: 91之核苷酸序列。 25. 如前述段落中任一者之mRNA,進一步包含5'非轉譯區(UTR),該5'非轉譯區視情況地包含SEQ ID NO: 131或2之核苷酸序列。 26. 如前述段落中任一者之mRNA,進一步包含3'非轉譯區(UTR),該3'非轉譯區視情況地包含SEQ ID NO: 132或4之核苷酸序列。 27. 如前述段落中任一者之mRNA,進一步包含5'端帽,視情況地7mG(5')ppp(5')NlmpNp。 28.如前述段落中任一者之mRNA,進一步包含聚A尾,該聚A尾視情況地具有約100個核苷酸之長度。 29. 如前述段落中任一者之mRNA,其中該mRNA包含化學修飾,視情況地1-甲基假尿苷。 30. 一種組合物,其包含如段落1至29中任一者之mRNA。 31. 一種組合物,其包含如段落1至8中任一者之mRNA及如段落9至16中任一者之mRNA。 32. 一種組合物,其包含如段落17至29中任一者之mRNA。 33. 一種組合物,其包含: (a)    包含編碼融合蛋白之開放閱讀框的信使核糖核酸(mRNA),該融合蛋白包含SARS-CoV-2刺突蛋白之受體結合結構域(RBD)及蛋白質跨膜結構域;及 (b)    包含編碼融合蛋白之開放閱讀框之mRNA,該融合蛋白包含SARS-CoV-2刺突蛋白之胺基(N)末端結構域及跨膜結構域。 34. 如段落33之組合物,其中該蛋白質跨膜結構域係流行性感冒血球凝集素跨膜結構域。 35. 如段落34之組合物,其中(a)之融合蛋白包含與SEQ ID NO: 77之胺基酸序列具有至少80%一致性的胺基酸序列。 36. 如段落35之組合物,其中(a)之融合蛋白包含與SEQ ID NO: 77之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。 37. 如段落36之組合物,其中(a)之融合蛋白包含SEQ ID NO: 77之胺基酸序列。 38. 如段落34至37中任一者之組合物,其中(a)之開放閱讀框包含與SEQ ID NO: 76之核苷酸序列具有至少70%一致性的核苷酸序列。 39. 如段落38之組合物,其中(a)之開放閱讀框包含與SEQ ID NO: 76之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。 40. 如段落39之組合物,其中(a)之開放閱讀框包含SEQ ID NO: 76之核苷酸序列。 41. 如段落34至40中任一者之組合物,其中(b)之融合蛋白包含與SEQ ID NO: 47之胺基酸序列具有至少80%一致性的胺基酸序列。 42. 如段落41之組合物,其中(b)之融合蛋白包含與SEQ ID NO: 47之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。 43. 如段落42之組合物,其中(b)之融合蛋白包含SEQ ID NO: 47之胺基酸序列。 44. 如段落34至43中任一者之組合物,其中(b)之開放閱讀框包含與SEQ ID NO: 46之核苷酸序列具有至少70%一致性的核苷酸序列。 45. 如段落44之組合物,其中(b)之開放閱讀框包含與SEQ ID NO: 46之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。 46. 如段落45之組合物,其中(b)之開放閱讀框包含SEQ ID NO: 46之核苷酸序列。 47. 如段落33至46中任一者之組合物,其中(a)之mRNA對(b)之mRNA之比率為約1:1。 48. 如段落1至29中任一者之組合物,其調配於脂質奈米顆粒中。 49. 如段落30至47中任一者之組合物,其進一步包含脂質奈米顆粒。 50. 如段落49之組合物,其中該mRNA調配於脂質奈米顆粒中。 51. 如段落33至47中任一者之組合物,其中(a)之mRNA調配於脂質奈米顆粒中,且(b)之mRNA調配於脂質奈米顆粒中。 52. 如段落51之組合物,其中(a)及(b)之mRNA於相同脂質奈米顆粒中,或其中(a)及(b)之mRNA相對於彼此各自調配於單獨奈米顆粒中。 53. 如段落48之mRNA或如段落49至52中任一者之組合物,其中該脂質奈米顆粒包含陽離子脂質。 54. 如段落53之mRNA或組合物,其中該脂質奈米顆粒進一步包含中性脂質。 55. 如段落53或54之mRNA或組合物,其中該脂質奈米顆粒進一步包含固醇。 56. 如段落53至55中任一者之mRNA或組合物,其中該脂質奈米顆粒進一步包含聚乙二醇(PEG)改質之脂質。 57. 如段落53至56中任一者之mRNA或組合物,其中該脂質奈米顆粒包含可電離之陽離子脂質、中性脂質、固醇及PEG改質之脂質。 58. 如段落57之mRNA或組合物,其中該可電離之陽離子脂質係8-((2-羥基乙基)(6-側氧基-6-(十一烷基氧基)己基)胺基)辛酸十七烷-9-基酯(化合物1)。 59. 如段落57或58之mRNA或組合物,其中該中性脂質係1,2-二硬脂醯基-sn-甘油-3-磷酸膽鹼(DSPC)。 60. 如段落57至59中任一者之mRNA或組合物,其中該固醇係膽固醇。 61. 如段落57至60中任一者之mRNA或組合物,其中該PEG改質之脂質係1,2二肉豆蔻醯基-sn-甘油-甲氧基聚乙二醇(PEG2000 DMG)。 62. 如段落57至61中任一者之mRNA或組合物,其中該脂質奈米顆粒包含20-60 mol%可電離之陽離子脂質、5-25 mol%中性脂質、25-55 mol%固醇及0.5-15 mol% PEG改質之脂質。 63. 如段落62之mRNA或組合物,其中該脂質奈米顆粒包含: 47 mol%可電離之脂質;11.5 mol%中性脂質;38.5 mol%固醇;及3.0 mol% PEG改質之脂質; 48 mol%可電離之脂質;11 mol%中性脂質;38.5 mol%固醇;及2.5 mol% PEG改質之脂質; 49 mol%可電離之脂質;10.5 mol%中性脂質;38.5 mol%固醇;及2.0 mol% PEG改質之脂質; 50 mol%可電離之脂質;10 mol%中性脂質;38.5 mol%固醇;及1.5 mol% PEG改質之脂質;或 51 mol%可電離之脂質;9.5 mol%中性脂質;38.5 mol%固醇;及1.0 mol% PEG改質之脂質。 64. 如段落63之mRNA或組合物,其中該脂質奈米顆粒包含: 47 mol%化合物1;11.5 mol% DSPC;38.5 mol%膽固醇;及3.0% PEG2000DMG; 48 mol%化合物1;11.5 mol% DSPC;38.5 mol%膽固醇;及2.5% PEG2000DMG; 49 mol%化合物1;11.5 mol% DSPC;38.5 mol%膽固醇;及2.0% PEG2000DMG; 50 mol%化合物1;11.5 mol% DSPC;38.5 mol%膽固醇;及1.5% PEG2000DMG;或 51 mol%化合物1;11.5 mol% DSPC;38.5 mol%膽固醇;及1.0% PEG2000DMG。 65. 一種方法,該方法包括向個體投與有效地在該個體中誘導針對SARS-CoV-2之中和抗體反應之量的如前述段落中任一者之mRNA或組合物。 66. 一種方法,該方法包括向個體投與有效地在該個體中誘導針對SARS-CoV-2之T細胞免疫反應之量的如前述段落中任一者之mRNA或組合物。 67. 一種信使核糖核酸(mRNA),其包含編碼冠狀病毒抗原之開放閱讀框(ORF),該冠狀病毒抗原能夠誘導針對SARS-CoV-2之免疫反應(諸如中和抗體反應),其中該抗原包含SARS-CoV-2之蛋白質片段或功能性蛋白質結構域,視情況地其中RNA調配於脂質奈米顆粒中。 68. 如段落67之mRNA,其中該抗原為功能性蛋白質結構域。 69. 如段落68之mRNA,其中該蛋白質結構域係SARS-CoV-2刺突蛋白之N末端結構域(NTD)。 70. 如段落69之mRNA,其中該NTD與跨膜結構域、視情况地流行性感冒血球凝集素跨膜結構域連接。 71. 如段落70之mRNA,其中該抗原包含與SEQ ID NO: 47之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 47之胺基酸序列。 72. 如段落70或71之mRNA,其中該開放閱讀框包含與SEQ ID NO: 46之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 46之核苷酸序列。 73. 如段落68之mRNA,其中該蛋白質結構域係SARS-CoV-2刺突蛋白之受體結合結構域(RBD)。 74. 如段落73之mRNA,其中該RBD係可溶的。 75. 如段落74之mRNA,其中該抗原包含與SEQ ID NO: 62之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 62之胺基酸序列。 76. 如段落74或75之mRNA,其中該開放閱讀框包含與SEQ ID NO: 61之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 61之核苷酸序列。 77. 如段落73之mRNA,其中該RBD與跨膜結構域、視情况地流行性感冒血球凝集素跨膜結構域連接。 78. 如段落77之mRNA,其中該抗原包含與SEQ ID NO: 77之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 77之胺基酸序列。 79. 如段落77或78之mRNA,其中該開放閱讀框包含與SEQ ID NO: 76之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 76之核苷酸序列。 80. 如段落69之mRNA,其中該NTD與SARS-CoV-2刺突蛋白之RBD連接以形成NTD-RBD融合蛋白。 81. 如段落80之mRNA,其中該NTD-RBD融合體與跨膜結構域(TM)連接,視情況與流行性感冒血球凝集素跨膜結構域連接,以形成NTD-RBD-TM蛋白。 82. 如段落81之mRNA,其中該抗原包含與SEQ ID NO: 92之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 92之胺基酸序列。 83. 如段落81或82之mRNA,其中該開放閱讀框包含與SEQ ID NO: 91之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 91之核苷酸序列。 84. 如段落80之mRNA,其中該NTD-RBD融合體包含C末端截短。 85. 如段落84之mRNA,其中該抗原包含與SEQ ID NO: 107之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 107之胺基酸序列。 86. 如段落84或85之mRNA,其中該開放閱讀框包含與SEQ ID NO: 106之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 106之核苷酸序列。 87. 如前述段落中任一者之mRNA,其中該NTD及/或RBD包括延伸區。 88. 如段落87之mRNA,其中該抗原包含與SEQ ID NO: 59、86、89、116、119或122中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 59、86、89、116、119或122中之任一者之胺基酸序列。 89. 如段落87或88之mRNA,其中該開放閱讀框包含與SEQ ID NO: 58、85、88、115、118或121中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 58、85、88、115、118或121中之任一者之核苷酸序列。 90. 如段落68之mRNA,其中該蛋白質結構域係SARS-CoV-2刺突蛋白之S1次單元結構域。 91. 如段落90之mRNA,其中該S1次單元係可溶的。 92. 如段落91之mRNA,其中該抗原包含與SEQ ID NO: 5之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 5之胺基酸序列。 93. 如段落91或92之mRNA,其中該開放閱讀框包含與SEQ ID NO: 3之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 3之核苷酸序列。 94. 如段落90之mRNA,其中該S1次單元與跨膜結構域、視情况地流行性感冒血球凝集素跨膜結構域連接。 95. 如段落94之mRNA,其中該抗原包含與SEQ ID NO: 17之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 17之胺基酸序列。 96. 如段落94或95之mRNA,其中該開放閱讀框包含與SEQ ID NO: 16之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 16之核苷酸序列。 97. 如段落90之mRNA,其中該S1次單元經修飾以去除S蛋白之RBD或RBD之一部分。 98. 如段落97之mRNA,其中該抗原包含與SEQ ID NO: 20、23、26、29、32或35中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 20、23、26、29、32或35中之任一者之胺基酸序列。 99. 如段落97或98之mRNA,其中該開放閱讀框包含與SEQ ID NO: 19、22、25、28、31或34中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 19、22、25、28、31或34中之任一者之核苷酸序列。 100. 如段落90之mRNA,其中該S1次單元與S蛋白之S2次單元連接。 101. 如段落100之mRNA,其中該S2次單元來自SARS-CoV-2 S蛋白,且在一些實施例中其中該S2次單元包含開放閱讀框,該開放閱讀框包含與SEQ ID NO:145之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中該開放閱讀框包含SEQ ID NO: 145之核苷酸序列。 102. 如段落101之mRNA,其中該S1次單元來自HKU1 S蛋白。 103. 如段落102之mRNA,其中該抗原包含與SEQ ID NO: 38之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 38之胺基酸序列。 104. 如段落102或103之mRNA,其中該開放閱讀框包含與SEQ ID NO: 37之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 37之核苷酸序列。 105. 如段落101之mRNA,其中該S1次單元來自OC43 S蛋白。 106. 如段落105之mRNA,其中該抗原包含與SEQ ID NO: 41之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 41之胺基酸序列。 107. 如段落105或106之mRNA,其中該開放閱讀框包含與SEQ ID NO: 40之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 40之核苷酸序列。 108. 如前述段落中任一者之mRNA,其中該抗原進一步包含支架結構域,視情況地選自鐵蛋白、二氧四氫蝶啶(lumazine)合成酶及摺疊子。 109. 如段落108之mRNA,其中該支架結構域係鐵蛋白。 110. 如段落109之mRNA,其中該抗原包含與SEQ ID NO: 8或65之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 8或65之胺基酸序列。 111. 如段落109或110之mRNA,其中該開放閱讀框包含與SEQ ID NO: 7或64之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 7或64之核苷酸序列。 112. 如段落108之mRNA,其中該支架結構域係二氧四氫蝶啶(lumazine)合成酶。 113. 如段落112之mRNA,其中該抗原包含與SEQ ID NO: 11、14、68或71中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 11、14、68或71中之任一者之胺基酸序列。 114. 如段落112或113之mRNA,其中該開放閱讀框包含與SEQ ID NO: 10、13、67或70中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 10、13、67或70中之任一者之核苷酸序列。 115. 如段落108之mRNA,其中該支架結構域係摺疊子。 116. 如段落115之mRNA,其中該抗原包含與SEQ ID NO: 44、50、74、80、83、101、104或113中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 44、50、74、80、83、101、104或113中之任一者之胺基酸序列。 117. 如段落115或116之mRNA,其中該開放閱讀框包含與SEQ ID NO: 43、49、73、79、82、100、103或112中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 43、49、73、79、82、100、103或112中之任一者之核苷酸序列。 118. 如前述段落中任一者之mRNA,其中該抗原進一步包含運輸信號,視情況地選自巨噬細胞標記物,視情況地CD86、CD11B及/或VSVGct。 119. 如段落118之mRNA,其中該抗原包含與SEQ ID NO: 95、98或110中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中抗原包含SEQ ID NO: 95、98或110中之任一者之胺基酸序列。 120. 如段落118或119之mRNA,其中該開放閱讀框包含與SEQ ID NO: 94、97或109中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中開放閱讀框包含SEQ ID NO: 94、97或109中之任一者之核苷酸序列。 121. 如段落67至120中任一者之mRNA,其調配於脂質奈米顆粒中。 122. 如段落121之mRNA,其中該脂質奈米顆粒包含陽離子脂質(視情況地可電離之陽離子脂質)、中性脂質、固醇及/或聚乙二醇(PEG)改質之脂質。 123. 如段落108之mRNA或組合物,其中該可電離之陽離子脂質係8-((2-羥基乙基)(6-側氧基-6-(十一烷基氧基)己基)胺基)辛酸十七烷-9-基酯(化合物1),中性脂質係1,2-二硬脂醯基-sn-甘油-3-磷酸膽鹼(DSPC),固醇係膽固醇,及/或PEG改質之脂質係1,2二肉豆蔻醯基-sn-甘油-甲氧基聚乙二醇(PEG2000 DMG)。 124. 如段落121至123中任一者之mRNA或組合物,其中該脂質奈米顆粒包含20-60 mol%可電離之陽離子脂質、5-25 mol%中性脂質、25-55 mol%固醇及0.5-15 mol% PEG改質之脂質。 125. 如段落124之mRNA或組合物,其中該脂質奈米顆粒包含: 47 mol%可電離之陽離子脂質;11.5 mol%中性脂質;38.5 mol%固醇;及3.0 mol% PEG改質之脂質; 48 mol%可電離之陽離子脂質;11 mol%中性脂質;38.5 mol%固醇;及2.5 mol% PEG改質之脂質; 49 mol%可電離之陽離子脂質;10.5 mol%中性脂質;38.5 mol%固醇;及2.0 mol% PEG改質之脂質; 50 mol%可電離之陽離子脂質;10 mol%中性脂質;38.5 mol%固醇;及1.5 mol% PEG改質之脂質;或 51 mol%可電離之陽離子脂質;9.5 mol%中性脂質;38.5 mol%固醇;及1.0 mol% PEG改質之脂質。 126. 如段落125之mRNA或組合物,其中該脂質奈米顆粒包含: 47 mol%化合物1;11.5 mol% DSPC;38.5 mol%膽固醇;及3.0 mol% PEG2000 DMG; 48 mol%化合物1;11 mol% DSPC;38.5 mol%膽固醇;及2.5 mol% PEG2000 DMG; 49 mol%化合物1;10.5 mol% DSPC;38.5 mol%膽固醇;及2.0 mol% PEG2000 DMG; 50 mol%化合物1;10 mol% DSPC;38.5 mol%膽固醇;及1.5 mol% PEG2000 DMG;或 51 mol%化合物1;9.5 mol% DSPC;38.5 mol%膽固醇;及1.0 mol% PEG2000 DMG。 127. 一種方法,該方法包括向個體投與有效地在該個體中誘導針對SARS-CoV-2之中和抗體反應之量的如段落67至126中任一者之mRNA。 128. 一種方法,該方法包括向個體投與有效地在該個體中誘導針對SARS-CoV-2之T細胞免疫反應之量的如段落67至126中任一者之mRNA或組合物。 實例 實例 1. 表現數據 The following numbered paragraphs cover additional embodiments of the present disclosure: 1. A messenger ribonucleic acid (mRNA) comprising an open reading frame encoding a fusion protein comprising the receptor binding structure of the SARS-CoV-2 spike protein domain (RBD) and protein transmembrane domain. 2. The mRNA of paragraph 1, wherein the protein transmembrane domain is an influenza hemagglutinin transmembrane domain. 3. The mRNA of paragraph 2, wherein the fusion protein comprises an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 77. 4. The mRNA of paragraph 3, wherein the fusion protein comprises at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least the amino acid sequence of SEQ ID NO: 77 99% identical amino acid sequence. 5. The mRNA of paragraph 4, wherein the fusion protein comprises the amino acid sequence of SEQ ID NO:77. 6. The mRNA of any of the preceding paragraphs, wherein the open reading frame comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 76. 7. The mRNA of paragraph 6, wherein the open reading frame comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, with the nucleotide sequence of SEQ ID NO: 76. Nucleotide sequences that are at least 98% or at least 99% identical. 8. The mRNA of paragraph 7, wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO:76. 9. A messenger ribonucleic acid (mRNA) comprising an open reading frame encoding a fusion protein comprising an amino (N) terminal domain and a transmembrane domain of a SARS-CoV-2 spike protein. 10. The mRNA of paragraph 9, wherein the transmembrane domain is an influenza hemagglutinin transmembrane domain. 11. The mRNA of paragraph 10, wherein the fusion protein comprises an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 47. 12. The mRNA of paragraph 11, wherein the fusion protein comprises at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least the amino acid sequence of SEQ ID NO: 47 99% identical amino acid sequence. 13. The mRNA of paragraph 12, wherein the fusion protein comprises the amino acid sequence of SEQ ID NO:47. 14. The mRNA of any of the preceding paragraphs, wherein the open reading frame comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 46. 15. The mRNA of paragraph 14, wherein the open reading frame comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, with the nucleotide sequence of SEQ ID NO: 46. Nucleotide sequences that are at least 98% or at least 99% identical. 16. The mRNA of paragraph 15, wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO:46. 17. A messenger ribonucleic acid (mRNA) comprising an open reading frame encoding a fusion protein comprising a SARS-CoV-2 spike protein linked to the receptor binding domain of the SARS-CoV-2 spike protein. Amine (N) terminal domain. 18. The mRNA of paragraph 17, wherein the fusion protein further comprises a transmembrane domain. 19. The mRNA of paragraph 18, wherein the fusion protein comprises an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 92. 20. The mRNA of paragraph 18, wherein the fusion protein comprises at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least the amino acid sequence of SEQ ID NO: 92 99% identical amino acid sequence. 21. The mRNA of paragraph 20, wherein the fusion protein comprises the amino acid sequence of SEQ ID NO:92. 22. The mRNA of any of the preceding paragraphs, wherein the open reading frame comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 91. 23. The mRNA of paragraph 22, wherein the open reading frame comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, with the nucleotide sequence of SEQ ID NO: 91. Nucleotide sequences that are at least 98% or at least 99% identical. 24. The mRNA of paragraph 23, wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO:91. 25. The mRNA of any of the preceding paragraphs, further comprising a 5' untranslated region (UTR) comprising the nucleotide sequence of SEQ ID NO: 131 or 2 as appropriate. 26. The mRNA of any of the preceding paragraphs, further comprising a 3' untranslated region (UTR) comprising the nucleotide sequence of SEQ ID NO: 132 or 4 as appropriate. 27. The mRNA of any of the preceding paragraphs, further comprising a 5' end cap, optionally 7mG(5')ppp(5')NlmpNp. 28. The mRNA of any of the preceding paragraphs, further comprising a poly-A tail, the poly-A tail optionally having a length of about 100 nucleotides. 29. The mRNA of any preceding paragraph, wherein the mRNA comprises a chemical modification, optionally 1-methylpseudouridine. 30. A composition comprising the mRNA of any one of paragraphs 1-29. 31. A composition comprising the mRNA of any of paragraphs 1-8 and the mRNA of any of paragraphs 9-16. 32. A composition comprising the mRNA of any one of paragraphs 17-29. 33. A composition comprising: (a) a messenger ribonucleic acid (mRNA) comprising an open reading frame encoding a fusion protein comprising the receptor binding domain (RBD) of the SARS-CoV-2 spike protein and a protein transmembrane domain; and (b) an mRNA comprising an open reading frame encoding a fusion protein comprising the amino (N) terminal domain and the transmembrane domain of the SARS-CoV-2 spike protein. 34. The composition of paragraph 33, wherein the protein transmembrane domain is an influenza hemagglutinin transmembrane domain. 35. The composition of paragraph 34, wherein the fusion protein of (a) comprises an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 77. 36. The composition of paragraph 35, wherein the fusion protein of (a) comprises at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 97% with the amino acid sequence of SEQ ID NO: 77 Amino acid sequences that are 98% or at least 99% identical. 37. The composition of paragraph 36, wherein the fusion protein of (a) comprises the amino acid sequence of SEQ ID NO:77. 38. The composition of any of paragraphs 34 to 37, wherein the open reading frame of (a) comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 76. 39. The composition of paragraph 38, wherein the open reading frame of (a) comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, with the nucleotide sequence of SEQ ID NO: 76. Nucleotide sequences that are at least 97%, at least 98%, or at least 99% identical. 40. The composition of paragraph 39, wherein the open reading frame of (a) comprises the nucleotide sequence of SEQ ID NO:76. 41. The composition of any one of paragraphs 34 to 40, wherein the fusion protein of (b) comprises an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 47. 42. The composition of paragraph 41, wherein the fusion protein of (b) comprises at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least Amino acid sequences that are 98% or at least 99% identical. 43. The composition of paragraph 42, wherein the fusion protein of (b) comprises the amino acid sequence of SEQ ID NO:47. 44. The composition of any one of paragraphs 34 to 43, wherein the open reading frame of (b) comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 46. 45. The composition of paragraph 44, wherein the open reading frame of (b) comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, with the nucleotide sequence of SEQ ID NO:46. Nucleotide sequences that are at least 97%, at least 98%, or at least 99% identical. 46. The composition of paragraph 45, wherein the open reading frame of (b) comprises the nucleotide sequence of SEQ ID NO:46. 47. The composition of any of paragraphs 33-46, wherein the ratio of mRNA of (a) to mRNA of (b) is about 1:1. 48. The composition of any of paragraphs 1-29, formulated in lipid nanoparticles. 49. The composition of any of paragraphs 30-47, further comprising lipid nanoparticles. 50. The composition of paragraph 49, wherein the mRNA is formulated in lipid nanoparticles. 51. The composition of any of paragraphs 33-47, wherein the mRNA of (a) is formulated in lipid nanoparticles, and the mRNA of (b) is formulated in lipid nanoparticles. 52. The composition of paragraph 51, wherein the mRNAs of (a) and (b) are in the same lipid nanoparticle, or wherein the mRNAs of (a) and (b) are each formulated in separate nanoparticles relative to each other. 53. The mRNA of paragraph 48 or the composition of any of paragraphs 49-52, wherein the lipid nanoparticle comprises a cationic lipid. 54. The mRNA or composition of paragraph 53, wherein the lipid nanoparticle further comprises neutral lipids. 55. The mRNA or composition of paragraph 53 or 54, wherein the lipid nanoparticle further comprises a sterol. 56. The mRNA or composition of any of paragraphs 53-55, wherein the lipid nanoparticle further comprises polyethylene glycol (PEG) modified lipids. 57. The mRNA or composition of any of paragraphs 53-56, wherein the lipid nanoparticle comprises ionizable cationic lipids, neutral lipids, sterols, and PEG-modified lipids. 58. The mRNA or composition of paragraph 57, wherein the ionizable cationic lipid is 8-((2-hydroxyethyl)(6-oxygen-6-(undecyloxy)hexyl)amino ) heptadecan-9-yl octanoate (compound 1). 59. The mRNA or composition of paragraph 57 or 58, wherein the neutral lipid is 1,2-distearyl-sn-glycero-3-phosphocholine (DSPC). 60. The mRNA or composition of any of paragraphs 57-59, wherein the sterol is cholesterol. 61. The mRNA or composition of any of paragraphs 57-60, wherein the PEG-modified lipid is 1,2 dimyristyl-sn-glycerol-methoxypolyethylene glycol (PEG2000 DMG). 62. The mRNA or composition of any one of paragraphs 57 to 61, wherein the lipid nanoparticle comprises 20-60 mol% ionizable cationic lipids, 5-25 mol% neutral lipids, 25-55 mol% solid lipids Lipids modified with alcohol and 0.5-15 mol% PEG. 63. The mRNA or composition of paragraph 62, wherein the lipid nanoparticle comprises: 47 mol% ionizable lipids; 11.5 mol% neutral lipids; 38.5 mol% sterols; and 3.0 mol% PEG-modified lipids; 48 mol% ionizable lipids; 11 mol% neutral lipids; 38.5 mol% sterols; and 2.5 mol% PEG-modified lipids; 49 mol% ionizable lipids; 10.5 mol% neutral lipids; 38.5 mol% solids alcohol; and 2.0 mol% PEG-modified lipid; 50 mol% ionizable lipid; 10 mol% neutral lipid; 38.5 mol% sterol; and 1.5 mol% PEG-modified lipid; or 51 mol% ionizable lipid Lipids; 9.5 mol% neutral lipids; 38.5 mol% sterols; and 1.0 mol% PEG-modified lipids. 64. The mRNA or composition of paragraph 63, wherein the lipid nanoparticle comprises: 47 mol% Compound 1; 11.5 mol% DSPC; 38.5 mol% cholesterol; and 3.0% PEG2000DMG; 48 mol% Compound 1; 11.5 mol% DSPC 38.5 mol% cholesterol; and 2.5% PEG2000DMG; 49 mol% Compound 1; 11.5 mol% DSPC; 38.5 mol% cholesterol; and 2.0% PEG2000DMG; 50 mol% Compound 1; 11.5 mol% DSPC; % PEG2000DMG; or 51 mol% Compound 1; 11.5 mol% DSPC; 38.5 mol% cholesterol; and 1.0% PEG2000DMG. 65. A method comprising administering to an individual the mRNA or composition of any of the preceding paragraphs in an amount effective to induce a neutralizing antibody response against SARS-CoV-2 in the individual. 66. A method comprising administering to an individual the mRNA or composition of any of the preceding paragraphs in an amount effective to induce a T cell immune response against SARS-CoV-2 in the individual. 67. A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding a coronavirus antigen capable of inducing an immune response against SARS-CoV-2 (such as a neutralizing antibody response), wherein the antigen A protein fragment or functional protein domain comprising SARS-CoV-2, optionally in which RNA is formulated in lipid nanoparticles. 68. The mRNA of paragraph 67, wherein the antigen is a functional protein domain. 69. The mRNA of paragraph 68, wherein the protein domain is the N-terminal domain (NTD) of the SARS-CoV-2 spike protein. 70. The mRNA of paragraph 69, wherein the NTD is linked to a transmembrane domain, optionally an influenza hemagglutinin transmembrane domain. 71. The mRNA of paragraph 70, wherein the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% with the amino acid sequence of SEQ ID NO: 47 % or at least 99% identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO: 47. 72. The mRNA of paragraph 70 or 71, wherein the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96% with the nucleotide sequence of SEQ ID NO: 46 %, at least 97%, at least 98%, or at least 99% identical to a nucleotide sequence, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 46. 73. The mRNA of paragraph 68, wherein the protein domain is the receptor binding domain (RBD) of the SARS-CoV-2 spike protein. 74. The mRNA of paragraph 73, wherein the RBD is soluble. 75. The mRNA of paragraph 74, wherein the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% with the amino acid sequence of SEQ ID NO: 62 % or at least 99% identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO: 62. 76. The mRNA of paragraph 74 or 75, wherein the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96% with the nucleotide sequence of SEQ ID NO: 61 %, at least 97%, at least 98%, or at least 99% identical nucleotide sequences, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 61. 77. The mRNA of paragraph 73, wherein the RBD is linked to a transmembrane domain, optionally an influenza hemagglutinin transmembrane domain. 78. The mRNA of paragraph 77, wherein the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% with the amino acid sequence of SEQ ID NO: 77 % or at least 99% identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO:77. 79. The mRNA of paragraph 77 or 78, wherein the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96% with the nucleotide sequence of SEQ ID NO: 76 %, at least 97%, at least 98%, or at least 99% identical nucleotide sequences, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 76. 80. The mRNA of paragraph 69, wherein the NTD is linked to the RBD of the SARS-CoV-2 spike protein to form an NTD-RBD fusion protein. 81. The mRNA of paragraph 80, wherein the NTD-RBD fusion is linked to a transmembrane domain (TM), optionally linked to an influenza hemagglutinin transmembrane domain, to form an NTD-RBD-TM protein. 82. The mRNA of paragraph 81, wherein the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% with the amino acid sequence of SEQ ID NO: 92 % or at least 99% identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO: 92. 83. The mRNA of paragraph 81 or 82, wherein the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96% with the nucleotide sequence of SEQ ID NO: 91 %, at least 97%, at least 98%, or at least 99% identical nucleotide sequences, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 91. 84. The mRNA of paragraph 80, wherein the NTD-RBD fusion comprises a C-terminal truncation. 85. The mRNA of paragraph 84, wherein the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% with the amino acid sequence of SEQ ID NO: 107 % or at least 99% identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO: 107. 86. The mRNA of paragraph 84 or 85, wherein the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96% with the nucleotide sequence of SEQ ID NO: 106 %, at least 97%, at least 98%, or at least 99% identical to a nucleotide sequence, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 106. 87. The mRNA of any of the preceding paragraphs, wherein the NTD and/or RBD comprises an extension. 88. The mRNA of paragraph 87, wherein the antigen comprises at least 80%, at least 85%, at least 90% with the amino acid sequence of any one of SEQ ID NOs: 59, 86, 89, 116, 119 or 122 , at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical amino acid sequences, optionally wherein the antigen comprises in SEQ ID NO: 59, 86, 89, 116, 119 or 122 The amino acid sequence of any one. 89. The mRNA of paragraph 87 or 88, wherein the open reading frame comprises at least 70%, at least 75% with the nucleotide sequence of any one of SEQ ID NOs: 58, 85, 88, 115, 118 or 121 , at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to a nucleotide sequence, optionally wherein the open reading frame comprises SEQ ID NO: 58, The nucleotide sequence of any of 85, 88, 115, 118, or 121. 90. The mRNA of paragraph 68, wherein the protein domain is the S1 subunit domain of the SARS-CoV-2 spike protein. 91. The mRNA of paragraph 90, wherein the S1 subunit is soluble. 92. The mRNA of paragraph 91, wherein the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% with the amino acid sequence of SEQ ID NO: 5 % or at least 99% identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO:5. 93. The mRNA of paragraph 91 or 92, wherein the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96% with the nucleotide sequence of SEQ ID NO: 3 %, at least 97%, at least 98%, or at least 99% identical nucleotide sequences, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 3. 94. The mRNA of paragraph 90, wherein the S1 subunit is linked to a transmembrane domain, optionally an influenza hemagglutinin transmembrane domain. 95. The mRNA of paragraph 94, wherein the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% with the amino acid sequence of SEQ ID NO: 17 % or at least 99% identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO: 17. 96. The mRNA of paragraph 94 or 95, wherein the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96% with the nucleotide sequence of SEQ ID NO: 16 %, at least 97%, at least 98%, or at least 99% identical nucleotide sequences, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 16. 97. The mRNA of paragraph 90, wherein the S1 subunit is modified to remove the RBD or a portion of the RBD of the S protein. 98. The mRNA of paragraph 97, wherein the antigen comprises at least 80%, at least 85%, at least 90% with the amino acid sequence of any one of SEQ ID NOs: 20, 23, 26, 29, 32 or 35 , at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical amino acid sequences, optionally wherein the antigen comprises in SEQ ID NO: 20, 23, 26, 29, 32 or 35 The amino acid sequence of any one. 99. The mRNA of paragraph 97 or 98, wherein the open reading frame comprises at least 70%, at least 75% with the nucleotide sequence of any one of SEQ ID NOs: 19, 22, 25, 28, 31 or 34 , at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to a nucleotide sequence, optionally wherein the open reading frame comprises SEQ ID NO: 19, The nucleotide sequence of any of 22, 25, 28, 31 or 34. 100. The mRNA of paragraph 90, wherein the S1 subunit is linked to the S2 subunit of the S protein. 101. The mRNA of paragraph 100, wherein the S2 subunit is derived from the SARS-CoV-2 S protein, and in some embodiments wherein the S2 subunit comprises an open reading frame comprising the SEQ ID NO: 145 Nucleotide sequences having at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity, as appropriate wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO:145. 102. The mRNA of paragraph 101, wherein the S1 subunit is from the HKU1 S protein. 103. The mRNA of paragraph 102, wherein the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% with the amino acid sequence of SEQ ID NO: 38 % or at least 99% identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO:38. 104. The mRNA of paragraph 102 or 103, wherein the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96% with the nucleotide sequence of SEQ ID NO: 37 %, at least 97%, at least 98%, or at least 99% identical nucleotide sequences, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 37. 105. The mRNA of paragraph 101, wherein the S1 subunit is from an OC43 S protein. 106. The mRNA of paragraph 105, wherein the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% with the amino acid sequence of SEQ ID NO: 41 % or at least 99% identical amino acid sequence, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO: 41. 107. The mRNA of paragraph 105 or 106, wherein the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96% with the nucleotide sequence of SEQ ID NO: 40 %, at least 97%, at least 98%, or at least 99% identical nucleotide sequences, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 40. 108. The mRNA of any of the preceding paragraphs, wherein the antigen further comprises a scaffold domain, optionally selected from the group consisting of ferritin, lumazine synthetase, and foldon. 109. The mRNA of paragraph 108, wherein the scaffold domain is ferritin. 110. The mRNA of paragraph 109, wherein the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, with the amino acid sequence of SEQ ID NO: 8 or 65, An amino acid sequence that is at least 98% or at least 99% identical, as appropriate, wherein the antigen comprises the amino acid sequence of SEQ ID NO: 8 or 65. 111. The mRNA of paragraph 109 or 110, wherein the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, with the nucleotide sequence of SEQ ID NO:7 or 64. A nucleotide sequence that is at least 96%, at least 97%, at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 7 or 64. 112. The mRNA of paragraph 108, wherein the scaffold domain is a lumazine synthase. 113. The mRNA of paragraph 112, wherein the antigen comprises at least 80%, at least 85%, at least 90%, at least 95% with the amino acid sequence of any one of SEQ ID NOs: 11, 14, 68 or 71 , at least 96%, at least 97%, at least 98% or at least 99% identical amino acid sequences, optionally wherein the antigen comprises an amino acid of any one of SEQ ID NOs: 11, 14, 68 or 71 sequence. 114. The mRNA of paragraph 112 or 113, wherein the open reading frame comprises at least 70%, at least 75%, at least 80% with the nucleotide sequence of any one of SEQ ID NOs: 10, 13, 67 or 70 , at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical to a nucleotide sequence, optionally wherein the open reading frame comprises SEQ ID NO: 10, 13, 67 or The nucleotide sequence of any of 70. 115. The mRNA of paragraph 108, wherein the scaffold domain is a foldon. 116. The mRNA of paragraph 115, wherein the antigen comprises at least 80%, at least 85% with the amino acid sequence of any one of SEQ ID NOs: 44, 50, 74, 80, 83, 101, 104 or 113 , at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical amino acid sequences, optionally wherein the antigen comprises SEQ ID NOs: 44, 50, 74, 80, The amino acid sequence of any of 83, 101, 104, or 113. 117. The mRNA of paragraph 115 or 116, wherein the open reading frame comprises at least 70% with the nucleotide sequence of any one of SEQ ID NOs: 43, 49, 73, 79, 82, 100, 103 or 112 , at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical nucleotide sequences, optionally wherein the open reading frame comprises SEQ ID The nucleotide sequence of any of NO: 43, 49, 73, 79, 82, 100, 103 or 112. 118. The mRNA of any of the preceding paragraphs, wherein the antigen further comprises a trafficking signal, optionally selected from macrophage markers, optionally CD86, CD11B and/or VSVGct. 119. The mRNA of paragraph 118, wherein the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least the amino acid sequence of any one of SEQ ID NOs: 95, 98 or 110 An amino acid sequence that is 96%, at least 97%, at least 98%, or at least 99% identical, as appropriate, wherein the antigen comprises the amino acid sequence of any one of SEQ ID NOs: 95, 98, or 110. 120. The mRNA of paragraph 118 or 119, wherein the open reading frame comprises at least 70%, at least 75%, at least 80%, at least the nucleotide sequence of any one of SEQ ID NOs: 94, 97 or 109 A nucleotide sequence that is 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical, optionally wherein the open reading frame comprises any of SEQ ID NOs: 94, 97 or 109 The nucleotide sequence of one. 121. The mRNA of any of paragraphs 67-120, formulated in a lipid nanoparticle. 122. The mRNA of paragraph 121, wherein the lipid nanoparticles comprise cationic lipids (optionally ionizable cationic lipids), neutral lipids, sterols, and/or polyethylene glycol (PEG) modified lipids. 123. The mRNA or composition of paragraph 108, wherein the ionizable cationic lipid is 8-((2-hydroxyethyl)(6-oxygen-6-(undecyloxy)hexyl)amino ) heptadecan-9-yl caprylate (compound 1), neutral lipids 1,2-distearyl-sn-glycero-3-phosphocholine (DSPC), sterols cholesterol, and/or The PEG-modified lipid is 1,2 dimyristoyl-sn-glycerol-methoxypolyethylene glycol (PEG2000 DMG). 124. The mRNA or composition of any one of paragraphs 121 to 123, wherein the lipid nanoparticle comprises 20-60 mol% ionizable cationic lipids, 5-25 mol% neutral lipids, 25-55 mol% solid lipids Lipids modified with alcohol and 0.5-15 mol% PEG. 125. The mRNA or composition of paragraph 124, wherein the lipid nanoparticle comprises: 47 mol% ionizable cationic lipids; 11.5 mol% neutral lipids; 38.5 mol% sterols; and 3.0 mol% PEG-modified lipids ; 48 mol% ionizable cationic lipids; 11 mol% neutral lipids; 38.5 mol% sterols; and 2.5 mol% PEG-modified lipids; 49 mol% ionizable cationic lipids; 10.5 mol% neutral lipids; 38.5 mol% mol% sterols; and 2.0 mol% PEG-modified lipids; 50 mol% ionizable cationic lipids; 10 mol% neutral lipids; 38.5 mol% sterols; and 1.5 mol% PEG-modified lipids; or 51 mol% % ionizable cationic lipids; 9.5 mol% neutral lipids; 38.5 mol% sterols; and 1.0 mol% PEG-modified lipids. 126. The mRNA or composition of paragraph 125, wherein the lipid nanoparticle comprises: 47 mol% Compound 1; 11.5 mol% DSPC; 38.5 mol% cholesterol; and 3.0 mol% PEG2000 DMG; 48 mol% Compound 1; 11 mol % DSPC; 38.5 mol % cholesterol; and 2.5 mol % PEG2000 DMG; 49 mol % Compound 1; 10.5 mol % DSPC; 38.5 mol % cholesterol; and 2.0 mol % PEG2000 DMG; 50 mol % Compound 1; 10 mol % DSPC; mol% cholesterol; and 1.5 mol% PEG2000 DMG; or 51 mol% Compound 1; 9.5 mol% DSPC; 38.5 mol% cholesterol; and 1.0 mol% PEG2000 DMG. 127. A method comprising administering to an individual the mRNA of any of paragraphs 67-126 in an amount effective to induce a neutralizing antibody response against SARS-CoV-2 in the individual. 128. A method comprising administering to an individual an mRNA or composition of any of paragraphs 67-126 in an amount effective to induce a T cell immune response against SARS-CoV-2 in the individual. Example Example 1. Performance Data

本研究中使用之mRNA用於表現SARS-CoV-2冠狀病毒刺突(S)蛋白之關鍵中和結構域,且評價該等中和蛋白結構域在個別地或組合用作免疫原性組合物或疫苗以保護人免受活的及傳播之天然病毒感染時是否可更有效地誘導保護性免疫性。由mRNA編碼之蛋白質之綫性設計如 2中所示。蛋白質亦皆含有源自流行性感冒之血球凝集素(HA)之羧基(C)末端跨膜結構域(TM)。 The mRNA used in this study was used to express the key neutralizing domains of the SARS-CoV-2 coronavirus spike (S) protein, and these neutralizing protein domains were evaluated individually or in combination as immunogenic compositions Or vaccines to protect people from live and circulating natural viruses are more effective in inducing protective immunity. The linear design of the protein encoded by the mRNA is shown in Figure 2 . The proteins also all contain the carboxy (C) terminal transmembrane domain (TM) of the influenza-derived hemagglutinin (HA).

已知NTD及RBD二者皆係結合表現中和病毒活性之抗體之位點。在SARS-CoV-2之情形下,RBD係刺突蛋白之受體結合位點,且結合血管收縮肽轉化酶2 (ACE2)。胺基(N)末端結構域NTD之功能尚未完全瞭解,似乎在結合糖部分及促進刺突蛋白自融合前構象向融合後構象之構象轉變中起作用。參見Zhou H, Chen Y, Zhang S等人, Nat Commun.2019;10(1): 3068。無論如何,NTD及RBD結構域二者皆誘導如下文論述之高結合抗體及中和抗體效價。 Both NTD and RBD are known to bind to the site of antibodies that exhibit virus-neutralizing activity. In the case of SARS-CoV-2, RBD is the receptor binding site for the spike protein and binds vasoconstrictor peptide converting enzyme 2 (ACE2). The function of the amine (N) terminal domain, NTD, is not fully understood and appears to play a role in binding the carbohydrate moiety and promoting the conformational transition of the spike protein from the prefusion to the postfusion conformation. See Zhou H, Chen Y, Zhang S et al., Nat Commun. 2019;10(1):3068. Regardless, both the NTD and RBD domains induce high binding and neutralizing antibody titers as discussed below.

使用對SARS-CoV-2刺突蛋白之受體結合結構域(RBD) (mAb1)及SARS-CoV-2刺突蛋白之N末端結構域(NTD) (Ab2)特異性之抗體,在 16 17中顯示mRNA RBD-TM疫苗(「SARS-CoV-2 RBD-TM」;SEQ ID NO: 75-77)、mRNA NTD-TM疫苗(「SARS-CoV-2 NTD-TM」;SEQ ID NO: 45-47)及mRNA NTD-RBD-TM (「SARS-CoV-1 NTD-RBD-TM」;SEQ ID NO: 90-92)疫苗之表現數據。 16顯示在24小時(hr)、48 hr及72 hr時,與WT SARS-CoV-2刺突蛋白mRNA相比,MFI * Freq之平均倍數差異(在稀釋範圍內) ( 16)。 16.與S2P蛋白相比之總抗原表現(MFI * Freq)之倍數變化 單株抗體 (mAb) 野生型 (WT) S mRNA RBD-TM mRNA NTD-TM mRNA NTD-RBD-TM mRNA 空白 24 小時 (hr) 1 – 特異於RBD 1.8 30.6 0.0 10.4 0.0 24 hr 2 – 特異於NTD 0.7 0.0 9.7 2.5 0.0 48 hr 1 1.2 40.1 0.0 7.3 0.0 48 hr 2 0.8 0.0 19.0 6.0 0.0 72 hr 1 0.8 32.5 0.1 11.6 0.0 72 hr 2 0.7 0 14.4 4.1 0.0 RBD =受體結合結構域 NTD =N末端結構域 TM =跨膜結構域 實例 2. 單一劑量後第 21 天之免疫原性數據及中和數據 Antibodies specific for the receptor binding domain (RBD) (mAb1) of the SARS-CoV-2 spike protein and the N-terminal domain (NTD) (Ab2) of the SARS-CoV-2 spike protein were used, in Table 16 mRNA RBD-TM vaccine ("SARS-CoV-2 RBD-TM"; SEQ ID NO: 75-77), mRNA NTD-TM vaccine ("SARS-CoV-2 NTD-TM"; SEQ ID NO: 75-77) shown in and 17 : 45-47) and mRNA NTD-RBD-TM ("SARS-CoV-1 NTD-RBD-TM"; SEQ ID NO: 90-92) vaccine performance data. Table 16 shows the mean fold difference in MFI*Freq (over the dilution range) compared to WT SARS-CoV-2 spike protein mRNA at 24 hours (hr), 48 hr and 72 hr ( Table 16 ). Table 16. Fold change in total antigen expression (MFI*Freq) compared to S2P protein Monoclonal Antibody (mAb) Wild-type (WT) S mRNA RBD-TM mRNA NTD-TM mRNA NTD-RBD-TM mRNA blank 24 hours (hr) 1 – Specific to RBD 1.8 30.6 0.0 10.4 0.0 24 hours 2 – Specific to NTD 0.7 0.0 9.7 2.5 0.0 48 hours 1 1.2 40.1 0.0 7.3 0.0 48 hours 2 0.8 0.0 19.0 6.0 0.0 72 hours 1 0.8 32.5 0.1 11.6 0.0 72 hours 2 0.7 0 14.4 4.1 0.0 RBD = receptor binding domain NTD = N-terminal domain TM = transmembrane domain Example 2. Immunogenicity data and neutralization data at day 21 post single dose

以如下劑量向小鼠投與mRNA NTD-TM及mRNA RBD-TM (闡述於 實例 1中):0.001 μg、0.01 μg、0.1 μg或1 μg (N=8)。以如下劑量向小鼠投與mRNA NTD-RBD-TM (闡述於 實例 1中):0.1 μg或1 μg (N=8)。向小鼠投與mRNA NTD-TM及mRNA RBD-TM之50:50混合物,其含有0.1 μg每一mRNA,總共0.2 μg mRNA,或1 μg每一mRNA,總共2 μg mRNA (N=8)。接著在疫苗接種後第21天藉由ELISA量測SARS-CoV-2刺突蛋白特異性IgG效價( 17)、SARS-CoV-2 RBD特異性IgG效價( 18)及SARS-CoV-2 NTD特異性IgG效價( 19)。數據提供於 17-19中。0.1 μg劑量之mRNA NTD-RBD-TM及0.2 μg劑量之mRNA NTD-TM及mRNA RBD-TM組合物之50:50混合物引發可觀察到之NTD特異性及RBD特異性IgG效價,且0.1 μg劑量之RBD-TM及NTD-TM分別引發可量測之針對RBD及NTD抗原之IgG效價。 17.SARS-CoV-2 S1/S2刺突蛋白特異性IgG效價 - 平均值 微克(每種mRNA構築體) RBD-TM NTD-TM NTD-RBD-TM 50:50混合物 0.001 13 13 - - 0.01 13 13 - - 0.1 13 13 13 19 1 225 13 379 707 N=8之平均值;僅PBS = 1.1 18.RBD結構域特異性IgG效價 - 平均值 微克(每種mRNA構築體) RBD-TM NTD-RBD-TM 50:50混合物 0.001 15 - - 0.01 14 - - 0.1 75 209 266 1 1947 7710 15530 N=8之平均值;僅PBS = 1.1 19.NTD結構域特異性IgG效價 - 平均值 微克(每種mRNA構築體) NTD-TM NTD-RBD-TM 50:50混合物 0.001 13 - - 0.01 14 - - 0.1 30 20 38 1 620 848 1075 N=8之平均值;僅PBS = 1.1 Mice were administered mRNA NTD-TM and mRNA RBD-TM (described in Example 1 ) at the following doses: 0.001 μg, 0.01 μg, 0.1 μg or 1 μg (N=8). Mice were administered mRNA NTD-RBD-TM (described in Example 1 ) at the following doses: 0.1 μg or 1 μg (N=8). Mice were administered a 50:50 mixture of mRNA NTD-TM and mRNA RBD-TM containing 0.1 μg of each mRNA, 0.2 μg of mRNA in total, or 1 μg of each mRNA, 2 μg of mRNA in total (N=8). SARS-CoV-2 spike protein-specific IgG titers ( Table 17 ), SARS-CoV-2 RBD-specific IgG titers ( Table 18 ) and SARS-CoV were then measured by ELISA on day 21 post-vaccination -2 NTD-specific IgG titers ( Table 19 ). Data are provided in Tables 17-19 . mRNA NTD-RBD-TM at a dose of 0.1 μg and a 50:50 mixture of the composition of mRNA NTD-TM and mRNA RBD-TM at a dose of 0.2 μg elicited observable NTD-specific and RBD-specific IgG titers, and 0.1 μg Doses of RBD-TM and NTD-TM elicited measurable IgG titers against RBD and NTD antigens, respectively. Table 17. SARS-CoV-2 S1/S2 Spike Protein-Specific IgG Titers - Mean μg (per mRNA construct) RBD-TM NTD-TM NTD-RBD-TM 50:50 mix 0.001 13 13 - - 0.01 13 13 - - 0.1 13 13 13 19 1 225 13 379 707 Mean of N=8; PBS only = 1.1 Table 18. RBD domain specific IgG titers - mean μg (per mRNA construct) RBD-TM NTD-RBD-TM 50:50 mix 0.001 15 - - 0.01 14 - - 0.1 75 209 266 1 1947 7710 15530 Mean of N=8; PBS only = 1.1 Table 19. NTD domain specific IgG titers - mean μg (per mRNA construct) NTD-TM NTD-RBD-TM 50:50 mix 0.001 13 - - 0.01 14 - - 0.1 30 20 38 1 620 848 1075 Average of N=8; PBS only = 1.1

量測來自用1 μg劑量之RBD-TM及NTD-RBD-TM組合物及2 μg劑量之NTD-TM及RBD-TM組合物之50:50混合物接種之小鼠之血清之中和效價,且分析ELISA效價與中和效價之間之相關性( 7)。 Neutralization titers of sera from mice vaccinated with a 1 μg dose of the RBD-TM and NTD-RBD-TM composition and a 50:50 mixture of the NTD-TM and RBD-TM composition at a 2 μg dose were measured, And the correlation between ELISA titer and neutralization titer was analyzed ( Fig. 7 ).

由1 μg劑量之NTD-RBD-TM組合物或2 μg之NTD-TM及RBD-TM組合物之50:50混合物引發之效價大於由1 μg劑量之RBD-TM組合物引發之彼等效價( 20)。在刺突特異性IgG、RBD特異性IgG及NTD特異性IgG之中和效價及ELISA效價之間存在顯著之相關性( 7)。 20.中和效價 - 平均值 微克(每種mRNA構築體) RBD-TM NTD-RBD-TM 50:50混合物 1 238 504 - 2 - - 421 N=8之平均值;僅PBS = 1.1 基於重組 VSVΔG SARS-CoV-2 假病毒中和分析 The titers elicited by a 1 μg dose of the NTD-RBD-TM composition or a 50:50 mixture of the NTD-TM and RBD-TM composition at 2 μg were greater than those elicited by a 1 μg dose of the RBD-TM composition price ( Table 20 ). There was a significant correlation between neutralizing titers and ELISA titers for spike-specific IgG, RBD-specific IgG, and NTD-specific IgG ( Figure 7 ). Table 20. Neutralizing potency - mean μg (per mRNA construct) RBD-TM NTD-RBD-TM 50:50 mix 1 238 504 - 2 - - 421 Average of N=8; PBS only = 1.1 SARS-CoV-2 pseudovirus neutralization assay based on recombinant VSVΔG

將密碼子最佳化之野生型或D614G刺突基因(Wuhan-Hu-1株;NC_045512.2)選殖至pCAGGS載體中。為了產生基於VSVΔG之SARS-CoV-2假病毒,如先前所述(Whitt, 2010),用刺突表現質體及感染之VSVΔG-螢火蟲-螢光素酶轉染BHK-21/WI-2細胞。使用A549-hACE2-TMPRSS2細胞作為基於VSV∆G之SARS-CoV-2假病毒中和分析之靶細胞。製備編碼hACE2-P2A-TMPRSS2之慢病毒屬,以產生A549-hACE2-TMPRSS2細胞,將其維持在補充有10%胎牛血清及1 μg/mL嘌呤黴素(puromycin)之DMEM中。在37℃下用假病毒感染A549-hACE2-TMPRSS2細胞1小時。感染後除去接種病毒或病毒-抗體混合物。18小時後,將等體積之One-Glo試劑(Promega;E6120)添加至培養基中,使用BMG PHERastar-FS讀板儀讀數。中和程序及數據分析與上述基於慢病毒屬之假病毒中和分析中之相同。參見Whitt, M.A. (2010). Journal of Virological Methods 169, 365-374。 實例 3. 兩個劑量後第 36 天之免疫原性數據 The codon-optimized wild-type or D614G spike gene (strain Wuhan-Hu-1; NC_045512.2) was cloned into the pCAGGS vector. To generate VSVΔG-based SARS-CoV-2 pseudoviruses, BHK-21/WI-2 cells were transfected with spike-expressing plastids and infected VSVΔG-firefly-luciferase as previously described (Whitt, 2010). . A549-hACE2-TMPRSS2 cells were used as target cells for the VSVΔG-based SARS-CoV-2 pseudovirus neutralization assay. A lentivirus encoding hACE2-P2A-TMPRSS2 was prepared to generate A549-hACE2-TMPRSS2 cells, which were maintained in DMEM supplemented with 10% fetal bovine serum and 1 μg/mL puromycin. A549-hACE2-TMPRSS2 cells were infected with pseudovirus for 1 hour at 37°C. The inoculum virus or virus-antibody mixture is removed after infection. After 18 hours, an equal volume of One-Glo reagent (Promega; E6120) was added to the medium and read using a BMG PHERastar-FS plate reader. The neutralization procedure and data analysis were the same as in the above-described lentivirus-based pseudovirus neutralization analysis. See Whitt, MA (2010). Journal of Virological Methods 169 , 365-374. Example 3. Immunogenicity data at day 36 after two doses

接種疫苗第一劑量後第22天,再次將 實例 2中所述之相同劑量之mRNA疫苗作為加强劑量投與給小鼠。自第36天血清藉由ELISA量測在加强劑量後產生之針對RBD抗原、NTD抗原、野生型(WT)刺突(S)蛋白及S2P蛋白(具有雙脯胺酸突變以穩定融合前構象之S蛋白)中之每一者之抗體之效價,且顯示如下。將由LNP中之mRNA編碼之RBD-TM及NTD-TM之兩種免疫原性組合物之50:50混合物在第22天以2 μg或0.2 μg總mRNA作為加强劑量投與給小鼠,且在第36天測定效價。參見 21On day 22 after the first dose of the vaccine, the same dose of mRNA vaccine described in Example 2 was again administered to the mice as a booster dose. Serum from day 36 was measured by ELISA against RBD antigen, NTD antigen, wild-type (WT) Spike (S) protein and S2P protein (with bisproline mutation to stabilize the prefusion conformation against RBD antigen, NTD antigen) by ELISA The titers of antibodies to each of the S protein) are shown below. A 50:50 mixture of two immunogenic compositions of RBD-TM and NTD-TM encoded by mRNA in LNP was administered to mice on day 22 as a booster dose of 2 μg or 0.2 μg total mRNA, and on day 22. The titers were determined on day 36. See Table 21 .

21中顯示之用由LNP中之mRNA編碼之RBD-TM、NTD-TM或NTD-RBD-TM免疫之小鼠之WT S蛋白效價指示,在誘導可識別且結合至SARS-CoV-2 WT S蛋白之抗體方面,兩個劑量在所有測試之劑量下皆係優異的。 21.SARS-CoV-2 WT S特異性IgG效價 - 幾何平均值 微克(mRNA構築體) RBD-TM NTD-TM NTD-RBD-TM    第21天(GMT)效價 0.001 13 13 - 0.01 13 13 - 0.1 13 13 14 1 225 13 379    第36天(GMT)效價 0.001 13 13 - 0.01 295 13 - 0.1 1,017 39 3879 1 27,674 6317 24,413 N=8之平均值;僅PBS = 1.1 An indication of WT S protein titer in mice immunized with RBD-TM, NTD-TM or NTD-RBD-TM encoded by mRNA in LNP, shown in Table 21 , recognizes and binds to SARS-CoV-2 upon induction For antibodies to the WT S protein, both doses were excellent at all doses tested. Table 21. SARS-CoV-2 WT S-specific IgG titers - geometric mean μg (mRNA construct) RBD-TM NTD-TM NTD-RBD-TM Day 21 (GMT) titer 0.001 13 13 - 0.01 13 13 - 0.1 13 13 14 1 225 13 379 Day 36 (GMT) titer 0.001 13 13 - 0.01 295 13 - 0.1 1,017 39 3879 1 27,674 6317 24,413 Average of N=8; PBS only = 1.1

進一步分析來自用兩個劑量之由LNP中之mRNA編碼之RBD-TM、NTD-TM或NTD-RBD-TM免疫之小鼠之血清的抗體識別及結合SARS-CoV-2 S2P蛋白之能力。藉由ELISA使用S2P作為板上之抗原測定SARS-CoV-2 S2P蛋白之效價,且其示於下 22中。當S2P為抗原時,該等免疫原中之每一者誘導比當WT S蛋白為ELISA抗原時高得多之抗體效價。比較 21 22 22.SARS-CoV-2 S2P特異性IgG效價 - 幾何平均值 微克(mRNA構築體) RBD-TM NTD-TM NTD-RBD-TM    第36天(GMT)效價 0.001 88 16 - 0.01 1,416 280 - 0.1 3,892 11,687 25,298 1 107,611 192,040 223,140 N=8之平均值;僅PBS = 1.1 Sera from mice immunized with two doses of RBD-TM, NTD-TM or NTD-RBD-TM encoded by mRNA in LNP were further analyzed for their ability to recognize and bind to the SARS-CoV-2 S2P protein. The titer of the SARS-CoV-2 S2P protein was determined by ELISA using S2P as the antigen on the plate and is shown in Table 22 below. When S2P was the antigen, each of these immunogens induced much higher antibody titers than when the WT S protein was the ELISA antigen. Compare Table 21 and Table 22 . Table 22. SARS-CoV-2 S2P-specific IgG titers - geometric mean μg (mRNA construct) RBD-TM NTD-TM NTD-RBD-TM Day 36 (GMT) titer 0.001 88 16 - 0.01 1,416 280 - 0.1 3,892 11,687 25,298 1 107,611 192,040 223,140 Average of N=8; PBS only = 1.1

23中,免疫原係由LNP中之mRNA編碼之RBD-TM及NTD-TM之50:50混合物,且在一個劑量(第21天)及兩個劑量(第36天)後測定對WT S、RBD、NTD及S2P之效價。該等結果顯示,當免疫原係RBD-TM及NTD-TM以50:50混合之組合時,與相同劑量之個別抗原誘導之抗體效價相比,效價顯著增加。50:50混合物誘導對免疫抗原之良好效價,但令人驚訝的是,對WT S蛋白之效價甚至更佳且對S2P蛋白之效價極高。參見 23 23.SARS-CoV-2抗原特異性IgG效價 - 幾何平均值 免疫原 ELISA板上偵測抗原 微克(mRNA) 50:50混合物 RBD-TM LNP NTD-TM LNP 抗刺突蛋白IgG 抗RBD IgG 抗NTD IgG 抗S2P IgG 第21天(GMT)效價          0.2 19 266 38 4,869 2 707 15,530 1,075 52,936 第36天(GMT)效價          0.2 7,259 15,957 4,869 74,718 2 75,871 186,740 52,936 2,726,948 N=8之平均值;僅PBS = 1.1 In Table 23 , the immunogen was a 50:50 mixture of RBD-TM and NTD-TM encoded by mRNA in LNP, and was tested against WT after one dose (day 21) and two doses (day 36) The titers of S, RBD, NTD and S2P. These results show that when the immunogens are a combination of RBD-TM and NTD-TM in a 50:50 mix, the titers are significantly increased compared to the antibody titers induced by the same doses of the individual antigens. The 50:50 mixture induced good titers against the immunizing antigen, but surprisingly, the titers against the WT S protein were even better and the titers against the S2P protein were extremely high. See Table 23 . Table 23. SARS-CoV-2 antigen-specific IgG titers - geometric mean immunogen Antigen detection on ELISA plate μg (mRNA) 50:50 mixture RBD-TM LNP NTD-TM LNP anti-spike IgG anti-RBD IgG anti-NTD IgG Anti-S2P IgG Day 21 (GMT) titer 0.2 19 266 38 4,869 2 707 15,530 1,075 52,936 Day 36 (GMT) titer 0.2 7,259 15,957 4,869 74,718 2 75,871 186,740 52,936 2,726,948 Average of N=8; PBS only = 1.1

24顯示用RBD-TM及NTD-TM中之每一者作為編碼彼等抗原之mRNA進行免疫之結果。使用由mRNA免疫原編碼之蛋白質作為ELISA板上之抗原,量測每組8隻小鼠之幾何平均效價。同樣,當抗原作為調配於LNP中之mRNA投與時,兩種免疫原性組合物皆誘導對免疫抗原之高效價。在此情形中,兩個劑量在所有濃度下皆產生優異之抗體反應。然而,以每微克(μg)計,該等抗原之50:50混合物誘導之抗體反應係單獨投與抗原時之約10倍。比較 23 24 24.SARS-CoV-2 RBD及NTD結構域特異性IgG效價 - 幾何平均值 微克(mRNA構築體) RBD-TM 板上抗原 NTD-TM 板上抗原    第21天(GMT)效價 0.001 15 13 0.01 14 14 0.1 75 30 1 1,947 620    第36天(GMT)效價 0.001 25 13 0.01 1,357 123 0.1 4,678 6898 1 84,385 46,516 N=8之平均值;僅PBS = 1.1 Table 24 shows the results of immunization with each of RBD-TM and NTD-TM as mRNA encoding their antigens. The geometric mean titers of 8 mice per group were measured using the protein encoded by the mRNA immunogen as the antigen on the ELISA plate. Likewise, both immunogenic compositions induced high titers to the immunizing antigen when the antigen was administered as mRNA formulated in LNP. In this case, both doses produced excellent antibody responses at all concentrations. However, on a per microgram (μg) basis, the 50:50 mixture of these antigens induced an antibody response that was about 10 times greater than when the antigen was administered alone. Compare Table 23 and Table 24 . Table 24. SARS-CoV-2 RBD and NTD domain-specific IgG titers - geometric mean μg (mRNA construct) Antigen on RBD-TM plate Antigen on NTD-TM plate Day 21 (GMT) titer 0.001 15 13 0.01 14 14 0.1 75 30 1 1,947 620 Day 36 (GMT) titer 0.001 25 13 0.01 1,357 123 0.1 4,678 6898 1 84,385 46,516 Average of N=8; PBS only = 1.1

在第1天及第21天,以0.1及1 μg劑量將包含與RBD連接且由LNP中之mRNA編碼之NTD之融合蛋白作為免疫原性組合物投與給8隻小鼠之組。參見下 25。甚至編碼NTD-RBD-TM之融合蛋白形式之mRNA對個別結構域亦誘導極佳效價,該等效價高於當單一結構域為免疫抗原時之效價。對S2P蛋白之效價係對WT S蛋白之效價之約8倍。參見 25 25.SARS-CoV-2 NTD-RBD-TM結構域特異性IgG 效價 - 幾何平均值 微克免疫原 NTD-RBD-TM (mRNA構築體) RBD-TM 板上抗原 NTD-TM 板上抗原 野生型刺突 板上抗原 S2P 板上抗原       第21天(GMT)效價 0.1 209 20 13 NA 1 7,710 848 379 NA       第36天(GMT)效價 0.1 5,769 2,875 3,878 25,298 1 111,747 41,876 24,413 223,140 N=8之平均值;僅PBS = 1.1 On days 1 and 21, a fusion protein comprising NTD linked to RBD and encoded by mRNA in LNP was administered as an immunogenic composition to groups of 8 mice at 0.1 and 1 μg doses. See Table 25 below. Even the mRNA encoding the fusion protein form of NTD-RBD-TM induced excellent titers for the individual domains, which were higher than when the single domain was the immunizing antigen. The potency against the S2P protein was about 8 times the potency against the WT S protein. See Table 25 . Table 25. SARS-CoV-2 NTD-RBD-TM domain-specific IgG titers - geometric mean μg immunogen NTD-RBD-TM (mRNA construct) Antigen on RBD-TM plate Antigen on NTD-TM plate Antigen on wild-type spike plate Antigen on S2P Plate Day 21 (GMT) titer 0.1 209 20 13 NA 1 7,710 848 379 NA Day 36 (GMT) titer 0.1 5,769 2,875 3,878 25,298 1 111,747 41,876 24,413 223,140 Average of N=8; PBS only = 1.1

中和數據示於 26中。由mRNA編碼之S1-666-TM係使用S1子結構域之抗原,特別是SARS-CoV-2刺突蛋白之附接至跨膜結構域之殘基1-666。 26.第1天及第22天兩次免疫後第36天之平均中和效價. 微克 (每種mRNA構築體) S1-666-TM NTD-TM及RBD-TM之50:50混合物 RBD-TM NTD-TM NTD-RBD-TM 0.001 - - 40 40 - 0.01 508 - 70.5 40 - 0.1 11,561 1,336 260 205 865 1.0 - 25,208 7,402 4,669 57,891 N=8之平均值;僅PBS = 1.1 實例 4. S1-666-TM 之免疫原性 Neutralization data are shown in Table 26 . The S1-666-TM encoded by the mRNA uses the antigens of the S1 subdomain, specifically residues 1-666 of the SARS-CoV-2 spike protein attached to the transmembrane domain. Table 26. Mean neutralizing titers on day 36 after two immunizations on day 1 and day 22. μg (per mRNA construct) S1-666-TM 50:50 mixture of NTD-TM and RBD-TM RBD-TM NTD-TM NTD-RBD-TM 0.001 - - 40 40 - 0.01 508 - 70.5 40 - 0.1 11,561 1,336 260 205 865 1.0 - 25,208 7,402 4,669 57,891 Average of N=8; PBS only=1.1 Example 4. Immunogenicity of S1-666-TM

將由LNP中之mRNA編碼之S1-666-TM (或刺突蛋白S之S1殘基1-666)在第1天作為初免免疫投與給小鼠,且在第22天作為加强劑量投與0.01 μg及0.1 μg (N =8)組。自第21天(加强前)及第36天(加强後)血清藉由ELISA量測加强劑量後產生之針對mRNA RBD、mRNA NTD及mRNA野生型(WT)刺突(S)蛋白( 1)中每一者之抗體效價,且示於下 27中。 S1-666-TM (or S1 residues 1-666 of Spike protein S) encoded by mRNA in LNP was administered to mice as a prime immunization on day 1 and as a booster dose on day 22 0.01 μg and 0.1 μg (N = 8) groups. Serum from day 21 (pre-boost) and day 36 (post-boost) was measured by ELISA against mRNA RBD, mRNA NTD and mRNA wild-type (WT) Spike (S) protein by ELISA ( Figure 1 ) The antibody titers for each of these are shown in Table 27 below.

27中顯示之用由LNP中之mRNA編碼之S1-666-TM免疫之小鼠之WT S蛋白效價指示,在誘導可識別且結合至SARS-CoV-2 WT S蛋白之抗體方面,兩個劑量在所有測試之劑量下皆係優異的。令人驚訝的是,即使在S1中未發現2P突變(這是因為2P突變發生在S2中,而免疫原中不存在S2),當針對刺突蛋白之S2P形式量測時,誘導之效價亦最高。與其他構築體相似,NTD效價需要第二劑量升高。 27.SARS-CoV-2抗原特異性IgG效價 - 幾何平均值 免疫原 ELISA板上偵測抗原 微克(mRNA) S1-666-TM 抗刺突蛋白IgG 抗RBD IgG 抗NTD IgG 抗S2P IgG 第21天(GMT)效價          0.01 13 30 13 NA 0.1 175 868 49 NA 第36天(GMT)效價          0.01 2,450 12,952 112 186,561 0.1 17,066 69,969 9,015 874,609 N=8之平均值;僅PBS = 1.1 實例 5. RBD-TM NTD-TM NTD-RBD-TM NTD-TM/RBD-TM 組合物之 50:50 混合物在兩個劑量後在第 36 天之免疫原性 The WT S protein titer indications of mice immunized with S1-666-TM encoded by the mRNA in LNP, shown in Table 27 , were both effective in inducing antibodies that could recognize and bind to the SARS-CoV-2 WT S protein. This dose was excellent at all doses tested. Surprisingly, even though no 2P mutation was found in S1 (this is because the 2P mutation occurs in S2, which is not present in the immunogen), when measured against the S2P form of the spike protein, the titer of induction Also the highest. Similar to the other constructs, NTD titers required a second dose increase. Table 27. SARS-CoV-2 antigen-specific IgG titers - geometric mean immunogen Antigen detection on ELISA plate Micrograms (mRNA) S1-666-TM anti-spike IgG anti-RBD IgG anti-NTD IgG Anti-S2P IgG Day 21 (GMT) titer 0.01 13 30 13 NA 0.1 175 868 49 NA Day 36 (GMT) titer 0.01 2,450 12,952 112 186,561 0.1 17,066 69,969 9,015 874,609 Average of N=8; PBS only=1.1 Example 5. 50:50 mixture of RBD-TM , NTD-TM , NTD-RBD-TM and NTD-TM/RBD-TM composition at 36 after two doses day immunogenicity

在該重複實驗中,在接種第一劑量後第22天,再次向小鼠投與上述實例中所述相同劑量之mRNA疫苗作為加强劑量。接著在接種第一劑量後第36天藉由ELISA量測SARS-CoV-2刺突蛋白特異性IgG效價、SARS-CoV-2 S2P蛋白特異性IgG效價、SARS-CoV-2 RBD特異性IgG效價及SARS-CoV-2 NTD特異性IgG效價。In this replicate experiment, on day 22 after the first dose, mice were again administered the same dose of mRNA vaccine as described in the above example as a booster dose. The SARS-CoV-2 spike protein-specific IgG titers, SARS-CoV-2 S2P protein-specific IgG titers, SARS-CoV-2 RBD-specific IgG titers were then measured by ELISA on day 36 after the first dose of inoculation IgG titers and SARS-CoV-2 NTD-specific IgG titers.

結果顯示1 μg及0.1 μg劑量之mRNA RBD-TM、mRNA NTD-TM、mRNA NTD-RBD-TM組合物及含有1 μg或0.1 μg mRNA RBD-TM及mRNA NTD-TM組合物中之每一者之50:50混合物引發針對SARS-CoV-2刺突或SARS-CoV-2 S2P蛋白之高ELISA效價。 實例 6. 免疫原性研究 The results show 1 μg and 0.1 μg doses of mRNA RBD-TM, mRNA NTD-TM, mRNA NTD-RBD-TM compositions and compositions containing 1 μg or 0.1 μg of each of mRNA RBD-TM and mRNA NTD-TM The 50:50 mixture elicited high ELISA titers against the SARS-CoV-2 spike or the SARS-CoV-2 S2P protein. Example 6. Immunogenicity studies

將mRNA NTD-TM及mRNA RBD-TM之50:50混合物之免疫原性以如下劑量投與給小鼠:0.2 μg或2 μg總mRNA (0.1 μg或1 μg每一mRNA) (N=8)。在第1天投與初免劑量,且在第22天投與加强劑量。在第36天,使用ELISA來評價抗體與SARS-CoV-2穩定之融合前刺突蛋白(SARS-CoV-2 pre-S)之結合。將包括mRNA NTD-RBD-TM、mRNA RBD-TM及mRNA NTD-TM之以下疫苗組合物以如下劑量投與給小鼠:0.1 μg及0.01 μg (N=8)。測定GMT數據且示於下 28中。 28.S蛋白結構域融合及組合誘導之SARS-CoV-2刺突效價 疫苗免疫原 劑量 µg mRNA 21 GMT 效價 劑量 1 3 S2P 包被在板上 36 GMT 效價 劑量 2 3 S2P 包被在板上 50:50混合物 RBD-TM NTD-TM 1 5,913 301,881 50:50混合物 RBD-TM NTD-TM 0.1 71 13,423 NTD-RBD-TM 1.0 3,329 177,499 NTD-RBD-TM 0.1 80 48947 RBD-TM 1.0 3198 593,730 RBD-TM 0.1 73 31,090 NTD-TM 1.0 593 74,777 NTD-TM 0.1 21 23,916 實例 7. NTD-RBD-TM IgG2a IgG1 之比率之測定 The immunogenicity of a 50:50 mixture of mRNA NTD-TM and mRNA RBD-TM was administered to mice at the following doses: 0.2 μg or 2 μg total mRNA (0.1 μg or 1 μg each mRNA) (N=8) . The prime dose was administered on day 1 and the booster dose was administered on day 22. On day 36, antibody binding to SARS-CoV-2 stable prefusion spike protein (SARS-CoV-2 pre-S) was assessed using ELISA. The following vaccine compositions including mRNA NTD-RBD-TM, mRNA RBD-TM and mRNA NTD-TM were administered to mice at the following doses: 0.1 μg and 0.01 μg (N=8). GMT data were determined and are shown in Table 28 below. Table 28. SARS-CoV-2 spike titers induced by S protein domain fusions and combinations vaccine immunogen Dose µg mRNA Day 21 GMT titer dose 1 after 3 weeks of S2P coating on plates Day 36 GMT titer dose 2 post 3 weeks S2P coating on plates 50:50 Mixture RBD-TM NTD-TM 1 5,913 301,881 50:50 Mixture RBD-TM NTD-TM 0.1 71 13,423 NTD-RBD-TM 1.0 3,329 177,499 NTD-RBD-TM 0.1 80 48947 RBD-TM 1.0 3198 593,730 RBD-TM 0.1 73 31,090 NTD-TM 1.0 593 74,777 NTD-TM 0.1 twenty one 23,916 Example 7. Determination of the ratio of IgG2a and IgG1 of NTD-RBD-TM

將NTD-RBD-TM、mRNA NTD-RBD-TM之組合物以如下劑量投與給小鼠:0.1 μg及1 μg。在第1天投與初免劑量,且在第22天投與加强劑量。在第36天,評價S2P特異性IgG1及IgG2a之效價。參見 8A-8C。截至第36天,在兩個劑量水準下,IgG2a之效價皆高於IgG1之量。參見 8A。為了確定T細胞反應是否偏向Th1或Th2型反應,吾人繪製在第36天時間點之IgG2a/IgG1之比率。如 8B中所示,NTD-RBD-TM組合物誘導抗體免疫反應,該抗體免疫反應明確地在Th1型反應中。Th2型反應在疫苗開發中係不利的,此乃因其與驅動疾病增强有關。 實例 8. 免疫原性研究 The composition of NTD-RBD-TM, mRNA NTD-RBD-TM was administered to mice at the following doses: 0.1 μg and 1 μg. The prime dose was administered on day 1 and the booster dose was administered on day 22. On day 36, S2P-specific IgG1 and IgG2a titers were assessed. See Figures 8A-8C . By day 36, the titer of IgG2a was higher than that of IgG1 at both dose levels. See Figure 8A . To determine whether T cell responses were biased towards Th1 or Th2 type responses, we plotted the ratio of IgG2a/IgG1 at the day 36 time point. As shown in Figure 8B , the NTD-RBD-TM composition induced an antibody immune response that was clearly in a Th1-type response. Th2-type responses are disadvantageous in vaccine development as they are associated with driving disease enhancement. Example 8. Immunogenicity studies

29中列出之mRNA以如下劑量投與給小鼠:0.1 μg及1 μg (N =8)。在第1天投與初免劑量,且在第22天投與加强劑量。第21天及第36天在S2P包被之板上分析血清IgG效價。結果示於 29中。 29. 調配物 / 材料 劑量 (μg) 21 36 PBS 1.097 1.097 NTD-RBD-TM 1 4.237 6.036 0.1 2.711 5.106 S1-666-TM (SEQ ID NO: 16) 1 4.224 5.704 0.1 1.944 4.633 混合(50:50) NTD-EXT-F43C-TM (SEQ ID NO: 55) + RBD-Q563D-EXT-TM (SEQ ID NO: 88) 1 3.939 5.442 0.1 1.867 4.342 NTD-EXT-RBD-EXT-TM (SEQ ID NO: 121) 1 4.541 6.001 0.1 3.694 4.885 S1-594-TM (SEQ ID NO: 22) 1 4.027 5.517 0.1 3.313 4.780 S1-594-聚G-DS-TM (SEQ ID NO: 28) 1 3.974 5.531 0.1 2.551 4.425 S2-TM (SEQ ID NO: 145) 1 1.989 4.058 0.1 1.097 2.015 RBD-NTD-TM (SEQ ID NO: 139) 1 4.523 6.062 0.1 3.453 5.222 NTD-PADRE-RBD-TM (SEQ ID NO: 142) 1 4.489 6.040 0.1 3.386 5.301 額外序列 The mRNAs listed in Table 29 were administered to mice at the following doses: 0.1 μg and 1 μg (N = 8). The prime dose was administered on day 1 and the booster dose was administered on day 22. Serum IgG titers were analyzed on days 21 and 36 on S2P-coated plates. The results are shown in Table 29 . Table 29. Formulations / Materials Dose (μg) Day 21 _ Day 36 _ PBS 1.097 1.097 NTD-RBD-TM 1 4.237 6.036 0.1 2.711 5.106 S1-666-TM (SEQ ID NO: 16) 1 4.224 5.704 0.1 1.944 4.633 Mixed (50:50) NTD-EXT-F43C-TM (SEQ ID NO: 55) + RBD-Q563D-EXT-TM (SEQ ID NO: 88) 1 3.939 5.442 0.1 1.867 4.342 NTD-EXT-RBD-EXT-TM (SEQ ID NO: 121) 1 4.541 6.001 0.1 3.694 4.885 S1-594-TM (SEQ ID NO: 22) 1 4.027 5.517 0.1 3.313 4.780 S1-594-polyG-DS-TM (SEQ ID NO: 28) 1 3.974 5.531 0.1 2.551 4.425 S2-TM (SEQ ID NO: 145) 1 1.989 4.058 0.1 1.097 2.015 RBD-NTD-TM (SEQ ID NO: 139) 1 4.523 6.062 0.1 3.453 5.222 NTD-PADRE-RBD-TM (SEQ ID NO: 142) 1 4.489 6.040 0.1 3.386 5.301 extra sequence

應理解,本文所述之任何mRNA序列可包括5' UTR及/或3' UTR。UTR序列可選自以下序列,或可使用其他已知UTR序列。亦應理解,本文所述之任何mRNA構築體可進一步包含聚(A)尾及/或端帽(例如,7mG(5')ppp(5')NlmpNp)。此外,儘管本文所述之許多mRNA及編碼之抗原序列包括信號肽及/或肽標籤(例如,C末端His標籤),但應理解,所指示之信號肽及/或肽標籤可用不同信號肽及/或肽標籤取代,或者可省略信號肽及/或肽標籤。

Figure 02_image017
SARS-CoV-2 野生型刺突 (S) 蛋白 SEQ ID NO: 123自5'端至3'端由以下組成:5' UTR SEQ ID NO: 2、mRNA ORF SEQ ID NO: 124及3' UTR SEQ ID NO: 4。 123 化學 1-甲基假尿苷    端帽 7mG(5')ppp(5')NlmpNp    5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 mRNA構築體之ORF (不包括終止密碼子) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCAACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGGCGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCACCCCGGAGGGCAAGGAGCGUGGCCAGCCAGAGCAUCAUCGCCUACACCAUGAGCCUGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACCAACUUCACCAUCAGCGUGACCACCGAGAUUCUGCCCGUGAGCAUGACCAAGACCAGCGUGGACUGCACCAUGUACAUCUGCGGCGACAGCACCGAGUGCAGCAACCUGCUGCUGCAGUACGGCAGCUUCUGCACCCAGCUGAACCGGGCCCUGACCGGCAUCGCCGUGGAGCAGGACAAGAACACCCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACCCCUCCCAUCAAGGACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACCCUAGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUCGGCGACAUAGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAACGGCCUGACCGUGCUGCCUCCCCUGCUGACCGACGAGAUGAUCGCCCAGUACACCAGCGCCCUGUUAGCCGGAACCAUCACCAGCGGCUGGACUUUCGGCGCUGGAGCCGCUCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACCCAGAACGUGCUGUACGAGAACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCUGAGCAGCACCGCUAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCAGAACGCCCAGGCCCUGAACACCCUGGUGAAGCAGCUGAGCAGCAACUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCCUGAGCCGGCUGGACAAGGUGGAGGCCGAGGUGCAGAUCGACCGGCUGAUCACUGGCCGGCUGCAGAGCCUGCAGACCUACGUGACCCAGCAGCUGAUCCGGGCCGCCGAGAUUCGGGCCAGCGCCAACCUGGCCGCCACCAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUUCCCCAGAGCGCACCCCACGGAGUGGUGUUCCUGCACGUGACCUACGUGCCCGCCCAGGAGAAGAACUUCACCACCGCCCCAGCCAUCUGCCACGACGGCAAGGCCCACUUUCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACCCACUGGUUCGUGACCCAGCGGAACUUCUACGAGCCCCAGAUCAUCACCACCGACAACACCUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACCGUGUACGAUCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAAUCACACCAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGAUCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCACUACACC 124 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 相應之胺基酸序列 MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT 125 聚A尾 100 nt    It is understood that any of the mRNA sequences described herein may include the 5' UTR and/or the 3' UTR. The UTR sequence can be selected from the following sequences, or other known UTR sequences can be used. It is also understood that any of the mRNA constructs described herein may further comprise a poly(A) tail and/or end cap (eg, 7mG(5')ppp(5')NlmpNp). In addition, although many of the mRNA and encoded antigen sequences described herein include signal peptides and/or peptide tags (eg, a C-terminal His tag), it should be understood that the indicated signal peptides and/or peptide tags may be used with different signal peptides and/or peptide tags. /or peptide tag substitution, or the signal peptide and/or peptide tag may be omitted.
Figure 02_image017
SARS-CoV-2 wild-type spike (S) protein SEQ ID NO: 123 consists of the following from the 5' end to the 3' end: 5' UTR SEQ ID NO: 2, mRNA ORF SEQ ID NO: 124 and 3' UTR SEQ ID NO: 4. 123 Chemical 1-Methylpseudouridine end cap 7mG(5')ppp(5')NlmpNp 5' UTR GGGAAAUAAGAGAGAAAAGAAGAGUAAGAAGAAAUAUAAGACCCCGGCGCCGCCACC 2 ORF of mRNA construct (excluding stop codon) AUGUUCGUGUUCCUGGUGCUGCUGCCCCUGGUGAGCAGCCAGUGCGUGAACCUGACCACCCGGACCCAGCUGCCACCAGCCUACACCAACAGCUUCACCCGGGGCGUCUACUACCCCGACAAGGUGUUCCGGAGCAGCGUCCUGCACAGCACCCAGGACCUGUUCCUGCCCUUCUUCAGCAACGUGACCUGGUUCCACGCCAUCCACGUGAGCGGCACCAACGGCACCAAGCGGUUCGACAACCCCGUGCUGCCCUUCAACGACGGCGUGUACUUCGCCAGCACCGAGAAGAGCAACAUCAUCCGGGGCUGGAUCUUCGGCACCACCCUGGACAGCAAGACCCAGAGCCUGCUGAUCGUGAAUAACGCCACCAACGUGGUGAUCAAGGUGUGCGAGUUCCAGUUCUGCAACGACCCCUUCCUGGGCGUGUACUACCACAAGAACAACAAGAGCUGGAUGGAGAGCGAGUUCCGGGUGUACAGCAGCGCCAACAACUGCACCUUCGAGUACGUGAGCCAGCCCUUCCUGAUGGACCUGGAGGGCAAGCAGGGCAACUUCAAGAACCUGCGGGAGUUCGUGUUCAAGAACAUCGACGGCUACUUCAAGAUCUACAGCAAGCACACCCCAAUCAACCUGGUGCGGGAUCUGCCCCAGGGCUUCUCAGCCCUGGAGCCCCUGGUGGACCUGCCCAUCGGCAUCAACAUCACCCGGUUCCAGACCCUGCUGGCCCUGCACCGGAGCUACCUGACCCCAGGCGACAGCAGCAGCGGGUGGACAGCAGGCGCGGCUGCUUACUACGUGGGCUACCUGCAGCCCCGGACCUUCCUGCUGAAGUACAACGAGAACGGCACCAUCACCGACGCCGUGGACUGCGCCCUGGACCCUCUGAGCGAGACCAAGUGCACCCUGAAGAGCUUCACCGUGGAGAAGGGCAUCUACCAGACCAGCAACUUCCGGGUGCAGCCCACCGAGAGCAUCGUGCGGUUCCCCAACAUCACCA ACCUGUGCCCCUUCGGCGAGGUGUUCAACGCCACCCGGUUCGCCAGCGUGUACGCCUGGAACCGGAAGCGGAUCAGCAACUGCGUGGCCGACUACAGCGUGCUGUACAACAGCGCCAGCUUCAGCACCUUCAAGUGCUACGGCGUGAGCCCCACCAAGCUGAACGACCUGUGCUUCACCAACGUGUACGCCGACAGCUUCGUGAUCCGUGGCGACGAGGUGCGGCAGAUCGCACCCGGCCAGACAGGCAAGAUCGCCGACUACAACUACAAGCUGCCCGACGACUUCACCGGCUGCGUGAUCGCCUGGAACAGCAACAACCUCGACAGCAAGGUGGGCGGCAACUACAACUACCUGUACCGGCUGUUCCGGAAGAGCAACCUGAAGCCCUUCGAGCGGGACAUCAGCACCGAGAUCUACCAAGCCGGCUCCACCCCUUGCAACGGCGUGGAGGGCUUCAACUGCUACUUCCCUCUGCAGAGCUACGGCUUCCAGCCCACCAACGGCGUGGGCUACCAGCCCUACCGGGUGGUGGUGCUGAGCUUCGAGCUGCUGCACGCCCCAGCCACCGUGUGUGGCCCCAAGAAGAGCACCAACCUGGUGAAGAACAAGUGCGUGAACUUCAACUUCAACGGCCUUACCGGCACCGGCGUGCUGACCGAGAGCAACAAGAAAUUCCUGCCCUUUCAGCAGUUCGGCCGGGACAUCGCCGACACCACCGACGCUGUGCGGGAUCCCCAGACCCUGGAGAUCCUGGACAUCACCCCUUGCAGCUUCGGCGGCGUGAGCGUGAUCACCCCAGGCACCAACACCAGCAACCAGGUGGCCGUGCUGUACCAGGACGUGAACUGCACCGAGGUGCCCGUGGCCAUCCACGCCGACCAGCUGACACCCACCUGGCGGGUCUACAGCACCGGCAGCAACGUGUUCCAGACCCGGGCCGGUUGCCUGAUCGGCGCCGAGCACGUGAACAACAGCUACGAGUGCGACAUCCCCAUCGG CGCCGGCAUCUGUGCCAGCUACCAGACCCAGACCAAUUCACCCCGGAGGGCAAGGAGCGUGGCCAGCCAGAGCAUCAUCGCCUACACCAUGAGCCUGGGCGCCGAGAACAGCGUGGCCUACAGCAACAACAGCAUCGCCAUCCCCACCAACUUCACCAUCAGCGUGACCACCGAGAUUCUGCCCGUGAGCAUGACCAAGACCAGCGUGGACUGCACCAUGUACAUCUGCGGCGACAGCACCGAGUGCAGCAACCUGCUGCUGCAGUACGGCAGCUUCUGCACCCAGCUGAACCGGGCCCUGACCGGCAUCGCCGUGGAGCAGGACAAGAACACCCAGGAGGUGUUCGCCCAGGUGAAGCAGAUCUACAAGACCCCUCCCAUCAAGGACUUCGGCGGCUUCAACUUCAGCCAGAUCCUGCCCGACCCCAGCAAGCCCAGCAAGCGGAGCUUCAUCGAGGACCUGCUGUUCAACAAGGUGACCCUAGCCGACGCCGGCUUCAUCAAGCAGUACGGCGACUGCCUCGGCGACAUAGCCGCCCGGGACCUGAUCUGCGCCCAGAAGUUCAACGGCCUGACCGUGCUGCCUCCCCUGCUGACCGACGAGAUGAUCGCCCAGUACACCAGCGCCCUGUUAGCCGGAACCAUCACCAGCGGCUGGACUUUCGGCGCUGGAGCCGCUCUGCAGAUCCCCUUCGCCAUGCAGAUGGCCUACCGGUUCAACGGCAUCGGCGUGACCCAGAACGUGCUGUACGAGAACCAGAAGCUGAUCGCCAACCAGUUCAACAGCGCCAUCGGCAAGAUCCAGGACAGCCUGAGCAGCACCGCUAGCGCCCUGGGCAAGCUGCAGGACGUGGUGAACCAGAACGCCCAGGCCCUGAACACCCUGGUGAAGCAGCUGAGCAGCAACUUCGGCGCCAUCAGCAGCGUGCUGAACGACAUCCUGAGCCGGCUGGACAAGGUGGAGGCCGAGGUGCAGAUCGACCGGCUGAUCACUGGCCGG CUGCAGAGCCUGCAGACCUACGUGACCCAGCAGCUGAUCCGGGCCGCCGAGAUUCGGGCCAGCGCCAACCUGGCCGCCACCAAGAUGAGCGAGUGCGUGCUGGGCCAGAGCAAGCGGGUGGACUUCUGCGGCAAGGGCUACCACCUGAUGAGCUUUCCCCAGAGCGCACCCCACGGAGUGGUGUUCCUGCACGUGACCUACGUGCCCGCCCAGGAGAAGAACUUCACCACCGCCCCAGCCAUCUGCCACGACGGCAAGGCCCACUUUCCCCGGGAGGGCGUGUUCGUGAGCAACGGCACCCACUGGUUCGUGACCCAGCGGAACUUCUACGAGCCCCAGAUCAUCACCACCGACAACACCUUCGUGAGCGGCAACUGCGACGUGGUGAUCGGCAUCGUGAACAACACCGUGUACGAUCCCCUGCAGCCCGAGCUGGACAGCUUCAAGGAGGAGCUGGACAAGUACUUCAAGAAUCACACCAGCCCCGACGUGGACCUGGGCGACAUCAGCGGCAUCAACGCCAGCGUGGUGAACAUCCAGAAGGAGAUCGAUCGGCUGAACGAGGUGGCCAAGAACCUGAACGAGAGCCUGAUCGACCUGCAGGAGCUGGGCAAGUACGAGCAGUACAUCAAGUGGCCCUGGUACAUCUGGCUGGGCUUCAUCGCCGGCCUGAUCGCCAUCGUGAUGGUGACCAUCAUGCUGUGCUGCAUGACCAGCUGCUGCAGCUGCCUGAAGGGCUGUUGCAGCUGCGGCAGCUGCUGCAAGUUCGACGAGGACGACAGCGAGCCCGUGCUGAAGGGCGUGAAGCUGCACUACACC 124 3' UTR UGAUAAUAGGCUGGAGCCUCGGUGGCCUAGCUUCUUGCCCCUUGGGCCUCCCCCCAGCCCCUCCUCCCCUUCCUGCACCCGUACCCCCGUGGUCUUUGAAUAAAGUCUGAGUGGGCGGC 4 Corresponding amino acid sequence MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGR LQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT 125 Poly A tail 100nt

亦應理解,本文所述之開放閱讀框及/或相應胺基酸序列中之任一者可包括或排除信號序列。It is also understood that any of the open reading frames and/or corresponding amino acid sequences described herein may include or exclude signal sequences.

本文揭示之所有參考文獻、專利及專利申請皆關於其每一者所列舉之標的物以引用方式併入,其在一些情形下可涵蓋整個文件。All references, patents, and patent applications disclosed herein are incorporated by reference, which in some cases may cover the entire document, with respect to the subject matter recited in each of them.

除非明確指示相反含義,否則如本說明書及申請專利範圍中所用之不定冠詞「一(a及an)」應理解為意指「至少一個」。The indefinite articles "a (a and an)" as used in this specification and the scope of the claims should be understood to mean "at least one" unless the contrary meaning is expressly indicated.

亦應理解,除非明確指示相反含義,否則在本文所主張之包括一個以上步驟或動作之任何方法中,該方法之步驟或動作之順序不必受限於列舉該方法之步驟或動作之順序。It should also be understood that in any method claimed herein that includes more than one step or action, the order of the steps or actions of the method is not necessarily limited to the order in which the steps or actions of the method are listed, unless expressly indicated to the contrary.

在申請專利範圍中以及在上文說明書中,所有過渡片語(諸如「包含」、「包括」、「携載」、「具有」、「含有」、「涉及」、「容納」、「由...構成」及諸如此類)應理解爲係開放式的,亦即,意指包括但不限於。如美國專利局專利審查規程手册(United States Patent Office Manual of Patent Examining Procedures)第2111.03節所述,僅過渡片語「由……組成」及「基本上由……組成」分別應當係封閉式或半封閉式過渡片語。In the scope of the patent application and in the above specification, all transitional phrases (such as "comprising", "including", "carrying", "having", "containing", "involving", "containing", "by. .. constitutes" and the like) shall be construed as open ended, that is, meant to include, but not be limited to. As stated in Section 2111.03 of the United States Patent Office Manual of Patent Examining Procedures, only the transitional phrases "consisting of" and "consisting essentially of" should be closed or closed, respectively. Semi-closed transitional phrase.

在數值之前之術語「約」及「實質上」意指所列舉之數值之±10%。The terms "about" and "substantially" preceding a numerical value mean ±10% of the recited numerical value.

在提供值之範圍之情况下,本文具體考慮及闡述該範圍之上端及下端之間之每一值。Where a range of values is provided, each value between the upper and lower end of the range is specifically considered and set forth herein.

國際申請案號PCT/US2015/02740、PCT/US2016/043348、PCT/US2016/043332、PCT/US2016/058327、PCT/US2016/058324、PCT/US2016/058314、PCT/US2016/058310、PCT/US2016/058321、PCT/US2016/058297、PCT/US2016/058319及PCT/US2016/058314之整個內容以引用方式併入本文中。International application case numbers PCT/US2015/02740, PCT/US2016/043348, PCT/US2016/043332, PCT/US2016/058327, PCT/US2016/058324, PCT/US2016/058314, PCT/US2016/058310, PCT/US2016/ 058321, PCT/US2016/058297, PCT/US2016/058319, and PCT/US2016/058314 are incorporated herein by reference in their entirety.

1由本發明之mRNA編碼之野生型及2P刺突蛋白抗原之示意圖;信號肽(SP),無填充;N末端結構域(NTD),有點的;受體結合結構域(RBD),向下對角條帶;子結構域1 (SD1),水平條帶;子結構域2 (SD2),波形;融合肽(FP),向上對角條帶;七肽重複區1 (HR1),編織形;七肽重複區2 (HR2),對角磚形;(TM),垂直條帶;及胞質尾區(CT),磚形。 2A- 2I顯示由實例1-3中所述之mRNA編碼之抗原之示例性綫性設計。 3顯示圖2中繪示之抗原之序列比對。 4顯示由實例4-6中所述之mRNA編碼之抗原之示例性綫性設計。 5顯示本文所述之各種S1次單元抗原之序列比對。 6A- 6D顯示由實例7及8中所述之mRNA編碼之抗原之示例性綫性設計。 7顯示中和與ELISA效價之相關性。 8A- 8C顯示用LNP中之編碼NTD-RBD-TM之mRNA對小鼠進行第1天初免及第21天加强劑量後第36天之血清IgG1效價及IgG2a效價。 Figure 1 Schematic representation of wild-type and 2P spike protein antigens encoded by mRNAs of the invention; signal peptide (SP), unfilled; N-terminal domain (NTD), dotted; receptor binding domain (RBD), down Diagonal band; subdomain 1 (SD1), horizontal band; subdomain 2 (SD2), vimentum; fusion peptide (FP), upward diagonal band; heptapeptide repeat 1 (HR1), braided ; heptapeptide repeat 2 (HR2), diagonal brick; (TM), vertical bars; and cytoplasmic tail (CT), brick. Figures 2A - 2I show exemplary linear designs of antigens encoded by mRNAs described in Examples 1-3. FIG. 3 shows a sequence alignment of the antigens depicted in FIG. 2 . Figure 4 shows an exemplary linear design of the antigens encoded by the mRNAs described in Examples 4-6. Figure 5 shows a sequence alignment of the various S1 subunit antigens described herein. 6A - 6D show exemplary linear designs of antigens encoded by mRNAs described in Examples 7 and 8. Figure 7 shows the correlation of neutralization with ELISA titers. Figures 8A - 8C show serum IgGl titers and IgG2a titers at day 36 after day 1 prime and day 21 booster doses in mice with mRNA encoding NTD-RBD-TM in LNP.

                         
          <![CDATA[<110>  美商現代公司(ModernaTX, Inc.)]]>
          <![CDATA[<120>  SARS-COV-2 MRNA結構域疫苗]]>
          <![CDATA[<130>  M1378.70157]]>
          <![CDATA[<140>  TW 110123289]]>
          <![CDATA[<141>  2021-06-25]]>
          <![CDATA[<150>  US 63/063,137]]>
          <![CDATA[<151>  2020-08-07]]>
          <![CDATA[<150>  US 63/044,330]]>
          <![CDATA[<151>  2020-06-25]]>
          <![CDATA[<160>  152]]>
          <![CDATA[<170>  PatentIn 3.5版]]>
          <![CDATA[<210>  1]]>
          <![CDATA[<211>  2216]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  1]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag     1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug     1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc     1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc     1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug     1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac     1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc     1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug     1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac     1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac     1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca     1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc     1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc     1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc     1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcg ugagcgugau caccccaggc     1860
          accaacacca gcaaccaggu ggccgugcug uaccaggacg ugaacugcac cgaggugccc     1920
          guggccaucc acgccgacca gcugacaccc accuggcggg ucuacagcac cggcagcaac     1980
          guguuccaga cccgggccgg uugccugauc ggcgccgagc acgugaacaa cagcuacgag     2040
          ugcgacaucc ccaucggcgc cggcaucugu gccagcuacc agacccagac caauucauga     2100
          uaauaggcug gagccucggu ggccuagcuu cuugccccuu gggccucccc ccagccccuc     2160
          cuccccuucc ugcacccgua cccccguggu cuuugaauaa agucugagug ggcggc         2216
          <![CDATA[<210>  2]]>
          <![CDATA[<211>  57]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  2]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccacc          57
          <![CDATA[<210>  3]]>
          <![CDATA[<211>  2040]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  3]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug      960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag     1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac     1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac     1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc     1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac     1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac     1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac     1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc     1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc     1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc     1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac     1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug     1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag     1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcgugagcgu gaucacccca     1800
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug     1860
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc     1920
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac     1980
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca     2040
          <![CDATA[<210>  4]]>
          <![CDATA[<211>  119]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  4]]>
          ugauaauagg cuggagccuc gguggccuag cuucuugccc cuugggccuc cccccagccc       60
          cuccuccccu uccugcaccc guacccccgu ggucuuugaa uaaagucuga gugggcggc       119
          <![CDATA[<210>  5]]>
          <![CDATA[<211>  680]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  5]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
          305                 310                 315                 320 
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                          325                 330                 335     
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
                      340                 345                 350         
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                  355                 360                 365             
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
              370                 375                 380                 
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
          385                 390                 395                 400 
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                          405                 410                 415     
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
                      420                 425                 430         
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                  435                 440                 445             
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
              450                 455                 460                 
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
          465                 470                 475                 480 
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                          485                 490                 495     
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
                      500                 505                 510         
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                  515                 520                 525             
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
              530                 535                 540                 
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
          545                 550                 555                 560 
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                          565                 570                 575     
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
                      580                 585                 590         
          Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
                  595                 600                 605             
          Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
              610                 615                 620                 
          His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
          625                 630                 635                 640 
          Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                          645                 650                 655     
          Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
                      660                 665                 670         
          Ser Tyr Gln Thr Gln Thr Asn Ser 
                  675                 680 
          <![CDATA[<210>  6]]>
          <![CDATA[<211>  2729]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  6]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag     1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug     1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc     1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc     1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug     1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac     1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc     1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug     1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac     1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac     1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca     1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc     1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc     1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc     1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcg ugagcgugau caccccaggc     1860
          accaacacca gcaaccaggu ggccgugcug uaccaggacg ugaacugcac cgaggugccc     1920
          guggccaucc acgccgacca gcugacaccc accuggcggg ucuacagcac cggcagcaac     1980
          guguuccaga cccgggccgg uugccugauc ggcgccgagc acgugaacaa cagcuacgag     2040
          ugcgacaucc ccaucggcgc cggcaucugu gccagcuacc agacccagac caauucagga     2100
          ggaggcagcg gcggcgauau caucaagcuu cugaacgagc aaguuaacaa ggaaaugcag     2160
          agcaguaauc ucuacaugag caugagcagc uggugcuaca cccacucccu ggacggagca     2220
          ggccucuucc uguucgacca cgcagccgag gaguacgagc acgcuaagaa guugaucauu     2280
          uucuugaacg agaacaacgu gcccgugcag cuaacgucaa ucagcgcacc ugagcacaag     2340
          uucgagggcc ugacccagau cuuccagaag gccuacgaac acgaacagca caucuccgag     2400
          agcaucaaca auauugugga ucacgcuauc aaguccaagg accacgcuac cuucaacuuc     2460
          cugcaguggu acguggccga gcaacaugag gaggaggugc uguucaagga cauccuggac     2520
          aagaucgagc ugaucgguaa ugagaaucac ggccuguacc uggccgacca guacgugaag     2580
          ggcaucgcca agagccggaa gucaggcuca ugauaauagg cuggagccuc gguggccuag     2640
          cuucuugccc cuugggccuc cccccagccc cuccuccccu uccugcaccc guacccccgu     2700
          ggucuuugaa uaaagucuga gugggcggc                                       2729
          <![CDATA[<210>  7]]>
          <![CDATA[<211>  2553]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  7]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug      960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag     1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac     1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac     1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc     1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac     1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac     1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac     1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc     1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc     1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc     1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac     1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug     1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag     1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcgugagcgu gaucacccca     1800
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug     1860
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc     1920
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac     1980
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca     2040
          ggaggaggca gcggcggcga uaucaucaag cuucugaacg agcaaguuaa caaggaaaug     2100
          cagagcagua aucucuacau gagcaugagc agcuggugcu acacccacuc ccuggacgga     2160
          gcaggccucu uccuguucga ccacgcagcc gaggaguacg agcacgcuaa gaaguugauc     2220
          auuuucuuga acgagaacaa cgugcccgug cagcuaacgu caaucagcgc accugagcac     2280
          aaguucgagg gccugaccca gaucuuccag aaggccuacg aacacgaaca gcacaucucc     2340
          gagagcauca acaauauugu ggaucacgcu aucaagucca aggaccacgc uaccuucaac     2400
          uuccugcagu gguacguggc cgagcaacau gaggaggagg ugcuguucaa ggacauccug     2460
          gacaagaucg agcugaucgg uaaugagaau cacggccugu accuggccga ccaguacgug     2520
          aagggcaucg ccaagagccg gaagucaggc uca                                  2553
          <![CDATA[<210>  8]]>
          <![CDATA[<211>  851]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  8]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
          305                 310                 315                 320 
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                          325                 330                 335     
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
                      340                 345                 350         
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                  355                 360                 365             
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
              370                 375                 380                 
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
          385                 390                 395                 400 
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                          405                 410                 415     
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
                      420                 425                 430         
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                  435                 440                 445             
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
              450                 455                 460                 
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
          465                 470                 475                 480 
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                          485                 490                 495     
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
                      500                 505                 510         
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                  515                 520                 525             
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
              530                 535                 540                 
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
          545                 550                 555                 560 
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                          565                 570                 575     
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
                      580                 585                 590         
          Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
                  595                 600                 605             
          Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
              610                 615                 620                 
          His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
          625                 630                 635                 640 
          Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                          645                 650                 655     
          Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
                      660                 665                 670         
          Ser Tyr Gln Thr Gln Thr Asn Ser Gly Gly Gly Ser Gly Gly Asp Ile 
                  675                 680                 685             
          Ile Lys Leu Leu Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn 
              690                 695                 700                 
          Leu Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly 
          705                 710                 715                 720 
          Ala Gly Leu Phe Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala 
                          725                 730                 735     
          Lys Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu 
                      740                 745                 750         
          Thr Ser Ile Ser Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile 
                  755                 760                 765             
          Phe Gln Lys Ala Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn 
              770                 775                 780                 
          Asn Ile Val Asp His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn 
          785                 790                 795                 800 
          Phe Leu Gln Trp Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe 
                          805                 810                 815     
          Lys Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly 
                      820                 825                 830         
          Leu Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys 
                  835                 840                 845             
          Ser Gly Ser 
              850     
          <![CDATA[<210>  9]]>
          <![CDATA[<211>  2762]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  9]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          ggcauccugc ccagcccugg caugcccgcu cugcugagcc uggugagccu gcugagcgug      120
          cugcugaugg gcugcguggc ugagaccggc augcagaucu acgagggcaa gcugaccgca      180
          gagggccugc gguucggcau cguggccagc cgcgccaacc acgcucuggu ggaccggcuu      240
          guggagggcg cuaucgacgc caucgugaga cacggcggcc gggaagagga caucacccug      300
          gugcgggugu gcggcagcug ggagauuccc gucgccgccg gagaacuggc ccggaaggag      360
          gacaucgacg ccgugaucgc caucggcgug cugugcagag gcgccacgcc cagcuucgac      420
          uacaucgcca gcgaggugag caagggccug gccgaccuga gccuggagcu gcggaagccc      480
          aucaccuucg gcgugaucac cgccgacacc cuggagcagg ccaucgaggc cgcaggcacc      540
          ugccacggca acaagggcug ggaagccgcc cugugcgcca ucgagauggc caaccuguuc      600
          aagagccugc ggggcggaag uggaggcucu gguggcagcg gaggaucugg cggcggcacc      660
          acccggaccc agcugccacc agccuacacc aacagcuuca cccggggcgu cuacuacccc      720
          gacaaggugu uccggagcag cguccugcac agcacccagg accuguuccu gcccuucuuc      780
          agcaacguga ccugguucca cgccauccac gugagcggca ccaacggcac caagcgguuc      840
          gacaaccccg ugcugcccuu caacgacggc guguacuucg ccagcaccga gaagagcaac      900
          aucauccggg gcuggaucuu cggcaccacc cuggacagca agacccagag ccugcugauc      960
          gugaauaacg ccaccaacgu ggugaucaag gugugcgagu uccaguucug caacgacccc     1020
          uuccugggcg uguacuacca caagaacaac aagagcugga uggagagcga guuccgggug     1080
          uacagcagcg ccaacaacug caccuucgag uacgugagcc agcccuuccu gauggaccug     1140
          gagggcaagc agggcaacuu caagaaccug cgggaguucg uguucaagaa caucgacggc     1200
          uacuucaaga ucuacagcaa gcacacccca aucaaccugg ugcgggaucu gccccagggc     1260
          uucucagccc uggagccccu gguggaccug cccaucggca ucaacaucac ccgguuccag     1320
          acccugcugg cccugcaccg gagcuaccug accccaggcg acagcagcag cggguggaca     1380
          gcaggcgcgg cugcuuacua cgugggcuac cugcagcccc ggaccuuccu gcugaaguac     1440
          aacgagaacg gcaccaucac cgacgccgug gacugcgccc uggacccucu gagcgagacc     1500
          aagugcaccc ugaagagcuu caccguggag aagggcaucu accagaccag caacuuccgg     1560
          gugcagccca ccgagagcau cgugcgguuc cccaacauca ccaaccugug ccccuucggc     1620
          gagguguuca acgccacccg guucgccagc guguacgccu ggaaccggaa gcggaucagc     1680
          aacugcgugg ccgacuacag cgugcuguac aacagcgcca gcuucagcac cuucaagugc     1740
          uacggcguga gccccaccaa gcugaacgac cugugcuuca ccaacgugua cgccgacagc     1800
          uucgugaucc guggcgacga ggugcggcag aucgcacccg gccagacagg caagaucgcc     1860
          gacuacaacu acaagcugcc cgacgacuuc accggcugcg ugaucgccug gaacagcaac     1920
          aaccucgaca gcaagguggg cggcaacuac aacuaccugu accggcuguu ccggaagagc     1980
          aaccugaagc ccuucgagcg ggacaucagc accgagaucu accaagccgg cuccaccccu     2040
          ugcaacggcg uggagggcuu caacugcuac uucccucugc agagcuacgg cuuccagccc     2100
          accaacggcg ugggcuacca gcccuaccgg gugguggugc ugagcuucga gcugcugcac     2160
          gccccagcca ccgugugugg ccccaagaag agcaccaacc uggugaagaa caagugcgug     2220
          aacuucaacu ucaacggccu uaccggcacc ggcgugcuga ccgagagcaa caagaaauuc     2280
          cugcccuuuc agcaguucgg ccgggacauc gccgacacca ccgacgcugu gcgggauccc     2340
          cagacccugg agauccugga caucaccccu ugcagcuucg gcggcgugag cgugaucacc     2400
          ccaggcacca acaccagcaa ccagguggcc gugcuguacc aggacgugaa cugcaccgag     2460
          gugcccgugg ccauccacgc cgaccagcug acacccaccu ggcgggucua cagcaccggc     2520
          agcaacgugu uccagacccg ggccgguugc cugaucggcg ccgagcacgu gaacaacagc     2580
          uacgagugcg acauccccau cggcgccggc aucugugcca gcuaccagac ccagaccaau     2640
          ucaugauaau aggcuggagc cucgguggcc uagcuucuug ccccuugggc cuccccccag     2700
          ccccuccucc ccuuccugca cccguacccc cguggucuuu gaauaaaguc ugagugggcg     2760
          gc                                                                    2762
          <![CDATA[<210>  10]]>
          <![CDATA[<211>  2586]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  10]]>
          augggcaucc ugcccagccc uggcaugccc gcucugcuga gccuggugag ccugcugagc       60
          gugcugcuga ugggcugcgu ggcugagacc ggcaugcaga ucuacgaggg caagcugacc      120
          gcagagggcc ugcgguucgg caucguggcc agccgcgcca accacgcucu gguggaccgg      180
          cuuguggagg gcgcuaucga cgccaucgug agacacggcg gccgggaaga ggacaucacc      240
          cuggugcggg ugugcggcag cugggagauu cccgucgccg ccggagaacu ggcccggaag      300
          gaggacaucg acgccgugau cgccaucggc gugcugugca gaggcgccac gcccagcuuc      360
          gacuacaucg ccagcgaggu gagcaagggc cuggccgacc ugagccugga gcugcggaag      420
          cccaucaccu ucggcgugau caccgccgac acccuggagc aggccaucga ggccgcaggc      480
          accugccacg gcaacaaggg cugggaagcc gcccugugcg ccaucgagau ggccaaccug      540
          uucaagagcc ugcggggcgg aaguggaggc ucugguggca gcggaggauc uggcggcggc      600
          accacccgga cccagcugcc accagccuac accaacagcu ucacccgggg cgucuacuac      660
          cccgacaagg uguuccggag cagcguccug cacagcaccc aggaccuguu ccugcccuuc      720
          uucagcaacg ugaccugguu ccacgccauc cacgugagcg gcaccaacgg caccaagcgg      780
          uucgacaacc ccgugcugcc cuucaacgac ggcguguacu ucgccagcac cgagaagagc      840
          aacaucaucc ggggcuggau cuucggcacc acccuggaca gcaagaccca gagccugcug      900
          aucgugaaua acgccaccaa cguggugauc aaggugugcg aguuccaguu cugcaacgac      960
          cccuuccugg gcguguacua ccacaagaac aacaagagcu ggauggagag cgaguuccgg     1020
          guguacagca gcgccaacaa cugcaccuuc gaguacguga gccagcccuu ccugauggac     1080
          cuggagggca agcagggcaa cuucaagaac cugcgggagu ucguguucaa gaacaucgac     1140
          ggcuacuuca agaucuacag caagcacacc ccaaucaacc uggugcggga ucugccccag     1200
          ggcuucucag cccuggagcc ccugguggac cugcccaucg gcaucaacau cacccgguuc     1260
          cagacccugc uggcccugca ccggagcuac cugaccccag gcgacagcag cagcgggugg     1320
          acagcaggcg cggcugcuua cuacgugggc uaccugcagc cccggaccuu ccugcugaag     1380
          uacaacgaga acggcaccau caccgacgcc guggacugcg cccuggaccc ucugagcgag     1440
          accaagugca cccugaagag cuucaccgug gagaagggca ucuaccagac cagcaacuuc     1500
          cgggugcagc ccaccgagag caucgugcgg uuccccaaca ucaccaaccu gugccccuuc     1560
          ggcgaggugu ucaacgccac ccgguucgcc agcguguacg ccuggaaccg gaagcggauc     1620
          agcaacugcg uggccgacua cagcgugcug uacaacagcg ccagcuucag caccuucaag     1680
          ugcuacggcg ugagccccac caagcugaac gaccugugcu ucaccaacgu guacgccgac     1740
          agcuucguga uccguggcga cgaggugcgg cagaucgcac ccggccagac aggcaagauc     1800
          gccgacuaca acuacaagcu gcccgacgac uucaccggcu gcgugaucgc cuggaacagc     1860
          aacaaccucg acagcaaggu gggcggcaac uacaacuacc uguaccggcu guuccggaag     1920
          agcaaccuga agcccuucga gcgggacauc agcaccgaga ucuaccaagc cggcuccacc     1980
          ccuugcaacg gcguggaggg cuucaacugc uacuucccuc ugcagagcua cggcuuccag     2040
          cccaccaacg gcgugggcua ccagcccuac cggguggugg ugcugagcuu cgagcugcug     2100
          cacgccccag ccaccgugug uggccccaag aagagcacca accuggugaa gaacaagugc     2160
          gugaacuuca acuucaacgg ccuuaccggc accggcgugc ugaccgagag caacaagaaa     2220
          uuccugcccu uucagcaguu cggccgggac aucgccgaca ccaccgacgc ugugcgggau     2280
          ccccagaccc uggagauccu ggacaucacc ccuugcagcu ucggcggcgu gagcgugauc     2340
          accccaggca ccaacaccag caaccaggug gccgugcugu accaggacgu gaacugcacc     2400
          gaggugcccg uggccaucca cgccgaccag cugacaccca ccuggcgggu cuacagcacc     2460
          ggcagcaacg uguuccagac ccgggccggu ugccugaucg gcgccgagca cgugaacaac     2520
          agcuacgagu gcgacauccc caucggcgcc ggcaucugug ccagcuacca gacccagacc     2580
          aauuca                                                                2586
          <![CDATA[<210>  11]]>
          <![CDATA[<211>  862]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  11]]>
          Met Gly Ile Leu Pro Ser Pro Gly Met Pro Ala Leu Leu Ser Leu Val 
          1               5                   10                  15      
          Ser Leu Leu Ser Val Leu Leu Met Gly Cys Val Ala Glu Thr Gly Met 
                      20                  25                  30          
          Gln Ile Tyr Glu Gly Lys Leu Thr Ala Glu Gly Leu Arg Phe Gly Ile 
                  35                  40                  45              
          Val Ala Ser Arg Ala Asn His Ala Leu Val Asp Arg Leu Val Glu Gly 
              50                  55                  60                  
          Ala Ile Asp Ala Ile Val Arg His Gly Gly Arg Glu Glu Asp Ile Thr 
          65                  70                  75                  80  
          Leu Val Arg Val Cys Gly Ser Trp Glu Ile Pro Val Ala Ala Gly Glu 
                          85                  90                  95      
          Leu Ala Arg Lys Glu Asp Ile Asp Ala Val Ile Ala Ile Gly Val Leu 
                      100                 105                 110         
          Cys Arg Gly Ala Thr Pro Ser Phe Asp Tyr Ile Ala Ser Glu Val Ser 
                  115                 120                 125             
          Lys Gly Leu Ala Asp Leu Ser Leu Glu Leu Arg Lys Pro Ile Thr Phe 
              130                 135                 140                 
          Gly Val Ile Thr Ala Asp Thr Leu Glu Gln Ala Ile Glu Ala Ala Gly 
          145                 150                 155                 160 
          Thr Cys His Gly Asn Lys Gly Trp Glu Ala Ala Leu Cys Ala Ile Glu 
                          165                 170                 175     
          Met Ala Asn Leu Phe Lys Ser Leu Arg Gly Gly Ser Gly Gly Ser Gly 
                      180                 185                 190         
          Gly Ser Gly Gly Ser Gly Gly Gly Thr Thr Arg Thr Gln Leu Pro Pro 
                  195                 200                 205             
          Ala Tyr Thr Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val 
              210                 215                 220                 
          Phe Arg Ser Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe 
          225                 230                 235                 240 
          Phe Ser Asn Val Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn 
                          245                 250                 255     
          Gly Thr Lys Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val 
                      260                 265                 270         
          Tyr Phe Ala Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe 
                  275                 280                 285             
          Gly Thr Thr Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn 
              290                 295                 300                 
          Ala Thr Asn Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp 
          305                 310                 315                 320 
          Pro Phe Leu Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu 
                          325                 330                 335     
          Ser Glu Phe Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr 
                      340                 345                 350         
          Val Ser Gln Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe 
                  355                 360                 365             
          Lys Asn Leu Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys 
              370                 375                 380                 
          Ile Tyr Ser Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln 
          385                 390                 395                 400 
          Gly Phe Ser Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn 
                          405                 410                 415     
          Ile Thr Arg Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr 
                      420                 425                 430         
          Pro Gly Asp Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr 
                  435                 440                 445             
          Val Gly Tyr Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn 
              450                 455                 460                 
          Gly Thr Ile Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu 
          465                 470                 475                 480 
          Thr Lys Cys Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln 
                          485                 490                 495     
          Thr Ser Asn Phe Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro 
                      500                 505                 510         
          Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg 
                  515                 520                 525             
          Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val 
              530                 535                 540                 
          Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys 
          545                 550                 555                 560 
          Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn 
                          565                 570                 575     
          Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile 
                      580                 585                 590         
          Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro 
                  595                 600                 605             
          Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp 
              610                 615                 620                 
          Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys 
          625                 630                 635                 640 
          Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln 
                          645                 650                 655     
          Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe 
                      660                 665                 670         
          Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln 
                  675                 680                 685             
          Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala 
              690                 695                 700                 
          Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys 
          705                 710                 715                 720 
          Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu 
                          725                 730                 735     
          Ser Asn Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala 
                      740                 745                 750         
          Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp 
                  755                 760                 765             
          Ile Thr Pro Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr 
              770                 775                 780                 
          Asn Thr Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr 
          785                 790                 795                 800 
          Glu Val Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg 
                          805                 810                 815     
          Val Tyr Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu 
                      820                 825                 830         
          Ile Gly Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile 
                  835                 840                 845             
          Gly Ala Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser 
              850                 855                 860         
          <![CDATA[<210>  12]]>
          <![CDATA[<211>  2714]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  12]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag     1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug     1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc     1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc     1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug     1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac     1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc     1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug     1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac     1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac     1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca     1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc     1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc     1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc     1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcg ugagcgugau caccccaggc     1860
          accaacacca gcaaccaggu ggccgugcug uaccaggacg ugaacugcac cgaggugccc     1920
          guggccaucc acgccgacca gcugacaccc accuggcggg ucuacagcac cggcagcaac     1980
          guguuccaga cccgggccgg uugccugauc ggcgccgagc acgugaacaa cagcuacgag     2040
          ugcgacaucc ccaucggcgc cggcaucugu gccagcuacc agacccagac caauucagga     2100
          ggaggcuccg gaggcgguag cgcugagacc ggcaugcaga ucuacgaggg caagcugacc     2160
          gcagagggcc ugcgguucgg caucguggcc agccgcgcca accacgcucu gguggaccgg     2220
          cuuguggagg gcgcuaucga cgccaucgug agacacggcg gccgggaaga ggacaucacc     2280
          cuggugcggg ugugcggcag cugggagauu cccgucgccg ccggagaacu ggcccggaag     2340
          gaggacaucg acgccgugau cgccaucggc gugcugugca gaggcgccac gcccagcuuc     2400
          gacuacaucg ccagcgaggu gagcaagggc cuggccgacc ugagccugga gcugcggaag     2460
          cccaucaccu ucggcgugau caccgccgac acccuggagc aggccaucga ggccgcaggc     2520
          accugccacg gcaacaaggg cugggaagcc gcccugugcg ccaucgagau ggccaaccug     2580
          uucaagagcc ugcggugaua auaggcugga gccucggugg ccuagcuucu ugccccuugg     2640
          gccucccccc agccccuccu ccccuuccug cacccguacc cccguggucu uugaauaaag     2700
          ucugaguggg cggc                                                       2714
          <![CDATA[<210>  13]]>
          <![CDATA[<211>  2538]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  13]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug      960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag     1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac     1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac     1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc     1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac     1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac     1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac     1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc     1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc     1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc     1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac     1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug     1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag     1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcgugagcgu gaucacccca     1800
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug     1860
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc     1920
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac     1980
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca     2040
          ggaggaggcu ccggaggcgg uagcgcugag accggcaugc agaucuacga gggcaagcug     2100
          accgcagagg gccugcgguu cggcaucgug gccagccgcg ccaaccacgc ucugguggac     2160
          cggcuugugg agggcgcuau cgacgccauc gugagacacg gcggccggga agaggacauc     2220
          acccuggugc gggugugcgg cagcugggag auucccgucg ccgccggaga acuggcccgg     2280
          aaggaggaca ucgacgccgu gaucgccauc ggcgugcugu gcagaggcgc cacgcccagc     2340
          uucgacuaca ucgccagcga ggugagcaag ggccuggccg accugagccu ggagcugcgg     2400
          aagcccauca ccuucggcgu gaucaccgcc gacacccugg agcaggccau cgaggccgca     2460
          ggcaccugcc acggcaacaa gggcugggaa gccgcccugu gcgccaucga gauggccaac     2520
          cuguucaaga gccugcgg                                                   2538
          <![CDATA[<210>  14]]>
          <![CDATA[<211>  846]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  14]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
          305                 310                 315                 320 
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                          325                 330                 335     
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
                      340                 345                 350         
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                  355                 360                 365             
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
              370                 375                 380                 
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
          385                 390                 395                 400 
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                          405                 410                 415     
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
                      420                 425                 430         
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                  435                 440                 445             
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
              450                 455                 460                 
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
          465                 470                 475                 480 
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                          485                 490                 495     
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
                      500                 505                 510         
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                  515                 520                 525             
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
              530                 535                 540                 
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
          545                 550                 555                 560 
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                          565                 570                 575     
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
                      580                 585                 590         
          Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
                  595                 600                 605             
          Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
              610                 615                 620                 
          His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
          625                 630                 635                 640 
          Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                          645                 650                 655     
          Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
                      660                 665                 670         
          Ser Tyr Gln Thr Gln Thr Asn Ser Gly Gly Gly Ser Gly Gly Gly Ser 
                  675                 680                 685             
          Ala Glu Thr Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr Ala Glu Gly 
              690                 695                 700                 
          Leu Arg Phe Gly Ile Val Ala Ser Arg Ala Asn His Ala Leu Val Asp 
          705                 710                 715                 720 
          Arg Leu Val Glu Gly Ala Ile Asp Ala Ile Val Arg His Gly Gly Arg 
                          725                 730                 735     
          Glu Glu Asp Ile Thr Leu Val Arg Val Cys Gly Ser Trp Glu Ile Pro 
                      740                 745                 750         
          Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala Val Ile 
                  755                 760                 765             
          Ala Ile Gly Val Leu Cys Arg Gly Ala Thr Pro Ser Phe Asp Tyr Ile 
              770                 775                 780                 
          Ala Ser Glu Val Ser Lys Gly Leu Ala Asp Leu Ser Leu Glu Leu Arg 
          785                 790                 795                 800 
          Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu Gln Ala 
                          805                 810                 815     
          Ile Glu Ala Ala Gly Thr Cys His Gly Asn Lys Gly Trp Glu Ala Ala 
                      820                 825                 830         
          Leu Cys Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg 
                  835                 840                 845     
          <![CDATA[<210>  15]]>
          <![CDATA[<211>  2300]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  15]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag     1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug     1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc     1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc     1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug     1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac     1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc     1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug     1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac     1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac     1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca     1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc     1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc     1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc     1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcg ugagcgugau caccccaggc     1860
          accaacacca gcaaccaggu ggccgugcug uaccaggacg ugaacugcac cgaggugccc     1920
          guggccaucc acgccgacca gcugacaccc accuggcggg ucuacagcac cggcagcaac     1980
          guguuccaga cccgggccgg uugccugauc ggcgccgagc acgugaacaa cagcuacgag     2040
          ugcgacaucc ccaucggcgc cggcaucugu gccagcuacc agacccagac caauucaucu     2100
          ggcggaggca gcauccuggc caucuacagc accguggcca gcagccuggu gcugcuggug     2160
          agccugggcg ccaucagcuu cugauaauag gcuggagccu cgguggccua gcuucuugcc     2220
          ccuugggccu ccccccagcc ccuccucccc uuccugcacc cguacccccg uggucuuuga     2280
          auaaagucug agugggcggc                                                 2300
          <![CDATA[<210>  16]]>
          <![CDATA[<211>  2124]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  16]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug      960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag     1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac     1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac     1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc     1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac     1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac     1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac     1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc     1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc     1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc     1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac     1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug     1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag     1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcgugagcgu gaucacccca     1800
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug     1860
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc     1920
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac     1980
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca     2040
          ucuggcggag gcagcauccu ggccaucuac agcaccgugg ccagcagccu ggugcugcug     2100
          gugagccugg gcgccaucag cuuc                                            2124
          <![CDATA[<210>  17]]>
          <![CDATA[<211>  707]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  17]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
          305                 310                 315                 320 
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                          325                 330                 335     
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
                      340                 345                 350         
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                  355                 360                 365             
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
              370                 375                 380                 
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
          385                 390                 395                 400 
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                          405                 410                 415     
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
                      420                 425                 430         
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                  435                 440                 445             
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
              450                 455                 460                 
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
          465                 470                 475                 480 
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                          485                 490                 495     
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
                      500                 505                 510         
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                  515                 520                 525             
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
              530                 535                 540                 
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
          545                 550                 555                 560 
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                          565                 570                 575     
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
                      580                 585                 590         
          Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
                  595                 600                 605             
          Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
              610                 615                 620                 
          His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
          625                 630                 635                 640 
          Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                          645                 650                 655     
          Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
                      660                 665                 670         
          Ser Tyr Gln Thr Gln Thr Asn Ser Gly Gly Gly Ser Ile Leu Ala Ile 
                  675                 680                 685             
          Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala 
              690                 695                 700                 
          Ile Ser Phe 
          705         
          <![CDATA[<210>  18]]>
          <![CDATA[<211>  1853]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  18]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag     1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug     1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc     1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc     1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug     1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac     1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc     1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug     1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac     1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac     1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca     1620
          gccaccgugu guggccccaa gaagagcacc ucuggcggag gcagcauccu ggccaucuac     1680
          agcaccgugg ccagcagccu ggugcugcug gugagccugg gcgccaucag cuucugauaa     1740
          uaggcuggag ccucgguggc cuagcuucuu gccccuuggg ccucccccca gccccuccuc     1800
          cccuuccugc acccguaccc ccguggucuu ugaauaaagu cugagugggc ggc            1853
          <![CDATA[<210>  19]]>
          <![CDATA[<211>  1677]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  19]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug      960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag     1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac     1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac     1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc     1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac     1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac     1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac     1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc     1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc     1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc     1560
          ccagccaccg uguguggccc caagaagagc accucuggcg gaggcagcau ccuggccauc     1620
          uacagcaccg uggccagcag ccuggugcug cuggugagcc ugggcgccau cagcuuc        1677
          <![CDATA[<210>  20]]>
          <![CDATA[<211>  559]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  20]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
          305                 310                 315                 320 
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                          325                 330                 335     
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
                      340                 345                 350         
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                  355                 360                 365             
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
              370                 375                 380                 
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
          385                 390                 395                 400 
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                          405                 410                 415     
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
                      420                 425                 430         
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                  435                 440                 445             
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
              450                 455                 460                 
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
          465                 470                 475                 480 
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                          485                 490                 495     
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
                      500                 505                 510         
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                  515                 520                 525             
          Lys Ser Thr Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val 
              530                 535                 540                 
          Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe 
          545                 550                 555                 
          <![CDATA[<210>  21]]>
          <![CDATA[<211>  2042]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  21]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag     1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug     1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc     1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc     1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug     1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac     1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc     1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug     1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac     1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac     1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca     1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc     1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc     1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc     1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcu cuggcggagg cagcauccug     1860
          gccaucuaca gcaccguggc cagcagccug gugcugcugg ugagccuggg cgccaucagc     1920
          uucugauaau aggcuggagc cucgguggcc uagcuucuug ccccuugggc cuccccccag     1980
          ccccuccucc ccuuccugca cccguacccc cguggucuuu gaauaaaguc ugagugggcg     2040
          gc                                                                    2042
          <![CDATA[<210>  22]]>
          <![CDATA[<211>  1866]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  22]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug      960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag     1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac     1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac     1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc     1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac     1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac     1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac     1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc     1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc     1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc     1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac     1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug     1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag     1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcucuggcgg aggcagcauc     1800
          cuggccaucu acagcaccgu ggccagcagc cuggugcugc uggugagccu gggcgccauc     1860
          agcuuc                                                                1866
          <![CDATA[<210>  23]]>
          <![CDATA[<211>  622]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  23]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
          305                 310                 315                 320 
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                          325                 330                 335     
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
                      340                 345                 350         
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                  355                 360                 365             
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
              370                 375                 380                 
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
          385                 390                 395                 400 
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                          405                 410                 415     
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
                      420                 425                 430         
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                  435                 440                 445             
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
              450                 455                 460                 
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
          465                 470                 475                 480 
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                          485                 490                 495     
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
                      500                 505                 510         
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                  515                 520                 525             
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
              530                 535                 540                 
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
          545                 550                 555                 560 
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                          565                 570                 575     
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
                      580                 585                 590         
          Gly Gly Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala 
                  595                 600                 605             
          Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe 
              610                 615                 620         
          <![CDATA[<210>  24]]>
          <![CDATA[<211>  2066]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  24]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccgu gggcagcggc ggcggcagcg gcggaggcag cggaggaggc     1020
          agcggcggag gcaguggagg ccagcccacc gagagcaucg ugcgguuccc caacaucacc     1080
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg     1140
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc     1200
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc     1260
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc     1320
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug     1380
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac     1440
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac     1500
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag     1560
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug     1620
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagaagag caccaaccug     1680
          gugaagaaca agugcgugaa cuucaacuuc aacggccuua ccggcaccgg cgugcugacc     1740
          gagagcaaca agaaauuccu gcccuuucag caguucggcc gggacaucgc cgacaccacc     1800
          gacgcugugc gggaucccca gacccuggag auccuggaca ucaccccuug cagcuucggc     1860
          ggcucuggcg gaggcagcau ccuggccauc uacagcaccg uggccagcag ccuggugcug     1920
          cuggugagcc ugggcgccau cagcuucuga uaauaggcug gagccucggu ggccuagcuu     1980
          cuugccccuu gggccucccc ccagccccuc cuccccuucc ugcacccgua cccccguggu     2040
          cuuugaauaa agucugagug ggcggc                                          2066
          <![CDATA[<210>  25]]>
          <![CDATA[<211>  1890]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  25]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cgugggcagc ggcggcggca gcggcggagg cagcggagga      960
          ggcagcggcg gaggcagugg aggccagccc accgagagca ucgugcgguu ccccaacauc     1020
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc     1080
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc     1140
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc     1200
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc     1260
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc     1320
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug     1380
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc     1440
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug     1500
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug     1560
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaagaa gagcaccaac     1620
          cuggugaaga acaagugcgu gaacuucaac uucaacggcc uuaccggcac cggcgugcug     1680
          accgagagca acaagaaauu ccugcccuuu cagcaguucg gccgggacau cgccgacacc     1740
          accgacgcug ugcgggaucc ccagacccug gagauccugg acaucacccc uugcagcuuc     1800
          ggcggcucug gcggaggcag cauccuggcc aucuacagca ccguggccag cagccuggug     1860
          cugcugguga gccugggcgc caucagcuuc                                      1890
          <![CDATA[<210>  26]]>
          <![CDATA[<211>  630]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  26]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Val Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly 
          305                 310                 315                 320 
          Gly Ser Gly Gly Gly Ser Gly Gly Gln Pro Thr Glu Ser Ile Val Arg 
                          325                 330                 335     
          Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala 
                      340                 345                 350         
          Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn 
                  355                 360                 365             
          Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr 
              370                 375                 380                 
          Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe 
          385                 390                 395                 400 
          Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg 
                          405                 410                 415     
          Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys 
                      420                 425                 430         
          Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn 
                  435                 440                 445             
          Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe 
              450                 455                 460                 
          Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile 
          465                 470                 475                 480 
          Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys 
                          485                 490                 495     
          Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly 
                      500                 505                 510         
          Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala 
                  515                 520                 525             
          Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn 
              530                 535                 540                 
          Lys Cys Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu 
          545                 550                 555                 560 
          Thr Glu Ser Asn Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp 
                          565                 570                 575     
          Ile Ala Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile 
                      580                 585                 590         
          Leu Asp Ile Thr Pro Cys Ser Phe Gly Gly Ser Gly Gly Gly Ser Ile 
                  595                 600                 605             
          Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser 
              610                 615                 620                 
          Leu Gly Ala Ile Ser Phe 
          625                 630 
          <![CDATA[<210>  27]]>
          <![CDATA[<211>  2066]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  27]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          gugugccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccgu gggcagcggc ggcggcagcg gcggaggcag cggaggaggc     1020
          agcggcggag gcaguggagg ccagcccacc gagagcaucg ugcgguuccc caacaucacc     1080
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg     1140
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc     1200
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc     1260
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc     1320
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug     1380
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac     1440
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac     1500
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag     1560
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug     1620
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagaagag caccaaccug     1680
          gugaagaaca agugcgugaa cuucaacuuc aacggccuua ccggcaccgg cgugcugacc     1740
          gagagcaaca agaaauuccu gcccuuuugc caguucggcc gggacaucgc cgacaccacc     1800
          gacgcugugc gggaucccca gacccuggag auccuggaca ucaccccuug cagcuucggc     1860
          ggcucuggcg gaggcagcau ccuggccauc uacagcaccg uggccagcag ccuggugcug     1920
          cuggugagcc ugggcgccau cagcuucuga uaauaggcug gagccucggu ggccuagcuu     1980
          cuugccccuu gggccucccc ccagccccuc cuccccuucc ugcacccgua cccccguggu     2040
          cuuugaauaa agucugagug ggcggc                                          2066
          <![CDATA[<210>  28]]>
          <![CDATA[<211>  1890]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  28]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aaggugugcc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cgugggcagc ggcggcggca gcggcggagg cagcggagga      960
          ggcagcggcg gaggcagugg aggccagccc accgagagca ucgugcgguu ccccaacauc     1020
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc     1080
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc     1140
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc     1200
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc     1260
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc     1320
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug     1380
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc     1440
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug     1500
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug     1560
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaagaa gagcaccaac     1620
          cuggugaaga acaagugcgu gaacuucaac uucaacggcc uuaccggcac cggcgugcug     1680
          accgagagca acaagaaauu ccugcccuuu ugccaguucg gccgggacau cgccgacacc     1740
          accgacgcug ugcgggaucc ccagacccug gagauccugg acaucacccc uugcagcuuc     1800
          ggcggcucug gcggaggcag cauccuggcc aucuacagca ccguggccag cagccuggug     1860
          cugcugguga gccugggcgc caucagcuuc                                      1890
          <![CDATA[<210>  29]]>
          <![CDATA[<211>  630]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  29]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Cys Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Val Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly 
          305                 310                 315                 320 
          Gly Ser Gly Gly Gly Ser Gly Gly Gln Pro Thr Glu Ser Ile Val Arg 
                          325                 330                 335     
          Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala 
                      340                 345                 350         
          Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn 
                  355                 360                 365             
          Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr 
              370                 375                 380                 
          Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe 
          385                 390                 395                 400 
          Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg 
                          405                 410                 415     
          Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys 
                      420                 425                 430         
          Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn 
                  435                 440                 445             
          Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe 
              450                 455                 460                 
          Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile 
          465                 470                 475                 480 
          Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys 
                          485                 490                 495     
          Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly 
                      500                 505                 510         
          Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala 
                  515                 520                 525             
          Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn 
              530                 535                 540                 
          Lys Cys Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu 
          545                 550                 555                 560 
          Thr Glu Ser Asn Lys Lys Phe Leu Pro Phe Cys Gln Phe Gly Arg Asp 
                          565                 570                 575     
          Ile Ala Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile 
                      580                 585                 590         
          Leu Asp Ile Thr Pro Cys Ser Phe Gly Gly Ser Gly Gly Gly Ser Ile 
                  595                 600                 605             
          Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser 
              610                 615                 620                 
          Leu Gly Ala Ile Ser Phe 
          625                 630 
          <![CDATA[<210>  30]]>
          <![CDATA[<211>  3182]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  30]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu cggcggcagc     1020
          ggcggcguga gcgugaucac cccaggcacc aacaccagca accagguggc cgugcuguac     1080
          caggacguga acugcaccga ggugcccgug gccauccacg ccgaccagcu gacacccacc     1140
          uggcgggucu acagcaccgg cagcaacgug uuccagaccc gggccgguug ccugaucggc     1200
          gccgagcacg ugaacaacag cuacgagugc gacaucccca ucggcgccgg caucugugcc     1260
          agcuaccaga cccagaccaa uucaccccgg agggcaagga gcguggccag ccagagcauc     1320
          aucgccuaca ccaugagccu gggcgccgag aacagcgugg ccuacagcaa caacagcauc     1380
          gccaucccca ccaacuucac caucagcgug accaccgaga uucugcccgu gagcaugacc     1440
          aagaccagcg uggacugcac cauguacauc ugcggcgaca gcaccgagug cagcaaccug     1500
          cugcugcagu acggcagcuu cugcacccag cugaaccggg cccugaccgg caucgccgug     1560
          gagcaggaca agaacaccca ggagguguuc gcccagguga agcagaucua caagaccccu     1620
          cccaucaagg acuucggcgg cuucaacuuc agccagaucc ugcccgaccc cagcaagccc     1680
          agcaagcgga gcuucaucga ggaccugcug uucaacaagg ugacccuagc cgacgccggc     1740
          uucaucaagc aguacggcga cugccucggc gacauagccg cccgggaccu gaucugcgcc     1800
          cagaaguuca acggccugac cgugcugccu ccccugcuga ccgacgagau gaucgcccag     1860
          uacaccagcg cccuguuagc cggaaccauc accagcggcu ggacuuucgg cgcuggagcc     1920
          gcucugcaga uccccuucgc caugcagaug gccuaccggu ucaacggcau cggcgugacc     1980
          cagaacgugc uguacgagaa ccagaagcug aucgccaacc aguucaacag cgccaucggc     2040
          aagauccagg acagccugag cagcaccgcu agcgcccugg gcaagcugca ggacguggug     2100
          aaccagaacg cccaggcccu gaacacccug gugaagcagc ugagcagcaa cuucggcgcc     2160
          aucagcagcg ugcugaacga cauccugagc cggcuggacc cucccaacgc caccgugcag     2220
          aucgaccggc ugaucacugg ccggcugcag agccugcaga ccuacgugac ccagcagcug     2280
          auccgggccg ccgagauucg ggccagcgcc aaccuggccg ccaccaagau gagcgagugc     2340
          gugcugggcc agagcaagcg gguggacuuc ugcggcaagg gcuaccaccu gaugagcuuu     2400
          ccccagagcg caccccacgg agugguguuc cugcacguga ccuacgugcc cgcccaggag     2460
          aagaacuuca ccaccgcccc agccaucugc cacgacggca aggcccacuu uccccgggag     2520
          ggcguguucg ugagcaacgg cacccacugg uucgugaccc agcggaacuu cuacgagccc     2580
          cagaucauca ccaccgacaa caccuucgug agcggcaacu gcgacguggu gaucggcauc     2640
          gugaacaaca ccguguacga uccccugcag cccgagcugg acagcuucaa ggaggagcug     2700
          gacaaguacu ucaagaauca caccagcccc gacguggacc ugggcgacau cagcggcauc     2760
          aacgccagcg uggugaacau ccagaaggag aucgaucggc ugaacgaggu ggccaagaac     2820
          cugaacgaga gccugaucga ccugcaggag cugggcaagu acgagcagua caucaagugg     2880
          cccugguaca ucuggcuggg cuucaucgcc ggccugaucg ccaucgugau ggugaccauc     2940
          augcugugcu gcaugaccag cugcugcagc ugccugaagg gcuguugcag cugcggcagc     3000
          ugcugcaagu ucgacgagga cgacagcgag cccgugcuga agggcgugaa gcugcacuac     3060
          accugauaau aggcuggagc cucgguggcc uagcuucuug ccccuugggc cuccccccag     3120
          ccccuccucc ccuuccugca cccguacccc cguggucuuu gaauaaaguc ugagugggcg     3180
          gc                                                                    3182
          <![CDATA[<210>  31]]>
          <![CDATA[<211>  3006]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  31]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuucggcggc      960
          agcggcggcg ugagcgugau caccccaggc accaacacca gcaaccaggu ggccgugcug     1020
          uaccaggacg ugaacugcac cgaggugccc guggccaucc acgccgacca gcugacaccc     1080
          accuggcggg ucuacagcac cggcagcaac guguuccaga cccgggccgg uugccugauc     1140
          ggcgccgagc acgugaacaa cagcuacgag ugcgacaucc ccaucggcgc cggcaucugu     1200
          gccagcuacc agacccagac caauucaccc cggagggcaa ggagcguggc cagccagagc     1260
          aucaucgccu acaccaugag ccugggcgcc gagaacagcg uggccuacag caacaacagc     1320
          aucgccaucc ccaccaacuu caccaucagc gugaccaccg agauucugcc cgugagcaug     1380
          accaagacca gcguggacug caccauguac aucugcggcg acagcaccga gugcagcaac     1440
          cugcugcugc aguacggcag cuucugcacc cagcugaacc gggcccugac cggcaucgcc     1500
          guggagcagg acaagaacac ccaggaggug uucgcccagg ugaagcagau cuacaagacc     1560
          ccucccauca aggacuucgg cggcuucaac uucagccaga uccugcccga ccccagcaag     1620
          cccagcaagc ggagcuucau cgaggaccug cuguucaaca aggugacccu agccgacgcc     1680
          ggcuucauca agcaguacgg cgacugccuc ggcgacauag ccgcccggga ccugaucugc     1740
          gcccagaagu ucaacggccu gaccgugcug ccuccccugc ugaccgacga gaugaucgcc     1800
          caguacacca gcgcccuguu agccggaacc aucaccagcg gcuggacuuu cggcgcugga     1860
          gccgcucugc agauccccuu cgccaugcag auggccuacc gguucaacgg caucggcgug     1920
          acccagaacg ugcuguacga gaaccagaag cugaucgcca accaguucaa cagcgccauc     1980
          ggcaagaucc aggacagccu gagcagcacc gcuagcgccc ugggcaagcu gcaggacgug     2040
          gugaaccaga acgcccaggc ccugaacacc cuggugaagc agcugagcag caacuucggc     2100
          gccaucagca gcgugcugaa cgacauccug agccggcugg acccucccaa cgccaccgug     2160
          cagaucgacc ggcugaucac uggccggcug cagagccugc agaccuacgu gacccagcag     2220
          cugauccggg ccgccgagau ucgggccagc gccaaccugg ccgccaccaa gaugagcgag     2280
          ugcgugcugg gccagagcaa gcggguggac uucugcggca agggcuacca ccugaugagc     2340
          uuuccccaga gcgcacccca cggaguggug uuccugcacg ugaccuacgu gcccgcccag     2400
          gagaagaacu ucaccaccgc cccagccauc ugccacgacg gcaaggccca cuuuccccgg     2460
          gagggcgugu ucgugagcaa cggcacccac ugguucguga cccagcggaa cuucuacgag     2520
          ccccagauca ucaccaccga caacaccuuc gugagcggca acugcgacgu ggugaucggc     2580
          aucgugaaca acaccgugua cgauccccug cagcccgagc uggacagcuu caaggaggag     2640
          cuggacaagu acuucaagaa ucacaccagc cccgacgugg accugggcga caucagcggc     2700
          aucaacgcca gcguggugaa cauccagaag gagaucgauc ggcugaacga gguggccaag     2760
          aaccugaacg agagccugau cgaccugcag gagcugggca aguacgagca guacaucaag     2820
          uggcccuggu acaucuggcu gggcuucauc gccggccuga ucgccaucgu gauggugacc     2880
          aucaugcugu gcugcaugac cagcugcugc agcugccuga agggcuguug cagcugcggc     2940
          agcugcugca aguucgacga ggacgacagc gagcccgugc ugaagggcgu gaagcugcac     3000
          uacacc                                                                3006
          <![CDATA[<210>  32]]>
          <![CDATA[<211>  1002]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  32]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Gly Gly 
          305                 310                 315                 320 
          Ser Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln 
                          325                 330                 335     
          Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala 
                      340                 345                 350         
          Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly 
                  355                 360                 365             
          Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His 
              370                 375                 380                 
          Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys 
          385                 390                 395                 400 
          Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val 
                          405                 410                 415     
          Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn 
                      420                 425                 430         
          Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr 
                  435                 440                 445             
          Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser 
              450                 455                 460                 
          Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn 
          465                 470                 475                 480 
          Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu 
                          485                 490                 495     
          Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala 
                      500                 505                 510         
          Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly 
                  515                 520                 525             
          Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg 
              530                 535                 540                 
          Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala 
          545                 550                 555                 560 
          Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg 
                          565                 570                 575     
          Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro 
                      580                 585                 590         
          Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala 
                  595                 600                 605             
          Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln 
              610                 615                 620                 
          Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val 
          625                 630                 635                 640 
          Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe 
                          645                 650                 655     
          Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser 
                      660                 665                 670         
          Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu 
                  675                 680                 685             
          Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser 
              690                 695                 700                 
          Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Pro Pro Asn Ala Thr Val 
          705                 710                 715                 720 
          Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr 
                          725                 730                 735     
          Val Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn 
                      740                 745                 750         
          Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg 
                  755                 760                 765             
          Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser 
              770                 775                 780                 
          Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln 
          785                 790                 795                 800 
          Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala 
                          805                 810                 815     
          His Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe 
                      820                 825                 830         
          Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn 
                  835                 840                 845             
          Thr Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn 
              850                 855                 860                 
          Thr Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu 
          865                 870                 875                 880 
          Leu Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly 
                          885                 890                 895     
          Asp Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile 
                      900                 905                 910         
          Asp Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp 
                  915                 920                 925             
          Leu Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr 
              930                 935                 940                 
          Ile Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr 
          945                 950                 955                 960 
          Ile Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys 
                          965                 970                 975     
          Cys Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro 
                      980                 985                 990         
          Val Leu Lys Gly Val Lys Leu His  Tyr Thr 
                  995                 1000         
          <![CDATA[<210>  33]]>
          <![CDATA[<211>  2378]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  33]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagg gcaccaucac cgacgccgug      120
          gacugcgccc uggacccucu gagcgagacc aagugcaccc ugaagagcuu caccguggag      180
          aagggcaucu accagaccag caacuucggc ggcagcggcg gcgugagcgu gaucacccca      240
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug      300
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc      360
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac      420
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca      480
          ccccggaggg caaggagcgu ggccagccag agcaucaucg ccuacaccau gagccugggc      540
          gccgagaaca gcguggccua cagcaacaac agcaucgcca uccccaccaa cuucaccauc      600
          agcgugacca ccgagauucu gcccgugagc augaccaaga ccagcgugga cugcaccaug      660
          uacaucugcg gcgacagcac cgagugcagc aaccugcugc ugaacuacac cagcuucugc      720
          acccagcuga accgggcccu gaccggcauc gccguggagc aggacaagaa cacccaggag      780
          guguucgccc aggugaagca gaucuacaag accccuccca ucaaggacuu cggcggcuuc      840
          aacuucagcc agauccugcc cgaccccagc aagcccagca agcggagcuu caucgaggac      900
          cugcuguuca acaaggugac ccuagccgac gccggcuuca ucaagcagua cggcgacugc      960
          cucggcgaca uagccgcccg ggaccugauc ugcgcccaga aguucaacgg ccugaccgug     1020
          cugccucccc ugcugaccga cgagaugauc gcccaguaca ccagcgcccu guuagccgga     1080
          accaucacca gcggcuggac uuucggcgcu ggagccgcuc ugcagauccc cuucgccaug     1140
          cagauggccu accgguucaa cggcaucggc gugacccaga acgugcugua cgagaaccag     1200
          aagcugaucg ccaaccaguu caacagcgcc aucggcaaga uccaggacag ccugagcagc     1260
          accgcuagcg cccugggcaa gcugcaggac guggugaacc agaacgccca ggcccugaac     1320
          acccugguga agcagcugag cagcaacuuc ggcgccauca gcagcgugcu gaacgacauc     1380
          cugagccggc uggacccucc caacgccacc gugcagaucg accggcugau cacuggccgg     1440
          cugcagagcc ugcagaccua cgugacccag cagcugaucc gggccgccga gauucgggcc     1500
          agcgccaacc uggccgccac caagaugagc gagugcgugc ugggccagag caagcgggug     1560
          gacuucugcg gcaagggcua ccaccugaug agcuuucccc agagcgcacc ccacggagug     1620
          guguuccugc acgugaccua cgugcccgcc caggagaaga acuucaccac cgccccagcc     1680
          aucugccacg acggcaaggc ccacuuuccc cgggagggcg uguucgugag caacggcacc     1740
          cacugguucg ugacccagcg gaacuucuac gagccccaga ucaucaccac cgacaacacc     1800
          uucgugagcg gcaacugcga cguggugauc ggcaucguga acaacaccgu guacgauccc     1860
          cugcagcccg agcuggacag cuucaaggag gagcuggaca aguacuucaa gaaucacacc     1920
          agccccgacg uggaccuggg cgacaucagc ggcaucaacg ccagcguggu gaacauccag     1980
          aaggagaucg aucggcugaa cgagguggcc aagaaccuga acgagagccu gaucgaccug     2040
          caggagcugg gcaaguacga gcaguacauc aaguggcccu gguacaucug gcugggcuuc     2100
          aucgccggcc ugaucgccau cgugauggug accaucaugc ugugcugcau gaccagcugc     2160
          ugcagcugcc ugaagggcug uugcagcugc ggcagcugcu gcaaguucga cgaggacgac     2220
          agcgagcccg ugcugaaggg cgugaagcug cacuacaccu gauaauaggc uggagccucg     2280
          guggccuagc uucuugcccc uugggccucc ccccagcccc uccuccccuu ccugcacccg     2340
          uacccccgug gucuuugaau aaagucugag ugggcggc                             2378
          <![CDATA[<210>  34]]>
          <![CDATA[<211>  2202]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  34]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agggcaccau caccgacgcc       60
          guggacugcg cccuggaccc ucugagcgag accaagugca cccugaagag cuucaccgug      120
          gagaagggca ucuaccagac cagcaacuuc ggcggcagcg gcggcgugag cgugaucacc      180
          ccaggcacca acaccagcaa ccagguggcc gugcuguacc aggacgugaa cugcaccgag      240
          gugcccgugg ccauccacgc cgaccagcug acacccaccu ggcgggucua cagcaccggc      300
          agcaacgugu uccagacccg ggccgguugc cugaucggcg ccgagcacgu gaacaacagc      360
          uacgagugcg acauccccau cggcgccggc aucugugcca gcuaccagac ccagaccaau      420
          ucaccccgga gggcaaggag cguggccagc cagagcauca ucgccuacac caugagccug      480
          ggcgccgaga acagcguggc cuacagcaac aacagcaucg ccauccccac caacuucacc      540
          aucagcguga ccaccgagau ucugcccgug agcaugacca agaccagcgu ggacugcacc      600
          auguacaucu gcggcgacag caccgagugc agcaaccugc ugcugaacua caccagcuuc      660
          ugcacccagc ugaaccgggc ccugaccggc aucgccgugg agcaggacaa gaacacccag      720
          gagguguucg cccaggugaa gcagaucuac aagaccccuc ccaucaagga cuucggcggc      780
          uucaacuuca gccagauccu gcccgacccc agcaagccca gcaagcggag cuucaucgag      840
          gaccugcugu ucaacaaggu gacccuagcc gacgccggcu ucaucaagca guacggcgac      900
          ugccucggcg acauagccgc ccgggaccug aucugcgccc agaaguucaa cggccugacc      960
          gugcugccuc cccugcugac cgacgagaug aucgcccagu acaccagcgc ccuguuagcc     1020
          ggaaccauca ccagcggcug gacuuucggc gcuggagccg cucugcagau ccccuucgcc     1080
          augcagaugg ccuaccgguu caacggcauc ggcgugaccc agaacgugcu guacgagaac     1140
          cagaagcuga ucgccaacca guucaacagc gccaucggca agauccagga cagccugagc     1200
          agcaccgcua gcgcccuggg caagcugcag gacgugguga accagaacgc ccaggcccug     1260
          aacacccugg ugaagcagcu gagcagcaac uucggcgcca ucagcagcgu gcugaacgac     1320
          auccugagcc ggcuggaccc ucccaacgcc accgugcaga ucgaccggcu gaucacuggc     1380
          cggcugcaga gccugcagac cuacgugacc cagcagcuga uccgggccgc cgagauucgg     1440
          gccagcgcca accuggccgc caccaagaug agcgagugcg ugcugggcca gagcaagcgg     1500
          guggacuucu gcggcaaggg cuaccaccug augagcuuuc cccagagcgc accccacgga     1560
          gugguguucc ugcacgugac cuacgugccc gcccaggaga agaacuucac caccgcccca     1620
          gccaucugcc acgacggcaa ggcccacuuu ccccgggagg gcguguucgu gagcaacggc     1680
          acccacuggu ucgugaccca gcggaacuuc uacgagcccc agaucaucac caccgacaac     1740
          accuucguga gcggcaacug cgacguggug aucggcaucg ugaacaacac cguguacgau     1800
          ccccugcagc ccgagcugga cagcuucaag gaggagcugg acaaguacuu caagaaucac     1860
          accagccccg acguggaccu gggcgacauc agcggcauca acgccagcgu ggugaacauc     1920
          cagaaggaga ucgaucggcu gaacgaggug gccaagaacc ugaacgagag ccugaucgac     1980
          cugcaggagc ugggcaagua cgagcaguac aucaaguggc ccugguacau cuggcugggc     2040
          uucaucgccg gccugaucgc caucgugaug gugaccauca ugcugugcug caugaccagc     2100
          ugcugcagcu gccugaaggg cuguugcagc ugcggcagcu gcugcaaguu cgacgaggac     2160
          gacagcgagc ccgugcugaa gggcgugaag cugcacuaca cc                        2202
          <![CDATA[<210>  35]]>
          <![CDATA[<211>  734]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  35]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Gly Thr 
          1               5                   10                  15      
          Ile Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys 
                      20                  25                  30          
          Cys Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser 
                  35                  40                  45              
          Asn Phe Gly Gly Ser Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn 
              50                  55                  60                  
          Thr Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu 
          65                  70                  75                  80  
          Val Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val 
                          85                  90                  95      
          Tyr Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile 
                      100                 105                 110         
          Gly Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly 
                  115                 120                 125             
          Ala Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg 
              130                 135                 140                 
          Ala Arg Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu 
          145                 150                 155                 160 
          Gly Ala Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro 
                          165                 170                 175     
          Thr Asn Phe Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met 
                      180                 185                 190         
          Thr Lys Thr Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr 
                  195                 200                 205             
          Glu Cys Ser Asn Leu Leu Leu Asn Tyr Thr Ser Phe Cys Thr Gln Leu 
              210                 215                 220                 
          Asn Arg Ala Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln 
          225                 230                 235                 240 
          Glu Val Phe Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys 
                          245                 250                 255     
          Asp Phe Gly Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys 
                      260                 265                 270         
          Pro Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr 
                  275                 280                 285             
          Leu Ala Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp 
              290                 295                 300                 
          Ile Ala Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr 
          305                 310                 315                 320 
          Val Leu Pro Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser 
                          325                 330                 335     
          Ala Leu Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly 
                      340                 345                 350         
          Ala Ala Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn 
                  355                 360                 365             
          Gly Ile Gly Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile 
              370                 375                 380                 
          Ala Asn Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser 
          385                 390                 395                 400 
          Ser Thr Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn 
                          405                 410                 415     
          Ala Gln Ala Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly 
                      420                 425                 430         
          Ala Ile Ser Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Pro Pro 
                  435                 440                 445             
          Asn Ala Thr Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser 
              450                 455                 460                 
          Leu Gln Thr Tyr Val Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg 
          465                 470                 475                 480 
          Ala Ser Ala Asn Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly 
                          485                 490                 495     
          Gln Ser Lys Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser 
                      500                 505                 510         
          Phe Pro Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr 
                  515                 520                 525             
          Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His 
              530                 535                 540                 
          Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly 
          545                 550                 555                 560 
          Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile 
                          565                 570                 575     
          Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly 
                      580                 585                 590         
          Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser 
                  595                 600                 605             
          Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp 
              610                 615                 620                 
          Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile 
          625                 630                 635                 640 
          Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu 
                          645                 650                 655     
          Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys 
                      660                 665                 670         
          Trp Pro Trp Tyr Ile Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile 
                  675                 680                 685             
          Val Met Val Thr Ile Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys 
              690                 695                 700                 
          Leu Lys Gly Cys Cys Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp 
          705                 710                 715                 720 
          Asp Ser Glu Pro Val Leu Lys Gly Val Lys Leu His Tyr Thr 
                          725                 730                 
          <![CDATA[<210>  36]]>
          <![CDATA[<211>  4241]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  36]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uuccugauca ucuucauccu gcccaccacc cuggccguga ucggcgacuu caacugcacc      120
          aacagcuuca ucaacgacua caacaagacc aucccucgga ucagcgaaga cgugguggac      180
          gugucccugg gccugggcac cuacuacgug cugaaccggg uguaccugaa caccacacug      240
          cuguucaccg gcuacuuccc caagagcggc gccaacuucc gggaccuggc ccugaagggc      300
          agcaucuacc ugagcaccuu gugguacaag ccucccuucc ugagcgacuu caauaacggc      360
          aucuucucua aggugaagaa caccaagcug uacguaaaca acacccugua cagcgaguuc      420
          agcaccaucg ugaucggcag cguguucguc aacaccagcu acaccaucgu ggugcagccc      480
          cacaacggca uccuggagau caccgccugc caguacacca ugugcgagua cccucacacc      540
          gugugcaaga gcaagggcuc cauccggaac gagagcuggc acaucgacag cagcgagccg      600
          cugugccugu ucaagaagaa cuucaccuac aacgugagcg ccgacuggcu guacuuccac      660
          uucuaccagg agcggggcgu guucuacgcc uacuacgccg acgugggcau gccaaccacc      720
          uuccuguuca gccuguaccu gggcaccauc cugagccacu acuacgugau gccccugacc      780
          ugcaacgcca ucagcucaaa caccgacaac gagacccugg aguacugggu gacuccacug      840
          agccggcggc aguaccugcu gaacuucgac gagcacggcg ugaucaccaa cgccguggac      900
          ugcgcccugg acccucugag cgagaccaag ugcacccuga agagcuucac cguggagaag      960
          ggcaucuacc agaccagcgg cuucaccgug aagcccguag ccaccgugua ccggcggauc     1020
          cccaaccugc ccgacugcga caucgacaac uggcugaaca acgucagcgu gcccagccca     1080
          cugaacuggg agcggcggau cuucagcaac ugcaacuuca aucugagcac ccugcugcgg     1140
          cuggugcacg uggacagcuu cagcugcaac aaccuggaca agagcaagau cuucgguagc     1200
          ugcuucaaca gcaucaccgu ggacaaguuc gccaucccua accggcggcg ggacgaucug     1260
          cagcugggca gcagcggcuu ccugcagagc agcaacuaca agaucgacau cagcagcuca     1320
          agcugccagc uguacuacag ccugccccug gugaacguga ccaucaacaa cuucaacccc     1380
          agcagcugga accggcggua cggcuucggc agcuucaacc ugagcagcua cgacguggug     1440
          uacagcgacc acugcuucag cgugaacagc gacuucugcc ccugugccga cccuagcgug     1500
          gugaacagcu gcgccaagag caagccuccc agcgccauuu gccccgccgg caccaaguac     1560
          cggcacugcg accuggacac cacccuguac gugaagaacu ggugccggug cagcugccug     1620
          cccgacccca ucagcaccua cagccccaac accugucccc agaagaaggu gguggugggu     1680
          aucggcgagc acugucccgg ccugggcauc aacgaggaga agugcggcac ccagcugaac     1740
          cacagcagcu gcuucuguag ccccgacgcc uuccugggcu ggagcuucga cagcugcauc     1800
          agcaacaacc ggugcaacau cuuuagcaac uucaucuuca acggaaucaa cagcggcacc     1860
          accugcagca acgaccugcu guauagcaac accgagauca gcaccggcgu gugcgugaac     1920
          uacgaccugu acggcaucac cggccagggc aucuucaagg aggugagcgc cgccuacuac     1980
          aacaacuggc agaaccugcu guacgacagc aacggcaaca ucaucggcuu caaggacuuu     2040
          cugaccaaca agaccuacac cauccugccc ugcuacagcg gcggcgugag cgugaucacc     2100
          ccaggcacca acaccagcaa ccagguggcc gugcuguacc aggacgugaa cugcaccgag     2160
          gugcccgugg ccauccacgc cgaccagcug acacccaccu ggcgggucua cagcaccggc     2220
          agcaacgugu uccagacccg ggccgguugc cugaucggcg ccgagcacgu gaacaacagc     2280
          uacgagugcg acauccccau cggcgccggc aucugugcca gcuaccagac ccagaccaau     2340
          ucaccccgga gggcaaggag cguggccagc cagagcauca ucgccuacac caugagccug     2400
          ggcgccgaga acagcguggc cuacagcaac aacagcaucg ccauccccac caacuucacc     2460
          aucagcguga ccaccgagau ucugcccgug agcaugacca agaccagcgu ggacugcacc     2520
          auguacaucu gcggcgacag caccgagugc agcaaccugc ugcugcagua cggcagcuuc     2580
          ugcacccagc ugaaccgggc ccugaccggc aucgccgugg agcaggacaa gaacacccag     2640
          gagguguucg cccaggugaa gcagaucuac aagaccccuc ccaucaagga cuucggcggc     2700
          uucaacuuca gccagauccu gcccgacccc agcaagccca gcaagcggag cuucaucgag     2760
          gaccugcugu ucaacaaggu gacccuagcc gacgccggcu ucaucaagca guacggcgac     2820
          ugccucggcg acauagccgc ccgggaccug aucugcgccc agaaguucaa cggccugacc     2880
          gugcugccuc cccugcugac cgacgagaug aucgcccagu acaccagcgc ccuguuagcc     2940
          ggaaccauca ccagcggcug gacuuucggc gcuggagccg cucugcagau ccccuucgcc     3000
          augcagaugg ccuaccgguu caacggcauc ggcgugaccc agaacgugcu guacgagaac     3060
          cagaagcuga ucgccaacca guucaacagc gccaucggca agauccagga cagccugagc     3120
          agcaccgcua gcgcccuggg caagcugcag gacgugguga accagaacgc ccaggcccug     3180
          aacacccugg ugaagcagcu gagcagcaac uucggcgcca ucagcagcgu gcugaacgac     3240
          auccugagcc ggcuggaccc ucccgaggcc gaggugcaga ucgaccggcu gaucacuggc     3300
          cggcugcaga gccugcagac cuacgugacc cagcagcuga uccgggccgc cgagauucgg     3360
          gccagcgcca accuggccgc caccaagaug agcgagugcg ugcugggcca gagcaagcgg     3420
          guggacuucu gcggcaaggg cuaccaccug augagcuuuc cccagagcgc accccacgga     3480
          gugguguucc ugcacgugac cuacgugccc gcccaggaga agaacuucac caccgcccca     3540
          gccaucugcc acgacggcaa ggcccacuuu ccccgggagg gcguguucgu gagcaacggc     3600
          acccacuggu ucgugaccca gcggaacuuc uacgagcccc agaucaucac caccgacaac     3660
          accuucguga gcggcaacug cgacguggug aucggcaucg ugaacaacac cguguacgau     3720
          ccccugcagc ccgagcugga cagcuucaag gaggagcugg acaaguacuu caagaaucac     3780
          accagccccg acguggaccu gggcgacauc agcggcauca acgccagcgu ggugaacauc     3840
          cagaaggaga ucgaucggcu gaacgaggug gccaagaacc ugaacgagag ccugaucgac     3900
          cugcaggagc ugggcaagua cgagcaguac aucaaguggc ccugguacau cuggcugggc     3960
          uucaucgccg gccugaucgc caucgugaug gugaccauca ugcugugcug caugaccagc     4020
          ugcugcagcu gccugaaggg cuguugcagc ugcggcagcu gcugcaaguu cgacgaggac     4080
          gacagcgagc ccgugcugaa gggcgugaag cugcacuaca ccugauaaua ggcuggagcc     4140
          ucgguggccu agcuucuugc cccuugggcc uccccccagc cccuccuccc cuuccugcac     4200
          ccguaccccc guggucuuug aauaaagucu gagugggcgg c                         4241
          <![CDATA[<210>  37]]>
          <![CDATA[<211>  4065]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  37]]>
          auguuccuga ucaucuucau ccugcccacc acccuggccg ugaucggcga cuucaacugc       60
          accaacagcu ucaucaacga cuacaacaag accaucccuc ggaucagcga agacguggug      120
          gacguguccc ugggccuggg caccuacuac gugcugaacc ggguguaccu gaacaccaca      180
          cugcuguuca ccggcuacuu ccccaagagc ggcgccaacu uccgggaccu ggcccugaag      240
          ggcagcaucu accugagcac cuugugguac aagccucccu uccugagcga cuucaauaac      300
          ggcaucuucu cuaaggugaa gaacaccaag cuguacguaa acaacacccu guacagcgag      360
          uucagcacca ucgugaucgg cagcguguuc gucaacacca gcuacaccau cguggugcag      420
          ccccacaacg gcauccugga gaucaccgcc ugccaguaca ccaugugcga guacccucac      480
          accgugugca agagcaaggg cuccauccgg aacgagagcu ggcacaucga cagcagcgag      540
          ccgcugugcc uguucaagaa gaacuucacc uacaacguga gcgccgacug gcuguacuuc      600
          cacuucuacc aggagcgggg cguguucuac gccuacuacg ccgacguggg caugccaacc      660
          accuuccugu ucagccugua ccugggcacc auccugagcc acuacuacgu gaugccccug      720
          accugcaacg ccaucagcuc aaacaccgac aacgagaccc uggaguacug ggugacucca      780
          cugagccggc ggcaguaccu gcugaacuuc gacgagcacg gcgugaucac caacgccgug      840
          gacugcgccc uggacccucu gagcgagacc aagugcaccc ugaagagcuu caccguggag      900
          aagggcaucu accagaccag cggcuucacc gugaagcccg uagccaccgu guaccggcgg      960
          auccccaacc ugcccgacug cgacaucgac aacuggcuga acaacgucag cgugcccagc     1020
          ccacugaacu gggagcggcg gaucuucagc aacugcaacu ucaaucugag cacccugcug     1080
          cggcuggugc acguggacag cuucagcugc aacaaccugg acaagagcaa gaucuucggu     1140
          agcugcuuca acagcaucac cguggacaag uucgccaucc cuaaccggcg gcgggacgau     1200
          cugcagcugg gcagcagcgg cuuccugcag agcagcaacu acaagaucga caucagcagc     1260
          ucaagcugcc agcuguacua cagccugccc cuggugaacg ugaccaucaa caacuucaac     1320
          cccagcagcu ggaaccggcg guacggcuuc ggcagcuuca accugagcag cuacgacgug     1380
          guguacagcg accacugcuu cagcgugaac agcgacuucu gccccugugc cgacccuagc     1440
          guggugaaca gcugcgccaa gagcaagccu cccagcgcca uuugccccgc cggcaccaag     1500
          uaccggcacu gcgaccugga caccacccug uacgugaaga acuggugccg gugcagcugc     1560
          cugcccgacc ccaucagcac cuacagcccc aacaccuguc cccagaagaa ggugguggug     1620
          gguaucggcg agcacugucc cggccugggc aucaacgagg agaagugcgg cacccagcug     1680
          aaccacagca gcugcuucug uagccccgac gccuuccugg gcuggagcuu cgacagcugc     1740
          aucagcaaca accggugcaa caucuuuagc aacuucaucu ucaacggaau caacagcggc     1800
          accaccugca gcaacgaccu gcuguauagc aacaccgaga ucagcaccgg cgugugcgug     1860
          aacuacgacc uguacggcau caccggccag ggcaucuuca aggaggugag cgccgccuac     1920
          uacaacaacu ggcagaaccu gcuguacgac agcaacggca acaucaucgg cuucaaggac     1980
          uuucugacca acaagaccua caccauccug cccugcuaca gcggcggcgu gagcgugauc     2040
          accccaggca ccaacaccag caaccaggug gccgugcugu accaggacgu gaacugcacc     2100
          gaggugcccg uggccaucca cgccgaccag cugacaccca ccuggcgggu cuacagcacc     2160
          ggcagcaacg uguuccagac ccgggccggu ugccugaucg gcgccgagca cgugaacaac     2220
          agcuacgagu gcgacauccc caucggcgcc ggcaucugug ccagcuacca gacccagacc     2280
          aauucacccc ggagggcaag gagcguggcc agccagagca ucaucgccua caccaugagc     2340
          cugggcgccg agaacagcgu ggccuacagc aacaacagca ucgccauccc caccaacuuc     2400
          accaucagcg ugaccaccga gauucugccc gugagcauga ccaagaccag cguggacugc     2460
          accauguaca ucugcggcga cagcaccgag ugcagcaacc ugcugcugca guacggcagc     2520
          uucugcaccc agcugaaccg ggcccugacc ggcaucgccg uggagcagga caagaacacc     2580
          caggaggugu ucgcccaggu gaagcagauc uacaagaccc cucccaucaa ggacuucggc     2640
          ggcuucaacu ucagccagau ccugcccgac cccagcaagc ccagcaagcg gagcuucauc     2700
          gaggaccugc uguucaacaa ggugacccua gccgacgccg gcuucaucaa gcaguacggc     2760
          gacugccucg gcgacauagc cgcccgggac cugaucugcg cccagaaguu caacggccug     2820
          accgugcugc cuccccugcu gaccgacgag augaucgccc aguacaccag cgcccuguua     2880
          gccggaacca ucaccagcgg cuggacuuuc ggcgcuggag ccgcucugca gauccccuuc     2940
          gccaugcaga uggccuaccg guucaacggc aucggcguga cccagaacgu gcuguacgag     3000
          aaccagaagc ugaucgccaa ccaguucaac agcgccaucg gcaagaucca ggacagccug     3060
          agcagcaccg cuagcgcccu gggcaagcug caggacgugg ugaaccagaa cgcccaggcc     3120
          cugaacaccc uggugaagca gcugagcagc aacuucggcg ccaucagcag cgugcugaac     3180
          gacauccuga gccggcugga cccucccgag gccgaggugc agaucgaccg gcugaucacu     3240
          ggccggcugc agagccugca gaccuacgug acccagcagc ugauccgggc cgccgagauu     3300
          cgggccagcg ccaaccuggc cgccaccaag augagcgagu gcgugcuggg ccagagcaag     3360
          cggguggacu ucugcggcaa gggcuaccac cugaugagcu uuccccagag cgcaccccac     3420
          ggaguggugu uccugcacgu gaccuacgug cccgcccagg agaagaacuu caccaccgcc     3480
          ccagccaucu gccacgacgg caaggcccac uuuccccggg agggcguguu cgugagcaac     3540
          ggcacccacu gguucgugac ccagcggaac uucuacgagc cccagaucau caccaccgac     3600
          aacaccuucg ugagcggcaa cugcgacgug gugaucggca ucgugaacaa caccguguac     3660
          gauccccugc agcccgagcu ggacagcuuc aaggaggagc uggacaagua cuucaagaau     3720
          cacaccagcc ccgacgugga ccugggcgac aucagcggca ucaacgccag cguggugaac     3780
          auccagaagg agaucgaucg gcugaacgag guggccaaga accugaacga gagccugauc     3840
          gaccugcagg agcugggcaa guacgagcag uacaucaagu ggcccuggua caucuggcug     3900
          ggcuucaucg ccggccugau cgccaucgug auggugacca ucaugcugug cugcaugacc     3960
          agcugcugca gcugccugaa gggcuguugc agcugcggca gcugcugcaa guucgacgag     4020
          gacgacagcg agcccgugcu gaagggcgug aagcugcacu acacc                     4065
          <![CDATA[<210>  38]]>
          <![CDATA[<211>  1355]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  38]]>
          Met Phe Leu Ile Ile Phe Ile Leu Pro Thr Thr Leu Ala Val Ile Gly 
          1               5                   10                  15      
          Asp Phe Asn Cys Thr Asn Ser Phe Ile Asn Asp Tyr Asn Lys Thr Ile 
                      20                  25                  30          
          Pro Arg Ile Ser Glu Asp Val Val Asp Val Ser Leu Gly Leu Gly Thr 
                  35                  40                  45              
          Tyr Tyr Val Leu Asn Arg Val Tyr Leu Asn Thr Thr Leu Leu Phe Thr 
              50                  55                  60                  
          Gly Tyr Phe Pro Lys Ser Gly Ala Asn Phe Arg Asp Leu Ala Leu Lys 
          65                  70                  75                  80  
          Gly Ser Ile Tyr Leu Ser Thr Leu Trp Tyr Lys Pro Pro Phe Leu Ser 
                          85                  90                  95      
          Asp Phe Asn Asn Gly Ile Phe Ser Lys Val Lys Asn Thr Lys Leu Tyr 
                      100                 105                 110         
          Val Asn Asn Thr Leu Tyr Ser Glu Phe Ser Thr Ile Val Ile Gly Ser 
                  115                 120                 125             
          Val Phe Val Asn Thr Ser Tyr Thr Ile Val Val Gln Pro His Asn Gly 
              130                 135                 140                 
          Ile Leu Glu Ile Thr Ala Cys Gln Tyr Thr Met Cys Glu Tyr Pro His 
          145                 150                 155                 160 
          Thr Val Cys Lys Ser Lys Gly Ser Ile Arg Asn Glu Ser Trp His Ile 
                          165                 170                 175     
          Asp Ser Ser Glu Pro Leu Cys Leu Phe Lys Lys Asn Phe Thr Tyr Asn 
                      180                 185                 190         
          Val Ser Ala Asp Trp Leu Tyr Phe His Phe Tyr Gln Glu Arg Gly Val 
                  195                 200                 205             
          Phe Tyr Ala Tyr Tyr Ala Asp Val Gly Met Pro Thr Thr Phe Leu Phe 
              210                 215                 220                 
          Ser Leu Tyr Leu Gly Thr Ile Leu Ser His Tyr Tyr Val Met Pro Leu 
          225                 230                 235                 240 
          Thr Cys Asn Ala Ile Ser Ser Asn Thr Asp Asn Glu Thr Leu Glu Tyr 
                          245                 250                 255     
          Trp Val Thr Pro Leu Ser Arg Arg Gln Tyr Leu Leu Asn Phe Asp Glu 
                      260                 265                 270         
          His Gly Val Ile Thr Asn Ala Val Asp Cys Ala Leu Asp Pro Leu Ser 
                  275                 280                 285             
          Glu Thr Lys Cys Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr 
              290                 295                 300                 
          Gln Thr Ser Gly Phe Thr Val Lys Pro Val Ala Thr Val Tyr Arg Arg 
          305                 310                 315                 320 
          Ile Pro Asn Leu Pro Asp Cys Asp Ile Asp Asn Trp Leu Asn Asn Val 
                          325                 330                 335     
          Ser Val Pro Ser Pro Leu Asn Trp Glu Arg Arg Ile Phe Ser Asn Cys 
                      340                 345                 350         
          Asn Phe Asn Leu Ser Thr Leu Leu Arg Leu Val His Val Asp Ser Phe 
                  355                 360                 365             
          Ser Cys Asn Asn Leu Asp Lys Ser Lys Ile Phe Gly Ser Cys Phe Asn 
              370                 375                 380                 
          Ser Ile Thr Val Asp Lys Phe Ala Ile Pro Asn Arg Arg Arg Asp Asp 
          385                 390                 395                 400 
          Leu Gln Leu Gly Ser Ser Gly Phe Leu Gln Ser Ser Asn Tyr Lys Ile 
                          405                 410                 415     
          Asp Ile Ser Ser Ser Ser Cys Gln Leu Tyr Tyr Ser Leu Pro Leu Val 
                      420                 425                 430         
          Asn Val Thr Ile Asn Asn Phe Asn Pro Ser Ser Trp Asn Arg Arg Tyr 
                  435                 440                 445             
          Gly Phe Gly Ser Phe Asn Leu Ser Ser Tyr Asp Val Val Tyr Ser Asp 
              450                 455                 460                 
          His Cys Phe Ser Val Asn Ser Asp Phe Cys Pro Cys Ala Asp Pro Ser 
          465                 470                 475                 480 
          Val Val Asn Ser Cys Ala Lys Ser Lys Pro Pro Ser Ala Ile Cys Pro 
                          485                 490                 495     
          Ala Gly Thr Lys Tyr Arg His Cys Asp Leu Asp Thr Thr Leu Tyr Val 
                      500                 505                 510         
          Lys Asn Trp Cys Arg Cys Ser Cys Leu Pro Asp Pro Ile Ser Thr Tyr 
                  515                 520                 525             
          Ser Pro Asn Thr Cys Pro Gln Lys Lys Val Val Val Gly Ile Gly Glu 
              530                 535                 540                 
          His Cys Pro Gly Leu Gly Ile Asn Glu Glu Lys Cys Gly Thr Gln Leu 
          545                 550                 555                 560 
          Asn His Ser Ser Cys Phe Cys Ser Pro Asp Ala Phe Leu Gly Trp Ser 
                          565                 570                 575     
          Phe Asp Ser Cys Ile Ser Asn Asn Arg Cys Asn Ile Phe Ser Asn Phe 
                      580                 585                 590         
          Ile Phe Asn Gly Ile Asn Ser Gly Thr Thr Cys Ser Asn Asp Leu Leu 
                  595                 600                 605             
          Tyr Ser Asn Thr Glu Ile Ser Thr Gly Val Cys Val Asn Tyr Asp Leu 
              610                 615                 620                 
          Tyr Gly Ile Thr Gly Gln Gly Ile Phe Lys Glu Val Ser Ala Ala Tyr 
          625                 630                 635                 640 
          Tyr Asn Asn Trp Gln Asn Leu Leu Tyr Asp Ser Asn Gly Asn Ile Ile 
                          645                 650                 655     
          Gly Phe Lys Asp Phe Leu Thr Asn Lys Thr Tyr Thr Ile Leu Pro Cys 
                      660                 665                 670         
          Tyr Ser Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn 
                  675                 680                 685             
          Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val 
              690                 695                 700                 
          Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr 
          705                 710                 715                 720 
          Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu 
                          725                 730                 735     
          His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile 
                      740                 745                 750         
          Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser 
                  755                 760                 765             
          Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu 
              770                 775                 780                 
          Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe 
          785                 790                 795                 800 
          Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr 
                          805                 810                 815     
          Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser 
                      820                 825                 830         
          Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala 
                  835                 840                 845             
          Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe 
              850                 855                 860                 
          Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly 
          865                 870                 875                 880 
          Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys 
                          885                 890                 895     
          Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp 
                      900                 905                 910         
          Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala 
                  915                 920                 925             
          Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro 
              930                 935                 940                 
          Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu 
          945                 950                 955                 960 
          Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu 
                          965                 970                 975     
          Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly 
                      980                 985                 990         
          Val Thr Gln Asn Val Leu Tyr Glu  Asn Gln Lys Leu Ile  Ala Asn Gln 
                  995                 1000                 1005             
          Phe Asn  Ser Ala Ile Gly Lys  Ile Gln Asp Ser Leu  Ser Ser Thr 
              1010                 1015                 1020             
          Ala Ser  Ala Leu Gly Lys Leu  Gln Asp Val Val Asn  Gln Asn Ala 
              1025                 1030                 1035             
          Gln Ala  Leu Asn Thr Leu Val  Lys Gln Leu Ser Ser  Asn Phe Gly 
              1040                 1045                 1050             
          Ala Ile  Ser Ser Val Leu Asn  Asp Ile Leu Ser Arg  Leu Asp Pro 
              1055                 1060                 1065             
          Pro Glu  Ala Glu Val Gln Ile  Asp Arg Leu Ile Thr  Gly Arg Leu 
              1070                 1075                 1080             
          Gln Ser  Leu Gln Thr Tyr Val  Thr Gln Gln Leu Ile  Arg Ala Ala 
              1085                 1090                 1095             
          Glu Ile  Arg Ala Ser Ala Asn  Leu Ala Ala Thr Lys  Met Ser Glu 
              1100                 1105                 1110             
          Cys Val  Leu Gly Gln Ser Lys  Arg Val Asp Phe Cys  Gly Lys Gly 
              1115                 1120                 1125             
          Tyr His  Leu Met Ser Phe Pro  Gln Ser Ala Pro His  Gly Val Val 
              1130                 1135                 1140             
          Phe Leu  His Val Thr Tyr Val  Pro Ala Gln Glu Lys  Asn Phe Thr 
              1145                 1150                 1155             
          Thr Ala  Pro Ala Ile Cys His  Asp Gly Lys Ala His  Phe Pro Arg 
              1160                 1165                 1170             
          Glu Gly  Val Phe Val Ser Asn  Gly Thr His Trp Phe  Val Thr Gln 
              1175                 1180                 1185             
          Arg Asn  Phe Tyr Glu Pro Gln  Ile Ile Thr Thr Asp  Asn Thr Phe 
              1190                 1195                 1200             
          Val Ser  Gly Asn Cys Asp Val  Val Ile Gly Ile Val  Asn Asn Thr 
              1205                 1210                 1215             
          Val Tyr  Asp Pro Leu Gln Pro  Glu Leu Asp Ser Phe  Lys Glu Glu 
              1220                 1225                 1230             
          Leu Asp  Lys Tyr Phe Lys Asn  His Thr Ser Pro Asp  Val Asp Leu 
              1235                 1240                 1245             
          Gly Asp  Ile Ser Gly Ile Asn  Ala Ser Val Val Asn  Ile Gln Lys 
              1250                 1255                 1260             
          Glu Ile  Asp Arg Leu Asn Glu  Val Ala Lys Asn Leu  Asn Glu Ser 
              1265                 1270                 1275             
          Leu Ile  Asp Leu Gln Glu Leu  Gly Lys Tyr Glu Gln  Tyr Ile Lys 
              1280                 1285                 1290             
          Trp Pro  Trp Tyr Ile Trp Leu  Gly Phe Ile Ala Gly  Leu Ile Ala 
              1295                 1300                 1305             
          Ile Val  Met Val Thr Ile Met  Leu Cys Cys Met Thr  Ser Cys Cys 
              1310                 1315                 1320             
          Ser Cys  Leu Lys Gly Cys Cys  Ser Cys Gly Ser Cys  Cys Lys Phe 
              1325                 1330                 1335             
          Asp Glu  Asp Asp Ser Glu Pro  Val Leu Lys Gly Val  Lys Leu His 
              1340                 1345                 1350             
          Tyr Thr  
              1355 
          <![CDATA[<210>  39]]>
          <![CDATA[<211>  4247]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  39]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uuccugaucc ugcugaucag ccugcccacc gccuucgccg ugaucggcga ccugaagugc      120
          accagcgaca acaucaacga caaggacacc ggcccaccac ccaucagcac cgacaccgug      180
          gacgugacca acggccuggg caccuacuac gugcuggacc ggguguaccu gaacaccacc      240
          cuguuccuga acggcuacua ccccaccagc ggcagcaccu accggaauau ggcccugaag      300
          ggcagcgugc ugcugagccg gcugugguuc aagccaccau uccugagcga cuucaucaac      360
          ggaaucuucg ccaaggugaa gaacaccaag gugaucaagg accgggugau guacagcgag      420
          uuccccgcca ucaccauugg caguaccuuc gugaacacca gcuacagcgu gguggugcag      480
          ccccggacca ucaacagcac ccaggacggc gacaacaagc ugcagggccu gcuggaggug      540
          agcgugugcc aguacaacau gugcgaguac ccucagacca ucugccaccc caaccugggc      600
          aaccaccgga aggagcugug gcaccuggac accggcgugg ugagcugccu guacaagcgg      660
          aacuucaccu acgacguaaa cgccgacuac cuguacuucc acuucuacca ggagggcggc      720
          accuucuacg ccuacuucac cgacacgggc guggugacca aguuccuguu caacguguac      780
          cugggcaugg cccugagcca cuacuacgug augccccuga ccuguaacag caagcugacc      840
          cuggaguacu gggugacccc ucugaccagc cggcaguacc ugcuggccuu caaccaggac      900
          ggcaucaucu ucaacgccgu ggacugcgcc cuggacccuc ugagcgagac caagugcacc      960
          cugaagagcu ucaccgugga gaagggcauc uaccagacca acggcuacac cgugcagccc     1020
          aucgccgacg uguaccggcg gaagcccaac cugcccaacu gcaacaucga ggccuggcug     1080
          aacgacaaga gcgugcccuc gccccugaac ugggagcgga agaccuucag caacugcaac     1140
          uucaacauga gcagucugau gagcuucauc caggccgaca gcuucaccug caacaacauc     1200
          gacgccgcca agaucuacgg caugugcuuc agcagcauca ccaucgacaa guuugccauc     1260
          cccaacggcc ggaaggugga ccugcagcug ggcaaccugg gcuaccugca gagcuucaac     1320
          uaccggaucg acaccaccgc caccucuugc cagcuguacu acaaccugcc cgccgccaac     1380
          gugagcguga gccgguucaa ccccagcacc uggaacaagc gguucggcuu cauugaggac     1440
          agcguguuca aaccccggcc cgcaggagua cugaccaacc acgacguggu guacgcccag     1500
          cacugcuuca aggcacccaa gaacuucugc cccugcaagc ugaacggcag cugugugggc     1560
          ucuggccccg guaagaacaa cggcauaggg acuugcccgg cagggaccaa cuaccugacc     1620
          ugcgacaacc ugugcacacc cgaccccauc accuucaccg gcaccuacaa guguccccag     1680
          accaagagcc uggugggcau cggcgagcac ugcagcggcc uggccgugaa gagcgacuac     1740
          ugcggcggca acagcugcac cugucggccc caggccuucc ugggcuggag cgccgacagc     1800
          ugccugcagg gcgacaagug caacaucuuu gccaacuuca uccugcacga cgugaacagc     1860
          ggccugaccu gcagcaccga ccugcagaag gccaacaccg acaucauccu gggcgugugc     1920
          gugaacuacg acuuguacgg cauccugggc cagggcaucu ucguggaggu gaacgccacc     1980
          uacuacaaca gcuggcagaa ccugcuguac gacagcaacg gcaaccugua cggcuuccgg     2040
          gacuacauca ucaaccggac cuucaugauc cggagcugcu acagcggcgg cgugagcgug     2100
          aucaccccag gcaccaacac cagcaaccag guggccgugc uguaccagga cgugaacugc     2160
          accgaggugc ccguggccau ccacgccgac cagcugacac ccaccuggcg ggucuacagc     2220
          accggcagca acguguucca gacccgggcc gguugccuga ucggcgccga gcacgugaac     2280
          aacagcuacg agugcgacau ccccaucggc gccggcaucu gugccagcua ccagacccag     2340
          accaauucac cccggagggc aaggagcgug gccagccaga gcaucaucgc cuacaccaug     2400
          agccugggcg ccgagaacag cguggccuac agcaacaaca gcaucgccau ccccaccaac     2460
          uucaccauca gcgugaccac cgagauucug cccgugagca ugaccaagac cagcguggac     2520
          ugcaccaugu acaucugcgg cgacagcacc gagugcagca accugcugcu gcaguacggc     2580
          agcuucugca cccagcugaa ccgggcccug accggcaucg ccguggagca ggacaagaac     2640
          acccaggagg uguucgccca ggugaagcag aucuacaaga ccccucccau caaggacuuc     2700
          ggcggcuuca acuucagcca gauccugccc gaccccagca agcccagcaa gcggagcuuc     2760
          aucgaggacc ugcuguucaa caaggugacc cuagccgacg ccggcuucau caagcaguac     2820
          ggcgacugcc ucggcgacau agccgcccgg gaccugaucu gcgcccagaa guucaacggc     2880
          cugaccgugc ugccuccccu gcugaccgac gagaugaucg cccaguacac cagcgcccug     2940
          uuagccggaa ccaucaccag cggcuggacu uucggcgcug gagccgcucu gcagaucccc     3000
          uucgccaugc agauggccua ccgguucaac ggcaucggcg ugacccagaa cgugcuguac     3060
          gagaaccaga agcugaucgc caaccaguuc aacagcgcca ucggcaagau ccaggacagc     3120
          cugagcagca ccgcuagcgc ccugggcaag cugcaggacg uggugaacca gaacgcccag     3180
          gcccugaaca cccuggugaa gcagcugagc agcaacuucg gcgccaucag cagcgugcug     3240
          aacgacaucc ugagccggcu ggacccuccc gaggccgagg ugcagaucga ccggcugauc     3300
          acuggccggc ugcagagccu gcagaccuac gugacccagc agcugauccg ggccgccgag     3360
          auucgggcca gcgccaaccu ggccgccacc aagaugagcg agugcgugcu gggccagagc     3420
          aagcgggugg acuucugcgg caagggcuac caccugauga gcuuucccca gagcgcaccc     3480
          cacggagugg uguuccugca cgugaccuac gugcccgccc aggagaagaa cuucaccacc     3540
          gccccagcca ucugccacga cggcaaggcc cacuuucccc gggagggcgu guucgugagc     3600
          aacggcaccc acugguucgu gacccagcgg aacuucuacg agccccagau caucaccacc     3660
          gacaacaccu ucgugagcgg caacugcgac guggugaucg gcaucgugaa caacaccgug     3720
          uacgaucccc ugcagcccga gcuggacagc uucaaggagg agcuggacaa guacuucaag     3780
          aaucacacca gccccgacgu ggaccugggc gacaucagcg gcaucaacgc cagcguggug     3840
          aacauccaga aggagaucga ucggcugaac gagguggcca agaaccugaa cgagagccug     3900
          aucgaccugc aggagcuggg caaguacgag caguacauca aguggcccug guacaucugg     3960
          cugggcuuca ucgccggccu gaucgccauc gugaugguga ccaucaugcu gugcugcaug     4020
          accagcugcu gcagcugccu gaagggcugu ugcagcugcg gcagcugcug caaguucgac     4080
          gaggacgaca gcgagcccgu gcugaagggc gugaagcugc acuacaccug auaauaggcu     4140
          ggagccucgg uggccuagcu ucuugccccu ugggccuccc cccagccccu ccuccccuuc     4200
          cugcacccgu acccccgugg ucuuugaaua aagucugagu gggcggc                   4247
          <![CDATA[<210>  40]]>
          <![CDATA[<211>  4071]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  40]]>
          auguuccuga uccugcugau cagccugccc accgccuucg ccgugaucgg cgaccugaag       60
          ugcaccagcg acaacaucaa cgacaaggac accggcccac cacccaucag caccgacacc      120
          guggacguga ccaacggccu gggcaccuac uacgugcugg accgggugua ccugaacacc      180
          acccuguucc ugaacggcua cuaccccacc agcggcagca ccuaccggaa uauggcccug      240
          aagggcagcg ugcugcugag ccggcugugg uucaagccac cauuccugag cgacuucauc      300
          aacggaaucu ucgccaaggu gaagaacacc aaggugauca aggaccgggu gauguacagc      360
          gaguuccccg ccaucaccau uggcaguacc uucgugaaca ccagcuacag cgugguggug      420
          cagccccgga ccaucaacag cacccaggac ggcgacaaca agcugcaggg ccugcuggag      480
          gugagcgugu gccaguacaa caugugcgag uacccucaga ccaucugcca ccccaaccug      540
          ggcaaccacc ggaaggagcu guggcaccug gacaccggcg uggugagcug ccuguacaag      600
          cggaacuuca ccuacgacgu aaacgccgac uaccuguacu uccacuucua ccaggagggc      660
          ggcaccuucu acgccuacuu caccgacacg ggcgugguga ccaaguuccu guucaacgug      720
          uaccugggca uggcccugag ccacuacuac gugaugcccc ugaccuguaa cagcaagcug      780
          acccuggagu acugggugac cccucugacc agccggcagu accugcuggc cuucaaccag      840
          gacggcauca ucuucaacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      900
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccaacggcua caccgugcag      960
          cccaucgccg acguguaccg gcggaagccc aaccugccca acugcaacau cgaggccugg     1020
          cugaacgaca agagcgugcc cucgccccug aacugggagc ggaagaccuu cagcaacugc     1080
          aacuucaaca ugagcagucu gaugagcuuc auccaggccg acagcuucac cugcaacaac     1140
          aucgacgccg ccaagaucua cggcaugugc uucagcagca ucaccaucga caaguuugcc     1200
          auccccaacg gccggaaggu ggaccugcag cugggcaacc ugggcuaccu gcagagcuuc     1260
          aacuaccgga ucgacaccac cgccaccucu ugccagcugu acuacaaccu gcccgccgcc     1320
          aacgugagcg ugagccgguu caaccccagc accuggaaca agcgguucgg cuucauugag     1380
          gacagcgugu ucaaaccccg gcccgcagga guacugacca accacgacgu gguguacgcc     1440
          cagcacugcu ucaaggcacc caagaacuuc ugccccugca agcugaacgg cagcugugug     1500
          ggcucuggcc ccgguaagaa caacggcaua gggacuugcc cggcagggac caacuaccug     1560
          accugcgaca accugugcac acccgacccc aucaccuuca ccggcaccua caaguguccc     1620
          cagaccaaga gccugguggg caucggcgag cacugcagcg gccuggccgu gaagagcgac     1680
          uacugcggcg gcaacagcug caccugucgg ccccaggccu uccugggcug gagcgccgac     1740
          agcugccugc agggcgacaa gugcaacauc uuugccaacu ucauccugca cgacgugaac     1800
          agcggccuga ccugcagcac cgaccugcag aaggccaaca ccgacaucau ccugggcgug     1860
          ugcgugaacu acgacuugua cggcauccug ggccagggca ucuucgugga ggugaacgcc     1920
          accuacuaca acagcuggca gaaccugcug uacgacagca acggcaaccu guacggcuuc     1980
          cgggacuaca ucaucaaccg gaccuucaug auccggagcu gcuacagcgg cggcgugagc     2040
          gugaucaccc caggcaccaa caccagcaac cagguggccg ugcuguacca ggacgugaac     2100
          ugcaccgagg ugcccguggc cauccacgcc gaccagcuga cacccaccug gcgggucuac     2160
          agcaccggca gcaacguguu ccagacccgg gccgguugcc ugaucggcgc cgagcacgug     2220
          aacaacagcu acgagugcga cauccccauc ggcgccggca ucugugccag cuaccagacc     2280
          cagaccaauu caccccggag ggcaaggagc guggccagcc agagcaucau cgccuacacc     2340
          augagccugg gcgccgagaa cagcguggcc uacagcaaca acagcaucgc cauccccacc     2400
          aacuucacca ucagcgugac caccgagauu cugcccguga gcaugaccaa gaccagcgug     2460
          gacugcacca uguacaucug cggcgacagc accgagugca gcaaccugcu gcugcaguac     2520
          ggcagcuucu gcacccagcu gaaccgggcc cugaccggca ucgccgugga gcaggacaag     2580
          aacacccagg agguguucgc ccaggugaag cagaucuaca agaccccucc caucaaggac     2640
          uucggcggcu ucaacuucag ccagauccug cccgacccca gcaagcccag caagcggagc     2700
          uucaucgagg accugcuguu caacaaggug acccuagccg acgccggcuu caucaagcag     2760
          uacggcgacu gccucggcga cauagccgcc cgggaccuga ucugcgccca gaaguucaac     2820
          ggccugaccg ugcugccucc ccugcugacc gacgagauga ucgcccagua caccagcgcc     2880
          cuguuagccg gaaccaucac cagcggcugg acuuucggcg cuggagccgc ucugcagauc     2940
          cccuucgcca ugcagauggc cuaccgguuc aacggcaucg gcgugaccca gaacgugcug     3000
          uacgagaacc agaagcugau cgccaaccag uucaacagcg ccaucggcaa gauccaggac     3060
          agccugagca gcaccgcuag cgcccugggc aagcugcagg acguggugaa ccagaacgcc     3120
          caggcccuga acacccuggu gaagcagcug agcagcaacu ucggcgccau cagcagcgug     3180
          cugaacgaca uccugagccg gcuggacccu cccgaggccg aggugcagau cgaccggcug     3240
          aucacuggcc ggcugcagag ccugcagacc uacgugaccc agcagcugau ccgggccgcc     3300
          gagauucggg ccagcgccaa ccuggccgcc accaagauga gcgagugcgu gcugggccag     3360
          agcaagcggg uggacuucug cggcaagggc uaccaccuga ugagcuuucc ccagagcgca     3420
          ccccacggag ugguguuccu gcacgugacc uacgugcccg cccaggagaa gaacuucacc     3480
          accgccccag ccaucugcca cgacggcaag gcccacuuuc cccgggaggg cguguucgug     3540
          agcaacggca cccacugguu cgugacccag cggaacuucu acgagcccca gaucaucacc     3600
          accgacaaca ccuucgugag cggcaacugc gacgugguga ucggcaucgu gaacaacacc     3660
          guguacgauc cccugcagcc cgagcuggac agcuucaagg aggagcugga caaguacuuc     3720
          aagaaucaca ccagccccga cguggaccug ggcgacauca gcggcaucaa cgccagcgug     3780
          gugaacaucc agaaggagau cgaucggcug aacgaggugg ccaagaaccu gaacgagagc     3840
          cugaucgacc ugcaggagcu gggcaaguac gagcaguaca ucaaguggcc cugguacauc     3900
          uggcugggcu ucaucgccgg ccugaucgcc aucgugaugg ugaccaucau gcugugcugc     3960
          augaccagcu gcugcagcug ccugaagggc uguugcagcu gcggcagcug cugcaaguuc     4020
          gacgaggacg acagcgagcc cgugcugaag ggcgugaagc ugcacuacac c              4071
          <![CDATA[<210>  41]]>
          <![CDATA[<211>  1357]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  41]]>
          Met Phe Leu Ile Leu Leu Ile Ser Leu Pro Thr Ala Phe Ala Val Ile 
          1               5                   10                  15      
          Gly Asp Leu Lys Cys Thr Ser Asp Asn Ile Asn Asp Lys Asp Thr Gly 
                      20                  25                  30          
          Pro Pro Pro Ile Ser Thr Asp Thr Val Asp Val Thr Asn Gly Leu Gly 
                  35                  40                  45              
          Thr Tyr Tyr Val Leu Asp Arg Val Tyr Leu Asn Thr Thr Leu Phe Leu 
              50                  55                  60                  
          Asn Gly Tyr Tyr Pro Thr Ser Gly Ser Thr Tyr Arg Asn Met Ala Leu 
          65                  70                  75                  80  
          Lys Gly Ser Val Leu Leu Ser Arg Leu Trp Phe Lys Pro Pro Phe Leu 
                          85                  90                  95      
          Ser Asp Phe Ile Asn Gly Ile Phe Ala Lys Val Lys Asn Thr Lys Val 
                      100                 105                 110         
          Ile Lys Asp Arg Val Met Tyr Ser Glu Phe Pro Ala Ile Thr Ile Gly 
                  115                 120                 125             
          Ser Thr Phe Val Asn Thr Ser Tyr Ser Val Val Val Gln Pro Arg Thr 
              130                 135                 140                 
          Ile Asn Ser Thr Gln Asp Gly Asp Asn Lys Leu Gln Gly Leu Leu Glu 
          145                 150                 155                 160 
          Val Ser Val Cys Gln Tyr Asn Met Cys Glu Tyr Pro Gln Thr Ile Cys 
                          165                 170                 175     
          His Pro Asn Leu Gly Asn His Arg Lys Glu Leu Trp His Leu Asp Thr 
                      180                 185                 190         
          Gly Val Val Ser Cys Leu Tyr Lys Arg Asn Phe Thr Tyr Asp Val Asn 
                  195                 200                 205             
          Ala Asp Tyr Leu Tyr Phe His Phe Tyr Gln Glu Gly Gly Thr Phe Tyr 
              210                 215                 220                 
          Ala Tyr Phe Thr Asp Thr Gly Val Val Thr Lys Phe Leu Phe Asn Val 
          225                 230                 235                 240 
          Tyr Leu Gly Met Ala Leu Ser His Tyr Tyr Val Met Pro Leu Thr Cys 
                          245                 250                 255     
          Asn Ser Lys Leu Thr Leu Glu Tyr Trp Val Thr Pro Leu Thr Ser Arg 
                      260                 265                 270         
          Gln Tyr Leu Leu Ala Phe Asn Gln Asp Gly Ile Ile Phe Asn Ala Val 
                  275                 280                 285             
          Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys Ser 
              290                 295                 300                 
          Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Asn Gly Tyr Thr Val Gln 
          305                 310                 315                 320 
          Pro Ile Ala Asp Val Tyr Arg Arg Lys Pro Asn Leu Pro Asn Cys Asn 
                          325                 330                 335     
          Ile Glu Ala Trp Leu Asn Asp Lys Ser Val Pro Ser Pro Leu Asn Trp 
                      340                 345                 350         
          Glu Arg Lys Thr Phe Ser Asn Cys Asn Phe Asn Met Ser Ser Leu Met 
                  355                 360                 365             
          Ser Phe Ile Gln Ala Asp Ser Phe Thr Cys Asn Asn Ile Asp Ala Ala 
              370                 375                 380                 
          Lys Ile Tyr Gly Met Cys Phe Ser Ser Ile Thr Ile Asp Lys Phe Ala 
          385                 390                 395                 400 
          Ile Pro Asn Gly Arg Lys Val Asp Leu Gln Leu Gly Asn Leu Gly Tyr 
                          405                 410                 415     
          Leu Gln Ser Phe Asn Tyr Arg Ile Asp Thr Thr Ala Thr Ser Cys Gln 
                      420                 425                 430         
          Leu Tyr Tyr Asn Leu Pro Ala Ala Asn Val Ser Val Ser Arg Phe Asn 
                  435                 440                 445             
          Pro Ser Thr Trp Asn Lys Arg Phe Gly Phe Ile Glu Asp Ser Val Phe 
              450                 455                 460                 
          Lys Pro Arg Pro Ala Gly Val Leu Thr Asn His Asp Val Val Tyr Ala 
          465                 470                 475                 480 
          Gln His Cys Phe Lys Ala Pro Lys Asn Phe Cys Pro Cys Lys Leu Asn 
                          485                 490                 495     
          Gly Ser Cys Val Gly Ser Gly Pro Gly Lys Asn Asn Gly Ile Gly Thr 
                      500                 505                 510         
          Cys Pro Ala Gly Thr Asn Tyr Leu Thr Cys Asp Asn Leu Cys Thr Pro 
                  515                 520                 525             
          Asp Pro Ile Thr Phe Thr Gly Thr Tyr Lys Cys Pro Gln Thr Lys Ser 
              530                 535                 540                 
          Leu Val Gly Ile Gly Glu His Cys Ser Gly Leu Ala Val Lys Ser Asp 
          545                 550                 555                 560 
          Tyr Cys Gly Gly Asn Ser Cys Thr Cys Arg Pro Gln Ala Phe Leu Gly 
                          565                 570                 575     
          Trp Ser Ala Asp Ser Cys Leu Gln Gly Asp Lys Cys Asn Ile Phe Ala 
                      580                 585                 590         
          Asn Phe Ile Leu His Asp Val Asn Ser Gly Leu Thr Cys Ser Thr Asp 
                  595                 600                 605             
          Leu Gln Lys Ala Asn Thr Asp Ile Ile Leu Gly Val Cys Val Asn Tyr 
              610                 615                 620                 
          Asp Leu Tyr Gly Ile Leu Gly Gln Gly Ile Phe Val Glu Val Asn Ala 
          625                 630                 635                 640 
          Thr Tyr Tyr Asn Ser Trp Gln Asn Leu Leu Tyr Asp Ser Asn Gly Asn 
                          645                 650                 655     
          Leu Tyr Gly Phe Arg Asp Tyr Ile Ile Asn Arg Thr Phe Met Ile Arg 
                      660                 665                 670         
          Ser Cys Tyr Ser Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr 
                  675                 680                 685             
          Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val 
              690                 695                 700                 
          Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr 
          705                 710                 715                 720 
          Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly 
                          725                 730                 735     
          Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala 
                      740                 745                 750         
          Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala 
                  755                 760                 765             
          Arg Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly 
              770                 775                 780                 
          Ala Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr 
          785                 790                 795                 800 
          Asn Phe Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr 
                          805                 810                 815     
          Lys Thr Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu 
                      820                 825                 830         
          Cys Ser Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn 
                  835                 840                 845             
          Arg Ala Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu 
              850                 855                 860                 
          Val Phe Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp 
          865                 870                 875                 880 
          Phe Gly Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro 
                          885                 890                 895     
          Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu 
                      900                 905                 910         
          Ala Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile 
                  915                 920                 925             
          Ala Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val 
              930                 935                 940                 
          Leu Pro Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala 
          945                 950                 955                 960 
          Leu Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala 
                          965                 970                 975     
          Ala Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly 
                      980                 985                 990         
          Ile Gly Val Thr Gln Asn Val Leu  Tyr Glu Asn Gln Lys  Leu Ile Ala 
                  995                 1000                 1005             
          Asn Gln  Phe Asn Ser Ala Ile  Gly Lys Ile Gln Asp  Ser Leu Ser 
              1010                 1015                 1020             
          Ser Thr  Ala Ser Ala Leu Gly  Lys Leu Gln Asp Val  Val Asn Gln 
              1025                 1030                 1035             
          Asn Ala  Gln Ala Leu Asn Thr  Leu Val Lys Gln Leu  Ser Ser Asn 
              1040                 1045                 1050             
          Phe Gly  Ala Ile Ser Ser Val  Leu Asn Asp Ile Leu  Ser Arg Leu 
              1055                 1060                 1065             
          Asp Pro  Pro Glu Ala Glu Val  Gln Ile Asp Arg Leu  Ile Thr Gly 
              1070                 1075                 1080             
          Arg Leu  Gln Ser Leu Gln Thr  Tyr Val Thr Gln Gln  Leu Ile Arg 
              1085                 1090                 1095             
          Ala Ala  Glu Ile Arg Ala Ser  Ala Asn Leu Ala Ala  Thr Lys Met 
              1100                 1105                 1110             
          Ser Glu  Cys Val Leu Gly Gln  Ser Lys Arg Val Asp  Phe Cys Gly 
              1115                 1120                 1125             
          Lys Gly  Tyr His Leu Met Ser  Phe Pro Gln Ser Ala  Pro His Gly 
              1130                 1135                 1140             
          Val Val  Phe Leu His Val Thr  Tyr Val Pro Ala Gln  Glu Lys Asn 
              1145                 1150                 1155             
          Phe Thr  Thr Ala Pro Ala Ile  Cys His Asp Gly Lys  Ala His Phe 
              1160                 1165                 1170             
          Pro Arg  Glu Gly Val Phe Val  Ser Asn Gly Thr His  Trp Phe Val 
              1175                 1180                 1185             
          Thr Gln  Arg Asn Phe Tyr Glu  Pro Gln Ile Ile Thr  Thr Asp Asn 
              1190                 1195                 1200             
          Thr Phe  Val Ser Gly Asn Cys  Asp Val Val Ile Gly  Ile Val Asn 
              1205                 1210                 1215             
          Asn Thr  Val Tyr Asp Pro Leu  Gln Pro Glu Leu Asp  Ser Phe Lys 
              1220                 1225                 1230             
          Glu Glu  Leu Asp Lys Tyr Phe  Lys Asn His Thr Ser  Pro Asp Val 
              1235                 1240                 1245             
          Asp Leu  Gly Asp Ile Ser Gly  Ile Asn Ala Ser Val  Val Asn Ile 
              1250                 1255                 1260             
          Gln Lys  Glu Ile Asp Arg Leu  Asn Glu Val Ala Lys  Asn Leu Asn 
              1265                 1270                 1275             
          Glu Ser  Leu Ile Asp Leu Gln  Glu Leu Gly Lys Tyr  Glu Gln Tyr 
              1280                 1285                 1290             
          Ile Lys  Trp Pro Trp Tyr Ile  Trp Leu Gly Phe Ile  Ala Gly Leu 
              1295                 1300                 1305             
          Ile Ala  Ile Val Met Val Thr  Ile Met Leu Cys Cys  Met Thr Ser 
              1310                 1315                 1320             
          Cys Cys  Ser Cys Leu Lys Gly  Cys Cys Ser Cys Gly  Ser Cys Cys 
              1325                 1330                 1335             
          Lys Phe  Asp Glu Asp Asp Ser  Glu Pro Val Leu Lys  Gly Val Lys 
              1340                 1345                 1350             
          Leu His  Tyr Thr 
              1355         
          <![CDATA[<210>  42]]>
          <![CDATA[<211>  1157]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  42]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacucu ggcggaggcg gcagcgccau cggcggcuac      960
          auccccgagg ccccuagaga cggccaggcc uacgugcgga aggacggcga gugggugcug     1020
          cugagcaccu uccugggcug auaauaggcu ggagccucgg uggccuagcu ucuugccccu     1080
          ugggccuccc cccagccccu ccuccccuuc cugcacccgu acccccgugg ucuuugaaua     1140
          aagucugagu gggcggc                                                    1157
          <![CDATA[<210>  43]]>
          <![CDATA[<211>  981]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  43]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ucuggcggag gcggcagcgc caucggcggc      900
          uacauccccg aggccccuag agacggccag gccuacgugc ggaaggacgg cgagugggug      960
          cugcugagca ccuuccuggg c                                                981
          <![CDATA[<210>  44]]>
          <![CDATA[<211>  327]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  44]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Ser Gly Gly Gly Gly Ser Ala Ile Gly Gly Tyr Ile Pro Glu 
              290                 295                 300                 
          Ala Pro Arg Asp Gly Gln Ala Tyr Val Arg Lys Asp Gly Glu Trp Val 
          305                 310                 315                 320 
          Leu Leu Ser Thr Phe Leu Gly 
                          325         
          <![CDATA[<210>  45]]>
          <![CDATA[<211>  1130]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  45]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacucu ggcggaggca gcauccuggc caucuacagc      960
          accguggcca gcagccuggu gcugcuggug agccugggcg ccaucagcuu cugauaauag     1020
          gcuggagccu cgguggccua gcuucuugcc ccuugggccu ccccccagcc ccuccucccc     1080
          uuccugcacc cguacccccg uggucuuuga auaaagucug agugggcggc                1130
          <![CDATA[<210>  46]]>
          <![CDATA[<211>  954]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  46]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ucuggcggag gcagcauccu ggccaucuac      900
          agcaccgugg ccagcagccu ggugcugcug gugagccugg gcgccaucag cuuc            954
          <![CDATA[<210>  47]]>
          <![CDATA[<211>  318]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  47]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala 
              290                 295                 300                 
          Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe 
          305                 310                 315             
          <![CDATA[<210>  48]]>
          <![CDATA[<211>  1238]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  48]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacucu ggcggaggca gcauccuggc caucuacagc      960
          accguggcca gcagccuggu gcugcuggug agccugggcg ccaucagcuu cggcggaggc     1020
          agcgccaucg gcggcuacau ccccgaggcc ccuagagacg gccaggccua cgugcggaag     1080
          gacggcgagu gggugcugcu gagcaccuuc cugggcaagu gauaauaggc uggagccucg     1140
          guggccuagc uucuugcccc uugggccucc ccccagcccc uccuccccuu ccugcacccg     1200
          uacccccgug gucuuugaau aaagucugag ugggcggc                             1238
          <![CDATA[<210>  49]]>
          <![CDATA[<211>  1062]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  49]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ucuggcggag gcagcauccu ggccaucuac      900
          agcaccgugg ccagcagccu ggugcugcug gugagccugg gcgccaucag cuucggcgga      960
          ggcagcgcca ucggcggcua cauccccgag gccccuagag acggccaggc cuacgugcgg     1020
          aaggacggcg agugggugcu gcugagcacc uuccugggca ag                        1062
          <![CDATA[<210>  50]]>
          <![CDATA[<211>  354]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  50]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala 
              290                 295                 300                 
          Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe Gly Gly 
          305                 310                 315                 320 
          Gly Ser Ala Ile Gly Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln 
                          325                 330                 335     
          Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu 
                      340                 345                 350         
          Gly Lys 
          <![CDATA[<210>  51]]>
          <![CDATA[<211>  1130]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  51]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          gugugccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacucu ggcggaggca gcauccuggc caucuacagc      960
          accguggcca gcagccuggu gcugcuggug agccugggcg ccaucagcuu cugauaauag     1020
          gcuggagccu cgguggccua gcuucuugcc ccuugggccu ccccccagcc ccuccucccc     1080
          uuccugcacc cguacccccg uggucuuuga auaaagucug agugggcggc                1130
          <![CDATA[<210>  52]]>
          <![CDATA[<211>  954]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  52]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aaggugugcc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ucuggcggag gcagcauccu ggccaucuac      900
          agcaccgugg ccagcagccu ggugcugcug gugagccugg gcgccaucag cuuc            954
          <![CDATA[<210>  53]]>
          <![CDATA[<211>  318]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  53]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Cys Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala 
              290                 295                 300                 
          Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe 
          305                 310                 315             
          <![CDATA[<210>  54]]>
          <![CDATA[<211>  1181]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  54]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          gugugccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccuc uggcggaggc agcauccugg ccaucuacag caccguggcc     1020
          agcagccugg ugcugcuggu gagccugggc gccaucagcu ucugauaaua ggcuggagcc     1080
          ucgguggccu agcuucuugc cccuugggcc uccccccagc cccuccuccc cuuccugcac     1140
          ccguaccccc guggucuuug aauaaagucu gagugggcgg c                         1181
          <![CDATA[<210>  55]]>
          <![CDATA[<211>  1005]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  55]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aaggugugcc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cucuggcgga ggcagcaucc uggccaucua cagcaccgug      960
          gccagcagcc uggugcugcu ggugagccug ggcgccauca gcuuc                     1005
          <![CDATA[<210>  56]]>
          <![CDATA[<211>  335]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  56]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Cys Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val 
          305                 310                 315                 320 
          Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe 
                          325                 330                 335 
          <![CDATA[<210>  57]]>
          <![CDATA[<211>  1181]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  57]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccuc uggcggaggc agcauccugg ccaucuacag caccguggcc     1020
          agcagccugg ugcugcuggu gagccugggc gccaucagcu ucugauaaua ggcuggagcc     1080
          ucgguggccu agcuucuugc cccuugggcc uccccccagc cccuccuccc cuuccugcac     1140
          ccguaccccc guggucuuug aauaaagucu gagugggcgg c                         1181
          <![CDATA[<210>  58]]>
          <![CDATA[<211>  1005]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  58]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cucuggcgga ggcagcaucc uggccaucua cagcaccgug      960
          gccagcagcc uggugcugcu ggugagccug ggcgccauca gcuuc                     1005
          <![CDATA[<210>  59]]>
          <![CDATA[<211>  335]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  59]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val 
          305                 310                 315                 320 
          Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe 
                          325                 330                 335 
          <![CDATA[<210>  60]]>
          <![CDATA[<211>  836]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  60]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag      120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc      180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac      240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac      300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag      360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc      420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac      480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc      540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac      600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg      660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaaguga      720
          uaauaggcug gagccucggu ggccuagcuu cuugccccuu gggccucccc ccagccccuc      780
          cuccccuucc ugcacccgua cccccguggu cuuugaauaa agucugagug ggcggc          836
          <![CDATA[<210>  61]]>
          <![CDATA[<211>  660]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  61]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc       60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc      120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug      180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac      240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg      300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac      360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac      420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc      480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc      540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac      600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag      660
          <![CDATA[<210>  62]]>
          <![CDATA[<211>  220]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  62]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu 
          1               5                   10                  15      
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu 
                      20                  25                  30          
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys 
                  35                  40                  45              
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala 
              50                  55                  60                  
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn 
          65                  70                  75                  80  
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly 
                          85                  90                  95      
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp 
                      100                 105                 110         
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp 
                  115                 120                 125             
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu 
              130                 135                 140                 
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile 
          145                 150                 155                 160 
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu 
                          165                 170                 175     
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr 
                      180                 185                 190         
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu 
                  195                 200                 205             
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
              210                 215                 220 
          <![CDATA[<210>  63]]>
          <![CDATA[<211>  1349]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  63]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag      120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc      180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac      240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac      300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag      360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc      420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac      480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc      540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac      600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg      660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaaggga      720
          ggaggcagcg gcggcgauau caucaagcuu cugaacgagc aaguuaacaa ggaaaugcag      780
          agcaguaauc ucuacaugag caugagcagc uggugcuaca cccacucccu ggacggagca      840
          ggccucuucc uguucgacca cgcagccgag gaguacgagc acgcuaagaa guugaucauu      900
          uucuugaacg agaacaacgu gcccgugcag cuaacgucaa ucagcgcacc ugagcacaag      960
          uucgagggcc ugacccagau cuuccagaag gccuacgaac acgaacagca caucuccgag     1020
          agcaucaaca auauugugga ucacgcuauc aaguccaagg accacgcuac cuucaacuuc     1080
          cugcaguggu acguggccga gcaacaugag gaggaggugc uguucaagga cauccuggac     1140
          aagaucgagc ugaucgguaa ugagaaucac ggccuguacc uggccgacca guacgugaag     1200
          ggcaucgcca agagccggaa gucaggcuca ugauaauagg cuggagccuc gguggccuag     1260
          cuucuugccc cuugggccuc cccccagccc cuccuccccu uccugcaccc guacccccgu     1320
          ggucuuugaa uaaagucuga gugggcggc                                       1349
          <![CDATA[<210>  64]]>
          <![CDATA[<211>  1173]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  64]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc       60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc      120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug      180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac      240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg      300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac      360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac      420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc      480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc      540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac      600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag      660
          ggaggaggca gcggcggcga uaucaucaag cuucugaacg agcaaguuaa caaggaaaug      720
          cagagcagua aucucuacau gagcaugagc agcuggugcu acacccacuc ccuggacgga      780
          gcaggccucu uccuguucga ccacgcagcc gaggaguacg agcacgcuaa gaaguugauc      840
          auuuucuuga acgagaacaa cgugcccgug cagcuaacgu caaucagcgc accugagcac      900
          aaguucgagg gccugaccca gaucuuccag aaggccuacg aacacgaaca gcacaucucc      960
          gagagcauca acaauauugu ggaucacgcu aucaagucca aggaccacgc uaccuucaac     1020
          uuccugcagu gguacguggc cgagcaacau gaggaggagg ugcuguucaa ggacauccug     1080
          gacaagaucg agcugaucgg uaaugagaau cacggccugu accuggccga ccaguacgug     1140
          aagggcaucg ccaagagccg gaagucaggc uca                                  1173
          <![CDATA[<210>  65]]>
          <![CDATA[<211>  391]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  65]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu 
          1               5                   10                  15      
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu 
                      20                  25                  30          
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys 
                  35                  40                  45              
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala 
              50                  55                  60                  
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn 
          65                  70                  75                  80  
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly 
                          85                  90                  95      
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp 
                      100                 105                 110         
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp 
                  115                 120                 125             
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu 
              130                 135                 140                 
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile 
          145                 150                 155                 160 
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu 
                          165                 170                 175     
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr 
                      180                 185                 190         
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu 
                  195                 200                 205             
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Gly Gly Gly Ser 
              210                 215                 220                 
          Gly Gly Asp Ile Ile Lys Leu Leu Asn Glu Gln Val Asn Lys Glu Met 
          225                 230                 235                 240 
          Gln Ser Ser Asn Leu Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr His 
                          245                 250                 255     
          Ser Leu Asp Gly Ala Gly Leu Phe Leu Phe Asp His Ala Ala Glu Glu 
                      260                 265                 270         
          Tyr Glu His Ala Lys Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn Val 
                  275                 280                 285             
          Pro Val Gln Leu Thr Ser Ile Ser Ala Pro Glu His Lys Phe Glu Gly 
              290                 295                 300                 
          Leu Thr Gln Ile Phe Gln Lys Ala Tyr Glu His Glu Gln His Ile Ser 
          305                 310                 315                 320 
          Glu Ser Ile Asn Asn Ile Val Asp His Ala Ile Lys Ser Lys Asp His 
                          325                 330                 335     
          Ala Thr Phe Asn Phe Leu Gln Trp Tyr Val Ala Glu Gln His Glu Glu 
                      340                 345                 350         
          Glu Val Leu Phe Lys Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly Asn 
                  355                 360                 365             
          Glu Asn His Gly Leu Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile Ala 
              370                 375                 380                 
          Lys Ser Arg Lys Ser Gly Ser 
          385                 390     
          <![CDATA[<210>  66]]>
          <![CDATA[<211>  1376]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  66]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          ggcauccugc ccagcccugg caugcccgcu cugcugagcc uggugagccu gcugagcgug      120
          cugcugaugg gcugcguggc ugagaccggc augcagaucu acgagggcaa gcugaccgca      180
          gagggccugc gguucggcau cguggccagc cgcgccaacc acgcucuggu ggaccggcuu      240
          guggagggcg cuaucgacgc caucgugaga cacggcggcc gggaagagga caucacccug      300
          gugcgggugu gcggcagcug ggagauuccc gucgccgccg gagaacuggc ccggaaggag      360
          gacaucgacg ccgugaucgc caucggcgug cugugcagag gcgccacgcc cagcuucgac      420
          uacaucgcca gcgaggugag caagggccug gccgaccuga gccuggagcu gcggaagccc      480
          aucaccuucg gcgugaucac cgccgacacc cuggagcagg ccaucgaggc cgcaggcacc      540
          ugccacggca acaagggcug ggaagccgcc cugugcgcca ucgagauggc caaccuguuc      600
          aagagccugc ggggcggaag uggaggcucu gguggcagcg gaggaucugg cggcggccag      660
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc      720
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac      780
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac      840
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag      900
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc      960
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac     1020
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc     1080
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac     1140
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg     1200
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaaguga     1260
          uaauaggcug gagccucggu ggccuagcuu cuugccccuu gggccucccc ccagccccuc     1320
          cuccccuucc ugcacccgua cccccguggu cuuugaauaa agucugagug ggcggc         1376
          <![CDATA[<210>  67]]>
          <![CDATA[<211>  1200]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  67]]>
          augggcaucc ugcccagccc uggcaugccc gcucugcuga gccuggugag ccugcugagc       60
          gugcugcuga ugggcugcgu ggcugagacc ggcaugcaga ucuacgaggg caagcugacc      120
          gcagagggcc ugcgguucgg caucguggcc agccgcgcca accacgcucu gguggaccgg      180
          cuuguggagg gcgcuaucga cgccaucgug agacacggcg gccgggaaga ggacaucacc      240
          cuggugcggg ugugcggcag cugggagauu cccgucgccg ccggagaacu ggcccggaag      300
          gaggacaucg acgccgugau cgccaucggc gugcugugca gaggcgccac gcccagcuuc      360
          gacuacaucg ccagcgaggu gagcaagggc cuggccgacc ugagccugga gcugcggaag      420
          cccaucaccu ucggcgugau caccgccgac acccuggagc aggccaucga ggccgcaggc      480
          accugccacg gcaacaaggg cugggaagcc gcccugugcg ccaucgagau ggccaaccug      540
          uucaagagcc ugcggggcgg aaguggaggc ucugguggca gcggaggauc uggcggcggc      600
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc      660
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug      720
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac      780
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg      840
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac      900
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac      960
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc     1020
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc     1080
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac     1140
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag     1200
          <![CDATA[<210>  68]]>
          <![CDATA[<211>  400]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  68]]>
          Met Gly Ile Leu Pro Ser Pro Gly Met Pro Ala Leu Leu Ser Leu Val 
          1               5                   10                  15      
          Ser Leu Leu Ser Val Leu Leu Met Gly Cys Val Ala Glu Thr Gly Met 
                      20                  25                  30          
          Gln Ile Tyr Glu Gly Lys Leu Thr Ala Glu Gly Leu Arg Phe Gly Ile 
                  35                  40                  45              
          Val Ala Ser Arg Ala Asn His Ala Leu Val Asp Arg Leu Val Glu Gly 
              50                  55                  60                  
          Ala Ile Asp Ala Ile Val Arg His Gly Gly Arg Glu Glu Asp Ile Thr 
          65                  70                  75                  80  
          Leu Val Arg Val Cys Gly Ser Trp Glu Ile Pro Val Ala Ala Gly Glu 
                          85                  90                  95      
          Leu Ala Arg Lys Glu Asp Ile Asp Ala Val Ile Ala Ile Gly Val Leu 
                      100                 105                 110         
          Cys Arg Gly Ala Thr Pro Ser Phe Asp Tyr Ile Ala Ser Glu Val Ser 
                  115                 120                 125             
          Lys Gly Leu Ala Asp Leu Ser Leu Glu Leu Arg Lys Pro Ile Thr Phe 
              130                 135                 140                 
          Gly Val Ile Thr Ala Asp Thr Leu Glu Gln Ala Ile Glu Ala Ala Gly 
          145                 150                 155                 160 
          Thr Cys His Gly Asn Lys Gly Trp Glu Ala Ala Leu Cys Ala Ile Glu 
                          165                 170                 175     
          Met Ala Asn Leu Phe Lys Ser Leu Arg Gly Gly Ser Gly Gly Ser Gly 
                      180                 185                 190         
          Gly Ser Gly Gly Ser Gly Gly Gly Gln Pro Asn Ile Thr Asn Leu Cys 
                  195                 200                 205             
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
              210                 215                 220                 
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
          225                 230                 235                 240 
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
                          245                 250                 255     
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
                      260                 265                 270         
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                  275                 280                 285             
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
              290                 295                 300                 
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
          305                 310                 315                 320 
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
                          325                 330                 335     
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
                      340                 345                 350         
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                  355                 360                 365             
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
              370                 375                 380                 
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
          385                 390                 395                 400 
          <![CDATA[<210>  69]]>
          <![CDATA[<211>  1334]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  69]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag      120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc      180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac      240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac      300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag      360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc      420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac      480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc      540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac      600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg      660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaaggga      720
          ggaggcuccg gaggcgguag cgcugagacc ggcaugcaga ucuacgaggg caagcugacc      780
          gcagagggcc ugcgguucgg caucguggcc agccgcgcca accacgcucu gguggaccgg      840
          cuuguggagg gcgcuaucga cgccaucgug agacacggcg gccgggaaga ggacaucacc      900
          cuggugcggg ugugcggcag cugggagauu cccgucgccg ccggagaacu ggcccggaag      960
          gaggacaucg acgccgugau cgccaucggc gugcugugca gaggcgccac gcccagcuuc     1020
          gacuacaucg ccagcgaggu gagcaagggc cuggccgacc ugagccugga gcugcggaag     1080
          cccaucaccu ucggcgugau caccgccgac acccuggagc aggccaucga ggccgcaggc     1140
          accugccacg gcaacaaggg cugggaagcc gcccugugcg ccaucgagau ggccaaccug     1200
          uucaagagcc ugcggugaua auaggcugga gccucggugg ccuagcuucu ugccccuugg     1260
          gccucccccc agccccuccu ccccuuccug cacccguacc cccguggucu uugaauaaag     1320
          ucugaguggg cggc                                                       1334
          <![CDATA[<210>  70]]>
          <![CDATA[<211>  1158]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  70]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc       60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc      120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug      180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac      240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg      300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac      360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac      420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc      480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc      540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac      600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag      660
          ggaggaggcu ccggaggcgg uagcgcugag accggcaugc agaucuacga gggcaagcug      720
          accgcagagg gccugcgguu cggcaucgug gccagccgcg ccaaccacgc ucugguggac      780
          cggcuugugg agggcgcuau cgacgccauc gugagacacg gcggccggga agaggacauc      840
          acccuggugc gggugugcgg cagcugggag auucccgucg ccgccggaga acuggcccgg      900
          aaggaggaca ucgacgccgu gaucgccauc ggcgugcugu gcagaggcgc cacgcccagc      960
          uucgacuaca ucgccagcga ggugagcaag ggccuggccg accugagccu ggagcugcgg     1020
          aagcccauca ccuucggcgu gaucaccgcc gacacccugg agcaggccau cgaggccgca     1080
          ggcaccugcc acggcaacaa gggcugggaa gccgcccugu gcgccaucga gauggccaac     1140
          cuguucaaga gccugcgg                                                   1158
          <![CDATA[<210>  71]]>
          <![CDATA[<211>  386]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  71]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu 
          1               5                   10                  15      
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu 
                      20                  25                  30          
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys 
                  35                  40                  45              
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala 
              50                  55                  60                  
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn 
          65                  70                  75                  80  
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly 
                          85                  90                  95      
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp 
                      100                 105                 110         
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp 
                  115                 120                 125             
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu 
              130                 135                 140                 
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile 
          145                 150                 155                 160 
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu 
                          165                 170                 175     
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr 
                      180                 185                 190         
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu 
                  195                 200                 205             
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Gly Gly Gly Ser 
              210                 215                 220                 
          Gly Gly Gly Ser Ala Glu Thr Gly Met Gln Ile Tyr Glu Gly Lys Leu 
          225                 230                 235                 240 
          Thr Ala Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Ala Asn His 
                          245                 250                 255     
          Ala Leu Val Asp Arg Leu Val Glu Gly Ala Ile Asp Ala Ile Val Arg 
                      260                 265                 270         
          His Gly Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Cys Gly Ser 
                  275                 280                 285             
          Trp Glu Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile 
              290                 295                 300                 
          Asp Ala Val Ile Ala Ile Gly Val Leu Cys Arg Gly Ala Thr Pro Ser 
          305                 310                 315                 320 
          Phe Asp Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asp Leu Ser 
                          325                 330                 335     
          Leu Glu Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr 
                      340                 345                 350         
          Leu Glu Gln Ala Ile Glu Ala Ala Gly Thr Cys His Gly Asn Lys Gly 
                  355                 360                 365             
          Trp Glu Ala Ala Leu Cys Ala Ile Glu Met Ala Asn Leu Phe Lys Ser 
              370                 375                 380                 
          Leu Arg 
          385     
          <![CDATA[<210>  72]]>
          <![CDATA[<211>  947]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  72]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag      120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc      180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac      240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac      300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag      360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc      420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac      480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc      540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac      600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg      660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaagucu      720
          ggcggaggcg gcagcgccau cggcggcuac auccccgagg ccccuagaga cggccaggcc      780
          uacgugcgga aggacggcga gugggugcug cugagcaccu uccugggcug auaauaggcu      840
          ggagccucgg uggccuagcu ucuugccccu ugggccuccc cccagccccu ccuccccuuc      900
          cugcacccgu acccccgugg ucuuugaaua aagucugagu gggcggc                    947
          <![CDATA[<210>  73]]>
          <![CDATA[<211>  771]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  73]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc       60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc      120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug      180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac      240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg      300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac      360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac      420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc      480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc      540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac      600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag      660
          ucuggcggag gcggcagcgc caucggcggc uacauccccg aggccccuag agacggccag      720
          gccuacgugc ggaaggacgg cgagugggug cugcugagca ccuuccuggg c               771
          <![CDATA[<210>  74]]>
          <![CDATA[<211>  257]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  74]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu 
          1               5                   10                  15      
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu 
                      20                  25                  30          
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys 
                  35                  40                  45              
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala 
              50                  55                  60                  
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn 
          65                  70                  75                  80  
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly 
                          85                  90                  95      
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp 
                      100                 105                 110         
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp 
                  115                 120                 125             
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu 
              130                 135                 140                 
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile 
          145                 150                 155                 160 
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu 
                          165                 170                 175     
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr 
                      180                 185                 190         
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu 
                  195                 200                 205             
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Ser Gly Gly Gly 
              210                 215                 220                 
          Gly Ser Ala Ile Gly Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln 
          225                 230                 235                 240 
          Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu 
                          245                 250                 255     
          Gly 
          <![CDATA[<210>  75]]>
          <![CDATA[<211>  920]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  75]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag      120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc      180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac      240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac      300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag      360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc      420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac      480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc      540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac      600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg      660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaagucu      720
          ggcggaggca gcauccuggc caucuacagc accguggcca gcagccuggu gcugcuggug      780
          agccugggcg ccaucagcuu cugauaauag gcuggagccu cgguggccua gcuucuugcc      840
          ccuugggccu ccccccagcc ccuccucccc uuccugcacc cguacccccg uggucuuuga      900
          auaaagucug agugggcggc                                                  920
          <![CDATA[<210>  76]]>
          <![CDATA[<211>  744]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  76]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc       60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc      120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug      180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac      240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg      300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac      360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac      420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc      480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc      540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac      600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag      660
          ucuggcggag gcagcauccu ggccaucuac agcaccgugg ccagcagccu ggugcugcug      720
          gugagccugg gcgccaucag cuuc                                             744
          <![CDATA[<210>  77]]>
          <![CDATA[<211>  248]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  77]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu 
          1               5                   10                  15      
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu 
                      20                  25                  30          
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys 
                  35                  40                  45              
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala 
              50                  55                  60                  
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn 
          65                  70                  75                  80  
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly 
                          85                  90                  95      
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp 
                      100                 105                 110         
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp 
                  115                 120                 125             
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu 
              130                 135                 140                 
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile 
          145                 150                 155                 160 
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu 
                          165                 170                 175     
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr 
                      180                 185                 190         
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu 
                  195                 200                 205             
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Ser Gly Gly Gly 
              210                 215                 220                 
          Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu 
          225                 230                 235                 240 
          Val Ser Leu Gly Ala Ile Ser Phe 
                          245             
          <![CDATA[<210>  78]]>
          <![CDATA[<211>  1025]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  78]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag      120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc      180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac      240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac      300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag      360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc      420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac      480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc      540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac      600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg      660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaagucu      720
          ggcggaggcg gcagcgccau cggcggcuac auccccgagg ccccuagaga cggccaggcc      780
          uacgugcgga aggacggcga gugggugcug cugagcaccu uccugggcgg aggcagcauc      840
          cuggccaucu acagcaccgu ggccagcagc cuggugcugc uggugagccu gggcgccauc      900
          agcuucugau aauaggcugg agccucggug gccuagcuuc uugccccuug ggccuccccc      960
          cagccccucc uccccuuccu gcacccguac ccccgugguc uuugaauaaa gucugagugg     1020
          gcggc                                                                 1025
          <![CDATA[<210>  79]]>
          <![CDATA[<211>  849]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  79]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc       60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc      120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug      180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac      240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg      300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac      360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac      420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc      480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc      540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac      600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag      660
          ucuggcggag gcggcagcgc caucggcggc uacauccccg aggccccuag agacggccag      720
          gccuacgugc ggaaggacgg cgagugggug cugcugagca ccuuccuggg cggaggcagc      780
          auccuggcca ucuacagcac cguggccagc agccuggugc ugcuggugag ccugggcgcc      840
          aucagcuuc                                                              849
          <![CDATA[<210>  80]]>
          <![CDATA[<211>  283]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  80]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu 
          1               5                   10                  15      
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu 
                      20                  25                  30          
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys 
                  35                  40                  45              
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala 
              50                  55                  60                  
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn 
          65                  70                  75                  80  
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly 
                          85                  90                  95      
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp 
                      100                 105                 110         
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp 
                  115                 120                 125             
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu 
              130                 135                 140                 
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile 
          145                 150                 155                 160 
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu 
                          165                 170                 175     
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr 
                      180                 185                 190         
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu 
                  195                 200                 205             
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Ser Gly Gly Gly 
              210                 215                 220                 
          Gly Ser Ala Ile Gly Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln 
          225                 230                 235                 240 
          Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu 
                          245                 250                 255     
          Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu 
                      260                 265                 270         
          Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe 
                  275                 280             
          <![CDATA[<210>  81]]>
          <![CDATA[<211>  1028]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  81]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag      120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc      180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac      240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac      300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag      360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc      420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac      480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc      540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac      600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg      660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaagucu      720
          ggcggaggca gcauccuggc caucuacagc accguggcca gcagccuggu gcugcuggug      780
          agccugggcg ccaucagcuu cggcggaggc agcgccaucg gcggcuacau ccccgaggcc      840
          ccuagagacg gccaggccua cgugcggaag gacggcgagu gggugcugcu gagcaccuuc      900
          cugggcaagu gauaauaggc uggagccucg guggccuagc uucuugcccc uugggccucc      960
          ccccagcccc uccuccccuu ccugcacccg uacccccgug gucuuugaau aaagucugag     1020
          ugggcggc                                                              1028
          <![CDATA[<210>  82]]>
          <![CDATA[<211>  852]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  82]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc       60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc      120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug      180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac      240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg      300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac      360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac      420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc      480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc      540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac      600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag      660
          ucuggcggag gcagcauccu ggccaucuac agcaccgugg ccagcagccu ggugcugcug      720
          gugagccugg gcgccaucag cuucggcgga ggcagcgcca ucggcggcua cauccccgag      780
          gccccuagag acggccaggc cuacgugcgg aaggacggcg agugggugcu gcugagcacc      840
          uuccugggca ag                                                          852
          <![CDATA[<210>  83]]>
          <![CDATA[<211>  284]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  83]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu 
          1               5                   10                  15      
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu 
                      20                  25                  30          
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys 
                  35                  40                  45              
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala 
              50                  55                  60                  
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn 
          65                  70                  75                  80  
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly 
                          85                  90                  95      
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp 
                      100                 105                 110         
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp 
                  115                 120                 125             
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu 
              130                 135                 140                 
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile 
          145                 150                 155                 160 
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu 
                          165                 170                 175     
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr 
                      180                 185                 190         
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu 
                  195                 200                 205             
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Ser Gly Gly Gly 
              210                 215                 220                 
          Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu 
          225                 230                 235                 240 
          Val Ser Leu Gly Ala Ile Ser Phe Gly Gly Gly Ser Ala Ile Gly Gly 
                          245                 250                 255     
          Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr Val Arg Lys Asp 
                      260                 265                 270         
          Gly Glu Trp Val Leu Leu Ser Thr Phe Leu Gly Lys 
                  275                 280                 
          <![CDATA[<210>  84]]>
          <![CDATA[<211>  1142]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  84]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag      120
          cgggugcagc ccaccgagag caucgugcgg uuccccaaca ucaccaaccu gugccccuuc      180
          ggcgaggugu ucaacgccac ccgguucgcc agcguguacg ccuggaaccg gaagcggauc      240
          agcaacugcg uggccgacua cagcgugcug uacaacagcg ccagcuucag caccuucaag      300
          ugcuacggcg ugagccccac caagcugaac gaccugugcu ucaccaacgu guacgccgac      360
          agcuucguga uccguggcga cgaggugcgg cagaucgcac ccggccagac aggcaagauc      420
          gccgacuaca acuacaagcu gcccgacgac uucaccggcu gcgugaucgc cuggaacagc      480
          aacaaccucg acagcaaggu gggcggcaac uacaacuacc uguaccggcu guuccggaag      540
          agcaaccuga agcccuucga gcgggacauc agcaccgaga ucuaccaagc cggcuccacc      600
          ccuugcaacg gcguggaggg cuucaacugc uacuucccuc ugcagagcua cggcuuccag      660
          cccaccaacg gcgugggcua ccagcccuac cggguggugg ugcugagcuu cgagcugcug      720
          cacgccccag ccaccgugug uggccccaag aagagcacca accuggugaa gaacaagugc      780
          gugaacuuca acuucaacgg ccuuaccggc accggcgugc ugaccgagag caacaagaaa      840
          uuccugcccu uucagcaguu cggccgggac aucgccgaca ccaccgacgc ugugcgggau      900
          ccccagaccc uggagauccu ggacaucacc ccuugcagcu cuggcggagg cagcauccug      960
          gccaucuaca gcaccguggc cagcagccug gugcugcugg ugagccuggg cgccaucagc     1020
          uucugauaau aggcuggagc cucgguggcc uagcuucuug ccccuugggc cuccccccag     1080
          ccccuccucc ccuuccugca cccguacccc cguggucuuu gaauaaaguc ugagugggcg     1140
          gc                                                                    1142
          <![CDATA[<210>  85]]>
          <![CDATA[<211>  966]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  85]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc       60
          cagcgggugc agcccaccga gagcaucgug cgguucccca acaucaccaa ccugugcccc      120
          uucggcgagg uguucaacgc cacccgguuc gccagcgugu acgccuggaa ccggaagcgg      180
          aucagcaacu gcguggccga cuacagcgug cuguacaaca gcgccagcuu cagcaccuuc      240
          aagugcuacg gcgugagccc caccaagcug aacgaccugu gcuucaccaa cguguacgcc      300
          gacagcuucg ugauccgugg cgacgaggug cggcagaucg cacccggcca gacaggcaag      360
          aucgccgacu acaacuacaa gcugcccgac gacuucaccg gcugcgugau cgccuggaac      420
          agcaacaacc ucgacagcaa ggugggcggc aacuacaacu accuguaccg gcuguuccgg      480
          aagagcaacc ugaagcccuu cgagcgggac aucagcaccg agaucuacca agccggcucc      540
          accccuugca acggcgugga gggcuucaac ugcuacuucc cucugcagag cuacggcuuc      600
          cagcccacca acggcguggg cuaccagccc uaccgggugg uggugcugag cuucgagcug      660
          cugcacgccc cagccaccgu guguggcccc aagaagagca ccaaccuggu gaagaacaag      720
          ugcgugaacu ucaacuucaa cggccuuacc ggcaccggcg ugcugaccga gagcaacaag      780
          aaauuccugc ccuuucagca guucggccgg gacaucgccg acaccaccga cgcugugcgg      840
          gauccccaga cccuggagau ccuggacauc accccuugca gcucuggcgg aggcagcauc      900
          cuggccaucu acagcaccgu ggccagcagc cuggugcugc uggugagccu gggcgccauc      960
          agcuuc                                                                 966
          <![CDATA[<210>  86]]>
          <![CDATA[<211>  322]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  86]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu 
          1               5                   10                  15      
          Leu Val Asn Ser Gln Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe 
                      20                  25                  30          
          Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr 
                  35                  40                  45              
          Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys 
              50                  55                  60                  
          Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe 
          65                  70                  75                  80  
          Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr 
                          85                  90                  95      
          Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln 
                      100                 105                 110         
          Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu 
                  115                 120                 125             
          Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu 
              130                 135                 140                 
          Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg 
          145                 150                 155                 160 
          Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr 
                          165                 170                 175     
          Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr 
                      180                 185                 190         
          Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr 
                  195                 200                 205             
          Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro 
              210                 215                 220                 
          Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys 
          225                 230                 235                 240 
          Cys Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr 
                          245                 250                 255     
          Glu Ser Asn Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile 
                      260                 265                 270         
          Ala Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu 
                  275                 280                 285             
          Asp Ile Thr Pro Cys Ser Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr 
              290                 295                 300                 
          Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile 
          305                 310                 315                 320 
          Ser Phe 
          <![CDATA[<210>  87]]>
          <![CDATA[<211>  1142]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  87]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag      120
          cgggugcagc ccaccgagag caucgugcgg uuccccaaca ucaccaaccu gugccccuuc      180
          ggcgaggugu ucaacgccac ccgguucgcc agcguguacg ccuggaaccg gaagcggauc      240
          agcaacugcg uggccgacua cagcgugcug uacaacagcg ccagcuucag caccuucaag      300
          ugcuacggcg ugagccccac caagcugaac gaccugugcu ucaccaacgu guacgccgac      360
          agcuucguga uccguggcga cgaggugcgg cagaucgcac ccggccagac aggcaagauc      420
          gccgacuaca acuacaagcu gcccgacgac uucaccggcu gcgugaucgc cuggaacagc      480
          aacaaccucg acagcaaggu gggcggcaac uacaacuacc uguaccggcu guuccggaag      540
          agcaaccuga agcccuucga gcgggacauc agcaccgaga ucuaccaagc cggcuccacc      600
          ccuugcaacg gcguggaggg cuucaacugc uacuucccuc ugcagagcua cggcuuccag      660
          cccaccaacg gcgugggcua ccagcccuac cggguggugg ugcugagcuu cgagcugcug      720
          cacgccccag ccaccgugug uggccccaag aagagcacca accuggugaa gaacaagugc      780
          gugaacuuca acuucaacgg ccuuaccggc accggcgugc ugaccgagag caacaagaaa      840
          uuccugcccu uuugccaguu cggccgggac aucgccgaca ccaccgacgc ugugcgggau      900
          ccccagaccc uggagauccu ggacaucacc ccuugcagcu cuggcggagg cagcauccug      960
          gccaucuaca gcaccguggc cagcagccug gugcugcugg ugagccuggg cgccaucagc     1020
          uucugauaau aggcuggagc cucgguggcc uagcuucuug ccccuugggc cuccccccag     1080
          ccccuccucc ccuuccugca cccguacccc cguggucuuu gaauaaaguc ugagugggcg     1140
          gc                                                                    1142
          <![CDATA[<210>  88]]>
          <![CDATA[<211>  966]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  88]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc       60
          cagcgggugc agcccaccga gagcaucgug cgguucccca acaucaccaa ccugugcccc      120
          uucggcgagg uguucaacgc cacccgguuc gccagcgugu acgccuggaa ccggaagcgg      180
          aucagcaacu gcguggccga cuacagcgug cuguacaaca gcgccagcuu cagcaccuuc      240
          aagugcuacg gcgugagccc caccaagcug aacgaccugu gcuucaccaa cguguacgcc      300
          gacagcuucg ugauccgugg cgacgaggug cggcagaucg cacccggcca gacaggcaag      360
          aucgccgacu acaacuacaa gcugcccgac gacuucaccg gcugcgugau cgccuggaac      420
          agcaacaacc ucgacagcaa ggugggcggc aacuacaacu accuguaccg gcuguuccgg      480
          aagagcaacc ugaagcccuu cgagcgggac aucagcaccg agaucuacca agccggcucc      540
          accccuugca acggcgugga gggcuucaac ugcuacuucc cucugcagag cuacggcuuc      600
          cagcccacca acggcguggg cuaccagccc uaccgggugg uggugcugag cuucgagcug      660
          cugcacgccc cagccaccgu guguggcccc aagaagagca ccaaccuggu gaagaacaag      720
          ugcgugaacu ucaacuucaa cggccuuacc ggcaccggcg ugcugaccga gagcaacaag      780
          aaauuccugc ccuuuugcca guucggccgg gacaucgccg acaccaccga cgcugugcgg      840
          gauccccaga cccuggagau ccuggacauc accccuugca gcucuggcgg aggcagcauc      900
          cuggccaucu acagcaccgu ggccagcagc cuggugcugc uggugagccu gggcgccauc      960
          agcuuc                                                                 966
          <![CDATA[<210>  89]]>
          <![CDATA[<211>  322]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  89]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu 
          1               5                   10                  15      
          Leu Val Asn Ser Gln Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe 
                      20                  25                  30          
          Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr 
                  35                  40                  45              
          Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys 
              50                  55                  60                  
          Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe 
          65                  70                  75                  80  
          Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr 
                          85                  90                  95      
          Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln 
                      100                 105                 110         
          Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu 
                  115                 120                 125             
          Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu 
              130                 135                 140                 
          Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg 
          145                 150                 155                 160 
          Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr 
                          165                 170                 175     
          Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr 
                      180                 185                 190         
          Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr 
                  195                 200                 205             
          Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro 
              210                 215                 220                 
          Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys 
          225                 230                 235                 240 
          Cys Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr 
                          245                 250                 255     
          Glu Ser Asn Lys Lys Phe Leu Pro Phe Cys Gln Phe Gly Arg Asp Ile 
                      260                 265                 270         
          Ala Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu 
                  275                 280                 285             
          Asp Ile Thr Pro Cys Ser Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr 
              290                 295                 300                 
          Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile 
          305                 310                 315                 320 
          Ser Phe 
          <![CDATA[<210>  90]]>
          <![CDATA[<211>  1748]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  90]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc      960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg     1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc     1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc     1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc     1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug     1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac     1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac     1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag     1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug     1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcagc     1560
          auccuggcca ucuacagcac cguggccagc agccuggugc ugcuggugag ccugggcgcc     1620
          aucagcuucu gauaauaggc uggagccucg guggccuagc uucuugcccc uugggccucc     1680
          ccccagcccc uccuccccuu ccugcacccg uacccccgug gucuuugaau aaagucugag     1740
          ugggcggc                                                              1748
          <![CDATA[<210>  91]]>
          <![CDATA[<211>  1572]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  91]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc      900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc      960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc     1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc     1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc     1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc     1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug     1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc     1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug     1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug     1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc     1500
          agcauccugg ccaucuacag caccguggcc agcagccugg ugcugcuggu gagccugggc     1560
          gccaucagcu uc                                                         1572
          <![CDATA[<210>  92]]>
          <![CDATA[<211>  524]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  92]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys 
              290                 295                 300                 
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
          305                 310                 315                 320 
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                          325                 330                 335     
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
                      340                 345                 350         
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
                  355                 360                 365             
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
              370                 375                 380                 
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
          385                 390                 395                 400 
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                          405                 410                 415     
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
                      420                 425                 430         
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
                  435                 440                 445             
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
              450                 455                 460                 
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
          465                 470                 475                 480 
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                          485                 490                 495     
          Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser 
                      500                 505                 510         
          Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe 
                  515                 520                 
          <![CDATA[<210>  93]]>
          <![CDATA[<211>  1925]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  93]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc      960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg     1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc     1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc     1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc     1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug     1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac     1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac     1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag     1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug     1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcagc     1560
          auccuggcca ucuacagcac cguggccagc agccuggugc ugcuggugag ccugggcgcc     1620
          aucagcuuca agaagaagaa gcggccacgg aacuccuaca agugcggcac caacaccaug     1680
          gagcgggagg agagcgagca gaccaagaag cgggagaaga uccacauucc ugaacggucc     1740
          gacgaagccc agcggguguu caagagcagc aagaccagca gcugcgacaa gagcgacacc     1800
          ugcuucugau aauaggcugg agccucggug gccuagcuuc uugccccuug ggccuccccc     1860
          cagccccucc uccccuuccu gcacccguac ccccgugguc uuugaauaaa gucugagugg     1920
          gcggc                                                                 1925
          <![CDATA[<210>  94]]>
          <![CDATA[<211>  1749]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  94]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc      900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc      960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc     1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc     1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc     1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc     1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug     1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc     1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug     1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug     1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc     1500
          agcauccugg ccaucuacag caccguggcc agcagccugg ugcugcuggu gagccugggc     1560
          gccaucagcu ucaagaagaa gaagcggcca cggaacuccu acaagugcgg caccaacacc     1620
          auggagcggg aggagagcga gcagaccaag aagcgggaga agauccacau uccugaacgg     1680
          uccgacgaag cccagcgggu guucaagagc agcaagacca gcagcugcga caagagcgac     1740
          accugcuuc                                                             1749
          <![CDATA[<210>  95]]>
          <![CDATA[<211>  583]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  95]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys 
              290                 295                 300                 
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
          305                 310                 315                 320 
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                          325                 330                 335     
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
                      340                 345                 350         
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
                  355                 360                 365             
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
              370                 375                 380                 
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
          385                 390                 395                 400 
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                          405                 410                 415     
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
                      420                 425                 430         
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
                  435                 440                 445             
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
              450                 455                 460                 
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
          465                 470                 475                 480 
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                          485                 490                 495     
          Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser 
                      500                 505                 510         
          Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe Lys Lys Lys Lys 
                  515                 520                 525             
          Arg Pro Arg Asn Ser Tyr Lys Cys Gly Thr Asn Thr Met Glu Arg Glu 
              530                 535                 540                 
          Glu Ser Glu Gln Thr Lys Lys Arg Glu Lys Ile His Ile Pro Glu Arg 
          545                 550                 555                 560 
          Ser Asp Glu Ala Gln Arg Val Phe Lys Ser Ser Lys Thr Ser Ser Cys 
                          565                 570                 575     
          Asp Lys Ser Asp Thr Cys Phe 
                      580             
          <![CDATA[<210>  96]]>
          <![CDATA[<211>  1805]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  96]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc      960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg     1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc     1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc     1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc     1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug     1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac     1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac     1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag     1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug     1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcagc     1560
          auccuggcca ucuacagcac cguggccagc agccuggugc ugcuggugag ccugggcgcc     1620
          aucagcuuca agcggcagua caaggacaug augagcgagg gaggaccacc uggcgcugag     1680
          ccacagugau aauaggcugg agccucggug gccuagcuuc uugccccuug ggccuccccc     1740
          cagccccucc uccccuuccu gcacccguac ccccgugguc uuugaauaaa gucugagugg     1800
          gcggc                                                                 1805
          <![CDATA[<210>  97]]>
          <![CDATA[<211>  1629]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  97]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc      900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc      960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc     1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc     1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc     1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc     1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug     1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc     1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug     1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug     1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc     1500
          agcauccugg ccaucuacag caccguggcc agcagccugg ugcugcuggu gagccugggc     1560
          gccaucagcu ucaagcggca guacaaggac augaugagcg agggaggacc accuggcgcu     1620
          gagccacag                                                             1629
          <![CDATA[<210>  98]]>
          <![CDATA[<211>  543]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  98]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys 
              290                 295                 300                 
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
          305                 310                 315                 320 
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                          325                 330                 335     
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
                      340                 345                 350         
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
                  355                 360                 365             
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
              370                 375                 380                 
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
          385                 390                 395                 400 
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                          405                 410                 415     
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
                      420                 425                 430         
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
                  435                 440                 445             
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
              450                 455                 460                 
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
          465                 470                 475                 480 
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                          485                 490                 495     
          Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser 
                      500                 505                 510         
          Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe Lys Arg Gln Tyr 
                  515                 520                 525             
          Lys Asp Met Met Ser Glu Gly Gly Pro Pro Gly Ala Glu Pro Gln 
              530                 535                 540             
          <![CDATA[<210>  99]]>
          <![CDATA[<211>  1853]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  99]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc      960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg     1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc     1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc     1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc     1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug     1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac     1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac     1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag     1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug     1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcggc     1560
          agcgccaucg gcggcuacau ccccgaggcc ccuagagacg gccaggccua cgugcggaag     1620
          gacggcgagu gggugcugcu gagcaccuuc cugggcggag gcagcauccu ggccaucuac     1680
          agcaccgugg ccagcagccu ggugcugcug gugagccugg gcgccaucag cuucugauaa     1740
          uaggcuggag ccucgguggc cuagcuucuu gccccuuggg ccucccccca gccccuccuc     1800
          cccuuccugc acccguaccc ccguggucuu ugaauaaagu cugagugggc ggc            1853
          <![CDATA[<210>  100]]>
          <![CDATA[<211>  1677]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  100]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc      900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc      960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc     1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc     1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc     1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc     1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug     1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc     1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug     1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug     1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc     1500
          ggcagcgcca ucggcggcua cauccccgag gccccuagag acggccaggc cuacgugcgg     1560
          aaggacggcg agugggugcu gcugagcacc uuccugggcg gaggcagcau ccuggccauc     1620
          uacagcaccg uggccagcag ccuggugcug cuggugagcc ugggcgccau cagcuuc        1677
          <![CDATA[<210>  101]]>
          <![CDATA[<211>  559]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  101]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys 
              290                 295                 300                 
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
          305                 310                 315                 320 
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                          325                 330                 335     
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
                      340                 345                 350         
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
                  355                 360                 365             
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
              370                 375                 380                 
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
          385                 390                 395                 400 
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                          405                 410                 415     
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
                      420                 425                 430         
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
                  435                 440                 445             
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
              450                 455                 460                 
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
          465                 470                 475                 480 
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                          485                 490                 495     
          Ser Gly Gly Gly Gly Ser Ala Ile Gly Gly Tyr Ile Pro Glu Ala Pro 
                      500                 505                 510         
          Arg Asp Gly Gln Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu 
                  515                 520                 525             
          Ser Thr Phe Leu Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val 
              530                 535                 540                 
          Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe 
          545                 550                 555                 
          <![CDATA[<210>  102]]>
          <![CDATA[<211>  1856]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  102]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc      960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg     1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc     1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc     1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc     1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug     1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac     1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac     1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag     1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug     1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcagc     1560
          auccuggcca ucuacagcac cguggccagc agccuggugc ugcuggugag ccugggcgcc     1620
          aucagcuucg gcggaggcag cgccaucggc ggcuacaucc ccgaggcccc uagagacggc     1680
          caggccuacg ugcggaagga cggcgagugg gugcugcuga gcaccuuccu gggcaaguga     1740
          uaauaggcug gagccucggu ggccuagcuu cuugccccuu gggccucccc ccagccccuc     1800
          cuccccuucc ugcacccgua cccccguggu cuuugaauaa agucugagug ggcggc         1856
          <![CDATA[<210>  103]]>
          <![CDATA[<211>  1680]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  103]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc      900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc      960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc     1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc     1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc     1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc     1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug     1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc     1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug     1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug     1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc     1500
          agcauccugg ccaucuacag caccguggcc agcagccugg ugcugcuggu gagccugggc     1560
          gccaucagcu ucggcggagg cagcgccauc ggcggcuaca uccccgaggc cccuagagac     1620
          ggccaggccu acgugcggaa ggacggcgag ugggugcugc ugagcaccuu ccugggcaag     1680
          <![CDATA[<210>  104]]>
          <![CDATA[<211>  560]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  104]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys 
              290                 295                 300                 
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
          305                 310                 315                 320 
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                          325                 330                 335     
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
                      340                 345                 350         
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
                  355                 360                 365             
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
              370                 375                 380                 
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
          385                 390                 395                 400 
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                          405                 410                 415     
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
                      420                 425                 430         
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
                  435                 440                 445             
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
              450                 455                 460                 
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
          465                 470                 475                 480 
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                          485                 490                 495     
          Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser 
                      500                 505                 510         
          Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe Gly Gly Gly Ser 
                  515                 520                 525             
          Ala Ile Gly Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr 
              530                 535                 540                 
          Val Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu Gly Lys 
          545                 550                 555                 560 
          <![CDATA[<210>  105]]>
          <![CDATA[<211>  1826]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  105]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc      960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg     1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc     1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc     1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc     1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug     1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac     1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac     1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag     1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug     1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcagc     1560
          auccuggcca ucuacagccu gggcuucauc gccggccuga ucgccaucgu gauggugacc     1620
          aucaugcugu gcugcaugac cagcugcugc agcugccuga agggcuguug cagcugcggc     1680
          agcugcugca aguucgacga ggacgacuga uaauaggcug gagccucggu ggccuagcuu     1740
          cuugccccuu gggccucccc ccagccccuc cuccccuucc ugcacccgua cccccguggu     1800
          cuuugaauaa agucugagug ggcggc                                          1826
          <![CDATA[<210>  106]]>
          <![CDATA[<211>  1650]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  106]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc      900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc      960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc     1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc     1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc     1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc     1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug     1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc     1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug     1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug     1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc     1500
          agcauccugg ccaucuacag ccugggcuuc aucgccggcc ugaucgccau cgugauggug     1560
          accaucaugc ugugcugcau gaccagcugc ugcagcugcc ugaagggcug uugcagcugc     1620
          ggcagcugcu gcaaguucga cgaggacgac                                      1650
          <![CDATA[<210>  107]]>
          <![CDATA[<211>  550]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  107]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys 
              290                 295                 300                 
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
          305                 310                 315                 320 
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                          325                 330                 335     
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
                      340                 345                 350         
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
                  355                 360                 365             
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
              370                 375                 380                 
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
          385                 390                 395                 400 
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                          405                 410                 415     
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
                      420                 425                 430         
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
                  435                 440                 445             
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
              450                 455                 460                 
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
          465                 470                 475                 480 
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                          485                 490                 495     
          Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Leu Gly Phe Ile Ala 
                      500                 505                 510         
          Gly Leu Ile Ala Ile Val Met Val Thr Ile Met Leu Cys Cys Met Thr 
                  515                 520                 525             
          Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys Ser Cys Gly Ser Cys Cys 
              530                 535                 540                 
          Lys Phe Asp Glu Asp Asp 
          545                 550 
          <![CDATA[<210>  108]]>
          <![CDATA[<211>  1826]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  108]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc      960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg     1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc     1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc     1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc     1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug     1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac     1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac     1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag     1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug     1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcagc     1560
          agcagcaucg ccagcuucuu cuucaucauc gggcugauca ucggccucuu ccuggugcug     1620
          cgggugggca uccaccugug caucaagcug aagcacacca agaagagaca gaucuacacc     1680
          gacaucgaga ugaaccggcu gggcaaguga uaauaggcug gagccucggu ggccuagcuu     1740
          cuugccccuu gggccucccc ccagccccuc cuccccuucc ugcacccgua cccccguggu     1800
          cuuugaauaa agucugagug ggcggc                                          1826
          <![CDATA[<210>  109]]>
          <![CDATA[<211>  1650]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  109]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc      900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc      960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc     1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc     1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc     1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc     1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug     1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc     1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug     1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug     1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc     1500
          agcagcagca ucgccagcuu cuucuucauc aucgggcuga ucaucggccu cuuccuggug     1560
          cugcgggugg gcauccaccu gugcaucaag cugaagcaca ccaagaagag acagaucuac     1620
          accgacaucg agaugaaccg gcugggcaag                                      1650
          <![CDATA[<210>  110]]>
          <![CDATA[<211>  550]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  110]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys 
              290                 295                 300                 
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
          305                 310                 315                 320 
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                          325                 330                 335     
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
                      340                 345                 350         
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
                  355                 360                 365             
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
              370                 375                 380                 
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
          385                 390                 395                 400 
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                          405                 410                 415     
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
                      420                 425                 430         
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
                  435                 440                 445             
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
              450                 455                 460                 
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
          465                 470                 475                 480 
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                          485                 490                 495     
          Ser Gly Gly Gly Ser Ser Ser Ile Ala Ser Phe Phe Phe Ile Ile Gly 
                      500                 505                 510         
          Leu Ile Ile Gly Leu Phe Leu Val Leu Arg Val Gly Ile His Leu Cys 
                  515                 520                 525             
          Ile Lys Leu Lys His Thr Lys Lys Arg Gln Ile Tyr Thr Asp Ile Glu 
              530                 535                 540                 
          Met Asn Arg Leu Gly Lys 
          545                 550 
          <![CDATA[<210>  111]]>
          <![CDATA[<211>  1775]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  111]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc      960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg     1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc     1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc     1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc     1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug     1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac     1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac     1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag     1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug     1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcggc     1560
          agcgccaucg gcggcuacau ccccgaggcc ccuagagacg gccaggccua cgugcggaag     1620
          gacggcgagu gggugcugcu gagcaccuuc cugggcugau aauaggcugg agccucggug     1680
          gccuagcuuc uugccccuug ggccuccccc cagccccucc uccccuuccu gcacccguac     1740
          ccccgugguc uuugaauaaa gucugagugg gcggc                                1775
          <![CDATA[<210>  112]]>
          <![CDATA[<211>  1599]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  112]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc      900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc      960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc     1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc     1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc     1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc     1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug     1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc     1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug     1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug     1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc     1500
          ggcagcgcca ucggcggcua cauccccgag gccccuagag acggccaggc cuacgugcgg     1560
          aaggacggcg agugggugcu gcugagcacc uuccugggc                            1599
          <![CDATA[<210>  113]]>
          <![CDATA[<211>  533]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  113]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys 
              290                 295                 300                 
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
          305                 310                 315                 320 
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                          325                 330                 335     
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
                      340                 345                 350         
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
                  355                 360                 365             
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
              370                 375                 380                 
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
          385                 390                 395                 400 
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                          405                 410                 415     
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
                      420                 425                 430         
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
                  435                 440                 445             
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
              450                 455                 460                 
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
          465                 470                 475                 480 
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                          485                 490                 495     
          Ser Gly Gly Gly Gly Ser Ala Ile Gly Gly Tyr Ile Pro Glu Ala Pro 
                      500                 505                 510         
          Arg Asp Gly Gln Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu 
                  515                 520                 525             
          Ser Thr Phe Leu Gly 
              530             
          <![CDATA[<210>  114]]>
          <![CDATA[<211>  1973]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  114]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggaca gcgggugcag      960
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug     1020
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc     1080
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc     1140
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug     1200
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac     1260
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc     1320
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug     1380
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac     1440
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac     1500
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca     1560
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc     1620
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc     1680
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc     1740
          cuggagaucc uggacaucac cccuugcagc ucuggcggag gcagcauccu ggccaucuac     1800
          agcaccgugg ccagcagccu ggugcugcug gugagccugg gcgccaucag cuucugauaa     1860
          uaggcuggag ccucgguggc cuagcuucuu gccccuuggg ccucccccca gccccuccuc     1920
          cccuuccugc acccguaccc ccguggucuu ugaauaaagu cugagugggc ggc            1973
          <![CDATA[<210>  115]]>
          <![CDATA[<211>  1797]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  115]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acagcgggug      900
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag      960
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac     1020
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac     1080
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc     1140
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac     1200
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac     1260
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac     1320
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc     1380
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc     1440
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc     1500
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac     1560
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug     1620
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag     1680
          acccuggaga uccuggacau caccccuugc agcucuggcg gaggcagcau ccuggccauc     1740
          uacagcaccg uggccagcag ccuggugcug cuggugagcc ugggcgccau cagcuuc        1797
          <![CDATA[<210>  116]]>
          <![CDATA[<211>  599]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  116]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Gly Gly Gly Ser Gly Gly Gly Gln Arg Val Gln Pro Thr Glu 
              290                 295                 300                 
          Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu 
          305                 310                 315                 320 
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys 
                          325                 330                 335     
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala 
                      340                 345                 350         
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn 
                  355                 360                 365             
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly 
              370                 375                 380                 
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp 
          385                 390                 395                 400 
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp 
                          405                 410                 415     
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu 
                      420                 425                 430         
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile 
                  435                 440                 445             
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu 
              450                 455                 460                 
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr 
          465                 470                 475                 480 
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu 
                          485                 490                 495     
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn 
                      500                 505                 510         
          Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn Gly Leu Thr Gly 
                  515                 520                 525             
          Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu Pro Phe Gln Gln 
              530                 535                 540                 
          Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val Arg Asp Pro Gln 
          545                 550                 555                 560 
          Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Ser Gly Gly Gly Ser 
                          565                 570                 575     
          Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val 
                      580                 585                 590         
          Ser Leu Gly Ala Ile Ser Phe 
                  595                 
          <![CDATA[<210>  117]]>
          <![CDATA[<211>  1799]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  117]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccgg aggcggaucg ggaggcggac ccaacaucac caaccugugc     1020
          cccuucggcg agguguucaa cgccacccgg uucgccagcg uguacgccug gaaccggaag     1080
          cggaucagca acugcguggc cgacuacagc gugcuguaca acagcgccag cuucagcacc     1140
          uucaagugcu acggcgugag ccccaccaag cugaacgacc ugugcuucac caacguguac     1200
          gccgacagcu ucgugauccg uggcgacgag gugcggcaga ucgcacccgg ccagacaggc     1260
          aagaucgccg acuacaacua caagcugccc gacgacuuca ccggcugcgu gaucgccugg     1320
          aacagcaaca accucgacag caaggugggc ggcaacuaca acuaccugua ccggcuguuc     1380
          cggaagagca accugaagcc cuucgagcgg gacaucagca ccgagaucua ccaagccggc     1440
          uccaccccuu gcaacggcgu ggagggcuuc aacugcuacu ucccucugca gagcuacggc     1500
          uuccagccca ccaacggcgu gggcuaccag cccuaccggg ugguggugcu gagcuucgag     1560
          cugcugcacg ccccagccac cguguguggc cccaagucug gcggaggcag cauccuggcc     1620
          aucuacagca ccguggccag cagccuggug cugcugguga gccugggcgc caucagcuuc     1680
          ugauaauagg cuggagccuc gguggccuag cuucuugccc cuugggccuc cccccagccc     1740
          cuccuccccu uccugcaccc guacccccgu ggucuuugaa uaaagucuga gugggcggc      1799
          <![CDATA[<210>  118]]>
          <![CDATA[<211>  1623]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  118]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cggaggcgga ucgggaggcg gacccaacau caccaaccug      960
          ugccccuucg gcgagguguu caacgccacc cgguucgcca gcguguacgc cuggaaccgg     1020
          aagcggauca gcaacugcgu ggccgacuac agcgugcugu acaacagcgc cagcuucagc     1080
          accuucaagu gcuacggcgu gagccccacc aagcugaacg accugugcuu caccaacgug     1140
          uacgccgaca gcuucgugau ccguggcgac gaggugcggc agaucgcacc cggccagaca     1200
          ggcaagaucg ccgacuacaa cuacaagcug cccgacgacu ucaccggcug cgugaucgcc     1260
          uggaacagca acaaccucga cagcaaggug ggcggcaacu acaacuaccu guaccggcug     1320
          uuccggaaga gcaaccugaa gcccuucgag cgggacauca gcaccgagau cuaccaagcc     1380
          ggcuccaccc cuugcaacgg cguggagggc uucaacugcu acuucccucu gcagagcuac     1440
          ggcuuccagc ccaccaacgg cgugggcuac cagcccuacc gggugguggu gcugagcuuc     1500
          gagcugcugc acgccccagc caccgugugu ggccccaagu cuggcggagg cagcauccug     1560
          gccaucuaca gcaccguggc cagcagccug gugcugcugg ugagccuggg cgccaucagc     1620
          uuc                                                                   1623
          <![CDATA[<210>  119]]>
          <![CDATA[<211>  541]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  119]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu 
          305                 310                 315                 320 
          Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr 
                          325                 330                 335     
          Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val 
                      340                 345                 350         
          Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser 
                  355                 360                 365             
          Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser 
              370                 375                 380                 
          Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr 
          385                 390                 395                 400 
          Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly 
                          405                 410                 415     
          Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly 
                      420                 425                 430         
          Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro 
                  435                 440                 445             
          Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro 
              450                 455                 460                 
          Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr 
          465                 470                 475                 480 
          Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val 
                          485                 490                 495     
          Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro 
                      500                 505                 510         
          Lys Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser 
                  515                 520                 525             
          Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe 
              530                 535                 540     
          <![CDATA[<210>  120]]>
          <![CDATA[<211>  2024]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  120]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccgg aggcggaucg ggaggcggac agcgggugca gcccaccgag     1020
          agcaucgugc gguuccccaa caucaccaac cugugccccu ucggcgaggu guucaacgcc     1080
          acccgguucg ccagcgugua cgccuggaac cggaagcgga ucagcaacug cguggccgac     1140
          uacagcgugc uguacaacag cgccagcuuc agcaccuuca agugcuacgg cgugagcccc     1200
          accaagcuga acgaccugug cuucaccaac guguacgccg acagcuucgu gauccguggc     1260
          gacgaggugc ggcagaucgc acccggccag acaggcaaga ucgccgacua caacuacaag     1320
          cugcccgacg acuucaccgg cugcgugauc gccuggaaca gcaacaaccu cgacagcaag     1380
          gugggcggca acuacaacua ccuguaccgg cuguuccgga agagcaaccu gaagcccuuc     1440
          gagcgggaca ucagcaccga gaucuaccaa gccggcucca ccccuugcaa cggcguggag     1500
          ggcuucaacu gcuacuuccc ucugcagagc uacggcuucc agcccaccaa cggcgugggc     1560
          uaccagcccu accggguggu ggugcugagc uucgagcugc ugcacgcccc agccaccgug     1620
          uguggcccca agaagagcac caaccuggug aagaacaagu gcgugaacuu caacuucaac     1680
          ggccuuaccg gcaccggcgu gcugaccgag agcaacaaga aauuccugcc cuuucagcag     1740
          uucggccggg acaucgccga caccaccgac gcugugcggg auccccagac ccuggagauc     1800
          cuggacauca ccccuugcag cucuggcgga ggcagcaucc uggccaucua cagcaccgug     1860
          gccagcagcc uggugcugcu ggugagccug ggcgccauca gcuucugaua auaggcugga     1920
          gccucggugg ccuagcuucu ugccccuugg gccucccccc agccccuccu ccccuuccug     1980
          cacccguacc cccguggucu uugaauaaag ucugaguggg cggc                      2024
          <![CDATA[<210>  121]]>
          <![CDATA[<211>  1848]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  121]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cggaggcgga ucgggaggcg gacagcgggu gcagcccacc      960
          gagagcaucg ugcgguuccc caacaucacc aaccugugcc ccuucggcga gguguucaac     1020
          gccacccggu ucgccagcgu guacgccugg aaccggaagc ggaucagcaa cugcguggcc     1080
          gacuacagcg ugcuguacaa cagcgccagc uucagcaccu ucaagugcua cggcgugagc     1140
          cccaccaagc ugaacgaccu gugcuucacc aacguguacg ccgacagcuu cgugauccgu     1200
          ggcgacgagg ugcggcagau cgcacccggc cagacaggca agaucgccga cuacaacuac     1260
          aagcugcccg acgacuucac cggcugcgug aucgccugga acagcaacaa ccucgacagc     1320
          aaggugggcg gcaacuacaa cuaccuguac cggcuguucc ggaagagcaa ccugaagccc     1380
          uucgagcggg acaucagcac cgagaucuac caagccggcu ccaccccuug caacggcgug     1440
          gagggcuuca acugcuacuu cccucugcag agcuacggcu uccagcccac caacggcgug     1500
          ggcuaccagc ccuaccgggu gguggugcug agcuucgagc ugcugcacgc cccagccacc     1560
          guguguggcc ccaagaagag caccaaccug gugaagaaca agugcgugaa cuucaacuuc     1620
          aacggccuua ccggcaccgg cgugcugacc gagagcaaca agaaauuccu gcccuuucag     1680
          caguucggcc gggacaucgc cgacaccacc gacgcugugc gggaucccca gacccuggag     1740
          auccuggaca ucaccccuug cagcucuggc ggaggcagca uccuggccau cuacagcacc     1800
          guggccagca gccuggugcu gcuggugagc cugggcgcca ucagcuuc                  1848
          <![CDATA[<210>  122]]>
          <![CDATA[<211>  616]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  122]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Gly Gly Gly Ser Gly Gly Gly Gln Arg Val Gln Pro Thr 
          305                 310                 315                 320 
          Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly 
                          325                 330                 335     
          Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg 
                      340                 345                 350         
          Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser 
                  355                 360                 365             
          Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu 
              370                 375                 380                 
          Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg 
          385                 390                 395                 400 
          Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala 
                          405                 410                 415     
          Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala 
                      420                 425                 430         
          Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr 
                  435                 440                 445             
          Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp 
              450                 455                 460                 
          Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val 
          465                 470                 475                 480 
          Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro 
                          485                 490                 495     
          Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe 
                      500                 505                 510         
          Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr 
                  515                 520                 525             
          Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn Gly Leu Thr 
              530                 535                 540                 
          Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu Pro Phe Gln 
          545                 550                 555                 560 
          Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val Arg Asp Pro 
                          565                 570                 575     
          Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Ser Gly Gly Gly 
                      580                 585                 590         
          Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu 
                  595                 600                 605             
          Val Ser Leu Gly Ala Ile Ser Phe 
              610                 615     
          <![CDATA[<210>  123]]>
          <![CDATA[<211>  3995]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  123]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag     1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug     1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc     1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc     1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug     1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac     1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc     1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug     1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac     1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac     1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca     1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc     1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc     1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc     1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcg ugagcgugau caccccaggc     1860
          accaacacca gcaaccaggu ggccgugcug uaccaggacg ugaacugcac cgaggugccc     1920
          guggccaucc acgccgacca gcugacaccc accuggcggg ucuacagcac cggcagcaac     1980
          guguuccaga cccgggccgg uugccugauc ggcgccgagc acgugaacaa cagcuacgag     2040
          ugcgacaucc ccaucggcgc cggcaucugu gccagcuacc agacccagac caauucaccc     2100
          cggagggcaa ggagcguggc cagccagagc aucaucgccu acaccaugag ccugggcgcc     2160
          gagaacagcg uggccuacag caacaacagc aucgccaucc ccaccaacuu caccaucagc     2220
          gugaccaccg agauucugcc cgugagcaug accaagacca gcguggacug caccauguac     2280
          aucugcggcg acagcaccga gugcagcaac cugcugcugc aguacggcag cuucugcacc     2340
          cagcugaacc gggcccugac cggcaucgcc guggagcagg acaagaacac ccaggaggug     2400
          uucgcccagg ugaagcagau cuacaagacc ccucccauca aggacuucgg cggcuucaac     2460
          uucagccaga uccugcccga ccccagcaag cccagcaagc ggagcuucau cgaggaccug     2520
          cuguucaaca aggugacccu agccgacgcc ggcuucauca agcaguacgg cgacugccuc     2580
          ggcgacauag ccgcccggga ccugaucugc gcccagaagu ucaacggccu gaccgugcug     2640
          ccuccccugc ugaccgacga gaugaucgcc caguacacca gcgcccuguu agccggaacc     2700
          aucaccagcg gcuggacuuu cggcgcugga gccgcucugc agauccccuu cgccaugcag     2760
          auggccuacc gguucaacgg caucggcgug acccagaacg ugcuguacga gaaccagaag     2820
          cugaucgcca accaguucaa cagcgccauc ggcaagaucc aggacagccu gagcagcacc     2880
          gcuagcgccc ugggcaagcu gcaggacgug gugaaccaga acgcccaggc ccugaacacc     2940
          cuggugaagc agcugagcag caacuucggc gccaucagca gcgugcugaa cgacauccug     3000
          agccggcugg acaaggugga ggccgaggug cagaucgacc ggcugaucac uggccggcug     3060
          cagagccugc agaccuacgu gacccagcag cugauccggg ccgccgagau ucgggccagc     3120
          gccaaccugg ccgccaccaa gaugagcgag ugcgugcugg gccagagcaa gcggguggac     3180
          uucugcggca agggcuacca ccugaugagc uuuccccaga gcgcacccca cggaguggug     3240
          uuccugcacg ugaccuacgu gcccgcccag gagaagaacu ucaccaccgc cccagccauc     3300
          ugccacgacg gcaaggccca cuuuccccgg gagggcgugu ucgugagcaa cggcacccac     3360
          ugguucguga cccagcggaa cuucuacgag ccccagauca ucaccaccga caacaccuuc     3420
          gugagcggca acugcgacgu ggugaucggc aucgugaaca acaccgugua cgauccccug     3480
          cagcccgagc uggacagcuu caaggaggag cuggacaagu acuucaagaa ucacaccagc     3540
          cccgacgugg accugggcga caucagcggc aucaacgcca gcguggugaa cauccagaag     3600
          gagaucgauc ggcugaacga gguggccaag aaccugaacg agagccugau cgaccugcag     3660
          gagcugggca aguacgagca guacaucaag uggcccuggu acaucuggcu gggcuucauc     3720
          gccggccuga ucgccaucgu gauggugacc aucaugcugu gcugcaugac cagcugcugc     3780
          agcugccuga agggcuguug cagcugcggc agcugcugca aguucgacga ggacgacagc     3840
          gagcccgugc ugaagggcgu gaagcugcac uacaccugau aauaggcugg agccucggug     3900
          gccuagcuuc uugccccuug ggccuccccc cagccccucc uccccuuccu gcacccguac     3960
          ccccgugguc uuugaauaaa gucugagugg gcggc                                3995
          <![CDATA[<210>  124]]>
          <![CDATA[<211>  3819]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  124]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug      960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag     1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac     1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac     1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc     1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac     1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac     1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac     1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc     1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc     1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc     1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac     1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug     1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag     1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcgugagcgu gaucacccca     1800
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug     1860
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc     1920
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac     1980
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca     2040
          ccccggaggg caaggagcgu ggccagccag agcaucaucg ccuacaccau gagccugggc     2100
          gccgagaaca gcguggccua cagcaacaac agcaucgcca uccccaccaa cuucaccauc     2160
          agcgugacca ccgagauucu gcccgugagc augaccaaga ccagcgugga cugcaccaug     2220
          uacaucugcg gcgacagcac cgagugcagc aaccugcugc ugcaguacgg cagcuucugc     2280
          acccagcuga accgggcccu gaccggcauc gccguggagc aggacaagaa cacccaggag     2340
          guguucgccc aggugaagca gaucuacaag accccuccca ucaaggacuu cggcggcuuc     2400
          aacuucagcc agauccugcc cgaccccagc aagcccagca agcggagcuu caucgaggac     2460
          cugcuguuca acaaggugac ccuagccgac gccggcuuca ucaagcagua cggcgacugc     2520
          cucggcgaca uagccgcccg ggaccugauc ugcgcccaga aguucaacgg ccugaccgug     2580
          cugccucccc ugcugaccga cgagaugauc gcccaguaca ccagcgcccu guuagccgga     2640
          accaucacca gcggcuggac uuucggcgcu ggagccgcuc ugcagauccc cuucgccaug     2700
          cagauggccu accgguucaa cggcaucggc gugacccaga acgugcugua cgagaaccag     2760
          aagcugaucg ccaaccaguu caacagcgcc aucggcaaga uccaggacag ccugagcagc     2820
          accgcuagcg cccugggcaa gcugcaggac guggugaacc agaacgccca ggcccugaac     2880
          acccugguga agcagcugag cagcaacuuc ggcgccauca gcagcgugcu gaacgacauc     2940
          cugagccggc uggacaaggu ggaggccgag gugcagaucg accggcugau cacuggccgg     3000
          cugcagagcc ugcagaccua cgugacccag cagcugaucc gggccgccga gauucgggcc     3060
          agcgccaacc uggccgccac caagaugagc gagugcgugc ugggccagag caagcgggug     3120
          gacuucugcg gcaagggcua ccaccugaug agcuuucccc agagcgcacc ccacggagug     3180
          guguuccugc acgugaccua cgugcccgcc caggagaaga acuucaccac cgccccagcc     3240
          aucugccacg acggcaaggc ccacuuuccc cgggagggcg uguucgugag caacggcacc     3300
          cacugguucg ugacccagcg gaacuucuac gagccccaga ucaucaccac cgacaacacc     3360
          uucgugagcg gcaacugcga cguggugauc ggcaucguga acaacaccgu guacgauccc     3420
          cugcagcccg agcuggacag cuucaaggag gagcuggaca aguacuucaa gaaucacacc     3480
          agccccgacg uggaccuggg cgacaucagc ggcaucaacg ccagcguggu gaacauccag     3540
          aaggagaucg aucggcugaa cgagguggcc aagaaccuga acgagagccu gaucgaccug     3600
          caggagcugg gcaaguacga gcaguacauc aaguggcccu gguacaucug gcugggcuuc     3660
          aucgccggcc ugaucgccau cgugauggug accaucaugc ugugcugcau gaccagcugc     3720
          ugcagcugcc ugaagggcug uugcagcugc ggcagcugcu gcaaguucga cgaggacgac     3780
          agcgagcccg ugcugaaggg cgugaagcug cacuacacc                            3819
          <![CDATA[<210>  125]]>
          <![CDATA[<211>  1273]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  125]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
          305                 310                 315                 320 
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                          325                 330                 335     
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
                      340                 345                 350         
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                  355                 360                 365             
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
              370                 375                 380                 
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
          385                 390                 395                 400 
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                          405                 410                 415     
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
                      420                 425                 430         
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                  435                 440                 445             
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
              450                 455                 460                 
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
          465                 470                 475                 480 
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                          485                 490                 495     
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
                      500                 505                 510         
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                  515                 520                 525             
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
              530                 535                 540                 
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
          545                 550                 555                 560 
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                          565                 570                 575     
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
                      580                 585                 590         
          Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
                  595                 600                 605             
          Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
              610                 615                 620                 
          His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
          625                 630                 635                 640 
          Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                          645                 650                 655     
          Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
                      660                 665                 670         
          Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala 
                  675                 680                 685             
          Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser 
              690                 695                 700                 
          Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile 
          705                 710                 715                 720 
          Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val 
                          725                 730                 735     
          Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu 
                      740                 745                 750         
          Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr 
                  755                 760                 765             
          Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln 
              770                 775                 780                 
          Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe 
          785                 790                 795                 800 
          Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser 
                          805                 810                 815     
          Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly 
                      820                 825                 830         
          Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp 
                  835                 840                 845             
          Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu 
              850                 855                 860                 
          Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly 
          865                 870                 875                 880 
          Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile 
                          885                 890                 895     
          Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr 
                      900                 905                 910         
          Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn 
                  915                 920                 925             
          Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala 
              930                 935                 940                 
          Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn 
          945                 950                 955                 960 
          Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val 
                          965                 970                 975     
          Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln 
                      980                 985                 990         
          Ile Asp Arg Leu Ile Thr Gly Arg  Leu Gln Ser Leu Gln  Thr Tyr Val 
                  995                 1000                 1005             
          Thr Gln  Gln Leu Ile Arg Ala  Ala Glu Ile Arg Ala  Ser Ala Asn 
              1010                 1015                 1020             
          Leu Ala  Ala Thr Lys Met Ser  Glu Cys Val Leu Gly  Gln Ser Lys 
              1025                 1030                 1035             
          Arg Val  Asp Phe Cys Gly Lys  Gly Tyr His Leu Met  Ser Phe Pro 
              1040                 1045                 1050             
          Gln Ser  Ala Pro His Gly Val  Val Phe Leu His Val  Thr Tyr Val 
              1055                 1060                 1065             
          Pro Ala  Gln Glu Lys Asn Phe  Thr Thr Ala Pro Ala  Ile Cys His 
              1070                 1075                 1080             
          Asp Gly  Lys Ala His Phe Pro  Arg Glu Gly Val Phe  Val Ser Asn 
              1085                 1090                 1095             
          Gly Thr  His Trp Phe Val Thr  Gln Arg Asn Phe Tyr  Glu Pro Gln 
              1100                 1105                 1110             
          Ile Ile  Thr Thr Asp Asn Thr  Phe Val Ser Gly Asn  Cys Asp Val 
              1115                 1120                 1125             
          Val Ile  Gly Ile Val Asn Asn  Thr Val Tyr Asp Pro  Leu Gln Pro 
              1130                 1135                 1140             
          Glu Leu  Asp Ser Phe Lys Glu  Glu Leu Asp Lys Tyr  Phe Lys Asn 
              1145                 1150                 1155             
          His Thr  Ser Pro Asp Val Asp  Leu Gly Asp Ile Ser  Gly Ile Asn 
              1160                 1165                 1170             
          Ala Ser  Val Val Asn Ile Gln  Lys Glu Ile Asp Arg  Leu Asn Glu 
              1175                 1180                 1185             
          Val Ala  Lys Asn Leu Asn Glu  Ser Leu Ile Asp Leu  Gln Glu Leu 
              1190                 1195                 1200             
          Gly Lys  Tyr Glu Gln Tyr Ile  Lys Trp Pro Trp Tyr  Ile Trp Leu 
              1205                 1210                 1215             
          Gly Phe  Ile Ala Gly Leu Ile  Ala Ile Val Met Val  Thr Ile Met 
              1220                 1225                 1230             
          Leu Cys  Cys Met Thr Ser Cys  Cys Ser Cys Leu Lys  Gly Cys Cys 
              1235                 1240                 1245             
          Ser Cys  Gly Ser Cys Cys Lys  Phe Asp Glu Asp Asp  Ser Glu Pro 
              1250                 1255                 1260             
          Val Leu  Lys Gly Val Lys Leu  His Tyr Thr 
              1265                 1270             
          <![CDATA[<210>  126]]>
          <![CDATA[<211>  8]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  126]]>
          Lys Pro Ser Lys Arg Ser Phe Ile 
          1               5               
          <![CDATA[<210>  127]]>
          <![CDATA[<211>  13]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  127]]>
          Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala 
          1               5                   10              
          <![CDATA[<210>  128]]>
          <![CDATA[<211>  9]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  128]]>
          ccrccaugg                                                                9
          <![CDATA[<210>  129]]>
          <![CDATA[<211>  11]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  129]]>
          gggauccuac c                                                            11
          <![CDATA[<210>  130]]>
          <![CDATA[<211>  9]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  130]]>
          uuauuuaww                                                                9
          <![CDATA[<210>  131]]>
          <![CDATA[<211>  47]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  131]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag agccacc                     47
          <![CDATA[<210>  132]]>
          <![CDATA[<211>  119]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  132]]>
          ugauaauagg cuggagccuc gguggccaug cuucuugccc cuugggccuc cccccagccc       60
          cuccuccccu uccugcaccc guacccccgu ggucuuugaa uaaagucuga gugggcggc       119
          <![CDATA[<210>  133]]>
          <![CDATA[<211>  15]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  133]]>
          Gly Gly Ser Gly Gly Ser Gly Gly Ser Gly Gly Ser Gly Gly Gly 
          1               5                   10                  15  
          <![CDATA[<210>  134]]>
          <![CDATA[<211>  8]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  134]]>
          Gly Gly Gly Ser Gly Gly Gly Ser 
          1               5               
          <![CDATA[<210>  135]]>
          <![CDATA[<211>  7]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  135]]>
          Gly Gly Gly Ser Gly Gly Gly 
          1               5           
          <![CDATA[<210>  136]]>
          <![CDATA[<211>  4]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  136]]>
          Gly Gly Gly Ser 
          1               
          <![CDATA[<210>  137]]>
          <![CDATA[<211>  4]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  137]]>
          Ser Gly Gly Ser 
          1               
          <![CDATA[<210>  138]]>
          <![CDATA[<211>  5]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  138]]>
          Ser Gly Gly Gly Ser 
          1               5   
          <![CDATA[<210>  139]]>
          <![CDATA[<211>  1602]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  139]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc       60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc      120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug      180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac      240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg      300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac      360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac      420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc      480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc      540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac      600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag      660
          ucuggcggag gcagcggagg cggaagccag ugcgugaacc ugaccacccg gacccagcug      720
          ccaccagccu acaccaacag cuucacccgg ggcgucuacu accccgacaa gguguuccgg      780
          agcagcgucc ugcacagcac ccaggaccug uuccugcccu ucuucagcaa cgugaccugg      840
          uuccacgcca uccacgugag cggcaccaac ggcaccaagc gguucgacaa ccccgugcug      900
          cccuucaacg acggcgugua cuucgccagc accgagaaga gcaacaucau ccggggcugg      960
          aucuucggca ccacccugga cagcaagacc cagagccugc ugaucgugaa uaacgccacc     1020
          aacgugguga ucaaggugug cgaguuccag uucugcaacg accccuuccu gggcguguac     1080
          uaccacaaga acaacaagag cuggauggag agcgaguucc ggguguacag cagcgccaac     1140
          aacugcaccu ucgaguacgu gagccagccc uuccugaugg accuggaggg caagcagggc     1200
          aacuucaaga accugcggga guucguguuc aagaacaucg acggcuacuu caagaucuac     1260
          agcaagcaca ccccaaucaa ccuggugcgg gaucugcccc agggcuucuc agcccuggag     1320
          ccccuggugg accugcccau cggcaucaac aucacccggu uccagacccu gcuggcccug     1380
          caccggagcu accugacccc aggcgacagc agcagcgggu ggacagcagg cgcggcugcu     1440
          uacuacgugg gcuaccugca gccccggacc uuccugcuga aguacaacga gaacggcacc     1500
          aucaccgacg ccguggacuc uggcggaggc agcauccugg ccaucuacag caccguggcc     1560
          agcagccugg ugcugcuggu gagccugggc gccaucagcu uc                        1602
          <![CDATA[<210>  140]]>
          <![CDATA[<211>  534]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  140]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu 
          1               5                   10                  15      
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu 
                      20                  25                  30          
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys 
                  35                  40                  45              
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala 
              50                  55                  60                  
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn 
          65                  70                  75                  80  
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly 
                          85                  90                  95      
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp 
                      100                 105                 110         
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp 
                  115                 120                 125             
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu 
              130                 135                 140                 
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile 
          145                 150                 155                 160 
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu 
                          165                 170                 175     
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr 
                      180                 185                 190         
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu 
                  195                 200                 205             
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Ser Gly Gly Gly 
              210                 215                 220                 
          Ser Gly Gly Gly Ser Gln Cys Val Asn Leu Thr Thr Arg Thr Gln Leu 
          225                 230                 235                 240 
          Pro Pro Ala Tyr Thr Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp 
                          245                 250                 255     
          Lys Val Phe Arg Ser Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu 
                      260                 265                 270         
          Pro Phe Phe Ser Asn Val Thr Trp Phe His Ala Ile His Val Ser Gly 
                  275                 280                 285             
          Thr Asn Gly Thr Lys Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp 
              290                 295                 300                 
          Gly Val Tyr Phe Ala Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp 
          305                 310                 315                 320 
          Ile Phe Gly Thr Thr Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val 
                          325                 330                 335     
          Asn Asn Ala Thr Asn Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys 
                      340                 345                 350         
          Asn Asp Pro Phe Leu Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp 
                  355                 360                 365             
          Met Glu Ser Glu Phe Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe 
              370                 375                 380                 
          Glu Tyr Val Ser Gln Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly 
          385                 390                 395                 400 
          Asn Phe Lys Asn Leu Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr 
                          405                 410                 415     
          Phe Lys Ile Tyr Ser Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu 
                      420                 425                 430         
          Pro Gln Gly Phe Ser Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly 
                  435                 440                 445             
          Ile Asn Ile Thr Arg Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr 
              450                 455                 460                 
          Leu Thr Pro Gly Asp Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala 
          465                 470                 475                 480 
          Tyr Tyr Val Gly Tyr Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn 
                          485                 490                 495     
          Glu Asn Gly Thr Ile Thr Asp Ala Val Asp Ser Gly Gly Gly Ser Ile 
                      500                 505                 510         
          Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser 
                  515                 520                 525             
          Leu Gly Ala Ile Ser Phe 
              530                 
          <![CDATA[<210>  141]]>
          <![CDATA[<211>  1778]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  141]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag      120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc      180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac      240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac      300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag      360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc      420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac      480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc      540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac      600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg      660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaagucu      720
          ggcggaggca gcggaggcgg aagccagugc gugaaccuga ccacccggac ccagcugcca      780
          ccagccuaca ccaacagcuu cacccggggc gucuacuacc ccgacaaggu guuccggagc      840
          agcguccugc acagcaccca ggaccuguuc cugcccuucu ucagcaacgu gaccugguuc      900
          cacgccaucc acgugagcgg caccaacggc accaagcggu ucgacaaccc cgugcugccc      960
          uucaacgacg gcguguacuu cgccagcacc gagaagagca acaucauccg gggcuggauc     1020
          uucggcacca cccuggacag caagacccag agccugcuga ucgugaauaa cgccaccaac     1080
          guggugauca aggugugcga guuccaguuc ugcaacgacc ccuuccuggg cguguacuac     1140
          cacaagaaca acaagagcug gauggagagc gaguuccggg uguacagcag cgccaacaac     1200
          ugcaccuucg aguacgugag ccagcccuuc cugauggacc uggagggcaa gcagggcaac     1260
          uucaagaacc ugcgggaguu cguguucaag aacaucgacg gcuacuucaa gaucuacagc     1320
          aagcacaccc caaucaaccu ggugcgggau cugccccagg gcuucucagc ccuggagccc     1380
          cugguggacc ugcccaucgg caucaacauc acccgguucc agacccugcu ggcccugcac     1440
          cggagcuacc ugaccccagg cgacagcagc agcgggugga cagcaggcgc ggcugcuuac     1500
          uacgugggcu accugcagcc ccggaccuuc cugcugaagu acaacgagaa cggcaccauc     1560
          accgacgccg uggacucugg cggaggcagc auccuggcca ucuacagcac cguggccagc     1620
          agccuggugc ugcuggugag ccugggcgcc aucagcuucu gauaauaggc uggagccucg     1680
          guggccuagc uucuugcccc uugggccucc ccccagcccc uccuccccuu ccugcacccg     1740
          uacccccgug gucuuugaau aaagucugag ugggcggc                             1778
          <![CDATA[<210>  142]]>
          <![CDATA[<211>  1602]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  142]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ggaggcgcca aguucguggc cgccuggacu      900
          cugaaggccg cagccggcgg acccaacauc accaaccugu gccccuucgg cgagguguuc      960
          aacgccaccc gguucgccag cguguacgcc uggaaccgga agcggaucag caacugcgug     1020
          gccgacuaca gcgugcugua caacagcgcc agcuucagca ccuucaagug cuacggcgug     1080
          agccccacca agcugaacga ccugugcuuc accaacgugu acgccgacag cuucgugauc     1140
          cguggcgacg aggugcggca gaucgcaccc ggccagacag gcaagaucgc cgacuacaac     1200
          uacaagcugc ccgacgacuu caccggcugc gugaucgccu ggaacagcaa caaccucgac     1260
          agcaaggugg gcggcaacua caacuaccug uaccggcugu uccggaagag caaccugaag     1320
          cccuucgagc gggacaucag caccgagauc uaccaagccg gcuccacccc uugcaacggc     1380
          guggagggcu ucaacugcua cuucccucug cagagcuacg gcuuccagcc caccaacggc     1440
          gugggcuacc agcccuaccg ggugguggug cugagcuucg agcugcugca cgccccagcc     1500
          accgugugug gccccaaguc uggcggaggc agcauccugg ccaucuacag caccguggcc     1560
          agcagccugg ugcugcuggu gagccugggc gccaucagcu uc                        1602
          <![CDATA[<210>  143]]>
          <![CDATA[<211>  534]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  143]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Gly Gly Ala Lys Phe Val Ala Ala Trp Thr Leu Lys Ala Ala 
              290                 295                 300                 
          Ala Gly Gly Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe 
          305                 310                 315                 320 
          Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile 
                          325                 330                 335     
          Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe 
                      340                 345                 350         
          Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu 
                  355                 360                 365             
          Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu 
              370                 375                 380                 
          Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn 
          385                 390                 395                 400 
          Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser 
                          405                 410                 415     
          Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg 
                      420                 425                 430         
          Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr 
                  435                 440                 445             
          Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe 
              450                 455                 460                 
          Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly 
          465                 470                 475                 480 
          Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu 
                          485                 490                 495     
          His Ala Pro Ala Thr Val Cys Gly Pro Lys Ser Gly Gly Gly Ser Ile 
                      500                 505                 510         
          Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser 
                  515                 520                 525             
          Leu Gly Ala Ile Ser Phe 
              530                 
          <![CDATA[<210>  144]]>
          <![CDATA[<211>  1778]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  144]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacgga ggcgccaagu ucguggccgc cuggacucug      960
          aaggccgcag ccggcggacc caacaucacc aaccugugcc ccuucggcga gguguucaac     1020
          gccacccggu ucgccagcgu guacgccugg aaccggaagc ggaucagcaa cugcguggcc     1080
          gacuacagcg ugcuguacaa cagcgccagc uucagcaccu ucaagugcua cggcgugagc     1140
          cccaccaagc ugaacgaccu gugcuucacc aacguguacg ccgacagcuu cgugauccgu     1200
          ggcgacgagg ugcggcagau cgcacccggc cagacaggca agaucgccga cuacaacuac     1260
          aagcugcccg acgacuucac cggcugcgug aucgccugga acagcaacaa ccucgacagc     1320
          aaggugggcg gcaacuacaa cuaccuguac cggcuguucc ggaagagcaa ccugaagccc     1380
          uucgagcggg acaucagcac cgagaucuac caagccggcu ccaccccuug caacggcgug     1440
          gagggcuuca acugcuacuu cccucugcag agcuacggcu uccagcccac caacggcgug     1500
          ggcuaccagc ccuaccgggu gguggugcug agcuucgagc ugcugcacgc cccagccacc     1560
          guguguggcc ccaagucugg cggaggcagc auccuggcca ucuacagcac cguggccagc     1620
          agccuggugc ugcuggugag ccugggcgcc aucagcuucu gauaauaggc uggagccucg     1680
          guggccuagc uucuugcccc uugggccucc ccccagcccc uccuccccuu ccugcacccg     1740
          uacccccgug gucuuugaau aaagucugag ugggcggc                             1778
          <![CDATA[<210>  145]]>
          <![CDATA[<211>  1785]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  145]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc       60
          cagggcgccg agaacagcgu ggccuacagc aacaacagca ucgccauccc caccaacuuc      120
          accaucagcg ugaccaccga gauucugccc gugagcauga ccaagaccag cguggacugc      180
          accauguaca ucugcggcga cagcaccgag ugcagcaacc ugcugcugca guacggcagc      240
          uucugcaccc agcugaaccg ggcccugacc ggcaucgccg uggagcagga caagaacacc      300
          caggaggugu ucgcccaggu gaagcagauc uacaagaccc cucccaucaa ggacuucggc      360
          ggcuucaacu ucagccagau ccugcccgac cccagcaagc ccagcaagcg gagcuucauc      420
          gaggaccugc uguucaacaa ggugacccua gccgacgccg gcuucaucaa gcaguacggc      480
          gacugccucg gcgacauagc cgcccgggac cugaucugcg cccagaaguu caacggccug      540
          accgugcugc cuccccugcu gaccgacgag augaucgccc aguacaccag cgcccuguua      600
          gccggaacca ucaccagcgg cuggacuuuc ggcgcuggag ccgcucugca gauccccuuc      660
          gccaugcaga uggccuaccg guucaacggc aucggcguga cccagaacgu gcuguacgag      720
          aaccagaagc ugaucgccaa ccaguucaac agcgccaucg gcaagaucca ggacagccug      780
          agcagcaccg cuagcgcccu gggcaagcug caggacgugg ugaaccagaa cgcccaggcc      840
          cugaacaccc uggugaagca gcugagcagc aacuucggcg ccaucagcag cgugcugaac      900
          gacauccuga gccggcugga cccucccgag gccgaggugc agaucgaccg gcugaucacu      960
          ggccggcugc agagccugca gaccuacgug acccagcagc ugauccgggc cgccgagauu     1020
          cgggccagcg ccaaccuggc cgccaccaag augagcgagu gcgugcuggg ccagagcaag     1080
          cggguggacu ucugcggcaa gggcuaccac cugaugagcu uuccccagag cgcaccccac     1140
          ggaguggugu uccugcacgu gaccuacgug cccgcccagg agaagaacuu caccaccgcc     1200
          ccagccaucu gccacgacgg caaggcccac uuuccccggg agggcguguu cgugagcaac     1260
          ggcacccacu gguucgugac ccagcggaac uucuacgagc cccagaucau caccaccgac     1320
          aacaccuucg ugagcggcaa cugcgacgug gugaucggca ucgugaacaa caccguguac     1380
          gauccccugc agcccgagcu ggacagcuuc aaggaggagc uggacaagua cuucaagaau     1440
          cacaccagcc ccgacgugga ccugggcgac aucagcggca ucaacgccag cguggugaac     1500
          auccagaagg agaucgaucg gcugaacgag guggccaaga accugaacga gagccugauc     1560
          gaccugcagg agcugggcaa guacgagcag uacaucaagu ggcccuggua caucuggcug     1620
          ggcuucaucg ccggccugau cgccaucgug auggugacca ucaugcugug cugcaugacc     1680
          agcugcugca gcugccugaa gggcuguugc agcugcggca gcugcugcaa guucgacgag     1740
          gacgacagcg agcccgugcu gaagggcgug aagcugcacu acacc                     1785
          <![CDATA[<210>  146]]>
          <![CDATA[<211>  595]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  146]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu 
          1               5                   10                  15      
          Leu Val Asn Ser Gln Gly Ala Glu Asn Ser Val Ala Tyr Ser Asn Asn 
                      20                  25                  30          
          Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile Ser Val Thr Thr Glu Ile 
                  35                  40                  45              
          Leu Pro Val Ser Met Thr Lys Thr Ser Val Asp Cys Thr Met Tyr Ile 
              50                  55                  60                  
          Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu Leu Leu Gln Tyr Gly Ser 
          65                  70                  75                  80  
          Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr Gly Ile Ala Val Glu Gln 
                          85                  90                  95      
          Asp Lys Asn Thr Gln Glu Val Phe Ala Gln Val Lys Gln Ile Tyr Lys 
                      100                 105                 110         
          Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe Asn Phe Ser Gln Ile Leu 
                  115                 120                 125             
          Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu 
              130                 135                 140                 
          Phe Asn Lys Val Thr Leu Ala Asp Ala Gly Phe Ile Lys Gln Tyr Gly 
          145                 150                 155                 160 
          Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp Leu Ile Cys Ala Gln Lys 
                          165                 170                 175     
          Phe Asn Gly Leu Thr Val Leu Pro Pro Leu Leu Thr Asp Glu Met Ile 
                      180                 185                 190         
          Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly Thr Ile Thr Ser Gly Trp 
                  195                 200                 205             
          Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile Pro Phe Ala Met Gln Met 
              210                 215                 220                 
          Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr Gln Asn Val Leu Tyr Glu 
          225                 230                 235                 240 
          Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn Ser Ala Ile Gly Lys Ile 
                          245                 250                 255     
          Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala Leu Gly Lys Leu Gln Asp 
                      260                 265                 270         
          Val Val Asn Gln Asn Ala Gln Ala Leu Asn Thr Leu Val Lys Gln Leu 
                  275                 280                 285             
          Ser Ser Asn Phe Gly Ala Ile Ser Ser Val Leu Asn Asp Ile Leu Ser 
              290                 295                 300                 
          Arg Leu Asp Pro Pro Glu Ala Glu Val Gln Ile Asp Arg Leu Ile Thr 
          305                 310                 315                 320 
          Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val Thr Gln Gln Leu Ile Arg 
                          325                 330                 335     
          Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu Ala Ala Thr Lys Met Ser 
                      340                 345                 350         
          Glu Cys Val Leu Gly Gln Ser Lys Arg Val Asp Phe Cys Gly Lys Gly 
                  355                 360                 365             
          Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro His Gly Val Val Phe 
              370                 375                 380                 
          Leu His Val Thr Tyr Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala 
          385                 390                 395                 400 
          Pro Ala Ile Cys His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val 
                          405                 410                 415     
          Phe Val Ser Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr 
                      420                 425                 430         
          Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys 
                  435                 440                 445             
          Asp Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln 
              450                 455                 460                 
          Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn 
          465                 470                 475                 480 
          His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala 
                          485                 490                 495     
          Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala 
                      500                 505                 510         
          Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr 
                  515                 520                 525             
          Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile Trp Leu Gly Phe Ile Ala 
              530                 535                 540                 
          Gly Leu Ile Ala Ile Val Met Val Thr Ile Met Leu Cys Cys Met Thr 
          545                 550                 555                 560 
          Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys Ser Cys Gly Ser Cys Cys 
                          565                 570                 575     
          Lys Phe Asp Glu Asp Asp Ser Glu Pro Val Leu Lys Gly Val Lys Leu 
                      580                 585                 590         
          His Tyr Thr 
                  595 
          <![CDATA[<210>  147]]>
          <![CDATA[<211>  1961]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  147]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag      120
          ggcgccgaga acagcguggc cuacagcaac aacagcaucg ccauccccac caacuucacc      180
          aucagcguga ccaccgagau ucugcccgug agcaugacca agaccagcgu ggacugcacc      240
          auguacaucu gcggcgacag caccgagugc agcaaccugc ugcugcagua cggcagcuuc      300
          ugcacccagc ugaaccgggc ccugaccggc aucgccgugg agcaggacaa gaacacccag      360
          gagguguucg cccaggugaa gcagaucuac aagaccccuc ccaucaagga cuucggcggc      420
          uucaacuuca gccagauccu gcccgacccc agcaagccca gcaagcggag cuucaucgag      480
          gaccugcugu ucaacaaggu gacccuagcc gacgccggcu ucaucaagca guacggcgac      540
          ugccucggcg acauagccgc ccgggaccug aucugcgccc agaaguucaa cggccugacc      600
          gugcugccuc cccugcugac cgacgagaug aucgcccagu acaccagcgc ccuguuagcc      660
          ggaaccauca ccagcggcug gacuuucggc gcuggagccg cucugcagau ccccuucgcc      720
          augcagaugg ccuaccgguu caacggcauc ggcgugaccc agaacgugcu guacgagaac      780
          cagaagcuga ucgccaacca guucaacagc gccaucggca agauccagga cagccugagc      840
          agcaccgcua gcgcccuggg caagcugcag gacgugguga accagaacgc ccaggcccug      900
          aacacccugg ugaagcagcu gagcagcaac uucggcgcca ucagcagcgu gcugaacgac      960
          auccugagcc ggcuggaccc ucccgaggcc gaggugcaga ucgaccggcu gaucacuggc     1020
          cggcugcaga gccugcagac cuacgugacc cagcagcuga uccgggccgc cgagauucgg     1080
          gccagcgcca accuggccgc caccaagaug agcgagugcg ugcugggcca gagcaagcgg     1140
          guggacuucu gcggcaaggg cuaccaccug augagcuuuc cccagagcgc accccacgga     1200
          gugguguucc ugcacgugac cuacgugccc gcccaggaga agaacuucac caccgcccca     1260
          gccaucugcc acgacggcaa ggcccacuuu ccccgggagg gcguguucgu gagcaacggc     1320
          acccacuggu ucgugaccca gcggaacuuc uacgagcccc agaucaucac caccgacaac     1380
          accuucguga gcggcaacug cgacguggug aucggcaucg ugaacaacac cguguacgau     1440
          ccccugcagc ccgagcugga cagcuucaag gaggagcugg acaaguacuu caagaaucac     1500
          accagccccg acguggaccu gggcgacauc agcggcauca acgccagcgu ggugaacauc     1560
          cagaaggaga ucgaucggcu gaacgaggug gccaagaacc ugaacgagag ccugaucgac     1620
          cugcaggagc ugggcaagua cgagcaguac aucaaguggc ccugguacau cuggcugggc     1680
          uucaucgccg gccugaucgc caucgugaug gugaccauca ugcugugcug caugaccagc     1740
          ugcugcagcu gccugaaggg cuguugcagc ugcggcagcu gcugcaaguu cgacgaggac     1800
          gacagcgagc ccgugcugaa gggcgugaag cugcacuaca ccugauaaua ggcuggagcc     1860
          ucgguggccu agcuucuugc cccuugggcc uccccccagc cccuccuccc cuuccugcac     1920
          ccguaccccc guggucuuug aauaaagucu gagugggcgg c                         1961
          <![CDATA[<210>  148]]>
          <![CDATA[<211>  13]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  148]]>
          Ala Lys Phe Val Ala Ala Trp Thr Leu Lys Ala Ala Ala 
          1               5                   10              
          <![CDATA[<210>  149]]>
          <![CDATA[<211>  2216]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  149]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug       60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg      120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag      180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac      240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac      300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc      360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau      420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug      480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc      540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc      600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc      660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca      720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug      780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc      840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag      900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc      960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag     1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug     1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc     1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc     1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug     1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac     1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc     1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug     1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac     1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac     1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca     1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc     1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc     1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc     1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcg ugagcgugau caccccaggc     1860
          accaacacca gcaaccaggu ggccgugcug uaccaggacg ugaacugcac cgaggugccc     1920
          guggccaucc acgccgacca gcugacaccc accuggcggg ucuacagcac cggcagcaac     1980
          guguuccaga cccgggccgg uugccugauc ggcgccgagc acgugaacaa cagcuacgag     2040
          ugcgacaucc ccaucggcgc cggcaucugu gccagcuacc agacccagac caauucauga     2100
          uaauaggcug gagccucggu ggccuagcuu cuugccccuu gggccucccc ccagccccuc     2160
          cuccccuucc ugcacccgua cccccguggu cuuugaauaa agucugagug ggcggc         2216
          <![CDATA[<210>  150]]>
          <![CDATA[<211>  2040]]>
          <![CDATA[<212>  RNA]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  150]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc       60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac      120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc      180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac      240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc      300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug      360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc      420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac      480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag      540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac      600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc      660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc      720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca      780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac      840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag      900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug      960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag     1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac     1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac     1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc     1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac     1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac     1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac     1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc     1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc     1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc     1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac     1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug     1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag     1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcgugagcgu gaucacccca     1800
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug     1860
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc     1920
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac     1980
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca     2040
          <![CDATA[<210>  151]]>
          <![CDATA[<211>  680]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  151]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val 
          1               5                   10                  15      
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe 
                      20                  25                  30          
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu 
                  35                  40                  45              
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp 
              50                  55                  60                  
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp 
          65                  70                  75                  80  
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu 
                          85                  90                  95      
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser 
                      100                 105                 110         
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile 
                  115                 120                 125             
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr 
              130                 135                 140                 
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr 
          145                 150                 155                 160 
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu 
                          165                 170                 175     
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe 
                      180                 185                 190         
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr 
                  195                 200                 205             
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu 
              210                 215                 220                 
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr 
          225                 230                 235                 240 
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser 
                          245                 250                 255     
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro 
                      260                 265                 270         
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala 
                  275                 280                 285             
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys 
              290                 295                 300                 
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val 
          305                 310                 315                 320 
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys 
                          325                 330                 335     
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala 
                      340                 345                 350         
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu 
                  355                 360                 365             
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro 
              370                 375                 380                 
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe 
          385                 390                 395                 400 
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly 
                          405                 410                 415     
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys 
                      420                 425                 430         
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn 
                  435                 440                 445             
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe 
              450                 455                 460                 
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys 
          465                 470                 475                 480 
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly 
                          485                 490                 495     
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val 
                      500                 505                 510         
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys 
                  515                 520                 525             
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn 
              530                 535                 540                 
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu 
          545                 550                 555                 560 
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val 
                          565                 570                 575     
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe 
                      580                 585                 590         
          Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val 
                  595                 600                 605             
          Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile 
              610                 615                 620                 
          His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser 
          625                 630                 635                 640 
          Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val 
                          645                 650                 655     
          Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala 
                      660                 665                 670         
          Ser Tyr Gln Thr Gln Thr Asn Ser 
                  675                 680 
          <![CDATA[<210>  152]]>
          <![CDATA[<211>  4]]>
          <![CDATA[<212>  PRT]]>
          <![CDATA[<213>  人工序列(Artificial Sequence)]]>
          <![CDATA[<220>]]>
          <![CDATA[<223>  合成]]>
          <![CDATA[<400>  152]]>
          Gly Ser Gly Gly
          1               
                          
           <![CDATA[ <110> ModernaTX, Inc.]]>
           <![CDATA[ <120> SARS-COV-2 mRNA domain vaccine]]>
           <![CDATA[ <130> M1378.70157]]>
           <![CDATA[ <140>TW 110123289]]>
           <![CDATA[ <141> 2021-06-25]]>
           <![CDATA[ <150> US 63/063,137]]>
           <![CDATA[ <151> 2020-08-07]]>
           <![CDATA[ <150> US 63/044,330]]>
           <![CDATA[ <151> 2020-06-25]]>
           <![CDATA[ <160> 152]]>
           <![CDATA[ <170> PatentIn Version 3.5]]>
           <![CDATA[ <210> 1]]>
           <![CDATA[ <211> 2216]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 1]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag 1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug 1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc 1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc 1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug 1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac 1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc 1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug 1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac 1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac 1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca 1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc 1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc 1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc 1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcg ugagcgugau caccccaggc 1860
          accaacacca gcaaccaggu ggccgugcug uaccaggacg ugaacugcac cgaggugccc 1920
          guggccaucc acgccgacca gcugacaccc accuggcggg ucuacagcac cggcagcaac 1980
          guguuccaga cccgggccgg uugccugauc ggcgccgagc acgugaacaa cagcuacgag 2040
          ugcgacaucc ccaucggcgc cggcaucugu gccagcuacc agacccagac caauucauga 2100
          uaauaggcug gagccucggu ggccuagcuu cuugccccuu gggccucccc ccagccccuc 2160
          cuccccuucc ugcacccgua cccccguggu cuuugaauaa agucugagug ggcggc 2216
           <![CDATA[ <210> 2]]>
           <![CDATA[ <211> 57]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 2]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccacc 57
           <![CDATA[ <210> 3]]>
           <![CDATA[ <211> 2040]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 3]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug 960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag 1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac 1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac 1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc 1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac 1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac 1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac 1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc 1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc 1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc 1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac 1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug 1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag 1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcgugagcgu gaucacccca 1800
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug 1860
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc 1920
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac 1980
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca 2040
           <![CDATA[ <210> 4]]>
           <![CDATA[ <211> 119]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 4]]>
          ugauaauagg cuggagccuc gguggccuag cuucuugccc cuugggccuc cccccagccc 60
          cuccuccccu uccugcaccc guacccccgu ggucuuugaa uaaagucuga gugggcggc 119
           <![CDATA[ <210> 5]]>
           <![CDATA[ <211> 680]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 5]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
          305 310 315 320
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
                          325 330 335
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
                      340 345 350
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                  355 360 365
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
              370 375 380
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
          385 390 395 400
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
                          405 410 415
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
                      420 425 430
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                  435 440 445
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
              450 455 460
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
          465 470 475 480
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
                          485 490 495
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
                      500 505 510
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                  515 520 525
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
              530 535 540
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
          545 550 555 560
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
                          565 570 575
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
                      580 585 590
          Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
                  595 600 605
          Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile
              610 615 620
          His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
          625 630 635 640
          Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
                          645 650 655
          Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
                      660 665 670
          Ser Tyr Gln Thr Gln Thr Asn Ser
                  675 680
           <![CDATA[ <210> 6]]>
           <![CDATA[ <211> 2729]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 6]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag 1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug 1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc 1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc 1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug 1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac 1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc 1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug 1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac 1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac 1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca 1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc 1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc 1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc 1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcg ugagcgugau caccccaggc 1860
          accaacacca gcaaccaggu ggccgugcug uaccaggacg ugaacugcac cgaggugccc 1920
          guggccaucc acgccgacca gcugacaccc accuggcggg ucuacagcac cggcagcaac 1980
          guguuccaga cccgggccgg uugccugauc ggcgccgagc acgugaacaa cagcuacgag 2040
          ugcgacaucc ccaucggcgc cggcaucugu gccagcuacc agacccagac caauucagga 2100
          ggaggcagcg gcggcgauau caucaagcuu cugaacgagc aaguuaacaa ggaaaugcag 2160
          agcaguaauc ucuacaugag caugagcagc uggugcuaca cccacucccu ggacggagca 2220
          ggccucuucc uguucgacca cgcagccgag gaguacgagc acgcuaagaa guugaucauu 2280
          uucuugaacg agaacaacgu gcccgugcag cuaacgucaa ucagcgcacc ugagcacaag 2340
          uucgagggcc ugacccagau cuuccagaag gccuacgaac acgaacagca caucuccgag 2400
          agcaucaaca auauugugga ucacgcuauc aaguccaagg accacgcuac cuucaacuuc 2460
          cugcaguggu acguggccga gcaacaugag gaggaggugc uguucaagga cauccuggac 2520
          aagaucgagc ugaucgguaa ugagaaucac ggccuguacc uggccgacca guacgugaag 2580
          ggcaucgcca agagccggaa gucaggcuca ugauaauagg cuggagccuc gguggccuag 2640
          cuucuugccc cuugggccuc cccccagccc cuccuccccu uccugcaccc guacccccgu 2700
          ggucuuugaa uaaagucuga gugggcggc 2729
           <![CDATA[ <210> 7]]>
           <![CDATA[ <211> 2553]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 7]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug 960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag 1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac 1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac 1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc 1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac 1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac 1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac 1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc 1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc 1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc 1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac 1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug 1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag 1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcgugagcgu gaucacccca 1800
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug 1860
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc 1920
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac 1980
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca 2040
          ggaggaggca gcggcggcga uaucaucaag cuucugaacg agcaaguuaa caaggaaaug 2100
          cagagcagua aucucuacau gagcaugagc agcuggugcu acacccacuc ccuggacgga 2160
          gcaggccucu uccuguucga ccacgcagcc gaggaguacg agcacgcuaa gaaguugauc 2220
          auuuucuuga acgagaacaa cgugcccgug cagcuaacgu caaucagcgc accugagcac 2280
          aaguucgagg gccugaccca gaucuuccag aaggccuacg aacacgaaca gcacaucucc 2340
          gagagcauca acaauauugu ggaucacgcu aucaagucca aggaccacgc uaccuucaac 2400
          uuccugcagu gguacguggc cgagcaacau gaggaggagg ugcuguucaa ggacauccug 2460
          gacaagaucg agcugaucgg uaaugagaau cacggccugu accuggccga ccaguacgug 2520
          aagggcaucg ccaagagccg gaagucaggc uca 2553
           <![CDATA[ <210> 8]]>
           <![CDATA[ <211> 851]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 8]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
          305 310 315 320
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
                          325 330 335
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
                      340 345 350
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                  355 360 365
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
              370 375 380
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
          385 390 395 400
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
                          405 410 415
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
                      420 425 430
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                  435 440 445
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
              450 455 460
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
          465 470 475 480
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
                          485 490 495
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
                      500 505 510
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                  515 520 525
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
              530 535 540
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
          545 550 555 560
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
                          565 570 575
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
                      580 585 590
          Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
                  595 600 605
          Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile
              610 615 620
          His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
          625 630 635 640
          Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
                          645 650 655
          Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
                      660 665 670
          Ser Tyr Gln Thr Gln Thr Asn Ser Gly Gly Gly Ser Gly Gly Asp Ile
                  675 680 685
          Ile Lys Leu Leu Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn
              690 695 700
          Leu Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly
          705 710 715 720
          Ala Gly Leu Phe Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala
                          725 730 735
          Lys Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu
                      740 745 750
          Thr Ser Ile Ser Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile
                  755 760 765
          Phe Gln Lys Ala Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn
              770 775 780
          Asn Ile Val Asp His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn
          785 790 795 800
          Phe Leu Gln Trp Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe
                          805 810 815
          Lys Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly
                      820 825 830
          Leu Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys
                  835 840 845
          Ser Gly Ser
              850
           <![CDATA[ <210> 9]]>
           <![CDATA[ <211> 2762]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 9]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          ggcauccugc ccagcccugg caugcccgcu cugcugagcc uggugagccu gcugagcgug 120
          cugcugaugg gcugcguggc ugagaccggc augcagaucu acgagggcaa gcugaccgca 180
          gagggccugc gguucggcau cguggccagc cgcgccaacc acgcucuggu ggaccggcuu 240
          guggagggcg cuaucgacgc caucgugaga cacggcggcc gggaagagga caucacccug 300
          gugcgggugu gcggcagcug ggagauuccc gucgccgccg gagaacuggc ccggaaggag 360
          gacaucgacg ccgugaucgc caucggcgug cugugcagag gcgccacgcc cagcuucgac 420
          uacaucgcca gcgaggugag caagggccug gccgaccuga gccuggagcu gcggaagccc 480
          aucaccuucg gcgugaucac cgccgacacc cuggagcagg ccaucgaggc cgcaggcacc 540
          ugccacggca acaagggcug ggaagccgcc cugugcgcca ucgagauggc caaccuguuc 600
          aagagccugc ggggcggaag uggaggcucu gguggcagcg gaggaucugg cggcggcacc 660
          acccggaccc agcugccacc agccuacacc aacagcuuca cccggggcgu cuacuacccc 720
          gacaaggugu uccggagcag cguccugcac agcacccagg accuguuccu gcccuucuuc 780
          agcaacguga ccugguucca cgccauccac gugagcggca ccaacggcac caagcgguuc 840
          gacaaccccg ugcugcccuu caacgacggc guguacuucg ccagcaccga gaagagcaac 900
          aucauccggg gcuggaucuu cggcaccacc cuggacagca agacccagag ccugcugauc 960
          gugaauaacg ccaccaacgu ggugaucaag gugugcgagu uccaguucug caacgacccc 1020
          uuccugggcg uguacuacca caagaacaac aagagcugga uggagagcga guuccgggug 1080
          uacagcagcg ccaacaacug caccuucgag uacgugagcc agcccuuccu gauggaccug 1140
          gagggcaagc agggcaacuu caagaaccug cgggaguucg uguucaagaa caucgacggc 1200
          uacuucaaga ucuacagcaa gcacacccca aucaaccugg ugcgggaucu gccccagggc 1260
          uucucagccc uggagccccu gguggaccug cccaucggca ucaacaucac ccgguuccag 1320
          acccugcugg cccugcaccg gagcuaccug accccaggcg acagcagcag cggguggaca 1380
          gcaggcgcgg cugcuuacua cgugggcuac cugcagcccc ggaccuuccu gcugaaguac 1440
          aacgagaacg gcaccaucac cgacgccgug gacugcgccc uggacccucu gagcgagacc 1500
          aagugcaccc ugaagagcuu caccguggag aagggcaucu accagaccag caacuuccgg 1560
          gugcagccca ccgagagcau cgugcgguuc cccaacauca ccaaccugug ccccuucggc 1620
          gagguguuca acgccacccg guucgccagc guguacgccu ggaaccggaa gcggaucagc 1680
          aacugcgugg ccgacuacag cgugcuguac aacagcgcca gcuucagcac cuucaagugc 1740
          uacggcguga gccccaccaa gcugaacgac cugugcuuca ccaacgugua cgccgacagc 1800
          uucgugaucc guggcgacga ggugcggcag aucgcacccg gccagacagg caagaucgcc 1860
          gacuacaacu acaagcugcc cgacgacuuc accggcugcg ugaucgccug gaacagcaac 1920
          aaccucgaca gcaagguggg cggcaacuac aacuaccugu accggcuguu ccggaagagc 1980
          aaccugaagc ccuucgagcg ggacaucagc accgagaucu accaagccgg cuccaccccu 2040
          ugcaacggcg uggagggcuu caacugcuac uucccucugc agagcuacgg cuuccagccc 2100
          accaacggcg ugggcuacca gcccuaccgg gugguggugc ugagcuucga gcugcugcac 2160
          gccccagcca ccgugugugg ccccaagaag agcaccaacc uggugaagaa caagugcgug 2220
          aacuucaacu ucaacggccu uaccggcacc ggcgugcuga ccgagagcaa caagaaauuc 2280
          cugcccuuuc agcaguucgg ccgggacauc gccgacacca ccgacgcugu gcgggauccc 2340
          cagacccugg agauccugga caucaccccu ugcagcuucg gcggcgugag cgugaucacc 2400
          ccaggcacca acaccagcaa ccagguggcc gugcuguacc aggacgugaa cugcaccgag 2460
          gugcccgugg ccauccacgc cgaccagcug acacccaccu ggcgggucua cagcaccggc 2520
          agcaacgugu uccagacccg ggccgguugc cugaucggcg ccgagcacgu gaacaacagc 2580
          uacgagugcg acauccccau cggcgccggc aucugugcca gcuaccagac ccagaccaau 2640
          ucaugauaau aggcuggagc cucgguggcc uagcuucuug ccccuugggc cuccccccag 2700
          ccccuccucc ccuuccugca cccguacccc cguggucuuu gaauaaaguc ugagugggcg 2760
          gc 2762
           <![CDATA[ <210> 10]]>
           <![CDATA[ <211> 2586]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 10]]>
          augggcaucc ugcccagccc uggcaugccc gcucugcuga gccuggugag ccugcugagc 60
          gugcugcuga ugggcugcgu ggcugagacc ggcaugcaga ucuacgaggg caagcugacc 120
          gcagagggcc ugcgguucgg caucguggcc agccgcgcca accacgcucu gguggaccgg 180
          cuuguggagg gcgcuaucga cgccaucgug agacacggcg gccgggaaga ggacaucacc 240
          cuggugcggg ugugcggcag cugggagauu cccgucgccg ccggagaacu ggcccggaag 300
          gaggacaucg acgccgugau cgccaucggc gugcugugca gaggcgccac gcccagcuuc 360
          gacuacaucg ccagcgaggu gagcaagggc cuggccgacc ugagccugga gcugcggaag 420
          cccaucaccu ucggcgugau caccgccgac acccuggagc aggccaucga ggccgcaggc 480
          accugccacg gcaacaaggg cugggaagcc gcccugugcg ccaucgagau ggccaaccug 540
          uucaagagcc ugcggggcgg aaguggaggc ucugguggca gcggaggauc uggcggcggc 600
          accacccgga cccagcugcc accagccuac accaacagcu ucacccgggg cgucuacuac 660
          cccgacaagg uguuccggag cagcguccug cacagcaccc aggaccuguu ccugcccuuc 720
          uucagcaacg ugaccugguu ccacgccauc cacgugagcg gcaccaacgg caccaagcgg 780
          uucgacaacc ccgugcugcc cuucaacgac ggcguguacu ucgccagcac cgagaagagc 840
          aacaucaucc ggggcuggau cuucggcacc acccuggaca gcaagaccca gagccugcug 900
          aucgugaaua acgccaccaa cguggugauc aaggugugcg aguuccaguu cugcaacgac 960
          cccuuccugg gcguguacua ccacaagaac aacaagagcu ggauggagag cgaguuccgg 1020
          guguacagca gcgccaacaa cugcaccuuc gaguacguga gccagcccuu ccugauggac 1080
          cuggagggca agcagggcaa cuucaagaac cugcgggagu ucguguucaa gaacaucgac 1140
          ggcuacuuca agaucuacag caagcacacc ccaaucaacc uggugcggga ucugccccag 1200
          ggcuucucag cccuggagcc ccugguggac cugcccaucg gcaucaacau cacccgguuc 1260
          cagacccugc uggcccugca ccggagcuac cugaccccag gcgacagcag cagcgggugg 1320
          acagcaggcg cggcugcuua cuacgugggc uaccugcagc cccggaccuu ccugcugaag 1380
          uacaacgaga acggcaccau caccgacgcc guggacugcg cccuggaccc ucugagcgag 1440
          accaagugca cccugaagag cuucaccgug gagaagggca ucuaccagac cagcaacuuc 1500
          cgggugcagc ccaccgagag caucgugcgg uuccccaaca ucaccaaccu gugccccuuc 1560
          ggcgaggugu ucaacgccac ccgguucgcc agcguguacg ccuggaaccg gaagcggauc 1620
          agcaacugcg uggccgacua cagcgugcug uacaacagcg ccagcuucag caccuucaag 1680
          ugcuacggcg ugagccccac caagcugaac gaccugugcu ucaccaacgu guacgccgac 1740
          agcuucguga uccguggcga cgaggugcgg cagaucgcac ccggccagac aggcaagauc 1800
          gccgacuaca acuacaagcu gcccgacgac uucaccggcu gcgugaucgc cuggaacagc 1860
          aacaaccucg acagcaaggu gggcggcaac uacaacuacc uguaccggcu guuccggaag 1920
          agcaaccuga agcccuucga gcgggacauc agcaccgaga ucuaccaagc cggcuccacc 1980
          ccuugcaacg gcguggaggg cuucaacugc uacuucccuc ugcagagcua cggcuuccag 2040
          cccaccaacg gcgugggcua ccagcccuac cggguggugg ugcugagcuu cgagcugcug 2100
          cacgccccag ccaccgugug uggccccaag aagagcacca accuggugaa gaacaagugc 2160
          gugaacuuca acuucaacgg ccuuaccggc accggcgugc ugaccgagag caacaagaaa 2220
          uuccugcccu uucagcaguu cggccgggac aucgccgaca ccaccgacgc ugugcgggau 2280
          ccccagaccc uggagauccu ggacaucacc ccuugcagcu ucggcggcgu gagcgugauc 2340
          accccaggca ccaacaccag caaccaggug gccgugcugu accaggacgu gaacugcacc 2400
          gaggugcccg uggccaucca cgccgaccag cugacaccca ccuggcgggu cuacagcacc 2460
          ggcagcaacg uguuccagac ccgggccggu ugccugaucg gcgccgagca cgugaacaac 2520
          agcuacgagu gcgacauccc caucggcgcc ggcaucugug ccagcuacca gacccagacc 2580
          aauuca 2586
           <![CDATA[ <210> 11]]>
           <![CDATA[ <211> 862]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 11]]>
          Met Gly Ile Leu Pro Ser Pro Gly Met Pro Ala Leu Leu Ser Leu Val
          1 5 10 15
          Ser Leu Leu Ser Val Leu Leu Met Gly Cys Val Ala Glu Thr Gly Met
                      20 25 30
          Gln Ile Tyr Glu Gly Lys Leu Thr Ala Glu Gly Leu Arg Phe Gly Ile
                  35 40 45
          Val Ala Ser Arg Ala Asn His Ala Leu Val Asp Arg Leu Val Glu Gly
              50 55 60
          Ala Ile Asp Ala Ile Val Arg His Gly Gly Arg Glu Glu Asp Ile Thr
          65 70 75 80
          Leu Val Arg Val Cys Gly Ser Trp Glu Ile Pro Val Ala Ala Gly Glu
                          85 90 95
          Leu Ala Arg Lys Glu Asp Ile Asp Ala Val Ile Ala Ile Gly Val Leu
                      100 105 110
          Cys Arg Gly Ala Thr Pro Ser Phe Asp Tyr Ile Ala Ser Glu Val Ser
                  115 120 125
          Lys Gly Leu Ala Asp Leu Ser Leu Glu Leu Arg Lys Pro Ile Thr Phe
              130 135 140
          Gly Val Ile Thr Ala Asp Thr Leu Glu Gln Ala Ile Glu Ala Ala Gly
          145 150 155 160
          Thr Cys His Gly Asn Lys Gly Trp Glu Ala Ala Leu Cys Ala Ile Glu
                          165 170 175
          Met Ala Asn Leu Phe Lys Ser Leu Arg Gly Gly Ser Gly Gly Ser Gly
                      180 185 190
          Gly Ser Gly Gly Ser Gly Gly Gly Thr Thr Arg Thr Gln Leu Pro Pro
                  195 200 205
          Ala Tyr Thr Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val
              210 215 220
          Phe Arg Ser Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu Pro Phe
          225 230 235 240
          Phe Ser Asn Val Thr Trp Phe His Ala Ile His Val Ser Gly Thr Asn
                          245 250 255
          Gly Thr Lys Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp Gly Val
                      260 265 270
          Tyr Phe Ala Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe
                  275 280 285
          Gly Thr Thr Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val Asn Asn
              290 295 300
          Ala Thr Asn Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys Asn Asp
          305 310 315 320
          Pro Phe Leu Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp Met Glu
                          325 330 335
          Ser Glu Phe Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr
                      340 345 350
          Val Ser Gln Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly Asn Phe
                  355 360 365
          Lys Asn Leu Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys
              370 375 380
          Ile Tyr Ser Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu Pro Gln
          385 390 395 400
          Gly Phe Ser Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly Ile Asn
                          405 410 415
          Ile Thr Arg Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr Leu Thr
                      420 425 430
          Pro Gly Asp Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr
                  435 440 445
          Val Gly Tyr Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn
              450 455 460
          Gly Thr Ile Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu
          465 470 475 480
          Thr Lys Cys Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln
                          485 490 495
          Thr Ser Asn Phe Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe Pro
                      500 505 510
          Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg
                  515 520 525
          Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val
              530 535 540
          Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys
          545 550 555 560
          Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn
                          565 570 575
          Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile
                      580 585 590
          Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro
                  595 600 605
          Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp
              610 615 620
          Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys
          625 630 635 640
          Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln
                          645 650 655
          Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe
                      660 665 670
          Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln
                  675 680 685
          Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala
              690 695 700
          Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys Cys
          705 710 715 720
          Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr Glu
                          725 730 735
          Ser Asn Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala
                      740 745 750
          Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp
                  755 760 765
          Ile Thr Pro Cys Ser Phe Gly Gly Val Ser Val Ile Thr Pro Gly Thr
              770 775 780
          Asn Thr Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr
          785 790 795 800
          Glu Val Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg
                          805 810 815
          Val Tyr Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu
                      820 825 830
          Ile Gly Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile
                  835 840 845
          Gly Ala Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser
              850 855 860
           <![CDATA[ <210> 12]]>
           <![CDATA[ <211> 2714]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 12]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag 1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug 1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc 1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc 1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug 1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac 1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc 1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug 1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac 1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac 1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca 1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc 1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc 1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc 1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcg ugagcgugau caccccaggc 1860
          accaacacca gcaaccaggu ggccgugcug uaccaggacg ugaacugcac cgaggugccc 1920
          guggccaucc acgccgacca gcugacaccc accuggcggg ucuacagcac cggcagcaac 1980
          guguuccaga cccgggccgg uugccugauc ggcgccgagc acgugaacaa cagcuacgag 2040
          ugcgacaucc ccaucggcgc cggcaucugu gccagcuacc agacccagac caauucagga 2100
          ggaggcuccg gaggcgguag cgcugagacc ggcaugcaga ucuacgaggg caagcugacc 2160
          gcagagggcc ugcgguucgg caucguggcc agccgcgcca accacgcucu gguggaccgg 2220
          cuuguggagg gcgcuaucga cgccaucgug agacacggcg gccgggaaga ggacaucacc 2280
          cuggugcggg ugugcggcag cugggagauu cccgucgccg ccggagaacu ggcccggaag 2340
          gaggacaucg acgccgugau cgccaucggc gugcugugca gaggcgccac gcccagcuuc 2400
          gacuacaucg ccagcgaggu gagcaagggc cuggccgacc ugagccugga gcugcggaag 2460
          cccaucaccu ucggcgugau caccgccgac acccuggagc aggccaucga ggccgcaggc 2520
          accugccacg gcaacaaggg cugggaagcc gcccugugcg ccaucgagau ggccaaccug 2580
          uucaagagcc ugcggugaua auaggcugga gccucggugg ccuagcuucu ugccccuugg 2640
          gccuccccccc agccccuccu ccccuuccug cacccguacc cccguggucu uugaauaaag 2700
          ucugaguggg cggc 2714
           <![CDATA[ <210> 13]]>
           <![CDATA[ <211> 2538]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 13]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug 960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag 1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac 1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac 1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc 1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac 1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac 1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac 1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc 1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc 1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc 1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac 1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug 1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag 1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcgugagcgu gaucacccca 1800
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug 1860
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc 1920
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac 1980
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca 2040
          ggaggaggcu ccggaggcgg uagcgcugag accggcaugc agaucuacga gggcaagcug 2100
          accgcagagg gccugcgguu cggcaucgug gccagccgcg ccaaccacgc ucugguggac 2160
          cggcuugugg agggcgcuau cgacgccauc gugagacacg gcggccggga agaggacauc 2220
          acccuggugc gggugugcgg cagcugggag auucccgucg ccgccggaga acuggcccgg 2280
          aaggaggaca ucgacgccgu gaucgccauc ggcgugcugu gcagaggcgc cacgcccagc 2340
          uucgacuaca ucgccagcga ggugagcaag ggccuggccg accugagccu ggagcugcgg 2400
          aagcccauca ccuucggcgu gaucaccgcc gacacccugg agcaggccau cgaggccgca 2460
          ggcaccugcc acggcaacaa gggcugggaa gccgcccugu gcgccaucga gauggccaac 2520
          cuguucaaga gccugcgg 2538
           <![CDATA[ <210> 14]]>
           <![CDATA[ <211> 846]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 14]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
          305 310 315 320
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
                          325 330 335
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
                      340 345 350
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                  355 360 365
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
              370 375 380
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
          385 390 395 400
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
                          405 410 415
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
                      420 425 430
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                  435 440 445
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
              450 455 460
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
          465 470 475 480
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
                          485 490 495
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
                      500 505 510
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                  515 520 525
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
              530 535 540
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
          545 550 555 560
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
                          565 570 575
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
                      580 585 590
          Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
                  595 600 605
          Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile
              610 615 620
          His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
          625 630 635 640
          Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
                          645 650 655
          Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
                      660 665 670
          Ser Tyr Gln Thr Gln Thr Asn Ser Gly Gly Gly Ser Gly Gly Gly Ser
                  675 680 685
          Ala Glu Thr Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr Ala Glu Gly
              690 695 700
          Leu Arg Phe Gly Ile Val Ala Ser Arg Ala Asn His Ala Leu Val Asp
          705 710 715 720
          Arg Leu Val Glu Gly Ala Ile Asp Ala Ile Val Arg His Gly Gly Arg
                          725 730 735
          Glu Glu Asp Ile Thr Leu Val Arg Val Cys Gly Ser Trp Glu Ile Pro
                      740 745 750
          Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala Val Ile
                  755 760 765
          Ala Ile Gly Val Leu Cys Arg Gly Ala Thr Pro Ser Phe Asp Tyr Ile
              770 775 780
          Ala Ser Glu Val Ser Lys Gly Leu Ala Asp Leu Ser Leu Glu Leu Arg
          785 790 795 800
          Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu Gln Ala
                          805 810 815
          Ile Glu Ala Ala Gly Thr Cys His Gly Asn Lys Gly Trp Glu Ala Ala
                      820 825 830
          Leu Cys Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg
                  835 840 845
           <![CDATA[ <210> 15]]>
           <![CDATA[ <211> 2300]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 15]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag 1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug 1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc 1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc 1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug 1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac 1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc 1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug 1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac 1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac 1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca 1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc 1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc 1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc 1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcg ugagcgugau caccccaggc 1860
          accaacacca gcaaccaggu ggccgugcug uaccaggacg ugaacugcac cgaggugccc 1920
          guggccaucc acgccgacca gcugacaccc accuggcggg ucuacagcac cggcagcaac 1980
          guguuccaga cccgggccgg uugccugauc ggcgccgagc acgugaacaa cagcuacgag 2040
          ugcgacaucc ccaucggcgc cggcaucugu gccagcuacc agacccagac caauucaucu 2100
          ggcggaggca gcauccuggc caucuacagc accguggcca gcagccuggu gcugcuggug 2160
          agccugggcg ccaucagcuu cugauaauag gcuggagccu cgguggccua gcuucuugcc 2220
          ccuugggccu ccccccagcc ccuccucccc uuccugcacc cguacccccg uggucuuuga 2280
          auaaagucug agugggcggc 2300
           <![CDATA[ <210> 16]]>
           <![CDATA[ <211> 2124]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 16]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug 960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag 1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac 1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac 1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc 1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac 1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac 1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac 1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc 1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc 1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc 1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac 1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug 1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag 1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcgugagcgu gaucacccca 1800
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug 1860
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc 1920
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac 1980
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca 2040
          ucuggcggag gcagcauccu ggccaucuac agcaccgugg ccagcagccu ggugcugcug 2100
          gugagccugg gcgccaucag cuuc 2124
           <![CDATA[ <210> 17]]>
           <![CDATA[ <211> 707]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 17]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
          305 310 315 320
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
                          325 330 335
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
                      340 345 350
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                  355 360 365
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
              370 375 380
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
          385 390 395 400
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
                          405 410 415
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
                      420 425 430
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                  435 440 445
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
              450 455 460
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
          465 470 475 480
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
                          485 490 495
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
                      500 505 510
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                  515 520 525
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
              530 535 540
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
          545 550 555 560
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
                          565 570 575
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
                      580 585 590
          Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
                  595 600 605
          Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile
              610 615 620
          His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
          625 630 635 640
          Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
                          645 650 655
          Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
                      660 665 670
          Ser Tyr Gln Thr Gln Thr Asn Ser Gly Gly Gly Ser Ile Leu Ala Ile
                  675 680 685
          Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala
              690 695 700
          Ile Ser Phe
          705
           <![CDATA[ <210> 18]]>
           <![CDATA[ <211> 1853]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 18]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag 1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug 1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc 1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc 1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug 1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac 1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc 1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug 1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac 1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac 1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca 1620
          gccaccgugu guggccccaa gaagagcacc ucuggcggag gcagcauccu ggccaucuac 1680
          agcaccgugg ccagcagccu ggugcugcug gugagccugg gcgccaucag cuucugauaa 1740
          uaggcuggag ccucgguggc cuagcuucuu gccccuuggg ccucccccca gccccucccuc 1800
          cccuuccugc acccguaccc ccguggucuu ugaauaaagu cugagugggc ggc 1853
           <![CDATA[ <210> 19]]>
           <![CDATA[ <211> 1677]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 19]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug 960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag 1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac 1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac 1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc 1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac 1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac 1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac 1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc 1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc 1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc 1560
          ccagccaccg uguguggccc caagaagagc accucuggcg gaggcagcau ccuggccauc 1620
          uacagcaccg uggccagcag ccuggugcug cuggugagcc ugggcgccau cagcuuc 1677
           <![CDATA[ <210> 20]]>
           <![CDATA[ <211> 559]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 20]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
          305 310 315 320
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
                          325 330 335
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
                      340 345 350
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                  355 360 365
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
              370 375 380
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
          385 390 395 400
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
                          405 410 415
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
                      420 425 430
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                  435 440 445
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
              450 455 460
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
          465 470 475 480
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
                          485 490 495
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
                      500 505 510
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                  515 520 525
          Lys Ser Thr Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val
              530 535 540
          Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe
          545 550 555
           <![CDATA[ <210> 21]]>
           <![CDATA[ <211> 2042]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 21]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag 1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug 1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc 1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc 1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug 1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac 1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc 1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug 1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac 1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac 1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca 1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc 1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc 1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc 1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcu cuggcggagg cagcauccug 1860
          gccaucuaca gcaccguggc cagcagccug gugcugcugg ugagccuggg cgccaucagc 1920
          uucugauaau aggcuggagc cucgguggcc uagcuucuug ccccuugggc cuccccccag 1980
          ccccuccucc ccuuccugca cccguacccc cguggucuuu gaauaaaguc ugagugggcg 2040
          gc 2042
           <![CDATA[ <210> 22]]>
           <![CDATA[ <211> 1866]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 22]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug 960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag 1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac 1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac 1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc 1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac 1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac 1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac 1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc 1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc 1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc 1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac 1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug 1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag 1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcucuggcgg aggcagcauc 1800
          cuggccaucu acagcaccgu ggccagcagc cuggugcugc uggugagccu gggcgccauc 1860
          agcuuc 1866
           <![CDATA[ <210> 23]]>
           <![CDATA[ <211> 622]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 23]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
          305 310 315 320
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
                          325 330 335
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
                      340 345 350
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                  355 360 365
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
              370 375 380
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
          385 390 395 400
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
                          405 410 415
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
                      420 425 430
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                  435 440 445
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
              450 455 460
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
          465 470 475 480
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
                          485 490 495
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
                      500 505 510
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                  515 520 525
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
              530 535 540
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
          545 550 555 560
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
                          565 570 575
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
                      580 585 590
          Gly Gly Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala
                  595 600 605
          Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe
              610 615 620
           <![CDATA[ <210> 24]]>
           <![CDATA[ <211> 2066]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 24]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccgu gggcagcggc ggcggcagcg gcggaggcag cggaggaggc 1020
          agcggcggag gcaguggagg ccagcccacc gagagcaucg ugcgguuccc caacaucacc 1080
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg 1140
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc 1200
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc 1260
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc 1320
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug 1380
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac 1440
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac 1500
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag 1560
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug 1620
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagaagag caccaaccug 1680
          gugaagaaca agugcgugaa cuucaacuuc aacggccuua ccggcaccgg cgugcugacc 1740
          gagagcaaca agaaauuccu gcccuuucag caguucggcc gggacaucgc cgacaccacc 1800
          gacgcugugc gggaucccca gacccuggag auccuggaca ucaccccuug cagcuucggc 1860
          ggcucuggcg gaggcagcau ccuggccauc uacagcaccg uggccagcag ccuggugcug 1920
          cuggugagcc ugggcgccau cagcuucuga uaauaggcug gagccucggu ggccuagcuu 1980
          cuugccccuu gggccucccc ccagccccuc cuccccuucc ugcacccgua cccccguggu 2040
          cuuugaauaa agucugagug ggcggc 2066
           <![CDATA[ <210> 25]]>
           <![CDATA[ <211> 1890]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 25]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cgugggcagc ggcggcggca gcggcggagg cagcggagga 960
          ggcagcggcg gaggcagugg aggccagccc accgagagca ucgugcgguu ccccaacauc 1020
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc 1080
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc 1140
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc 1200
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc 1260
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc 1320
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug 1380
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc 1440
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug 1500
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug 1560
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaagaa gagcaccaac 1620
          cuggugaaga acaagugcgu gaacuucaac uucaacggcc uuaccggcac cggcgugcug 1680
          accgagagca acaagaaauu ccugcccuuu cagcaguucg gccgggacau cgccgacacc 1740
          accgacgcug ugcgggaucc ccagacccug gagauccugg acaucacccc uugcagcuuc 1800
          ggcggcucug gcggaggcag cauccuggcc aucuacagca ccguggccag cagccuggug 1860
          cugcugguga gccugggcgc caucagcuuc 1890
           <![CDATA[ <210> 26]]>
           <![CDATA[ <211> 630]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 26]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Val Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly
          305 310 315 320
          Gly Ser Gly Gly Gly Ser Gly Gly Gln Pro Thr Glu Ser Ile Val Arg
                          325 330 335
          Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala
                      340 345 350
          Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn
                  355 360 365
          Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr
              370 375 380
          Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe
          385 390 395 400
          Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg
                          405 410 415
          Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys
                      420 425 430
          Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn
                  435 440 445
          Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe
              450 455 460
          Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile
          465 470 475 480
          Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys
                          485 490 495
          Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly
                      500 505 510
          Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala
                  515 520 525
          Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn
              530 535 540
          Lys Cys Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu
          545 550 555 560
          Thr Glu Ser Asn Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp
                          565 570 575
          Ile Ala Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile
                      580 585 590
          Leu Asp Ile Thr Pro Cys Ser Phe Gly Gly Ser Gly Gly Gly Ser Ile
                  595 600 605
          Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser
              610 615 620
          Leu Gly Ala Ile Ser Phe
          625 630
           <![CDATA[ <210> 27]]>
           <![CDATA[ <211> 2066]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 27]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          gugugccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccgu gggcagcggc ggcggcagcg gcggaggcag cggaggaggc 1020
          agcggcggag gcaguggagg ccagcccacc gagagcaucg ugcgguuccc caacaucacc 1080
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg 1140
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc 1200
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc 1260
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc 1320
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug 1380
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac 1440
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac 1500
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag 1560
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug 1620
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagaagag caccaaccug 1680
          gugaagaaca agugcgugaa cuucaacuuc aacggccuua ccggcaccgg cgugcugacc 1740
          gagagcaaca agaaauuccu gcccuuuugc caguucggcc gggacaucgc cgacaccacc 1800
          gacgcugugc gggaucccca gacccuggag auccuggaca ucaccccuug cagcuucggc 1860
          ggcucuggcg gaggcagcau ccuggccauc uacagcaccg uggccagcag ccuggugcug 1920
          cuggugagcc ugggcgccau cagcuucuga uaauaggcug gagccucggu ggccuagcuu 1980
          cuugccccuu gggccucccc ccagccccuc cuccccuucc ugcacccgua cccccguggu 2040
          cuuugaauaa agucugagug ggcggc 2066
           <![CDATA[ <210> 28]]>
           <![CDATA[ <211> 1890]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 28]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aaggugugcc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cgugggcagc ggcggcggca gcggcggagg cagcggagga 960
          ggcagcggcg gaggcagugg aggccagccc accgagagca ucgugcgguu ccccaacauc 1020
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc 1080
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc 1140
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc 1200
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc 1260
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc 1320
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug 1380
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc 1440
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug 1500
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug 1560
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaagaa gagcaccaac 1620
          cuggugaaga acaagugcgu gaacuucaac uucaacggcc uuaccggcac cggcgugcug 1680
          accgagagca acaagaaauu ccugcccuuu ugccaguucg gccgggacau cgccgacacc 1740
          accgacgcug ugcgggaucc ccagacccug gagauccugg acaucacccc uugcagcuuc 1800
          ggcggcucug gcggaggcag cauccuggcc aucuacagca ccguggccag cagccuggug 1860
          cugcugguga gccugggcgc caucagcuuc 1890
           <![CDATA[ <210> 29]]>
           <![CDATA[ <211> 630]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 29]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Cys Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Val Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly
          305 310 315 320
          Gly Ser Gly Gly Gly Ser Gly Gly Gln Pro Thr Glu Ser Ile Val Arg
                          325 330 335
          Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala
                      340 345 350
          Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn
                  355 360 365
          Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr
              370 375 380
          Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe
          385 390 395 400
          Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg
                          405 410 415
          Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys
                      420 425 430
          Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn
                  435 440 445
          Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe
              450 455 460
          Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile
          465 470 475 480
          Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys
                          485 490 495
          Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly
                      500 505 510
          Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala
                  515 520 525
          Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn
              530 535 540
          Lys Cys Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu
          545 550 555 560
          Thr Glu Ser Asn Lys Lys Phe Leu Pro Phe Cys Gln Phe Gly Arg Asp
                          565 570 575
          Ile Ala Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile
                      580 585 590
          Leu Asp Ile Thr Pro Cys Ser Phe Gly Gly Ser Gly Gly Gly Ser Ile
                  595 600 605
          Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser
              610 615 620
          Leu Gly Ala Ile Ser Phe
          625 630
           <![CDATA[ <210> 30]]>
           <![CDATA[ <211> 3182]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 30]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu cggcggcagc 1020
          ggcggcguga gcgugaucac cccaggcacc aacaccagca accagguggc cgugcuguac 1080
          caggacguga acugcaccga ggugcccgug gccauccacg ccgaccagcu gacacccacc 1140
          uggcgggucu acagcaccgg cagcaacgug uuccagaccc gggccgguug ccugaucggc 1200
          gccgagcacg ugaacaacag cuacgagugc gacaucccca ucggcgccgg caucugugcc 1260
          agcuaccaga cccagaccaa uucaccccgg agggcaagga gcguggccag ccagagcauc 1320
          aucgccuaca ccaugagccu gggcgccgag aacagcgugg ccuacagcaa caacagcauc 1380
          gccaucccca ccaacuucac caucagcgug accaccgaga uucugcccgu gagcaugacc 1440
          aagaccagcg uggacugcac cauguacauc ugcggcgaca gcaccgagug cagcaaccug 1500
          cugcugcagu acggcagcuu cugcacccag cugaaccggg cccugaccgg caucgccgug 1560
          gagcaggaca agaacaccca ggagguguuc gcccagguga agcagaucua caagaccccu 1620
          cccaucaagg acuucggcgg cuucaacuuc agccagaucc ugcccgaccc cagcaagccc 1680
          agcaagcgga gcuucaucga ggaccugcug uucaacaagg ugacccuagc cgacgccggc 1740
          uucaucaagc aguacggcga cugccucggc gacauagccg cccgggaccu gaucugcgcc 1800
          cagaaguuca acggccugac cgugcugccu ccccugcuga ccgacgagau gaucgcccag 1860
          uacaccagcg cccuguuagc cggaaccauc accagcggcu ggacuuucgg cgcuggagcc 1920
          gcucugcaga uccccuucgc caugcagaug gccuaccggu ucaacggcau cggcgugacc 1980
          cagaacgugc uguacgagaa ccagaagcug aucgccaacc aguucaacag cgccaucggc 2040
          aagauccagg acagccugag cagcaccgcu agcgcccugg gcaagcugca ggacguggug 2100
          aaccagaacg cccaggcccu gaacacccug gugaagcagc ugagcagcaa cuucggcgcc 2160
          aucagcagcg ugcugaacga cauccugagc cggcuggacc cucccaacgc caccgugcag 2220
          aucgaccggc ugaucacugg ccggcugcag agccugcaga ccuacgugac ccagcagcug 2280
          auccgggccg ccgagauucg ggccagcgcc aaccuggccg ccaccaagau gagcgagugc 2340
          gugcugggcc agagcaagcg gguggacuuc ugcggcaagg gcuaccaccu gaugagcuuu 2400
          ccccagagcg caccccacgg agugguguuc cugcacguga ccuacgugcc cgcccaggag 2460
          aagaacuuca ccaccgcccc agccaucugc cacgacggca aggcccacuu uccccgggag 2520
          ggcguguucg ugagcaacgg cacccacugg uucgugaccc agcggaacuu cuacgagccc 2580
          cagaucauca ccaccgacaa caccuucgug agcggcaacu gcgacguggu gaucggcauc 2640
          gugaacaaca ccguguacga uccccugcag cccgagcugg acagcuucaa ggaggagcug 2700
          gacaaguacu ucaagaauca caccagcccc gacguggacc ugggcgacau cagcggcauc 2760
          aacgccagcg uggugaacau ccagaaggag aucgaucggc ugaacgaggu ggccaagaac 2820
          cugaacgaga gccugaucga ccugcaggag cugggcaagu acgagcagua caucaagugg 2880
          cccugguaca ucuggcuggg cuucaucgcc ggccugaucg ccaucgugau ggugaccauc 2940
          augcugugcu gcaugaccag cugcugcagc ugccugaagg gcuguugcag cugcggcagc 3000
          ugcugcaagu ucgacgagga cgacagcgag cccgugcuga agggcgugaa gcugcacuac 3060
          accugauaau aggcuggagc cucgguggcc uagcuucuug ccccuugggc cuccccccag 3120
          ccccuccucc ccuuccugca cccguacccc cguggucuuu gaauaaaguc ugagugggcg 3180
          gc 3182
           <![CDATA[ <210> 31]]>
           <![CDATA[ <211> 3006]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 31]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuucggcggc 960
          agcggcggcg ugagcgugau caccccaggc accaacacca gcaaccaggu ggccgugcug 1020
          uaccaggacg ugaacugcac cgaggugccc guggccaucc acgccgacca gcugacaccc 1080
          accuggcggg ucuacagcac cggcagcaac guguuccaga cccgggccgg uugccugauc 1140
          ggcgccgagc acgugaacaa cagcuacgag ugcgacaucc ccaucggcgc cggcaucugu 1200
          gccagcuacc agacccagac caauucaccc cggagggcaa ggagcguggc cagccagagc 1260
          aucaucgccu acaccaugag ccugggcgcc gagaacagcg uggccuacag caacaacagc 1320
          aucgccaucc ccaccaacuu caccaucagc gugaccaccg agauucugcc cgugagcaug 1380
          accaagacca gcguggacug caccauguac aucugcggcg acagcaccga gugcagcaac 1440
          cugcugcugc aguacggcag cuucugcacc cagcugaacc gggcccugac cggcaucgcc 1500
          guggagcagg acaagaacac ccaggaggug uucgcccagg ugaagcagau cuacaagacc 1560
          ccucccauca aggacuucgg cggcuucaac uucagccaga uccugcccga ccccagcaag 1620
          cccagcaagc ggagcuucau cgaggaccug cuguucaaca aggugacccu agccgacgcc 1680
          ggcuucauca agcaguacgg cgacugccuc ggcgacauag ccgcccggga ccugaucugc 1740
          gcccagaagu ucaacggccu gaccgugcug ccuccccugc ugaccgacga gaugaucgcc 1800
          caguacacca gcgcccuguu agccggaacc aucaccagcg gcuggacuuu cggcgcugga 1860
          gccgcucugc agauccccuu cgccaugcag auggccuacc gguucaacgg caucggcgug 1920
          acccagaacg ugcuguacga gaaccagaag cugaucgcca accaguucaa cagcgccauc 1980
          ggcaagaucc aggacagccu gagcagcacc gcuagcgccc ugggcaagcu gcaggacgug 2040
          gugaaccaga acgcccaggc ccugaacacc cuggugaagc agcugagcag caacuucggc 2100
          gccaucagca gcgugcugaa cgacauccug agccggcugg acccucccaa cgccaccgug 2160
          cagaucgacc ggcugaucac uggccggcug cagagccugc agaccuacgu gacccagcag 2220
          cugauccggg ccgccgagau ucgggccagc gccaaccugg ccgccaccaa gaugagcgag 2280
          ugcgugcugg gccagagcaa gcggguggac uucugcggca agggcuacca ccugaugagc 2340
          uuuccccaga gcgcacccca cggaguggug uuccugcacg ugaccuacgu gcccgcccag 2400
          gagaagaacu ucaccaccgc cccagccauc ugccacgacg gcaaggccca cuuuccccgg 2460
          gagggcgugu ucgugagcaa cggcacccac ugguucguga cccagcggaa cuucuacgag 2520
          ccccagauca ucaccaccga caacaccuuc gugagcggca acugcgacgu ggugaucggc 2580
          aucgugaaca acaccgugua cgauccccug cagcccgagc uggacagcuu caaggaggag 2640
          cuggacaagu acuucaagaa ucacaccagc cccgacgugg accugggcga caucagcggc 2700
          aucaacgcca gcguggugaa cauccagaag gagaucgauc ggcugaacga gguggccaag 2760
          aaccugaacg agagccugau cgaccugcag gagcugggca aguacgagca guacaucaag 2820
          uggcccuggu acaucuggcu gggcuucauc gccggccuga ucgccaucgu gauggugacc 2880
          aucaugcugu gcugcaugac cagcugcugc agcugccuga agggcuguug cagcugcggc 2940
          agcugcugca aguucgacga ggacgacagc gagcccgugc ugaagggcgu gaagcugcac 3000
          uacacc 3006
           <![CDATA[ <210> 32]]>
           <![CDATA[ <211> 1002]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 32]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Gly Gly
          305 310 315 320
          Ser Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln
                          325 330 335
          Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala
                      340 345 350
          Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly
                  355 360 365
          Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His
              370 375 380
          Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys
          385 390 395 400
          Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val
                          405 410 415
          Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn
                      420 425 430
          Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr
                  435 440 445
          Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser
              450 455 460
          Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn
          465 470 475 480
          Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu
                          485 490 495
          Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala
                      500 505 510
          Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly
                  515 520 525
          Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg
              530 535 540
          Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala
          545 550 555 560
          Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg
                          565 570 575
          Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro
                      580 585 590
          Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala
                  595 600 605
          Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln
              610 615 620
          Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val
          625 630 635 640
          Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe
                          645 650 655
          Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser
                      660 665 670
          Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu
                  675 680 685
          Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser
              690 695 700
          Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Pro Pro Asn Ala Thr Val
          705 710 715 720
          Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr
                          725 730 735
          Val Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn
                      740 745 750
          Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg
                  755 760 765
          Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser
              770 775 780
          Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln
          785 790 795 800
          Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala
                          805 810 815
          His Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe
                      820 825 830
          Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn
                  835 840 845
          Thr Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn
              850 855 860
          Thr Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu
          865 870 875 880
          Leu Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu Gly
                          885 890 895
          Asp Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys Glu Ile
                      900 905 910
          Asp Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp
                  915 920 925
          Leu Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr
              930 935 940
          Ile Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr
          945 950 955 960
          Ile Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys
                          965 970 975
          Cys Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro
                      980 985 990
          Val Leu Lys Gly Val Lys Leu His Tyr Thr
                  995 1000
           <![CDATA[ <210> 33]]>
           <![CDATA[ <211> 2378]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 33]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagg gcaccaucac cgacgccgug 120
          gacugcgccc uggacccucu gagcgagacc aagugcaccc ugaagagcuu caccguggag 180
          aagggcaucu accagaccag caacuucggc ggcagcggcg gcgugagcgu gaucacccca 240
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug 300
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc 360
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac 420
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca 480
          ccccggaggg caaggagcgu ggccagccag agcaucaucg ccuacaccau gagccugggc 540
          gccgagaaca gcguggccua cagcaacaac agcaucgcca uccccaccaa cuucaccauc 600
          agcgugacca ccgagauucu gcccgugagc augaccaaga ccagcgugga cugcaccaug 660
          uacaucugcg gcgacagcac cgagugcagc aaccugcugc ugaacuacac cagcuucugc 720
          acccagcuga accgggcccu gaccggcauc gccguggagc aggacaagaa cacccaggag 780
          guguucgccc aggugaagca gaucuacaag accccuccca ucaaggacuu cggcggcuuc 840
          aacuucagcc agauccugcc cgaccccagc aagcccagca agcggagcuu caucgaggac 900
          cugcuguuca acaaggugac ccuagccgac gccggcuuca ucaagcagua cggcgacugc 960
          cucggcgaca uagccgcccg ggaccugauc ugcgcccaga aguucaacgg ccugaccgug 1020
          cugccuccccc ugcugaccga cgagaugauc gcccaguaca ccagcgcccu guuagccgga 1080
          accaucacca gcggcuggac uuucggcgcu ggagccgcuc ugcagauccc cuucgccaug 1140
          cagauggccu accgguucaa cggcaucggc gugacccaga acgugcugua cgagaaccag 1200
          aagcugaucg ccaaccaguu caacagcgcc aucggcaaga uccaggacag ccugagcagc 1260
          accgcuagcg cccugggcaa gcugcaggac guggugaacc agaacgccca ggcccugaac 1320
          acccugguga agcagcugag cagcaacuuc ggcgccauca gcagcgugcu gaacgacauc 1380
          cugagccggc uggacccucc caacgccacc gugcagaucg accggcugau cacuggccgg 1440
          cugcagagcc ugcagaccua cgugacccag cagcugaucc gggccgccga gauucgggcc 1500
          agcgccaacc uggccgccac caagaugagc gagugcgugc ugggccagag caagcgggug 1560
          gacuucugcg gcaagggcua ccaccugaug agcuuucccc agagcgcacc ccacggagug 1620
          guguuccugc acgugaccua cgugcccgcc caggagaaga acuucaccac cgccccagcc 1680
          aucugccacg acggcaaggc ccacuuuccc cgggagggcg uguucgugag caacggcacc 1740
          cacugguucg ugacccagcg gaacuucuac gagccccaga ucaucaccac cgacaacacc 1800
          uucgugagcg gcaacugcga cguggugauc ggcaucguga acaacaccgu guacgauccc 1860
          cugcagcccg agcuggacag cuucaaggag gagcuggaca aguacuucaa gaaucacacc 1920
          agccccgacg uggaccuggg cgacaucagc ggcaucaacg ccagcguggu gaacauccag 1980
          aaggagaucg aucggcugaa cgagguggcc aagaaccuga acgagagccu gaucgaccug 2040
          caggagcugg gcaaguacga gcaguacauc aaguggcccu gguacaucug gcugggcuuc 2100
          aucgccggcc ugaucgccau cgugauggug accaucaugc ugugcugcau gaccagcugc 2160
          ugcagcugcc ugaagggcug uugcagcugc ggcagcugcu gcaaguucga cgaggacgac 2220
          agcgagcccg ugcugaaggg cgugaagcug cacuacaccu gauaauaggc uggagccucg 2280
          guggccuagc uucuugcccc uugggccucc ccccagcccc uccuccccuu ccugcacccg 2340
          uacccccgug gucuuugaau aaagucugag ugggcggc 2378
           <![CDATA[ <210> 34]]>
           <![CDATA[ <211> 2202]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 34]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agggcaccau caccgacgcc 60
          guggacugcg cccuggaccc ucugagcgag accaagugca cccugaagag cuucaccgug 120
          gagaagggca ucuaccagac cagcaacuuc ggcggcagcg gcggcgugag cgugaucacc 180
          ccaggcacca acaccagcaa ccagguggcc gugcuguacc aggacgugaa cugcaccgag 240
          gugcccgugg ccauccacgc cgaccagcug acacccaccu ggcgggucua cagcaccggc 300
          agcaacgugu uccagacccg ggccgguugc cugaucggcg ccgagcacgu gaacaacagc 360
          uacgagugcg acauccccau cggcgccggc aucugugcca gcuaccagac ccagaccaau 420
          ucaccccgga gggcaaggag cguggccagc cagagcauca ucgccuacac caugagccug 480
          ggcgccgaga acagcguggc cuacagcaac aacagcaucg ccauccccac caacuucacc 540
          aucagcguga ccaccgagau ucugcccgug agcaugacca agaccagcgu ggacugcacc 600
          auguacaucu gcggcgacag caccgagugc agcaaccugc ugcugaacua caccagcuuc 660
          ugcacccagc ugaaccgggc ccugaccggc aucgccgugg agcaggacaa gaacacccag 720
          gagguguucg cccaggugaa gcagaucuac aagaccccuc ccaucaagga cuucggcggc 780
          uucaacuuca gccagauccu gcccgacccc agcaagccca gcaagcggag cuucaucgag 840
          gaccugcugu ucaacaaggu gacccuagcc gacgccggcu ucaucaagca guacggcgac 900
          ugccucggcg acauagccgc ccgggaccug aucugcgccc agaaguucaa cggccugacc 960
          gugcugccuc cccugcugac cgacgagaug aucgcccagu acaccagcgc ccuguuagcc 1020
          ggaaccauca ccagcggcug gacuuucggc gcuggagccg cucugcagau ccccuucgcc 1080
          augcagaugg ccuaccgguu caacggcauc ggcgugaccc agaacgugcu guacgagaac 1140
          cagaagcuga ucgccaacca guucaacagc gccaucggca agauccagga cagccugagc 1200
          agcaccgcua gcgcccuggg caagcugcag gacgugguga accagaacgc ccaggcccug 1260
          aacacccugg ugaagcagcu gagcagcaac uucggcgcca ucagcagcgu gcugaacgac 1320
          auccugagcc ggcuggaccc ucccaacgcc accgugcaga ucgaccggcu gaucacuggc 1380
          cggcugcaga gccugcagac cuacgugacc cagcagcuga uccgggccgc cgagauucgg 1440
          gccagcgcca accuggccgc caccaagaug agcgagugcg ugcugggcca gagcaagcgg 1500
          guggacuucu gcggcaaggg cuaccaccug augagcuuuc cccagagcgc accccacgga 1560
          gugguguucc ugcacgugac cuacgugccc gcccaggaga agaacuucac caccgcccca 1620
          gccaucugcc acgacggcaa ggcccacuuu ccccgggagg gcguguucgu gagcaacggc 1680
          acccacuggu ucgugaccca gcggaacuuc uacgagcccc agaucaucac caccgacaac 1740
          accuucguga gcggcaacug cgacguggug aucggcaucg ugaacaacac cguguacgau 1800
          ccccugcagc ccgagcugga cagcuucaag gaggagcugg acaaguacuu caagaaucac 1860
          accagccccg acguggaccu gggcgacauc agcggcauca acgccagcgu ggugaacauc 1920
          cagaaggaga ucgaucggcu gaacgaggug gccaagaacc ugaacgagag ccugaucgac 1980
          cugcaggagc ugggcaagua cgagcaguac aucaaguggc ccugguacau cuggcugggc 2040
          uucaucgccg gccugaucgc caucgugaug gugaccauca ugcugugcug caugaccagc 2100
          ugcugcagcu gccugaaggg cuguugcagc ugcggcagcu gcugcaaguu cgacgaggac 2160
          gacagcgagc ccgugcugaa gggcgugaag cugcacuaca cc 2202
           <![CDATA[ <210> 35]]>
           <![CDATA[ <211> 734]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 35]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Gly Thr
          1 5 10 15
          Ile Thr Asp Ala Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys
                      20 25 30
          Cys Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser
                  35 40 45
          Asn Phe Gly Gly Ser Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn
              50 55 60
          Thr Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu
          65 70 75 80
          Val Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val
                          85 90 95
          Tyr Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile
                      100 105 110
          Gly Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly
                  115 120 125
          Ala Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg
              130 135 140
          Ala Arg Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu
          145 150 155 160
          Gly Ala Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro
                          165 170 175
          Thr Asn Phe Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met
                      180 185 190
          Thr Lys Thr Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr
                  195 200 205
          Glu Cys Ser Asn Leu Leu Leu Asn Tyr Thr Ser Phe Cys Thr Gln Leu
              210 215 220
          Asn Arg Ala Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln
          225 230 235 240
          Glu Val Phe Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys
                          245 250 255
          Asp Phe Gly Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys
                      260 265 270
          Pro Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr
                  275 280 285
          Leu Ala Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp
              290 295 300
          Ile Ala Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr
          305 310 315 320
          Val Leu Pro Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser
                          325 330 335
          Ala Leu Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly
                      340 345 350
          Ala Ala Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn
                  355 360 365
          Gly Ile Gly Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile
              370 375 380
          Ala Asn Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser
          385 390 395 400
          Ser Thr Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn
                          405 410 415
          Ala Gln Ala Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly
                      420 425 430
          Ala Ile Ser Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Pro Pro
                  435 440 445
          Asn Ala Thr Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser
              450 455 460
          Leu Gln Thr Tyr Val Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg
          465 470 475 480
          Ala Ser Ala Asn Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly
                          485 490 495
          Gln Ser Lys Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser
                      500 505 510
          Phe Pro Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr
                  515 520 525
          Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His
              530 535 540
          Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly
          545 550 555 560
          Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile
                          565 570 575
          Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly
                      580 585 590
          Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser
                  595 600 605
          Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp
              610 615 620
          Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile
          625 630 635 640
          Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu
                          645 650 655
          Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys
                      660 665 670
          Trp Pro Trp Tyr Ile Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala Ile
                  675 680 685
          Val Met Val Thr Ile Met Leu Cys Cys Met Thr Ser Cys Cys Ser Cys
              690 695 700
          Leu Lys Gly Cys Cys Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp
          705 710 715 720
          Asp Ser Glu Pro Val Leu Lys Gly Val Lys Leu His Tyr Thr
                          725 730
           <![CDATA[ <210> 36]]>
           <![CDATA[ <211> 4241]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 36]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uuccugauca ucuucauccu gcccaccacc cuggccguga ucggcgacuu caacugcacc 120
          aacagcuuca ucaacgacua caacaagacc aucccucgga ucagcgaaga cgugguggac 180
          gugucccugg gccugggcac cuacuacgug cugaaccggg uguaccugaa caccacacug 240
          cuguucaccg gcuacuuccc caagagcggc gccaacuucc gggaccuggc ccugaagggc 300
          agcaucuacc ugagcaccuu gugguacaag ccucccuucc ugagcgacuu caauaacggc 360
          aucuucucua aggugaagaa caccaagcug uacguaaaca acacccugua cagcgaguuc 420
          agcaccaucg ugaucggcag cguguucguc aacaccagcu acaccaucgu ggugcagccc 480
          cacaacggca uccuggagau caccgccugc caguacacca ugugcgagua cccucacacc 540
          gugugcaaga gcaagggcuc cauccggaac gagagcuggc acaucgacag cagcgagccg 600
          cugugccugu ucaagaagaa cuucaccuac aacgugagcg ccgacuggcu guacuuccac 660
          uucuaccagg agcggggcgu guucuacgcc uacuacgccg acgugggcau gccaaccacc 720
          uuccuguuca gccuguaccu gggcaccauc cugagccacu acuacgugau gccccugacc 780
          ugcaacgcca ucagcucaaa caccgacaac gagacccugg aguacugggu gacuccacug 840
          agccggcggc aguaccugcu gaacuucgac gagcacggcg ugaucaccaa cgccguggac 900
          ugcgcccugg acccucugag cgagaccaag ugcacccuga agagcuucac cguggagaag 960
          ggcaucuacc agaccagcgg cuucaccgug aagcccguag ccaccgugua ccggcggauc 1020
          cccaaccugc ccgacugcga caucgacaac uggcugaaca acgucagcgu gcccagccca 1080
          cugaacuggg agcggcggau cuucagcaac ugcaacuuca aucugagcac ccugcugcgg 1140
          cuggugcacg uggacagcuu cagcugcaac aaccuggaca agagcaagau cuucgguagc 1200
          ugcuucaaca gcaucaccgu ggacaaguuc gccaucccua accggcggcg ggacgaucug 1260
          cagcugggca gcagcggcuu ccugcagagc agcaacuaca agaucgacau cagcagcuca 1320
          agcugccagc uguacuacag ccugccccug gugaacguga ccaucaacaa cuucaacccc 1380
          agcagcugga accggcggua cggcuucggc agcuucaacc ugagcagcua cgacguggug 1440
          uacagcgacc acugcuucag cgugaacagc gacuucugcc ccugugccga cccuagcgug 1500
          gugaacagcu gcgccaagag caagccuccc agcgccauuu gccccgccgg caccaaguac 1560
          cggcacugcg accuggacac cacccuguac gugaagaacu ggugccggug cagcugccug 1620
          cccgacccca ucagcaccua cagccccaac accugucccc agaagaaggu gguggugggu 1680
          aucggcgagc acugucccgg ccugggcauc aacgaggaga agugcggcac ccagcugaac 1740
          cacagcagcu gcuucuguag ccccgacgcc uuccugggcu ggagcuucga cagcugcauc 1800
          agcaacaacc ggugcaacau cuuuagcaac uucaucuuca acggaaucaa cagcggcacc 1860
          accugcagca acgaccugcu guauagcaac accgagauca gcaccggcgu gugcgugaac 1920
          uacgaccugu acggcaucac cggccagggc aucuucaagg aggugagcgc cgccuacuac 1980
          aacaacuggc agaaccugcu guacgacagc aacggcaaca ucaucggcuu caaggacuuu 2040
          cugaccaaca agaccuacac cauccugccc ugcuacagcg gcggcgugag cgugaucacc 2100
          ccaggcacca acaccagcaa ccagguggcc gugcuguacc aggacgugaa cugcaccgag 2160
          gugcccgugg ccauccacgc cgaccagcug acacccaccu ggcgggucua cagcaccggc 2220
          agcaacgugu uccagacccg ggccgguugc cugaucggcg ccgagcacgu gaacaacagc 2280
          uacgagugcg acauccccau cggcgccggc aucugugcca gcuaccagac ccagaccaau 2340
          ucaccccgga gggcaaggag cguggccagc cagagcauca ucgccuacac caugagccug 2400
          ggcgccgaga acagcguggc cuacagcaac aacagcaucg ccauccccac caacuucacc 2460
          aucagcguga ccaccgagau ucugcccgug agcaugacca agaccagcgu ggacugcacc 2520
          auguacaucu gcggcgacag caccgagugc agcaaccugc ugcugcagua cggcagcuuc 2580
          ugcacccagc ugaaccgggc ccugaccggc aucgccgugg agcaggacaa gaacacccag 2640
          gagguguucg cccaggugaa gcagaucuac aagaccccuc ccaucaagga cuucggcggc 2700
          uucaacuuca gccagauccu gcccgacccc agcaagccca gcaagcggag cuucaucgag 2760
          gaccugcugu ucaacaaggu gacccuagcc gacgccggcu ucaucaagca guacggcgac 2820
          ugccucggcg acauagccgc ccgggaccug aucugcgccc agaaguucaa cggccugacc 2880
          gugcugccuc cccugcugac cgacgagaug aucgcccagu acaccagcgc ccuguuagcc 2940
          ggaaccauca ccagcggcug gacuuucggc gcuggagccg cucugcagau ccccuucgcc 3000
          augcagaugg ccuaccgguu caacggcauc ggcgugaccc agaacgugcu guacgagaac 3060
          cagaagcuga ucgccaacca guucaacagc gccaucggca agauccagga cagccugagc 3120
          agcaccgcua gcgcccuggg caagcugcag gacgugguga accagaacgc ccaggcccug 3180
          aacacccugg ugaagcagcu gagcagcaac uucggcgcca ucagcagcgu gcugaacgac 3240
          auccugagcc ggcuggaccc ucccgaggcc gaggugcaga ucgaccggcu gaucacuggc 3300
          cggcugcaga gccugcagac cuacgugacc cagcagcuga uccgggccgc cgagauucgg 3360
          gccagcgcca accuggccgc caccaagaug agcgagugcg ugcugggcca gagcaagcgg 3420
          guggacuucu gcggcaaggg cuaccaccug augagcuuuc cccagagcgc accccacgga 3480
          gugguguucc ugcacgugac cuacgugccc gcccaggaga agaacuucac caccgcccca 3540
          gccaucugcc acgacggcaa ggcccacuuu ccccgggagg gcguguucgu gagcaacggc 3600
          acccacuggu ucgugaccca gcggaacuuc uacgagcccc agaucaucac caccgacaac 3660
          accuucguga gcggcaacug cgacguggug aucggcaucg ugaacaacac cguguacgau 3720
          ccccugcagc ccgagcugga cagcuucaag gaggagcugg acaaguacuu caagaaucac 3780
          accagccccg acguggaccu gggcgacauc agcggcauca acgccagcgu ggugaacauc 3840
          cagaaggaga ucgaucggcu gaacgaggug gccaagaacc ugaacgagag ccugaucgac 3900
          cugcaggagc ugggcaagua cgagcaguac aucaaguggc ccugguacau cuggcugggc 3960
          uucaucgccg gccugaucgc caucgugaug gugaccauca ugcugugcug caugaccagc 4020
          ugcugcagcu gccugaaggg cuguugcagc ugcggcagcu gcugcaaguu cgacgaggac 4080
          gacagcgagc ccgugcugaa gggcgugaag cugcacuaca ccugauaaua ggcuggagcc 4140
          ucgguggccu agcuucuugc cccuugggcc uccccccagc cccuccuccc cuuccugcac 4200
          ccguaccccc guggucuuug aauaaagucu gagugggcgg c 4241
           <![CDATA[ <210> 37]]>
           <![CDATA[ <211> 4065]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 37]]>
          auguuccuga ucaucuucau ccugcccacc acccuggccg ugaucggcga cuucaacugc 60
          accaacagcu ucaucaacga cuacaacaag accaucccuc ggaucagcga agacguggug 120
          gacguguccc ugggccuggg caccuacuac gugcugaacc ggguguaccu gaacaccaca 180
          cugcuguuca ccggcuacuu ccccaagagc ggcgccaacu uccgggaccu ggcccugaag 240
          ggcagcaucu accugagcac cuugugguac aagccucccu uccugagcga cuucaauaac 300
          ggcaucuucu cuaaggugaa gaacaccaag cuguacguaa acaacacccu guacagcgag 360
          uucagcacca ucgugaucgg cagcguguuc gucaacacca gcuacaccau cguggugcag 420
          ccccacaacg gcauccugga gaucaccgcc ugccaguaca ccaugugcga guacccucac 480
          accgugugca agagcaaggg cuccauccgg aacgagagcu ggcacaucga cagcagcgag 540
          ccgcugugcc uguucaagaa gaacuucacc uacaacguga gcgccgacug gcuguacuuc 600
          cacuucuacc aggagcgggg cguguucuac gccuacuacg ccgacguggg caugccaacc 660
          accuuccugu ucagccugua ccugggcacc auccugagcc acuacuacgu gaugccccug 720
          accugcaacg ccaucagcuc aaacaccgac aacgagaccc uggaguacug ggugacucca 780
          cugagccggc ggcaguaccu gcugaacuuc gacgagcacg gcgugaucac caacgccgug 840
          gacugcgccc uggacccucu gagcgagacc aagugcaccc ugaagagcuu caccguggag 900
          aagggcaucu accagaccag cggcuucacc gugaagcccg uagccaccgu guaccggcgg 960
          auccccaacc ugcccgacug cgacaucgac aacuggcuga acaacgucag cgugcccagc 1020
          ccacugaacu gggagcggcg gaucuucagc aacugcaacu ucaaucugag cacccugcug 1080
          cggcuggugc acguggacag cuucagcugc aacaaccugg acaagagcaa gaucuucggu 1140
          agcugcuuca acagcaucac cguggacaag uucgccaucc cuaaccggcg gcgggacgau 1200
          cugcagcugg gcagcagcgg cuuccugcag agcagcaacu acaagaucga caucagcagc 1260
          ucaagcugcc agcuguacua cagccugccc cuggugaacg ugaccaucaa caacuucaac 1320
          cccagcagcu ggaaccggcg guacggcuuc ggcagcuuca accugagcag cuacgacgug 1380
          guguacagcg accacugcuu cagcgugaac agcgacuucu gccccugugc cgacccuagc 1440
          guggugaaca gcugcgccaa gagcaagccu cccagcgcca uuugccccgc cggcaccaag 1500
          uaccggcacu gcgaccugga caccacccug uacgugaaga acuggugccg gugcagcugc 1560
          cugcccgacc ccaucagcac cuacagcccc aacaccuguc cccagaagaa ggugguggug 1620
          gguaucggcg agcacugucc cggccugggc aucaacgagg agaagugcgg cacccagcug 1680
          aaccacagca gcugcuucug uagccccgac gccuuccugg gcuggagcuu cgacagcugc 1740
          aucagcaaca accggugcaa caucuuuagc aacuucaucu ucaacggaau caacagcggc 1800
          accaccugca gcaacgaccu gcuguauagc aacaccgaga ucagcaccgg cgugugcgug 1860
          aacuacgacc uguacggcau caccggccag ggcaucuuca aggaggugag cgccgccuac 1920
          uacaacaacu ggcagaaccu gcuguacgac agcaacggca acaucaucgg cuucaaggac 1980
          uuucugacca acaagaccua caccauccug cccugcuaca gcggcggcgu gagcgugauc 2040
          accccaggca ccaacaccag caaccaggug gccgugcugu accaggacgu gaacugcacc 2100
          gaggugcccg uggccaucca cgccgaccag cugacaccca ccuggcgggu cuacagcacc 2160
          ggcagcaacg uguuccagac ccgggccggu ugccugaucg gcgccgagca cgugaacaac 2220
          agcuacgagu gcgacauccc caucggcgcc ggcaucugug ccagcuacca gacccagacc 2280
          aauucacccc ggagggcaag gagcguggcc agccagagca ucaucgccua caccaugagc 2340
          cugggcgccg agaacagcgu ggccuacagc aacaacagca ucgccauccc caccaacuuc 2400
          accaucagcg ugaccaccga gauucugccc gugagcauga ccaagaccag cguggacugc 2460
          accauguaca ucugcggcga cagcaccgag ugcagcaacc ugcugcugca guacggcagc 2520
          uucugcaccc agcugaaccg ggcccugacc ggcaucgccg uggagcagga caagaacacc 2580
          caggaggugu ucgcccaggu gaagcagauc uacaagaccc cucccaucaa ggacuucggc 2640
          ggcuucaacu ucagccagau ccugcccgac cccagcaagc ccagcaagcg gagcuucauc 2700
          gaggaccugc uguucaacaa ggugacccua gccgacgccg gcuucaucaa gcaguacggc 2760
          gacugccucg gcgacauagc cgcccgggac cugaucugcg cccagaaguu caacggccug 2820
          accgugcugc cuccccugcu gaccgacgag augaucgccc aguacaccag cgcccuguua 2880
          gccggaacca ucaccagcgg cuggacuuuc ggcgcuggag ccgcucugca gauccccuuc 2940
          gccaugcaga uggccuaccg guucaacggc aucggcguga cccagaacgu gcuguacgag 3000
          aaccagaagc ugaucgccaa ccaguucaac agcgccaucg gcaagaucca ggacagccug 3060
          agcagcaccg cuagcgcccu gggcaagcug caggacgugg ugaaccagaa cgcccaggcc 3120
          cugaacaccc uggugaagca gcugagcagc aacuucggcg ccaucagcag cgugcugaac 3180
          gacauccuga gccggcugga cccucccgag gccgaggugc agaucgaccg gcugaucacu 3240
          ggccggcugc agagccugca gaccuacgug acccagcagc ugauccgggc cgccgagauu 3300
          cgggccagcg ccaaccuggc cgccaccaag augagcgagu gcgugcuggg ccagagcaag 3360
          cgggguggacu ucugcggcaa gggcuaccac cugaugagcu uuccccagag cgcaccccac 3420
          ggaguggugu uccugcacgu gaccuacgug cccgcccagg agaagaacuu caccaccgcc 3480
          ccagccaucu gccacgacgg caaggcccac uuuccccggg agggcguguu cgugagcaac 3540
          ggcacccacu gguucgugac ccagcggaac uucuacgagc cccagaucau caccaccgac 3600
          aacaccuucg ugagcggcaa cugcgacgug gugaucggca ucgugaacaa caccguguac 3660
          gauccccugc agcccgagcu ggacagcuuc aaggaggagc uggacaagua cuucaagaau 3720
          cacaccagcc ccgacgugga ccugggcgac aucagcggca ucaacgccag cguggugaac 3780
          auccagaagg agaucgaucg gcugaacgag guggccaaga accugaacga gagccugauc 3840
          gaccugcagg agcugggcaa guacgagcag uacaucaagu ggcccuggua caucuggcug 3900
          ggcuucaucg ccggccugau cgccaucgug auggugacca ucaugcugug cugcaugacc 3960
          agcugcugca gcugccugaa gggcuguugc agcugcggca gcugcugcaa guucgacgag 4020
          gacgacagcg agcccgugcu gaagggcgug aagcugcacu acacc 4065
           <![CDATA[ <210> 38]]>
           <![CDATA[ <211> 1355]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 38]]>
          Met Phe Leu Ile Ile Phe Ile Leu Pro Thr Thr Leu Ala Val Ile Gly
          1 5 10 15
          Asp Phe Asn Cys Thr Asn Ser Phe Ile Asn Asp Tyr Asn Lys Thr Ile
                      20 25 30
          Pro Arg Ile Ser Glu Asp Val Val Asp Val Ser Leu Gly Leu Gly Thr
                  35 40 45
          Tyr Tyr Val Leu Asn Arg Val Tyr Leu Asn Thr Thr Leu Leu Phe Thr
              50 55 60
          Gly Tyr Phe Pro Lys Ser Gly Ala Asn Phe Arg Asp Leu Ala Leu Lys
          65 70 75 80
          Gly Ser Ile Tyr Leu Ser Thr Leu Trp Tyr Lys Pro Pro Phe Leu Ser
                          85 90 95
          Asp Phe Asn Asn Gly Ile Phe Ser Lys Val Lys Asn Thr Lys Leu Tyr
                      100 105 110
          Val Asn Asn Thr Leu Tyr Ser Glu Phe Ser Thr Ile Val Ile Gly Ser
                  115 120 125
          Val Phe Val Asn Thr Ser Tyr Thr Ile Val Val Gln Pro His Asn Gly
              130 135 140
          Ile Leu Glu Ile Thr Ala Cys Gln Tyr Thr Met Cys Glu Tyr Pro His
          145 150 155 160
          Thr Val Cys Lys Ser Lys Gly Ser Ile Arg Asn Glu Ser Trp His Ile
                          165 170 175
          Asp Ser Ser Glu Pro Leu Cys Leu Phe Lys Lys Asn Phe Thr Tyr Asn
                      180 185 190
          Val Ser Ala Asp Trp Leu Tyr Phe His Phe Tyr Gln Glu Arg Gly Val
                  195 200 205
          Phe Tyr Ala Tyr Tyr Ala Asp Val Gly Met Pro Thr Thr Phe Leu Phe
              210 215 220
          Ser Leu Tyr Leu Gly Thr Ile Leu Ser His Tyr Tyr Val Met Pro Leu
          225 230 235 240
          Thr Cys Asn Ala Ile Ser Ser Asn Thr Asp Asn Glu Thr Leu Glu Tyr
                          245 250 255
          Trp Val Thr Pro Leu Ser Arg Arg Gln Tyr Leu Leu Asn Phe Asp Glu
                      260 265 270
          His Gly Val Ile Thr Asn Ala Val Asp Cys Ala Leu Asp Pro Leu Ser
                  275 280 285
          Glu Thr Lys Cys Thr Leu Lys Ser Phe Thr Val Glu Lys Gly Ile Tyr
              290 295 300
          Gln Thr Ser Gly Phe Thr Val Lys Pro Val Ala Thr Val Tyr Arg Arg
          305 310 315 320
          Ile Pro Asn Leu Pro Asp Cys Asp Ile Asp Asn Trp Leu Asn Asn Val
                          325 330 335
          Ser Val Pro Ser Pro Leu Asn Trp Glu Arg Arg Ile Phe Ser Asn Cys
                      340 345 350
          Asn Phe Asn Leu Ser Thr Leu Leu Arg Leu Val His Val Asp Ser Phe
                  355 360 365
          Ser Cys Asn Asn Leu Asp Lys Ser Lys Ile Phe Gly Ser Cys Phe Asn
              370 375 380
          Ser Ile Thr Val Asp Lys Phe Ala Ile Pro Asn Arg Arg Arg Asp Asp
          385 390 395 400
          Leu Gln Leu Gly Ser Ser Gly Phe Leu Gln Ser Ser Asn Tyr Lys Ile
                          405 410 415
          Asp Ile Ser Ser Ser Ser Cys Gln Leu Tyr Tyr Ser Leu Pro Leu Val
                      420 425 430
          Asn Val Thr Ile Asn Asn Phe Asn Pro Ser Ser Trp Asn Arg Arg Tyr
                  435 440 445
          Gly Phe Gly Ser Phe Asn Leu Ser Ser Tyr Asp Val Val Tyr Ser Asp
              450 455 460
          His Cys Phe Ser Val Asn Ser Asp Phe Cys Pro Cys Ala Asp Pro Ser
          465 470 475 480
          Val Val Asn Ser Cys Ala Lys Ser Lys Pro Pro Ser Ala Ile Cys Pro
                          485 490 495
          Ala Gly Thr Lys Tyr Arg His Cys Asp Leu Asp Thr Thr Leu Tyr Val
                      500 505 510
          Lys Asn Trp Cys Arg Cys Ser Cys Leu Pro Asp Pro Ile Ser Thr Tyr
                  515 520 525
          Ser Pro Asn Thr Cys Pro Gln Lys Lys Val Val Val Gly Ile Gly Glu
              530 535 540
          His Cys Pro Gly Leu Gly Ile Asn Glu Glu Lys Cys Gly Thr Gln Leu
          545 550 555 560
          Asn His Ser Ser Cys Phe Cys Ser Pro Asp Ala Phe Leu Gly Trp Ser
                          565 570 575
          Phe Asp Ser Cys Ile Ser Asn Asn Arg Cys Asn Ile Phe Ser Asn Phe
                      580 585 590
          Ile Phe Asn Gly Ile Asn Ser Gly Thr Thr Cys Ser Asn Asp Leu Leu
                  595 600 605
          Tyr Ser Asn Thr Glu Ile Ser Thr Gly Val Cys Val Asn Tyr Asp Leu
              610 615 620
          Tyr Gly Ile Thr Gly Gln Gly Ile Phe Lys Glu Val Ser Ala Ala Tyr
          625 630 635 640
          Tyr Asn Asn Trp Gln Asn Leu Leu Tyr Asp Ser Asn Gly Asn Ile Ile
                          645 650 655
          Gly Phe Lys Asp Phe Leu Thr Asn Lys Thr Tyr Thr Ile Leu Pro Cys
                      660 665 670
          Tyr Ser Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn
                  675 680 685
          Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val
              690 695 700
          Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr
          705 710 715 720
          Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu
                          725 730 735
          His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile
                      740 745 750
          Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser
                  755 760 765
          Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu
              770 775 780
          Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe
          785 790 795 800
          Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr
                          805 810 815
          Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser
                      820 825 830
          Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala
                  835 840 845
          Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe
              850 855 860
          Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly
          865 870 875 880
          Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys
                          885 890 895
          Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp
                      900 905 910
          Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala
                  915 920 925
          Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro
              930 935 940
          Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu
          945 950 955 960
          Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu
                          965 970 975
          Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly
                      980 985 990
          Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln
                  995 1000 1005
          Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr
              1010 1015 1020
          Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala
              1025 1030 1035
          Gln Ala Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly
              1040 1045 1050
          Ala Ile Ser Ser Val Leu Asn Asp Ile Leu Ser Arg Leu Asp Pro
              1055 1060 1065
          Pro Glu Ala Glu Val Gln Ile Asp Arg Leu Ile Thr Gly Arg Leu
              1070 1075 1080
          Gln Ser Leu Gln Thr Tyr Val Thr Gln Gln Leu Ile Arg Ala Ala
              1085 1090 1095
          Glu Ile Arg Ala Ser Ala Asn Leu Ala Ala Thr Lys Met Ser Glu
              1100 1105 1110
          Cys Val Leu Gly Gln Ser Lys Arg Val Asp Phe Cys Gly Lys Gly
              1115 1120 1125
          Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro His Gly Val Val
              1130 1135 1140
          Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu Lys Asn Phe Thr
              1145 1150 1155
          Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His Phe Pro Arg
              1160 1165 1170
          Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val Thr Gln
              1175 1180 1185
          Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr Phe
              1190 1195 1200
          Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn Asn Thr
              1205 1210 1215
          Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys Glu Glu
              1220 1225 1230
          Leu Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val Asp Leu
              1235 1240 1245
          Gly Asp Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile Gln Lys
              1250 1255 1260
          Glu Ile Asp Arg Leu Asn Glu Val Ala Lys Asn Leu Asn Glu Ser
              1265 1270 1275
          Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr Glu Gln Tyr Ile Lys
              1280 1285 1290
          Trp Pro Trp Tyr Ile Trp Leu Gly Phe Ile Ala Gly Leu Ile Ala
              1295 1300 1305
          Ile Val Met Val Thr Ile Met Leu Cys Cys Met Thr Ser Cys Cys
              1310 1315 1320
          Ser Cys Leu Lys Gly Cys Cys Ser Cys Gly Ser Cys Cys Lys Phe
              1325 1330 1335
          Asp Glu Asp Asp Ser Glu Pro Val Leu Lys Gly Val Lys Leu His
              1340 1345 1350
          Tyr Thr
              1355
           <![CDATA[ <210> 39]]>
           <![CDATA[ <211> 4247]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 39]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uuccugaucc ugcugaucag ccugcccacc gccuucgccg ugaucggcga ccugaagugc 120
          accagcgaca acaucaacga caaggacacc ggcccaccac ccaucagcac cgacaccgug 180
          gacgugacca acggccuggg caccuacuac gugcuggacc ggguguaccu gaacaccacc 240
          cuguuccuga acggcuacua ccccaccagc ggcagcaccu accggaauau ggcccugaag 300
          ggcagcgugc ugcugagccg gcugugguuc aagccaccau uccugagcga cuucaucaac 360
          ggaaucuucg ccaaggugaa gaacaccaag gugaucaagg accgggugau guacagcgag 420
          uuccccgcca ucaccauugg caguaccuuc gugaacacca gcuacagcgu gguggugcag 480
          ccccggacca ucaacagcac ccaggacggc gacaacaagc ugcagggccu gcuggaggug 540
          agcgugugcc aguacaacau gugcgaguac ccucagacca ucugccaccc caaccugggc 600
          aaccaccgga aggagcugug gcaccuggac accggcgugg ugagcugccu guacaagcgg 660
          aacuucaccu acgacguaaa cgccgacuac cuguacuucc acuucuacca ggagggcggc 720
          accuucuacg ccuacuucac cgacacgggc guggugacca aguuccuguu caacguguac 780
          cugggcaugg cccugagcca cuacuacgug augccccuga ccuguaacag caagcugacc 840
          cuggaguacu gggugacccc ucugaccagc cggcaguacc ugcuggccuu caaccaggac 900
          ggcaucaucu ucaacgccgu ggacugcgcc cuggacccuc ugagcgagac caagugcacc 960
          cugaagagcu ucaccgugga gaagggcauc uaccagacca acggcuacac cgugcagccc 1020
          aucgccgacg uguaccggcg gaagcccaac cugcccaacu gcaacaucga ggccuggcug 1080
          aacgacaaga gcgugcccuc gccccugaac ugggagcgga agaccuucag caacugcaac 1140
          uucaacauga gcagucugau gagcuucauc caggccgaca gcuucaccug caacaacauc 1200
          gacgccgcca agaucuacgg caugugcuuc agcagcauca ccaucgacaa guuugccauc 1260
          cccaacggcc ggaaggugga ccugcagcug ggcaaccugg gcuaccugca gagcuucaac 1320
          uaccggaucg acaccaccgc caccucuugc cagcuguacu acaaccugcc cgccgccaac 1380
          gugagcguga gccgguucaa ccccagcacc uggaacaagc gguucggcuu cauugaggac 1440
          agcguguuca aaccccggcc cgcaggagua cugaccaacc acgacguggu guacgcccag 1500
          cacugcuuca aggcacccaa gaacuucugc cccugcaagc ugaacggcag cugugugggc 1560
          ucuggccccg guaagaacaa cggcauaggg acuugcccgg cagggaccaa cuaccugacc 1620
          ugcgacaacc ugugcacacc cgaccccauc accuucaccg gcaccuacaa guguccccag 1680
          accaagagcc uggugggcau cggcgagcac ugcagcggcc uggccgugaa gagcgacuac 1740
          ugcggcggca acagcugcac cugucggccc caggccuucc ugggcuggag cgccgacagc 1800
          ugccugcagg gcgacaagug caacaucuuu gccaacuuca uccugcacga cgugaacagc 1860
          ggccugaccu gcagcaccga ccugcagaag gccaacaccg acaucauccu gggcgugugc 1920
          gugaacuacg acuuguacgg cauccugggc cagggcaucu ucguggaggu gaacgccacc 1980
          uacuacaaca gcuggcagaa ccugcuguac gacagcaacg gcaaccugua cggcuuccgg 2040
          gacuacauca ucaaccggac cuucaugauc cggagcugcu acagcggcgg cgugagcgug 2100
          aucaccccag gcaccaacac cagcaaccag guggccgugc uguaccagga cgugaacugc 2160
          accgaggugc ccguggccau ccacgccgac cagcugacac ccaccuggcg ggucuacagc 2220
          accggcagca acguguucca gacccgggcc gguugccuga ucggcgccga gcacgugaac 2280
          aacagcuacg agugcgacau ccccaucggc gccggcaucu gugccagcua ccagacccag 2340
          accaauucac cccggagggc aaggagcgug gccagccaga gcaucaucgc cuacaccaug 2400
          agccugggcg ccgagaacag cguggccuac agcaacaaca gcaucgccau ccccaccaac 2460
          uucaccauca gcgugaccac cgagauucug cccgugagca ugaccaagac cagcguggac 2520
          ugcaccaugu acaucugcgg cgacagcacc gagugcagca accugcugcu gcaguacggc 2580
          agcuucugca cccagcugaa ccgggcccug accggcaucg ccguggagca ggacaagaac 2640
          acccaggagg uguucgccca ggugaagcag aucuacaaga ccccucccau caaggacuuc 2700
          ggcggcuuca acuucagcca gauccugccc gaccccagca agcccagcaa gcggagcuuc 2760
          aucgaggacc ugcuguucaa caaggugacc cuagccgacg ccggcuucau caagcaguac 2820
          ggcgacugcc ucggcgacau agccgcccgg gaccugaucu gcgcccagaa guucaacggc 2880
          cugaccgugc ugccuccccu gcugaccgac gagaugaucg cccaguacac cagcgcccug 2940
          uuagccggaa ccaucaccag cggcuggacu uucggcgcug gagccgcucu gcagaucccc 3000
          uucgccaugc agauggccua ccgguucaac ggcaucggcg ugacccagaa cgugcuguac 3060
          gagaaccaga agcugaucgc caaccaguuc aacagcgcca ucggcaagau ccaggacagc 3120
          cugagcagca ccgcuagcgc ccugggcaag cugcaggacg uggugaacca gaacgcccag 3180
          gcccugaaca cccuggugaa gcagcugagc agcaacuucg gcgccaucag cagcgugcug 3240
          aacgacaucc ugagccggcu ggacccuccc gaggccgagg ugcagaucga ccggcugauc 3300
          acuggccggc ugcagagccu gcagaccuac gugacccagc agcugauccg ggccgccgag 3360
          auucgggcca gcgccaaccu ggccgccacc aagaugagcg agugcgugcu gggccagagc 3420
          aagcgggugg acuucugcgg caagggcuac caccugauga gcuuucccca gagcgcaccc 3480
          cacggagugg uguuccugca cgugaccuac gugcccgccc aggagaagaa cuucaccacc 3540
          gccccagcca ucugccacga cggcaaggcc cacuuucccc gggagggcgu guucgugagc 3600
          aacggcaccc acugguucgu gacccagcgg aacuucuacg agccccagau caucaccacc 3660
          gacaacaccu ucgugagcgg caacugcgac guggugaucg gcaucgugaa caacaccgug 3720
          uacgaucccc ugcagcccga gcuggacagc uucaaggagg agcuggacaa guacuucaag 3780
          aaucacacca gccccgacgu ggaccugggc gacaucagcg gcaucaacgc cagcguggug 3840
          aacauccaga aggagaucga ucggcugaac gagguggcca agaaccugaa cgagagccug 3900
          aucgaccugc aggagcuggg caaguacgag caguacauca aguggcccug guacaucugg 3960
          cugggcuuca ucgccggccu gaucgccauc gugaugguga ccaucaugcu gugcugcaug 4020
          accagcugcu gcagcugccu gaagggcugu ugcagcugcg gcagcugcug caaguucgac 4080
          gaggacgaca gcgagcccgu gcugaagggc gugaagcugc acuacaccug auaauaggcu 4140
          ggagccucgg uggccuagcu ucuugccccu ugggccuccc cccagccccu ccuccccuuc 4200
          cugcacccgu acccccgugg ucuuugaaua aagucugagu gggcggc 4247
           <![CDATA[ <210> 40]]>
           <![CDATA[ <211> 4071]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 40]]>
          auguuccuga uccugcugau cagccugccc accgccuucg ccgugaucgg cgaccugaag 60
          ugcaccagcg acaacaucaa cgacaaggac accggcccac cacccaucag caccgacacc 120
          guggacguga ccaacggccu gggcaccuac uacgugcugg accgggugua ccugaacacc 180
          acccuguucc ugaacggcua cuaccccacc agcggcagca ccuaccggaa uauggcccug 240
          aagggcagcg ugcugcugag ccggcugugg uucaagccac cauuccugag cgacuucauc 300
          aacggaaucu ucgccaaggu gaagaacacc aagugauca aggaccgggu gauguacagc 360
          gaguuccccg ccaucaccau uggcaguacc uucgugaaca ccagcuacag cgugguggug 420
          cagccccgga ccaucaacag cacccaggac ggcgacaaca agcugcaggg ccugcuggag 480
          gugagcgugu gccaguacaa caugugcgag uacccucaga ccaucugcca ccccaaccug 540
          ggcaaccacc ggaaggagcu guggcaccug gacaccggcg uggugagcug ccuguacaag 600
          cggaacuuca ccuacgacgu aaacgccgac uaccuguacu uccacuucua ccaggagggc 660
          ggcaccuucu acgccuacuu caccgacacg ggcgugguga ccaaguuccu guucaacgug 720
          uaccugggca uggcccugag ccacuacuac gugaugcccc ugaccuguaa cagcaagcug 780
          acccuggagu acugggugac cccucugacc agccggcagu accugcuggc cuucaaccag 840
          gacggcauca ucuucaacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 900
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccaacggcua caccgugcag 960
          cccaucgccg acguguaccg gcggaagccc aaccugccca acugcaacau cgaggccugg 1020
          cugaacgaca agagcgugcc cucgccccug aacugggagc ggaagaccuu cagcaacugc 1080
          aacuucaaca ugagcagucu gaugagcuuc auccaggccg acagcuucac cugcaacaac 1140
          aucgacgccg ccaagaucua cggcaugugc uucagcagca ucaccaucga caaguuugcc 1200
          auccccaacg gccggaaggu ggaccugcag cugggcaacc ugggcuaccu gcagagcuuc 1260
          aacuaccgga ucgacaccac cgccaccucu ugccagcugu acuacaaccu gcccgccgcc 1320
          aacgugagcg ugagccgguu caaccccagc accuggaaca agcgguucgg cuucauugag 1380
          gacagcgugu ucaaaccccg gcccgcagga guacugacca accacgacgu gguguacgcc 1440
          cagcacugcu ucaaggcacc caagaacuuc ugccccugca agcugaacgg cagcugugug 1500
          ggcucuggcc ccgguaagaa caacggcaua gggacuugcc cggcagggac caacuaccug 1560
          accugcgaca accugugcac acccgacccc aucaccuuca ccggcaccua caaguguccc 1620
          cagaccaaga gccugguggg caucggcgag cacugcagcg gccuggccgu gaagagcgac 1680
          uacugcggcg gcaacagcug caccugucgg ccccaggccu uccugggcug gagcgccgac 1740
          agcugccugc agggcgacaa gugcaacauc uuugccaacu ucauccugca cgacgugaac 1800
          agcggccuga ccugcagcac cgaccugcag aaggccaaca ccgacaucau ccugggcgug 1860
          ugcgugaacu acgacuugua cggcauccug ggccagggca ucuucgugga ggugaacgcc 1920
          accuacuaca acagcuggca gaaccugcug uacgacagca acggcaaccu guacggcuuc 1980
          cgggacuaca ucaucaaccg gaccuucaug auccggagcu gcuacagcgg cggcgugagc 2040
          gugaucaccc caggcaccaa caccagcaac cagguggccg ugcuguacca ggacgugaac 2100
          ugcaccgagg ugcccguggc cauccacgcc gaccagcuga cacccaccug gcgggucuac 2160
          agcaccggca gcaacguguu ccagacccgg gccgguugcc ugaucggcgc cgagcacgug 2220
          aacaacagcu acgagugcga cauccccauc ggcgccggca ucugugccag cuaccagacc 2280
          cagaccaauu caccccggag ggcaaggagc guggccagcc agagcaucau cgccuacacc 2340
          augagccugg gcgccgagaa cagcguggcc uacagcaaca acagcaucgc cauccccacc 2400
          aacuucacca ucagcgugac caccgagauu cugcccgug gcaugaccaa gaccagcgug 2460
          gacugcacca uguacaucug cggcgacagc accgagugca gcaaccugcu gcugcaguac 2520
          ggcagcuucu gcacccagcu gaaccgggcc cugaccggca ucgccgugga gcaggacaag 2580
          aacacccagg agguguucgc ccaggugaag cagaucuaca agaccccucc caucaaggac 2640
          uucggcggcu ucaacuucag ccagauccug cccgacccca gcaagcccag caagcggagc 2700
          uucaucgagg accugcuguu caacaaggug acccuagccg acgccggcuu caucaagcag 2760
          uacggcgacu gccucggcga cauagccgcc cgggaccuga ucugcgccca gaaguucaac 2820
          ggccugaccg ugcugccucc ccugcugacc gacgagauga ucgcccagua caccagcgcc 2880
          cuguuagccg gaaccaucac cagcggcugg acuuucggcg cuggagccgc ucugcagauc 2940
          cccuucgcca ugcagauggc cuaccgguuc aacggcaucg gcgugaccca gaacgugcug 3000
          uacgagaacc agaagcugau cgccaaccag uucaacagcg ccaucggcaa gauccaggac 3060
          agccugagca gcaccgcuag cgcccugggc aagcugcagg acguggugaa ccagaacgcc 3120
          caggcccuga acacccuggu gaagcagcug agcagcaacu ucggcgccau cagcagcgug 3180
          cugaacgaca uccugagccg gcuggacccu cccgaggccg aggugcagau cgaccggcug 3240
          aucacuggcc ggcugcagag ccugcagacc uacgugaccc agcagcugau ccgggccgcc 3300
          gagauucggg ccagcgccaa ccuggccgcc accaagauga gcgagugcgu gcugggccag 3360
          agcaagcggg uggacuucug cggcaagggc uaccaccuga ugagcuuucc ccagagcgca 3420
          ccccacggag ugguguuccu gcacgugacc uacgugcccg cccaggagaa gaacuucacc 3480
          accgccccag ccaucugcca cgacggcaag gcccacuuuc cccgggaggg cguguucgug 3540
          agcaacggca cccacugguu cgugacccag cggaacuucu acgagcccca gaucaucacc 3600
          accgacaaca ccuucgugag cggcaacugc gacgugguga ucggcaucgu gaacaacacc 3660
          guguacgauc cccugcagcc cgagcuggac agcuucaagg aggagcugga caaguacuuc 3720
          aagaaucaca ccagccccga cguggaccug ggcgacauca gcggcaucaa cgccagcgug 3780
          gugaacaucc agaaggagau cgaucggcug aacgaggugg ccaagaaccu gaacgagagc 3840
          cugaucgacc ugcaggagcu gggcaaguac gagcaguaca ucaaguggcc cugguacauc 3900
          uggcugggcu ucaucgccgg ccugaucgcc aucgugaugg ugaccaucau gcugugcugc 3960
          augaccagcu gcugcagcug ccugaagggc uguugcagcu gcggcagcug cugcaaguuc 4020
          gacgaggacg acagcgagcc cgugcugaag ggcgugaagc ugcacuacac c 4071
           <![CDATA[ <210> 41]]>
           <![CDATA[ <211> 1357]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 41]]>
          Met Phe Leu Ile Leu Leu Ile Ser Leu Pro Thr Ala Phe Ala Val Ile
          1 5 10 15
          Gly Asp Leu Lys Cys Thr Ser Asp Asn Ile Asn Asp Lys Asp Thr Gly
                      20 25 30
          Pro Pro Pro Ile Ser Thr Asp Thr Val Asp Val Thr Asn Gly Leu Gly
                  35 40 45
          Thr Tyr Tyr Val Leu Asp Arg Val Tyr Leu Asn Thr Thr Leu Phe Leu
              50 55 60
          Asn Gly Tyr Tyr Pro Thr Ser Gly Ser Thr Tyr Arg Asn Met Ala Leu
          65 70 75 80
          Lys Gly Ser Val Leu Leu Ser Arg Leu Trp Phe Lys Pro Pro Phe Leu
                          85 90 95
          Ser Asp Phe Ile Asn Gly Ile Phe Ala Lys Val Lys Asn Thr Lys Val
                      100 105 110
          Ile Lys Asp Arg Val Met Tyr Ser Glu Phe Pro Ala Ile Thr Ile Gly
                  115 120 125
          Ser Thr Phe Val Asn Thr Ser Tyr Ser Val Val Val Gln Pro Arg Thr
              130 135 140
          Ile Asn Ser Thr Gln Asp Gly Asp Asn Lys Leu Gln Gly Leu Leu Glu
          145 150 155 160
          Val Ser Val Cys Gln Tyr Asn Met Cys Glu Tyr Pro Gln Thr Ile Cys
                          165 170 175
          His Pro Asn Leu Gly Asn His Arg Lys Glu Leu Trp His Leu Asp Thr
                      180 185 190
          Gly Val Val Ser Cys Leu Tyr Lys Arg Asn Phe Thr Tyr Asp Val Asn
                  195 200 205
          Ala Asp Tyr Leu Tyr Phe His Phe Tyr Gln Glu Gly Gly Thr Phe Tyr
              210 215 220
          Ala Tyr Phe Thr Asp Thr Gly Val Val Thr Lys Phe Leu Phe Asn Val
          225 230 235 240
          Tyr Leu Gly Met Ala Leu Ser His Tyr Tyr Val Met Pro Leu Thr Cys
                          245 250 255
          Asn Ser Lys Leu Thr Leu Glu Tyr Trp Val Thr Pro Leu Thr Ser Arg
                      260 265 270
          Gln Tyr Leu Leu Ala Phe Asn Gln Asp Gly Ile Ile Phe Asn Ala Val
                  275 280 285
          Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys Ser
              290 295 300
          Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Asn Gly Tyr Thr Val Gln
          305 310 315 320
          Pro Ile Ala Asp Val Tyr Arg Arg Lys Pro Asn Leu Pro Asn Cys Asn
                          325 330 335
          Ile Glu Ala Trp Leu Asn Asp Lys Ser Val Pro Ser Pro Leu Asn Trp
                      340 345 350
          Glu Arg Lys Thr Phe Ser Asn Cys Asn Phe Asn Met Ser Ser Leu Met
                  355 360 365
          Ser Phe Ile Gln Ala Asp Ser Phe Thr Cys Asn Asn Ile Asp Ala Ala
              370 375 380
          Lys Ile Tyr Gly Met Cys Phe Ser Ser Ile Thr Ile Asp Lys Phe Ala
          385 390 395 400
          Ile Pro Asn Gly Arg Lys Val Asp Leu Gln Leu Gly Asn Leu Gly Tyr
                          405 410 415
          Leu Gln Ser Phe Asn Tyr Arg Ile Asp Thr Thr Ala Thr Ser Cys Gln
                      420 425 430
          Leu Tyr Tyr Asn Leu Pro Ala Ala Asn Val Ser Val Ser Arg Phe Asn
                  435 440 445
          Pro Ser Thr Trp Asn Lys Arg Phe Gly Phe Ile Glu Asp Ser Val Phe
              450 455 460
          Lys Pro Arg Pro Ala Gly Val Leu Thr Asn His Asp Val Val Tyr Ala
          465 470 475 480
          Gln His Cys Phe Lys Ala Pro Lys Asn Phe Cys Pro Cys Lys Leu Asn
                          485 490 495
          Gly Ser Cys Val Gly Ser Gly Pro Gly Lys Asn Asn Gly Ile Gly Thr
                      500 505 510
          Cys Pro Ala Gly Thr Asn Tyr Leu Thr Cys Asp Asn Leu Cys Thr Pro
                  515 520 525
          Asp Pro Ile Thr Phe Thr Gly Thr Tyr Lys Cys Pro Gln Thr Lys Ser
              530 535 540
          Leu Val Gly Ile Gly Glu His Cys Ser Gly Leu Ala Val Lys Ser Asp
          545 550 555 560
          Tyr Cys Gly Gly Asn Ser Cys Thr Cys Arg Pro Gln Ala Phe Leu Gly
                          565 570 575
          Trp Ser Ala Asp Ser Cys Leu Gln Gly Asp Lys Cys Asn Ile Phe Ala
                      580 585 590
          Asn Phe Ile Leu His Asp Val Asn Ser Gly Leu Thr Cys Ser Thr Asp
                  595 600 605
          Leu Gln Lys Ala Asn Thr Asp Ile Ile Leu Gly Val Cys Val Asn Tyr
              610 615 620
          Asp Leu Tyr Gly Ile Leu Gly Gln Gly Ile Phe Val Glu Val Asn Ala
          625 630 635 640
          Thr Tyr Tyr Asn Ser Trp Gln Asn Leu Leu Tyr Asp Ser Asn Gly Asn
                          645 650 655
          Leu Tyr Gly Phe Arg Asp Tyr Ile Ile Asn Arg Thr Phe Met Ile Arg
                      660 665 670
          Ser Cys Tyr Ser Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr
                  675 680 685
          Ser Asn Gln Val Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val
              690 695 700
          Pro Val Ala Ile His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr
          705 710 715 720
          Ser Thr Gly Ser Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly
                          725 730 735
          Ala Glu His Val Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala
                      740 745 750
          Gly Ile Cys Ala Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala
                  755 760 765
          Arg Ser Val Ala Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly
              770 775 780
          Ala Glu Asn Ser Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr
          785 790 795 800
          Asn Phe Thr Ile Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr
                          805 810 815
          Lys Thr Ser Val Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu
                      820 825 830
          Cys Ser Asn Leu Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn
                  835 840 845
          Arg Ala Leu Thr Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu
              850 855 860
          Val Phe Ala Gln Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp
          865 870 875 880
          Phe Gly Gly Phe Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro
                          885 890 895
          Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu
                      900 905 910
          Ala Asp Ala Gly Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile
                  915 920 925
          Ala Ala Arg Asp Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val
              930 935 940
          Leu Pro Pro Leu Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala
          945 950 955 960
          Leu Leu Ala Gly Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala
                          965 970 975
          Ala Leu Gln Ile Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly
                      980 985 990
          Ile Gly Val Thr Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala
                  995 1000 1005
          Asn Gln Phe Asn Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser
              1010 1015 1020
          Ser Thr Ala Ser Ala Leu Gly Lys Leu Gln Asp Val Val Asn Gln
              1025 1030 1035
          Asn Ala Gln Ala Leu Asn Thr Leu Val Lys Gln Leu Ser Ser Asn
              1040 1045 1050
          Phe Gly Ala Ile Ser Ser Val Leu Asn Asp Ile Leu Ser Arg Leu
              1055 1060 1065
          Asp Pro Pro Glu Ala Glu Val Gln Ile Asp Arg Leu Ile Thr Gly
              1070 1075 1080
          Arg Leu Gln Ser Leu Gln Thr Tyr Val Thr Gln Gln Leu Ile Arg
              1085 1090 1095
          Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu Ala Ala Thr Lys Met
              1100 1105 1110
          Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val Asp Phe Cys Gly
              1115 1120 1125
          Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro His Gly
              1130 1135 1140
          Val Val Phe Leu His Val Thr Tyr Val Pro Ala Gln Glu Lys Asn
              1145 1150 1155
          Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly Lys Ala His Phe
              1160 1165 1170
          Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr His Trp Phe Val
              1175 1180 1185
          Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile Thr Thr Asp Asn
              1190 1195 1200
          Thr Phe Val Ser Gly Asn Cys Asp Val Val Ile Gly Ile Val Asn
              1205 1210 1215
          Asn Thr Val Tyr Asp Pro Leu Gln Pro Glu Leu Asp Ser Phe Lys
              1220 1225 1230
          Glu Glu Leu Asp Lys Tyr Phe Lys Asn His Thr Ser Pro Asp Val
              1235 1240 1245
          Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala Ser Val Val Asn Ile
              1250 1255 1260
          Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala Lys Asn Leu Asn
              1265 1270 1275
          Glu Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr Glu Gln Tyr
              1280 1285 1290
          Ile Lys Trp Pro Trp Tyr Ile Trp Leu Gly Phe Ile Ala Gly Leu
              1295 1300 1305
          Ile Ala Ile Val Met Val Thr Ile Met Leu Cys Cys Met Thr Ser
              1310 1315 1320
          Cys Cys Ser Cys Leu Lys Gly Cys Cys Ser Cys Gly Ser Cys Cys
              1325 1330 1335
          Lys Phe Asp Glu Asp Asp Ser Glu Pro Val Leu Lys Gly Val Lys
              1340 1345 1350
          Leu His Tyr Thr
              1355
           <![CDATA[ <210> 42]]>
           <![CDATA[ <211> 1157]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 42]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacucu ggcggaggcg gcagcgccau cggcggcuac 960
          auccccgagg ccccuagaga cggccaggcc uacgugcgga aggacggcga gugggugcug 1020
          cugagcaccu uccugggcug auaauaggcu ggagccucgg uggccuagcu ucuugccccu 1080
          ugggccuccc cccagccccu ccuccccuuc cugcacccgu acccccgugg ucuuugaaua 1140
          aagucugagu gggcggc 1157
           <![CDATA[ <210> 43]]>
           <![CDATA[ <211> 981]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 43]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ucuggcggag gcggcagcgc caucggcggc 900
          uacauccccg aggccccuag agacggccag gccuacgugc ggaaggacgg cgagugggug 960
          cugcugagca ccuuccuggg c 981
           <![CDATA[ <210> 44]]>
           <![CDATA[ <211> 327]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 44]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Ser Gly Gly Gly Gly Ser Ala Ile Gly Gly Tyr Ile Pro Glu
              290 295 300
          Ala Pro Arg Asp Gly Gln Ala Tyr Val Arg Lys Asp Gly Glu Trp Val
          305 310 315 320
          Leu Leu Ser Thr Phe Leu Gly
                          325
           <![CDATA[ <210> 45]]>
           <![CDATA[ <211> 1130]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 45]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuucuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacucu ggcggaggca gcauccuggc caucuacagc 960
          accguggcca gcagccuggu gcugcuggug agccugggcg ccaucagcuu cugauaauag 1020
          gcuggagccu cgguggccua gcuucuugcc ccuugggccu ccccccagcc ccuccucccc 1080
          uuccugcacc cguacccccg uggucuuuga auaaagucug agugggcggc 1130
           <![CDATA[ <210> 46]]>
           <![CDATA[ <211> 954]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 46]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ucuggcggag gcagcauccu ggccaucuac 900
          agcaccgugg ccagcagccu ggugcugcug gugagccugg gcgccaucag cuuc 954
           <![CDATA[ <210> 47]]>
           <![CDATA[ <211> 318]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 47]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala
              290 295 300
          Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe
          305 310 315
           <![CDATA[ <210> 48]]>
           <![CDATA[ <211> 1238]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 48]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacucu ggcggaggca gcauccuggc caucuacagc 960
          accguggcca gcagccuggu gcugcuggug agccugggcg ccaucagcuu cggcggaggc 1020
          agcgccaucg gcggcuacau ccccgaggcc ccuagagacg gccaggccua cgugcggaag 1080
          gacggcgagu gggugcugcu gagcaccuuc cugggcaagu gauaauaggc uggagccucg 1140
          guggccuagc uucuugcccc uugggccucc ccccagcccc uccuccccuu ccugcacccg 1200
          uacccccgug gucuuugaau aaagucugag ugggcggc 1238
           <![CDATA[ <210> 49]]>
           <![CDATA[ <211> 1062]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 49]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ucuggcggag gcagcauccu ggccaucuac 900
          agcaccgugg ccagcagccu ggugcugcug gugagccugg gcgccaucag cuucggcgga 960
          ggcagcgcca ucggcggcua cauccccgag gccccuagag acggccaggc cuacgugcgg 1020
          aaggacggcg agugggugcu gcugagcacc uuccugggca ag 1062
           <![CDATA[ <210> 50]]>
           <![CDATA[ <211> 354]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 50]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala
              290 295 300
          Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe Gly Gly
          305 310 315 320
          Gly Ser Ala Ile Gly Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln
                          325 330 335
          Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu
                      340 345 350
          Gly Lys
           <![CDATA[ <210> 51]]>
           <![CDATA[ <211> 1130]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 51]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          gugugccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacucu ggcggaggca gcauccuggc caucuacagc 960
          accguggcca gcagccuggu gcugcuggug agccugggcg ccaucagcuu cugauaauag 1020
          gcuggagccu cgguggccua gcuucuugcc ccuugggccu ccccccagcc ccuccucccc 1080
          uuccugcacc cguacccccg uggucuuuga auaaagucug agugggcggc 1130
           <![CDATA[ <210> 52]]>
           <![CDATA[ <211> 954]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 52]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aaggugugcc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ucuggcggag gcagcauccu ggccaucuac 900
          agcaccgugg ccagcagccu ggugcugcug gugagccugg gcgccaucag cuuc 954
           <![CDATA[ <210> 53]]>
           <![CDATA[ <211> 318]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 53]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Cys Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala
              290 295 300
          Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe
          305 310 315
           <![CDATA[ <210> 54]]>
           <![CDATA[ <211> 1181]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 54]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          gugugccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccuc uggcggaggc agcauccugg ccaucuacag caccguggcc 1020
          agcagccugg ugcugcuggu gagccugggc gccaucagcu ucugauaaua ggcuggagcc 1080
          ucgguggccu agcuucuugc cccuugggcc uccccccagc cccuccuccc cuuccugcac 1140
          ccguaccccc guggucuuug aauaaagucu gagugggcgg c 1181
           <![CDATA[ <210> 55]]>
           <![CDATA[ <211> 1005]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 55]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aaggugugcc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cucuggcgga ggcagcaucc uggccaucua cagcaccgug 960
          gccagcagcc uggugcugcu ggugagccug ggcgccauca gcuuc 1005
           <![CDATA[ <210> 56]]>
           <![CDATA[ <211> 335]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 56]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Cys Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val
          305 310 315 320
          Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe
                          325 330 335
           <![CDATA[ <210> 57]]>
           <![CDATA[ <211> 1181]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 57]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccuc uggcggaggc agcauccugg ccaucuacag caccguggcc 1020
          agcagccugg ugcugcuggu gagccugggc gccaucagcu ucugauaaua ggcuggagcc 1080
          ucgguggccu agcuucuugc cccuugggcc uccccccagc cccuccuccc cuuccugcac 1140
          ccguaccccc guggucuuug aauaaagucu gagugggcgg c 1181
           <![CDATA[ <210> 58]]>
           <![CDATA[ <211> 1005]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 58]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cucuggcgga ggcagcaucc uggccaucua cagcaccgug 960
          gccagcagcc uggugcugcu ggugagccug ggcgccauca gcuuc 1005
           <![CDATA[ <210> 59]]>
           <![CDATA[ <211> 335]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 59]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val
          305 310 315 320
          Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe
                          325 330 335
           <![CDATA[ <210> 60]]>
           <![CDATA[ <211> 836]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 60]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag 120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc 180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac 240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac 300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag 360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc 420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac 480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc 540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac 600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg 660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaaguga 720
          uaauaggcug gagccucggu ggccuagcuu cuugccccuu gggccuccccc ccagccccuc 780
          cuccccuucc ugcacccgua cccccguggu cuuugaauaa agucugagug ggcggc 836
           <![CDATA[ <210> 61]]>
           <![CDATA[ <211> 660]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 61]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc 60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc 120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug 180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac 240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg 300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac 360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac 420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc 480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc 540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac 600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag 660
           <![CDATA[ <210> 62]]>
           <![CDATA[ <211> 220]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 62]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu
          1 5 10 15
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu
                      20 25 30
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys
                  35 40 45
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala
              50 55 60
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn
          65 70 75 80
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly
                          85 90 95
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp
                      100 105 110
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp
                  115 120 125
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu
              130 135 140
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile
          145 150 155 160
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu
                          165 170 175
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr
                      180 185 190
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu
                  195 200 205
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
              210 215 220
           <![CDATA[ <210> 63]]>
           <![CDATA[ <211> 1349]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 63]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag 120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc 180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac 240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac 300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag 360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc 420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac 480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc 540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac 600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg 660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaaggga 720
          ggaggcagcg gcggcgauau caucaagcuu cugaacgagc aaguuaacaa ggaaaugcag 780
          agcaguaauc ucuacaugag caugagcagc uggugcuaca cccacucccu ggacggagca 840
          ggccucuucc uguucgacca cgcagccgag gaguacgagc acgcuaagaa guugaucauu 900
          uucuugaacg agaacaacgu gcccgugcag cuaacgucaa ucagcgcacc ugagcacaag 960
          uucgagggcc ugacccagau cuuccagaag gccuacgaac acgaacagca caucuccgag 1020
          agcaucaaca auauugugga ucacgcuauc aaguccaagg accacgcuac cuucaacuuc 1080
          cugcaguggu acguggccga gcaacaugag gaggaggugc uguucaagga cauccuggac 1140
          aagaucgagc ugaucgguaa ugagaaucac ggccuguacc uggccgacca guacgugaag 1200
          ggcaucgcca agagccggaa gucaggcuca ugauaauagg cuggagccuc gguggccuag 1260
          cuucuugccc cuugggccuc cccccagccc cuccuccccu uccugcaccc guacccccgu 1320
          ggucuuugaa uaaagucuga gugggcggc 1349
           <![CDATA[ <210> 64]]>
           <![CDATA[ <211> 1173]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 64]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc 60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc 120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug 180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac 240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg 300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac 360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac 420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc 480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc 540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac 600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag 660
          ggaggaggca gcggcggcga uaucaucaag cuucugaacg agcaaguuaa caaggaaaug 720
          cagagcagua aucucuacau gagcaugagc agcuggugcu acacccacuc ccuggacgga 780
          gcaggccucu uccuguucga ccacgcagcc gaggaguacg agcacgcuaa gaaguugauc 840
          auuuucuuga acgagaacaa cgugcccgug cagcuaacgu caaucagcgc accugagcac 900
          aaguucgagg gccugaccca gaucuuccag aaggccuacg aacacgaaca gcacaucucc 960
          gagagcauca acaauauugu ggaucacgcu aucaagucca aggaccacgc uaccuucaac 1020
          uuccugcagu gguacguggc cgagcaacau gaggaggagg ugcuguucaa ggacauccug 1080
          gacaagaucg agcugaucgg uaaugagaau cacggccugu accuggccga ccaguacgug 1140
          aagggcaucg ccaagagccg gaaguccaggc uca 1173
           <![CDATA[ <210> 65]]>
           <![CDATA[ <211> 391]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 65]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu
          1 5 10 15
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu
                      20 25 30
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys
                  35 40 45
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala
              50 55 60
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn
          65 70 75 80
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly
                          85 90 95
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp
                      100 105 110
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp
                  115 120 125
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu
              130 135 140
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile
          145 150 155 160
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu
                          165 170 175
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr
                      180 185 190
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu
                  195 200 205
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Gly Gly Gly Ser
              210 215 220
          Gly Gly Asp Ile Ile Lys Leu Leu Asn Glu Gln Val Asn Lys Glu Met
          225 230 235 240
          Gln Ser Ser Asn Leu Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr His
                          245 250 255
          Ser Leu Asp Gly Ala Gly Leu Phe Leu Phe Asp His Ala Ala Glu Glu
                      260 265 270
          Tyr Glu His Ala Lys Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn Val
                  275 280 285
          Pro Val Gln Leu Thr Ser Ile Ser Ala Pro Glu His Lys Phe Glu Gly
              290 295 300
          Leu Thr Gln Ile Phe Gln Lys Ala Tyr Glu His Glu Gln His Ile Ser
          305 310 315 320
          Glu Ser Ile Asn Asn Ile Val Asp His Ala Ile Lys Ser Lys Asp His
                          325 330 335
          Ala Thr Phe Asn Phe Leu Gln Trp Tyr Val Ala Glu Gln His Glu Glu
                      340 345 350
          Glu Val Leu Phe Lys Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly Asn
                  355 360 365
          Glu Asn His Gly Leu Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile Ala
              370 375 380
          Lys Ser Arg Lys Ser Gly Ser
          385 390
           <![CDATA[ <210> 66]]>
           <![CDATA[ <211> 1376]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 66]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          ggcauccugc ccagcccugg caugcccgcu cugcugagcc uggugagccu gcugagcgug 120
          cugcugaugg gcugcguggc ugagaccggc augcagaucu acgagggcaa gcugaccgca 180
          gagggccugc gguucggcau cguggccagc cgcgccaacc acgcucuggu ggaccggcuu 240
          guggagggcg cuaucgacgc caucgugaga cacggcggcc gggaagagga caucacccug 300
          gugcgggugu gcggcagcug ggagauuccc gucgccgccg gagaacuggc ccggaaggag 360
          gacaucgacg ccgugaucgc caucggcgug cugugcagag gcgccacgcc cagcuucgac 420
          uacaucgcca gcgaggugag caagggccug gccgaccuga gccuggagcu gcggaagccc 480
          aucaccuucg gcgugaucac cgccgacacc cuggagcagg ccaucgaggc cgcaggcacc 540
          ugccacggca acaagggcug ggaagccgcc cugugcgcca ucgagauggc caaccuguuc 600
          aagagccugc ggggcggaag uggaggcucu gguggcagcg gaggaucugg cggcggccag 660
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc 720
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac 780
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac 840
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag 900
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc 960
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac 1020
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc 1080
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac 1140
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg 1200
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaaguga 1260
          uaauaggcug gagccucggu ggccuagcuu cuugccccuu gggccucccc ccagccccuc 1320
          cuccccuucc ugcacccgua cccccguggu cuuugaauaa agucugagug ggcggc 1376
           <![CDATA[ <210> 67]]>
           <![CDATA[ <211> 1200]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 67]]>
          augggcaucc ugcccagccc uggcaugccc gcucugcuga gccuggugag ccugcugagc 60
          gugcugcuga ugggcugcgu ggcugagacc ggcaugcaga ucuacgaggg caagcugacc 120
          gcagagggcc ugcgguucgg caucguggcc agccgcgcca accacgcucu gguggaccgg 180
          cuuguggagg gcgcuaucga cgccaucgug agacacggcg gccgggaaga ggacaucacc 240
          cuggugcggg ugugcggcag cugggagauu cccgucgccg ccggagaacu ggcccggaag 300
          gaggacaucg acgccgugau cgccaucggc gugcugugca gaggcgccac gcccagcuuc 360
          gacuacaucg ccagcgaggu gagcaagggc cuggccgacc ugagccugga gcugcggaag 420
          cccaucaccu ucggcgugau caccgccgac acccuggagc aggccaucga ggccgcaggc 480
          accugccacg gcaacaaggg cugggaagcc gcccugugcg ccaucgagau ggccaaccug 540
          uucaagagcc ugcggggcgg aaguggaggc ucugguggca gcggaggauc uggcggcggc 600
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc 660
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug 720
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac 780
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg 840
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac 900
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac 960
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc 1020
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc 1080
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac 1140
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag 1200
           <![CDATA[ <210> 68]]>
           <![CDATA[ <211> 400]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 68]]>
          Met Gly Ile Leu Pro Ser Pro Gly Met Pro Ala Leu Leu Ser Leu Val
          1 5 10 15
          Ser Leu Leu Ser Val Leu Leu Met Gly Cys Val Ala Glu Thr Gly Met
                      20 25 30
          Gln Ile Tyr Glu Gly Lys Leu Thr Ala Glu Gly Leu Arg Phe Gly Ile
                  35 40 45
          Val Ala Ser Arg Ala Asn His Ala Leu Val Asp Arg Leu Val Glu Gly
              50 55 60
          Ala Ile Asp Ala Ile Val Arg His Gly Gly Arg Glu Glu Asp Ile Thr
          65 70 75 80
          Leu Val Arg Val Cys Gly Ser Trp Glu Ile Pro Val Ala Ala Gly Glu
                          85 90 95
          Leu Ala Arg Lys Glu Asp Ile Asp Ala Val Ile Ala Ile Gly Val Leu
                      100 105 110
          Cys Arg Gly Ala Thr Pro Ser Phe Asp Tyr Ile Ala Ser Glu Val Ser
                  115 120 125
          Lys Gly Leu Ala Asp Leu Ser Leu Glu Leu Arg Lys Pro Ile Thr Phe
              130 135 140
          Gly Val Ile Thr Ala Asp Thr Leu Glu Gln Ala Ile Glu Ala Ala Gly
          145 150 155 160
          Thr Cys His Gly Asn Lys Gly Trp Glu Ala Ala Leu Cys Ala Ile Glu
                          165 170 175
          Met Ala Asn Leu Phe Lys Ser Leu Arg Gly Gly Ser Gly Gly Ser Gly
                      180 185 190
          Gly Ser Gly Gly Ser Gly Gly Gly Gln Pro Asn Ile Thr Asn Leu Cys
                  195 200 205
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
              210 215 220
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
          225 230 235 240
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
                          245 250 255
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
                      260 265 270
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
                  275 280 285
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
              290 295 300
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
          305 310 315 320
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
                          325 330 335
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
                      340 345 350
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
                  355 360 365
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
              370 375 380
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
          385 390 395 400
           <![CDATA[ <210> 69]]>
           <![CDATA[ <211> 1334]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 69]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag 120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc 180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac 240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac 300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag 360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc 420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac 480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc 540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac 600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg 660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaaggga 720
          ggaggcuccg gaggcgguag cgcugagacc ggcaugcaga ucuacgaggg caagcugacc 780
          gcagagggcc ugcgguucgg caucguggcc agccgcgcca accacgcucu gguggaccgg 840
          cuuguggagg gcgcuaucga cgccaucgug agacacggcg gccgggaaga ggacaucacc 900
          cuggugcggg ugugcggcag cugggagauu cccgucgccg ccggagaacu ggcccggaag 960
          gaggacaucg acgccgugau cgccaucggc gugcugugca gaggcgccac gcccagcuuc 1020
          gacuacaucg ccagcgaggu gagcaagggc cuggccgacc ugagccugga gcugcggaag 1080
          cccaucaccu ucggcgugau caccgccgac acccuggagc aggccaucga ggccgcaggc 1140
          accugccacg gcaacaaggg cugggaagcc gcccugugcg ccaucgagau ggccaaccug 1200
          uucaagagcc ugcggugaua auaggcugga gccucggugg ccuagcuucu ugccccuugg 1260
          gccuccicccc agccccuccu ccccuuccug cacccguacc cccguggucu uugaauaaag 1320
          ucugaguggg cggc 1334
           <![CDATA[ <210> 70]]>
           <![CDATA[ <211> 1158]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 70]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc 60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc 120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug 180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac 240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg 300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac 360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac 420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc 480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc 540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac 600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag 660
          ggaggaggcu ccggaggcgg uagcgcugag accggcaugc agaucuacga gggcaagcug 720
          accgcagagg gccugcgguu cggcaucgug gccagccgcg ccaaccacgc ucugguggac 780
          cggcuugugg agggcgcuau cgacgccauc gugagacacg gcggccggga agaggacauc 840
          acccuggugc gggugugcgg cagcugggag auucccgucg ccgccggaga acuggcccgg 900
          aaggaggaca ucgacgccgu gaucgccauc ggcgugcugu gcagaggcgc cacgcccagc 960
          uucgacuaca ucgccagcga ggugagcaag ggccuggccg accugagccu ggagcugcgg 1020
          aagcccauca ccuucggcgu gaucaccgcc gacacccugg agcaggccau cgaggccgca 1080
          ggcaccugcc acggcaacaa gggcugggaa gccgcccugu gcgccaucga gauggccaac 1140
          cuguucaaga gccugcgg 1158
           <![CDATA[ <210> 71]]>
           <![CDATA[ <211> 386]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 71]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu
          1 5 10 15
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu
                      20 25 30
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys
                  35 40 45
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala
              50 55 60
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn
          65 70 75 80
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly
                          85 90 95
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp
                      100 105 110
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp
                  115 120 125
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu
              130 135 140
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile
          145 150 155 160
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu
                          165 170 175
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr
                      180 185 190
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu
                  195 200 205
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Gly Gly Gly Ser
              210 215 220
          Gly Gly Gly Ser Ala Glu Thr Gly Met Gln Ile Tyr Glu Gly Lys Leu
          225 230 235 240
          Thr Ala Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Ala Asn His
                          245 250 255
          Ala Leu Val Asp Arg Leu Val Glu Gly Ala Ile Asp Ala Ile Val Arg
                      260 265 270
          His Gly Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Cys Gly Ser
                  275 280 285
          Trp Glu Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile
              290 295 300
          Asp Ala Val Ile Ala Ile Gly Val Leu Cys Arg Gly Ala Thr Pro Ser
          305 310 315 320
          Phe Asp Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asp Leu Ser
                          325 330 335
          Leu Glu Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr
                      340 345 350
          Leu Glu Gln Ala Ile Glu Ala Ala Gly Thr Cys His Gly Asn Lys Gly
                  355 360 365
          Trp Glu Ala Ala Leu Cys Ala Ile Glu Met Ala Asn Leu Phe Lys Ser
              370 375 380
          Leu Arg
          385
           <![CDATA[ <210> 72]]>
           <![CDATA[ <211> 947]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 72]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag 120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc 180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac 240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac 300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag 360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc 420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac 480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc 540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac 600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg 660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaagucu 720
          ggcggaggcg gcagcgccau cggcggcuac auccccgagg ccccuagaga cggccaggcc 780
          uacgugcgga aggacggcga gugggugcug cugagcaccu uccugggcug auaauaggcu 840
          ggagccucgg uggccuagcu ucuugccccu ugggccuccc cccagccccu ccuccccuuc 900
          cugcacccgu acccccgugg ucuuugaaua aagucugagu gggcggc 947
           <![CDATA[ <210> 73]]>
           <![CDATA[ <211> 771]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 73]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc 60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc 120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug 180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac 240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg 300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac 360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac 420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc 480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc 540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac 600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag 660
          ucuggcggag gcggcagcgc caucggcggc uacauccccg aggccccuag agacggccag 720
          gccuacgugc ggaaggacgg cgagugggug cugcugagca ccuuccuggg c 771
           <![CDATA[ <210> 74]]>
           <![CDATA[ <211> 257]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 74]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu
          1 5 10 15
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu
                      20 25 30
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys
                  35 40 45
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala
              50 55 60
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn
          65 70 75 80
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly
                          85 90 95
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp
                      100 105 110
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp
                  115 120 125
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu
              130 135 140
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile
          145 150 155 160
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu
                          165 170 175
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr
                      180 185 190
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu
                  195 200 205
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Ser Gly Gly Gly
              210 215 220
          Gly Ser Ala Ile Gly Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln
          225 230 235 240
          Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu
                          245 250 255
          Gly
           <![CDATA[ <210> 75]]>
           <![CDATA[ <211> 920]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 75]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag 120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc 180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac 240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac 300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag 360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc 420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac 480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc 540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac 600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg 660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaagucu 720
          ggcggaggca gcauccuggc caucuacagc accguggcca gcagccuggu gcugcuggug 780
          agccugggcg ccaucagcuu cugauaauag gcuggagccu cgguggccua gcuucuugcc 840
          ccuugggccu ccccccagcc ccuccucccc uuccugcacc cguacccccg uggucuuuga 900
          auaaagucug agugggcggc 920
           <![CDATA[ <210> 76]]>
           <![CDATA[ <211> 744]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 76]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc 60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc 120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug 180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac 240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg 300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac 360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac 420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc 480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc 540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac 600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag 660
          ucuggcggag gcagcauccu ggccaucuac agcaccgugg ccagcagccu ggugcugcug 720
          gugagccugg gcgccaucag cuuc 744
           <![CDATA[ <210> 77]]>
           <![CDATA[ <211> 248]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 77]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu
          1 5 10 15
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu
                      20 25 30
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys
                  35 40 45
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala
              50 55 60
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn
          65 70 75 80
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly
                          85 90 95
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp
                      100 105 110
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp
                  115 120 125
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu
              130 135 140
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile
          145 150 155 160
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu
                          165 170 175
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr
                      180 185 190
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu
                  195 200 205
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Ser Gly Gly Gly
              210 215 220
          Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu
          225 230 235 240
          Val Ser Leu Gly Ala Ile Ser Phe
                          245
           <![CDATA[ <210> 78]]>
           <![CDATA[ <211> 1025]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 78]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag 120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc 180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac 240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac 300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag 360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc 420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac 480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc 540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac 600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg 660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaagucu 720
          ggcggaggcg gcagcgccau cggcggcuac auccccgagg ccccuagaga cggccaggcc 780
          uacgugcgga aggacggcga gugggugcug cugagcaccu uccugggcgg aggcagcauc 840
          cuggccaucu acagcaccgu ggccagcagc cuggugcugc uggugagccu gggcgccauc 900
          agcuucugau aauaggcugg agccucggug gccuagcuuc uugccccuug ggccuccccc 960
          cagccccuccc uccccuuccu gcacccguac ccccgugguc uuugaauaaa gucugagugg 1020
          gcggc 1025
           <![CDATA[ <210> 79]]>
           <![CDATA[ <211> 849]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 79]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc 60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc 120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug 180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac 240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg 300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac 360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac 420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc 480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc 540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac 600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag 660
          ucuggcggag gcggcagcgc caucggcggc uacauccccg aggccccuag agacggccag 720
          gccuacgugc ggaaggacgg cgagugggug cugcugagca ccuuccuggg cggaggcagc 780
          auccuggcca ucuacagcac cguggccagc agccuggugc ugcuggugag ccugggcgcc 840
          aucagcuuc 849
           <![CDATA[ <210> 80]]>
           <![CDATA[ <211> 283]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 80]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu
          1 5 10 15
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu
                      20 25 30
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys
                  35 40 45
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala
              50 55 60
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn
          65 70 75 80
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly
                          85 90 95
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp
                      100 105 110
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp
                  115 120 125
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu
              130 135 140
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile
          145 150 155 160
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu
                          165 170 175
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr
                      180 185 190
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu
                  195 200 205
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Ser Gly Gly Gly
              210 215 220
          Gly Ser Ala Ile Gly Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln
          225 230 235 240
          Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu
                          245 250 255
          Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu
                      260 265 270
          Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe
                  275 280
           <![CDATA[ <210> 81]]>
           <![CDATA[ <211> 1028]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 81]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag 120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc 180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac 240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac 300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag 360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc 420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac 480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc 540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac 600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg 660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaagucu 720
          ggcggaggca gcauccuggc caucuacagc accguggcca gcagccuggu gcugcuggug 780
          agccugggcg ccaucagcuu cggcggaggc agcgccaucg gcggcuacau ccccgaggcc 840
          ccuagagacg gccaggccua cgugcggaag gacggcgagu gggugcugcu gagcaccuuc 900
          cugggcaagu gauaauaggc uggagccucg guggccuagc uucuugcccc uugggccucc 960
          ccccagcccc uccuccccuu ccugcacccg uacccccgug gucuuugaau aaagucugag 1020
          ugggcggc 1028
           <![CDATA[ <210> 82]]>
           <![CDATA[ <211> 852]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 82]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc 60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc 120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug 180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac 240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg 300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac 360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac 420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc 480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc 540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac 600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag 660
          ucuggcggag gcagcauccu ggccaucuac agcaccgugg ccagcagccu ggugcugcug 720
          gugagccugg gcgccaucag cuucggcgga ggcagcgcca ucggcggcua cauccccgag 780
          gccccuagag acggccaggc cuacgugcgg aaggacggcg agugggugcu gcugagcacc 840
          uuccugggca ag 852
           <![CDATA[ <210> 83]]>
           <![CDATA[ <211> 284]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 83]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu
          1 5 10 15
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu
                      20 25 30
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys
                  35 40 45
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala
              50 55 60
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn
          65 70 75 80
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly
                          85 90 95
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp
                      100 105 110
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp
                  115 120 125
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu
              130 135 140
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile
          145 150 155 160
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu
                          165 170 175
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr
                      180 185 190
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu
                  195 200 205
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Ser Gly Gly Gly
              210 215 220
          Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu
          225 230 235 240
          Val Ser Leu Gly Ala Ile Ser Phe Gly Gly Gly Ser Ala Ile Gly Gly
                          245 250 255
          Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr Val Arg Lys Asp
                      260 265 270
          Gly Glu Trp Val Leu Leu Ser Thr Phe Leu Gly Lys
                  275 280
           <![CDATA[ <210> 84]]>
           <![CDATA[ <211> 1142]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 84]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag 120
          cgggugcagc ccaccgagag caucgugcgg uuccccaaca ucaccaaccu gugccccuuc 180
          ggcgaggugu ucaacgccac ccgguucgcc agcguguacg ccuggaaccg gaagcggauc 240
          agcaacugcg uggccgacua cagcgugcug uacaacagcg ccagcuucag caccuucaag 300
          ugcuacggcg ugagccccac caagcugaac gaccugugcu ucaccaacgu guacgccgac 360
          agcuucguga uccguggcga cgaggugcgg cagaucgcac ccggccagac aggcaagauc 420
          gccgacuaca acuacaagcu gcccgacgac uucaccggcu gcgugaucgc cuggaacagc 480
          aacaaccucg acagcaaggu gggcggcaac uacaacuacc uguaccggcu guuccggaag 540
          agcaaccuga agcccuucga gcgggacauc agcaccgaga ucuaccaagc cggcuccacc 600
          ccuugcaacg gcguggaggg cuucaacugc uacuucccuc ugcagagcua cggcuuccag 660
          cccaccaacg gcgugggcua ccagcccuac cggguggugg ugcugagcuu cgagcugcug 720
          cacgccccag ccaccgugug uggccccaag aagagcacca accuggugaa gaacaagugc 780
          gugaacuuca acuucaacgg ccuuaccggc accggcgugc ugaccgagag caacaagaaa 840
          uuccugcccu uucagcaguu cggccgggac aucgccgaca ccaccgacgc ugugcgggau 900
          ccccagaccc uggagauccu ggacaucacc ccuugcagcu cuggcggagg cagcauccug 960
          gccaucuaca gcaccguggc cagcagccug gugcugcugg ugagccuggg cgccaucagc 1020
          uucugauaau aggcuggagc cucgguggcc uagcuucuug ccccuugggc cuccccccag 1080
          ccccuccucc ccuuccugca cccguacccc cguggucuuu gaauaaaguc ugagugggcg 1140
          gc 1142
           <![CDATA[ <210> 85]]>
           <![CDATA[ <211> 966]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 85]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc 60
          cagcgggugc agcccaccga gagcaucgug cgguucccca acaucaccaa ccugugcccc 120
          uucggcgagg uguucaacgc cacccgguuc gccagcgugu acgccuggaa ccggaagcgg 180
          aucagcaacu gcguggccga cuacagcgug cuguacaaca gcgccagcuu cagcaccuuc 240
          aagugcuacg gcgugagccc caccaagcug aacgaccugu gcuucaccaa cguguacgcc 300
          gacagcuucg ugauccgugg cgacgaggug cggcagaucg cacccggcca gacaggcaag 360
          aucgccgacu acaacuacaa gcugcccgac gacuucaccg gcugcgugau cgccuggaac 420
          agcaacaacc ucgacagcaa ggugggcggc aacuacaacu accuguaccg gcuguuccgg 480
          aagagcaacc ugaagcccuu cgagcgggac aucagcaccg agaucuacca agccggcucc 540
          accccuugca acggcgugga gggcuucaac ugcuacuucc cucugcagag cuacggcuuc 600
          cagcccacca acggcguggg cuaccagccc uaccgggugg uggugcugag cuucgagcug 660
          cugcacgccc cagccaccgu guguggcccc aagaagagca ccaaccuggu gaagaacaag 720
          ugcgugaacu ucaacuucaa cggccuuacc ggcaccggcg ugcugaccga gagcaacaag 780
          aaauuccugc ccuuucagca guucggccgg gacaucgccg acaccaccga cgcugugcgg 840
          gauccccaga cccuggagau ccuggacauc accccuugca gcucuggcgg aggcagcauc 900
          cuggccaucu acagcaccgu ggccagcagc cuggugcugc uggugagccu gggcgccauc 960
          agcuuc 966
           <![CDATA[ <210> 86]]>
           <![CDATA[ <211> 322]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 86]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu
          1 5 10 15
          Leu Val Asn Ser Gln Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe
                      20 25 30
          Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr
                  35 40 45
          Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys
              50 55 60
          Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe
          65 70 75 80
          Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr
                          85 90 95
          Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln
                      100 105 110
          Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu
                  115 120 125
          Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu
              130 135 140
          Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg
          145 150 155 160
          Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr
                          165 170 175
          Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr
                      180 185 190
          Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr
                  195 200 205
          Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro
              210 215 220
          Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys
          225 230 235 240
          Cys Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr
                          245 250 255
          Glu Ser Asn Lys Lys Phe Leu Pro Phe Gln Gln Phe Gly Arg Asp Ile
                      260 265 270
          Ala Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu
                  275 280 285
          Asp Ile Thr Pro Cys Ser Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr
              290 295 300
          Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile
          305 310 315 320
          Ser Phe
           <![CDATA[ <210> 87]]>
           <![CDATA[ <211> 1142]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 87]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag 120
          cgggugcagc ccaccgagag caucgugcgg uuccccaaca ucaccaaccu gugccccuuc 180
          ggcgaggugu ucaacgccac ccgguucgcc agcguguacg ccuggaaccg gaagcggauc 240
          agcaacugcg uggccgacua cagcgugcug uacaacagcg ccagcuucag caccuucaag 300
          ugcuacggcg ugagccccac caagcugaac gaccugugcu ucaccaacgu guacgccgac 360
          agcuucguga uccguggcga cgaggugcgg cagaucgcac ccggccagac aggcaagauc 420
          gccgacuaca acuacaagcu gcccgacgac uucaccggcu gcgugaucgc cuggaacagc 480
          aacaaccucg acagcaaggu gggcggcaac uacaacuacc uguaccggcu guuccggaag 540
          agcaaccuga agcccuucga gcgggacauc agcaccgaga ucuaccaagc cggcuccacc 600
          ccuugcaacg gcguggaggg cuucaacugc uacuucccuc ugcagagcua cggcuuccag 660
          cccaccaacg gcgugggcua ccagcccuac cggguggugg ugcugagcuu cgagcugcug 720
          cacgccccag ccaccgugug uggccccaag aagagcacca accuggugaa gaacaagugc 780
          gugaacuuca acuucaacgg ccuuaccggc accggcgugc ugaccgagag caacaagaaa 840
          uuccugcccu uuugccaguu cggccgggac aucgccgaca ccaccgacgc ugugcgggau 900
          ccccagaccc uggagauccu ggacaucacc ccuugcagcu cuggcggagg cagcauccug 960
          gccaucuaca gcaccguggc cagcagccug gugcugcugg ugagccuggg cgccaucagc 1020
          uucugauaau aggcuggagc cucgguggcc uagcuucuug ccccuugggc cuccccccag 1080
          ccccuccucc ccuuccugca cccguacccc cguggucuuu gaauaaaguc ugagugggcg 1140
          gc 1142
           <![CDATA[ <210> 88]]>
           <![CDATA[ <211> 966]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 88]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc 60
          cagcgggugc agcccaccga gagcaucgug cgguucccca acaucaccaa ccugugcccc 120
          uucggcgagg uguucaacgc cacccgguuc gccagcgugu acgccuggaa ccggaagcgg 180
          aucagcaacu gcguggccga cuacagcgug cuguacaaca gcgccagcuu cagcaccuuc 240
          aagugcuacg gcgugagccc caccaagcug aacgaccugu gcuucaccaa cguguacgcc 300
          gacagcuucg ugauccgugg cgacgaggug cggcagaucg cacccggcca gacaggcaag 360
          aucgccgacu acaacuacaa gcugcccgac gacuucaccg gcugcgugau cgccuggaac 420
          agcaacaacc ucgacagcaa ggugggcggc aacuacaacu accuguaccg gcuguuccgg 480
          aagagcaacc ugaagcccuu cgagcgggac aucagcaccg agaucuacca agccggcucc 540
          accccuugca acggcgugga gggcuucaac ugcuacuucc cucugcagag cuacggcuuc 600
          cagcccacca acggcguggg cuaccagccc uaccgggugg uggugcugag cuucgagcug 660
          cugcacgccc cagccaccgu guguggcccc aagaagagca ccaaccuggu gaagaacaag 720
          ugcgugaacu ucaacuucaa cggccuuacc ggcaccggcg ugcugaccga gagcaacaag 780
          aaauuccugc ccuuuugcca guucggccgg gacaucgccg acaccaccga cgcugugcgg 840
          gauccccaga cccuggagau ccuggacauc accccuugca gcucuggcgg aggcagcauc 900
          cuggccaucu acagcaccgu ggccagcagc cuggugcugc uggugagccu gggcgccauc 960
          agcuuc 966
           <![CDATA[ <210> 89]]>
           <![CDATA[ <211> 322]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 89]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu
          1 5 10 15
          Leu Val Asn Ser Gln Arg Val Gln Pro Thr Glu Ser Ile Val Arg Phe
                      20 25 30
          Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe Asn Ala Thr
                  35 40 45
          Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys
              50 55 60
          Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe
          65 70 75 80
          Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr
                          85 90 95
          Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu Val Arg Gln
                      100 105 110
          Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu
                  115 120 125
          Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser Asn Asn Leu
              130 135 140
          Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg
          145 150 155 160
          Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr
                          165 170 175
          Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr
                      180 185 190
          Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr
                  195 200 205
          Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu His Ala Pro
              210 215 220
          Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn Leu Val Lys Asn Lys
          225 230 235 240
          Cys Val Asn Phe Asn Phe Asn Gly Leu Thr Gly Thr Gly Val Leu Thr
                          245 250 255
          Glu Ser Asn Lys Lys Phe Leu Pro Phe Cys Gln Phe Gly Arg Asp Ile
                      260 265 270
          Ala Asp Thr Thr Asp Ala Val Arg Asp Pro Gln Thr Leu Glu Ile Leu
                  275 280 285
          Asp Ile Thr Pro Cys Ser Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr
              290 295 300
          Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile
          305 310 315 320
          Ser Phe
           <![CDATA[ <210> 90]]>
           <![CDATA[ <211> 1748]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 90]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc 960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg 1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc 1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc 1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc 1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug 1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac 1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac 1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag 1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug 1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcagc 1560
          auccuggcca ucuacagcac cguggccagc agccuggugc ugcuggugag ccugggcgcc 1620
          aucagcuucu gauaauaggc uggagccucg guggccuagc uucuugcccc uugggccucc 1680
          ccccagcccc uccuccccuu ccugcacccg uacccccgug gucuuugaau aaagucugag 1740
          ugggcggc 1748
           <![CDATA[ <210> 91]]>
           <![CDATA[ <211> 1572]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 91]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc 900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc 960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc 1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc 1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc 1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc 1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug 1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc 1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug 1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug 1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc 1500
          agcauccugg ccaucuacag caccguggcc agcagccugg ugcugcuggu gagccugggc 1560
          gccaucagcu uc 1572
           <![CDATA[ <210> 92]]>
           <![CDATA[ <211> 524]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 92]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys
              290 295 300
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
          305 310 315 320
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                          325 330 335
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
                      340 345 350
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
                  355 360 365
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
              370 375 380
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
          385 390 395 400
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                          405 410 415
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
                      420 425 430
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
                  435 440 445
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
              450 455 460
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
          465 470 475 480
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                          485 490 495
          Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser
                      500 505 510
          Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe
                  515 520
           <![CDATA[ <210> 93]]>
           <![CDATA[ <211> 1925]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 93]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc 960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg 1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc 1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc 1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc 1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug 1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac 1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac 1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag 1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug 1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcagc 1560
          auccuggcca ucuacagcac cguggccagc agccuggugc ugcuggugag ccugggcgcc 1620
          aucagcuuca agaagaagaa gcggccacgg aacuccuaca agugcggcac caacaccaug 1680
          gagcgggagg agagcgagca gaccaagaag cgggagaaga uccacauucc ugaacggucc 1740
          gacgaagccc agcggguguu caagagcagc aagaccagca gcugcgacaa gagcgacacc 1800
          ugcuucugau aauaggcugg agccucggug gccuagcuuc uugccccuug ggccuccccc 1860
          cagccccuccc uccccuuccu gcacccguac ccccgugguc uuugaauaaa gucugagugg 1920
          gcggc 1925
           <![CDATA[ <210> 94]]>
           <![CDATA[ <211> 1749]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 94]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc 900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc 960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc 1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc 1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc 1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc 1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug 1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc 1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug 1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug 1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc 1500
          agcauccugg ccaucuacag caccguggcc agcagccugg ugcugcuggu gagccugggc 1560
          gccaucagcu ucaagaagaa gaagcggcca cggaacuccu acaagugcgg caccaacacc 1620
          auggagcggg aggagagcga gcagaccaag aagcgggaga agauccacau uccugaacgg 1680
          uccgacgaag cccagcgggu guucaagagc agcaagacca gcagcugcga caagagcgac 1740
          accugcuuc 1749
           <![CDATA[ <210> 95]]>
           <![CDATA[ <211> 583]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 95]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys
              290 295 300
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
          305 310 315 320
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                          325 330 335
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
                      340 345 350
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
                  355 360 365
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
              370 375 380
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
          385 390 395 400
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                          405 410 415
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
                      420 425 430
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
                  435 440 445
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
              450 455 460
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
          465 470 475 480
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                          485 490 495
          Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser
                      500 505 510
          Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe Lys Lys Lys Lys
                  515 520 525
          Arg Pro Arg Asn Ser Tyr Lys Cys Gly Thr Asn Thr Met Glu Arg Glu
              530 535 540
          Glu Ser Glu Gln Thr Lys Lys Arg Glu Lys Ile His Ile Pro Glu Arg
          545 550 555 560
          Ser Asp Glu Ala Gln Arg Val Phe Lys Ser Ser Lys Thr Ser Ser Cys
                          565 570 575
          Asp Lys Ser Asp Thr Cys Phe
                      580
           <![CDATA[ <210> 96]]>
           <![CDATA[ <211> 1805]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 96]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc 960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg 1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc 1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc 1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc 1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug 1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac 1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac 1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag 1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug 1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcagc 1560
          auccuggcca ucuacagcac cguggccagc agccuggugc ugcuggugag ccugggcgcc 1620
          aucagcuuca agcggcagua caaggacaug augagcgagg gaggaccacc uggcgcugag 1680
          ccacagugau aauaggcugg agccucggug gccuagcuuc uugccccuug ggccuccccc 1740
          cagccccuccc uccccuuccu gcacccguac ccccgugguc uuugaauaaa gucugagugg 1800
          gcggc 1805
           <![CDATA[ <210> 97]]>
           <![CDATA[ <211> 1629]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 97]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc 900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc 960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc 1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc 1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc 1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc 1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug 1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc 1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug 1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug 1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc 1500
          agcauccugg ccaucuacag caccguggcc agcagccugg ugcugcuggu gagccugggc 1560
          gccaucagcu ucaagcggca guacaaggac augaugagcg agggaggacc accuggcgcu 1620
          gagccacag 1629
           <![CDATA[ <210> 98]]>
           <![CDATA[ <211> 543]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 98]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys
              290 295 300
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
          305 310 315 320
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                          325 330 335
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
                      340 345 350
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
                  355 360 365
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
              370 375 380
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
          385 390 395 400
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                          405 410 415
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
                      420 425 430
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
                  435 440 445
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
              450 455 460
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
          465 470 475 480
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                          485 490 495
          Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser
                      500 505 510
          Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe Lys Arg Gln Tyr
                  515 520 525
          Lys Asp Met Met Ser Glu Gly Gly Pro Gly Ala Glu Pro Gln
              530 535 540
           <![CDATA[ <210> 99]]>
           <![CDATA[ <211> 1853]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 99]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc 960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg 1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc 1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc 1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc 1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug 1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac 1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac 1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag 1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug 1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcggc 1560
          agcgccaucg gcggcuacau ccccgaggcc ccuagagacg gccaggccua cgugcggaag 1620
          gacggcgagu gggugcugcu gagcaccuuc cugggcggag gcagcauccu ggccaucuac 1680
          agcaccgugg ccagcagccu ggugcugcug gugagccugg gcgccaucag cuucugauaa 1740
          uaggcuggag ccucgguggc cuagcuucuu gccccuuggg ccucccccca gccccucccuc 1800
          cccuuccugc acccguaccc ccguggucuu ugaauaaagu cugagugggc ggc 1853
           <![CDATA[ <210> 100]]>
           <![CDATA[ <211> 1677]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 100]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc 900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc 960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc 1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc 1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc 1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc 1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug 1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc 1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug 1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug 1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc 1500
          ggcagcgcca ucggcggcua cauccccgag gccccuagag acggccaggc cuacgugcgg 1560
          aaggacggcg agugggugcu gcugagcacc uuccugggcg gaggcagcau ccuggccauc 1620
          uacagcaccg uggccagcag ccuggugcug cuggugagcc ugggcgccau cagcuuc 1677
           <![CDATA[ <210> 101]]>
           <![CDATA[ <211> 559]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 101]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys
              290 295 300
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
          305 310 315 320
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                          325 330 335
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
                      340 345 350
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
                  355 360 365
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
              370 375 380
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
          385 390 395 400
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                          405 410 415
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
                      420 425 430
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
                  435 440 445
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
              450 455 460
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
          465 470 475 480
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                          485 490 495
          Ser Gly Gly Gly Gly Ser Ala Ile Gly Gly Tyr Ile Pro Glu Ala Pro
                      500 505 510
          Arg Asp Gly Gln Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu
                  515 520 525
          Ser Thr Phe Leu Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val
              530 535 540
          Ala Ser Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe
          545 550 555
           <![CDATA[ <210> 102]]>
           <![CDATA[ <211> 1856]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 102]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc 960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg 1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc 1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc 1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc 1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug 1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac 1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac 1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag 1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug 1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcagc 1560
          auccuggcca ucuacagcac cguggccagc agccuggugc ugcuggugag ccugggcgcc 1620
          aucagcuucg gcggaggcag cgccaucggc ggcuacaucc ccgaggcccc uagagacggc 1680
          caggccuacg ugcggaagga cggcgagugg gugcugcuga gcaccuuccu gggcaaguga 1740
          uaauaggcug gagccucggu ggccuagcuu cuugccccuu gggccuccccc ccagccccuc 1800
          cuccccuucc ugcacccgua cccccguggu cuuugaauaa agucugagug ggcggc 1856
           <![CDATA[ <210> 103]]>
           <![CDATA[ <211> 1680]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 103]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc 900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc 960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc 1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc 1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc 1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc 1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug 1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc 1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug 1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug 1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc 1500
          agcauccugg ccaucuacag caccguggcc agcagccugg ugcugcuggu gagccugggc 1560
          gccaucagcu ucggcggagg cagcgccauc ggcggcuaca uccccgaggc cccuagagac 1620
          ggccaggccu acgugcggaa ggacggcgag ugggugcugc ugagcaccuu ccugggcaag 1680
           <![CDATA[ <210> 104]]>
           <![CDATA[ <211> 560]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 104]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys
              290 295 300
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
          305 310 315 320
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                          325 330 335
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
                      340 345 350
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
                  355 360 365
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
              370 375 380
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
          385 390 395 400
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                          405 410 415
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
                      420 425 430
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
                  435 440 445
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
              450 455 460
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
          465 470 475 480
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                          485 490 495
          Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser
                      500 505 510
          Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe Gly Gly Gly Ser
                  515 520 525
          Ala Ile Gly Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr
              530 535 540
          Val Arg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu Gly Lys
          545 550 555 560
           <![CDATA[ <210> 105]]>
           <![CDATA[ <211> 1826]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 105]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc 960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg 1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc 1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc 1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc 1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug 1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac 1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac 1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag 1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug 1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcagc 1560
          auccuggcca ucuacagccu gggcuucauc gccggccuga ucgccaucgu gauggugacc 1620
          aucaugcugu gcugcaugac cagcugcugc agcugccuga agggcuguug cagcugcggc 1680
          agcugcugca aguucgacga ggacgacuga uaauaggcug gagccucggu ggccuagcuu 1740
          cuugccccuu gggccuccccc ccagccccuc cuccccuucc ugcacccgua cccccguggu 1800
          cuuugaauaa agucugagug ggcggc 1826
           <![CDATA[ <210> 106]]>
           <![CDATA[ <211> 1650]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 106]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc 900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc 960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc 1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc 1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc 1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc 1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug 1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc 1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug 1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug 1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc 1500
          agcauccugg ccaucuacag ccugggcuuc aucgccggcc ugaucgccau cgugauggug 1560
          accaucaugc ugugcugcau gaccagcugc ugcagcugcc ugaagggcug uugcagcugc 1620
          ggcagcugcu gcaaguucga cgaggacgac 1650
           <![CDATA[ <210> 107]]>
           <![CDATA[ <211> 550]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 107]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys
              290 295 300
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
          305 310 315 320
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                          325 330 335
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
                      340 345 350
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
                  355 360 365
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
              370 375 380
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
          385 390 395 400
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                          405 410 415
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
                      420 425 430
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
                  435 440 445
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
              450 455 460
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
          465 470 475 480
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                          485 490 495
          Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Leu Gly Phe Ile Ala
                      500 505 510
          Gly Leu Ile Ala Ile Val Met Val Thr Ile Met Leu Cys Cys Met Thr
                  515 520 525
          Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys Ser Cys Gly Ser Cys Cys
              530 535 540
          Lys Phe Asp Glu Asp Asp
          545 550
           <![CDATA[ <210> 108]]>
           <![CDATA[ <211> 1826]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 108]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc 960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg 1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc 1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc 1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc 1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug 1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac 1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac 1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag 1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug 1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcagc 1560
          agcagcaucg ccagcuucuu cuucaucauc gggcugauca ucggccucuu ccuggugcug 1620
          cgggugggca uccaccugug caucaagcug aagcacacca agaagagaca gaucuacacc 1680
          gacaucgaga ugaaccggcu gggcaaguga uaauaggcug gagccucggu ggccuagcuu 1740
          cuugccccuu gggccuccccc ccagccccuc cuccccuucc ugcacccgua cccccguggu 1800
          cuuugaauaa agucugagug ggcggc 1826
           <![CDATA[ <210> 109]]>
           <![CDATA[ <211> 1650]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 109]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc 900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc 960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc 1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc 1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc 1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc 1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug 1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc 1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug 1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug 1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc 1500
          agcagcagca ucgccagcuu cuucuucauc aucgggcuga ucaucggccu cuuccuggug 1560
          cugcgggugg gcauccaccu gugcaucaag cugaagcaca ccaagaagag acagaucuac 1620
          accgacaucg agaugaaccg gcugggcaag 1650
           <![CDATA[ <210> 110]]>
           <![CDATA[ <211> 550]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 110]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys
              290 295 300
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
          305 310 315 320
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                          325 330 335
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
                      340 345 350
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
                  355 360 365
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
              370 375 380
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
          385 390 395 400
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                          405 410 415
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
                      420 425 430
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
                  435 440 445
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
              450 455 460
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
          465 470 475 480
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                          485 490 495
          Ser Gly Gly Gly Ser Ser Ser Ile Ala Ser Phe Phe Phe Ile Ile Gly
                      500 505 510
          Leu Ile Ile Gly Leu Phe Leu Val Leu Arg Val Gly Ile His Leu Cys
                  515 520 525
          Ile Lys Leu Lys His Thr Lys Lys Arg Gln Ile Tyr Thr Asp Ile Glu
              530 535 540
          Met Asn Arg Leu Gly Lys
          545 550
           <![CDATA[ <210> 111]]>
           <![CDATA[ <211> 1775]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 111]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggacc caacaucacc 960
          aaccugugcc ccuucggcga gguguucaac gccacccggu ucgccagcgu guacgccugg 1020
          aaccggaagc ggaucagcaa cugcguggcc gacuacagcg ugcuguacaa cagcgccagc 1080
          uucagcaccu ucaagugcua cggcgugagc cccaccaagc ugaacgaccu gugcuucacc 1140
          aacguguacg ccgacagcuu cgugauccgu ggcgacgagg ugcggcagau cgcacccggc 1200
          cagacaggca agaucgccga cuacaacuac aagcugcccg acgacuucac cggcugcgug 1260
          aucgccugga acagcaacaa ccucgacagc aaggugggcg gcaacuacaa cuaccuguac 1320
          cggcuguucc ggaagagcaa ccugaagccc uucgagcggg acaucagcac cgagaucuac 1380
          caagccggcu ccaccccuug caacggcgug gagggcuuca acugcuacuu cccucugcag 1440
          agcuacggcu uccagcccac caacggcgug ggcuaccagc ccuaccgggu gguggugcug 1500
          agcuucgagc ugcugcacgc cccagccacc guguguggcc ccaagucugg cggaggcggc 1560
          agcgccaucg gcggcuacau ccccgaggcc ccuagagacg gccaggccua cgugcggaag 1620
          gacggcgagu gggugcugcu gagcaccuuc cugggcugau aauaggcugg agccucggug 1680
          gccuagcuuc uugccccuug ggccuccccc cagccccuccc uccccuuccu gcacccguac 1740
          ccccgugguc uuugaauaaa gucugagugg gcggc 1775
           <![CDATA[ <210> 112]]>
           <![CDATA[ <211> 1599]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 112]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acccaacauc 900
          accaaccugu gccccuucgg cgagguguuc aacgccaccc gguucgccag cguguacgcc 960
          uggaaccgga agcggaucag caacugcgug gccgacuaca gcgugcugua caacagcgcc 1020
          agcuucagca ccuucaagug cuacggcgug agccccacca agcugaacga ccugugcuuc 1080
          accaacgugu acgccgacag cuucgugauc cguggcgacg aggugcggca gaucgcaccc 1140
          ggccagacag gcaagaucgc cgacuacaac uacaagcugc ccgacgacuu caccggcugc 1200
          gugaucgccu ggaacagcaa caaccucgac agcaaggugg gcggcaacua caacuaccug 1260
          uaccggcugu uccggaagag caaccugaag cccuucgagc gggacaucag caccgagauc 1320
          uaccaagccg gcuccacccc uugcaacggc guggagggcu ucaacugcua cuucccucug 1380
          cagagcuacg gcuuccagcc caccaacggc gugggcuacc agcccuaccg ggugguggug 1440
          cugagcuucg agcugcugca cgccccagcc accgugugug gccccaaguc uggcggaggc 1500
          ggcagcgcca ucggcggcua cauccccgag gccccuagag acggccaggc cuacgugcgg 1560
          aaggacggcg agugggugcu gcugagcacc uuccugggc 1599
           <![CDATA[ <210> 113]]>
           <![CDATA[ <211> 533]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 113]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu Cys
              290 295 300
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
          305 310 315 320
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                          325 330 335
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
                      340 345 350
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
                  355 360 365
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
              370 375 380
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
          385 390 395 400
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                          405 410 415
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
                      420 425 430
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
                  435 440 445
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
              450 455 460
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
          465 470 475 480
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                          485 490 495
          Ser Gly Gly Gly Gly Ser Ala Ile Gly Gly Tyr Ile Pro Glu Ala Pro
                      500 505 510
          Arg Asp Gly Gln Ala Tyr Val Arg Lys Asp Gly Glu Trp Val Leu Leu
                  515 520 525
          Ser Thr Phe Leu Gly
              530
           <![CDATA[ <210> 114]]>
           <![CDATA[ <211> 1973]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 114]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacgga ggcggaucgg gaggcggaca gcgggugcag 960
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug 1020
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc 1080
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc 1140
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug 1200
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac 1260
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc 1320
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug 1380
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac 1440
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac 1500
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca 1560
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc 1620
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc 1680
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc 1740
          cuggagaucc uggacaucac cccuugcagc ucuggcggag gcagcauccu ggccaucuac 1800
          agcaccgugg ccagcagccu ggugcugcug gugagccugg gcgccaucag cuucugauaa 1860
          uaggcuggag ccucgguggc cuagcuucuu gccccuuggg ccucccccca gccccuccuc 1920
          cccuuccugc acccguaccc ccguggucuu ugaauaaagu cugagugggc ggc 1973
           <![CDATA[ <210> 115]]>
           <![CDATA[ <211> 1797]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 115]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ggaggcggau cgggaggcgg acagcgggug 900
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag 960
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac 1020
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac 1080
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc 1140
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac 1200
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac 1260
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac 1320
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc 1380
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc 1440
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc 1500
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac 1560
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug 1620
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag 1680
          acccuggaga uccuggacau caccccuugc agcucuggcg gaggcagcau ccuggccauc 1740
          uacagcaccg uggccagcag ccuggugcug cuggugagcc ugggcgccau cagcuuc 1797
           <![CDATA[ <210> 116]]>
           <![CDATA[ <211> 599]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 116]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Gly Gly Gly Ser Gly Gly Gly Gln Arg Val Gln Pro Thr Glu
              290 295 300
          Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu
          305 310 315 320
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys
                          325 330 335
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala
                      340 345 350
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn
                  355 360 365
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly
              370 375 380
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp
          385 390 395 400
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp
                          405 410 415
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu
                      420 425 430
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile
                  435 440 445
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu
              450 455 460
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr
          465 470 475 480
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu
                          485 490 495
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr Asn
                      500 505 510
          Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn Gly Leu Thr Gly
                  515 520 525
          Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu Pro Phe Gln Gln
              530 535 540
          Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val Arg Asp Pro Gln
          545 550 555 560
          Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Ser Gly Gly Gly Ser
                          565 570 575
          Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val
                      580 585 590
          Ser Leu Gly Ala Ile Ser Phe
                  595
           <![CDATA[ <210> 117]]>
           <![CDATA[ <211> 1799]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 117]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccgg aggcggaucg ggaggcggac ccaacaucac caaccugugc 1020
          cccuucggcg agguguucaa cgccacccgg uucgccagcg uguacgccug gaaccggaag 1080
          cggaucagca acugcguggc cgacuacagc gugcuguaca acagcgccag cuucagcacc 1140
          uucaagugcu acggcgugag ccccaccaag cugaacgacc ugugcuucac caacguguac 1200
          gccgacagcu ucgugauccg uggcgacgag gugcggcaga ucgcacccgg ccagacaggc 1260
          aagaucgccg acuacaacua caagcugccc gacgacuuca ccggcugcgu gaucgccugg 1320
          aacagcaaca accucgacag caaggugggc ggcaacuaca acuaccugua ccggcuguuc 1380
          cggaagagca accugaagcc cuucgagcgg gacaucagca ccgagaucua ccaagccggc 1440
          uccaccccuu gcaacggcgu ggagggcuuc aacugcuacu ucccucugca gagcuacggc 1500
          uuccagccca ccaacggcgu gggcuaccag cccuaccggg ugguggugcu gagcuucgag 1560
          cugcugcacg ccccagccac cguguguggc cccaagucug gcggaggcag cauccuggcc 1620
          aucuacagca ccguggccag cagccuggug cugcugguga gccugggcgc caucagcuuc 1680
          ugauaauagg cuggagccuc gguggccuag cuucuugccc cuugggccuc cccccagccc 1740
          cuccuccccu uccugcaccc guacccccgu ggucuuugaa uaaagucuga gugggcggc 1799
           <![CDATA[ <210> 118]]>
           <![CDATA[ <211> 1623]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 118]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cggaggcgga ucgggaggcg gacccaacau caccaaccug 960
          ugccccuucg gcgagguguu caacgccacc cgguucgcca gcguguacgc cuggaaccgg 1020
          aagcggauca gcaacugcgu ggccgacuac agcgugcugu acaacagcgc cagcuucagc 1080
          accuucaagu gcuacggcgu gagccccacc aagcugaacg accugugcuu caccaacgug 1140
          uacgccgaca gcuucgugau ccguggcgac gaggugcggc agaucgcacc cggccagaca 1200
          ggcaagaucg ccgacuacaa cuacaagcug cccgacgacu ucaccggcug cgugaucgcc 1260
          uggaacagca acaaccucga cagcaaggug ggcggcaacu acaacuaccu guaccggcug 1320
          uuccggaaga gcaaccugaa gcccuucgag cgggacauca gcaccgagau cuaccaagcc 1380
          ggcuccaccc cuugcaacgg cguggagggc uucaacugcu acuucccucu gcagagcuac 1440
          ggcuuccagc ccaccaacgg cgugggcuac cagcccuacc gggugguggu gcugagcuuc 1500
          gagcugcugc acgccccagc caccgugugu ggccccaagu cuggcggagg cagcauccug 1560
          gccaucuaca gcaccguggc cagcagccug gugcugcugg ugagccuggg cgccaucagc 1620
          uuc 1623
           <![CDATA[ <210> 119]]>
           <![CDATA[ <211> 541]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 119]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Gly Gly Gly Ser Gly Gly Gly Pro Asn Ile Thr Asn Leu
          305 310 315 320
          Cys Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr
                          325 330 335
          Ala Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val
                      340 345 350
          Leu Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser
                  355 360 365
          Pro Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser
              370 375 380
          Phe Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr
          385 390 395 400
          Gly Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly
                          405 410 415
          Cys Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly
                      420 425 430
          Asn Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro
                  435 440 445
          Phe Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro
              450 455 460
          Cys Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr
          465 470 475 480
          Gly Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val
                          485 490 495
          Val Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro
                      500 505 510
          Lys Ser Gly Gly Gly Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser
                  515 520 525
          Ser Leu Val Leu Leu Val Ser Leu Gly Ala Ile Ser Phe
              530 535 540
           <![CDATA[ <210> 120]]>
           <![CDATA[ <211> 2024]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 120]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccgg aggcggaucg ggaggcggac agcgggugca gcccaccgag 1020
          agcaucgugc gguuccccaa caucaccaac cugugccccu ucggcgaggu guucaacgcc 1080
          acccgguucg ccagcgugua cgccuggaac cggaagcgga ucagcaacug cguggccgac 1140
          uacagcgugc uguacaacag cgccagcuuc agcaccuuca agugcuacgg cgugagcccc 1200
          accaagcuga acgaccugug cuucaccaac guguacgccg acagcuucgu gauccguggc 1260
          gacgaggugc ggcagaucgc acccggccag acaggcaaga ucgccgacua caacuacaag 1320
          cugcccgacg acuucaccgg cugcgugauc gccuggaaca gcaacaaccu cgacagcaag 1380
          gugggcggca acuacaacua ccuguaccgg cuguuccgga agagcaaccu gaagcccuuc 1440
          gagcgggaca ucagcaccga gaucuaccaa gccggcucca ccccuugcaa cggcguggag 1500
          ggcuucaacu gcuacuuccc ucugcagagc uacggcuucc agcccaccaa cggcgugggc 1560
          uaccagcccu accggguggu ggugcugagc uucgagcugc ugcacgcccc agccaccgug 1620
          uguggcccca agaagagcac caaccuggug aagaacaagu gcgugaacuu caacuucaac 1680
          ggccuuaccg gcaccggcgu gcugaccgag agcaacaaga aauuccugcc cuuucagcag 1740
          uucggccggg acaucgccga caccaccgac gcugugcggg auccccagac ccuggagauc 1800
          cuggacauca ccccuugcag cucuggcgga ggcagcaucc uggccaucua cagcaccgug 1860
          gccagcagcc uggugcugcu ggugagccug ggcgccauca gcuucugaua auaggcugga 1920
          gccucggugg ccuagcuucu ugccccuugg gccucccccc agccccuccu ccccuuccug 1980
          cacccguacc cccguggucu uugaauaaag ucugaguggg cggc 2024
           <![CDATA[ <210> 121]]>
           <![CDATA[ <211> 1848]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 121]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cggaggcgga ucgggaggcg gacagcgggu gcagcccacc 960
          gagagcaucg ugcgguuccc caacaucacc aaccugugcc ccuucggcga gguguucaac 1020
          gccacccggu ucgccagcgu guacgccugg aaccggaagc ggaucagcaa cugcguggcc 1080
          gacuacagcg ugcuguacaa cagcgccagc uucagcaccu ucaagugcua cggcgugagc 1140
          cccaccaagc ugaacgaccu gugcuucacc aacguguacg ccgacagcuu cgugauccgu 1200
          ggcgacgagg ugcggcagau cgcacccggc cagacaggca agaucgccga cuacaacuac 1260
          aagcugcccg acgacuucac cggcugcgug aucgccugga acagcaacaa ccucgacagc 1320
          aaggugggcg gcaacuacaa cuaccuguac cggcuguucc ggaagagcaa ccugaagccc 1380
          uucgagcggg acaucagcac cgagaucuac caagccggcu ccaccccuug caacggcgug 1440
          gagggcuuca acugcuacuu cccucugcag agcuacggcu uccagcccac caacggcgug 1500
          ggcuaccagc ccuaccgggu gguggugcug agcuucgagc ugcugcacgc cccagccacc 1560
          guguguggcc ccaagaagag caccaaccug gugaagaaca agugcgugaa cuucaacuuc 1620
          aacggccuua ccggcaccgg cgugcugacc gagagcaaca agaaauuccu gcccuuucag 1680
          caguucggcc gggacaucgc cgacaccacc gacgcugugc gggaucccca gacccuggag 1740
          auccuggaca ucaccccuug cagcucuggc ggaggcagca uccuggccau cuacagcacc 1800
          guggccagca gccuggugcu gcuggugagc cugggcgcca ucagcuuc 1848
           <![CDATA[ <210> 122]]>
           <![CDATA[ <211> 616]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 122]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Gly Gly Gly Ser Gly Gly Gly Gln Arg Val Gln Pro Thr
          305 310 315 320
          Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly
                          325 330 335
          Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg
                      340 345 350
          Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser
                  355 360 365
          Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu
              370 375 380
          Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg
          385 390 395 400
          Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala
                          405 410 415
          Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala
                      420 425 430
          Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr
                  435 440 445
          Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp
              450 455 460
          Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val
          465 470 475 480
          Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro
                          485 490 495
          Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe
                      500 505 510
          Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Lys Ser Thr
                  515 520 525
          Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn Gly Leu Thr
              530 535 540
          Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu Pro Phe Gln
          545 550 555 560
          Gln Phe Gly Arg Asp Ile Ala Asp Thr Asp Ala Val Arg Asp Pro
                          565 570 575
          Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Ser Gly Gly Gly
                      580 585 590
          Ser Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu
                  595 600 605
          Val Ser Leu Gly Ala Ile Ser Phe
              610 615
           <![CDATA[ <210> 123]]>
           <![CDATA[ <211> 3995]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 123]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag 1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug 1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc 1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc 1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug 1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac 1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc 1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug 1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac 1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac 1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca 1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc 1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc 1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc 1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcg ugagcgugau caccccaggc 1860
          accaacacca gcaaccaggu ggccgugcug uaccaggacg ugaacugcac cgaggugccc 1920
          guggccaucc acgccgacca gcugacaccc accuggcggg ucuacagcac cggcagcaac 1980
          guguuccaga cccgggccgg uugccugauc ggcgccgagc acgugaacaa cagcuacgag 2040
          ugcgacaucc ccaucggcgc cggcaucugu gccagcuacc agacccagac caauucaccc 2100
          cggagggcaa ggagcguggc cagccagagc aucaucgccu acaccaugag ccugggcgcc 2160
          gagaacagcg uggccuacag caacaacagc aucgccaucc ccaccaacuu caccaucagc 2220
          gugaccaccg agauucugcc cgugagcaug accaagacca gcguggacug caccauguac 2280
          aucugcggcg acagcaccga gugcagcaac cugcugcugc aguacggcag cuucugcacc 2340
          cagcugaacc gggcccugac cggcaucgcc guggagcagg acaagaacac ccaggaggug 2400
          uucgcccagg ugaagcagau cuacaagacc ccucccauca aggacuucgg cggcuucaac 2460
          uucagccaga uccugcccga ccccagcaag cccagcaagc ggagcuucau cgaggaccug 2520
          cuguucaaca aggugacccu agccgacgcc ggcuucauca agcaguacgg cgacugccuc 2580
          ggcgacauag ccgcccggga ccugaucugc gcccagaagu ucaacggccu gaccgugcug 2640
          ccuccccugc ugaccgacga gaugaucgcc caguacacca gcgcccuguu agccggaacc 2700
          aucaccagcg gcuggacuuu cggcgcugga gccgcucugc agauccccuu cgccaugcag 2760
          auggccuacc gguucaacgg caucggcgug acccagaacg ugcuguacga gaaccagaag 2820
          cugaucgcca accaguucaa cagcgccauc ggcaagaucc aggacagccu gagcagcacc 2880
          gcuagcgccc ugggcaagcu gcaggacgug gugaaccaga acgcccaggc ccugaacacc 2940
          cuggugaagc agcugagcag caacuucggc gccaucagca gcgugcugaa cgacauccug 3000
          agccggcugg acaaggugga ggccgaggug cagaucgacc ggcugaucac uggccggcug 3060
          cagagccugc agaccuacgu gacccagcag cugauccggg ccgccgagau ucgggccagc 3120
          gccaaccugg ccgccaccaa gaugagcgag ugcgugcugg gccagagcaa gcggguggac 3180
          uucugcggca agggcuacca ccugaugagc uuuccccaga gcgcacccca cggaguggug 3240
          uuccugcacg ugaccuacgu gcccgcccag gagaagaacu ucaccaccgc cccagccauc 3300
          ugccacgacg gcaaggccca cuuuccccgg gagggcgugu ucgugagcaa cggcacccac 3360
          ugguucguga cccagcggaa cuucuacgag ccccagauca ucaccaccga caacaccuuc 3420
          gugagcggca acugcgacgu ggugaucggc aucgugaaca acaccgugua cgauccccug 3480
          cagcccgagc uggacagcuu caaggaggag cuggacaagu acuucaagaa ucacaccagc 3540
          cccgacgugg accugggcga caucagcggc aucaacgcca gcguggugaa cauccagaag 3600
          gagaucgauc ggcugaacga gguggccaag aaccugaacg agagccugau cgaccugcag 3660
          gagcugggca aguacgagca guacaucaag uggcccuggu acaucuggcu gggcuucauc 3720
          gccggccuga ucgccaucgu gauggugacc aucaugcugu gcugcaugac cagcugcugc 3780
          agcugccuga agggcuguug cagcugcggc agcugcugca aguucgacga ggacgacagc 3840
          gagcccgugc ugaagggcgu gaagcugcac uacaccugau aauaggcugg agccucggug 3900
          gccuagcuuc uugccccuug ggccuccccc cagccccuccc uccccuuccu gcacccguac 3960
          ccccgugguc uuugaauaaa gucugagugg gcggc 3995
           <![CDATA[ <210> 124]]>
           <![CDATA[ <211> 3819]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 124]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug 960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag 1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac 1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac 1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc 1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac 1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac 1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac 1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc 1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc 1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc 1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac 1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug 1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag 1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcgugagcgu gaucacccca 1800
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug 1860
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc 1920
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac 1980
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca 2040
          ccccggaggg caaggagcgu ggccagccag agcaucaucg ccuacaccau gagccugggc 2100
          gccgagaaca gcguggccua cagcaacaac agcaucgcca uccccaccaa cuucaccauc 2160
          agcgugacca ccgagauucu gcccgugagc augaccaaga ccagcgugga cugcaccaug 2220
          uacaucugcg gcgacagcac cgagugcagc aaccugcugc ugcaguacgg cagcuucugc 2280
          acccagcuga accgggcccu gaccggcauc gccguggagc aggacaagaa cacccaggag 2340
          guguucgccc aggugaagca gaucuacaag accccuccca ucaaggacuu cggcggcuuc 2400
          aacuucagcc agauccugcc cgaccccagc aagcccagca agcggagcuu caucgaggac 2460
          cugcuguuca acaaggugac ccuagccgac gccggcuuca ucaagcagua cggcgacugc 2520
          cucggcgaca uagccgcccg ggaccugauc ugcgcccaga aguucaacgg ccugaccgug 2580
          cugccuccccc ugcugaccga cgagaugauc gcccaguaca ccagcgcccu guuagccgga 2640
          accaucacca gcggcuggac uuucggcgcu ggagccgcuc ugcagauccc cuucgccaug 2700
          cagauggccu accgguucaa cggcaucggc gugacccaga acgugcugua cgagaaccag 2760
          aagcugaucg ccaaccaguu caacagcgcc aucggcaaga uccaggacag ccugagcagc 2820
          accgcuagcg cccugggcaa gcugcaggac guggugaacc agaacgccca ggcccugaac 2880
          acccugguga agcagcugag cagcaacuuc ggcgccauca gcagcgugcu gaacgacauc 2940
          cugagccggc uggacaaggu ggaggccgag gugcagaucg accggcugau cacuggccgg 3000
          cugcagagcc ugcagaccua cgugacccag cagcugaucc gggccgccga gauucgggcc 3060
          agcgccaacc uggccgccac caagaugagc gagugcgugc ugggccagag caagcgggug 3120
          gacuucugcg gcaagggcua ccaccugaug agcuuucccc agagcgcacc ccacggagug 3180
          guguuccugc acgugaccua cgugcccgcc caggagaaga acuucaccac cgccccagcc 3240
          aucugccacg acggcaaggc ccacuuuccc cgggagggcg uguucgugag caacggcacc 3300
          cacugguucg ugacccagcg gaacuucuac gagccccaga ucaucaccac cgacaacacc 3360
          uucgugagcg gcaacugcga cguggugauc ggcaucguga acaacaccgu guacgauccc 3420
          cugcagcccg agcuggacag cuucaaggag gagcuggaca aguacuucaa gaaucacacc 3480
          agccccgacg uggaccuggg cgacaucagc ggcaucaacg ccagcguggu gaacauccag 3540
          aaggagaucg aucggcugaa cgagguggcc aagaaccuga acgagagccu gaucgaccug 3600
          caggagcugg gcaaguacga gcaguacauc aaguggcccu gguacaucug gcugggcuuc 3660
          aucgccggcc ugaucgccau cgugauggug accaucaugc ugugcugcau gaccagcugc 3720
          ugcagcugcc ugaagggcug uugcagcugc ggcagcugcu gcaaguucga cgaggacgac 3780
          agcgagcccg ugcugaaggg cgugaagcug cacuacacc 3819
           <![CDATA[ <210> 125]]>
           <![CDATA[ <211> 1273]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 125]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
          305 310 315 320
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
                          325 330 335
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
                      340 345 350
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                  355 360 365
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
              370 375 380
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
          385 390 395 400
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
                          405 410 415
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
                      420 425 430
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                  435 440 445
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
              450 455 460
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
          465 470 475 480
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
                          485 490 495
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
                      500 505 510
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                  515 520 525
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
              530 535 540
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
          545 550 555 560
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
                          565 570 575
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
                      580 585 590
          Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
                  595 600 605
          Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile
              610 615 620
          His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
          625 630 635 640
          Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
                          645 650 655
          Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
                      660 665 670
          Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala
                  675 680 685
          Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser
              690 695 700
          Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile
          705 710 715 720
          Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val
                          725 730 735
          Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu
                      740 745 750
          Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr
                  755 760 765
          Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln
              770 775 780
          Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe
          785 790 795 800
          Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser
                          805 810 815
          Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
                      820 825 830
          Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp
                  835 840 845
          Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu
              850 855 860
          Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly
          865 870 875 880
          Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile
                          885 890 895
          Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr
                      900 905 910
          Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn
                  915 920 925
          Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala
              930 935 940
          Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn
          945 950 955 960
          Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val
                          965 970 975
          Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln
                      980 985 990
          Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val
                  995 1000 1005
          Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn
              1010 1015 1020
          Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys
              1025 1030 1035
          Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro
              1040 1045 1050
          Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr Val
              1055 1060 1065
          Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His
              1070 1075 1080
          Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser Asn
              1085 1090 1095
          Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln
              1100 1105 1110
          Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val
              1115 1120 1125
          Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln Pro
              1130 1135 1140
          Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn
              1145 1150 1155
          His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn
              1160 1165 1170
          Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu
              1175 1180 1185
          Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu
              1190 1195 1200
          Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile Trp Leu
              1205 1210 1215
          Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile Met
              1220 1225 1230
          Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys
              1235 1240 1245
          Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro
              1250 1255 1260
          Val Leu Lys Gly Val Lys Leu His Tyr Thr
              1265 1270
           <![CDATA[ <210> 126]]>
           <![CDATA[ <211> 8]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 126]]>
          Lys Pro Ser Lys Arg Ser Phe Ile
          1 5
           <![CDATA[ <210> 127]]>
           <![CDATA[ <211> 13]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 127]]>
          Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala
          1 5 10
           <![CDATA[ <210> 128]]>
           <![CDATA[ <211> 9]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 128]]>
          ccrccaugg 9
           <![CDATA[ <210> 129]]>
           <![CDATA[ <211> 11]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 129]]>
          gggauccuac c 11
           <![CDATA[ <210> 130]]>
           <![CDATA[ <211> 9]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 130]]>
          uuauuuaww 9
           <![CDATA[ <210> 131]]>
           <![CDATA[ <211> 47]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 131]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag agccacc 47
           <![CDATA[ <210> 132]]>
           <![CDATA[ <211> 119]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 132]]>
          ugauaauagg cuggagccuc gguggccaug cuucuugccc cuugggccuc cccccagccc 60
          cuccuccccu uccugcaccc guacccccgu ggucuuugaa uaaagucuga gugggcggc 119
           <![CDATA[ <210> 133]]>
           <![CDATA[ <211> 15]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 133]]>
          Gly Gly Ser Gly Gly Ser Gly Gly Ser Gly Gly Ser Gly Gly Gly
          1 5 10 15
           <![CDATA[ <210> 134]]>
           <![CDATA[ <211> 8]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 134]]>
          Gly Gly Gly Ser Gly Gly Gly Ser
          1 5
           <![CDATA[ <210> 135]]>
           <![CDATA[ <211> 7]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 135]]>
          Gly Gly Gly Ser Gly Gly Gly
          1 5
           <![CDATA[ <210> 136]]>
           <![CDATA[ <211> 4]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 136]]>
          Gly Gly Gly Ser
          1               
           <![CDATA[ <210> 137]]>
           <![CDATA[ <211> 4]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 137]]>
          Ser Gly Gly Ser
          1               
           <![CDATA[ <210> 138]]>
           <![CDATA[ <211> 5]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 138]]>
          Ser Gly Gly Gly Ser
          1 5
           <![CDATA[ <210> 139]]>
           <![CDATA[ <211> 1602]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 139]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc 60
          cagcccaaca ucaccaaccu gugccccuuc ggcgaggugu ucaacgccac ccgguucgcc 120
          agcguguacg ccuggaaccg gaagcggauc agcaacugcg uggccgacua cagcgugcug 180
          uacaacagcg ccagcuucag caccuucaag ugcuacggcg ugagccccac caagcugaac 240
          gaccugugcu ucaccaacgu guacgccgac agcuucguga uccguggcga cgaggugcgg 300
          cagaucgcac ccggccagac aggcaagauc gccgacuaca acuacaagcu gcccgacgac 360
          uucaccggcu gcgugaucgc cuggaacagc aacaaccucg acagcaaggu gggcggcaac 420
          uacaacuacc uguaccggcu guuccggaag agcaaccuga agcccuucga gcgggacauc 480
          agcaccgaga ucuaccaagc cggcuccacc ccuugcaacg gcguggaggg cuucaacugc 540
          uacuucccuc ugcagagcua cggcuuccag cccaccaacg gcgugggcua ccagcccuac 600
          cggguggugg ugcugagcuu cgagcugcug cacgccccag ccaccgugug uggccccaag 660
          ucuggcggag gcagcggagg cggaagccag ugcgugaacc ugaccacccg gacccagcug 720
          ccaccagccu acaccaacag cuucacccgg ggcgucuacu accccgacaa gguguuccgg 780
          agcagcgucc ugcacagcac ccaggaccug uuccugcccu ucuucagcaa cgugaccugg 840
          uuccacgcca uccacgugag cggcaccaac ggcaccaagc gguucgacaa ccccgugcug 900
          cccuucaacg acggcgugua cuucgccagc accgagaaga gcaacaucau ccggggcugg 960
          aucuucggca ccacccugga cagcaagacc cagagccugc ugaucgugaa uaacgccacc 1020
          aacgugguga ucaaggugug cgaguuccag uucugcaacg accccuuccu gggcguguac 1080
          uaccacaaga acaacaagag cuggauggag agcgaguucc ggguguacag cagcgccaac 1140
          aacugcaccu ucgaguacgu gagccagccc uuccugaugg accuggaggg caagcagggc 1200
          aacuucaaga accugcggga guucguguuc aagaacaucg acggcuacuu caagaucuac 1260
          agcaagcaca ccccaaucaa ccuggugcgg gaucugcccc agggcuucuc agcccuggag 1320
          ccccuggugg accugcccau cggcaucaac aucacccggu uccagacccu gcuggcccug 1380
          caccggagcu accugacccc aggcgacagc agcagcgggu ggacagcagg cgcggcugcu 1440
          uacuacgugg gcuaccugca gccccggacc uuccugcuga aguacaacga gaacggcacc 1500
          aucaccgacg ccguggacuc uggcggaggc agcauccugg ccaucuacag caccguggcc 1560
          agcagccugg ugcugcuggu gagccugggc gccaucagcu uc 1602
           <![CDATA[ <210> 140]]>
           <![CDATA[ <211> 534]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 140]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu
          1 5 10 15
          Leu Val Asn Ser Gln Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu
                      20 25 30
          Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys
                  35 40 45
          Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala
              50 55 60
          Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn
          65 70 75 80
          Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly
                          85 90 95
          Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp
                      100 105 110
          Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp
                  115 120 125
          Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu
              130 135 140
          Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile
          145 150 155 160
          Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu
                          165 170 175
          Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr
                      180 185 190
          Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu
                  195 200 205
          Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Ser Gly Gly Gly
              210 215 220
          Ser Gly Gly Gly Ser Gln Cys Val Asn Leu Thr Thr Arg Thr Gln Leu
          225 230 235 240
          Pro Pro Ala Tyr Thr Asn Ser Phe Thr Arg Gly Val Tyr Tyr Pro Asp
                          245 250 255
          Lys Val Phe Arg Ser Ser Val Leu His Ser Thr Gln Asp Leu Phe Leu
                      260 265 270
          Pro Phe Phe Ser Asn Val Thr Trp Phe His Ala Ile His Val Ser Gly
                  275 280 285
          Thr Asn Gly Thr Lys Arg Phe Asp Asn Pro Val Leu Pro Phe Asn Asp
              290 295 300
          Gly Val Tyr Phe Ala Ser Thr Glu Lys Ser Asn Ile Ile Arg Gly Trp
          305 310 315 320
          Ile Phe Gly Thr Thr Leu Asp Ser Lys Thr Gln Ser Leu Leu Ile Val
                          325 330 335
          Asn Asn Ala Thr Asn Val Val Ile Lys Val Cys Glu Phe Gln Phe Cys
                      340 345 350
          Asn Asp Pro Phe Leu Gly Val Tyr Tyr His Lys Asn Asn Lys Ser Trp
                  355 360 365
          Met Glu Ser Glu Phe Arg Val Tyr Ser Ser Ala Asn Asn Cys Thr Phe
              370 375 380
          Glu Tyr Val Ser Gln Pro Phe Leu Met Asp Leu Glu Gly Lys Gln Gly
          385 390 395 400
          Asn Phe Lys Asn Leu Arg Glu Phe Val Phe Lys Asn Ile Asp Gly Tyr
                          405 410 415
          Phe Lys Ile Tyr Ser Lys His Thr Pro Ile Asn Leu Val Arg Asp Leu
                      420 425 430
          Pro Gln Gly Phe Ser Ala Leu Glu Pro Leu Val Asp Leu Pro Ile Gly
                  435 440 445
          Ile Asn Ile Thr Arg Phe Gln Thr Leu Leu Ala Leu His Arg Ser Tyr
              450 455 460
          Leu Thr Pro Gly Asp Ser Ser Ser Gly Trp Thr Ala Gly Ala Ala Ala
          465 470 475 480
          Tyr Tyr Val Gly Tyr Leu Gln Pro Arg Thr Phe Leu Leu Lys Tyr Asn
                          485 490 495
          Glu Asn Gly Thr Ile Thr Asp Ala Val Asp Ser Gly Gly Gly Ser Ile
                      500 505 510
          Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser
                  515 520 525
          Leu Gly Ala Ile Ser Phe
              530
           <![CDATA[ <210> 141]]>
           <![CDATA[ <211> 1778]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 141]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag 120
          cccaacauca ccaaccugug ccccuucggc gagguguuca acgccacccg guucgccagc 180
          guguacgccu ggaaccggaa gcggaucagc aacugcgugg ccgacuacag cgugcuguac 240
          aacagcgcca gcuucagcac cuucaagugc uacggcguga gccccaccaa gcugaacgac 300
          cugugcuuca ccaacgugua cgccgacagc uucgugaucc guggcgacga ggugcggcag 360
          aucgcacccg gccagacagg caagaucgcc gacuacaacu acaagcugcc cgacgacuuc 420
          accggcugcg ugaucgccug gaacagcaac aaccucgaca gcaagguggg cggcaacuac 480
          aacuaccugu accggcuguu ccggaagagc aaccugaagc ccuucgagcg ggacaucagc 540
          accgagaucu accaagccgg cuccaccccu ugcaacggcg uggagggcuu caacugcuac 600
          uucccucugc agagcuacgg cuuccagccc accaacggcg ugggcuacca gcccuaccgg 660
          gugguggugc ugagcuucga gcugcugcac gccccagcca ccgugugugg ccccaagucu 720
          ggcggaggca gcggaggcgg aagccagugc gugaaccuga ccacccggac ccagcugcca 780
          ccagccuaca ccaacagcuu cacccggggc gucuacuacc ccgacaaggu guuccggagc 840
          agcguccugc acagcaccca ggaccuguuc cugcccuucu ucagcaacgu gaccugguuc 900
          cacgccaucc acgugagcgg caccaacggc accaagcggu ucgacaaccc cgugcugccc 960
          uucaacgacg gcguguacuu cgccagcacc gagaagagca acaucauccg gggcuggauc 1020
          uucggcacca cccuggacag caagacccag agccugcuga ucgugaauaa cgccaccaac 1080
          guggugauca aggugugcga guuccaguuc ugcaacgacc ccuuccuggg cguguacuac 1140
          cacaagaaca acaagagcug gauggagagc gaguuccggg uguacagcag cgccaacaac 1200
          ugcaccuucg aguacgugag ccagcccuuc cugauggacc uggagggcaa gcagggcaac 1260
          uucaagaacc ugcgggaguu cguguucaag aacaucgacg gcuacuucaa gaucuacagc 1320
          aagcacaccc caaucaaccu ggugcgggau cugccccagg gcuucucagc ccuggagccc 1380
          cugguggacc ugcccaucgg caucaacauc acccgguucc agacccugcu ggcccugcac 1440
          cggagcuacc ugaccccagg cgacagcagc agcgggugga cagcaggcgc ggcugcuuac 1500
          uacgugggcu accugcagcc ccggaccuuc cugcugaagu acaacgagaa cggcaccauc 1560
          accgacgccg uggacucugg cggaggcagc auccuggcca ucuacagcac cguggccagc 1620
          agccuggugc ugcuggugag ccugggcgcc aucagcuucu gauaauaggc uggagccucg 1680
          guggccuagc uucuugcccc uugggccucc ccccagcccc uccuccccuu ccugcacccg 1740
          uacccccgug gucuuugaau aaagucugag ugggcggc 1778
           <![CDATA[ <210> 142]]>
           <![CDATA[ <211> 1602]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 142]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ggaggcgcca aguucguggc cgccuggacu 900
          cugaaggccg cagccggcgg acccaacauc accaaccugu gccccuucgg cgagguguuc 960
          aacgccaccc gguucgccag cguguacgcc uggaaccgga agcggaucag caacugcgug 1020
          gccgacuaca gcgugcugua caacagcgcc agcuucagca ccuucaagug cuacggcgug 1080
          agccccacca agcugaacga ccugugcuuc accaacgugu acgccgacag cuucgugauc 1140
          cguggcgacg aggugcggca gaucgcaccc ggccagacag gcaagaucgc cgacuacaac 1200
          uacaagcugc ccgacgacuu caccggcugc gugaucgccu ggaacagcaa caaccucgac 1260
          agcaaggugg gcggcaacua caacuaccug uaccggcugu uccggaagag caaccugaag 1320
          cccuucgagc gggacaucag caccgagauc uaccaagccg gcuccacccc uugcaacggc 1380
          guggagggcu ucaacugcua cuucccucug cagagcuacg gcuuccagcc caccaacggc 1440
          gugggcuacc agcccuaccg ggugguggug cugagcuucg agcugcugca cgccccagcc 1500
          accgugugug gccccaaguc uggcggaggc agcauccugg ccaucuacag caccguggcc 1560
          agcagccugg ugcugcuggu gagccugggc gccaucagcu uc 1602
           <![CDATA[ <210> 143]]>
           <![CDATA[ <211> 534]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 143]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Gly Gly Ala Lys Phe Val Ala Ala Trp Thr Leu Lys Ala Ala
              290 295 300
          Ala Gly Gly Pro Asn Ile Thr Asn Leu Cys Pro Phe Gly Glu Val Phe
          305 310 315 320
          Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn Arg Lys Arg Ile
                          325 330 335
          Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn Ser Ala Ser Phe
                      340 345 350
          Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys Leu Asn Asp Leu
                  355 360 365
          Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile Arg Gly Asp Glu
              370 375 380
          Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile Ala Asp Tyr Asn
          385 390 395 400
          Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile Ala Trp Asn Ser
                          405 410 415
          Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn Tyr Leu Tyr Arg
                      420 425 430
          Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg Asp Ile Ser Thr
                  435 440 445
          Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys Asn Gly Val Glu Gly Phe
              450 455 460
          Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln Pro Thr Asn Gly
          465 470 475 480
          Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser Phe Glu Leu Leu
                          485 490 495
          His Ala Pro Ala Thr Val Cys Gly Pro Lys Ser Gly Gly Gly Gly Ser Ile
                      500 505 510
          Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val Ser
                  515 520 525
          Leu Gly Ala Ile Ser Phe
              530
           <![CDATA[ <210> 144]]>
           <![CDATA[ <211> 1778]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 144]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacgga ggcgccaagu ucguggccgc cuggacucug 960
          aaggccgcag ccggcggacc caacaucacc aaccugugcc ccuucggcga gguguucaac 1020
          gccacccggu ucgccagcgu guacgccugg aaccggaagc ggaucagcaa cugcguggcc 1080
          gacuacagcg ugcuguacaa cagcgccagc uucagcaccu ucaagugcua cggcgugagc 1140
          cccaccaagc ugaacgaccu gugcuucacc aacguguacg ccgacagcuu cgugauccgu 1200
          ggcgacgagg ugcggcagau cgcacccggc cagacaggca agaucgccga cuacaacuac 1260
          aagcugcccg acgacuucac cggcugcgug aucgccugga acagcaacaa ccucgacagc 1320
          aaggugggcg gcaacuacaa cuaccuguac cggcuguucc ggaagagcaa ccugaagccc 1380
          uucgagcggg acaucagcac cgagaucuac caagccggcu ccaccccuug caacggcgug 1440
          gagggcuuca acugcuacuu cccucugcag agcuacggcu uccagcccac caacggcgug 1500
          ggcuaccagc ccuaccgggu gguggugcug agcuucgagc ugcugcacgc cccagccacc 1560
          guguguggcc ccaagucugg cggaggcagc auccuggcca ucuacagcac cguggccagc 1620
          agccuggugc ugcuggugag ccugggcgcc aucagcuucu gauaauaggc uggagccucg 1680
          guggccuagc uucuugcccc uugggccucc ccccagcccc uccuccccuu ccugcacccg 1740
          uacccccgug gucuuugaau aaagucugag ugggcggc 1778
           <![CDATA[ <210> 145]]>
           <![CDATA[ <211> 1785]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 145]]>
          auguacagca ugcagcuggc uagcugcgug acccugaccc uggugcugcu ggugaacagc 60
          cagggcgccg agaacagcgu ggccuacagc aacaacagca ucgccauccc caccaacuuc 120
          accaucagcg ugaccaccga gauucugccc gugagcauga ccaagaccag cguggacugc 180
          accauguaca ucugcggcga cagcaccgag ugcagcaacc ugcugcugca guacggcagc 240
          uucugcaccc agcugaaccg ggcccugacc ggcaucgccg uggagcagga caagaacacc 300
          caggaggugu ucgcccaggu gaagcagauc uacaagaccc cucccaucaa ggacuucggc 360
          ggcuucaacu ucagccagau ccugcccgac cccagcaagc ccagcaagcg gagcuucauc 420
          gaggaccugc uguucaacaa ggugacccua gccgacgccg gcuucaucaa gcaguacggc 480
          gacugccucg gcgacauagc cgcccgggac cugaucugcg cccagaaguu caacggccug 540
          accgugcugc cuccccugcu gaccgacgag augaucgccc aguacaccag cgcccuguua 600
          gccggaacca ucaccagcgg cuggacuuuc ggcgcuggag ccgcucugca gauccccuuc 660
          gccaugcaga uggccuaccg guucaacggc aucggcguga cccagaacgu gcuguacgag 720
          aaccagaagc ugaucgccaa ccaguucaac agcgccaucg gcaagaucca ggacagccug 780
          agcagcaccg cuagcgcccu gggcaagcug caggacgugg ugaaccagaa cgcccaggcc 840
          cugaacaccc uggugaagca gcugagcagc aacuucggcg ccaucagcag cgugcugaac 900
          gacauccuga gccggcugga cccucccgag gccgaggugc agaucgaccg gcugaucacu 960
          ggccggcugc agagccugca gaccuacgug acccagcagc ugauccgggc cgccgagauu 1020
          cgggccagcg ccaaccuggc cgccaccaag augagcgagu gcgugcuggg ccagagcaag 1080
          cgggguggacu ucugcggcaa gggcuaccac cugaugagcu uuccccagag cgcaccccac 1140
          ggaguggugu uccugcacgu gaccuacgug cccgcccagg agaagaacuu caccaccgcc 1200
          ccagccaucu gccacgacgg caaggcccac uuuccccggg agggcguguu cgugagcaac 1260
          ggcacccacu gguucgugac ccagcggaac uucuacgagc cccagaucau caccaccgac 1320
          aacaccuucg ugagcggcaa cugcgacgug gugaucggca ucgugaacaa caccguguac 1380
          gauccccugc agcccgagcu ggacagcuuc aaggaggagc uggacaagua cuucaagaau 1440
          cacaccagcc ccgacgugga ccugggcgac aucagcggca ucaacgccag cguggugaac 1500
          auccagaagg agaucgaucg gcugaacgag guggccaaga accugaacga gagccugauc 1560
          gaccugcagg agcugggcaa guacgagcag uacaucaagu ggcccuggua caucuggcug 1620
          ggcuucaucg ccggccugau cgccaucgug auggugacca ucaugcugug cugcaugacc 1680
          agcugcugca gcugccugaa gggcuguugc agcugcggca gcugcugcaa guucgacgag 1740
          gacgacagcg agcccgugcu gaagggcgug aagcugcacu acacc 1785
           <![CDATA[ <210> 146]]>
           <![CDATA[ <211> 595]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 146]]>
          Met Tyr Ser Met Gln Leu Ala Ser Cys Val Thr Leu Thr Leu Val Leu
          1 5 10 15
          Leu Val Asn Ser Gln Gly Ala Glu Asn Ser Val Ala Tyr Ser Asn Asn
                      20 25 30
          Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile Ser Val Thr Glu Ile
                  35 40 45
          Leu Pro Val Ser Met Thr Lys Thr Ser Val Asp Cys Thr Met Tyr Ile
              50 55 60
          Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu Leu Leu Gln Tyr Gly Ser
          65 70 75 80
          Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr Gly Ile Ala Val Glu Gln
                          85 90 95
          Asp Lys Asn Thr Gln Glu Val Phe Ala Gln Val Lys Gln Ile Tyr Lys
                      100 105 110
          Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe Asn Phe Ser Gln Ile Leu
                  115 120 125
          Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser Phe Ile Glu Asp Leu Leu
              130 135 140
          Phe Asn Lys Val Thr Leu Ala Asp Ala Gly Phe Ile Lys Gln Tyr Gly
          145 150 155 160
          Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp Leu Ile Cys Ala Gln Lys
                          165 170 175
          Phe Asn Gly Leu Thr Val Leu Pro Pro Leu Leu Thr Asp Glu Met Ile
                      180 185 190
          Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly Thr Ile Thr Ser Gly Trp
                  195 200 205
          Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile Pro Phe Ala Met Gln Met
              210 215 220
          Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr Gln Asn Val Leu Tyr Glu
          225 230 235 240
          Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn Ser Ala Ile Gly Lys Ile
                          245 250 255
          Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala Leu Gly Lys Leu Gln Asp
                      260 265 270
          Val Val Asn Gln Asn Ala Gln Ala Leu Asn Thr Leu Val Lys Gln Leu
                  275 280 285
          Ser Ser Asn Phe Gly Ala Ile Ser Ser Val Leu Asn Asp Ile Leu Ser
              290 295 300
          Arg Leu Asp Pro Pro Glu Ala Glu Val Gln Ile Asp Arg Leu Ile Thr
          305 310 315 320
          Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val Thr Gln Gln Leu Ile Arg
                          325 330 335
          Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu Ala Ala Thr Lys Met Ser
                      340 345 350
          Glu Cys Val Leu Gly Gln Ser Lys Arg Val Asp Phe Cys Gly Lys Gly
                  355 360 365
          Tyr His Leu Met Ser Phe Pro Gln Ser Ala Pro His Gly Val Val Phe
              370 375 380
          Leu His Val Thr Tyr Val Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala
          385 390 395 400
          Pro Ala Ile Cys His Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val
                          405 410 415
          Phe Val Ser Asn Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr
                      420 425 430
          Glu Pro Gln Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys
                  435 440 445
          Asp Val Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln
              450 455 460
          Pro Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn
          465 470 475 480
          His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala
                          485 490 495
          Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala
                      500 505 510
          Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys Tyr
                  515 520 525
          Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile Trp Leu Gly Phe Ile Ala
              530 535 540
          Gly Leu Ile Ala Ile Val Met Val Thr Ile Met Leu Cys Cys Met Thr
          545 550 555 560
          Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys Ser Cys Gly Ser Cys Cys
                          565 570 575
          Lys Phe Asp Glu Asp Asp Ser Glu Pro Val Leu Lys Gly Val Lys Leu
                      580 585 590
          His Tyr Thr
                  595
           <![CDATA[ <210> 147]]>
           <![CDATA[ <211> 1961]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 147]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uacagcaugc agcuggcuag cugcgugacc cugacccugg ugcugcuggu gaacagccag 120
          ggcgccgaga acagcguggc cuacagcaac aacagcaucg ccauccccac caacuucacc 180
          aucagcguga ccaccgagau ucugcccgug agcaugacca agaccagcgu ggacugcacc 240
          auguacaucu gcggcgacag caccgagugc agcaaccugc ugcugcagua cggcagcuuc 300
          ugcacccagc ugaaccgggc ccugaccggc aucgccgugg agcaggacaa gaacacccag 360
          gagguguucg cccaggugaa gcagaucuac aagaccccuc ccaucaagga cuucggcggc 420
          uucaacuuca gccagauccu gcccgacccc agcaagccca gcaagcggag cuucaucgag 480
          gaccugcugu ucaacaaggu gacccuagcc gacgccggcu ucaucaagca guacggcgac 540
          ugccucggcg acauagccgc ccgggaccug aucugcgccc agaaguucaa cggccugacc 600
          gugcugccuc cccugcugac cgacgagaug aucgcccagu acaccagcgc ccuguuagcc 660
          ggaaccauca ccagcggcug gacuuucggc gcuggagccg cucugcagau ccccuucgcc 720
          augcagaugg ccuaccgguu caacggcauc ggcgugaccc agaacgugcu guacgagaac 780
          cagaagcuga ucgccaacca guucaacagc gccaucggca agauccagga cagccugagc 840
          agcaccgcua gcgcccuggg caagcugcag gacgugguga accagaacgc ccaggcccug 900
          aacacccugg ugaagcagcu gagcagcaac uucggcgcca ucagcagcgu gcugaacgac 960
          auccugagcc ggcuggaccc ucccgaggcc gaggugcaga ucgaccggcu gaucacuggc 1020
          cggcugcaga gccugcagac cuacgugacc cagcagcuga uccgggccgc cgagauucgg 1080
          gccagcgcca accuggccgc caccaagaug agcgagugcg ugcugggcca gagcaagcgg 1140
          guggacuucu gcggcaaggg cuaccaccug augagcuuuc cccagagcgc accccacgga 1200
          gugguguucc ugcacgugac cuacgugccc gcccaggaga agaacuucac caccgcccca 1260
          gccaucugcc acgacggcaa ggcccacuuu ccccgggagg gcguguucgu gagcaacggc 1320
          acccacuggu ucgugaccca gcggaacuuc uacgagcccc agaucaucac caccgacaac 1380
          accuucguga gcggcaacug cgacguggug aucggcaucg ugaacaacac cguguacgau 1440
          ccccugcagc ccgagcugga cagcuucaag gaggagcugg acaaguacuu caagaaucac 1500
          accagccccg acguggaccu gggcgacauc agcggcauca acgccagcgu ggugaacauc 1560
          cagaaggaga ucgaucggcu gaacgaggug gccaagaacc ugaacgagag ccugaucgac 1620
          cugcaggagc ugggcaagua cgagcaguac aucaaguggc ccugguacau cuggcugggc 1680
          uucaucgccg gccugaucgc caucgugaug gugaccauca ugcugugcug caugaccagc 1740
          ugcugcagcu gccugaaggg cuguugcagc ugcggcagcu gcugcaaguu cgacgaggac 1800
          gacagcgagc ccgugcugaa gggcgugaag cugcacuaca ccugauaaua ggcuggagcc 1860
          ucgguggccu agcuucuugc cccuugggcc uccccccagc cccuccuccc cuuccugcac 1920
          ccguaccccc guggucuuug aauaaagucu gagugggcgg c 1961
           <![CDATA[ <210> 148]]>
           <![CDATA[ <211> 13]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 148]]>
          Ala Lys Phe Val Ala Ala Trp Thr Leu Lys Ala Ala Ala
          1 5 10
           <![CDATA[ <210> 149]]>
           <![CDATA[ <211> 2216]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 149]]>
          gggaaauaag agagaaaaga agaguaagaa gaaauauaag accccggcgc cgccaccaug 60
          uucguguucc uggugcugcu gccccuggug agcagccagu gcgugaaccu gaccacccgg 120
          acccagcugc caccagccua caccaacagc uucacccggg gcgucuacua ccccgacaag 180
          guguuccgga gcagcguccu gcacagcacc caggaccugu uccugcccuu cuucagcaac 240
          gugaccuggu uccacgccau ccacgugagc ggcaccaacg gcaccaagcg guucgacaac 300
          cccgugcugc ccuucaacga cggcguguac uucgccagca ccgagaagag caacaucauc 360
          cggggcugga ucuucggcac cacccuggac agcaagaccc agagccugcu gaucgugaau 420
          aacgccacca acguggugau caaggugugc gaguuccagu ucugcaacga ccccuuccug 480
          ggcguguacu accacaagaa caacaagagc uggauggaga gcgaguuccg gguguacagc 540
          agcgccaaca acugcaccuu cgaguacgug agccagcccu uccugaugga ccuggagggc 600
          aagcagggca acuucaagaa ccugcgggag uucguguuca agaacaucga cggcuacuuc 660
          aagaucuaca gcaagcacac cccaaucaac cuggugcggg aucugcccca gggcuuca 720
          gcccuggagc cccuggugga ccugcccauc ggcaucaaca ucacccgguu ccagacccug 780
          cuggcccugc accggagcua ccugacccca ggcgacagca gcagcgggug gacagcaggc 840
          gcggcugcuu acuacguggg cuaccugcag ccccggaccu uccugcugaa guacaacgag 900
          aacggcacca ucaccgacgc cguggacugc gcccuggacc cucugagcga gaccaagugc 960
          acccugaaga gcuucaccgu ggagaagggc aucuaccaga ccagcaacuu ccgggugcag 1020
          cccaccgaga gcaucgugcg guuccccaac aucaccaacc ugugccccuu cggcgaggug 1080
          uucaacgcca cccgguucgc cagcguguac gccuggaacc ggaagcggau cagcaacugc 1140
          guggccgacu acagcgugcu guacaacagc gccagcuuca gcaccuucaa gugcuacggc 1200
          gugagcccca ccaagcugaa cgaccugugc uucaccaacg uguacgccga cagcuucgug 1260
          auccguggcg acgaggugcg gcagaucgca cccggccaga caggcaagau cgccgacuac 1320
          aacuacaagc ugcccgacga cuucaccggc ugcgugaucg ccuggaacag caacaaccuc 1380
          gacagcaagg ugggcggcaa cuacaacuac cuguaccggc uguuccggaa gagcaaccug 1440
          aagcccuucg agcgggacau cagcaccgag aucuaccaag ccggcuccac cccuugcaac 1500
          ggcguggagg gcuucaacug cuacuucccu cugcagagcu acggcuucca gcccaccaac 1560
          ggcgugggcu accagcccua ccggguggug gugcugagcu ucgagcugcu gcacgcccca 1620
          gccaccgugu guggccccaa gaagagcacc aaccugguga agaacaagug cgugaacuuc 1680
          aacuucaacg gccuuaccgg caccggcgug cugaccgaga gcaacaagaa auuccugccc 1740
          uuucagcagu ucggccggga caucgccgac accaccgacg cugugcggga uccccagacc 1800
          cuggagaucc uggacaucac cccuugcagc uucggcggcg ugagcgugau caccccaggc 1860
          accaacacca gcaaccaggu ggccgugcug uaccaggacg ugaacugcac cgaggugccc 1920
          guggccaucc acgccgacca gcugacaccc accuggcggg ucuacagcac cggcagcaac 1980
          guguuccaga cccgggccgg uugccugauc ggcgccgagc acgugaacaa cagcuacgag 2040
          ugcgacaucc ccaucggcgc cggcaucugu gccagcuacc agacccagac caauucauga 2100
          uaauaggcug gagccucggu ggccuagcuu cuugccccuu gggccucccc ccagccccuc 2160
          cuccccuucc ugcacccgua cccccguggu cuuugaauaa agucugagug ggcggc 2216
           <![CDATA[ <210> 150]]>
           <![CDATA[ <211> 2040]]>
           <![CDATA[ <212> RNA]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 150]]>
          auguucgugu uccuggugcu gcugccccug gugagcagcc agugcgugaa ccugaccacc 60
          cggacccagc ugccaccagc cuacaccaac agcuucaccc ggggcgucua cuaccccgac 120
          aagguguucc ggagcagcgu ccugcacagc acccaggacc uguuccugcc cuucuucagc 180
          aacgugaccu gguuccacgc cauccacgug agcggcacca acggcaccaa gcgguucgac 240
          aaccccgugc ugcccuucaa cgacggcgug uacuucgcca gcaccgagaa gagcaacauc 300
          auccggggcu ggaucuucgg caccacccug gacagcaaga cccagagccu gcugaucgug 360
          aauaacgcca ccaacguggu gaucaaggug ugcgaguucc aguucugcaa cgaccccuuc 420
          cugggcgugu acuaccacaa gaacaacaag agcuggaugg agagcgaguu ccggguguac 480
          agcagcgcca acaacugcac cuucgaguac gugagccagc ccuuccugau ggaccuggag 540
          ggcaagcagg gcaacuucaa gaaccugcgg gaguucgugu ucaagaacau cgacggcuac 600
          uucaagaucu acagcaagca caccccaauc aaccuggugc gggaucugcc ccagggcuuc 660
          ucagcccugg agccccuggu ggaccugccc aucggcauca acaucacccg guuccagacc 720
          cugcuggccc ugcaccggag cuaccugacc ccaggcgaca gcagcagcgg guggacagca 780
          ggcgcggcug cuuacuacgu gggcuaccug cagccccgga ccuuccugcu gaaguacaac 840
          gagaacggca ccaucaccga cgccguggac ugcgcccugg acccucugag cgagaccaag 900
          ugcacccuga agagcuucac cguggagaag ggcaucuacc agaccagcaa cuuccgggug 960
          cagcccaccg agagcaucgu gcgguucccc aacaucacca accugugccc cuucggcgag 1020
          guguucaacg ccacccgguu cgccagcgug uacgccugga accggaagcg gaucagcaac 1080
          ugcguggccg acuacagcgu gcuguacaac agcgccagcu ucagcaccuu caagugcuac 1140
          ggcgugagcc ccaccaagcu gaacgaccug ugcuucacca acguguacgc cgacagcuuc 1200
          gugauccgug gcgacgaggu gcggcagauc gcacccggcc agacaggcaa gaucgccgac 1260
          uacaacuaca agcugcccga cgacuucacc ggcugcguga ucgccuggaa cagcaacaac 1320
          cucgacagca aggugggcgg caacuacaac uaccuguacc ggcuguuccg gaagagcaac 1380
          cugaagcccu ucgagcggga caucagcacc gagaucuacc aagccggcuc caccccuugc 1440
          aacggcgugg agggcuucaa cugcuacuuc ccucugcaga gcuacggcuu ccagcccacc 1500
          aacggcgugg gcuaccagcc cuaccgggug guggugcuga gcuucgagcu gcugcacgcc 1560
          ccagccaccg uguguggccc caagaagagc accaaccugg ugaagaacaa gugcgugaac 1620
          uucaacuuca acggccuuac cggcaccggc gugcugaccg agagcaacaa gaaauuccug 1680
          cccuuucagc aguucggccg ggacaucgcc gacaccaccg acgcugugcg ggauccccag 1740
          acccuggaga uccuggacau caccccuugc agcuucggcg gcgugagcgu gaucacccca 1800
          ggcaccaaca ccagcaacca gguggccgug cuguaccagg acgugaacug caccgaggug 1860
          cccguggcca uccacgccga ccagcugaca cccaccuggc gggucuacag caccggcagc 1920
          aacguguucc agacccgggc cgguugccug aucggcgccg agcacgugaa caacagcuac 1980
          gagugcgaca uccccaucgg cgccggcauc ugugccagcu accagaccca gaccaauuca 2040
           <![CDATA[ <210> 151]]>
           <![CDATA[ <211> 680]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 151]]>
          Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
          1 5 10 15
          Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
                      20 25 30
          Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
                  35 40 45
          His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
              50 55 60
          Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
          65 70 75 80
          Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
                          85 90 95
          Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
                      100 105 110
          Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
                  115 120 125
          Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
              130 135 140
          Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
          145 150 155 160
          Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
                          165 170 175
          Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
                      180 185 190
          Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
                  195 200 205
          Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
              210 215 220
          Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
          225 230 235 240
          Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
                          245 250 255
          Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
                      260 265 270
          Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
                  275 280 285
          Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
              290 295 300
          Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
          305 310 315 320
          Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
                          325 330 335
          Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
                      340 345 350
          Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
                  355 360 365
          Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
              370 375 380
          Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
          385 390 395 400
          Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
                          405 410 415
          Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
                      420 425 430
          Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
                  435 440 445
          Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
              450 455 460
          Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
          465 470 475 480
          Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
                          485 490 495
          Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
                      500 505 510
          Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
                  515 520 525
          Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
              530 535 540
          Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
          545 550 555 560
          Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
                          565 570 575
          Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
                      580 585 590
          Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
                  595 600 605
          Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile
              610 615 620
          His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
          625 630 635 640
          Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
                          645 650 655
          Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
                      660 665 670
          Ser Tyr Gln Thr Gln Thr Asn Ser
                  675 680
           <![CDATA[ <210> 152]]>
           <![CDATA[ <211> 4]]>
           <![CDATA[ <212> PRT]]>
           <![CDATA[ <213> Artificial Sequence]]>
           <![CDATA[ <220>]]>
           <![CDATA[ <223> Synthesis]]>
           <![CDATA[ <400> 152]]>
          Gly Ser Gly Gly
          1               
          
      

Figure 12_A0101_SEQ_0001
Figure 12_A0101_SEQ_0001

Figure 12_A0101_SEQ_0002
Figure 12_A0101_SEQ_0002

Figure 12_A0101_SEQ_0003
Figure 12_A0101_SEQ_0003

Figure 12_A0101_SEQ_0004
Figure 12_A0101_SEQ_0004

Figure 12_A0101_SEQ_0005
Figure 12_A0101_SEQ_0005

Figure 12_A0101_SEQ_0006
Figure 12_A0101_SEQ_0006

Figure 12_A0101_SEQ_0007
Figure 12_A0101_SEQ_0007

Figure 12_A0101_SEQ_0008
Figure 12_A0101_SEQ_0008

Figure 12_A0101_SEQ_0009
Figure 12_A0101_SEQ_0009

Figure 12_A0101_SEQ_0010
Figure 12_A0101_SEQ_0010

Figure 12_A0101_SEQ_0011
Figure 12_A0101_SEQ_0011

Figure 12_A0101_SEQ_0012
Figure 12_A0101_SEQ_0012

Figure 12_A0101_SEQ_0013
Figure 12_A0101_SEQ_0013

Figure 12_A0101_SEQ_0014
Figure 12_A0101_SEQ_0014

Figure 12_A0101_SEQ_0015
Figure 12_A0101_SEQ_0015

Figure 12_A0101_SEQ_0016
Figure 12_A0101_SEQ_0016

Figure 12_A0101_SEQ_0017
Figure 12_A0101_SEQ_0017

Figure 12_A0101_SEQ_0018
Figure 12_A0101_SEQ_0018

Figure 12_A0101_SEQ_0019
Figure 12_A0101_SEQ_0019

Figure 12_A0101_SEQ_0020
Figure 12_A0101_SEQ_0020

Figure 12_A0101_SEQ_0021
Figure 12_A0101_SEQ_0021

Figure 12_A0101_SEQ_0022
Figure 12_A0101_SEQ_0022

Figure 12_A0101_SEQ_0023
Figure 12_A0101_SEQ_0023

Figure 12_A0101_SEQ_0024
Figure 12_A0101_SEQ_0024

Figure 12_A0101_SEQ_0025
Figure 12_A0101_SEQ_0025

Figure 12_A0101_SEQ_0026
Figure 12_A0101_SEQ_0026

Figure 12_A0101_SEQ_0027
Figure 12_A0101_SEQ_0027

Figure 12_A0101_SEQ_0028
Figure 12_A0101_SEQ_0028

Figure 12_A0101_SEQ_0029
Figure 12_A0101_SEQ_0029

Figure 12_A0101_SEQ_0030
Figure 12_A0101_SEQ_0030

Figure 12_A0101_SEQ_0031
Figure 12_A0101_SEQ_0031

Figure 12_A0101_SEQ_0032
Figure 12_A0101_SEQ_0032

Figure 12_A0101_SEQ_0033
Figure 12_A0101_SEQ_0033

Figure 12_A0101_SEQ_0034
Figure 12_A0101_SEQ_0034

Figure 12_A0101_SEQ_0035
Figure 12_A0101_SEQ_0035

Figure 12_A0101_SEQ_0036
Figure 12_A0101_SEQ_0036

Figure 12_A0101_SEQ_0037
Figure 12_A0101_SEQ_0037

Figure 12_A0101_SEQ_0038
Figure 12_A0101_SEQ_0038

Figure 12_A0101_SEQ_0039
Figure 12_A0101_SEQ_0039

Figure 12_A0101_SEQ_0040
Figure 12_A0101_SEQ_0040

Figure 12_A0101_SEQ_0041
Figure 12_A0101_SEQ_0041

Figure 12_A0101_SEQ_0042
Figure 12_A0101_SEQ_0042

Figure 12_A0101_SEQ_0043
Figure 12_A0101_SEQ_0043

Figure 12_A0101_SEQ_0044
Figure 12_A0101_SEQ_0044

Figure 12_A0101_SEQ_0045
Figure 12_A0101_SEQ_0045

Figure 12_A0101_SEQ_0046
Figure 12_A0101_SEQ_0046

Figure 12_A0101_SEQ_0047
Figure 12_A0101_SEQ_0047

Figure 12_A0101_SEQ_0048
Figure 12_A0101_SEQ_0048

Figure 12_A0101_SEQ_0049
Figure 12_A0101_SEQ_0049

Figure 12_A0101_SEQ_0050
Figure 12_A0101_SEQ_0050

Figure 12_A0101_SEQ_0051
Figure 12_A0101_SEQ_0051

Figure 12_A0101_SEQ_0052
Figure 12_A0101_SEQ_0052

Figure 12_A0101_SEQ_0053
Figure 12_A0101_SEQ_0053

Figure 12_A0101_SEQ_0054
Figure 12_A0101_SEQ_0054

Figure 12_A0101_SEQ_0055
Figure 12_A0101_SEQ_0055

Figure 12_A0101_SEQ_0056
Figure 12_A0101_SEQ_0056

Figure 12_A0101_SEQ_0057
Figure 12_A0101_SEQ_0057

Figure 12_A0101_SEQ_0058
Figure 12_A0101_SEQ_0058

Figure 12_A0101_SEQ_0059
Figure 12_A0101_SEQ_0059

Figure 12_A0101_SEQ_0060
Figure 12_A0101_SEQ_0060

Figure 12_A0101_SEQ_0061
Figure 12_A0101_SEQ_0061

Figure 12_A0101_SEQ_0062
Figure 12_A0101_SEQ_0062

Figure 12_A0101_SEQ_0063
Figure 12_A0101_SEQ_0063

Figure 12_A0101_SEQ_0064
Figure 12_A0101_SEQ_0064

Figure 12_A0101_SEQ_0065
Figure 12_A0101_SEQ_0065

Figure 12_A0101_SEQ_0066
Figure 12_A0101_SEQ_0066

Figure 12_A0101_SEQ_0067
Figure 12_A0101_SEQ_0067

Figure 12_A0101_SEQ_0068
Figure 12_A0101_SEQ_0068

Figure 12_A0101_SEQ_0069
Figure 12_A0101_SEQ_0069

Figure 12_A0101_SEQ_0070
Figure 12_A0101_SEQ_0070

Figure 12_A0101_SEQ_0071
Figure 12_A0101_SEQ_0071

Figure 12_A0101_SEQ_0072
Figure 12_A0101_SEQ_0072

Figure 12_A0101_SEQ_0073
Figure 12_A0101_SEQ_0073

Figure 12_A0101_SEQ_0074
Figure 12_A0101_SEQ_0074

Figure 12_A0101_SEQ_0075
Figure 12_A0101_SEQ_0075

Figure 12_A0101_SEQ_0076
Figure 12_A0101_SEQ_0076

Figure 12_A0101_SEQ_0077
Figure 12_A0101_SEQ_0077

Figure 12_A0101_SEQ_0078
Figure 12_A0101_SEQ_0078

Figure 12_A0101_SEQ_0079
Figure 12_A0101_SEQ_0079

Figure 12_A0101_SEQ_0080
Figure 12_A0101_SEQ_0080

Figure 12_A0101_SEQ_0081
Figure 12_A0101_SEQ_0081

Figure 12_A0101_SEQ_0082
Figure 12_A0101_SEQ_0082

Figure 12_A0101_SEQ_0083
Figure 12_A0101_SEQ_0083

Figure 12_A0101_SEQ_0084
Figure 12_A0101_SEQ_0084

Figure 12_A0101_SEQ_0085
Figure 12_A0101_SEQ_0085

Figure 12_A0101_SEQ_0086
Figure 12_A0101_SEQ_0086

Figure 12_A0101_SEQ_0087
Figure 12_A0101_SEQ_0087

Figure 12_A0101_SEQ_0088
Figure 12_A0101_SEQ_0088

Figure 12_A0101_SEQ_0089
Figure 12_A0101_SEQ_0089

Figure 12_A0101_SEQ_0090
Figure 12_A0101_SEQ_0090

Figure 12_A0101_SEQ_0091
Figure 12_A0101_SEQ_0091

Figure 12_A0101_SEQ_0092
Figure 12_A0101_SEQ_0092

Figure 12_A0101_SEQ_0093
Figure 12_A0101_SEQ_0093

Figure 12_A0101_SEQ_0094
Figure 12_A0101_SEQ_0094

Figure 12_A0101_SEQ_0095
Figure 12_A0101_SEQ_0095

Figure 12_A0101_SEQ_0096
Figure 12_A0101_SEQ_0096

Figure 12_A0101_SEQ_0097
Figure 12_A0101_SEQ_0097

Figure 12_A0101_SEQ_0098
Figure 12_A0101_SEQ_0098

Figure 12_A0101_SEQ_0099
Figure 12_A0101_SEQ_0099

Figure 12_A0101_SEQ_0100
Figure 12_A0101_SEQ_0100

Figure 12_A0101_SEQ_0101
Figure 12_A0101_SEQ_0101

Figure 12_A0101_SEQ_0102
Figure 12_A0101_SEQ_0102

Figure 12_A0101_SEQ_0103
Figure 12_A0101_SEQ_0103

Figure 12_A0101_SEQ_0104
Figure 12_A0101_SEQ_0104

Figure 12_A0101_SEQ_0105
Figure 12_A0101_SEQ_0105

Figure 12_A0101_SEQ_0106
Figure 12_A0101_SEQ_0106

Figure 12_A0101_SEQ_0107
Figure 12_A0101_SEQ_0107

Figure 12_A0101_SEQ_0108
Figure 12_A0101_SEQ_0108

Figure 12_A0101_SEQ_0109
Figure 12_A0101_SEQ_0109

Figure 12_A0101_SEQ_0110
Figure 12_A0101_SEQ_0110

Figure 12_A0101_SEQ_0111
Figure 12_A0101_SEQ_0111

Figure 12_A0101_SEQ_0112
Figure 12_A0101_SEQ_0112

Figure 12_A0101_SEQ_0113
Figure 12_A0101_SEQ_0113

Figure 12_A0101_SEQ_0114
Figure 12_A0101_SEQ_0114

Figure 12_A0101_SEQ_0115
Figure 12_A0101_SEQ_0115

Figure 12_A0101_SEQ_0116
Figure 12_A0101_SEQ_0116

Figure 12_A0101_SEQ_0117
Figure 12_A0101_SEQ_0117

Figure 12_A0101_SEQ_0118
Figure 12_A0101_SEQ_0118

Figure 12_A0101_SEQ_0119
Figure 12_A0101_SEQ_0119

Figure 12_A0101_SEQ_0120
Figure 12_A0101_SEQ_0120

Figure 12_A0101_SEQ_0121
Figure 12_A0101_SEQ_0121

Figure 12_A0101_SEQ_0122
Figure 12_A0101_SEQ_0122

Figure 12_A0101_SEQ_0123
Figure 12_A0101_SEQ_0123

Figure 12_A0101_SEQ_0124
Figure 12_A0101_SEQ_0124

Figure 12_A0101_SEQ_0125
Figure 12_A0101_SEQ_0125

Figure 12_A0101_SEQ_0126
Figure 12_A0101_SEQ_0126

Figure 12_A0101_SEQ_0127
Figure 12_A0101_SEQ_0127

Figure 12_A0101_SEQ_0128
Figure 12_A0101_SEQ_0128

Figure 12_A0101_SEQ_0129
Figure 12_A0101_SEQ_0129

Figure 12_A0101_SEQ_0130
Figure 12_A0101_SEQ_0130

Figure 12_A0101_SEQ_0131
Figure 12_A0101_SEQ_0131

Figure 12_A0101_SEQ_0132
Figure 12_A0101_SEQ_0132

Figure 12_A0101_SEQ_0133
Figure 12_A0101_SEQ_0133

Figure 12_A0101_SEQ_0134
Figure 12_A0101_SEQ_0134

Figure 12_A0101_SEQ_0135
Figure 12_A0101_SEQ_0135

Figure 12_A0101_SEQ_0136
Figure 12_A0101_SEQ_0136

Figure 12_A0101_SEQ_0137
Figure 12_A0101_SEQ_0137

Figure 12_A0101_SEQ_0138
Figure 12_A0101_SEQ_0138

Figure 12_A0101_SEQ_0139
Figure 12_A0101_SEQ_0139

Figure 12_A0101_SEQ_0140
Figure 12_A0101_SEQ_0140

Figure 12_A0101_SEQ_0141
Figure 12_A0101_SEQ_0141

Figure 12_A0101_SEQ_0142
Figure 12_A0101_SEQ_0142

Figure 12_A0101_SEQ_0143
Figure 12_A0101_SEQ_0143

Figure 12_A0101_SEQ_0144
Figure 12_A0101_SEQ_0144

Figure 12_A0101_SEQ_0145
Figure 12_A0101_SEQ_0145

Figure 12_A0101_SEQ_0146
Figure 12_A0101_SEQ_0146

Figure 12_A0101_SEQ_0147
Figure 12_A0101_SEQ_0147

Figure 12_A0101_SEQ_0148
Figure 12_A0101_SEQ_0148

Figure 12_A0101_SEQ_0149
Figure 12_A0101_SEQ_0149

Figure 12_A0101_SEQ_0150
Figure 12_A0101_SEQ_0150

Figure 12_A0101_SEQ_0151
Figure 12_A0101_SEQ_0151

Figure 12_A0101_SEQ_0152
Figure 12_A0101_SEQ_0152

Figure 12_A0101_SEQ_0153
Figure 12_A0101_SEQ_0153

Figure 12_A0101_SEQ_0154
Figure 12_A0101_SEQ_0154

Figure 12_A0101_SEQ_0155
Figure 12_A0101_SEQ_0155

Figure 12_A0101_SEQ_0156
Figure 12_A0101_SEQ_0156

Figure 12_A0101_SEQ_0157
Figure 12_A0101_SEQ_0157

Figure 12_A0101_SEQ_0158
Figure 12_A0101_SEQ_0158

Figure 12_A0101_SEQ_0159
Figure 12_A0101_SEQ_0159

Figure 12_A0101_SEQ_0160
Figure 12_A0101_SEQ_0160

Figure 12_A0101_SEQ_0161
Figure 12_A0101_SEQ_0161

Figure 12_A0101_SEQ_0162
Figure 12_A0101_SEQ_0162

Figure 12_A0101_SEQ_0163
Figure 12_A0101_SEQ_0163

Figure 12_A0101_SEQ_0164
Figure 12_A0101_SEQ_0164

Figure 12_A0101_SEQ_0165
Figure 12_A0101_SEQ_0165

Figure 12_A0101_SEQ_0166
Figure 12_A0101_SEQ_0166

Figure 12_A0101_SEQ_0167
Figure 12_A0101_SEQ_0167

Figure 12_A0101_SEQ_0168
Figure 12_A0101_SEQ_0168

Figure 12_A0101_SEQ_0169
Figure 12_A0101_SEQ_0169

Figure 12_A0101_SEQ_0170
Figure 12_A0101_SEQ_0170

Figure 12_A0101_SEQ_0171
Figure 12_A0101_SEQ_0171

Figure 12_A0101_SEQ_0172
Figure 12_A0101_SEQ_0172

Figure 12_A0101_SEQ_0173
Figure 12_A0101_SEQ_0173

Figure 12_A0101_SEQ_0174
Figure 12_A0101_SEQ_0174

Figure 12_A0101_SEQ_0175
Figure 12_A0101_SEQ_0175

Figure 12_A0101_SEQ_0176
Figure 12_A0101_SEQ_0176

Figure 12_A0101_SEQ_0177
Figure 12_A0101_SEQ_0177

Figure 12_A0101_SEQ_0178
Figure 12_A0101_SEQ_0178

Figure 12_A0101_SEQ_0179
Figure 12_A0101_SEQ_0179

Figure 12_A0101_SEQ_0180
Figure 12_A0101_SEQ_0180

Figure 12_A0101_SEQ_0181
Figure 12_A0101_SEQ_0181

Figure 12_A0101_SEQ_0182
Figure 12_A0101_SEQ_0182

Figure 12_A0101_SEQ_0183
Figure 12_A0101_SEQ_0183

Figure 12_A0101_SEQ_0184
Figure 12_A0101_SEQ_0184

Figure 12_A0101_SEQ_0185
Figure 12_A0101_SEQ_0185

Figure 12_A0101_SEQ_0186
Figure 12_A0101_SEQ_0186

Figure 12_A0101_SEQ_0187
Figure 12_A0101_SEQ_0187

Figure 12_A0101_SEQ_0188
Figure 12_A0101_SEQ_0188

Figure 12_A0101_SEQ_0189
Figure 12_A0101_SEQ_0189

Figure 12_A0101_SEQ_0190
Figure 12_A0101_SEQ_0190

Figure 12_A0101_SEQ_0191
Figure 12_A0101_SEQ_0191

Figure 12_A0101_SEQ_0192
Figure 12_A0101_SEQ_0192

Figure 12_A0101_SEQ_0193
Figure 12_A0101_SEQ_0193

Figure 12_A0101_SEQ_0194
Figure 12_A0101_SEQ_0194

Figure 12_A0101_SEQ_0195
Figure 12_A0101_SEQ_0195

Figure 12_A0101_SEQ_0196
Figure 12_A0101_SEQ_0196

Figure 12_A0101_SEQ_0197
Figure 12_A0101_SEQ_0197

Figure 12_A0101_SEQ_0198
Figure 12_A0101_SEQ_0198

Figure 12_A0101_SEQ_0199
Figure 12_A0101_SEQ_0199

Figure 12_A0101_SEQ_0200
Figure 12_A0101_SEQ_0200

Figure 12_A0101_SEQ_0201
Figure 12_A0101_SEQ_0201

Figure 12_A0101_SEQ_0202
Figure 12_A0101_SEQ_0202

Figure 12_A0101_SEQ_0203
Figure 12_A0101_SEQ_0203

Figure 12_A0101_SEQ_0204
Figure 12_A0101_SEQ_0204

Figure 12_A0101_SEQ_0205
Figure 12_A0101_SEQ_0205

Figure 12_A0101_SEQ_0206
Figure 12_A0101_SEQ_0206

Figure 12_A0101_SEQ_0207
Figure 12_A0101_SEQ_0207

Figure 12_A0101_SEQ_0208
Figure 12_A0101_SEQ_0208

Figure 12_A0101_SEQ_0209
Figure 12_A0101_SEQ_0209

Figure 12_A0101_SEQ_0210
Figure 12_A0101_SEQ_0210

Figure 12_A0101_SEQ_0211
Figure 12_A0101_SEQ_0211

Figure 12_A0101_SEQ_0212
Figure 12_A0101_SEQ_0212

Figure 12_A0101_SEQ_0213
Figure 12_A0101_SEQ_0213

Figure 12_A0101_SEQ_0214
Figure 12_A0101_SEQ_0214

Figure 12_A0101_SEQ_0215
Figure 12_A0101_SEQ_0215

Figure 12_A0101_SEQ_0216
Figure 12_A0101_SEQ_0216

Figure 12_A0101_SEQ_0217
Figure 12_A0101_SEQ_0217

Figure 12_A0101_SEQ_0218
Figure 12_A0101_SEQ_0218

Figure 12_A0101_SEQ_0219
Figure 12_A0101_SEQ_0219

Figure 12_A0101_SEQ_0220
Figure 12_A0101_SEQ_0220

Figure 12_A0101_SEQ_0221
Figure 12_A0101_SEQ_0221

Figure 12_A0101_SEQ_0222
Figure 12_A0101_SEQ_0222

Figure 12_A0101_SEQ_0223
Figure 12_A0101_SEQ_0223

Figure 12_A0101_SEQ_0224
Figure 12_A0101_SEQ_0224

Figure 12_A0101_SEQ_0225
Figure 12_A0101_SEQ_0225

Figure 12_A0101_SEQ_0226
Figure 12_A0101_SEQ_0226

Figure 12_A0101_SEQ_0227
Figure 12_A0101_SEQ_0227

Figure 12_A0101_SEQ_0228
Figure 12_A0101_SEQ_0228

Figure 12_A0101_SEQ_0229
Figure 12_A0101_SEQ_0229

Figure 12_A0101_SEQ_0230
Figure 12_A0101_SEQ_0230

Figure 12_A0101_SEQ_0231
Figure 12_A0101_SEQ_0231

Figure 12_A0101_SEQ_0232
Figure 12_A0101_SEQ_0232

Figure 12_A0101_SEQ_0233
Figure 12_A0101_SEQ_0233

Figure 12_A0101_SEQ_0234
Figure 12_A0101_SEQ_0234

Figure 12_A0101_SEQ_0235
Figure 12_A0101_SEQ_0235

Figure 12_A0101_SEQ_0236
Figure 12_A0101_SEQ_0236

Figure 12_A0101_SEQ_0237
Figure 12_A0101_SEQ_0237

Figure 12_A0101_SEQ_0238
Figure 12_A0101_SEQ_0238

Figure 12_A0101_SEQ_0239
Figure 12_A0101_SEQ_0239

Figure 12_A0101_SEQ_0240
Figure 12_A0101_SEQ_0240

Figure 12_A0101_SEQ_0241
Figure 12_A0101_SEQ_0241

Figure 12_A0101_SEQ_0242
Figure 12_A0101_SEQ_0242

Figure 12_A0101_SEQ_0243
Figure 12_A0101_SEQ_0243

Figure 12_A0101_SEQ_0244
Figure 12_A0101_SEQ_0244

Figure 12_A0101_SEQ_0245
Figure 12_A0101_SEQ_0245

Figure 12_A0101_SEQ_0246
Figure 12_A0101_SEQ_0246

Figure 12_A0101_SEQ_0247
Figure 12_A0101_SEQ_0247

Figure 12_A0101_SEQ_0248
Figure 12_A0101_SEQ_0248

Figure 12_A0101_SEQ_0249
Figure 12_A0101_SEQ_0249

Figure 12_A0101_SEQ_0250
Figure 12_A0101_SEQ_0250

Figure 12_A0101_SEQ_0251
Figure 12_A0101_SEQ_0251

Figure 12_A0101_SEQ_0252
Figure 12_A0101_SEQ_0252

Figure 12_A0101_SEQ_0253
Figure 12_A0101_SEQ_0253

Figure 12_A0101_SEQ_0254
Figure 12_A0101_SEQ_0254

Figure 12_A0101_SEQ_0255
Figure 12_A0101_SEQ_0255

Figure 12_A0101_SEQ_0256
Figure 12_A0101_SEQ_0256

Figure 12_A0101_SEQ_0257
Figure 12_A0101_SEQ_0257

Figure 12_A0101_SEQ_0258
Figure 12_A0101_SEQ_0258

Figure 12_A0101_SEQ_0259
Figure 12_A0101_SEQ_0259

Figure 12_A0101_SEQ_0260
Figure 12_A0101_SEQ_0260

Figure 12_A0101_SEQ_0261
Figure 12_A0101_SEQ_0261

Figure 12_A0101_SEQ_0262
Figure 12_A0101_SEQ_0262

Figure 12_A0101_SEQ_0263
Figure 12_A0101_SEQ_0263

Figure 12_A0101_SEQ_0264
Figure 12_A0101_SEQ_0264

Figure 12_A0101_SEQ_0265
Figure 12_A0101_SEQ_0265

Figure 12_A0101_SEQ_0266
Figure 12_A0101_SEQ_0266

Figure 12_A0101_SEQ_0267
Figure 12_A0101_SEQ_0267

Figure 12_A0101_SEQ_0268
Figure 12_A0101_SEQ_0268

Figure 12_A0101_SEQ_0269
Figure 12_A0101_SEQ_0269

Figure 12_A0101_SEQ_0270
Figure 12_A0101_SEQ_0270

Figure 12_A0101_SEQ_0271
Figure 12_A0101_SEQ_0271

Figure 12_A0101_SEQ_0272
Figure 12_A0101_SEQ_0272

Figure 12_A0101_SEQ_0273
Figure 12_A0101_SEQ_0273

Claims (154)

一種信使核糖核酸(mRNA),其包含編碼SARS-CoV-2刺突蛋白之至少兩個結構域且小於全長刺突蛋白之開放閱讀框(ORF)。A messenger ribonucleic acid (mRNA) comprising at least two domains encoding the SARS-CoV-2 spike protein and an open reading frame (ORF) smaller than the full-length spike protein. 如請求項1之mRNA,其中該兩個結構域中之一者係SARS-CoV-2刺突蛋白之N末端結構域(NTD)。The mRNA of claim 1, wherein one of the two domains is the N-terminal domain (NTD) of the SARS-CoV-2 spike protein. 如請求項1或2之mRNA,其中該兩個結構域中之一者係SARS-CoV-2刺突蛋白之受體結合結構域(RBD)。The mRNA of claim 1 or 2, wherein one of the two domains is the receptor binding domain (RBD) of the SARS-CoV-2 spike protein. 如請求項2或3之mRNA,其中該ORF編碼與該NTD及/或該RBD連接之跨膜結構域(TD)。The mRNA of claim 2 or 3, wherein the ORF encodes a transmembrane domain (TD) linked to the NTD and/or the RBD. 如請求項4之mRNA,其中該TD係流行性感冒血球凝集素跨膜結構域。The mRNA of claim 4, wherein the TD is an influenza hemagglutinin transmembrane domain. 如請求項3或4之mRNA,其中該ORF包含NTD - RBD - TM。The mRNA of claim 3 or 4, wherein the ORF comprises NTD-RBD-TM. 如請求項1至6中任一項之mRNA,其中該至少兩個結構域經由可裂解之連接體或不可裂解之連接體連接。The mRNA of any one of claims 1 to 6, wherein the at least two domains are linked via a cleavable linker or a non-cleavable linker. 如請求項7之mRNA,其中該不可裂解之連接體係甘胺酸-絲胺酸(GS)連接體。The mRNA of claim 7, wherein the non-cleavable linker is a glycine-serine (GS) linker. 如請求項8之mRNA,其中該GS連接體包含4至15個胺基酸。The mRNA of claim 8, wherein the GS linker comprises 4 to 15 amino acids. 如請求項6之mRNA,其中該連接體係泛HLA DR結合抗原决定基(pan HLA DR-binding epitope)(PADRE)。The mRNA of claim 6, wherein the linker system is a pan HLA DR-binding epitope (PADRE). 如請求項1至10中任一項之mRNA,其中該ORF編碼信號肽。The mRNA of any one of claims 1 to 10, wherein the ORF encodes a signal peptide. 如請求項11之mRNA,其中該信號肽與該NTD連接。The mRNA of claim 11, wherein the signal peptide is linked to the NTD. 如請求項11之mRNA,其中該信號肽與該RBD連接。The mRNA of claim 11, wherein the signal peptide is linked to the RBD. 如請求項11至13中任一項之mRNA,其中該信號肽對SARS-CoV-2係異源性。The mRNA of any one of claims 11 to 13, wherein the signal peptide is heterologous to SARS-CoV-2. 如請求項1至3中任一項之mRNA,其中該至少兩個結構域係可溶性。The mRNA of any one of claims 1 to 3, wherein the at least two domains are soluble. 如請求項1至14中任一項之mRNA,其中該ORF編碼運輸信號結構域。The mRNA of any one of claims 1 to 14, wherein the ORF encodes a trafficking signaling domain. 如請求項16之mRNA,其中該運輸信號結構域係巨噬細胞標記物。The mRNA of claim 16, wherein the trafficking signaling domain is a macrophage marker. 如請求項16之mRNA,其中該巨噬細胞標記物係CD86及/或CD11b。The mRNA of claim 16, wherein the macrophage marker is CD86 and/or CD11b. 如請求項16之mRNA,其中該運輸信號結構域係VSV-G胞質尾(VSVGct)。The mRNA of claim 16, wherein the trafficking signaling domain is a VSV-G cytoplasmic tail (VSVGct). 如請求項1之mRNA,其中該兩個結構域中之一者係SARS-CoV-2刺突蛋白之第一重複七肽:HPPHCPC (HR1)。The mRNA of claim 1, wherein one of the two domains is the first repeat heptapeptide of the SARS-CoV-2 spike protein: HPPHCPC (HR1). 如請求項1之mRNA,其中該兩個結構域中之一者係SARS-CoV-2刺突蛋白之第二重複七肽:HPPHCPC (HR2)。The mRNA of claim 1, wherein one of the two domains is the second repeat heptapeptide of the SARS-CoV-2 spike protein: HPPHCPC (HR2). 如請求項20或21之mRNA,其中該ORF編碼與該HR1及/或該HR2連接之跨膜結構域(TD)。The mRNA of claim 20 or 21, wherein the ORF encodes a transmembrane domain (TD) linked to the HR1 and/or the HR2. 如請求項22之mRNA,其中該TD係流行性感冒血球凝集素跨膜結構域。The mRNA of claim 22, wherein the TD is an influenza hemagglutinin transmembrane domain. 如請求項20至23中任一項之mRNA,其中該ORF編碼融合肽(FP)。The mRNA of any one of claims 20 to 23, wherein the ORF encodes a fusion peptide (FP). 如請求項20至23中任一項之mRNA,其中該ORF編碼CT尾。 The mRNA of any one of claims 20 to 23, wherein the ORF encodes a CT tail. 一種信使核糖核酸(mRNA),其包含編碼SARS-CoV-2刺突蛋白之受體結合結構域(RBD)之開放閱讀框(ORF)。A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding the receptor binding domain (RBD) of the SARS-CoV-2 spike protein. 如請求項26之mRNA,其中該RBD係可溶性。 The mRNA of claim 26, wherein the RBD is soluble. 如請求項26之mRNA,其中該RBD與跨膜結構域連接,視情況與流行性感冒血球凝集素跨膜結構域連接。The mRNA of claim 26, wherein the RBD is linked to a transmembrane domain, optionally linked to an influenza hemagglutinin transmembrane domain. 一種信使核糖核酸(mRNA),其包含編碼SARS-CoV-2刺突蛋白之N末端結構域(NTD)之開放閱讀框(ORF)。A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding the N-terminal domain (NTD) of the SARS-CoV-2 spike protein. 如請求項29之mRNA,其中該NTD與SARS-CoV-2刺突蛋白之RBD連接以形成NTD-RBD融合蛋白。The mRNA of claim 29, wherein the NTD is linked to the RBD of the SARS-CoV-2 spike protein to form an NTD-RBD fusion protein. 如請求項30之mRNA,其中該NTD-RBD融合體與跨膜結構域(TM) 連接,視情況與流行性感冒血球凝集素跨膜結構域連接,以形成NTD-RBD-TM蛋白。The mRNA of claim 30, wherein the NTD-RBD fusion is linked to a transmembrane domain (TM), optionally linked to an influenza hemagglutinin transmembrane domain, to form an NTD-RBD-TM protein. 如請求項30之mRNA,其中該NTD-RBD融合體包含C末端截短。The mRNA of claim 30, wherein the NTD-RBD fusion comprises a C-terminal truncation. 如請求項26至32中任一項之mRNA,其中該NTD及/或該RBD包括延伸區。The mRNA of any one of claims 26 to 32, wherein the NTD and/or the RBD comprises an extension. 一種信使核糖核酸(mRNA),其包含編碼S1次單元之開放閱讀框(ORF),其中該S1次單元與跨膜結構域連接,視情況與流行性感冒血球凝集素跨膜結構域連接。A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding an S1 subunit, wherein the S1 subunit is linked to a transmembrane domain, optionally linked to an influenza hemagglutinin transmembrane domain. 一種信使核糖核酸(mRNA),其包含編碼S1次單元之開放閱讀框(ORF),其中該S1次單元經修飾以去除S蛋白之RBD或RBD之一部分。A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding an S1 subunit, wherein the S1 subunit is modified to remove the RBD or a portion of the RBD of the S protein. 一種信使核糖核酸(mRNA),其包含編碼與S2次單元連接之S1次單元之開放閱讀框(ORF),其中該S1次單元來自HKU1 S蛋白。A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding an S1 subunit linked to an S2 subunit, wherein the S1 subunit is derived from the HKU1 S protein. 一種信使核糖核酸(mRNA),其包含編碼與S2次單元連接之S1次單元之開放閱讀框(ORF),其中該S1次單元來自OC43 S蛋白。 A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding an S1 subunit linked to an S2 subunit, wherein the S1 subunit is derived from an OC43 S protein. 如請求項34至37中任一項之mRNA,其中進一步包含運輸信號,視情況地選自巨噬細胞標記物,視情況為CD86、CD11B及/或VSVGct。The mRNA of any one of claims 34 to 37, further comprising a trafficking signal, optionally selected from macrophage markers, optionally CD86, CD11B and/or VSVGct. 一種信使核糖核酸(mRNA),其包含編碼SARS-CoV-2刺突蛋白之S2次單元之開放閱讀框(ORF)。A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding the S2 subunit of the SARS-CoV-2 spike protein. 一種信使核糖核酸(mRNA),其包含編碼融合蛋白之開放閱讀框,該融合蛋白包含SARS-CoV-2刺突蛋白之受體結合結構域(RBD)及蛋白質跨膜結構域。A messenger ribonucleic acid (mRNA) comprising an open reading frame encoding a fusion protein comprising the receptor binding domain (RBD) and protein transmembrane domain of the SARS-CoV-2 spike protein. 如請求項40之mRNA,其中該蛋白質跨膜結構域係流行性感冒血球凝集素跨膜結構域。The mRNA of claim 40, wherein the protein transmembrane domain is an influenza hemagglutinin transmembrane domain. 一種信使核糖核酸(mRNA),其包含編碼融合蛋白之開放閱讀框,該融合蛋白包含SARS-CoV-2刺突蛋白之胺基(N)末端結構域(NTD)及跨膜結構域。A messenger ribonucleic acid (mRNA) comprising an open reading frame encoding a fusion protein comprising the amino (N) terminal domain (NTD) and transmembrane domain of the SARS-CoV-2 spike protein. 如請求項42之mRNA,其中該跨膜結構域係流行性感冒血球凝集素跨膜結構域。The mRNA of claim 42, wherein the transmembrane domain is an influenza hemagglutinin transmembrane domain. 一種信使核糖核酸(mRNA),其包含編碼融合蛋白之開放閱讀框,該融合蛋白包含與SARS-CoV-2刺突蛋白之受體結合結構域連接之SARS-CoV-2刺突蛋白之胺基(N)末端結構域。A messenger ribonucleic acid (mRNA) comprising an open reading frame encoding a fusion protein comprising the amine group of the SARS-CoV-2 spike protein linked to the receptor binding domain of the SARS-CoV-2 spike protein (N) Terminal domain. 如請求項44之mRNA,其中該融合蛋白進一步包含跨膜結構域。The mRNA of claim 44, wherein the fusion protein further comprises a transmembrane domain. 如前述請求項中任一項之mRNA,其進一步包含5'端帽,視情況為7mG(5')ppp(5')NlmpNp。The mRNA of any preceding claim, further comprising a 5' end cap, optionally 7mG(5')ppp(5')NlmpNp. 如請求項1至46中任一項之mRNA,其進一步包含視情況地長度為約100個核苷酸之聚A尾。The mRNA of any one of claims 1 to 46, further comprising a poly-A tail optionally about 100 nucleotides in length. 如請求項1至47中任一項之mRNA,其中該mRNA包含化學修飾,視情況為1-甲基假尿苷。The mRNA of any one of claims 1 to 47, wherein the mRNA comprises a chemical modification, optionally 1-methylpseudouridine. 如請求項1至48中任一項之mRNA,其中該mRNA中之每一尿苷係1-甲基假尿苷。The mRNA of any one of claims 1 to 48, wherein each uridine in the mRNA is 1-methylpseudouridine. 一種組合物,其包含: (a) 信使核糖核酸(mRNA),其包含編碼融合蛋白之開放閱讀框,該融合蛋白包含SARS-CoV-2刺突蛋白之受體結合結構域(RBD)及蛋白質跨膜結構域;及 (b) mRNA,其包含編碼融合蛋白之開放閱讀框,該融合蛋白包含SARS-CoV-2刺突蛋白之胺基(N)末端結構域(NTD)及跨膜結構域。 A composition comprising: (a) messenger ribonucleic acid (mRNA) comprising an open reading frame encoding a fusion protein comprising the receptor binding domain (RBD) and protein transmembrane domain of the SARS-CoV-2 spike protein; and (b) mRNA comprising an open reading frame encoding a fusion protein comprising the amino (N) terminal domain (NTD) and transmembrane domain of the SARS-CoV-2 spike protein. 如請求項50之組合物,其中該蛋白質跨膜結構域係流行性感冒血球凝集素跨膜結構域。The composition of claim 50, wherein the protein transmembrane domain is an influenza hemagglutinin transmembrane domain. 如請求項51之組合物,其中(a)之該融合蛋白包含與SEQ ID NO: 77之胺基酸序列具有至少80%一致性的胺基酸序列。The composition of claim 51, wherein the fusion protein of (a) comprises an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 77. 如請求項52之組合物,其中(a)之該融合蛋白包含與SEQ ID NO: 77之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。The composition of claim 52, wherein the fusion protein of (a) comprises at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least the amino acid sequence of SEQ ID NO: 77 Amino acid sequences that are 98% or at least 99% identical. 如請求項53之組合物,其中(a)之該融合蛋白包含SEQ ID NO: 77之胺基酸序列。The composition of claim 53, wherein the fusion protein of (a) comprises the amino acid sequence of SEQ ID NO:77. 如請求項50至54中任一項之組合物,其中(a)之該開放閱讀框包含與SEQ ID NO: 76之核苷酸序列具有至少70%一致性的核苷酸序列。The composition of any one of claims 50 to 54, wherein the open reading frame of (a) comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 76. 如請求項55之組合物,其中(a)之該開放閱讀框包含與SEQ ID NO: 76之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。The composition of claim 55, wherein the open reading frame of (a) comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, and the nucleotide sequence of SEQ ID NO: 76. Nucleotide sequences that are at least 97%, at least 98%, or at least 99% identical. 如請求項56之組合物,其中(a)之該開放閱讀框包含SEQ ID NO: 76之核苷酸序列。The composition of claim 56, wherein the open reading frame of (a) comprises the nucleotide sequence of SEQ ID NO:76. 如請求項50至57中任一項之組合物,其中(b)之該融合蛋白包含與SEQ ID NO: 47之胺基酸序列具有至少80%一致性的胺基酸序列。The composition of any one of claims 50 to 57, wherein the fusion protein of (b) comprises an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 47. 如請求項58之組合物,其中(b)之該融合蛋白包含與SEQ ID NO: 47之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。The composition of claim 58, wherein the fusion protein of (b) comprises at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least Amino acid sequences that are 98% or at least 99% identical. 如請求項59之組合物,其中(b)之該融合蛋白包含SEQ ID NO: 47之胺基酸序列。The composition of claim 59, wherein the fusion protein of (b) comprises the amino acid sequence of SEQ ID NO:47. 如請求項50至60中任一項之組合物,其中(b)之該開放閱讀框包含與SEQ ID NO: 46之核苷酸序列具有至少70%一致性的核苷酸序列。The composition of any one of claims 50 to 60, wherein the open reading frame of (b) comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 46. 如請求項61之組合物,其中(b)之該開放閱讀框包含與SEQ ID NO: 46之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。The composition of claim 61, wherein the open reading frame of (b) comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, with the nucleotide sequence of SEQ ID NO: 46. Nucleotide sequences that are at least 97%, at least 98%, or at least 99% identical. 如請求項62之組合物,其中(b)之該開放閱讀框包含SEQ ID NO: 46之核苷酸序列。The composition of claim 62, wherein the open reading frame of (b) comprises the nucleotide sequence of SEQ ID NO:46. 如請求項50至63中任一項之組合物,其中(a)之該mRNA對(b)之該mRNA之比率為約1:1。The composition of any one of claims 50 to 63, wherein the ratio of the mRNA of (a) to the mRNA of (b) is about 1:1. 如請求項1至49中任一項之mRNA,其在組合物中進一步包含脂質奈米顆粒。The mRNA of any one of claims 1 to 49, further comprising lipid nanoparticles in the composition. 如請求項50至63中任一項之組合物,其進一步包含脂質奈米顆粒。The composition of any one of claims 50 to 63, further comprising lipid nanoparticles. 如請求項65或66之組合物,其中該mRNA在該脂質奈米顆粒中。The composition of claim 65 or 66, wherein the mRNA is in the lipid nanoparticle. 如請求項50至63中任一項之組合物,其中(a)之該mRNA在脂質奈米顆粒中且(b)之該mRNA在脂質奈米顆粒中。The composition of any one of claims 50 to 63, wherein the mRNA of (a) is in lipid nanoparticles and the mRNA of (b) is in lipid nanoparticles. 如請求項68之組合物,其中(a)之該mRNA及(b)之該mRNA在相同脂質奈米顆粒中,或其中(a)之該mRNA及(b)之該mRNA相對於彼此各自調配於單獨奈米顆粒中。The composition of claim 68, wherein the mRNA of (a) and the mRNA of (b) are in the same lipid nanoparticle, or wherein the mRNA of (a) and the mRNA of (b) are each formulated relative to each other in individual nanoparticles. 如請求項65至69中任一項之組合物,其中該脂質奈米顆粒包含陽離子脂質。The composition of any one of claims 65 to 69, wherein the lipid nanoparticles comprise cationic lipids. 如請求項70之組合物,其中該脂質奈米顆粒進一步包含中性脂質。The composition of claim 70, wherein the lipid nanoparticles further comprise neutral lipids. 如請求項70或71之組合物,其中該脂質奈米顆粒進一步包含固醇。The composition of claim 70 or 71, wherein the lipid nanoparticle further comprises a sterol. 如請求項70至72中任一項之組合物,其中該脂質奈米顆粒進一步包含聚乙二醇(PEG)改質之脂質。The composition of any one of claims 70 to 72, wherein the lipid nanoparticles further comprise polyethylene glycol (PEG) modified lipids. 如請求項70至73中任一項之組合物,其中該脂質奈米顆粒包含可電離之陽離子脂質、中性脂質、固醇及PEG改質之脂質。The composition of any one of claims 70 to 73, wherein the lipid nanoparticles comprise ionizable cationic lipids, neutral lipids, sterols, and PEG-modified lipids. 如請求項74之組合物,其中該可電離之陽離子脂質係8-((2-羥基乙基)(6-側氧基-6-(十一烷基氧基)己基)胺基)辛酸十七烷-9-基酯(化合物1)。The composition of claim 74, wherein the ionizable cationic lipid is 8-((2-hydroxyethyl)(6-oxy-6-(undecyloxy)hexyl)amino)octanoic acid Heptan-9-yl ester (Compound 1). 如請求項74或75之組合物,其中該中性脂質係1,2-二硬脂醯基-sn-甘油-3-磷酸膽鹼(DSPC)。The composition of claim 74 or 75, wherein the neutral lipid is 1,2-distearyl-sn-glycero-3-phosphocholine (DSPC). 如請求項70至76中任一項之組合物,其中該固醇係膽固醇。The composition of any one of claims 70 to 76, wherein the sterol is cholesterol. 如請求項70至77中任一項之組合物,其中該PEG改質之脂質係1,2二肉豆蔻醯基-sn-甘油-甲氧基聚乙二醇(PEG2000 DMG)。The composition of any one of claims 70 to 77, wherein the PEG-modified lipid is 1,2 dimyristyl-sn-glycerol-methoxypolyethylene glycol (PEG2000 DMG). 如請求項70至78中任一項之組合物,其中該脂質奈米顆粒包含20-60 mol%可電離之陽離子脂質、5-25 mol%中性脂質、25-55 mol%固醇及0.5-15 mol% PEG改質之脂質。The composition of any one of claims 70 to 78, wherein the lipid nanoparticles comprise 20-60 mol % ionizable cationic lipids, 5-25 mol % neutral lipids, 25-55 mol % sterols, and 0.5 mol % -15 mol% PEG-modified lipids. 如請求項79之組合物,其中該脂質奈米顆粒包含: 40-50 mol%可電離之脂質,視情況地45-50 mol%;30-45 mol%固醇,視情況地35-40 mol%;5-15 mol%輔助脂質,視情況地10-12 mol%;1-5% PEG脂質,視情況地1-3 mol%或1.5-2.5 mol%。 The composition of claim 79, wherein the lipid nanoparticles comprise: 40-50 mol% ionizable lipids, optionally 45-50 mol%; 30-45 mol% sterols, optionally 35-40 mol%; 5-15 mol% helper lipids, optionally 10-12 mol%; 1-5% PEG lipid, 1-3 mol% or 1.5-2.5 mol% as appropriate. 如請求項80之組合物,其中該脂質奈米顆粒包含: 40-50 mol%化合物1,視情況地45-50 mol%;30-45 mol%膽固醇,視情況地35-40 mol%;5-15 mol% DSPC,視情況地10-12 mol%;1-5% PEG2000DMG,視情況地1-3 mol%或1.5-2.5 mol%。 The composition of claim 80, wherein the lipid nanoparticle comprises: 40-50 mol% compound 1, optionally 45-50 mol%; 30-45 mol% cholesterol, optionally 35-40 mol%; 5-15 mol% DSPC, optionally 10-12 mol%; 1 -5% PEG2000DMG, 1-3 mol% or 1.5-2.5 mol% as appropriate. 一種方法,其包括向個體投與如前述請求項中任一項之mRNA或組合物,其投與量可在該個體中有效誘導針對SARS-CoV-2之中和抗體反應。A method comprising administering to an individual the mRNA or composition of any preceding claim in an amount effective to induce a neutralizing antibody response against SARS-CoV-2 in the individual. 一種方法,其包括向個體投與如前述請求項中任一項之mRNA或組合物,其投與量可在該個體中有效誘導針對SARS-CoV-2之T細胞免疫反應。A method comprising administering to an individual the mRNA or composition of any preceding claim in an amount effective to induce a T cell immune response against SARS-CoV-2 in the individual. 一種方法,其包括向個體投與如請求項1至49中任一項之mRNA,其投與量可在該個體中有效誘導針對SARS-CoV-2之中和抗體反應及T細胞免疫反應。A method comprising administering to an individual the mRNA of any one of claims 1 to 49 in an amount effective to induce neutralizing antibody responses and T cell immune responses against SARS-CoV-2 in the individual. 一種信使核糖核酸(mRNA),其包含編碼SARS-CoV-2刺突蛋白之至少一個結構域的開放閱讀框(ORF)。A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding at least one domain of the SARS-CoV-2 spike protein. 如請求項85之mRNA,其中該mRNA係如請求項1至49中之任一項之mRNA。The mRNA of claim 85, wherein the mRNA is the mRNA of any one of claims 1 to 49. 如請求項85或86之mRNA,其中該ORF編碼與SEQ ID NO: 77之胺基酸序列具有至少80%一致性的胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 77. 如請求項85或86之mRNA,其中該ORF編碼與SEQ ID NO: 77之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least the amino acid sequence of SEQ ID NO: 77 99% identical amino acid sequence. 如請求項85或86之mRNA,其中該ORF編碼SEQ ID NO: 77之胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes the amino acid sequence of SEQ ID NO:77. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 76之核苷酸序列具有至少70%一致性的核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 76. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 76之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least Nucleotide sequences that are 98% or at least 99% identical. 如請求項85或86之mRNA,其中該ORF包含SEQ ID NO: 76之核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises the nucleotide sequence of SEQ ID NO:76. 如請求項85或86之mRNA,其中該ORF編碼與SEQ ID NO: 47之胺基酸序列具有至少80%一致性的胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 47. 如請求項85或86之mRNA,其中該ORF編碼與SEQ ID NO: 47之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least the amino acid sequence of SEQ ID NO: 47 99% identical amino acid sequence. 如請求項85或86之mRNA,其中該ORF編碼SEQ ID NO: 47之胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes the amino acid sequence of SEQ ID NO:47. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 46之核苷酸序列具有至少70%一致性的核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 46. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 46之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 97%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least the Nucleotide sequences that are 98% or at least 99% identical. 如請求項85或86之mRNA,其中該ORF包含SEQ ID NO: 46之核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises the nucleotide sequence of SEQ ID NO:46. 如請求項85或86之mRNA,其中該ORF編碼與SEQ ID NO: 62之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中該抗原包含SEQ ID NO: 62之胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least the amino acid sequence of SEQ ID NO: 62 An amino acid sequence that is 98% or at least 99% identical, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO:62. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 61之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中該開放閱讀框包含SEQ ID NO: 61之核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least the nucleotide sequence of SEQ ID NO: 61 A nucleotide sequence that is 97%, at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO:61. 如請求項85或86之mRNA,其中該ORF編碼與SEQ ID NO: 92、140或143之胺基酸序列具有至少80%一致性的胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes an amino acid sequence that is at least 80% identical to the amino acid sequence of SEQ ID NO: 92, 140 or 143. 如請求項85或86之mRNA,其中該ORF編碼與SEQ ID NO: 92、140或143之胺基酸序列具有至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 97% of the amino acid sequence of SEQ ID NO: 92, 140 or 143 Amino acid sequences that are 98% or at least 99% identical. 如請求項85或86之mRNA,其中該ORF編碼SEQ ID NO: 92、140或143之胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes the amino acid sequence of SEQ ID NO: 92, 140 or 143. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 91、139或142之核苷酸序列具有至少70%一致性的核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises a nucleotide sequence that is at least 70% identical to the nucleotide sequence of SEQ ID NO: 91, 139 or 142. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 91、139或142之核苷酸序列具有至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least the nucleotide sequence of SEQ ID NO: 91, 139 or 142 Nucleotide sequences that are 97%, at least 98% or at least 99% identical. 如請求項85或86之mRNA,其中該ORF包含SEQ ID NO: 91、139或142之核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises the nucleotide sequence of SEQ ID NO: 91, 139 or 142. 如請求項85或86之mRNA,其中該ORF編碼與SEQ ID NO: 107之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中該抗原包含SEQ ID NO: 107之胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least the amino acid sequence of SEQ ID NO: 107 An amino acid sequence that is 98% or at least 99% identical, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO: 107. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 106之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中該開放閱讀框包含SEQ ID NO: 106之核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least A nucleotide sequence that is 97%, at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 106. 如請求項85或86之mRNA,其中該ORF編碼與SEQ ID NO: 59、86、89、116、119或122中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況地其中該抗原包含SEQ ID NO: 59、86、89、116、119或122中之任一者之胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes at least 80%, at least 85%, at least 90% of the amino acid sequence of any one of SEQ ID NOs: 59, 86, 89, 116, 119 or 122 %, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical amino acid sequences, optionally wherein the antigen comprises SEQ ID NO: 59, 86, 89, 116, 119 or The amino acid sequence of any of 122. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 58、85、88、115、118或121中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況地其中該開放閱讀框包含SEQ ID NO: 58、85、88、115、118或121中之任一者之核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises at least 70%, at least 75%, at least 80% with the nucleotide sequence of any one of SEQ ID NOs: 58, 85, 88, 115, 118 or 121 %, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical nucleotide sequences, optionally wherein the open reading frame comprises SEQ ID NOs: 58, 85, The nucleotide sequence of any of 88, 115, 118, or 121. 一種信使核糖核酸(mRNA),其包含編碼S1次單元抗原的開放閱讀框 (ORF),該抗原包含與SEQ ID NO: 5的胺基酸序列具有至少 80%、至少 85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況其中該抗原包含SEQ ID NO: 5的胺基酸序列。A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding an S1 subunit antigen comprising at least 80%, at least 85%, at least 90%, at least 80%, at least 85%, at least 90%, with the amino acid sequence of SEQ ID NO: 5 An amino acid sequence that is at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO:5. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 3之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況其中該開放閱讀框包含SEQ ID NO: 3之核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least A nucleotide sequence that is 97%, at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 3. 如請求項85或86之mRNA,其中該抗原包含與SEQ ID NO: 17或146之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況其中該抗原包含SEQ ID NO: 17或146之胺基酸序列。The mRNA of claim 85 or 86, wherein the antigen comprises at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97% with the amino acid sequence of SEQ ID NO: 17 or 146 , an amino acid sequence of at least 98% or at least 99% identity, as appropriate wherein the antigen comprises the amino acid sequence of SEQ ID NO: 17 or 146. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 16或145之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況其中該開放閱讀框包含SEQ ID NO: 16或145之核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96% with the nucleotide sequence of SEQ ID NO: 16 or 145 , a nucleotide sequence of at least 97%, at least 98%, or at least 99% identity, as the case may be, wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 16 or 145. 如請求項85或86之mRNA,其中該ORF編碼與SEQ ID NO: 20、23、26、29、32或35中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況其中該抗原包含SEQ ID NO: 20、23、26、29、32或35中之任一者之胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes at least 80%, at least 85%, at least 90% of the amino acid sequence with any one of SEQ ID NOs: 20, 23, 26, 29, 32 or 35 %, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical amino acid sequences, optionally wherein the antigen comprises SEQ ID NO: 20, 23, 26, 29, 32 or 35 The amino acid sequence of any of them. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 19、22、25、28、31或34中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況其中該開放閱讀框包含SEQ ID NO: 19、22、25、28、31或34中之任一者之核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises at least 70%, at least 75%, at least 80% with the nucleotide sequence of any one of SEQ ID NOs: 19, 22, 25, 28, 31 or 34 %, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical nucleotide sequences, optionally wherein the open reading frame comprises SEQ ID NOs: 19, 22, 25 The nucleotide sequence of any of , 28, 31 or 34. 如請求項85或86之mRNA,其中該ORF編碼與SEQ ID NO: 38之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況其中該抗原包含SEQ ID NO: 38之胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least the amino acid sequence of SEQ ID NO: 38 An amino acid sequence that is 98% or at least 99% identical, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO:38. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 37之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況其中該開放閱讀框包含SEQ ID NO: 37之核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least A nucleotide sequence that is 97%, at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO:37. 如請求項85或86之mRNA,其中該ORF編碼與SEQ ID NO: 41之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況其中該抗原包含SEQ ID NO: 41之胺基酸序列。The mRNA of claim 85 or 86, wherein the ORF encodes at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least the amino acid sequence of SEQ ID NO: 41 An amino acid sequence that is 98% or at least 99% identical, optionally wherein the antigen comprises the amino acid sequence of SEQ ID NO: 41. 如請求項85或86之mRNA,其中該ORF包含與SEQ ID NO: 40之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況其中該開放閱讀框包含SEQ ID NO: 40之核苷酸序列。The mRNA of claim 85 or 86, wherein the ORF comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least A nucleotide sequence that is 97%, at least 98% or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 40. 如前述請求項中任一項之mRNA,其進一步包含5'非轉譯區(UTR),該5'非轉譯區視情況包含SEQ ID NO: 131或2之核苷酸序列。The mRNA of any preceding claim, further comprising a 5' untranslated region (UTR) comprising the nucleotide sequence of SEQ ID NO: 131 or 2, as appropriate. 如前述請求項中任一項之mRNA,其進一步包含3'非轉譯區(UTR),該3'非轉譯區視情況包含SEQ ID NO: 132或4之核苷酸序列。 The mRNA of any of the preceding claims, further comprising a 3' untranslated region (UTR) comprising the nucleotide sequence of SEQ ID NO: 132 or 4, as appropriate. 一種信使核糖核酸(mRNA),其包含編碼抗原之開放閱讀框(ORF),該抗原包含與SEQ ID NO: 8或65之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況其中該抗原包含SEQ ID NO: 8或65之胺基酸序列。 A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding an antigen comprising at least 80%, at least 85%, at least 90%, at least 80% with the amino acid sequence of SEQ ID NO: 8 or 65 An amino acid sequence that is 95%, at least 96%, at least 97%, at least 98% or at least 99% identical, as appropriate wherein the antigen comprises the amino acid sequence of SEQ ID NO: 8 or 65. 如請求項123之mRNA,其中該ORF包含與SEQ ID NO: 7或64之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況其中該開放閱讀框包含SEQ ID NO: 7或64之核苷酸序列。The mRNA of claim 123, wherein the ORF comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 96%, at least A nucleotide sequence that is 97%, at least 98%, or at least 99% identical, optionally wherein the open reading frame comprises the nucleotide sequence of SEQ ID NO: 7 or 64. 一種信使核糖核酸(mRNA),其包含編碼抗原之開放閱讀框(ORF),該抗原包含與SEQ ID NO: 11、14、68或71中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況其中該抗原包含SEQ ID NO: 11、14、68或71中之任一者之胺基酸序列。 A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding an antigen comprising at least 80% amino acid sequence with any one of SEQ ID NOs: 11, 14, 68 or 71, An amino acid sequence that is at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical, optionally wherein the antigen comprises SEQ ID NOs: 11, 14, 68 or the amino acid sequence of any of 71. 如請求項125之mRNA,其中該開放閱讀框包含與SEQ ID NO: 10、13、67或70中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況其中該開放閱讀框包含SEQ ID NO: 10、13、67或70中之任一者之核苷酸序列。The mRNA of claim 125, wherein the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90% with the nucleotide sequence of any one of SEQ ID NOs: 10, 13, 67 or 70 %, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical nucleotide sequences, optionally wherein the open reading frame comprises one of SEQ ID NOs: 10, 13, 67 or 70 The nucleotide sequence of any one. 一種信使核糖核酸(mRNA),其包含編碼抗原之開放閱讀框(ORF),該抗原包含與SEQ ID NO: 44、50、74、80、83、101、104或113中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況其中該抗原包含SEQ ID NO: 44、50、74、80、83、101、104或113中之任一者之胺基酸序列。 A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding an antigen comprising an amine with any one of SEQ ID NOs: 44, 50, 74, 80, 83, 101, 104 or 113 an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity to an amino acid sequence, optionally wherein the antigen comprises SEQ The amino acid sequence of any of ID NO: 44, 50, 74, 80, 83, 101, 104 or 113. 如請求項127之mRNA,其中該開放閱讀框包含與SEQ ID NO: 43、49、73、79、82、100、103或112中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況其中該開放閱讀框包含SEQ ID NO: 43、49、73、79、82、100、103或112中之任一者之核苷酸序列。The mRNA of claim 127, wherein the open reading frame comprises at least 70%, at least 75% with the nucleotide sequence of any one of SEQ ID NOs: 43, 49, 73, 79, 82, 100, 103 or 112 %, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to a nucleotide sequence, optionally wherein the open reading frame comprises SEQ ID NO: 43 The nucleotide sequence of any of , 49, 73, 79, 82, 100, 103, or 112. 一種信使核糖核酸(mRNA),其包含編碼抗原之開放閱讀框(ORF),該抗原包含與SEQ ID NO: 95、98或110中之任一者之胺基酸序列具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的胺基酸序列,視情況其中該抗原包含SEQ ID NO: 95、98或110中之任一者之胺基酸序列。 A messenger ribonucleic acid (mRNA) comprising an open reading frame (ORF) encoding an antigen comprising at least 80%, at least 85%, at least 85%, at least 85%, at least 80%, at least 85%, at least 80%, at least 85%, at least 80%, at least 85%, at least 80%, at least 80%, 85%, %, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical amino acid sequences, optionally wherein the antigen comprises one of SEQ ID NOs: 95, 98 or 110 The amino acid sequence of any one. 如請求項129之mRNA,其中該開放閱讀框包含與SEQ ID NO: 94、97或109中之任一者之核苷酸序列具有至少70%、至少75%、至少80%、至少90%、至少95%、至少96%、至少97%、至少98%或至少99%一致性的核苷酸序列,視情況其中該開放閱讀框包含SEQ ID NO: 94、97或109中之任一者之核苷酸序列。The mRNA of claim 129, wherein the open reading frame comprises at least 70%, at least 75%, at least 80%, at least 90%, at least 70%, at least 75%, at least 80%, at least 90% with the nucleotide sequence of any one of SEQ ID NOs: 94, 97 or 109, A nucleotide sequence that is at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical, optionally wherein the open reading frame comprises any one of SEQ ID NOs: 94, 97 or 109 Nucleotide sequence. 如請求項85至130中任一項之mRNA,其在組合物中進一步包含脂質奈米顆粒。The mRNA of any one of claims 85 to 130, further comprising lipid nanoparticles in the composition. 如請求項131之組合物,其中該mRNA在該脂質奈米顆粒中。The composition of claim 131, wherein the mRNA is in the lipid nanoparticle. 如請求項131或132之組合物,其中該脂質奈米顆粒包含陽離子脂質。The composition of claim 131 or 132, wherein the lipid nanoparticles comprise cationic lipids. 如請求項133之組合物,其中該脂質奈米顆粒進一步包含中性脂質。The composition of claim 133, wherein the lipid nanoparticles further comprise neutral lipids. 如請求項131至134中任一項之組合物,其中該脂質奈米顆粒進一步包含固醇。The composition of any one of claims 131 to 134, wherein the lipid nanoparticle further comprises a sterol. 如請求項131至135中任一項之組合物,其中該脂質奈米顆粒進一步包含聚乙二醇(PEG)改質之脂質。The composition of any one of claims 131 to 135, wherein the lipid nanoparticles further comprise polyethylene glycol (PEG) modified lipids. 如請求項131至136中任一項之組合物,其中該脂質奈米顆粒包含可電離之陽離子脂質、中性脂質、固醇及PEG改質之脂質。The composition of any one of claims 131 to 136, wherein the lipid nanoparticles comprise ionizable cationic lipids, neutral lipids, sterols, and PEG-modified lipids. 如請求項137之組合物,其中該可電離之陽離子脂質係8-((2-羥基乙基)(6-側氧基-6-(十一烷基氧基)己基)胺基)辛酸十七烷-9-基酯(化合物1)。The composition of claim 137, wherein the ionizable cationic lipid is 8-((2-hydroxyethyl)(6-oxy-6-(undecyloxy)hexyl)amino)octanoic acid Heptan-9-yl ester (Compound 1). 如請求項137或138之組合物,其中該中性脂質係1,2-二硬脂醯基-sn-甘油-3-磷酸膽鹼(DSPC)。The composition of claim 137 or 138, wherein the neutral lipid is 1,2-distearyl-sn-glycero-3-phosphocholine (DSPC). 如請求項131至139中任一項之組合物,其中該固醇係膽固醇。The composition of any one of claims 131 to 139, wherein the sterol is cholesterol. 如請求項131至140中任一項之組合物,其中該PEG改質之脂質係1,2二肉豆蔻醯基-sn-甘油-甲氧基聚乙二醇(PEG2000 DMG)。The composition of any one of claims 131 to 140, wherein the PEG-modified lipid is 1,2 dimyristyl-sn-glycerol-methoxypolyethylene glycol (PEG2000 DMG). 如請求項131至141中任一項之組合物,其中該脂質奈米顆粒包含20-60 mol%可電離之陽離子脂質、5-25 mol%中性脂質、25-55 mol%固醇及0.5-15 mol% PEG改質之脂質。The composition of any one of claims 131 to 141, wherein the lipid nanoparticles comprise 20-60 mol % ionizable cationic lipids, 5-25 mol % neutral lipids, 25-55 mol % sterols, and 0.5 mol % -15 mol% PEG-modified lipids. 如請求項142之組合物,其中該脂質奈米顆粒包含: 40-50 mol%可電離之脂質,視情況地45-50 mol%;30-45 mol%固醇,視情況地35-40 mol%;5-15 mol%輔助脂質,視情況地10-12 mol%;1-5% PEG脂質,視情況地1-3 mol%或1.5-2.5 mol%。 The composition of claim 142, wherein the lipid nanoparticle comprises: 40-50 mol% ionizable lipids, optionally 45-50 mol%; 30-45 mol% sterols, optionally 35-40 mol%; 5-15 mol% helper lipids, optionally 10-12 mol%; 1-5% PEG lipid, 1-3 mol% or 1.5-2.5 mol% as appropriate. 如請求項143之組合物,其中該脂質奈米顆粒包含: 40-50 mol%化合物1,視情況地45-50 mol%;30-45 mol%膽固醇,視情況地35-40 mol%;5-15 mol% DSPC,視情況地10-12 mol%;1-5% PEG2000DMG,視情況地1-3 mol%或1.5-2.5 mol%。 The composition of claim 143, wherein the lipid nanoparticle comprises: 40-50 mol% compound 1, optionally 45-50 mol%; 30-45 mol% cholesterol, optionally 35-40 mol%; 5-15 mol% DSPC, optionally 10-12 mol%; 1 -5% PEG2000DMG, 1-3 mol% or 1.5-2.5 mol% as appropriate. 如請求項85至130中任一項之mRNA,其中該mRNA包含化學修飾,視情況為1-甲基假尿苷。The mRNA of any one of claims 85 to 130, wherein the mRNA comprises a chemical modification, optionally 1-methylpseudouridine. 如請求項85至130中任一項之mRNA,其中該mRNA中之每一尿苷係1-甲基假尿苷。The mRNA of any one of claims 85 to 130, wherein each uridine in the mRNA is 1-methylpseudouridine. 如請求項131至144中任一項之組合物,其中該mRNA包含化學修飾,視情況為1-甲基假尿苷。The composition of any one of claims 131 to 144, wherein the mRNA comprises a chemical modification, optionally 1-methylpseudouridine. 如請求項131至144中任一項之組合物,其中該mRNA中之每一尿苷係1-甲基假尿苷。The composition of any one of claims 131 to 144, wherein each uridine in the mRNA is 1-methylpseudouridine. 一種方法,其包括向個體投與如請求項85至130中任一項之mRNA,其投與量可在該個體中有效誘導針對SARS-CoV-2之中和抗體反應。A method comprising administering to an individual the mRNA of any one of claims 85 to 130 in an amount effective to induce a neutralizing antibody response against SARS-CoV-2 in the individual. 一種方法,其包括向個體投與如請求項85至130中任一項之mRNA,其投與量可在該個體中有效誘導針對SARS-CoV-2之T細胞免疫反應。A method comprising administering to an individual the mRNA of any one of claims 85 to 130 in an amount effective to induce a T cell immune response against SARS-CoV-2 in the individual. 一種方法,其包括向個體投與如請求項85至130中任一項之mRNA,其投與量可在該個體中有效誘導針對SARS-CoV-2之中和抗體反應及T細胞免疫反應。A method comprising administering to an individual the mRNA of any one of claims 85 to 130 in an amount effective to induce neutralizing antibody responses and T cell immune responses against SARS-CoV-2 in the individual. 一種方法,其包括向個體投與如請求項131至144中任一項之組合物,其投與量可在該個體中有效誘導針對SARS-CoV-2之中和抗體反應。A method comprising administering to an individual a composition of any one of claims 131 to 144 in an amount effective to induce a neutralizing antibody response against SARS-CoV-2 in the individual. 一種方法,其包括向個體投與如請求項131至144中任一項之組合物,其投與量可在該個體中有效誘導針對SARS-CoV-2之T細胞免疫反應。A method comprising administering to an individual a composition of any one of claims 131 to 144 in an amount effective to induce a T cell immune response against SARS-CoV-2 in the individual. 一種方法,其包括向個體投與如請求項131至144中任一項之組合物,其投與量可在該個體中有效誘導針對SARS-CoV-2之中和抗體反應及T細胞免疫反應。A method comprising administering to an individual the composition of any one of claims 131 to 144 in an amount effective to induce neutralizing antibody responses and T-cell immune responses against SARS-CoV-2 in the individual .
TW110123289A 2020-06-25 2021-06-25 Sars-cov-2 mrna domain vaccines TW202217000A (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US202063044330P 2020-06-25 2020-06-25
US63/044,330 2020-06-25
US202063063137P 2020-08-07 2020-08-07
US63/063,137 2020-08-07

Publications (1)

Publication Number Publication Date
TW202217000A true TW202217000A (en) 2022-05-01

Family

ID=82558760

Family Applications (1)

Application Number Title Priority Date Filing Date
TW110123289A TW202217000A (en) 2020-06-25 2021-06-25 Sars-cov-2 mrna domain vaccines

Country Status (1)

Country Link
TW (1) TW202217000A (en)

Similar Documents

Publication Publication Date Title
JP7443608B2 (en) SARS-COV-2 mRNA domain vaccine
US20240299531A1 (en) Therapeutic use of sars-cov-2 mrna domain vaccines
US20240293534A1 (en) Coronavirus glycosylation variant vaccines
US20240358819A1 (en) Pan-human coronavirus domain vaccines
US20240285754A1 (en) Mrna vaccines encoding flexible coronavirus spike proteins
US20240382581A1 (en) Pan-human coronavirus vaccines
US20230355743A1 (en) Multi-proline-substituted coronavirus spike protein vaccines
US20230108894A1 (en) Coronavirus rna vaccines
JP2024503699A (en) Variant strain-based coronavirus vaccines
JP2024503698A (en) Mutant strain-based coronavirus vaccine
WO2023283642A2 (en) Pan-human coronavirus concatemeric vaccines
WO2021159130A2 (en) Coronavirus rna vaccines and methods of use
CA3216490A1 (en) Epstein-barr virus mrna vaccines
WO2023092069A1 (en) Sars-cov-2 mrna domain vaccines and methods of use
TW202217000A (en) Sars-cov-2 mrna domain vaccines
WO2025149032A1 (en) Respiratory syncytial virus vaccine compositions and their use