[go: up one dir, main page]

CN121532411A - Systems and methods for regulating target genes - Google Patents

Systems and methods for regulating target genes

Info

Publication number
CN121532411A
CN121532411A CN202480047487.0A CN202480047487A CN121532411A CN 121532411 A CN121532411 A CN 121532411A CN 202480047487 A CN202480047487 A CN 202480047487A CN 121532411 A CN121532411 A CN 121532411A
Authority
CN
China
Prior art keywords
peptide
sequence
engineered gene
seq
optionally
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202480047487.0A
Other languages
Chinese (zh)
Inventor
丹尼尔·O·哈特
亓磊
托马斯·布莱尔·盖诺斯
乔瓦尼·卡罗索
腾宇·高
罗宾·W·约
穆罕默德·扎基·贾瓦德
布兰登·W·廖
杨潇
刘艳霞
蒂莫西·戴利
克里斯托弗·达里尔·斯蒂尔二世
梅兰妮·R·西尔维斯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Epicrispil Biotechnology Co
Original Assignee
Epicrispil Biotechnology Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Epicrispil Biotechnology Co filed Critical Epicrispil Biotechnology Co
Publication of CN121532411A publication Critical patent/CN121532411A/en
Pending legal-status Critical Current

Links

Abstract

Provided herein are compositions, methods, and systems for modulating expression of a target gene (e.g., a target endogenous gene). In some embodiments, an engineered gene effector is provided that comprises a first peptide that is 75-95 amino acids in length and a second peptide that is 75-95 amino acids in length. Engineered gene effectors may facilitate modulation of the expression level or activity level of a target gene when brought into proximity with the target gene or target gene regulatory sequence as a complex with a targeting moiety (e.g., a heterologous endonuclease). Also provided are computer-implemented methods for producing functional biological sequences, as well as functional biological sequences, such as engineered gene effectors made by the methods.

Description

Systems and methods for modulating a target gene
Reference to priority applications
The present application claims priority from U.S. provisional application number 63/504661 filed on month 5 of 2023, U.S. provisional application number 63/520561 filed on month 8 of 2023, U.S. provisional application number 63/502891 filed on month 5 of 2023, U.S. provisional application number 63/504660 filed on month 5 of 2023, and U.S. provisional application number 63/504663 filed on month 5 of 2023. The contents of each of these above-mentioned applications are expressly incorporated herein by reference in their entirety.
Reference to sequence Listing
The present application is presented with a sequence listing in electronic format. The sequence listing is provided in a file named EPICR019WOSequenceListing. Xml, created at 2024, month 5, 14, and having a size of 2506752 bytes. The electronically formatted information of the sequence listing is incorporated by reference in its entirety.
Background
Various effector agents (e.g., transcriptional regulatory factors) may be utilized to regulate expression or activity of a target gene in a cell. For example, a heterologous gene effector can be introduced (e.g., delivered, expressed, etc.) into a cell, and the heterologous gene effector can affect such modulation of a target gene, alone or in combination with additional agents. In some examples, the additional agent may include Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/CRISPR-associated protein (Cas) for specific binding to a target gene (e.g., a target deoxyribonucleic acid (DNA) sequence or ribonucleic acid (RNA) sequence (e.g., an exogenous DNA sequence or RNA sequence) of the target gene), while the heterologous gene effector may regulate expression or activity level of the target gene. Such gene effectors may be used, for example, as gene therapy to treat or ameliorate a condition (e.g., a disease) in a subject.
Disclosure of Invention
Provided herein are engineered gene effectors comprising a polypeptide comprising a first peptide of 75-110 (or 75-95) amino acids in length, wherein the first peptide comprises any one of SEQ ID nos. 3-100, or a sequence at least 85% identical thereto, and a second peptide of 75-110 (or 75-95) amino acids in length and heterologous to the first peptide, wherein the second peptide comprises any one of SEQ ID nos. 3-100, or a sequence at least 85% identical thereto, optionally wherein the first peptide is different from the second peptide.
Also provided are engineered gene effectors comprising a polypeptide comprising a first peptide comprising an amino acid sequence of 75-110 amino acids in length and based on a human or viral transcriptional regulator and a second peptide comprising an amino acid sequence of 75-110 amino acids in length and based on a human or viral transcriptional regulator, wherein the second peptide is heterologous to the first peptide, wherein the engineered gene effectors are capable of activating a target gene in a cell when expressed in the cell and effectively targeted to a site of the target gene.
Provided herein are fusion proteins comprising an engineered gene effector of the present disclosure, and a heterologous endonuclease coupled to a polypeptide, optionally wherein the heterologous endonuclease is a Cas protein.
Also provided are systems comprising an engineered gene effector of the present disclosure, a heterologous endonuclease coupled to a polypeptide of the engineered gene effector, optionally wherein the heterologous endonuclease is a Cas protein, and a guide nucleic acid capable of forming a complex with the heterologous endonuclease, wherein the complex exhibits specific binding to a target gene in a cell when the system is expressed in the cell.
Also provided are combinations of polynucleotides encoding the systems of the present disclosure, wherein the combinations of polynucleotides are configured to express in a cell a heterologous endonuclease coupled to an engineered gene effector, and a guide nucleic acid.
Further provided are kits comprising any of the engineered gene effectors, fusion proteins, combinations, systems, polynucleotides, vectors, and/or cells of the disclosure.
Also provided are methods of controlling a target gene in a cell comprising contacting the cell with an engineered gene effector, fusion protein, polynucleotide, vector, system, or combination of polynucleotides of the present disclosure.
Further provided is a computer-implemented method of generating a functional biological sequence, the method comprising (a) providing an fitness function trained on a biological dataset comprising functionally determined biological sequences having a fixed length; providing in a computer a plurality of different sequences comprising a fixed length, each sequence being associated with a temperature and a fitness based on the fitness function, wherein each sequence is associated with a different temperature of a temperature gradient, (c) by the computer, in parallel, (1) selecting one or more random positions for introducing substitutions in one or more sequences of the plurality of different sequences, optionally 1-5 random positions, optionally 1 random position, and for each of the one or more sequences, evaluating a first fitness change due to introducing substitutions at one or more randomly selected positions, and accepting or rejecting substitutions based on the evaluated first fitness change, and optionally further based on the temperature associated with the sequence, and/or (2) selecting one or more pairs of the plurality of different sequences, each selected pair comprising a sequence associated with a consecutive temperature of the temperature gradient, optionally selecting 3 of the plurality of different sequences, optionally selecting one or more pairs of selected pairs, and for producing a domain between the selected pairs of the plurality of selected pairs, and the domain is evaluated due to the selected pairs, and accepting or rejecting one or more domain exchanges between the selected pairs based on the fitness differences and the temperatures associated with each sequence of the selected pairs, and (d) iteratively performing (c), wherein in each subsequent iteration, accepted substitutions of a previous iteration and/or accepted domain exchanges of a previous iteration are incorporated into a plurality of different sequences, thereby producing one or more functional sequences having fitness that meets or exceeds a desired fitness threshold.
In one aspect, a computer-implemented method of generating a functional biological sequence is provided, the computer-implemented method comprising (a) evaluating, by a computer, a sequence of a plurality of different sequences comprising a fixed length based on an fitness function trained on a biological dataset comprising functional biological sequences having the fixed length, (b) replacing, by the computer, one or more random residues in the sequence to generate a mutant sequence, (c) evaluating, by the computer, the mutant sequence based on the fitness function, and (d) collecting, by the computer, the functional sequence accepted by the fitness function. In some cases, the functional biological sequence comprises an amino acid sequence or a nucleotide sequence of a protein or peptide. In some cases, the protein or peptide is an epigenetic regulator, transcription factor, enzyme, nuclease, agonist, antagonist, regulatory factor, or inhibitor. In some cases, the functional biological sequence comprises an amino acid sequence or a nucleotide sequence. In some cases, the functional biological sequence comprises an amino acid sequence, and further wherein the fixed length is at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, at least 100, at least 105, at least 110, at least 115, at least 120, at least 125, at least 130, at least 140, at least 150, or at least 200 amino acids. In some cases, the fitness function is based on one or more machine learning models, wherein the machine learning model is selected from the group consisting of a supervised machine learning model, an unsupervised machine learning model, a reinforcement learning model, a deep learning model, a transfer learning model, and any combination thereof. In some cases, the one or more machine learning models are selected from the group consisting of classification models, regression models, convolutional Neural Networks (CNNs), recurrent Neural Networks (RNNs), limit gradient boosting (XGBoost), long-short term memory networks, generation of countermeasure networks (GANs), automatic encoders, transformer networks, evolutionary Monte Carlo (evolutionary Monte Carlo), and any combination thereof. In some cases, the computer-implemented method further comprises randomly exchanging, by the computer, one or more subsequences from the mutated sequence and a different sequence of the plurality of sequences. In some cases, the fitness function includes a threshold selected from the group consisting of a binary threshold, a numeric threshold, a multi-class threshold, a confidence threshold, a decision threshold, and any combination thereof. In some cases, the fitness function accepts the functional sequence when the fitness score assigned to the functional sequence by the fitness function meets or exceeds a threshold.
In another aspect, a computer-implemented system is provided that includes a computing device including at least one processor and instructions executable by the at least one processor to provide an application program including (a) a software module configured to evaluate, by a computer, sequences in a plurality of different sequences including a fixed length based on an fitness function trained on a biological dataset including functional biological sequences having the fixed length, (b) a software module configured to replace, by the computer, one or more random residues in the sequences to generate a mutant sequence, (c) a software module configured to evaluate, by the computer, the mutant sequence based on the fitness function, and (d) a software module configured to collect, by the computer, the functional sequences accepted by the fitness function.
In another aspect, a non-transitory computer readable medium having stored thereon computer readable instructions that, when executed by a processor, cause the processor to perform a method comprising (a) evaluating sequences of a plurality of different sequences comprising a fixed length based on an fitness function trained on a biological dataset comprising functional biological sequences having the fixed length, (b) replacing one or more random residues in the sequences to generate a mutant sequence, (c) evaluating the mutant sequence based on the fitness function, and (d) collecting the functional sequences accepted by the fitness function is provided.
Provided herein are engineered gene effectors comprising a polypeptide of 85 amino acids in length comprising any one of SEQ ID NOs 1495, 1592, 1595, 1634, 1654, 1665, 1677, 1686, 1689, 1716, or a sequence at least 85% identical thereto.
There is also provided a computer-implemented system comprising a computing device comprising at least one processor and instructions executable by the at least one processor to provide an application comprising (a) a software module configured to provide, by the computer, a fitness function trained on a biological dataset comprising functionally determined biological sequences having a fixed length, (b) a software module configured to provide, by the computer, a plurality of different sequences comprising a fixed length, each sequence being associated with a temperature and a fitness based on the fitness function, wherein each sequence is associated with a different temperature of a temperature gradient, (c) a software module in parallel with the plurality of different sequences, (1) the software module being configured to select, by the computer, one or more random positions for introducing substitutions in one or more sequences of the plurality of different sequences, optionally 1-5 random positions, optionally 1 random position, and for one or more sequences, each sequence being associated with a temperature gradient based on the one or more random sequences, and further evaluating, by the one or more random sequences and/or more random positions being included in the plurality of sequences, the software module being further associated with the temperature gradient based on the one or more random sequences being selected and the fitness being evaluated based on the one or more random sequences, optionally selecting up to 3 pairs of the plurality of different sequences, optionally selecting 1 pair of the plurality of different sequences, and for each selected pair selecting one or more domains for exchange between sequences of the selected pair, and evaluating the fitness differences of sequences of the selected pair due to the exchange of one or more domains, and accepting or rejecting one or more domain exchanges between the selected pair based on the fitness differences and a temperature associated with each sequence of the selected pair, and (d) a software module configured to iteratively execute (c) by the computer, wherein in each subsequent iteration, accepted substitutions of a previous iteration and/or accepted domain exchanges of a previous iteration are incorporated into the plurality of different sequences, thereby producing one or more functional sequences having a fitness that is at or above a desired fitness threshold.
Also provided are engineered gene effectors comprising a polypeptide comprising any one of SEQ ID NO: 115-274、276-287、289-367、370-445、448-746、748-777、779-929、931-1007、1009-1156、1158-1194、1196-1288、1290-1350、1352-1451, or a sequence at least 85% identical thereto, or a sequence having a mutation of 0-3 amino acid residues, optionally wherein any mutation thereof is a conservative substitution.
Provided herein are engineered gene effectors comprising a polypeptide comprising any one of SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107, or sequences at least 85% identical thereto, or sequences having 0-3 amino acid residue mutations, optionally, wherein any mutation thereof is a conservative substitution.
Drawings
The novel features of the disclosure are set forth with particularity in the appended claims. The features and advantages of the present disclosure will be better understood by reference to the following detailed description, which sets forth exemplary embodiments utilizing the principles of the present invention, and the accompanying drawings.
FIG. 1A shows activation of GFP reporter genes by single gene regulators or by combination gene regulators.
FIG. 1B shows the synergistic effect of activation of the combinatorial gene modulators.
FIG. 2A shows the distribution of selected candidate gene regulators.
FIG. 2B shows the activation levels of the combination gene modulators (weak, medium, and strong).
FIG. 2C shows homologous clustering of candidate modulators.
FIG. 2D shows predicted protein domains of candidate modulators.
FIG. 3A shows cloning strategies for screening for combinatorial gene modulators.
FIG. 3B shows the GFP reporter design for evaluation of gene regulation by the combination gene regulators.
FIG. 3C shows the validation of the combined gene modulator reversible recruitment by the reporting system.
FIG. 4 shows GFP expression of the combination gene regulators.
FIG. 5A shows DESEQ analysis of combinatorial gene modulators.
FIG. 5B shows the level of activation of a combined gene regulator source (e.g., human-human vs virus).
Fig. 5C shows a heat map indicating the activation states of all the combination modulators.
Fig. 6A shows an analysis of biochemical and biophysical characteristics of a combination modulator by evaluating electrostatic potential.
FIG. 6B shows an analysis of biochemical and biophysical characteristics of the combination modulator by evaluating the B_beta factor.
FIGS. 7A-7B depict the activation of epigenetic silenced loci (CD 45) by engineered gene effectors in HEK 293. Some of the engineered gene effectors detected comprise a first peptide and a second peptide. Some of the engineered gene effectors detected are encoded by functional biological sequences generated using computer-implemented methods.
FIG. 8 depicts reactivation of epigenetic silenced loci by engineered gene effectors in human cells. Some of the engineered gene effectors detected comprise a first peptide and a second peptide. Some of the engineered gene effectors detected were generated using computer-implemented methods that produced functional biological sequences.
Fig. 9A-9D provide non-limiting examples of exemplary methods of generating biofunctional sequences in accordance with embodiments of the present disclosure.
FIG. 10 depicts a graph of predicting minimum edit distances of a biofunctional sequence relative to sequences in an original training dataset using the methods and systems provided herein.
FIG. 11 depicts a representation of predicting sequences having biological functions in two-dimensional space using the methods and systems provided herein.
FIG. 12A depicts a non-limiting example of a migration learning method using LPLM inserts to predict gene activators from sequences.
Fig. 12B depicts a non-limiting example of a fitness view representing the effective search space of MHMCS (pink) and EMCS (blue). MHMCS search locally for regions of high fitness near the starting molecule, while EMCS interpolates between starting points and changes the search speed to optimize the search.
Fig. 13 depicts Principal Component Analysis (PCA) of the original training set with new sequences designed by using OneHot-encoded EMCS and MHMCS.
Fig. 14A depicts the entropy change distribution for 107 MHMCS and EMCS initial iterations using default parameters. From an informative perspective, EMCS explores a larger region of the fitness space per iteration than MHMCS.
Fig. 14B depicts i) iterations from random predetermined sequences to convergence (f≥0.95) for 2361 sequences obtained by running 2571 times MHMCS at t=2.5x10 -3、10-4, ii) iterations from 1171 sequences obtained by running 1171 times EMCS at default parameters to convergence, iii) the number of iterations to convergence and/or positive hits (f≥0.5) for 2571 sequences of 2571 times MHMCS running at t=2.5x10 -3、10-4. 210 of 2571 sequences failed to converge (but successfully produced a positive hit (f > 0.5)), iv) 2720 sequences obtained by running 1171 EMCS under default parameters reached an iteration of converging and/or positive hits, yielding an average of 2.32 positive hits per EMCS run of 4 chains.
Fig. 15 depicts a schematic of fitness graph exploration using novel EMCS, which includes domain exchange between peptide chains. The EMCS algorithm comprises the following components of 1) parallel Metropolis-Hastings Monte Carlo (MHMCS) operation, 2) temperature gradient implementation (parallel tempering), and 3) domain exchange between peptide chains (EMCS) parallel operation.
FIGS. 16A-16B depict the entropy change distribution and convergence time of MHMCS and EMCS iterations. For ablation studies, PTP (parallel tempering) and EMC-NPT (EMCs without parallel tempering) were run.
Fig. 17 depicts Fluorescence Activated Cell Sorting (FACS) histograms from engineered gene effector validation experiments. The ability of the engineered gene effectors encoded by 4600 new sequences to activate synthetic gene loci was tested. 357 out of 4600 engineered gene effectors (hit rate 7.51%) significantly activated the gene reporter compared to background fluorescence.
Fig. 18A-18C depict biochemical and structural characterization analysis of experimentally verified functional biological sequences using ESMFold.
FIGS. 19A-19B depict engineered gene effector validation assays performed in HEK293 cells. 10 engineered gene effectors (SEQ ID NOS: 1495, 1592, 1595, 1634, 1654, 1665, 1677, 1686, 1689, 1716) were screened at the synthesis site (TRE 3G) and the endogenous site (CD 45), respectively. The activation potency of the engineered gene effector was compared to the activation potency of the standard activators VP64 and vCD.
FIG. 20 depicts a non-limiting computer system programmed or otherwise configured to implement the methods provided herein.
Fig. 21A and 21B are schematic diagrams illustrating a non-limiting embodiment of a computer-implemented method of the present disclosure.
Detailed Description
SUMMARY
Aspects of the disclosure may provide engineered effectors (or engineered gene effectors, used interchangeably herein) capable of modulating (e.g., activating or reducing) the expression or activity level of a target gene (e.g., an endogenous target gene, a heterologous target gene, etc.) in a cell, compositions, combinations, systems, and methods of using the same. Such engineered effectors may work with a heterologous endonuclease (e.g., an engineered CRISPR/Cas nuclease or inactive variant thereof), e.g., to affect manipulation of expression or activity levels of a target gene in a cell, e.g., for treating or ameliorating a disorder (e.g., a disease) in a subject. Gene expression can support a variety of physiological and pathological effects in cells and tissues, leading to a number of diseases and disorders, and thus compositions, combinations, systems and methods utilizing engineered gene effectors of the present disclosure can modulate expression of specific genes in a desired manner, thereby having therapeutic benefits.
CRISPR-mediated transcriptional regulation has a wide range of potential applications in synthetic biology and gene therapy. Transcriptional activation in eukaryotes depends on complex interactions of a variety of factors including DNA binding transcription factors, coactivators, chromatin remodeling factors, and underlying transcription mechanisms. These multiple inputs typically exhibit a synergistic relationship whereby the transcriptional output driven by two or more factors is greater than the sum of the outputs driven by each factor individually. Preliminary screening has found hundreds of small peptides (85 aa) derived from endogenous human, viral and archaeal proteins that, when fused to programmable DNA in combination with dCas protein, activate transcription to varying degrees.
Described herein are platforms for screening pairwise combinations of these activation domains in an inducible and reversible manner for increasing activity determined by both the magnitude and duration of transcriptional output. The screen identified about 1400 new combinations and the activating factors of these combinations showed stronger activity than their constituent parts. These combined activators are strongest when one of the chaperone blocks has a viral origin. Machine learning methods are further described herein to identify a set of chaperone blocks that best predicts strong combined activators, including classical viral activators (e.g., VP 64), novel viral activators (e.g., vIRF2/vIRF 4), and human activators (e.g., LEUTX).
Analysis of biochemical and biophysical characteristics is provided herein, indicating that strong combinations have a high negative electrostatic potential and tend to have structural flexibility. Computer-simulated structural predictions of previous hits show that stable intramolecular interactions between helices can stabilize strongly combined interaction interfaces and can be functionally related to transcriptional activation. Further provided herein are new kits of tools for combining transcription regulatory factors with potential that have not yet been developed, as well as platforms for discovering additional tools for use in both basic research and therapy.
In some embodiments, the engineered gene effectors of the present disclosure include a fusion of two different gene effectors, e.g., when used with a heterologous endonuclease, the fusion provides for regulation of gene expression. In some embodiments, modulation of gene expression by a fusion of two different gene effectors provides a superior effect (e.g., a prolonged duration of modulated expression) compared to the effect provided by each component alone
Designing new protein sequences is still a slow and expensive process due to various protein engineering challenges, and in particular, the number of protein variants that can be experimentally tested in a given assay is dwarfed compared to the broad overall sequence space (which results in low hit rates and expensive wet laboratory test cycles). Provided herein are computer-implemented methods and systems for producing functional biological sequences (e.g., engineered gene effectors). In some embodiments, computer-implemented methods and systems accelerate the discovery of new functional sequences (e.g., new functional amino acid sequences). In some embodiments, computer-implemented methods and systems generate new biofunctional sequences in a time-and cost-effective manner.
Terminology
As used herein, the term "heterologous" when used in reference to a polypeptide sequence or nucleic acid sequence means that the polypeptide sequence or nucleic acid sequence (1) is placed (e.g., in the context of a cell, virus, or fusion polypeptide molecule, or fusion polynucleotide molecule, etc.) where it is not normally present (e.g., not normally present in nature), or (2) includes two or more subsequences that have a relationship different from that normally present in nature. For example, a polypeptide may comprise a first polypeptide sequence and a second polypeptide sequence that do not occur together in a single polypeptide in nature, and thus the first polypeptide sequence and the second polypeptide sequence may be heterologous to each other. In another example, a polynucleotide may comprise a first polynucleotide sequence and a second polynucleotide sequence that do not occur together in a single polynucleotide in nature, and thus the first polynucleotide sequence and the second polynucleotide sequence may be heterologous to each other.
The term "cell" generally refers to a biological cell. The cells may be the basic structural, functional and/or biological units of an organism. The cells may be derived from any organism having one or more cells. Some non-limiting examples include prokaryotic cells, eukaryotic cells, bacterial cells, archaebacterial cells, unicellular eukaryotic cells, protozoal cells, cells from plants (e.g., cells from crops, fruits, vegetables, grains, soybeans, corn, maize, wheat, seeds, tomatoes, rice, tapioca, sugarcane, pumpkin, hay, potatoes, cotton, hemp, tobacco, flowering plants, conifers, gymnosperms, ferns, pinus, goldfish algae, liverwort, moss cells), algal cells (e.g., bronzes (Botryococcus braunii), chlamydomonas reinharderia (Chlamydomonas reinhardtii), nannochloropsis gaditana, pyrenoids (Chlorella pyrenoidosa), sargassum (Sargassum) c. Agardh, etc.), seaweed (e.g., kelp), fungal cells (e.g., yeast cells, cells from mushrooms, animal cells, cells from invertebrates (e.g., flies, spines, echinoderms, nematodes, etc.), vertebrate cells (e.g., fish, reptiles, animal cells, rodent, rat, mouse, etc.), non-human, etc. Sometimes, the cells are not derived from a natural organism (e.g., the cells may be synthetically manufactured, sometimes referred to as artificial cells).
As used herein, the term "nucleotide" generally refers to a base-sugar-phosphate composition. The nucleotides may comprise synthetic nucleotides. The nucleotide may comprise a synthetic nucleotide analogue. Nucleotides may be monomeric units of nucleic acid sequences such as deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). The term nucleotide may include ribonucleoside triphosphates (adenosine triphosphate (ATP), uridine Triphosphate (UTP), cytidine Triphosphate (CTP), guanosine Triphosphate (GTP)) and deoxyribonucleoside triphosphates (e.g., dATP, dCTP, dITP, dUTP, dGTP, dTTP) or derivatives thereof. Such derivatives may include, for example, [ αS ] dATP, 7-deaza-dGTP, and 7-deaza-dATP, as well as nucleotide derivatives that confer nuclease resistance on nucleic acid molecules containing them. The term nucleotide as used herein may refer to dideoxyribonucleoside triphosphates (ddntps) and their derivatives. Illustrative examples of dideoxyribonucleoside triphosphates may include, but are not limited to ddATP, ddCTP, ddGTP, ddITP and ddTTP. The nucleotides may be unlabeled or may be detectably labeled by known techniques. Marking may also be implemented with quantum dots. Detectable labels may include, for example, radioisotopes, fluorescent labels, chemiluminescent labels, bioluminescent labels, and enzyme labels. Fluorescent labels for nucleotides may include, but are not limited to, fluorescein, 5-carboxyfluorescein (FAM), 2',7' -dimethoxy-4 ',5' -dichloro-6-carboxyfluorescein (JOE), rhodamine, 6-carboxyrhodamine (R6G), N, N, N ', N' -tetramethyl-6-carboxyrhodamine (TAMRA), 6-carboxy-X-Rhodamine (ROX), 4- (4 '-dimethylaminophenylazo) benzoic acid (DABCYL), cascade Blue, oregon Green, texas Red, cyanine, and 5- (2' -aminoethyl) aminonaphthalene-1-sulfonic acid (EDANS). Specific examples of fluorescent-labeled nucleotides may include [R6G]dUTP、[TAMRA]dUTP、[R110]dCTP、[R6G]dCTP、[TAMRA]dCTP、[JOE]ddATP、[R6G]ddATP、[FAM]ddCTP、[R110]ddCTP、[TAMRA]ddGTP、[ROX]ddTTP、[dR6G]ddATP、[dR110]ddCTP、[dTAMRA]ddGTP and [ dROX ] ddTTP available from PERKIN ELMER (Foster City, calif.), fluoroLink deoxynucleotides available from Amersham (Arlington Heights, ill.), fluoroLink Cy3-dCTP, fluoroLink Cy5-dCTP, FluoroLink Fluor X-dCTP, fluoroLink Cy-dUTP and FluoroLink Cy-dUTP, fluorescein-15-dATP, fluorescein-12-dUTP, tetramethyl-rhodamine-6-dUTP, IR770-9-dATP, fluorescein-12-ddUTP, fluorescein-12-UTP and fluorescein-15-2' -dATP obtainable from Boehringer Mannheim (Indianapolis, ind.), and the chromosome-tagged nucleotide 、BODIPY-FL-14-UTP、BODIPY-FL-4-UTP、BODIPY-TMR-14-UTP、BODIPY-TMR-14-dUTP、BODIPY-TR-14-UTP、BODIPY-TR-14-dUTP、Cascade Blue-7-UTP、Cascade Blue-7-dUTP、 fluorescein-12-UTP, available from Molecular Probes (Eugene, oreg.), fluorescein-12-dUTP, oregon Green 488-5-dUTP, rhodamine Green-5-dUTP, tetramethyl rhodamine-6-UTP, tetramethyl rhodamine-6-dUTP, texas Red-5-dUTP, and Texas Red-12-dUTP. nucleotides may also be labeled or tagged by chemical modification. The chemically modified mononucleotide may be biotin-dNTP. Some non-limiting examples of biotinylated dNTPs may include biotin-dATP (e.g., bio-N6-ddATP, biotin-14-dATP), biotin-dCTP (e.g., biotin-11-dCTP, biotin-14-dCTP), and biotin-dUTP (e.g., biotin-11-dUTP, biotin-16-dUTP, biotin-20-dUTP).
The terms "polynucleotide", "oligonucleotide" or "nucleic acid" are used interchangeably herein to refer generally to a polymeric form of nucleotides of any length, whether deoxyribonucleotides or ribonucleotides, or analogs thereof, whether in single-stranded, double-stranded or multi-stranded form. Polynucleotides may be exogenous or endogenous to the cell. The polynucleotide may be present in a cell-free environment. The polynucleotide may be a gene or fragment thereof. The polynucleotide may be DNA. The polynucleotide may be RNA. Polynucleotides may have any three-dimensional structure and may perform any function, known or unknown. Polynucleotides may comprise one or more analogs (e.g., engineered backbones, sugars, or nucleobases). Modification (if present) of the nucleotide structure may be performed before or after assembly of the polymer. Some non-limiting examples of analogs include 5-bromouracil, peptide nucleic acids, xenogenic nucleic acids (xeno nucleic acid), morpholinos, locked nucleic acids, ethylene glycol nucleic acids, threose nucleic acids, dideoxynucleotides, cordycepin, 7-deaza-GTP, fluorophores (e.g., sugar linked rhodamine or fluorescein), sulfhydryl-containing nucleotides, biotin linked nucleotides, fluorescent base analogs, cpG islands, methyl-7-guanosine, methylated nucleotides, inosine, thiouridine (thiouridine), pseudouridine, dihydrouridine, queuosine, and wyosine. Non-limiting examples of polynucleotides include coding or non-coding regions of genes or gene fragments, loci defined by linkage analysis (locus), exons, introns, messenger RNA (mRNA), transfer RNA (tRNA), ribosomal RNA (rRNA), short interfering RNA (siRNA), short hairpin RNA (shRNA), microRNA (miRNA), ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, cell-free polynucleotides including cell-free DNA (cfDNA) and cell-free RNA (cfRNA), nucleic acid probes, and primers. The nucleotide sequence may be interrupted by non-nucleotide components.
The term "sequence identity" generally refers to the exact nucleotide-to-nucleotide, or amino acid-to-amino acid correspondence of two polynucleotide sequences or polypeptide sequences, respectively. In general, techniques for determining sequence identity include determining the nucleotide sequence of a polynucleotide and/or determining the amino acid sequence encoded thereby, and comparing these sequences to a second nucleotide sequence or amino acid sequence. By determining their "percent identity", two or more sequences (polynucleotides or amino acids) can be compared. The percent identity of two sequences, whether nucleic acid sequences or amino acid sequences, is the number of exact matches between the two aligned sequences divided by the length of the longer sequence and multiplied by 100. The percent identity can also be determined, for example, by comparing sequence information using an advanced BLAST computer program available from the national institutes of health, including version 2.2.9. The BLAST program is based on the alignment of Karlin and Altschul, proc. Natl. Acad. Sci. USA, 87:2264-2268 (1990), and is discussed in Altschul et al, J. Mol. Biol., 215:403-410 (1990), karlin and Altschul, proc. Natl. Acad. Sci. USA, 90:5873-5877 (1993), and Altschul et al, nucleic Acids Res., 25:3389-3402 (1997). This procedure can be used to determine the percent identity over the full length of the proteins being compared. Default parameters are provided, for example, in the blastp program to optimize retrieval of the short query sequence. The program also allows the use of SEG filters to mask fragments of query sequences determined by the SEG program of Wootton and Federhen, computers AND CHEMISTRY 17:149-163 (1993). The desired degree of sequence identity ranges from about 50% to 100% and integer values therebetween. In general, the present disclosure encompasses sequences having at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 98% sequence identity to any one of the sequences provided herein.
The term "gene" generally refers to a nucleic acid (e.g., DNA, such as genomic DNA and cDNA) and its corresponding nucleotide sequence that is involved in encoding an RNA transcript. The term used herein to refer to genomic DNA includes non-coding regions in the middle as well as regulatory regions, and may include the 5 'and 3' ends. In some uses, the term comprises transcribed sequences, including the 5 'untranslated region and the 3' untranslated region (5 '-UTR and 3' -UTR), exons, and introns. In some genes, the transcribed region will comprise an "open reading frame" encoding the polypeptide. In some uses of this term, a "gene" comprises only the coding sequences (e.g., an "open reading frame" or "coding region") necessary to encode a polypeptide. In some cases, the gene does not encode a polypeptide, such as a ribosomal RNA gene (rRNA) and a transfer RNA (tRNA) gene. In some cases, the term "gene" includes not only transcribed sequences, but also additionally non-transcribed regions comprising upstream and downstream regulatory regions, enhancers and promoters. A gene may refer to an "endogenous gene" or a native gene in a native location in the genome of an organism. Genes may be referred to as "exogenous genes" or non-native genes. Non-native genes may refer to genes that are not normally found in the host organism, but are introduced into the host organism by gene transfer. Non-native genes may also refer to genes that are not naturally located in the genome of an organism. Non-native genes may also refer to naturally occurring nucleic acid or polypeptide sequences (e.g., non-native sequences) that comprise mutations, insertions, and/or deletions.
The term "deletion" generally refers to the removal (or loss) of one or more (or a specified number of) amino acids (e.g., contiguous or non-contiguous amino acids) from a polypeptide sequence, or the removal (or loss) of one or more (or a specified number of) nucleobases (e.g., contiguous or non-contiguous nucleobases) from a polynucleotide sequence (e.g., a polynucleotide sequence encoding a polypeptide sequence). The term "internal deletion" generally refers to a deletion that does not include the N-or C-terminus of a polypeptide or the 5 'or 3' end of a polynucleotide. Deletions (e.g., internal deletions) may be identified by comparison to a reference sequence, such as by specifying the start and end positions of the deletion relative to the reference sequence. Deletions (e.g., internal deletions) are different and distinct from substitutions. For example, a deletion of at least one amino acid does not be followed by insertion of at least one different amino acid at the same position as the at least one amino acid as compared to a reference polypeptide sequence, such that the size (e.g., number of amino acid residues) of a modified (or engineered) polypeptide sequence comprising the deletion of at least one amino acid is smaller than the size of the at least one amino acid that has been deleted compared to the reference polypeptide sequence.
The term "expression" generally refers to one or more processes of transcription from a DNA template into a polynucleotide (e.g., into mRNA or other RNA transcript), and/or to the subsequent translation of the transcribed mRNA into a peptide, polypeptide, or protein. Transcripts and encoded polypeptides may be collectively referred to as "gene products". If the polynucleotide is derived from genomic DNA, expression may include splicing of mRNA in eukaryotic cells. In terms of expression, "up-regulated" generally refers to an increase in the level of expression of a polynucleotide (e.g., RNA, such as mRNA) and/or polypeptide sequence relative to its level of expression in a wild-type state, and "down-regulated" generally refers to a decrease in the level of expression of a polynucleotide (e.g., RNA, such as mRNA) and/or polypeptide sequence relative to its level of expression in a wild-type state. Expression of the transfected gene may occur transiently or stably in the cell. During "transient expression", the transfected gene is not transferred to daughter cells during cell division. Since its expression is limited to transfected cells, the expression of the gene disappears over time. In contrast, stable expression of the transfected gene can occur when the gene is co-transfected with another gene that confers selective advantage on the transfected cell. Such a selection advantage may be resistance to a certain toxin presented to the cell.
The term "expression profile" generally refers to the quantitative (e.g., abundance) and qualitative expression of one or more genes in a sample (e.g., a cell). The one or more genes may be expressed and identified in the form of a nucleic acid molecule (e.g., mRNA or other RNA transcript). Alternatively or additionally, the one or more genes may be expressed and determined in the form of a polypeptide (e.g., a protein determined by western blotting). The expression profile of a gene may be defined as the shape of the level of expression of the gene over a period of time (e.g., at least or up to about 1 hour, at least or up to about 2 hours, at least or up to about 3 hours, at least or up to about 4 hours, at least or up to about 5 hours, at least or up to about 6 hours, at least or up to about 7 hours, at least or up to about 8 hours, at least or up to about 9 hours, at least or up to about 10 hours, at least or up to about 11 hours, at least or up to about 12 hours, at least or up to about 16 hours, at least or up to about 18 hours, at least or up to about 24 hours, at least or up to about 36 hours, at least or up to about 48 hours, at least or up to about 3 days, at least or up to about 4 days, at least or up to about 5 days, at least or up to about 6 days, at least or up to about 7 days, at least or up to about 8 days, at least or up to about 9 days, at least or up to about 10 days, at least or up to about 11 days, at least or up to about 12 days, at least or up to about 13 days, at least or up to about 14 days, etc.). Or the expression profile of a gene may be defined as the expression level of the gene at a time point of interest (e.g., the expression level of the gene measured after treatment of a cell to induce such expression level for at least or at most about 1 hour, at least or at most about 2 hours, at least or at most about 3 hours, at least or at most about 4 hours, at least or at most about 5 hours, at least or at most about 6 hours, at least or at most about 7 hours, at least or at most about 8 hours, at least or at most about 9 hours, at least or at most about 10 hours, at least or at most about 11 hours, at least or at most about 12 hours, at least or at most about 16 hours, at least or at most about 18 hours, at least or at most about 24 hours, at least or at most about 36 hours, at least or at most about 48 hours, at least or at most about 3 days, at least or at most about 4 days, at least or at most about 5 days, at least or at most about 6 days, at least or at most about 7 days, at least or at most about 8 days, at least or at least about 9 days, at least or at least about 10 days, at least or at least about 11 days, or at least about 13 days).
The terms "peptide", "polypeptide" or "protein" are used interchangeably herein to generally refer to a polymer of at least two amino acid residues joined by peptide bonds. The term does not imply a particular length of polymer nor is it intended to imply or distinguish whether the peptide is produced using recombinant techniques, chemical synthesis or enzymatic synthesis, or is naturally occurring. These terms apply to naturally occurring amino acid polymers and amino acid polymers comprising at least one modified amino acid. In some cases, the polymer may be interrupted by non-amino acids. These terms include amino acid chains of any length, including full-length proteins, as well as proteins with or without secondary and/or tertiary structures (e.g., domains). These terms also encompass modified amino acid polymers, for example, modified by disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, oxidation, and any other manipulation (e.g., conjugation to a labeling element). As used herein, the terms "amino acid" and "amino acids" generally refer to natural and unnatural amino acids, including, but not limited to, modified amino acids and amino acid analogs. Modified amino acids may include natural amino acids and unnatural amino acids that have been chemically modified to include groups or chemical moieties on the amino acid that do not occur naturally. Amino acid analogs may refer to amino acid derivatives. The term "amino acid" includes D-amino acids and L-amino acids.
With respect to polypeptides, the terms "derivative," "variant," or "fragment" as used herein generally refer to polypeptides that are related to wild-type polypeptides, e.g., by amino acid sequence, structure (e.g., secondary structure and/or tertiary structure), activity (e.g., enzymatic activity), and/or function. Derivatives, variants, and fragments of the polypeptides may comprise one or more amino acid variations (e.g., mutations, insertions, and deletions), truncations, modifications, or combinations thereof, as compared to the wild-type polypeptide.
With respect to polypeptide molecules (e.g., proteins), the term "engineered," "chimeric," or "recombinant" as used herein generally refers to polypeptide molecules having a heterologous amino acid sequence or an altered amino acid sequence as a result of the application of genetic engineering techniques to the nucleic acid encoding the polypeptide molecule, as well as to the cell or organism in which the polypeptide molecule is expressed. With respect to polynucleotide molecules (e.g., DNA or RNA molecules), the term "engineered" or "recombinant" as used herein generally refers to polynucleotide molecules having a heterologous nucleic acid sequence or altered nucleic acid sequence as a result of the application of genetic engineering techniques. Genetic engineering techniques include, but are not limited to, PCR and DNA cloning techniques, transfection, transformation, and other gene transfer techniques, homologous recombination, site-directed mutagenesis, and gene fusion. In some cases, an engineered or recombinant polynucleotide (e.g., a genomic DNA sequence) may be modified or altered by a gene editing portion. For example, a heterologous endonuclease disclosed herein (e.g., an engineered Cas protein) is not a naturally occurring nuclease (e.g., is not a naturally occurring Cas protein). In another example, the engineered gene effector disclosed herein is not a naturally occurring gene effector.
For example, the engineered nucleases (e.g., engineered Cas proteins) disclosed herein are not naturally occurring nucleases (e.g., are not naturally occurring Cas proteins). The terms "engineered nuclease" and "engineered nuclease variant" are used interchangeably herein.
The terms "engineered" and "modified" are used interchangeably herein. The terms "engineering" and "modification" are used interchangeably herein. The terms "engineered cell" and "modified cell" are used interchangeably herein. The terms "engineered feature" and "modified feature" are used interchangeably herein.
The terms "enhancing expression," "increasing expression," or "up-regulating expression" generally refer to the production of a portion of interest (e.g., a polynucleotide or polypeptide) to a level that is higher than the normal expression level of the portion of interest in a host strain (e.g., host cell). The normal expression level may be substantially zero (or null) or above zero. The moiety of interest may comprise an endogenous gene or polypeptide construct of the host strain. The moiety of interest may comprise a heterologous gene or polypeptide construct introduced into or into the host strain. For example, a heterologous gene encoding a polypeptide of interest may be knocked-in (KI) to the genome of a host strain to enhance expression of the polypeptide of interest in the host strain.
The terms "enhancing activity", "increasing activity" or "upregulating activity" generally refer to the modification of the activity of a moiety of interest (e.g., a polynucleotide or polypeptide) to a level that is higher than the normal activity level of the moiety of interest in a host strain (e.g., host cell). The normal activity level may be substantially zero (or null) or above zero. The moiety of interest may comprise a polypeptide construct of the host strain. The moiety of interest may comprise a heterologous polypeptide construct introduced into or into the host strain. For example, a heterologous gene encoding a polypeptide of interest may be knocked-in (KI) to the genome of a host strain to enhance the activity of the polypeptide of interest in the host strain.
The terms "reduce expression," "reduce expression," or "downregulate expression" generally refer to the production of a portion of interest (e.g., a polynucleotide or polypeptide) to a level below the normal expression level of the portion of interest in a host strain (e.g., host cell). The normal expression level is above zero. The moiety of interest may comprise an endogenous gene or polypeptide construct of the host strain. In some cases, the moiety of interest may be knocked out or knocked down in the host strain. In some examples, reducing expression of the moiety of interest may include completely inhibiting such expression in the host strain.
The terms "decreasing activity", "reducing activity" or "down-regulating activity" generally refer to the modification of the activity of a moiety of interest (e.g., a polynucleotide or polypeptide) to a level below the normal activity level of the moiety of interest in a host strain (e.g., host cell). The normal activity level is above zero. The moiety of interest may comprise an endogenous gene or polypeptide construct of the host strain. In some cases, the moiety of interest may be knocked out or knocked down in the host strain. In some examples, the reduced activity of the moiety of interest may include completely inhibiting such activity in the host strain.
The terms "subject", "individual" or "patient" are used interchangeably herein to refer generally to a vertebrate, preferably a mammal, such as a human. Mammals include, but are not limited to, murine animals, apes, humans, domestic animals, sports animals, and pets. Tissues, cells, and their progeny of biological entities obtained in vivo or cultured in vitro are also contemplated.
The term "treatment" or "treatment" generally refers to a method of achieving a beneficial or desired result, including but not limited to a therapeutic benefit and/or a prophylactic benefit. For example, treatment may include administration of a system or cell population as disclosed herein. Therapeutic benefit refers to any treatment-related improvement or effect on one or more diseases, disorders or symptoms in treatment. For a prophylactic benefit, the composition may be administered to a subject at risk of developing a particular disease, disorder, or symptom, or to a subject reporting one or more physiological symptoms of the disease, even though the disease, disorder, or symptom may not have been manifested.
The term "effective amount" or "therapeutically effective amount" generally refers to an amount of a composition (e.g., a composition comprising a heterologous polypeptide, a heterologous polynucleotide, and/or a modified cell (e.g., a modified stem cell)) sufficient to produce a desired activity when administered to a subject in need thereof. In the context of the present disclosure, the term "therapeutically effective" generally refers to an amount of a composition sufficient to delay manifestation, prevent progression, alleviate or mitigate at least one symptom of a disorder treated by the methods of the present disclosure.
The term "at least," "greater than," or "greater than or equal to" when used in reference to a first value in a series of two or more values applies to each value in the series of values. For example, 1, 2, or 3 or more is equivalent to 1 or more, 2 or more, or 3 or more.
When the term "no more," "less than," or "less than or equal to" precedes the first value in a series of two or more values, the term "no more," "less than," or "less than or equal to" applies to each value in the series of values. For example, less than or equal to 3, 2, or 1 is equivalent to less than or equal to 3, less than or equal to 2, or less than or equal to 1.
The term "about" or "approximately" generally means within an acceptable error range for a particular value as determined by one of ordinary skill in the art, which will depend in part on how the value is measured or determined, i.e., the limitations of the measurement system. For example, "about" may mean within 1 or more than 1 standard deviation, as is conventional in the art. Or "about" may mean a range of no more than 20%, no more than 10%, no more than 5%, or no more than 1% of a given value. Or in particular for biological systems or processes, the term may mean within an order of magnitude of the value, preferably within a factor of 5, and more preferably within a factor of 2. In describing particular values in the present application and claims, the term "about" should be construed to mean within an acceptable error range for the particular value unless otherwise indicated.
The use of alternatives (e.g., "or") is understood to mean one, two, or any combination thereof. The term "and/or" is understood to mean one or both of the alternatives.
Engineered gene effector
In some aspects, the present disclosure provides compositions, combinations, systems, and methods that utilize engineered gene effectors. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-95 amino acids in length and a second peptide that is 75-95 amino acids in length, the second peptide being heterologous to the first peptide. In some embodiments, the engineered gene effector comprises a first peptide that is 75-95 amino acids in length and a second peptide that is 75-95 amino acids in length, the second peptide being heterologous to the first peptide, wherein the first peptide is different from the second peptide. In some embodiments, the first peptide and/or the second peptide is 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, or 95 amino acids in length, or optionally, the first peptide and/or the second peptide is in a range defined by any two of the above lengths (e.g., 76-94, 77-93, 78-92, 79-91, 80-90, 82-98, etc.).
In some aspects, the present disclosure provides compositions, combinations, systems, and methods that utilize engineered gene effectors. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length, the second peptide being heterologous to the first peptide. In some embodiments, the engineered gene effector comprises a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length, the second peptide being heterologous to the first peptide, wherein the first peptide is different from the second peptide. In some embodiments, the first peptide and/or the second peptide is 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109 amino acids in length, or optionally, the length is within a range defined by any two of the above lengths (e.g., 76-94, 77-93, 78-92, 79-91, 80-90, 82-98, 80-109, 85-108, etc.).
In some embodiments, the engineered gene effector of the present disclosure is capable of activating a target gene in a cell when expressed and effectively targeted to a site of the target gene in the cell. In some aspects, the present disclosure provides compositions, combinations, systems, and methods that utilize engineered gene effectors. In some embodiments, the engineered gene effector of the present disclosure comprises a polypeptide comprising a first peptide 75-95 (or 75-110) amino acids in length and a second peptide 75-95 (or 75-110) amino acids in length, the second peptide being heterologous to the first peptide, wherein the engineered gene effector is capable of activating a target gene in a cell when expressed in the cell and effectively targeted to a site of the target gene. In some embodiments, the first peptide and/or the second peptide is based on a human transcriptional regulator. In some embodiments, the first peptide and the second peptide are based on a human transcriptional regulator. In some embodiments, the first peptide and/or the second peptide is based on a viral transcriptional regulator. In some embodiments, the first peptide and the second peptide are based on a viral transcriptional regulator. In some embodiments, the first peptide is based on a human transcriptional regulator and the second peptide is based on a viral transcriptional regulator. In some embodiments, the first peptide is different from the second peptide. In some embodiments, the first peptide is different from the second peptide in length. In some embodiments, the first peptide is the same length as the second peptide.
In some embodiments, the first peptide and/or the second peptide of the engineered gene effector has a beta factor of about 30 to about 65. In some embodiments, the first peptide has a beta factor of about 30, about 35, about 40, about 45, about 50, about 55, about 60, or about 65, optionally the beta factor of the first peptide is within a range defined by any two of the foregoing values (e.g., about 30 to about 40, about 35 to about 55, about 40 to about 65). In some embodiments, the second peptide has a beta factor of about 30, about 35, about 40, about 45, about 50, about 55, about 60, or about 65, optionally the beta factor of the second peptide is within a range defined by any two of the foregoing values (e.g., about 30 to about 40, about 35 to about 55, about 40 to about 65). In some embodiments, the engineered gene effector of the present disclosure comprises a first peptide that is 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, or 95 amino acids in length and a second peptide that is 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, or 95 amino acids in length, wherein the first peptide and/or the second peptide has a beta factor of about 30, about 35, about 40, about 45, about 50, about 55, about 60, or about 65. In some embodiments, the engineered gene effectors of the present disclosure comprise a first peptide that is 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, or 95 amino acids in length and a second peptide that is 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 94, or 95 amino acids in length, wherein the beta factor of the first peptide and/or the second peptide is within a range defined by any two of the foregoing values (e.g., about 30 to about 40, about 35 to about 55, about 40 to about 65).
In some embodiments, the first peptide and/or the second peptide of the engineered gene effector is enriched in a negative electrostatic potential. In some embodiments, the first peptide and the second peptide are enriched in a negative electrostatic potential. In some embodiments, the engineered gene effector comprises a first peptide 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, or 95 amino acids in length and a second peptide 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, or 95 amino acids in length, wherein the first peptide and/or the second peptide is enriched in negative electrostatic potential. In some embodiments, the engineered gene effector comprises a first peptide 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94 or 95 amino acids in length and a second peptide 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 88, 89, 90, 91 or 95 amino acids in length, wherein the first peptide is enriched in negative electrostatic potential and the second peptide is enriched in negative electrostatic potential.
In some embodiments, the first peptide and/or the second peptide of the engineered gene effector has a negative net charge. In some embodiments, the first peptide and the second peptide of the engineered gene effector have a negative net charge. In some embodiments, the engineered gene effector comprises a first peptide that is 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, or 95 amino acids in length and a second peptide that is 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 94, or 95 amino acids in length, wherein the first peptide and/or the second peptide has a negative net charge. In some embodiments, the engineered gene effector comprises a first peptide 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94 or 95 amino acids in length and a second peptide 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91 or 95 amino acids in length, wherein the first peptide has a negative net charge and the second peptide has a negative net charge.
TABLE 3 amino acid sequences of individual peptides
SEQ ID NO Amino acid sequence
1 ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK
2 ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE
3 AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT
4 ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP
5 AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF
6 AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN
7 AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ
8 AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV
9 AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF
10 APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS
11 ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL
12 ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA
13 CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP
14 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP
15 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL
16 DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI
17 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV
18 DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN
19 DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG
20 DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM
21 EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV
22 EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR
23 EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF
24 EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH
25 ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY
26 EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY
27 FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG
28 FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV
29 FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL
30 GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ
31 GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN
32 GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY
33 GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS
34 GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL
35 GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK
36 GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD
37 GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA
38 GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL
39 GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK
40 HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL
41 HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI
42 IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS
43 KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP
44 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML
45 LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL
46 LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ
47 LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS
48 LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM
49 LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY
50 LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH
51 LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV
52 LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL
53 MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI
54 NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL
55 NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE
56 NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS
57 NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES
58 NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER
59 PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN
60 PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK
61 PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ
62 PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP
63 PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV
64 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP
65 PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL
66 PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE
67 PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT
68 QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL
69 QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC
70 QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD
71 RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET
72 RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR
73 RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF
74 RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ
75 RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG
76 RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL
77 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR
78 RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS
79 SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE
80 SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD
81 SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN
82 SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK
83 SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT
84 STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS
85 SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL
86 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL
87 TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG
88 TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL
89 TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL
90 TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ
91 VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT
92 VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD
93 VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV
94 VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK
95 VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS
96 VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA
97 VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS
98 VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH
99 YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA
100 YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS
In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide comprising any one of the sequences provided in table 3 and a second peptide that is heterologous to the first peptide and comprises any one of the sequences provided in table 3. In some embodiments, the first peptide is 75-110 amino acids in length. In some embodiments, the second peptide is 75-110 amino acids in length. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide comprising any one of SEQ ID NOS: 3-100 in length of 75-110 amino acids and a second peptide comprising any one of SEQ ID NOS: 3-100 in length of 75-110 amino acids, the second peptide being heterologous to the first peptide. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide comprising any one of SEQ ID NOS 3-33 and 35-100 in length of 75-95 amino acids and a second peptide comprising any one of SEQ ID NOS 3-33 and 35-100 in length of 75-95 amino acids, the second peptide being heterologous to the first peptide. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94 or 95 amino acids in length and a second peptide, the first peptide comprising any one of SEQ ID NOs 3-33 and 35-100 and the second peptide being 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94 or 95 amino acids in length, the second peptide being heterologous to the first peptide and the second peptide comprising any one of SEQ ID NOs 3-33 and 35-100. In some embodiments, the first peptide is different from the second peptide. In some embodiments, the first peptide is different from the second peptide in length. In some embodiments, the first peptide is the same length as the second peptide. In some embodiments, the SEQ ID NOs of the first and second peptides are selected according to any one of the pairwise arrangements of SEQ ID NOs of the first and second peptides listed in table 4, wherein the first peptide is located at the N-terminus of the second peptide.
In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide comprising a sequence that is, about, at least, or at most 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to any of the sequences provided in table 3 and a second peptide that is heterologous to the first peptide and comprises a sequence that is, about, at least, or at most 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to any of the sequences provided in table 3. In some embodiments, the first peptide comprises a sequence that is identical to any one of SEQ ID NOs 3-100 by, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%, or optionally, the first peptide comprises a sequence having a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to any one of SEQ ID NOs 3-100. In some embodiments, the second peptide comprises a sequence that is identical to any one of SEQ ID NOs 3-100 by, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%, or optionally, the second peptide comprises a sequence having a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to any one of SEQ ID NOs 3-100. In some embodiments, the first peptide is different from the second peptide. In some embodiments, the first peptide has the above percentage of sequence identity to any one of SEQ ID NOS.3-100, which is different from the sequence of any one of SEQ ID NOS.3-100 having the above percentage of sequence identity to the second peptide. In some embodiments, the first peptide is different from the second peptide. In some embodiments, the SEQ ID NOs of the first and second peptides are selected according to any one of the pairwise arrangements of SEQ ID NOs of the first and second peptides listed in table 4, wherein the first peptide is located at the N-terminus of the second peptide.
In some embodiments, the engineered gene effector comprises a first peptide that is 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94 or 95 amino acids in length and comprises a sequence that is about, or at most 85%, 86, 87, 88, 89, 90, 91, 92, 93, 94 or 95% identical to any of SEQ ID NOs 3-33 and 35-100, and a second peptide that is 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94 or 95 amino acids in length and comprises a sequence that is about, or at most 85%, 86, 87, 88, 89, 90, 96, 97, 98, 99% or 100% identical to any of SEQ ID NOs 3-33, 89, 90, 96, 97, 98%, 99% or at least one of the second peptide in length and at least about 85, 86, 96, 95% or at least 95% identical to any of SEQ ID NOs. In some embodiments, the first peptide is different from the second peptide. In some embodiments, the first peptide has the above percentage of sequence identity to any one of SEQ ID NOS 3-33 and SEQ ID NOS 35-100, which is different from the sequence of any one of SEQ ID NOS 3-33 and SEQ ID NOS 35-100 having the above percentage of sequence identity to the second peptide. In some embodiments, the first peptide is different from the second peptide. In some embodiments, the first peptide is different from the second peptide in length. In some embodiments, the first peptide is the same length as the second peptide. In some embodiments, the SEQ ID NOs of the first and second peptides are selected according to any one of the pairwise arrangements of SEQ ID NOs of the first and second peptides listed in table 4, wherein the first peptide is located at the N-terminus of the second peptide.
In some embodiments, the first peptide and/or the second peptide is 85 amino acids in length. In some embodiments, the first peptide and the second peptide are each 85 amino acids in length. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide comprising any one of SEQ ID NOS 3-33 and 35-100 in length and a second peptide comprising any one of SEQ ID NOS 3-33 and 35-100 in length that is heterologous to the first peptide. In some embodiments, the first peptide is different from the second peptide. In some embodiments, the first peptide is 85 amino acids in length and comprises a sequence that is about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NOS: 3-33, SEQ ID NOS: 35-100, or optionally the first peptide comprises a percentage of identity (e.g., 85% -100% >, 90% -100%, 95% -100%, 98% -100%, etc.). In some embodiments, the second peptide is 85 amino acids in length and comprises a sequence that is about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NOs: 3-100, SEQ ID NOs: 35-100, or optionally the second peptide comprises a percentage identity (e.g., 85% -100% >, 90% -100%, 95% -100%, 98% -100%, etc.). In some embodiments, the engineered gene effector comprises a first peptide of 85 amino acids in length comprising a sequence that is about, at least or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to any one of SEQ ID NOS 3-33, 35-100, and a second peptide of 85 amino acids in length, the second peptide being heterologous to the first peptide, the second peptide comprising a sequence that is 3-33, Any of SEQ ID NOs 35-100 have, about, at least or at most 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical sequences. In some embodiments, the first peptide has a percentage of sequence identity as described above with any one of SEQ ID NOS 3-33, 35-100 that is different from the sequence of any one of SEQ ID NOS 3-33, 35-100 that has a percentage of sequence identity as described above with the second peptide. In some embodiments, the SEQ ID NOs of the first and second peptides are selected according to any one of the pairwise arrangements of SEQ ID NOs of the first and second peptides listed in table 4, wherein the first peptide is located at the N-terminus of the second peptide.
In some embodiments, the engineered gene effector comprises a first peptide that is 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, or 95 amino acids in length and comprises a sequence that is about, at least or up to 85%, 86, 87, 88, 89, 90, 91, 92, 93, 94, 96, 97, 98, 99, or 100% identical to any of SEQ ID NOs 3-33 and 35-100, and a second peptide that is heterologous to the first peptide and comprises a sequence that is about, at least or up to 85%, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% identical to SEQ ID NO 34. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide 75-95 amino acids in length and having the sequence of any one of SEQ ID NOS: 3-33, SEQ ID NOS: 35-100 (which has 0, 1,2, or 3 amino acid residue mutations), and a second peptide heterologous to the first peptide and having the sequence of SEQ ID NO: 34 (which has 0, 1,2, or 3 amino acid residue mutations). In some embodiments, the amino acid residue is mutated to a conservative substitution. In some embodiments, the engineered gene effector comprises a first peptide 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, or 95 amino acids in length and comprises the sequence of any one of SEQ ID NOs 3-33 and 35-100, and a second peptide heterologous to the first peptide and comprising the sequence of SEQ ID NO 34. In some embodiments, the engineered gene effector comprises a first peptide that is 85 amino acids in length and comprises the sequence of any one of SEQ ID NOS: 3-33 and SEQ ID NOS: 35-100, and a second peptide that is heterologous to the first peptide and comprises the sequence of SEQ ID NO: 34. In some embodiments, the first peptide is located at the N-terminus of the second peptide. In some embodiments, the second peptide is located at the N-terminus of the first peptide.
In some embodiments, the first peptide comprises the sequence of any one of SEQ ID NOs 3-100, having 0, 1,2 or 3 amino acid residue mutations. In some embodiments, the second peptide comprises the sequence of any one of SEQ ID NOs 3-100, having 0, 1,2 or 3 amino acid residue mutations. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide of 75-110 amino acids in length and having the sequence of any one of SEQ ID NOS: 3-33, 35-100 (having 0, 1, 2, or 3 amino acid residue mutations), and a second peptide of 75-110 amino acids in length, the second peptide being heterologous to the first peptide and having any one of SEQ ID NOS: 3-100 (having 0, 1, 2, or 3 amino acid residue mutations). In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, or 95 amino acids in length and having the sequence of any one of SEQ ID NOs 3-33, 35-100 (having 0, 1,2, or 3 amino acid residue mutations), and a second peptide 75, a polypeptide comprising a sequence of any one of SEQ ID NOs 3-33, 35-100, and a polypeptide comprising a sequence of any one of SEQ ID NOs, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94 or 95 amino acids, said second peptide being heterologous to the first peptide and having any one of SEQ ID NOs 3-33, 35-100 (which has 0, 1,2 or 3 amino acid residue mutations). in some embodiments, the amino acid residue is mutated to a conservative substitution. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, or 95 amino acids in length and comprising any one of SEQ ID NOs 3-33, 35-100 (having a mutation of 0,1, 2, or 3 amino acid residues), and a second peptide, wherein any mutation thereof is a conservative substitution, and the second peptide is 75, in length, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94 or 95 amino acids, said second peptide being heterologous to the first peptide and comprising any one of SEQ ID NOs 3-33, 35-100 (having 0,1, 2 or 3 amino acid residue mutations), wherein any mutation thereof is a conservative substitution. In some embodiments, the SEQ ID NOs of the first and second peptides are selected according to any one of the pairwise arrangements of SEQ ID NOs of the first and second peptides listed in table 4, wherein the first peptide is located at the N-terminus of the second peptide.
The first polypeptide and the second polypeptide may be in any suitable position relative to each other in the polypeptides. In some embodiments, the first peptide is located N-terminal to the second peptide in the polypeptide. In some embodiments, the second peptide is located N-terminal to the first peptide in the polypeptide. In some embodiments, the first peptide at the N-terminus of the second peptide comprises any one of SEQ ID NOS: 3-100, and the second peptide comprises any one of SEQ ID NOS: 3-100. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide and a second peptide, the first peptide being located at the N-terminus of the second peptide, wherein the first peptide comprises any one of SEQ ID NOs 3-100 and the second peptide is 75-110 amino acids in length, the second peptide being heterologous to the first peptide and the second peptide comprising any one of SEQ ID NOs 3-100. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide comprising any one of SEQ ID NOs 3-33, 35-100 and a second peptide comprising any one of SEQ ID NOs 3-35-100, the second peptide being 75, 76, 77, 78, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94 or 95 amino acids in length, wherein the first peptide is 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94 or 95 amino acids in length, and the second peptide is heterologous to the first peptide, wherein the second peptide comprises any one of SEQ ID NOs 3-33, 35-100. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide and a second peptide, the first peptide being N-terminal to the second peptide, wherein the first peptide is 85 amino acids in length, the first peptide comprises any one of SEQ ID NOS: 3-33, SEQ ID NOS: 35-100, the second peptide is 85 amino acids in length, and the second peptide is heterologous to the first peptide and comprises any one of SEQ ID NOS: 3-33, SEQ ID NOS: 35-100.
In some embodiments, the first peptide is located N-terminal to the second peptide in the polypeptide, and the first peptide and the second peptide are selected according to any one of the pair-wise arrangements of SEQ ID NOs of the first peptide and the second peptide listed in table 4. In some embodiments, the engineered gene effector comprises a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length, and wherein the sequences of the first peptide and the second peptide are selected according to any one of the SEQ ID NOs listed in table 4 in a paired arrangement of the SEQ ID NOs of the first peptide and the second peptide. In some embodiments, the engineered gene effector comprises a first peptide 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, or 95 amino acids in length and a second peptide 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 94, or 95 amino acids in length, and wherein the sequences of the first peptide and the second peptide are selected according to any of the SEQ ID NOs in the pairwise arrangement of SEQ ID NOs of the first peptide and the second peptide listed in table 4. In some embodiments, the engineered gene effector comprises a first peptide that is 85 or 108 amino acids in length and a second peptide that is 85 or 108 amino acids in length, and wherein the sequences of the first peptide and the second peptide are selected according to any one of the pairwise permutations of SEQ ID NOs of the first peptide and the second peptide listed in table 4.
In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide and a second peptide, the first peptide being located N-terminally to the second peptide, wherein the first peptide comprises a sequence that is identical to any one of SEQ ID No. 75, SEQ ID No. 4, SEQ ID No. 40, and SEQ ID No. 77, about, at least or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%, or optionally the first peptide comprises a sequence that is identical to any one of SEQ ID No. 75, SEQ ID No. 4, SEQ ID No. 40, and SEQ ID No. 77, having a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.), and the second peptide comprises a sequence that is identical to any one of SEQ ID No. 64, SEQ ID No. 44, SEQ ID No. 77, SEQ ID No. 17, 96%, at least about, 86%, 98%, 99%, or 100%, or any one of SEQ ID No. 75, 4, 40, and 77, having a percent identity within a range defined by any two of the foregoing values (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.), and the second peptide comprises a sequence that is identical to any one of SEQ ID No. 64, SEQ ID No. 17, 96%, at least about, at least one of the sequence that is identical to any one of the two of the foregoing values, or at least one of the sequence, which is defined by any one of SEQ ID No. 95%, or by any of the sequence 98% -100%, etc.). In some embodiments, the first peptide comprises the sequence of any one of SEQ ID NO. 75, SEQ ID NO. 4, SEQ ID NO. 40, and SEQ ID NO. 77, and the second peptide comprises the sequence of any one of SEQ ID NO. 64, SEQ ID NO. 44, SEQ ID NO. 77, SEQ ID NO. 17, and SEQ ID NO. 96.
In some embodiments, the engineered gene effector comprises a first peptide and a second peptide, the first peptide being located at the N-terminus of the second peptide, wherein the first peptide and the second peptide are selected according to any one of the pair-wise arrangements of SEQ ID NOs of the first peptide and the second peptide listed in table a. In some embodiments, the first peptide is linked to the second peptide through a linker. In some embodiments, the linker comprises the sequence of any one of SEQ ID NOS 2211-2221. In some embodiments, the linker comprises the sequence of SEQ ID NO 2211.
TABLE A non-limiting examples of engineered Gene effector peptide combinations
A ID B ID Bar code The combined peptide SEQ ID NO: the first peptide SEQ ID NO: A second peptide SEQ ID NO:
hvTR_P10541 XV1.2 TGCCAGACGGTC _A_Concatenation GAGATTGTGTCC _B 1085 75 64
hvTR_G3G929 XV1.32 CTCACGACAAGA _A_Concatenation ATAACTCCACGC _B 122 4 44
hvTR_P10541 hvTR_Q2HR73 TGCCAGACGGTC _A_Concatenation ACTTGATGGTTT _B 1084 75 77
cds_NC_025350.1_cds_YP_009094074.1_1 LEUTX ATGACAAAACGA _A_Concatenation ATGACCCCTTGT _B 653 40 17
hvTR_Q2HR73 PRDM10 ACTTGATGGTTT _A_Concatenation AATCGATCCTAC _B 1099 77 96
hvTR_Q2HR73 XV1.2 ACTTGATGGTTT _A_Concatenation GAGATTGTGTCC _B 1107 77 64
In some embodiments, the engineered gene effector comprises a first peptide comprising a sequence that is identical to SEQ ID NO: 75, about, at least, or at most 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%, or optionally, a first peptide comprising a sequence that is identical to SEQ ID NO: 75 by a percentage (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values, and a second peptide comprising a sequence that is identical to SEQ ID NO: 64 by a percentage (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.), or alternatively, a second peptide comprising a sequence that is identical to SEQ ID NO: 64 by a percentage (e.g., 85% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values. In some embodiments, the first peptide comprises the sequence of SEQ ID NO. 75 and the second peptide comprises the sequence of SEQ ID NO. 64. In some embodiments, the first peptide is linked to the second peptide through a linker. In some embodiments, the linker comprises the sequence of any one of SEQ ID NOS 2211-2221. In some embodiments, the linker comprises the sequence of SEQ ID NO 2211.
In some embodiments, the engineered gene effector comprises a first peptide comprising a sequence that is identical to SEQ ID NO: 4, about, at least or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%, or optionally, the first peptide comprises a sequence that is identical to SEQ ID NO: 4 by a percentage of identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values, and a second peptide comprising a sequence that is identical to SEQ ID NO: 44 by a percentage of identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.), said second peptide comprising a sequence that is identical to SEQ ID NO: 44 by a percentage of identity (e.g., 85% -100%, 95% -100%, 98% -100%, etc.), or optionally, the second peptide comprises a sequence that is identical to SEQ ID NO: 44 by a percentage of identity within a range defined by any two of the foregoing values. In some embodiments, the first peptide comprises the sequence of SEQ ID NO. 4 and the second peptide comprises the sequence of SEQ ID NO. 44. In some embodiments, the first peptide is linked to the second peptide through a linker. In some embodiments, the linker comprises the sequence of any one of SEQ ID NOS 2211-2221. In some embodiments, the linker comprises the sequence of SEQ ID NO 2211.
In some embodiments, the engineered gene effector comprises a first peptide comprising a sequence that is identical to SEQ ID NO: 75, about, at least, or at most 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%, or optionally, a first peptide comprising a sequence that is identical to SEQ ID NO: 75 by a percentage (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values, and a second peptide comprising a sequence that is identical to SEQ ID NO: 77 by a percentage (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.), or alternatively, a second peptide comprising a sequence that is identical to SEQ ID NO: 77 by a percentage (e.g., 85% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values. In some embodiments, the first peptide comprises the sequence of SEQ ID NO. 75 and the second peptide comprises the sequence of SEQ ID NO. 77. In some embodiments, the first peptide is linked to the second peptide through a linker. In some embodiments, the linker comprises the sequence of any one of SEQ ID NOS 2211-2221. In some embodiments, the linker comprises the sequence of SEQ ID NO 2211.
In some embodiments, the engineered gene effector comprises a first peptide comprising a sequence that is identical to SEQ ID NO: 40, about, at least or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%, or optionally, a first peptide comprising a sequence that is identical to SEQ ID NO: 40 by a percentage of identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range of any two of the foregoing values, and a second peptide comprising a sequence that is identical to SEQ ID NO: 17 by a percentage of identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.), said second peptide comprising a sequence that is identical to SEQ ID NO: 17, about, at least or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100%, or optionally, a second peptide comprising a sequence that is identical to SEQ ID NO: 17 within a range of any two of the foregoing values. In some embodiments, the first peptide comprises the sequence of SEQ ID NO. 40 and the second peptide comprises the sequence of SEQ ID NO. 17. In some embodiments, the first peptide is linked to the second peptide through a linker. In some embodiments, the linker comprises the sequence of any one of SEQ ID NOS 2211-2221. In some embodiments, the linker comprises the sequence of SEQ ID NO 2211.
In some embodiments, the engineered gene effector comprises a first peptide comprising a sequence that is identical to SEQ ID NO: 77, about, at least, or at most 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%, or optionally, a first peptide comprising a sequence that is identical to SEQ ID NO: 77 by a percentage (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values, and a second peptide comprising a sequence that is identical to SEQ ID NO: 96 by a percentage (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.), about, at least, or at most 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%, or optionally, the second peptide comprises a percentage (e.g., 98% -100% identical to SEQ ID NO: 96 within a range defined by any two of the foregoing values. In some embodiments, the first peptide comprises the sequence of SEQ ID NO: 77 and the second peptide comprises the sequence of SEQ ID NO: 96. In some embodiments, the first peptide is linked to the second peptide through a linker. In some embodiments, the linker comprises the sequence of any one of SEQ ID NOS 2211-2221. In some embodiments, the linker comprises the sequence of SEQ ID NO 2211.
In some embodiments, the engineered gene effector comprises a first peptide comprising a sequence that is identical to SEQ ID NO: 77, about, at least, or at most 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%, or optionally, a first peptide comprising a sequence that is identical to SEQ ID NO: 77, about, at least, or at most 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 97%, 98%, 99%, or 100%, or optionally, a second peptide comprising a sequence that is identical to SEQ ID NO: 64, about, at least, or at most 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%, or optionally, a second peptide comprising a sequence that is identical to SEQ ID NO: 64, within a range defined by any two of the foregoing values (e.g., 85% -100%, 90% -100%, 98% -100%, etc.). In some embodiments, the first peptide comprises the sequence of SEQ ID NO. 77 and the second peptide comprises the sequence of SEQ ID NO. 64. In some embodiments, the first peptide is linked to the second peptide through a linker. In some embodiments, the linker comprises the sequence of any one of SEQ ID NOS 2211-2221. In some embodiments, the linker comprises the sequence of SEQ ID NO 2211.
The first peptide and the second peptide in the engineered gene effector polypeptide may be linked to each other directly or indirectly (e.g., through a spacer or linker). In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide and a second peptide, wherein the first peptide and the second peptide are linked by a spacer (e.g., a peptide spacer). Any suitable spacer may be used. In some embodiments, the spacer (e.g., peptide spacer) is a flexible linker having a sequence comprising fragments of glycine and serine residues. The small size of glycine and serine residues provides flexibility and allows fluidity of the linked functional domains. In some embodiments, the introduction of serine or threonine can maintain the stability of the spacer (e.g., peptide spacer) or linker in aqueous solution by forming hydrogen bonds with water molecules, thereby reducing adverse interactions between the spacer or linker and the attached moiety. In some embodiments, the flexible spacer or linker may also contain additional amino acids (e.g., threonine and alanine) to maintain flexibility, as well as polar amino acids (e.g., lysine and glutamine) to increase solubility. The rigid spacer or linker may have, for example, an alpha-helical structure. The alpha-helical stiff spacer or linker may act as a spacer between protein domains. Non-limiting examples of spacers or linkers include the sequences in table 5 and repeats thereof, e.g., 1, 2, 3, 4,5, 6, 7, 8, 9, or 10 repeats. SEQ ID NOS 2211-2217 provide flexible spacers or linkers or subunits thereof. SEQ ID NO 2218-2221 provides a rigid spacer or linker or subunit thereof. In some embodiments, the spacer (e.g., peptide spacer) or linker is SEQ ID NO: 2211 or comprises SEQ ID NO: 2211.
TABLE 5 linker amino acid sequence
SEQ ID NO: Sequence(s)
2211 GSGGSGGSGGSG
2212 GGGGS
2213 GGGS
2214 GG
2215 KESGSVSSEQLAQFRSLD
2216 EGKSSGSGSESKST
2217 GSAGSAAGSGEF
2218 EAAAK
2219 EAAAR
2220 PAPAP
2221 AEAAAKEAAAKA
In some embodiments, the length of a spacer (e.g., a peptide spacer) or linker disclosed herein can be, for example, 1, 2, 3, 4, 5, 6,7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 amino acid residues.
In some embodiments, a spacer (e.g., a peptide spacer) or linker disclosed herein can comprise at least 1, at least 2, at least 3, at least 5, at least 7, at least 9, at least 11, at least 13, at least 15, or at least 20 amino acids. In some embodiments, a linker may comprise up to 5, up to 7, up to 9, up to 11, up to 13, up to 15, up to 20, up to 25, up to 30, up to 40, or up to 50 amino acids.
In some embodiments, the spacer (e.g., peptide spacer) or linker comprises up to about 20, up to about 15, up to about 14, up to about 13, up to about 12, up to about 11, up to about 10, up to about 9, up to about 8, up to about 7, up to about 6, up to about 5, up to about 4, up to about 3, up to about 2, or about 1 glycine-serine (GS) linkers. In some embodiments, the engineered gene effector comprises a polypeptide having a first peptide and a second peptide linked by a spacer (e.g., a peptide spacer) or linker comprising up to about 20, up to about 15, up to about 14, up to about 13, up to about 12, up to about 11, up to about 10, up to about 9, up to about 8, up to about 7, up to about 6, up to about 5, up to about 4, up to about 3, up to about 2, or about 1 glycine-serine (GS) linkers. In some embodiments, the engineered gene effector comprises a polypeptide having a first peptide and a second peptide linked by a spacer (e.g., a peptide spacer) or linker comprising at least about 1, at least about 2, at least about 3, at least about 4, at least about 5, at least about 6, at least about 7, at least about 9, at least about 10, at least about 11, at least about 12, at least about 13, at least about 14, at least about 15, at least about 20, or more GS linkers.
In some embodiments, the spacer (e.g., peptide spacer) or linker comprises up to about 20, up to about 15, up to about 14, up to about 13, up to about 12, up to about 11, up to about 10, up to about 9, up to about 8, up to about 7, up to about 6, up to about 5, up to about 4, up to about 3, up to about 2, or about 1 glycine (G) linkers. In some embodiments, the engineered gene effector comprises a polypeptide having a first peptide and a second peptide linked by a spacer (e.g., a peptide spacer) or linker comprising at least about 1, at least about 2, at least about 3, at least about 4, at least about 5, at least about 6, at least about 7, at least about 9, at least about 10, at least about 11, at least about 12, at least about 13, at least about 14, at least about 15, at least about 20, or more G linkers. In some embodiments, the engineered gene effector comprises a polypeptide having a first peptide and a second peptide linked by a spacer (e.g., a peptide spacer) or linker comprising up to about 20, up to about 15, up to about 14, up to about 13, up to about 12, up to about 11, up to about 10, up to about 9, up to about 8, up to about 7, up to about 6, up to about 5, up to about 4, up to about 3, up to about 2, or about 1 glycine (G) linkers. In some embodiments, the engineered gene effector comprises a polypeptide having a first peptide and a second peptide linked by a spacer (e.g., a peptide spacer) or linker comprising at least about 1, at least about 2, at least about 3, at least about 4, at least about 5, at least about 6, at least about 7, at least about 9, at least about 10, at least about 11, at least about 12, at least about 13, at least about 14, at least about 15, at least about 20, or more G linkers.
In some embodiments, the engineered gene effector comprises a polypeptide having a first peptide and a second peptide linked by a spacer (e.g., a peptide spacer) or linker comprising any one of SEQ ID NOs 2211-2221, or a sequence thereof having 1-3 mutations. In some embodiments, the engineered gene effector comprises a polypeptide having a first peptide and a second peptide linked by a spacer (e.g., a peptide spacer) or linker comprising the sequence of SEQ ID NO 2211, or a sequence having 1-3 mutations thereof.
In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide and a second peptide, the first peptide being N-terminal to the second peptide, wherein the first peptide comprises any one of SEQ ID NOs 3-100 in length and the second peptide is 85 or 108 amino acids in length, the second peptide being heterologous to the first peptide and comprising any one of SEQ ID NOs 3-100, wherein the SEQ ID NOs of the first peptide and the second peptide are selected according to any one of the pairwise arrangements of SEQ ID NOs of the first peptide and the second peptide listed in table 4, wherein the first peptide and the second peptide are linked by a spacer (e.g., a peptide spacer) or linker (e.g., a peptide linker). In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide and a second peptide, the first peptide being N-terminal to the second peptide, wherein the first peptide comprises any one of SEQ ID NOs 3-33, 35-100 in length and the second peptide is 85 amino acids in length, the second peptide being heterologous to the first peptide and comprising any one of SEQ ID NOs 3-33, 35-100, wherein the SEQ ID NOs of the first and second peptides are selected according to any one of table 4 in a pairwise arrangement, wherein the first and second peptides are linked by a spacer (e.g., a peptide spacer) or linker (e.g., a peptide linker). In some embodiments, the linker is selected from any one of SEQ ID NOS 2211-2221. In some embodiments, the spacer (e.g., peptide spacer) or linker is SEQ ID NO: 2211. In some embodiments, a non-peptide spacer or linker is used. The non-peptide spacer or linker may be, for example, a chemical linker. The two parts of the complex of the present disclosure may be joined by a chemical linker. Each chemical linker of the present disclosure may be alkylene, alkenylene, alkynylene, heteroalkylene, cycloalkylene, heterocycloalkylene, arylene, or heteroarylene, any of which may be optionally substituted. In some embodiments, the chemical linker of the present disclosure may be an ester, ether, amide, thioether, or polyethylene glycol (PEG). In some embodiments, the spacer or linker may reverse the order of amino acid sequences in the compound, e.g., such that the amino acid sequences linked by the linker are head-to-head, rather than head-to-tail. Non-limiting examples of such spacers or linkers include dicarboxylic acid diesters, such as oxalyl diester, malonyl diester, succinyl diester glutaryl diester, adipoyl diester, pimeloyl diester fumaroyl diester, maleoyl diester, phthaloyl diester, isophthaloyl diester, or terephthaloyl diester. Non-limiting examples of such spacers or linkers include dicarboxylic acid diamides such as oxalamide, malonyl diamide, succinyl diamide, glutaryl diamide, adipoyl diamide, pimeloyl diamide, fumaryl diamide, maleyl diamide, phthaloyl diamide, isophthaloyl diamide, or terephthaloyl diamide. Non-limiting examples of such spacers or linkers include diamides of diamino linkers such as ethylenediamine, 1, 2-di (methylamino) ethane, 1, 3-diaminopropane, 1, 3-di (methylamino) propane, 1, 4-di (methylamino) butane, 1, 5-di (methylamino) pentane, 1, 6-di (methylamino) hexane, and piperazine. Non-limiting examples of optional substituents include hydroxy, mercapto, halogen, amino, nitro, nitroso, cyano, azido, sulfoxide, sulfone, sulfonamide, carboxyl, carboxyaldehyde, imine, alkyl, haloalkyl, alkenyl, haloalkenyl, alkynyl, haloalkynyl, alkoxy, aryl, aryloxy, arylalkyl, arylalkoxy, heterocyclyl, acyl, acyloxy, carbamate, amide, ureido, epoxy, or ester.
In some embodiments, the engineered gene effector comprises a polypeptide comprising a sequence that is, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to any one of SEQ ID NO: 115-274、276-287、289-367、370-445、448-746、748-777、779-929、931-1007、1009-1156、1158-1194、1196-1288、1290-1350、1352-1451, or, optionally, a polypeptide comprising a sequence that has a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to any one of SEQ ID NO: 115-274、276-287、289-367、370-445、448-746、748-777、779-929、931-1007、1009-1156、1158-1194、1196-1288、1290-1350、1352-1451. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of any one of SEQ ID NO: 115-274、276-287、289-367、370-445、448-746、748-777、779-929、931-1007、1009-1156、1158-1194、1196-1288、1290-1350、1352-1451. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of any one of SEQ ID NO: 115-274、276-287、289-367、370-445、448-746、748-777、779-929、931-1007、1009-1156、1158-1194、1196-1288、1290-1350、1352-1451, or a sequence having a mutation of 0,1, 2, or 3 amino acid residues. In some embodiments, the mutation is a conservative substitution.
In some embodiments, the engineered gene effector comprises a polypeptide comprising a sequence that is, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to any one of SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107, or optionally, the first peptide comprises a sequence having a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to any one of SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of any one of SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107, having a mutation of 0, 1, 2, or 3 amino acid residues. In some embodiments, the mutation is a conservative substitution. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of any one of SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107. In some embodiments, the engineered gene effector consists of a polypeptide having any one of the sequences of SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107.
In some embodiments, the engineered gene effector comprises a polypeptide comprising a sequence that is about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107, or optionally, the first peptide comprises a sequence having a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequences of SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107, having 0, 1,2, or 3 amino acid residue mutations. In some embodiments, the mutation is a conservative substitution. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequences of SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107. In some embodiments, the engineered gene effector consists of a polypeptide having the sequences of SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107. In some embodiments, the engineered gene effector comprises a polypeptide comprising a sequence that is identical to SEQ ID No. 1085, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%, or optionally, the first peptide comprises a sequence that is within a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) to any two of the foregoing defined values of SEQ ID No. 1085. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of SEQ ID NO: 1085 with a mutation of 0, 1,2 or 3 amino acid residues. In some embodiments, the mutation is a conservative substitution. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of SEQ ID NO. 1085. In some embodiments, the engineered gene effector consists of a polypeptide having the sequence of SEQ ID NO: 1085.
In some embodiments, the engineered gene effector comprises a polypeptide comprising a sequence that is, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID No. 122, or optionally, the first peptide comprises a sequence having a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to SEQ ID No. 122. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of SEQ ID NO. 122 having a mutation of 0, 1,2 or 3 amino acid residues. In some embodiments, the mutation is a conservative substitution. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of SEQ ID NO. 122. In some embodiments, the engineered gene effector consists of a polypeptide having the sequence of SEQ ID NO. 122.
In some embodiments, the engineered gene effector comprises a polypeptide comprising a sequence that is identical to SEQ ID No. 1084, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%, or optionally, the first peptide comprises a sequence that is within a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) to any two of the foregoing defined values of SEQ ID No. 1084. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of SEQ ID NO: 1085 with a mutation of 0, 1, 2 or 3 amino acid residues. In some embodiments, the mutation is a conservative substitution. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of SEQ ID NO. 1084. In some embodiments, the engineered gene effector consists of a polypeptide having the sequence of SEQ ID NO: 1084.
In some embodiments, the engineered gene effector comprises a polypeptide comprising a sequence that is identical to SEQ ID No. 653, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%, or optionally, the first peptide comprises a sequence that is within a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to SEQ ID No. 653. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of SEQ ID NO. 653 with a mutation of 0, 1, 2 or 3 amino acid residues. In some embodiments, the mutation is a conservative substitution. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of SEQ ID NO. 653. In some embodiments, the engineered gene effector consists of a polypeptide having the sequence of SEQ ID NO. 653.
In some embodiments, the engineered gene effector comprises a polypeptide comprising a sequence that is identical to, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% of SEQ ID No. 1099, or optionally, the first peptide comprises a sequence that is within a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) to any two of the foregoing values of SEQ ID No. 1099. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of SEQ ID NO. 1099 having a mutation of 0, 1, 2 or 3 amino acid residues. In some embodiments, the mutation is a conservative substitution. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of SEQ ID NO. 1099. In some embodiments, the engineered gene effector consists of a polypeptide having the sequence of SEQ ID NO. 1099.
In some embodiments, the engineered gene effector comprises a polypeptide comprising a sequence that is identical to, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% of SEQ ID No. 1107, or optionally, the first peptide comprises a sequence that has a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to SEQ ID No. 1107. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of SEQ ID NO. 1107 with mutations of 0, 1, 2 or 3 amino acid residues. In some embodiments, the mutation is a conservative substitution. In some embodiments, the engineered gene effector comprises a polypeptide comprising the sequence of SEQ ID NO. 1107. In some embodiments, the engineered gene effector consists of a polypeptide having the sequence of SEQ ID NO: 1107.
In some embodiments, the engineered gene effector is capable of activating a target gene in a cell when expressed in the cell and effectively targeted to a site of the target gene (e.g., along with a heterologous endonuclease). In some embodiments, the target gene is endogenous to the cell. In some embodiments, the target gene is a silenced gene, optionally, wherein the silenced gene is a methylated gene. In some embodiments, the engineered gene effector is capable of repressing a target gene in a cell when expressed in the cell and effectively targeted to a site of the target gene.
The target gene may be any suitable gene. For example, without limitation, the target gene may be any one of the genes listed in table 6.
In some embodiments, the engineered gene effector (e.g., together with the heterologous endonuclease) is capable of increasing, about or increasing by at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 250%, 300%, 400%, 500% or more the expression level of the target gene, or optionally, the engineered gene effector is capable of increasing the expression level of the target gene by a percentage (e.g., 10% -100%, 100% -200%, 200% -400%, 250% -500%, 10% -50% or 50% -100%, etc.) within a range defined by any two of the foregoing values. In some embodiments, the engineered gene effector (e.g., together with the heterologous endonuclease) is capable of increasing the expression level of the target gene by at least or about 10%, at least or about 20%, at least or about 30%, at least or about 40%, at least or about 50%, at least or about 60%, at least or about 70%, at least or about 80%, at least or about 90%, at least or about 100%, at least or about 200%, at least or about 250%, at least or about 300%, at least or about 400%, or at least or about 500%.
In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length, the second peptide being heterologous to the first peptide, wherein the engineered gene effector is capable of activating a synthetic reporter gene in a cell (e.g., along with a heterologous endonuclease described herein) at least or about 1, At least or about 2, at least or about 3, at least or about 4, at least or about 5, at least or about 6, at least or about 7, at least or about 8, at least or about 9, at least or about 10, at least or about 11, at least or about 12, at least or about 13, at least or about 14, at least or about 15, or at least or about 16, optionally, activating (e.g., as a log2 fold change) within a range defined by any two of the foregoing values (e.g., about 1 to about 5, about 1 to about 3, about 3 to about 5, about 2 to about 4, about 1 to about 15, about 1 to about 10, about 5 to about 10, about 10 to about 15). In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-95 amino acids in length and a second peptide that is 75-95 amino acids in length, the second peptide being heterologous to the first peptide, wherein the engineered gene effector is capable of activating a synthetic reporter gene in a cell (e.g., along with a heterologous endonuclease described herein) at least or about 1, At least or about 2, at least or about 3, at least or about 4, at least or about 5, at least or about 6, at least or about 7, at least or about 8, at least or about 9, at least or about 10, at least or about 11, at least or about 12, at least or about 13, at least or about 14, at least or about 15, or at least or about 16, optionally, activating (e.g., as a log2 fold change) within a range defined by any two of the foregoing values (e.g., about 1 to about 5, about 1 to about 3, about 3 to about 5, about 2 to about 4, about 1 to about 15, about 1 to about 10, about 5 to about 10, about 10 to about 15). In some embodiments, the engineered gene effector comprises a first peptide that is 85 or 108 amino acids in length and a second peptide that is 85 or 108 amino acids in length, wherein the sequences of the first peptide and the second peptide are selected according to any one of the pairwise permutations of SEQ ID NOs of the first peptide and the second peptide listed in table 4. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 9 that activates a synthetic reporter gene (e.g., with a heterologous endonuclease described herein) in a cell at an activation level of at least 1 (e.g., expressed as a log2 fold change), wherein the combination of the SEQ ID NOs of the engineered gene effector first peptide and the second peptide in table 9 is determined in table 4 based on the barcodes, optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on the combination determined in table 4 based on the barcodes. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 9 that activates a synthetic reporter gene (e.g., with a heterologous endonuclease described herein) in a cell at an activation level of at least 2 (e.g., expressed as a log2 fold change), wherein the combination of the first peptide and the second peptide of the engineered gene effector in table 9 is determined in table 4 based on a barcode, optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on the combination determined in table 4 based on the barcode. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 9 that activates a synthetic reporter gene (e.g., with a heterologous endonuclease described herein) in a cell at an activation level of at least 3 (e.g., expressed as a log2 fold change), wherein the combination of the first peptide and the second peptide of the engineered gene effector in table 9 is determined in table 4 based on a barcode, optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on the combination determined in table 4 based on the barcode. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 9 that activates a synthetic reporter gene (e.g., with a heterologous endonuclease described herein) in a cell at an activation level of at least 4 (e.g., expressed as a log2 fold change), wherein the combination of the first peptide and the second peptide of the engineered gene effector in table 9 is determined in table 4 based on a barcode, optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on the combination determined in table 4 based on the barcode. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 9 that activates a synthetic reporter gene (e.g., with a heterologous endonuclease described herein) in a cell at an activation level of at least 5 (e.g., expressed as a log2 fold change), wherein the combination of the first peptide and the second peptide of the engineered gene effector in table 9 is determined in table 4 based on a barcode, optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on the combination determined in table 4 based on the barcode. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 9 that activates a synthetic reporter gene (e.g., with a heterologous endonuclease described herein) in a cell at an activation level of at least 6 (e.g., expressed as a log2 fold change), wherein the combination of the first peptide and the second peptide of the engineered gene effector in table 9 is determined in table 4 based on a barcode, optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on the combination determined in table 4 based on the barcode. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 9 that activates a synthetic reporter gene (e.g., with a heterologous endonuclease described herein) in a cell at an activation level of at least 7 (e.g., expressed as a log2 fold change), wherein the combination of the first peptide and the second peptide of the engineered gene effector in table 9 is determined in table 4 based on a barcode, optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on the combination determined in table 4 based on the barcode. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 9 that activates a synthetic reporter gene (e.g., with a heterologous endonuclease described herein) in a cell at an activation level of at least 8 (e.g., expressed as a log2 fold change), wherein the combination of the first peptide and the second peptide of the engineered gene effector in table 9 is determined in table 4 based on a barcode, optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on the combination determined in table 4 based on the barcode. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 9 that activates a synthetic reporter gene (e.g., with a heterologous endonuclease described herein) in a cell at an activation level of at least 9 (e.g., expressed as a log2 fold change), wherein the combination of the first peptide and the second peptide of the engineered gene effector in table 9 is determined in table 4 based on a barcode, optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on the combination determined in table 4 based on the barcode. In some embodiments, the first peptide and the second peptide are linked by a spacer (e.g., a peptide spacer such as, but not limited to, a spacer of any of SEQ ID NOS: 2211-2221, optionally a spacer of SEQ ID NO: 2211). In some embodiments, the engineered gene effector having any of the above-described activation levels of a synthetic reporter gene in a cell is selected from the group consisting of :SEQ ID NO: 115-274、276-287、289-367、370-445、448-746、748-777、779-929、931-1007、1009-1156、1158-1194、1196-1288、1290-1350、1352-1451.
In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length, the second peptide being heterologous to the first peptide, wherein the engineered gene effector is capable of activating an endogenous CD45 gene in a cell (e.g., along with a heterologous endonuclease described herein) at least or about 1, At least or about 2, at least or about 3, at least or about 4, at least or about 5, at least or about 6, at least or about 7, at least or about 8, at least or about 9, at least or about 10, at least or about 11, at least or about 12, at least or about 13, at least or about 14, at least or about 15, or at least or about 16, optionally, activating (e.g., as a log2 fold change) within a range defined by any two of the foregoing values (e.g., about 1 to about 5, about 1 to about 3, about 3 to about 5, about 2 to about 4, about 1 to about 15, about 1 to about 10, about 5 to about 10, about 10 to about 15). In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-95 amino acids in length and a second peptide that is 75-95 amino acids in length, the second peptide being heterologous to the first peptide, wherein the engineered gene effector is capable of activating an endogenous CD45 gene in a cell (e.g., along with a heterologous endonuclease described herein) at least or about 1, At least or about 2, at least or about 3, at least or about 4, at least or about 5, at least or about 6, at least or about 7, at least or about 8, at least or about 9, at least or about 10, at least or about 11, at least or about 12, at least or about 13, at least or about 14, at least or about 15, or at least or about 16, optionally, activating (e.g., as a log2 fold change) within a range defined by any two of the foregoing values (e.g., about 1 to about 5, about 1 to about 3, about 3 to about 5, about 2 to about 4, about 1 to about 15, about 1 to about 10, about 5 to about 10, about 10 to about 15). In some embodiments, the engineered gene effector comprises a first peptide that is 85 or 108 amino acids in length and a second peptide that is 85 or 108 amino acids in length, wherein the sequences of the first peptide and the second peptide are selected according to any one of the pairwise permutations of SEQ ID NOs of the first peptide and the second peptide listed in table 4. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 10 that activates an endogenous CD45 gene (e.g., along with a heterologous endonuclease described herein) in a cell at an activation level of at least 1 (e.g., expressed as a log2 fold change), optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on a combination of barcodes as determined in table 4. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 10 that activates an endogenous CD45 gene (e.g., along with a heterologous endonuclease described herein) in a cell at an activation level of at least 2 (e.g., expressed as a log2 fold change), optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on a combination of barcodes as determined in table 4. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 10 that activates an endogenous CD45 gene (e.g., along with a heterologous endonuclease described herein) in a cell at an activation level of at least 3 (e.g., expressed as a log2 fold change), optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on a combination of barcodes as determined in table 4. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 10 that activates an endogenous CD45 gene (e.g., along with a heterologous endonuclease described herein) in a cell at an activation level of at least 4 (e.g., expressed as a log2 fold change), optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on a combination of barcodes as determined in table 4. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 10 that activates an endogenous CD45 gene (e.g., along with a heterologous endonuclease described herein) in a cell at an activation level of at least 5 (e.g., expressed as a log2 fold change), optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on a combination of barcodes as determined in table 4. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 10 that activates an endogenous CD45 gene (e.g., along with a heterologous endonuclease described herein) in a cell at an activation level of at least 6 (e.g., expressed as a log2 fold change), optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on a combination of barcodes as determined in table 4. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 10 that activates an endogenous CD45 gene (e.g., along with a heterologous endonuclease described herein) in a cell at an activation level of at least 7 (e.g., expressed as a log2 fold change), optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on a combination of barcodes as determined in table 4. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 10 that activates an endogenous CD45 gene (e.g., along with a heterologous endonuclease described herein) in a cell at an activation level of at least 8 (e.g., expressed as a log2 fold change), optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on a combination of barcodes as determined in table 4. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 10 that activates an endogenous CD45 gene (e.g., along with a heterologous endonuclease described herein) in a cell at an activation level (e.g., expressed as a log2 fold change) of at least 9, optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on a combination of barcodes as determined in table 4. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 10 that activates an endogenous CD45 gene (e.g., along with a heterologous endonuclease described herein) in a cell at an activation level (e.g., expressed as a log2 fold change) of at least 10, optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on a combination of barcodes as determined in table 4. In some embodiments, the engineered gene effector comprises a polypeptide comprising a first peptide that is 75-110 amino acids in length and a second peptide that is 75-110 amino acids in length and that is heterologous to the first peptide, wherein the first peptide and the second peptide are any one of the engineered gene effectors provided in table 10 that activates an endogenous CD45 gene (e.g., along with a heterologous endonuclease described herein) in a cell at an activation level of at least 11 (e.g., expressed as a log2 fold change), optionally wherein the engineered gene effector comprises a peptide amino acid sequence based on a combination of barcodes as determined in table 4. In some embodiments, the first peptide and the second peptide are linked by a spacer (e.g., a peptide spacer such as, but not limited to, a spacer of any of SEQ ID NOS: 2211-2221, optionally a spacer of SEQ ID NO: 2211). In some embodiments, the engineered gene effector having any of the above-described activation levels of the endogenous CD45 gene in the cell is selected from the group consisting of :SEQ ID NO: 115-274、276-287、289-367、370-445、448-746、748-777、779-929、931-1007、1009-1156、1158-1194、1196-1288、1290-1350、1352-1451.
TABLE 4 combined peptide amino acid sequences
Bar code Combination peptide SEQ ID NO Combined peptide amino acid sequences First peptide SEQ ID NO Linker SEQ ID NO Second peptide SEQ ID NO
Stop-1_A/u splice_ GAGATTGTGTCC _b 101 ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSKGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 1 2211 64
Stop-1_A/u splice_ ACTTGATGGTTT _b 102 ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSKGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 1 2211 77
Stop-1_A/u splice_ CTACGTGGCCCC _b 103 ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSKGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 1 2211 86
Stop-1_A/u splice_ ATGACCCCTTGT _b 104 ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSKGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 1 2211 17
Stop-1_A/u splice_ AGAGTGCGCTCT _b 105 ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSKGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 1 2211 14
Stop-1_A/u splice_ ATAACTCCACGC _b 106 ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSKGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 1 2211 44
Stop-1_A/u splice_ CCCTATGTTCTA _b 107 ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSKGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 1 2211 15
Stop 2_A splice ATGACCCCTTGT B 108 ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLEGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 2 2211 17
Stop 2_A splice ATAACTCCACGC B 109 ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLEGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 2 2211 44
Stop 2_A splice ACTTGATGGTTT B 110 ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLEGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 2 2211 77
Stop 2_A splice AGAGTGCGCTCT B 111 ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLEGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 2 2211 14
Stop 2_A splice CTACGTGGCCCC B 112 ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLEGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 2 2211 86
Stop 2_A splice GAGATTGTGTCC B 113 ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLEGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 2 2211 64
Stop 2_A splice CCCTATGTTCTA B 114 ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLEGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 2 2211 15
CAGTGACATAGT _A_Concatenation ACTTGATGGTTT _B 115 AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGATGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 3 2211 77
CAGTGACATAGT _A_Concatenation AGAGTGCGCTCT _B 116 AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGATGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 3 2211 14
CAGTGACATAGT _A_Concatenation CTACGTGGCCCC _B 117 AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGATGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 3 2211 86
CAGTGACATAGT _A_Concatenation ATAACTCCACGC _B 118 AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGATGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 3 2211 44
CAGTGACATAGT _A_Concatenation GAGATTGTGTCC _B 119 AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGATGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 3 2211 64
CAGTGACATAGT _A_Concatenation ATGACCCCTTGT _B 120 AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGATGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 3 2211 17
CAGTGACATAGT _A_Concatenation CCCTATGTTCTA _B 121 AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGATGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 3 2211 15
CTCACGACAAGA _A_Concatenation ATAACTCCACGC _B 122 ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRPGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 4 2211 44
CTCACGACAAGA _A_Concatenation CTACGTGGCCCC _B 123 ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRPGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 4 2211 86
CTCACGACAAGA _A_Concatenation ATGACCCCTTGT _B 124 ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRPGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 4 2211 17
CTCACGACAAGA _A_Concatenation GAGATTGTGTCC _B 125 ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRPGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 4 2211 64
CTCACGACAAGA _A_Concatenation ACTTGATGGTTT _B 126 ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRPGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 4 2211 77
CTCACGACAAGA _A_Concatenation AGAGTGCGCTCT _B 127 ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRPGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 4 2211 14
CTCACGACAAGA _A_Concatenation CCCTATGTTCTA _B 128 ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRPGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 4 2211 15
CATTAGTCCCGC _A_Concatenation CTACGTGGCCCC _B 129 AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEFGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 5 2211 86
CATTAGTCCCGC _A_Concatenation ATAACTCCACGC _B 130 AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEFGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 5 2211 44
CATTAGTCCCGC _A_Concatenation AGAGTGCGCTCT _B 131 AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEFGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 5 2211 14
CATTAGTCCCGC _A_Concatenation ATGACCCCTTGT _B 132 AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEFGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 5 2211 17
CATTAGTCCCGC _A_Concatenation GAGATTGTGTCC _B 133 AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEFGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 5 2211 64
CATTAGTCCCGC _A_Concatenation ACTTGATGGTTT _B 134 AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEFGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 5 2211 77
CATTAGTCCCGC _A_Concatenation CCCTATGTTCTA _B 135 AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEFGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 5 2211 15
CTCGCACCGAGG _A_Concatenation ACTTGATGGTTT _B 136 AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNNGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 6 2211 77
CTCGCACCGAGG _A_Concatenation ATGACCCCTTGT _B 137 AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNNGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 6 2211 17
CTCGCACCGAGG _A_Concatenation ATAACTCCACGC _B 138 AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNNGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 6 2211 44
CTCGCACCGAGG _A_Concatenation GAGATTGTGTCC _B 139 AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNNGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 6 2211 64
CTCGCACCGAGG _A_Concatenation AGAGTGCGCTCT _B 140 AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNNGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 6 2211 14
CTCGCACCGAGG _A_Concatenation CTACGTGGCCCC _B 141 AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNNGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 6 2211 86
CTCGCACCGAGG _A_Concatenation CCCTATGTTCTA _B 142 AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNNGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 6 2211 15
AAACTTTGAGCG _A_Concatenation ATGACCCCTTGT _B 143 AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 7 2211 17
AAACTTTGAGCG _A_Concatenation CTACGTGGCCCC _B 144 AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 7 2211 86
AAACTTTGAGCG _A_Concatenation GAGATTGTGTCC _B 145 AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 7 2211 64
AAACTTTGAGCG _A_Concatenation ACTTGATGGTTT _B 146 AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 7 2211 77
AAACTTTGAGCG _A_Concatenation AGAGTGCGCTCT _B 147 AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 7 2211 14
AAACTTTGAGCG _A_Concatenation ATAACTCCACGC _B 148 AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 7 2211 44
AAACTTTGAGCG _A_Concatenation CCCTATGTTCTA _B 149 AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 7 2211 15
CATTGTTTAAAT _A_Concatenation CTACGTGGCCCC _B 150 AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFVGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 8 2211 86
CATTGTTTAAAT _A_Concatenation GAGATTGTGTCC _B 151 AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFVGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 8 2211 64
CATTGTTTAAAT _A_Concatenation AGAGTGCGCTCT _B 152 AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFVGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 8 2211 14
CATTGTTTAAAT _A_Concatenation ATAACTCCACGC _B 153 AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFVGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 8 2211 44
CATTGTTTAAAT _A_Concatenation ACTTGATGGTTT _B 154 AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFVGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 8 2211 77
CATTGTTTAAAT _A_Concatenation ATGACCCCTTGT _B 155 AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFVGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 8 2211 17
CATTGTTTAAAT _A_Concatenation CCCTATGTTCTA _B 156 AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFVGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 8 2211 15
CTCGTTATGGCA _A_Concatenation AGAGTGCGCTCT _B 157 AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTFGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 9 2211 14
CTCGTTATGGCA _A_Concatenation ATAACTCCACGC _B 158 AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTFGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 9 2211 44
CTCGTTATGGCA _A_Concatenation CTACGTGGCCCC _B 159 AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTFGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 9 2211 86
CTCGTTATGGCA _A_Concatenation ATGACCCCTTGT _B 160 AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTFGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 9 2211 17
CTCGTTATGGCA _A_Concatenation GAGATTGTGTCC _B 161 AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTFGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 9 2211 64
CTCGTTATGGCA _A_Concatenation ACTTGATGGTTT _B 162 AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTFGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 9 2211 77
CTCGTTATGGCA _A_Concatenation CCCTATGTTCTA _B 163 AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTFGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 9 2211 15
CCCGTAGGGGCT _A_Concatenation CTACGTGGCCCC _B 164 APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQSGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 10 2211 86
CCCGTAGGGGCT _A_Concatenation ATGACCCCTTGT _B 165 APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQSGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 10 2211 17
CCCGTAGGGGCT _A_Concatenation ACTTGATGGTTT _B 166 APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQSGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 10 2211 77
CCCGTAGGGGCT _A_Concatenation AGAGTGCGCTCT _B 167 APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQSGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 10 2211 14
CCCGTAGGGGCT _A_Concatenation GAGATTGTGTCC _B 168 APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQSGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 10 2211 64
CCCGTAGGGGCT _A_Concatenation ATAACTCCACGC _B 169 APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQSGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 10 2211 44
CCCGTAGGGGCT _A_Concatenation CCCTATGTTCTA _B 170 APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQSGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 10 2211 15
TCTTGGTTTTGA _A_Concatenation ATGACCCCTTGT _B 171 ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 11 2211 17
TCTTGGTTTTGA _A_Concatenation GAGATTGTGTCC _B 172 ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 11 2211 64
TCTTGGTTTTGA _A_Concatenation ACTTGATGGTTT _B 173 ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 11 2211 77
TCTTGGTTTTGA _A_Concatenation ATAACTCCACGC _B 174 ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 11 2211 44
TCTTGGTTTTGA _A_Concatenation CTACGTGGCCCC _B 175 ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 11 2211 86
TCTTGGTTTTGA _A_Concatenation AGAGTGCGCTCT _B 176 ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 11 2211 14
TCTTGGTTTTGA _A_Concatenation CCCTATGTTCTA _B 177 ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 11 2211 15
CATCTCACCTAG _A_Concatenation ACTTGATGGTTT _B 178 ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEAGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 12 2211 77
CATCTCACCTAG _A_Concatenation GAGATTGTGTCC _B 179 ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEAGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 12 2211 64
CATCTCACCTAG _A_Concatenation AGAGTGCGCTCT _B 180 ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEAGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 12 2211 14
CATCTCACCTAG _A_Concatenation CTACGTGGCCCC _B 181 ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEAGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 12 2211 86
CATCTCACCTAG _A_Concatenation ATGACCCCTTGT _B 182 ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEAGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 12 2211 17
CATCTCACCTAG _A_Concatenation ATAACTCCACGC _B 183 ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEAGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 12 2211 44
CATCTCACCTAG _A_Concatenation CCCTATGTTCTA _B 184 ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEAGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 12 2211 15
TAAGATTCAACG _A_Concatenation ATGACCCCTTGT _B 185 CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNPGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 13 2211 17
TAAGATTCAACG _A_Concatenation GAGATTGTGTCC _B 186 CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNPGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 13 2211 64
TAAGATTCAACG _A_Concatenation CTACGTGGCCCC _B 187 CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNPGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 13 2211 86
TAAGATTCAACG _A_Concatenation ATAACTCCACGC _B 188 CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNPGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 13 2211 44
TAAGATTCAACG _A_Concatenation AGAGTGCGCTCT _B 189 CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNPGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 13 2211 14
TAAGATTCAACG _A_Concatenation ACTTGATGGTTT _B 190 CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNPGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 13 2211 77
TAAGATTCAACG _A_Concatenation CCCTATGTTCTA _B 191 CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNPGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 13 2211 15
AGAGTGCGCTCT _A_Concatenation CCGCCCTTATGT _B 192 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGVYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH 14 2211 98
AGAGTGCGCTCT _A_Concatenation CTCCATTAATGA _B 193 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGFCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV 14 2211 28
AGAGTGCGCTCT _A_Concatenation TGCCAGACGGTC _B 194 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGRMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG 14 2211 75
AGAGTGCGCTCT _A_Concatenation EPICXV.47_B 195 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 14 2211 34
AGAGTGCGCTCT _A_Concatenation AGCCAGGTCACC _B 196 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGGQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK 14 2211 39
AGAGTGCGCTCT _A_Concatenation GACTGTGGGCGG _B 197 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGTWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL 14 2211 89
AGAGTGCGCTCT _A_Concatenation GAGTGACCTATT _B 198 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGHKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI 14 2211 41
AGAGTGCGCTCT _A_Concatenation CATTAGTCCCGC _B 199 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGAEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF 14 2211 5
AGAGTGCGCTCT _A_Concatenation TCTTGGTTTTGA _B 200 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL 14 2211 11
AGAGTGCGCTCT _A_Concatenation ATGACAAAACGA _B 201 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGHFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL 14 2211 40
AGAGTGCGCTCT _A_Concatenation TGATGAGCGCCC _B 202 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGQHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL 14 2211 68
AGAGTGCGCTCT _A_Concatenation ATTTGTAGACCG _B 203 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGPSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE 14 2211 66
AGAGTGCGCTCT _A_Concatenation TAAGATTCAACG _B 204 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGCFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP 14 2211 13
AGAGTGCGCTCT _A_Concatenation CTCGCACCGAGG _B 205 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGAEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN 14 2211 6
AGAGTGCGCTCT _A_Concatenation GCAACGAGGGTC _B 206 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGRGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ 14 2211 74
AGAGTGCGCTCT _A_Concatenation CATCTCACCTAG _B 207 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA 14 2211 12
AGAGTGCGCTCT _A_Concatenation ACTCGTGTTGTC _B 208 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGVAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT 14 2211 91
AGAGTGCGCTCT _A_Concatenation AAGGGTAAATTG _B 209 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGNKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS 14 2211 56
AGAGTGCGCTCT _A_Concatenation CTGCCGGTTGCG _B 210 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGLIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL 14 2211 45
AGAGTGCGCTCT _A_Concatenation TATTAAAAGTGC _B 211 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGMHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI 14 2211 53
AGAGTGCGCTCT _A_Concatenation ACAACGTGCATA _B 212 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGRCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET 14 2211 71
AGAGTGCGCTCT _A_Concatenation CGATGGCGGAAT _B 213 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGDTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG 14 2211 19
AGAGTGCGCTCT _A_Concatenation GTACATACCGAA _B 214 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGNENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE 14 2211 55
AGAGTGCGCTCT _A_Concatenation CCCTATGTTCTA _B 215 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 14 2211 15
AGAGTGCGCTCT _A_Concatenation CATTGTTTAAAT _B 216 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGAETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV 14 2211 8
AGAGTGCGCTCT _A_Concatenation TTAAATGAGGGC _B 217 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGLLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ 14 2211 46
AGAGTGCGCTCT _A_Concatenation CACACCCGGCAG _B 218 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGVTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS 14 2211 95
AGAGTGCGCTCT _A_Concatenation CCAGACGGTCTG _B 219 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGPFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN 14 2211 59
AGAGTGCGCTCT _A_Concatenation CCCCTTAGGAAT _B 220 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGSLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD 14 2211 80
AGAGTGCGCTCT _A_Concatenation CTCAGCGATATA _B 221 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGFATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG 14 2211 27
AGAGTGCGCTCT _A_Concatenation AGCCGTGCTTGA _B 222 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGGGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY 14 2211 32
AGAGTGCGCTCT _A_Concatenation CAGTGACATAGT _B 223 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGAAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT 14 2211 3
AGAGTGCGCTCT _A_Concatenation GCAGATTCGAAT _B 224 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGKPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP 14 2211 43
AGAGTGCGCTCT _A_Concatenation CATGCTAACACC _B 225 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGTTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL 14 2211 88
AGAGTGCGCTCT _A_Concatenation AATCGAGACCAG _B 226 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGEPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH 14 2211 24
AGAGTGCGCTCT _A_Concatenation GTTCCACGCCTG _B 227 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGVVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS 14 2211 97
AGAGTGCGCTCT _A_Concatenation ATTGTATCTAAC _B 228 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGVIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD 14 2211 92
AGAGTGCGCTCT _A_Concatenation GGCACAGCTCCA _B 229 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGTTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG 14 2211 87
AGAGTGCGCTCT _A_Concatenation linker 2_B 230 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGGGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA 14 2211 37
AGAGTGCGCTCT _A_Concatenation CAACGCATGCCT _B 231 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGYPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS 14 2211 100
AGAGTGCGCTCT _A_Concatenation GGAGGCACATCG _B 232 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGRPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL 14 2211 76
AGAGTGCGCTCT _A_Concatenation linker 3_B 233 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGGGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD 14 2211 36
AGAGTGCGCTCT _A_Concatenation AGAGTGCGCTCT _B 234 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 14 2211 14
AGAGTGCGCTCT _A_Concatenation ACTTTAGTAACA _B 235 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGNNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES 14 2211 57
AGAGTGCGCTCT _A_Concatenation ATGTGGCACGAC _B 236 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGQQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC 14 2211 69
AGAGTGCGCTCT _A_Concatenation CTCACGACAAGA _B 237 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP 14 2211 4
AGAGTGCGCTCT _A_Concatenation ACCAGTATTTAA _B 238 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGNCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL 14 2211 54
AGAGTGCGCTCT _A_Concatenation TTTAACGGATTG _B 239 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGSPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN 14 2211 81
AGAGTGCGCTCT _A_Concatenation linker 1_B 240 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGGGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK 14 2211 35
AGAGTGCGCTCT _A_Concatenation AAGACCGGTGCC _B 241 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGRGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF 14 2211 73
AGAGTGCGCTCT _A_Concatenation ATGACCCCTTGT _B 242 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 14 2211 17
AGAGTGCGCTCT _A_Concatenation GAGACAGCTCTC _B 243 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGLREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM 14 2211 48
AGAGTGCGCTCT _A_Concatenation TATCGCAAGAAC _B 244 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGPNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV 14 2211 63
AGAGTGCGCTCT _A_Concatenation CGGCAATTGCTT _B 245 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGSRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK 14 2211 82
AGAGTGCGCTCT _A_Concatenation ACGACTCACCGC _B 246 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGPGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK 14 2211 60
AGAGTGCGCTCT _A_Concatenation AGGCTTCCCGCT _B 247 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGVKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV 14 2211 93
AGAGTGCGCTCT _A_Concatenation TAAAATTTATCA _B 248 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGLSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY 14 2211 49
AGAGTGCGCTCT _A_Concatenation TTCTAATTTCCT _B 249 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGIEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS 14 2211 42
AGAGTGCGCTCT _A_Concatenation AAAACAAGCATT _B 250 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGSTFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS 14 2211 84
AGAGTGCGCTCT _A_Concatenation CGCCGGCATACA _B 251 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGDGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI 14 2211 16
AGAGTGCGCTCT _A_Concatenation TCTTAGAGGGTG _B 252 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL 14 2211 29
AGAGTGCGCTCT _A_Concatenation CTGGTAGTTGTT _B 253 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGSSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT 14 2211 83
AGAGTGCGCTCT _A_Concatenation TACATGAGGCAC _B 254 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGPRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL 14 2211 65
AGAGTGCGCTCT _A_Concatenation GGCACTGTCGAG _B 255 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGYALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA 14 2211 99
AGAGTGCGCTCT _A_Concatenation GCGAGGCAGACC _B 256 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGQWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD 14 2211 70
AGAGTGCGCTCT _A_Concatenation ATCTCTCAGTTC _B 257 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGVKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK 14 2211 94
AGAGTGCGCTCT _A_Concatenation CTCGTTATGGCA _B 258 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGAFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF 14 2211 9
AGAGTGCGCTCT _A_Concatenation ACTTGATGGTTT _B 259 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 14 2211 77
AGAGTGCGCTCT _A_Concatenation AATCAACGAGCA _B 260 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGSAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE 14 2211 79
AGAGTGCGCTCT _A_Concatenation TATCTATCGTGT _B 261 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGRTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS 14 2211 78
AGAGTGCGCTCT _A_Concatenation GAGATTGTGTCC _B 262 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 14 2211 64
AGAGTGCGCTCT _A_Concatenation AGCCATAAACCT _B 263 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGPGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ 14 2211 61
AGAGTGCGCTCT _A_Concatenation TGGCCCCCAGTT _B 264 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGGFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN 14 2211 31
AGAGTGCGCTCT _A_Concatenation TCTGTACCAACG _B 265 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY 14 2211 25
AGAGTGCGCTCT _A_Concatenation CCCTGCCTCGGG _B 266 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGEVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY 14 2211 26
AGAGTGCGCTCT _A_Concatenation AATCGATCCTAC _B 267 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGVTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA 14 2211 96
AGAGTGCGCTCT _A_Concatenation AGTTGCTTTGCT _B 268 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGDYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM 14 2211 20
AGAGTGCGCTCT _A_Concatenation AGTTAGGCTCTT _B 269 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGEFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV 14 2211 21
AGAGTGCGCTCT _A_Concatenation GATCGCATGTTC _B 270 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGNSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER 14 2211 58
AGAGTGCGCTCT _A_Concatenation TCAGCCGTAGGC _B 271 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGRGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR 14 2211 72
AGAGTGCGCTCT _A_Concatenation ATAACTCCACGC _B 272 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 14 2211 44
AGAGTGCGCTCT _A_Concatenation AAGCAACGATAT _B 273 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGGHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL 14 2211 38
AGAGTGCGCTCT _A_Concatenation AAACTTTGAGCG _B 274 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGAELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ 14 2211 7
AGAGTGCGCTCT _A_Concatenation Stop_1_B 275 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSG***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK 14 2211 1
AGAGTGCGCTCT _A_Concatenation GATGGAGCTACA _B 276 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGEMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF 14 2211 23
AGAGTGCGCTCT _A_Concatenation AATGGCACTAGC _B 277 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGEGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR 14 2211 22
AGAGTGCGCTCT _A_Concatenation ACGGCGTAAGTA _B 278 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGLWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL 14 2211 52
AGAGTGCGCTCT _A_Concatenation ATCCAATCTGTG _B 279 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGPSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT 14 2211 67
AGAGTGCGCTCT _A_Concatenation CATCTGAGTCGT _B 280 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGGGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS 14 2211 33
AGAGTGCGCTCT _A_Concatenation CCCGTAGGGGCT _B 281 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGAPFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS 14 2211 10
AGAGTGCGCTCT _A_Concatenation CCTAAATGCAAG _B 282 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGSVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL 14 2211 85
AGAGTGCGCTCT _A_Concatenation CTACGTGGCCCC _B 283 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 14 2211 86
AGAGTGCGCTCT _A_Concatenation GGCCTTTCGGGC _B 284 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGGAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ 14 2211 30
AGAGTGCGCTCT _A_Concatenation GGGAGTATGTCG _B 285 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGLLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS 14 2211 47
AGAGTGCGCTCT _A_Concatenation GGTGTCTCGATT _B 286 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV 14 2211 51
AGAGTGCGCTCT _A_Concatenation GTCACCCTCCTT _B 287 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGDLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN 14 2211 18
AGAGTGCGCTCT _A_Concatenation Stop_2_B 288 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSG***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE 14 2211 2
AGAGTGCGCTCT _A_Concatenation TACTTCTAGCGG _B 289 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGTWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ 14 2211 90
AGAGTGCGCTCT _A_Concatenation TCTATCACAACT _B 290 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGLSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH 14 2211 50
AGAGTGCGCTCT _A_Concatenation TTAGTGCCATAC _B 291 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKPGSGGSGGSGGSGPLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP 14 2211 62
CCCTATGTTCTA _A_Concatenation ATAACTCCACGC _B 292 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 15 2211 44
CCCTATGTTCTA _A_Concatenation CTACGTGGCCCC _B 293 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 15 2211 86
CCCTATGTTCTA _A_Concatenation ACTTGATGGTTT _B 294 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 15 2211 77
CCCTATGTTCTA _A_Concatenation ATGACCCCTTGT _B 295 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 15 2211 17
CCCTATGTTCTA _A_Concatenation GAGATTGTGTCC _B 296 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 15 2211 64
CCCTATGTTCTA _A_Concatenation AGAGTGCGCTCT _B 297 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 15 2211 14
CCCTATGTTCTA _A_Concatenation AAAACAAGCATT _B 298 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGSTFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS 15 2211 84
CCCTATGTTCTA _A_Concatenation AAACTTTGAGCG _B 299 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGAELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ 15 2211 7
CCCTATGTTCTA _A_Concatenation AAGACCGGTGCC _B 300 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGRGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF 15 2211 73
CCCTATGTTCTA _A_Concatenation AAGCAACGATAT _B 301 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGGHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL 15 2211 38
CCCTATGTTCTA _A_Concatenation AAGGGTAAATTG _B 302 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGNKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS 15 2211 56
CCCTATGTTCTA _A_Concatenation AATCAACGAGCA _B 303 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGSAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE 15 2211 79
CCCTATGTTCTA _A_Concatenation AATCGAGACCAG _B 304 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGEPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH 15 2211 24
CCCTATGTTCTA _A_Concatenation AATCGATCCTAC _B 305 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGVTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA 15 2211 96
CCCTATGTTCTA _A_Concatenation AATGGCACTAGC _B 306 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGEGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR 15 2211 22
CCCTATGTTCTA _A_Concatenation ACAACGTGCATA _B 307 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGRCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET 15 2211 71
CCCTATGTTCTA _A_Concatenation ACCAGTATTTAA _B 308 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGNCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL 15 2211 54
CCCTATGTTCTA _A_Concatenation ACGACTCACCGC _B 309 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGPGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK 15 2211 60
CCCTATGTTCTA _A_Concatenation ACGGCGTAAGTA _B 310 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGLWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL 15 2211 52
CCCTATGTTCTA _A_Concatenation ACTCGTGTTGTC _B 311 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGVAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT 15 2211 91
CCCTATGTTCTA _A_Concatenation ACTTTAGTAACA _B 312 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGNNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES 15 2211 57
CCCTATGTTCTA _A_Concatenation AGCCAGGTCACC _B 313 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGGQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK 15 2211 39
CCCTATGTTCTA _A_Concatenation AGCCATAAACCT _B 314 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGPGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ 15 2211 61
CCCTATGTTCTA _A_Concatenation AGCCGTGCTTGA _B 315 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGGGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY 15 2211 32
CCCTATGTTCTA _A_Concatenation AGGCTTCCCGCT _B 316 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGVKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV 15 2211 93
CCCTATGTTCTA _A_Concatenation AGTTAGGCTCTT _B 317 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGEFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV 15 2211 21
CCCTATGTTCTA _A_Concatenation AGTTGCTTTGCT _B 318 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGDYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM 15 2211 20
CCCTATGTTCTA _A_Concatenation ATCCAATCTGTG _B 319 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGPSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT 15 2211 67
CCCTATGTTCTA _A_Concatenation ATCTCTCAGTTC _B 320 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGVKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK 15 2211 94
CCCTATGTTCTA _A_Concatenation ATGACAAAACGA _B 321 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGHFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL 15 2211 40
CCCTATGTTCTA _A_Concatenation ATGTGGCACGAC _B 322 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGQQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC 15 2211 69
CCCTATGTTCTA _A_Concatenation ATTGTATCTAAC _B 323 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGVIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD 15 2211 92
CCCTATGTTCTA _A_Concatenation ATTTGTAGACCG _B 324 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGPSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE 15 2211 66
CCCTATGTTCTA _A_Concatenation CAACGCATGCCT _B 325 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGYPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS 15 2211 100
CCCTATGTTCTA _A_Concatenation CACACCCGGCAG _B 326 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGVTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS 15 2211 95
CCCTATGTTCTA _A_Concatenation CAGTGACATAGT _B 327 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGAAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT 15 2211 3
CCCTATGTTCTA _A_Concatenation CATCTCACCTAG _B 328 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA 15 2211 12
CCCTATGTTCTA _A_Concatenation CATCTGAGTCGT _B 329 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGGGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS 15 2211 33
CCCTATGTTCTA _A_Concatenation CATGCTAACACC _B 330 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGTTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL 15 2211 88
CCCTATGTTCTA _A_Concatenation CATTAGTCCCGC _B 331 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGAEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF 15 2211 5
CCCTATGTTCTA _A_Concatenation CATTGTTTAAAT _B 332 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGAETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV 15 2211 8
CCCTATGTTCTA _A_Concatenation CCAGACGGTCTG _B 333 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGPFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN 15 2211 59
CCCTATGTTCTA _A_Concatenation CCCCTTAGGAAT _B 334 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGSLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD 15 2211 80
CCCTATGTTCTA _A_Concatenation CCCGTAGGGGCT _B 335 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGAPFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS 15 2211 10
CCCTATGTTCTA _A_Concatenation CCCTATGTTCTA _B 336 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 15 2211 15
CCCTATGTTCTA _A_Concatenation CCCTGCCTCGGG _B 337 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGEVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY 15 2211 26
CCCTATGTTCTA _A_Concatenation CCGCCCTTATGT _B 338 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGVYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH 15 2211 98
CCCTATGTTCTA _A_Concatenation CCTAAATGCAAG _B 339 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGSVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL 15 2211 85
CCCTATGTTCTA _A_Concatenation CGATGGCGGAAT _B 340 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGDTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG 15 2211 19
CCCTATGTTCTA _A_Concatenation CGCCGGCATACA _B 341 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGDGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI 15 2211 16
CCCTATGTTCTA _A_Concatenation CGGCAATTGCTT _B 342 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGSRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK 15 2211 82
CCCTATGTTCTA _A_Concatenation CTCACGACAAGA _B 343 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP 15 2211 4
CCCTATGTTCTA _A_Concatenation CTCAGCGATATA _B 344 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGFATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG 15 2211 27
CCCTATGTTCTA _A_Concatenation CTCCATTAATGA _B 345 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGFCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV 15 2211 28
CCCTATGTTCTA _A_Concatenation CTCGCACCGAGG _B 346 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGAEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN 15 2211 6
CCCTATGTTCTA _A_Concatenation CTCGTTATGGCA _B 347 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGAFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF 15 2211 9
CCCTATGTTCTA _A_Concatenation CTGCCGGTTGCG _B 348 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGLIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL 15 2211 45
CCCTATGTTCTA _A_Concatenation CTGGTAGTTGTT _B 349 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGSSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT 15 2211 83
CCCTATGTTCTA _A_Concatenation EPICXV.47_B 350 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 15 2211 34
CCCTATGTTCTA _A_Concatenation GACTGTGGGCGG _B 351 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGTWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL 15 2211 89
CCCTATGTTCTA _A_Concatenation GAGACAGCTCTC _B 352 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGLREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM 15 2211 48
CCCTATGTTCTA _A_Concatenation GAGTGACCTATT _B 353 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGHKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI 15 2211 41
CCCTATGTTCTA _A_Concatenation GATCGCATGTTC _B 354 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGNSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER 15 2211 58
CCCTATGTTCTA _A_Concatenation GATGGAGCTACA _B 355 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGEMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF 15 2211 23
CCCTATGTTCTA _A_Concatenation GCAACGAGGGTC _B 356 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGRGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ 15 2211 74
CCCTATGTTCTA _A_Concatenation GCAGATTCGAAT _B 357 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGKPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP 15 2211 43
CCCTATGTTCTA _A_Concatenation GCGAGGCAGACC _B 358 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGQWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD 15 2211 70
CCCTATGTTCTA _A_Concatenation GGAGGCACATCG _B 359 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGRPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL 15 2211 76
CCCTATGTTCTA _A_Concatenation GGCACAGCTCCA _B 360 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGTTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG 15 2211 87
CCCTATGTTCTA _A_Concatenation GGCACTGTCGAG _B 361 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGYALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA 15 2211 99
CCCTATGTTCTA _A_Concatenation GGCCTTTCGGGC _B 362 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGGAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ 15 2211 30
CCCTATGTTCTA _A_Concatenation GGGAGTATGTCG _B 363 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGLLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS 15 2211 47
CCCTATGTTCTA _A_Concatenation GGTGTCTCGATT _B 364 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV 15 2211 51
CCCTATGTTCTA _A_Concatenation GTACATACCGAA _B 365 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGNENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE 15 2211 55
CCCTATGTTCTA _A_Concatenation GTCACCCTCCTT _B 366 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGDLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN 15 2211 18
CCCTATGTTCTA _A_Concatenation GTTCCACGCCTG _B 367 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGVVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS 15 2211 97
CCCTATGTTCTA _A_Concatenation Stop_1_B 368 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSG***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK 15 2211 1
CCCTATGTTCTA _A_Concatenation Stop_2_B 369 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSG***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE 15 2211 2
CCCTATGTTCTA _A_Concatenation TAAAATTTATCA _B 370 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGLSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY 15 2211 49
CCCTATGTTCTA _A_Concatenation TAAGATTCAACG _B 371 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGCFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP 15 2211 13
CCCTATGTTCTA _A_Concatenation TACATGAGGCAC _B 372 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGPRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL 15 2211 65
CCCTATGTTCTA _A_Concatenation TACTTCTAGCGG _B 373 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGTWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ 15 2211 90
CCCTATGTTCTA _A_Concatenation TATCGCAAGAAC _B 374 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGPNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV 15 2211 63
CCCTATGTTCTA _A_Concatenation TATCTATCGTGT _B 375 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGRTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS 15 2211 78
CCCTATGTTCTA _A_Concatenation TATTAAAAGTGC _B 376 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGMHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI 15 2211 53
CCCTATGTTCTA _A_Concatenation TCAGCCGTAGGC _B 377 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGRGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR 15 2211 72
CCCTATGTTCTA _A_Concatenation TCTATCACAACT _B 378 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGLSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH 15 2211 50
CCCTATGTTCTA _A_Concatenation TCTGTACCAACG _B 379 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY 15 2211 25
CCCTATGTTCTA _A_Concatenation TCTTAGAGGGTG _B 380 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL 15 2211 29
CCCTATGTTCTA _A_Concatenation TCTTGGTTTTGA _B 381 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL 15 2211 11
CCCTATGTTCTA _A_Concatenation TGATGAGCGCCC _B 382 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGQHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL 15 2211 68
CCCTATGTTCTA _A_Concatenation TGCCAGACGGTC _B 383 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGRMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG 15 2211 75
CCCTATGTTCTA _A_Concatenation TGGCCCCCAGTT _B 384 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGGFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN 15 2211 31
CCCTATGTTCTA _A_Concatenation TTAAATGAGGGC _B 385 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGLLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ 15 2211 46
CCCTATGTTCTA _A_Concatenation TTAGTGCCATAC _B 386 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGPLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP 15 2211 62
CCCTATGTTCTA _A_Concatenation TTCTAATTTCCT _B 387 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGIEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS 15 2211 42
CCCTATGTTCTA _A_Concatenation TTTAACGGATTG _B 388 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGSPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN 15 2211 81
CCCTATGTTCTA _A_Concatenation linker 1_B 389 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGGGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK 15 2211 35
CCCTATGTTCTA _A_Concatenation linker 2_B 390 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGGGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA 15 2211 37
CCCTATGTTCTA _A_Concatenation linker 3_B 391 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGGGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD 15 2211 36
CGCCGGCATACA _A_Concatenation CTACGTGGCCCC _B 392 DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPIGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 16 2211 86
CGCCGGCATACA _A_Concatenation ACTTGATGGTTT _B 393 DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPIGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 16 2211 77
CGCCGGCATACA _A_Concatenation GAGATTGTGTCC _B 394 DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPIGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 16 2211 64
CGCCGGCATACA _A_Concatenation AGAGTGCGCTCT _B 395 DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPIGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 16 2211 14
CGCCGGCATACA _A_Concatenation ATGACCCCTTGT _B 396 DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPIGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 16 2211 17
CGCCGGCATACA _A_Concatenation ATAACTCCACGC _B 397 DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPIGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 16 2211 44
CGCCGGCATACA _A_Concatenation CCCTATGTTCTA _B 398 DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPIGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 16 2211 15
ATGACCCCTTGT _A_Concatenation AAGACCGGTGCC _B 399 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGRGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF 17 2211 73
ATGACCCCTTGT _A_Concatenation ATGTGGCACGAC _B 400 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGQQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC 17 2211 69
ATGACCCCTTGT _A_Concatenation GATGGAGCTACA _B 401 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGEMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF 17 2211 23
ATGACCCCTTGT _A_Concatenation GTTCCACGCCTG _B 402 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGVVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS 17 2211 97
ATGACCCCTTGT _A_Concatenation ACAACGTGCATA _B 403 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGRCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET 17 2211 71
ATGACCCCTTGT _A_Concatenation CGGCAATTGCTT _B 404 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGSRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK 17 2211 82
ATGACCCCTTGT _A_Concatenation TGATGAGCGCCC _B 405 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGQHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL 17 2211 68
ATGACCCCTTGT _A_Concatenation EPICXV.47_B 406 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 17 2211 34
ATGACCCCTTGT _A_Concatenation CAGTGACATAGT _B 407 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGAAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT 17 2211 3
ATGACCCCTTGT _A_Concatenation CTCACGACAAGA _B 408 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP 17 2211 4
ATGACCCCTTGT _A_Concatenation TAAAATTTATCA _B 409 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGLSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY 17 2211 49
ATGACCCCTTGT _A_Concatenation GGCACAGCTCCA _B 410 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGTTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG 17 2211 87
ATGACCCCTTGT _A_Concatenation TCAGCCGTAGGC _B 411 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGRGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR 17 2211 72
ATGACCCCTTGT _A_Concatenation CATTGTTTAAAT _B 412 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGAETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV 17 2211 8
ATGACCCCTTGT _A_Concatenation TGGCCCCCAGTT _B 413 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGGFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN 17 2211 31
ATGACCCCTTGT _A_Concatenation ACTTTAGTAACA _B 414 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGNNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES 17 2211 57
ATGACCCCTTGT _A_Concatenation CATCTCACCTAG _B 415 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA 17 2211 12
ATGACCCCTTGT _A_Concatenation TACATGAGGCAC _B 416 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGPRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL 17 2211 65
ATGACCCCTTGT _A_Concatenation GACTGTGGGCGG _B 417 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGTWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL 17 2211 89
ATGACCCCTTGT _A_Concatenation TACTTCTAGCGG _B 418 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGTWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ 17 2211 90
ATGACCCCTTGT _A_Concatenation ATTGTATCTAAC _B 419 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGVIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD 17 2211 92
ATGACCCCTTGT _A_Concatenation GGGAGTATGTCG _B 420 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGLLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS 17 2211 47
ATGACCCCTTGT _A_Concatenation TTCTAATTTCCT _B 421 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGIEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS 17 2211 42
ATGACCCCTTGT _A_Concatenation CCAGACGGTCTG _B 422 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGPFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN 17 2211 59
ATGACCCCTTGT _A_Concatenation CTCGCACCGAGG _B 423 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGAEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN 17 2211 6
ATGACCCCTTGT _A_Concatenation ACGGCGTAAGTA _B 424 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGLWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL 17 2211 52
ATGACCCCTTGT _A_Concatenation TCTTAGAGGGTG _B 425 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL 17 2211 29
ATGACCCCTTGT _A_Concatenation CGATGGCGGAAT _B 426 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGDTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG 17 2211 19
ATGACCCCTTGT _A_Concatenation CTGCCGGTTGCG _B 427 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGLIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL 17 2211 45
ATGACCCCTTGT _A_Concatenation GTACATACCGAA _B 428 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGNENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE 17 2211 55
ATGACCCCTTGT _A_Concatenation GCAACGAGGGTC _B 429 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGRGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ 17 2211 74
ATGACCCCTTGT _A_Concatenation ATGACAAAACGA _B 430 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGHFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL 17 2211 40
ATGACCCCTTGT _A_Concatenation GATCGCATGTTC _B 431 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGNSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER 17 2211 58
ATGACCCCTTGT _A_Concatenation GCGAGGCAGACC _B 432 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGQWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD 17 2211 70
ATGACCCCTTGT _A_Concatenation TTAAATGAGGGC _B 433 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGLLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ 17 2211 46
ATGACCCCTTGT _A_Concatenation ACTCGTGTTGTC _B 434 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGVAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT 17 2211 90
ATGACCCCTTGT _A_Concatenation AAGGGTAAATTG _B 435 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGNKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS 17 2211 56
ATGACCCCTTGT _A_Concatenation AATGGCACTAGC _B 436 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGEGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR 17 2211 22
ATGACCCCTTGT _A_Concatenation CATTAGTCCCGC _B 437 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGAEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF 17 2211 5
ATGACCCCTTGT _A_Concatenation AATCAACGAGCA _B 438 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGSAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE 17 2211 79
ATGACCCCTTGT _A_Concatenation CGCCGGCATACA _B 439 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGDGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI 17 2211 16
ATGACCCCTTGT _A_Concatenation TATTAAAAGTGC _B 440 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGMHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI 17 2211 53
ATGACCCCTTGT _A_Concatenation AATCGAGACCAG _B 441 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGEPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH 17 2211 24
ATGACCCCTTGT _A_Concatenation CCCTGCCTCGGG _B 442 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGEVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY 17 2211 26
ATGACCCCTTGT _A_Concatenation CTCGTTATGGCA _B 443 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGAFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF 17 2211 9
ATGACCCCTTGT _A_Concatenation TAAGATTCAACG _B 444 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGCFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP 17 2211 13
ATGACCCCTTGT _A_Concatenation ACGACTCACCGC _B 445 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGPGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK 17 2211 60
ATGACCCCTTGT _A_Concatenation Stop_1_B 446 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSG***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK 17 2211 1
ATGACCCCTTGT _A_Concatenation Stop_2_B 447 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSG***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE 17 2211 2
ATGACCCCTTGT _A_Concatenation AGCCGTGCTTGA _B 448 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGGGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY 17 2211 32
ATGACCCCTTGT _A_Concatenation GGAGGCACATCG _B 449 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGRPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL 17 2211 76
ATGACCCCTTGT _A_Concatenation ACTTGATGGTTT _B 450 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 17 2211 77
ATGACCCCTTGT _A_Concatenation ACCAGTATTTAA _B 451 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGNCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL 17 2211 54
ATGACCCCTTGT _A_Concatenation AGCCATAAACCT _B 452 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGPGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ 17 2211 61
ATGACCCCTTGT _A_Concatenation ATTTGTAGACCG _B 453 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGPSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE 17 2211 66
ATGACCCCTTGT _A_Concatenation CATGCTAACACC _B 454 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGTTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL 17 2211 88
ATGACCCCTTGT _A_Concatenation TATCGCAAGAAC _B 455 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGPNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV 17 2211 63
ATGACCCCTTGT _A_Concatenation AAGCAACGATAT _B 456 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGGHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL 17 2211 38
ATGACCCCTTGT _A_Concatenation linker 3_B 457 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGGGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD 17 2211 36
ATGACCCCTTGT _A_Concatenation CTCCATTAATGA _B 458 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGFCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV 17 2211 28
ATGACCCCTTGT _A_Concatenation CTGGTAGTTGTT _B 459 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGSSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT 17 2211 83
ATGACCCCTTGT _A_Concatenation AGAGTGCGCTCT _B 460 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 17 2211 14
ATGACCCCTTGT _A_Concatenation AGTTGCTTTGCT _B 461 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGDYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM 17 2211 20
ATGACCCCTTGT _A_Concatenation CTCAGCGATATA _B 462 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGFATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG 17 2211 27
ATGACCCCTTGT _A_Concatenation linker 2_B 463 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGGGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA 17 2211 37
ATGACCCCTTGT _A_Concatenation linker 1_B 464 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGGGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK 17 2211 35
ATGACCCCTTGT _A_Concatenation GAGATTGTGTCC _B 465 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 17 2211 64
ATGACCCCTTGT _A_Concatenation TGCCAGACGGTC _B 466 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGRMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG 17 2211 75
ATGACCCCTTGT _A_Concatenation GAGTGACCTATT _B 467 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGHKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI 17 2211 41
ATGACCCCTTGT _A_Concatenation AATCGATCCTAC _B 468 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGVTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA 17 2211 96
ATGACCCCTTGT _A_Concatenation ATGACCCCTTGT _B 469 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 17 2211 17
ATGACCCCTTGT _A_Concatenation TCTTGGTTTTGA _B 470 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL 17 2211 11
ATGACCCCTTGT _A_Concatenation TATCTATCGTGT _B 471 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGRTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS 17 2211 78
ATGACCCCTTGT _A_Concatenation TTTAACGGATTG _B 472 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGSPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN 17 2211 81
ATGACCCCTTGT _A_Concatenation CACACCCGGCAG _B 473 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGVTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS 17 2211 95
ATGACCCCTTGT _A_Concatenation TCTGTACCAACG _B 474 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY 17 2211 25
ATGACCCCTTGT _A_Concatenation GCAGATTCGAAT _B 475 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGKPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP 17 2211 43
ATGACCCCTTGT _A_Concatenation CAACGCATGCCT _B 476 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGYPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS 17 2211 100
ATGACCCCTTGT _A_Concatenation AGTTAGGCTCTT _B 477 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGEFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV 17 2211 21
ATGACCCCTTGT _A_Concatenation CCCTATGTTCTA _B 478 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 17 2211 15
ATGACCCCTTGT _A_Concatenation CCCCTTAGGAAT _B 479 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGSLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD 17 2211 80
ATGACCCCTTGT _A_Concatenation AAAACAAGCATT _B 480 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGSTFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS 17 2211 84
ATGACCCCTTGT _A_Concatenation AAACTTTGAGCG _B 481 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGAELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ 17 2211 7
ATGACCCCTTGT _A_Concatenation AGCCAGGTCACC _B 482 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGGQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK 17 2211 39
ATGACCCCTTGT _A_Concatenation AGGCTTCCCGCT _B 483 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGVKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV 17 2211 93
ATGACCCCTTGT _A_Concatenation ATAACTCCACGC _B 484 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 17 2211 44
ATGACCCCTTGT _A_Concatenation ATCCAATCTGTG _B 485 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGPSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT 17 2211 67
ATGACCCCTTGT _A_Concatenation ATCTCTCAGTTC _B 486 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGVKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK 17 2211 94
ATGACCCCTTGT _A_Concatenation CATCTGAGTCGT _B 487 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGGGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS 17 2211 33
ATGACCCCTTGT _A_Concatenation CCCGTAGGGGCT _B 488 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGAPFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS 17 2211 10
ATGACCCCTTGT _A_Concatenation CCGCCCTTATGT _B 489 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGVYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH 17 2211 98
ATGACCCCTTGT _A_Concatenation CCTAAATGCAAG _B 490 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGSVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL 17 2211 85
ATGACCCCTTGT _A_Concatenation CTACGTGGCCCC _B 491 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 17 2211 86
ATGACCCCTTGT _A_Concatenation GAGACAGCTCTC _B 492 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGLREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM 17 2211 48
ATGACCCCTTGT _A_Concatenation GGCACTGTCGAG _B 493 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGYALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA 17 2211 99
ATGACCCCTTGT _A_Concatenation GGCCTTTCGGGC _B 494 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGGAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ 17 2211 30
ATGACCCCTTGT _A_Concatenation GGTGTCTCGATT _B 495 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV 17 2211 51
ATGACCCCTTGT _A_Concatenation GTCACCCTCCTT _B 496 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGDLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN 17 2211 18
ATGACCCCTTGT _A_Concatenation TCTATCACAACT _B 497 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGLSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH 17 2211 50
ATGACCCCTTGT _A_Concatenation TTAGTGCCATAC _B 498 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSVGSGGSGGSGGSGPLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP 17 2211 62
GTCACCCTCCTT _A_Concatenation AGAGTGCGCTCT _B 499 DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 18 2211 14
GTCACCCTCCTT _A_Concatenation ACTTGATGGTTT _B 500 DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 18 2211 77
GTCACCCTCCTT _A_Concatenation ATGACCCCTTGT _B 501 DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 18 2211 17
GTCACCCTCCTT _A_Concatenation ATAACTCCACGC _B 502 DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 18 2211 44
GTCACCCTCCTT _A_Concatenation CTACGTGGCCCC _B 503 DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 18 2211 86
GTCACCCTCCTT _A_Concatenation GAGATTGTGTCC _B 504 DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 18 2211 64
GTCACCCTCCTT _A_Concatenation CCCTATGTTCTA _B 505 DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 18 2211 15
CGATGGCGGAAT _A_Concatenation AGAGTGCGCTCT _B 506 DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLGGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 19 2211 14
CGATGGCGGAAT _A_Concatenation ACTTGATGGTTT _B 507 DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLGGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 19 2211 77
CGATGGCGGAAT _A_Concatenation ATAACTCCACGC _B 508 DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLGGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 19 2211 44
CGATGGCGGAAT _A_Concatenation ATGACCCCTTGT _B 509 DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLGGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 19 2211 17
CGATGGCGGAAT _A_Concatenation CTACGTGGCCCC _B 510 DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLGGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 19 2211 86
CGATGGCGGAAT _A_Concatenation GAGATTGTGTCC _B 511 DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLGGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 19 2211 64
CGATGGCGGAAT _A_Concatenation CCCTATGTTCTA _B 512 DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLGGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 19 2211 15
AGTTGCTTTGCT _A_Concatenation CTACGTGGCCCC _B 513 DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLMGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 20 2211 86
AGTTGCTTTGCT _A_Concatenation ATAACTCCACGC _B 514 DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLMGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 20 2211 44
AGTTGCTTTGCT _A_Concatenation GAGATTGTGTCC _B 515 DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLMGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 20 2211 64
AGTTGCTTTGCT _A_Concatenation AGAGTGCGCTCT _B 516 DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLMGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 20 2211 14
AGTTGCTTTGCT _A_Concatenation ACTTGATGGTTT _B 517 DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLMGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 20 2211 77
AGTTGCTTTGCT _A_Concatenation ATGACCCCTTGT _B 518 DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLMGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 20 2211 17
AGTTGCTTTGCT _A_Concatenation CCCTATGTTCTA _B 519 DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLMGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 20 2211 15
AGTTAGGCTCTT _A_Concatenation ACTTGATGGTTT _B 520 EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPVGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 21 2211 77
AGTTAGGCTCTT _A_Concatenation AGAGTGCGCTCT _B 521 EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPVGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 21 2211 14
AGTTAGGCTCTT _A_Concatenation GAGATTGTGTCC _B 522 EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPVGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 21 2211 64
AGTTAGGCTCTT _A_Concatenation ATAACTCCACGC _B 523 EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPVGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 21 2211 44
AGTTAGGCTCTT _A_Concatenation CTACGTGGCCCC _B 524 EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPVGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 21 2211 86
AGTTAGGCTCTT _A_Concatenation ATGACCCCTTGT _B 525 EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPVGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 21 2211 17
AGTTAGGCTCTT _A_Concatenation CCCTATGTTCTA _B 526 EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPVGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 21 2211 15
AATGGCACTAGC _A_Concatenation GAGATTGTGTCC _B 527 EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYARGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 22 2211 64
AATGGCACTAGC _A_Concatenation ATAACTCCACGC _B 528 EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYARGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 22 2211 44
AATGGCACTAGC _A_Concatenation ATGACCCCTTGT _B 529 EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYARGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 22 2211 17
AATGGCACTAGC _A_Concatenation ACTTGATGGTTT _B 530 EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYARGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 22 2211 77
AATGGCACTAGC _A_Concatenation AGAGTGCGCTCT _B 531 EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYARGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 22 2211 14
AATGGCACTAGC _A_Concatenation CTACGTGGCCCC _B 532 EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYARGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 22 2211 86
AATGGCACTAGC _A_Concatenation CCCTATGTTCTA _B 533 EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYARGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 22 2211 15
GATGGAGCTACA _A_Concatenation CTACGTGGCCCC _B 534 EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLFGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 23 2211 86
GATGGAGCTACA _A_Concatenation AGAGTGCGCTCT _B 535 EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLFGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 23 2211 14
GATGGAGCTACA _A_Concatenation ACTTGATGGTTT _B 536 EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLFGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 23 2211 77
GATGGAGCTACA _A_Concatenation GAGATTGTGTCC _B 537 EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLFGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 23 2211 64
GATGGAGCTACA _A_Concatenation ATGACCCCTTGT _B 538 EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLFGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 23 2211 17
GATGGAGCTACA _A_Concatenation ATAACTCCACGC _B 539 EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLFGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 23 2211 44
GATGGAGCTACA _A_Concatenation CCCTATGTTCTA _B 540 EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLFGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 23 2211 15
AATCGAGACCAG _A_Concatenation CTACGTGGCCCC _B 541 EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEHGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 24 2211 86
AATCGAGACCAG _A_Concatenation AGAGTGCGCTCT _B 542 EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEHGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 24 2211 14
AATCGAGACCAG _A_Concatenation ATGACCCCTTGT _B 543 EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEHGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 24 2211 17
AATCGAGACCAG _A_Concatenation GAGATTGTGTCC _B 544 EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEHGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 24 2211 64
AATCGAGACCAG _A_Concatenation ATAACTCCACGC _B 545 EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEHGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 24 2211 44
AATCGAGACCAG _A_Concatenation ACTTGATGGTTT _B 546 EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEHGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 24 2211 77
AATCGAGACCAG _A_Concatenation CCCTATGTTCTA _B 547 EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEHGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 24 2211 15
TCTGTACCAACG _A_Concatenation AGAGTGCGCTCT _B 548 ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSYGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 25 2211 14
TCTGTACCAACG _A_Concatenation ACTTGATGGTTT _B 549 ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSYGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 25 2211 77
TCTGTACCAACG _A_Concatenation GAGATTGTGTCC _B 550 ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSYGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 25 2211 64
TCTGTACCAACG _A_Concatenation ATGACCCCTTGT _B 551 ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSYGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 25 2211 17
TCTGTACCAACG _A_Concatenation ATAACTCCACGC _B 552 ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSYGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 25 2211 44
TCTGTACCAACG _A_Concatenation CTACGTGGCCCC _B 553 ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSYGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 25 2211 86
TCTGTACCAACG _A_Concatenation CCCTATGTTCTA _B 554 ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSYGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 25 2211 15
CCCTGCCTCGGG _A_Concatenation AGAGTGCGCTCT _B 555 EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIYGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 26 2211 14
CCCTGCCTCGGG _A_Concatenation CTACGTGGCCCC _B 556 EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIYGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 26 2211 86
CCCTGCCTCGGG _A_Concatenation GAGATTGTGTCC _B 557 EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIYGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 26 2211 64
CCCTGCCTCGGG _A_Concatenation ATGACCCCTTGT _B 558 EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIYGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 26 2211 17
CCCTGCCTCGGG _A_Concatenation ACTTGATGGTTT _B 559 EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIYGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 26 2211 77
CCCTGCCTCGGG _A_Concatenation ATAACTCCACGC _B 560 EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIYGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 26 2211 44
CCCTGCCTCGGG _A_Concatenation CCCTATGTTCTA _B 561 EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIYGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 26 2211 15
CTCAGCGATATA _A_Concatenation GAGATTGTGTCC _B 562 FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGGGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 27 2211 64
CTCAGCGATATA _A_Concatenation AGAGTGCGCTCT _B 563 FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGGGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 27 2211 14
CTCAGCGATATA _A_Concatenation CTACGTGGCCCC _B 564 FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGGGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 27 2211 86
CTCAGCGATATA _A_Concatenation ACTTGATGGTTT _B 565 FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGGGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 27 2211 77
CTCAGCGATATA _A_Concatenation ATAACTCCACGC _B 566 FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGGGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 27 2211 44
CTCAGCGATATA _A_Concatenation ATGACCCCTTGT _B 567 FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGGGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 27 2211 17
CTCAGCGATATA _A_Concatenation CCCTATGTTCTA _B 568 FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGGGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 27 2211 15
CTCCATTAATGA _A_Concatenation ATAACTCCACGC _B 569 FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEVGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 28 2211 44
CTCCATTAATGA _A_Concatenation ACTTGATGGTTT _B 570 FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEVGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 28 2211 77
CTCCATTAATGA _A_Concatenation GAGATTGTGTCC _B 571 FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEVGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 28 2211 64
CTCCATTAATGA _A_Concatenation AGAGTGCGCTCT _B 572 FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEVGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 28 2211 14
CTCCATTAATGA _A_Concatenation ATGACCCCTTGT _B 573 FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEVGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 28 2211 17
CTCCATTAATGA _A_Concatenation CTACGTGGCCCC _B 574 FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEVGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 28 2211 86
CTCCATTAATGA _A_Concatenation CCCTATGTTCTA _B 575 FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEVGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 28 2211 15
TCTTAGAGGGTG _A_Concatenation ATGACCCCTTGT _B 576 FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 29 2211 17
TCTTAGAGGGTG _A_Concatenation AGAGTGCGCTCT _B 577 FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 29 2211 14
TCTTAGAGGGTG _A_Concatenation CTACGTGGCCCC _B 578 FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 29 2211 86
TCTTAGAGGGTG _A_Concatenation GAGATTGTGTCC _B 579 FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 29 2211 64
TCTTAGAGGGTG _A_Concatenation ATAACTCCACGC _B 580 FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 29 2211 44
TCTTAGAGGGTG _A_Concatenation ACTTGATGGTTT _B 581 FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 29 2211 77
TCTTAGAGGGTG _A_Concatenation CCCTATGTTCTA _B 582 FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGALGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 29 2211 15
GGCCTTTCGGGC _A_Concatenation GAGATTGTGTCC _B 583 GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 30 2211 64
GGCCTTTCGGGC _A_Concatenation ACTTGATGGTTT _B 584 GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 30 2211 77
GGCCTTTCGGGC _A_Concatenation ATAACTCCACGC _B 585 GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 30 2211 44
GGCCTTTCGGGC _A_Concatenation ATGACCCCTTGT _B 586 GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 30 2211 17
GGCCTTTCGGGC _A_Concatenation CTACGTGGCCCC _B 587 GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 30 2211 86
GGCCTTTCGGGC _A_Concatenation AGAGTGCGCTCT _B 588 GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 30 2211 14
GGCCTTTCGGGC _A_Concatenation CCCTATGTTCTA _B 589 GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 30 2211 15
TGGCCCCCAGTT _A_Concatenation CTACGTGGCCCC _B 590 GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQINGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 31 2211 86
TGGCCCCCAGTT _A_Concatenation ATGACCCCTTGT _B 591 GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQINGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 31 2211 17
TGGCCCCCAGTT _A_Concatenation GAGATTGTGTCC _B 592 GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQINGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 31 2211 64
TGGCCCCCAGTT _A_Concatenation ACTTGATGGTTT _B 593 GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQINGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 31 2211 77
TGGCCCCCAGTT _A_Concatenation AGAGTGCGCTCT _B 594 GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQINGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 31 2211 14
TGGCCCCCAGTT _A_Concatenation ATAACTCCACGC _B 595 GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQINGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 31 2211 44
TGGCCCCCAGTT _A_Concatenation CCCTATGTTCTA _B 596 GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQINGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 31 2211 15
AGCCGTGCTTGA _A_Concatenation GAGATTGTGTCC _B 597 GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIYGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 32 2211 64
AGCCGTGCTTGA _A_Concatenation ATGACCCCTTGT _B 598 GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIYGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 32 2211 17
AGCCGTGCTTGA _A_Concatenation AGAGTGCGCTCT _B 599 GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIYGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 32 2211 14
AGCCGTGCTTGA _A_Concatenation CTACGTGGCCCC _B 600 GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIYGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 32 2211 86
AGCCGTGCTTGA _A_Concatenation ACTTGATGGTTT _B 601 GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIYGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 32 2211 77
AGCCGTGCTTGA _A_Concatenation ATAACTCCACGC _B 602 GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIYGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 32 2211 44
AGCCGTGCTTGA _A_Concatenation CCCTATGTTCTA _B 603 GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIYGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 32 2211 15
CATCTGAGTCGT _A_Concatenation CTACGTGGCCCC _B 604 GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGSGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 33 2211 86
CATCTGAGTCGT _A_Concatenation GAGATTGTGTCC _B 605 GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGSGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 33 2211 64
CATCTGAGTCGT _A_Concatenation ACTTGATGGTTT _B 606 GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGSGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 33 2211 77
CATCTGAGTCGT _A_Concatenation ATGACCCCTTGT _B 607 GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGSGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 33 2211 17
CATCTGAGTCGT _A_Concatenation ATAACTCCACGC _B 608 GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGSGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 33 2211 44
CATCTGAGTCGT _A_Concatenation AGAGTGCGCTCT _B 609 GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGSGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 33 2211 14
CATCTGAGTCGT _A_Concatenation CCCTATGTTCTA _B 610 GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGSGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 33 2211 15
EPICXV.47-A\u splice_ CTACGTGGCCCC _b 611 GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 34 2211 86
EPICXV.47-A\u splice_ ATAACTCCACGC _b 612 GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 34 2211 44
EPICXV.47-A\u splice_ AGAGTGCGCTCT _b 613 GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 34 2211 14
EPICXV.47-A\u splice_ ATGACCCCTTGT _b 614 GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 34 2211 17
EPICXV.47-A\u splice_ GAGATTGTGTCC _b 615 GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 34 2211 64
EPICXV.47-A\u splice_ ACTTGATGGTTT _b 616 GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 34 2211 77
EPICXV.47-A\u splice_ CCCTATGTTCTA _b 617 GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 34 2211 15
Linker_1_A/u splice_ CTACGTGGCCCC _b 618 GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPKGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 35 2211 86
Linker_1_A/u splice_ ATGACCCCTTGT _b 619 GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPKGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 35 2211 17
Linker_1_A/u splice_ ACTTGATGGTTT _b 620 GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPKGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 35 2211 77
Linker_1_A/u splice_ AGAGTGCGCTCT _b 621 GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPKGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 35 2211 14
Linker_1_A/u splice_ GAGATTGTGTCC _b 622 GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPKGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 35 2211 64
Linker_1_A/u splice_ ATAACTCCACGC _b 623 GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPKGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 35 2211 44
Linker_1_A/u splice_ CCCTATGTTCTA _b 624 GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPKGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 35 2211 15
Linker_ 3_A/u splice_ ATGACCCCTTGT _b 625 GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKDGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 36 2211 17
Linker_ 3_A/u splice_ AGAGTGCGCTCT _b 626 GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKDGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 36 2211 14
Linker_ 3_A/u splice_ GAGATTGTGTCC _b 627 GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKDGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 36 2211 64
Linker_ 3_A/u splice_ ATAACTCCACGC _b 628 GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKDGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 36 2211 44
Linker_ 3_A/u splice_ ACTTGATGGTTT _b 629 GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKDGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 36 2211 77
Linker_ 3_A/u splice_ CTACGTGGCCCC _b 630 GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKDGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 36 2211 86
Linker_ 3_A/u splice_ CCCTATGTTCTA _b 631 GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKDGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 36 2211 15
Linker_ 2_A/u splice_ ATAACTCCACGC _b 632 GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPAGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 37 2211 44
Linker_ 2_A/u splice_ ATGACCCCTTGT _b 633 GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPAGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 37 2211 17
Linker_ 2_A/u splice_ ACTTGATGGTTT _b 634 GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPAGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 37 2211 77
Linker_ 2_A/u splice_ AGAGTGCGCTCT _b 635 GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPAGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 37 2211 14
Linker_ 2_A/u splice_ CTACGTGGCCCC _b 636 GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPAGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 37 2211 86
Linker_ 2_A/u splice_ GAGATTGTGTCC _b 637 GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPAGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 37 2211 64
Linker_ 2_A/u splice_ CCCTATGTTCTA _b 638 GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPAGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 37 2211 15
AAGCAACGATAT _A_Concatenation ACTTGATGGTTT _B 639 GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 38 2211 77
AAGCAACGATAT _A_Concatenation CTACGTGGCCCC _B 640 GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 38 2211 86
AAGCAACGATAT _A_Concatenation AGAGTGCGCTCT _B 641 GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 38 2211 14
AAGCAACGATAT _A_Concatenation ATAACTCCACGC _B 642 GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 38 2211 44
AAGCAACGATAT _A_Concatenation ATGACCCCTTGT _B 643 GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 38 2211 17
AAGCAACGATAT _A_Concatenation GAGATTGTGTCC _B 644 GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 38 2211 64
AAGCAACGATAT _A_Concatenation CCCTATGTTCTA _B 645 GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 38 2211 15
AGCCAGGTCACC _A_Concatenation GAGATTGTGTCC _B 646 GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPKGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 39 2211 64
AGCCAGGTCACC _A_Concatenation ACTTGATGGTTT _B 647 GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPKGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 39 2211 77
AGCCAGGTCACC _A_Concatenation CTACGTGGCCCC _B 648 GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPKGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 39 2211 86
AGCCAGGTCACC _A_Concatenation ATAACTCCACGC _B 649 GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPKGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 39 2211 44
AGCCAGGTCACC _A_Concatenation AGAGTGCGCTCT _B 650 GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPKGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 39 2211 14
AGCCAGGTCACC _A_Concatenation ATGACCCCTTGT _B 651 GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPKGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 39 2211 17
AGCCAGGTCACC _A_Concatenation CCCTATGTTCTA _B 652 GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPKGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 39 2211 15
ATGACAAAACGA _A_Concatenation ATGACCCCTTGT _B 653 HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 40 2211 17
ATGACAAAACGA _A_Concatenation ACTTGATGGTTT _B 654 HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 40 2211 77
ATGACAAAACGA _A_Concatenation GAGATTGTGTCC _B 655 HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 40 2211 64
ATGACAAAACGA _A_Concatenation AGAGTGCGCTCT _B 656 HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 40 2211 14
ATGACAAAACGA _A_Concatenation ATAACTCCACGC _B 657 HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 40 2211 44
ATGACAAAACGA _A_Concatenation CTACGTGGCCCC _B 658 HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 40 2211 86
ATGACAAAACGA _A_Concatenation CCCTATGTTCTA _B 659 HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 40 2211 15
GAGTGACCTATT _A_Concatenation ACTTGATGGTTT _B 660 HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLIGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 41 2211 77
GAGTGACCTATT _A_Concatenation CTACGTGGCCCC _B 661 HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLIGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 41 2211 86
GAGTGACCTATT _A_Concatenation GAGATTGTGTCC _B 662 HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLIGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 41 2211 64
GAGTGACCTATT _A_Concatenation ATGACCCCTTGT _B 663 HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLIGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 41 2211 17
GAGTGACCTATT _A_Concatenation AGAGTGCGCTCT _B 664 HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLIGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 41 2211 14
GAGTGACCTATT _A_Concatenation ATAACTCCACGC _B 665 HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLIGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 41 2211 44
GAGTGACCTATT _A_Concatenation CCCTATGTTCTA _B 666 HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLIGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 41 2211 15
TTCTAATTTCCT _A_Concatenation CTACGTGGCCCC _B 667 IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMSGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 42 2211 86
TTCTAATTTCCT _A_Concatenation AGAGTGCGCTCT _B 668 IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMSGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 42 2211 14
TTCTAATTTCCT _A_Concatenation GAGATTGTGTCC _B 669 IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMSGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 42 2211 64
TTCTAATTTCCT _A_Concatenation ATGACCCCTTGT _B 670 IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMSGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 42 2211 17
TTCTAATTTCCT _A_Concatenation ATAACTCCACGC _B 671 IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMSGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 42 2211 44
TTCTAATTTCCT _A_Concatenation ACTTGATGGTTT _B 672 IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMSGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 42 2211 77
TTCTAATTTCCT _A_Concatenation CCCTATGTTCTA _B 673 IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMSGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 42 2211 15
GCAGATTCGAAT _A_Concatenation ATGACCCCTTGT _B 674 KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNPGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 43 2211 17
GCAGATTCGAAT _A_Concatenation AGAGTGCGCTCT _B 675 KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNPGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 43 2211 14
GCAGATTCGAAT _A_Concatenation GAGATTGTGTCC _B 676 KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNPGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 43 2211 64
GCAGATTCGAAT _A_Concatenation ACTTGATGGTTT _B 677 KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNPGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 43 2211 77
GCAGATTCGAAT _A_Concatenation ATAACTCCACGC _B 678 KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNPGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 43 2211 44
GCAGATTCGAAT _A_Concatenation CTACGTGGCCCC _B 679 KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNPGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 43 2211 86
GCAGATTCGAAT _A_Concatenation CCCTATGTTCTA _B 680 KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNPGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 43 2211 15
ATAACTCCACGC _A_Concatenation GGCACAGCTCCA _B 681 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGTTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG 44 2211 87
ATAACTCCACGC _A_Concatenation CCGCCCTTATGT _B 682 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGVYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH 44 2211 98
ATAACTCCACGC _A_Concatenation CAACGCATGCCT _B 683 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGYPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS 44 2211 100
ATAACTCCACGC _A_Concatenation TATTAAAAGTGC _B 684 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGMHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI 44 2211 53
ATAACTCCACGC _A_Concatenation ATGACAAAACGA _B 685 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGHFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL 44 2211 40
ATAACTCCACGC _A_Concatenation ATCTCTCAGTTC _B 686 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGVKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK 44 2211 94
ATAACTCCACGC _A_Concatenation ACCAGTATTTAA _B 687 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGNCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL 44 2211 54
ATAACTCCACGC _A_Concatenation ATCCAATCTGTG _B 688 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGPSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT 44 2211 67
ATAACTCCACGC _A_Concatenation ATGTGGCACGAC _B 689 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGQQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC 44 2211 69
ATAACTCCACGC _A_Concatenation CATTAGTCCCGC _B 690 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGAEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF 44 2211 5
ATAACTCCACGC _A_Concatenation TCTGTACCAACG _B 691 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY 44 2211 25
ATAACTCCACGC _A_Concatenation AAGCAACGATAT _B 692 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGGHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL 44 2211 38
ATAACTCCACGC _A_Concatenation AGAGTGCGCTCT _B 693 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 44 2211 14
ATAACTCCACGC _A_Concatenation linker 3_B 694 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGGGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD 44 2211 36
ATAACTCCACGC _A_Concatenation CTCGTTATGGCA _B 695 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGAFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF 44 2211 9
ATAACTCCACGC _A_Concatenation ATAACTCCACGC _B 696 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 44 2211 44
ATAACTCCACGC _A_Concatenation TAAAATTTATCA _B 697 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGLSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY 44 2211 49
ATAACTCCACGC _A_Concatenation CGGCAATTGCTT _B 698 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGSRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK 44 2211 82
ATAACTCCACGC _A_Concatenation CTCGCACCGAGG _B 699 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGAEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN 44 2211 6
ATAACTCCACGC _A_Concatenation CATCTCACCTAG _B 700 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA 44 2211 12
ATAACTCCACGC _A_Concatenation CGCCGGCATACA _B 701 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGDGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI 44 2211 16
ATAACTCCACGC _A_Concatenation ATTTGTAGACCG _B 702 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGPSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE 44 2211 66
ATAACTCCACGC _A_Concatenation CATCTGAGTCGT _B 703 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGGGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS 44 2211 33
ATAACTCCACGC _A_Concatenation CTCCATTAATGA _B 704 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGFCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV 44 2211 28
ATAACTCCACGC _A_Concatenation TCTTAGAGGGTG _B 705 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL 44 2211 29
ATAACTCCACGC _A_Concatenation ACTTGATGGTTT _B 706 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 44 2211 77
ATAACTCCACGC _A_Concatenation ATTGTATCTAAC _B 707 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGVIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD 44 2211 92
ATAACTCCACGC _A_Concatenation CCAGACGGTCTG _B 708 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGPFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN 44 2211 59
ATAACTCCACGC _A_Concatenation GAGATTGTGTCC _B 709 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 44 2211 64
ATAACTCCACGC _A_Concatenation GCAACGAGGGTC _B 710 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGRGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ 44 2211 74
ATAACTCCACGC _A_Concatenation AAGACCGGTGCC _B 711 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGRGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF 44 2211 73
ATAACTCCACGC _A_Concatenation GGCACTGTCGAG _B 712 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGYALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA 44 2211 99
ATAACTCCACGC _A_Concatenation AGCCGTGCTTGA _B 713 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGGGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY 44 2211 32
ATAACTCCACGC _A_Concatenation AGCCATAAACCT _B 714 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGPGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ 44 2211 61
ATAACTCCACGC _A_Concatenation TACATGAGGCAC _B 715 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGPRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL 44 2211 65
ATAACTCCACGC _A_Concatenation TGGCCCCCAGTT _B 716 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGGFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN 44 2211 31
ATAACTCCACGC _A_Concatenation CATGCTAACACC _B 717 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGTTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL 44 2211 88
ATAACTCCACGC _A_Concatenation EPICXV.47_B 718 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 44 2211 34
ATAACTCCACGC _A_Concatenation ACGACTCACCGC _B 719 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGPGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK 44 2211 60
ATAACTCCACGC _A_Concatenation CGATGGCGGAAT _B 720 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGDTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG 44 2211 19
ATAACTCCACGC _A_Concatenation AAAACAAGCATT _B 721 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGSTFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS 44 2211 84
ATAACTCCACGC _A_Concatenation CCCCTTAGGAAT _B 722 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGSLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD 44 2211 80
ATAACTCCACGC _A_Concatenation TGCCAGACGGTC _B 723 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGRMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG 44 2211 75
ATAACTCCACGC _A_Concatenation AATCAACGAGCA _B 724 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGSAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE 44 2211 79
ATAACTCCACGC _A_Concatenation AATGGCACTAGC _B 725 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGEGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR 44 2211 22
ATAACTCCACGC _A_Concatenation GCAGATTCGAAT _B 726 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGKPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP 44 2211 43
ATAACTCCACGC _A_Concatenation AGTTAGGCTCTT _B 727 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGEFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV 44 2211 21
ATAACTCCACGC _A_Concatenation TATCGCAAGAAC _B 728 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGPNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV 44 2211 63
ATAACTCCACGC _A_Concatenation ACAACGTGCATA _B 729 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGRCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET 44 2211 71
ATAACTCCACGC _A_Concatenation GAGTGACCTATT _B 730 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGHKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI 44 2211 41
ATAACTCCACGC _A_Concatenation ACTTTAGTAACA _B 731 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGNNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES 44 2211 57
ATAACTCCACGC _A_Concatenation CACACCCGGCAG _B 732 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGVTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS 44 2211 95
ATAACTCCACGC _A_Concatenation linker 1_B 733 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGGGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK 44 2211 35
ATAACTCCACGC _A_Concatenation AAACTTTGAGCG _B 734 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGAELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ 44 2211 7
ATAACTCCACGC _A_Concatenation GCGAGGCAGACC _B 735 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGQWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD 44 2211 70
ATAACTCCACGC _A_Concatenation TAAGATTCAACG _B 736 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGCFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP 44 2211 13
ATAACTCCACGC _A_Concatenation CCTAAATGCAAG _B 737 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGSVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL 44 2211 85
ATAACTCCACGC _A_Concatenation CAGTGACATAGT _B 738 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGAAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT 44 2211 3
ATAACTCCACGC _A_Concatenation CCCTGCCTCGGG _B 739 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGEVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY 44 2211 26
ATAACTCCACGC _A_Concatenation AGTTGCTTTGCT _B 740 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGDYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM 44 2211 20
ATAACTCCACGC _A_Concatenation TCAGCCGTAGGC _B 741 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGRGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR 44 2211 72
ATAACTCCACGC _A_Concatenation GATGGAGCTACA _B 742 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGEMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF 44 2211 23
ATAACTCCACGC _A_Concatenation CCCTATGTTCTA _B 743 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 44 2211 15
ATAACTCCACGC _A_Concatenation ACGGCGTAAGTA _B 744 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGLWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL 44 2211 52
ATAACTCCACGC _A_Concatenation AATCGAGACCAG _B 745 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGEPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH 44 2211 24
ATAACTCCACGC _A_Concatenation CTCAGCGATATA _B 746 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGFATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG 44 2211 27
ATAACTCCACGC _A_Concatenation Stop_1_B 747 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSG***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK 44 2211 1
ATAACTCCACGC _A_Concatenation GGGAGTATGTCG _B 748 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGLLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS 44 2211 47
ATAACTCCACGC _A_Concatenation TCTTGGTTTTGA _B 749 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL 44 2211 11
ATAACTCCACGC _A_Concatenation GTTCCACGCCTG _B 750 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGVVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS 44 2211 97
ATAACTCCACGC _A_Concatenation CTCACGACAAGA _B 751 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP 44 2211 4
ATAACTCCACGC _A_Concatenation TTAAATGAGGGC _B 752 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGLLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ 44 2211 46
ATAACTCCACGC _A_Concatenation CTGGTAGTTGTT _B 753 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGSSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT 44 2211 83
ATAACTCCACGC _A_Concatenation TATCTATCGTGT _B 754 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGRTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS 44 2211 78
ATAACTCCACGC _A_Concatenation GAGACAGCTCTC _B 755 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGLREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM 44 2211 48
ATAACTCCACGC _A_Concatenation GGAGGCACATCG _B 756 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGRPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL 44 2211 76
ATAACTCCACGC _A_Concatenation TGATGAGCGCCC _B 757 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGQHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL 44 2211 68
ATAACTCCACGC _A_Concatenation TTTAACGGATTG _B 758 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGSPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN 44 2211 81
ATAACTCCACGC _A_Concatenation TACTTCTAGCGG _B 759 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGTWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ 44 2211 90
ATAACTCCACGC _A_Concatenation TCTATCACAACT _B 760 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGLSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH 44 2211 50
ATAACTCCACGC _A_Concatenation AGGCTTCCCGCT _B 761 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGVKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV 44 2211 93
ATAACTCCACGC _A_Concatenation GATCGCATGTTC _B 762 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGNSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER 44 2211 58
ATAACTCCACGC _A_Concatenation CATTGTTTAAAT _B 763 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGAETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV 44 2211 8
ATAACTCCACGC _A_Concatenation ACTCGTGTTGTC _B 764 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGVAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT 44 2211 91
ATAACTCCACGC _A_Concatenation CTGCCGGTTGCG _B 765 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGLIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL 44 2211 45
ATAACTCCACGC _A_Concatenation linker 2_B 766 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGGGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA 44 2211 37
ATAACTCCACGC _A_Concatenation AATCGATCCTAC _B 767 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGVTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA 44 2211 96
ATAACTCCACGC _A_Concatenation ATGACCCCTTGT _B 768 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 44 2211 17
ATAACTCCACGC _A_Concatenation AAGGGTAAATTG _B 769 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGNKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS 44 2211 56
ATAACTCCACGC _A_Concatenation GTACATACCGAA _B 770 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGNENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE 44 2211 55
ATAACTCCACGC _A_Concatenation AGCCAGGTCACC _B 771 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGGQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK 44 2211 39
ATAACTCCACGC _A_Concatenation CCCGTAGGGGCT _B 772 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGAPFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS 44 2211 10
ATAACTCCACGC _A_Concatenation CTACGTGGCCCC _B 773 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 44 2211 86
ATAACTCCACGC _A_Concatenation GACTGTGGGCGG _B 774 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGTWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL 44 2211 89
ATAACTCCACGC _A_Concatenation GGCCTTTCGGGC _B 775 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGGAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ 44 2211 30
ATAACTCCACGC _A_Concatenation GGTGTCTCGATT _B 776 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV 44 2211 51
ATAACTCCACGC _A_Concatenation GTCACCCTCCTT _B 777 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGDLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN 44 2211 18
ATAACTCCACGC _A_Concatenation Stop_2_B 778 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSG***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE 44 2211 2
ATAACTCCACGC _A_Concatenation TTAGTGCCATAC _B 779 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGPLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP 44 2211 62
ATAACTCCACGC _A_Concatenation TTCTAATTTCCT _B 780 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSGGSGGSGGSGIEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS 44 2211 42
CTGCCGGTTGCG _A_Concatenation GAGATTGTGTCC _B 781 LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 45 2211 64
CTGCCGGTTGCG _A_Concatenation ATGACCCCTTGT _B 782 LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 45 2211 17
CTGCCGGTTGCG _A_Concatenation CTACGTGGCCCC _B 783 LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 45 2211 86
CTGCCGGTTGCG _A_Concatenation AGAGTGCGCTCT _B 784 LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 45 2211 14
CTGCCGGTTGCG _A_Concatenation ATAACTCCACGC _B 785 LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 45 2211 44
CTGCCGGTTGCG _A_Concatenation ACTTGATGGTTT _B 786 LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 45 2211 77
CTGCCGGTTGCG _A_Concatenation CCCTATGTTCTA _B 787 LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 45 2211 15
TTAAATGAGGGC _A_Concatenation ACTTGATGGTTT _B 788 LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 46 2211 77
TTAAATGAGGGC _A_Concatenation AGAGTGCGCTCT _B 789 LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 46 2211 14
TTAAATGAGGGC _A_Concatenation ATAACTCCACGC _B 790 LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 46 2211 44
TTAAATGAGGGC _A_Concatenation ATGACCCCTTGT _B 791 LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 46 2211 17
TTAAATGAGGGC _A_Concatenation CTACGTGGCCCC _B 792 LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 46 2211 86
TTAAATGAGGGC _A_Concatenation GAGATTGTGTCC _B 793 LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 46 2211 64
TTAAATGAGGGC _A_Concatenation CCCTATGTTCTA _B 794 LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 46 2211 15
GGGAGTATGTCG _A_Concatenation CTACGTGGCCCC _B 795 LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDSGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 47 2211 86
GGGAGTATGTCG _A_Concatenation GAGATTGTGTCC _B 796 LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDSGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 47 2211 64
GGGAGTATGTCG _A_Concatenation ATGACCCCTTGT _B 797 LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDSGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 47 2211 17
GGGAGTATGTCG _A_Concatenation ATAACTCCACGC _B 798 LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDSGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 47 2211 44
GGGAGTATGTCG _A_Concatenation ACTTGATGGTTT _B 799 LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDSGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 47 2211 77
GGGAGTATGTCG _A_Concatenation AGAGTGCGCTCT _B 800 LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDSGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 47 2211 14
GGGAGTATGTCG _A_Concatenation CCCTATGTTCTA _B 801 LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDSGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 47 2211 15
GAGACAGCTCTC _A_Concatenation ATAACTCCACGC _B 802 LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMMGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 48 2211 44
GAGACAGCTCTC _A_Concatenation CTACGTGGCCCC _B 803 LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMMGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 48 2211 86
GAGACAGCTCTC _A_Concatenation GAGATTGTGTCC _B 804 LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMMGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 48 2211 64
GAGACAGCTCTC _A_Concatenation ATGACCCCTTGT _B 805 LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMMGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 48 2211 17
GAGACAGCTCTC _A_Concatenation ACTTGATGGTTT _B 806 LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMMGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 48 2211 77
GAGACAGCTCTC _A_Concatenation AGAGTGCGCTCT _B 807 LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMMGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 48 2211 14
GAGACAGCTCTC _A_Concatenation CCCTATGTTCTA _B 808 LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMMGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 48 2211 15
TAAAATTTATCA _A_Concatenation ACTTGATGGTTT _B 809 LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHYGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 49 2211 77
TAAAATTTATCA _A_Concatenation AGAGTGCGCTCT _B 810 LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHYGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 49 2211 14
TAAAATTTATCA _A_Concatenation CTACGTGGCCCC _B 811 LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHYGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 49 2211 86
TAAAATTTATCA _A_Concatenation ATAACTCCACGC _B 812 LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHYGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 49 2211 44
TAAAATTTATCA _A_Concatenation GAGATTGTGTCC _B 813 LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHYGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 49 2211 64
TAAAATTTATCA _A_Concatenation ATGACCCCTTGT _B 814 LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHYGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 49 2211 17
TAAAATTTATCA _A_Concatenation CCCTATGTTCTA _B 815 LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHYGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 49 2211 15
TCTATCACAACT _A_Concatenation CTACGTGGCCCC _B 816 LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLHGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 50 2211 86
TCTATCACAACT _A_Concatenation ACTTGATGGTTT _B 817 LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLHGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 50 2211 77
TCTATCACAACT _A_Concatenation AGAGTGCGCTCT _B 818 LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLHGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 50 2211 14
TCTATCACAACT _A_Concatenation GAGATTGTGTCC _B 819 LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLHGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 50 2211 64
TCTATCACAACT _A_Concatenation ATGACCCCTTGT _B 820 LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLHGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 50 2211 17
TCTATCACAACT _A_Concatenation ATAACTCCACGC _B 821 LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLHGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 50 2211 44
TCTATCACAACT _A_Concatenation CCCTATGTTCTA _B 822 LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLHGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 50 2211 15
GGTGTCTCGATT _A_Concatenation AGAGTGCGCTCT _B 823 LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDVGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 51 2211 14
GGTGTCTCGATT _A_Concatenation ACTTGATGGTTT _B 824 LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDVGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 51 2211 77
GGTGTCTCGATT _A_Concatenation CTACGTGGCCCC _B 825 LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDVGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 51 2211 86
GGTGTCTCGATT _A_Concatenation ATGACCCCTTGT _B 826 LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDVGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 51 2211 17
GGTGTCTCGATT _A_Concatenation ATAACTCCACGC _B 827 LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDVGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 51 2211 44
GGTGTCTCGATT _A_Concatenation GAGATTGTGTCC _B 828 LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDVGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 51 2211 64
GGTGTCTCGATT _A_Concatenation CCCTATGTTCTA _B 829 LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDVGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 51 2211 15
ACGGCGTAAGTA _A_Concatenation ATGACCCCTTGT _B 830 LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 52 2211 17
ACGGCGTAAGTA _A_Concatenation AGAGTGCGCTCT _B 831 LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 52 2211 14
ACGGCGTAAGTA _A_Concatenation ACTTGATGGTTT _B 832 LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 52 2211 77
ACGGCGTAAGTA _A_Concatenation ATAACTCCACGC _B 833 LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 52 2211 44
ACGGCGTAAGTA _A_Concatenation CTACGTGGCCCC _B 834 LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 52 2211 86
ACGGCGTAAGTA _A_Concatenation GAGATTGTGTCC _B 835 LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 52 2211 64
ACGGCGTAAGTA _A_Concatenation CCCTATGTTCTA _B 836 LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 52 2211 15
TATTAAAAGTGC _A_Concatenation GAGATTGTGTCC _B 837 MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEIGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 53 2211 64
TATTAAAAGTGC _A_Concatenation ATGACCCCTTGT _B 838 MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEIGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 53 2211 17
TATTAAAAGTGC _A_Concatenation AGAGTGCGCTCT _B 839 MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEIGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 53 2211 14
TATTAAAAGTGC _A_Concatenation ACTTGATGGTTT _B 840 MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEIGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 53 2211 77
TATTAAAAGTGC _A_Concatenation ATAACTCCACGC _B 841 MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEIGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 53 2211 44
TATTAAAAGTGC _A_Concatenation CTACGTGGCCCC _B 842 MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEIGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 53 2211 86
TATTAAAAGTGC _A_Concatenation CCCTATGTTCTA _B 843 MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEIGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 53 2211 15
ACCAGTATTTAA _A_Concatenation ATGACCCCTTGT _B 844 NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 54 2211 17
ACCAGTATTTAA _A_Concatenation GAGATTGTGTCC _B 845 NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 54 2211 64
ACCAGTATTTAA _A_Concatenation AGAGTGCGCTCT _B 846 NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 54 2211 14
ACCAGTATTTAA _A_Concatenation ACTTGATGGTTT _B 847 NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 54 2211 77
ACCAGTATTTAA _A_Concatenation ATAACTCCACGC _B 848 NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 54 2211 44
ACCAGTATTTAA _A_Concatenation CTACGTGGCCCC _B 849 NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 54 2211 86
ACCAGTATTTAA _A_Concatenation CCCTATGTTCTA _B 850 NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 54 2211 15
GTACATACCGAA _A_Concatenation GAGATTGTGTCC _B 851 NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIREGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 55 2211 64
GTACATACCGAA _A_Concatenation ACTTGATGGTTT _B 852 NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIREGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 55 2211 77
GTACATACCGAA _A_Concatenation CTACGTGGCCCC _B 853 NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIREGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 55 2211 86
GTACATACCGAA _A_Concatenation AGAGTGCGCTCT _B 854 NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIREGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 55 2211 14
GTACATACCGAA _A_Concatenation ATGACCCCTTGT _B 855 NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIREGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 55 2211 17
GTACATACCGAA _A_Concatenation ATAACTCCACGC _B 856 NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIREGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 55 2211 44
GTACATACCGAA _A_Concatenation CCCTATGTTCTA _B 857 NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIREGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 55 2211 15
AAGGGTAAATTG _A_Concatenation ACTTGATGGTTT _B 858 NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSSGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 56 2211 77
AAGGGTAAATTG _A_Concatenation CTACGTGGCCCC _B 859 NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSSGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 56 2211 86
AAGGGTAAATTG _A_Concatenation ATGACCCCTTGT _B 860 NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSSGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 56 2211 17
AAGGGTAAATTG _A_Concatenation GAGATTGTGTCC _B 861 NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSSGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 56 2211 64
AAGGGTAAATTG _A_Concatenation AGAGTGCGCTCT _B 862 NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSSGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 56 2211 14
AAGGGTAAATTG _A_Concatenation ATAACTCCACGC _B 863 NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSSGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 56 2211 44
AAGGGTAAATTG _A_Concatenation CCCTATGTTCTA _B 864 NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSSGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 56 2211 15
ACTTTAGTAACA _A_Concatenation ATGACCCCTTGT _B 865 NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVESGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 57 2211 17
ACTTTAGTAACA _A_Concatenation ATAACTCCACGC _B 866 NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVESGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 57 2211 44
ACTTTAGTAACA _A_Concatenation ACTTGATGGTTT _B 867 NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVESGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 57 2211 77
ACTTTAGTAACA _A_Concatenation AGAGTGCGCTCT _B 868 NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVESGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 57 2211 14
ACTTTAGTAACA _A_Concatenation GAGATTGTGTCC _B 869 NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVESGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 57 2211 64
ACTTTAGTAACA _A_Concatenation CTACGTGGCCCC _B 870 NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVESGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 57 2211 86
ACTTTAGTAACA _A_Concatenation CCCTATGTTCTA _B 871 NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVESGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 57 2211 15
GATCGCATGTTC _A_Concatenation ATGACCCCTTGT _B 872 NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKERGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 58 2211 17
GATCGCATGTTC _A_Concatenation AGAGTGCGCTCT _B 873 NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKERGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 58 2211 14
GATCGCATGTTC _A_Concatenation GAGATTGTGTCC _B 874 NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKERGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 58 2211 64
GATCGCATGTTC _A_Concatenation CTACGTGGCCCC _B 875 NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKERGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 58 2211 86
GATCGCATGTTC _A_Concatenation ACTTGATGGTTT _B 876 NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKERGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 58 2211 77
GATCGCATGTTC _A_Concatenation ATAACTCCACGC _B 877 NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKERGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 58 2211 44
GATCGCATGTTC _A_Concatenation CCCTATGTTCTA _B 878 NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKERGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 58 2211 15
CCAGACGGTCTG _A_Concatenation ACTTGATGGTTT _B 879 PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNNGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 59 2211 77
CCAGACGGTCTG _A_Concatenation CTACGTGGCCCC _B 880 PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNNGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 59 2211 86
CCAGACGGTCTG _A_Concatenation GAGATTGTGTCC _B 881 PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNNGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 59 2211 64
CCAGACGGTCTG _A_Concatenation ATAACTCCACGC _B 882 PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNNGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 59 2211 44
CCAGACGGTCTG _A_Concatenation ATGACCCCTTGT _B 883 PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNNGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 59 2211 17
CCAGACGGTCTG _A_Concatenation AGAGTGCGCTCT _B 884 PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNNGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 59 2211 14
CCAGACGGTCTG _A_Concatenation CCCTATGTTCTA _B 885 PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNNGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 59 2211 15
ACGACTCACCGC _A_Concatenation ATAACTCCACGC _B 886 PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGKGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 60 2211 44
ACGACTCACCGC _A_Concatenation GAGATTGTGTCC _B 887 PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGKGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 60 2211 64
ACGACTCACCGC _A_Concatenation ATGACCCCTTGT _B 888 PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGKGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 60 2211 17
ACGACTCACCGC _A_Concatenation CTACGTGGCCCC _B 889 PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGKGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 60 2211 86
ACGACTCACCGC _A_Concatenation ACTTGATGGTTT _B 890 PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGKGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 60 2211 77
ACGACTCACCGC _A_Concatenation AGAGTGCGCTCT _B 891 PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGKGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 60 2211 14
ACGACTCACCGC _A_Concatenation CCCTATGTTCTA _B 892 PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGKGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 60 2211 15
AGCCATAAACCT _A_Concatenation ATGACCCCTTGT _B 893 PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 61 2211 17
AGCCATAAACCT _A_Concatenation AGAGTGCGCTCT _B 894 PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 61 2211 14
AGCCATAAACCT _A_Concatenation CTACGTGGCCCC _B 895 PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 61 2211 86
AGCCATAAACCT _A_Concatenation GAGATTGTGTCC _B 896 PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 61 2211 64
AGCCATAAACCT _A_Concatenation ACTTGATGGTTT _B 897 PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 61 2211 77
AGCCATAAACCT _A_Concatenation ATAACTCCACGC _B 898 PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 61 2211 44
AGCCATAAACCT _A_Concatenation CCCTATGTTCTA _B 899 PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 61 2211 15
TTAGTGCCATAC _A_Concatenation AGAGTGCGCTCT _B 900 PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVPGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 62 2211 14
TTAGTGCCATAC _A_Concatenation ATGACCCCTTGT _B 901 PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVPGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 62 2211 17
TTAGTGCCATAC _A_Concatenation CTACGTGGCCCC _B 902 PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVPGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 62 2211 86
TTAGTGCCATAC _A_Concatenation ACTTGATGGTTT _B 903 PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVPGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 62 2211 77
TTAGTGCCATAC _A_Concatenation ATAACTCCACGC _B 904 PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVPGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 62 2211 44
TTAGTGCCATAC _A_Concatenation GAGATTGTGTCC _B 905 PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVPGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 62 2211 64
TTAGTGCCATAC _A_Concatenation CCCTATGTTCTA _B 906 PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVPGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 62 2211 15
TATCGCAAGAAC _A_Concatenation ATGACCCCTTGT _B 907 PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMVGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 63 2211 17
TATCGCAAGAAC _A_Concatenation ACTTGATGGTTT _B 908 PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMVGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 63 2211 77
TATCGCAAGAAC _A_Concatenation CTACGTGGCCCC _B 909 PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMVGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 63 2211 86
TATCGCAAGAAC _A_Concatenation GAGATTGTGTCC _B 910 PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMVGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 63 2211 64
TATCGCAAGAAC _A_Concatenation AGAGTGCGCTCT _B 911 PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMVGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 63 2211 14
TATCGCAAGAAC _A_Concatenation ATAACTCCACGC _B 912 PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMVGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 63 2211 44
TATCGCAAGAAC _A_Concatenation CCCTATGTTCTA _B 913 PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMVGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 63 2211 15
GAGATTGTGTCC _A_Concatenation CCAGACGGTCTG _B 914 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGPFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN 64 2211 59
GAGATTGTGTCC _A_Concatenation TAAGATTCAACG _B 915 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGCFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP 64 2211 13
GAGATTGTGTCC _A_Concatenation TTAAATGAGGGC _B 916 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGLLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ 64 2211 46
GAGATTGTGTCC _A_Concatenation TCTGTACCAACG _B 917 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY 64 2211 25
GAGATTGTGTCC _A_Concatenation CCCTGCCTCGGG _B 918 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGEVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY 64 2211 26
GAGATTGTGTCC _A_Concatenation CATGCTAACACC _B 919 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGTTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL 64 2211 88
GAGATTGTGTCC _A_Concatenation GCAACGAGGGTC _B 920 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGRGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ 64 2211 74
GAGATTGTGTCC _A_Concatenation GACTGTGGGCGG _B 921 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGTWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL 64 2211 89
GAGATTGTGTCC _A_Concatenation CCGCCCTTATGT _B 922 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGVYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH 64 2211 98
GAGATTGTGTCC _A_Concatenation EPICXV.47_B 923 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 64 2211 34
GAGATTGTGTCC _A_Concatenation CACACCCGGCAG _B 924 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGVTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS 64 2211 95
GAGATTGTGTCC _A_Concatenation CCCTATGTTCTA _B 925 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 64 2211 15
GAGATTGTGTCC _A_Concatenation GAGTGACCTATT _B 926 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGHKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI 64 2211 41
GAGATTGTGTCC _A_Concatenation CTCAGCGATATA _B 927 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGFATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG 64 2211 27
GAGATTGTGTCC _A_Concatenation ACTTGATGGTTT _B 928 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 64 2211 77
GAGATTGTGTCC _A_Concatenation ATGACAAAACGA _B 929 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGHFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL 64 2211 40
GAGATTGTGTCC _A_Concatenation Stop_1_B 930 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSG***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK 64 2211 1
GAGATTGTGTCC _A_Concatenation AAGGGTAAATTG _B 931 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGNKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS 64 2211 56
GAGATTGTGTCC _A_Concatenation ATTTGTAGACCG _B 932 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGPSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE 64 2211 66
GAGATTGTGTCC _A_Concatenation AATCGAGACCAG _B 933 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGEPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH 64 2211 24
GAGATTGTGTCC _A_Concatenation GATCGCATGTTC _B 934 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGNSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER 64 2211 58
GAGATTGTGTCC _A_Concatenation AATGGCACTAGC _B 935 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGEGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR 64 2211 22
GAGATTGTGTCC _A_Concatenation GAGATTGTGTCC _B 936 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 64 2211 64
GAGATTGTGTCC _A_Concatenation GTTCCACGCCTG _B 937 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGVVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS 64 2211 97
GAGATTGTGTCC _A_Concatenation ATCCAATCTGTG _B 938 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGPSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT 64 2211 67
GAGATTGTGTCC _A_Concatenation CGCCGGCATACA _B 939 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGDGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI 64 2211 16
GAGATTGTGTCC _A_Concatenation linker 1_B 940 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGGGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK 64 2211 35
GAGATTGTGTCC _A_Concatenation GATGGAGCTACA _B 941 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGEMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF 64 2211 23
GAGATTGTGTCC _A_Concatenation linker 3_B 942 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGGGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD 64 2211 36
GAGATTGTGTCC _A_Concatenation ATGACCCCTTGT _B 943 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 64 2211 17
GAGATTGTGTCC _A_Concatenation ACGACTCACCGC _B 944 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGPGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK 64 2211 60
GAGATTGTGTCC _A_Concatenation TTTAACGGATTG _B 945 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGSPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN 64 2211 81
GAGATTGTGTCC _A_Concatenation CTCACGACAAGA _B 946 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP 64 2211 4
GAGATTGTGTCC _A_Concatenation ATAACTCCACGC _B 947 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 64 2211 44
GAGATTGTGTCC _A_Concatenation ATGTGGCACGAC _B 948 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGQQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC 64 2211 69
GAGATTGTGTCC _A_Concatenation GGGAGTATGTCG _B 949 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGLLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS 64 2211 47
GAGATTGTGTCC _A_Concatenation CGATGGCGGAAT _B 950 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGDTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG 64 2211 19
GAGATTGTGTCC _A_Concatenation TGCCAGACGGTC _B 951 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGRMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG 64 2211 75
GAGATTGTGTCC _A_Concatenation TCAGCCGTAGGC _B 952 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGRGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR 64 2211 72
GAGATTGTGTCC _A_Concatenation AGAGTGCGCTCT _B 953 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 64 2211 14
GAGATTGTGTCC _A_Concatenation TGGCCCCCAGTT _B 954 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGGFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN 64 2211 31
GAGATTGTGTCC _A_Concatenation GGCACAGCTCCA _B 955 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGTTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG 64 2211 87
GAGATTGTGTCC _A_Concatenation ACTTTAGTAACA _B 956 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGNNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES 64 2211 57
GAGATTGTGTCC _A_Concatenation ACTCGTGTTGTC _B 957 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGVAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT 64 2211 91
GAGATTGTGTCC _A_Concatenation AAGACCGGTGCC _B 958 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGRGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF 64 2211 73
GAGATTGTGTCC _A_Concatenation TCTTAGAGGGTG _B 959 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL 64 2211 29
GAGATTGTGTCC _A_Concatenation AGCCGTGCTTGA _B 960 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGGGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY 64 2211 32
GAGATTGTGTCC _A_Concatenation CTCGCACCGAGG _B 961 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGAEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN 64 2211 6
GAGATTGTGTCC _A_Concatenation TACTTCTAGCGG _B 962 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGTWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ 64 2211 90
GAGATTGTGTCC _A_Concatenation AAAACAAGCATT _B 963 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGSTFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS 64 2211 84
GAGATTGTGTCC _A_Concatenation AGTTGCTTTGCT _B 964 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGDYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM 64 2211 20
GAGATTGTGTCC _A_Concatenation AATCAACGAGCA _B 965 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGSAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE 64 2211 79
GAGATTGTGTCC _A_Concatenation AGCCATAAACCT _B 966 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGPGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ 64 2211 61
GAGATTGTGTCC _A_Concatenation GGAGGCACATCG _B 967 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGRPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL 64 2211 76
GAGATTGTGTCC _A_Concatenation CTCGTTATGGCA _B 968 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGAFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF 64 2211 9
GAGATTGTGTCC _A_Concatenation TAAAATTTATCA _B 969 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGLSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY 64 2211 49
GAGATTGTGTCC _A_Concatenation TCTTGGTTTTGA _B 970 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL 64 2211 11
GAGATTGTGTCC _A_Concatenation CATTAGTCCCGC _B 971 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGAEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF 64 2211 5
GAGATTGTGTCC _A_Concatenation CAGTGACATAGT _B 972 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGAAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT 64 2211 3
GAGATTGTGTCC _A_Concatenation CATCTCACCTAG _B 973 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA 64 2211 12
GAGATTGTGTCC _A_Concatenation ATCTCTCAGTTC _B 974 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGVKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK 64 2211 94
GAGATTGTGTCC _A_Concatenation CCTAAATGCAAG _B 975 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGSVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL 64 2211 85
GAGATTGTGTCC _A_Concatenation GCGAGGCAGACC _B 976 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGQWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD 64 2211 70
GAGATTGTGTCC _A_Concatenation CGGCAATTGCTT _B 977 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGSRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK 64 2211 82
GAGATTGTGTCC _A_Concatenation AAGCAACGATAT _B 978 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGGHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL 64 2211 38
GAGATTGTGTCC _A_Concatenation ACAACGTGCATA _B 979 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGRCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET 64 2211 71
GAGATTGTGTCC _A_Concatenation CCCCTTAGGAAT _B 980 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGSLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD 64 2211 80
GAGATTGTGTCC _A_Concatenation GTACATACCGAA _B 981 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGNENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE 64 2211 55
GAGATTGTGTCC _A_Concatenation TGATGAGCGCCC _B 982 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGQHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL 64 2211 68
GAGATTGTGTCC _A_Concatenation GCAGATTCGAAT _B 983 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGKPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP 64 2211 43
GAGATTGTGTCC _A_Concatenation AGCCAGGTCACC _B 984 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGGQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK 64 2211 39
GAGATTGTGTCC _A_Concatenation TACATGAGGCAC _B 985 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGPRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL 64 2211 65
GAGATTGTGTCC _A_Concatenation ATTGTATCTAAC _B 986 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGVIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD 64 2211 92
GAGATTGTGTCC _A_Concatenation CTGCCGGTTGCG _B 987 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGLIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL 64 2211 45
GAGATTGTGTCC _A_Concatenation CTCCATTAATGA _B 988 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGFCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV 64 2211 28
GAGATTGTGTCC _A_Concatenation CTGGTAGTTGTT _B 989 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGSSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT 64 2211 83
GAGATTGTGTCC _A_Concatenation linker 2_B 990 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGGGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA 64 2211 37
GAGATTGTGTCC _A_Concatenation AGTTAGGCTCTT _B 991 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGEFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV 64 2211 21
GAGATTGTGTCC _A_Concatenation ACCAGTATTTAA _B 992 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGNCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL 64 2211 54
GAGATTGTGTCC _A_Concatenation TATTAAAAGTGC _B 993 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGMHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI 64 2211 53
GAGATTGTGTCC _A_Concatenation CAACGCATGCCT _B 994 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGYPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS 64 2211 100
GAGATTGTGTCC _A_Concatenation GGCACTGTCGAG _B 995 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGYALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA 64 2211 99
GAGATTGTGTCC _A_Concatenation CATTGTTTAAAT _B 996 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGAETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV 64 2211 8
GAGATTGTGTCC _A_Concatenation AAACTTTGAGCG _B 997 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGAELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ 64 2211 7
GAGATTGTGTCC _A_Concatenation AATCGATCCTAC _B 998 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGVTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA 64 2211 96
GAGATTGTGTCC _A_Concatenation ACGGCGTAAGTA _B 999 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGLWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL 64 2211 52
GAGATTGTGTCC _A_Concatenation AGGCTTCCCGCT _B 1000 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGVKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV 64 2211 93
GAGATTGTGTCC _A_Concatenation CATCTGAGTCGT _B 1001 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGGGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS 64 2211 33
GAGATTGTGTCC _A_Concatenation CCCGTAGGGGCT _B 1002 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGAPFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS 64 2211 10
GAGATTGTGTCC _A_Concatenation CTACGTGGCCCC _B 1003 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 64 2211 86
GAGATTGTGTCC _A_Concatenation GAGACAGCTCTC _B 1004 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGLREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM 64 2211 48
GAGATTGTGTCC _A_Concatenation GGCCTTTCGGGC _B 1005 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGGAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ 64 2211 30
GAGATTGTGTCC _A_Concatenation GGTGTCTCGATT _B 1006 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV 64 2211 51
GAGATTGTGTCC _A_Concatenation GTCACCCTCCTT _B 1007 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGDLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN 64 2211 18
GAGATTGTGTCC _A_Concatenation Stop_2_B 1008 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSG***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE 64 2211 2
GAGATTGTGTCC _A_Concatenation TATCGCAAGAAC _B 1009 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGPNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV 64 2211 63
GAGATTGTGTCC _A_Concatenation TATCTATCGTGT _B 1010 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGRTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS 64 2211 78
GAGATTGTGTCC _A_Concatenation TCTATCACAACT _B 1011 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGLSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH 64 2211 50
GAGATTGTGTCC _A_Concatenation TTAGTGCCATAC _B 1012 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGPLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP 64 2211 62
GAGATTGTGTCC _A_Concatenation TTCTAATTTCCT _B 1013 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLPGSGGSGGSGGSGIEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS 64 2211 42
TACATGAGGCAC _A_Concatenation ACTTGATGGTTT _B 1014 PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 65 2211 77
TACATGAGGCAC _A_Concatenation GAGATTGTGTCC _B 1015 PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 65 2211 64
TACATGAGGCAC _A_Concatenation AGAGTGCGCTCT _B 1016 PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 65 2211 14
TACATGAGGCAC _A_Concatenation ATAACTCCACGC _B 1017 PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 65 2211 44
TACATGAGGCAC _A_Concatenation ATGACCCCTTGT _B 1018 PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 65 2211 17
TACATGAGGCAC _A_Concatenation CTACGTGGCCCC _B 1019 PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 65 2211 86
TACATGAGGCAC _A_Concatenation CCCTATGTTCTA _B 1020 PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 65 2211 15
ATTTGTAGACCG _A_Concatenation AGAGTGCGCTCT _B 1021 PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENEGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 66 2211 14
ATTTGTAGACCG _A_Concatenation ACTTGATGGTTT _B 1022 PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENEGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 66 2211 77
ATTTGTAGACCG _A_Concatenation CTACGTGGCCCC _B 1023 PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENEGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 66 2211 86
ATTTGTAGACCG _A_Concatenation GAGATTGTGTCC _B 1024 PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENEGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 66 2211 64
ATTTGTAGACCG _A_Concatenation ATAACTCCACGC _B 1025 PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENEGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 66 2211 44
ATTTGTAGACCG _A_Concatenation ATGACCCCTTGT _B 1026 PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENEGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 66 2211 17
ATTTGTAGACCG _A_Concatenation CCCTATGTTCTA _B 1027 PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENEGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 66 2211 15
ATCCAATCTGTG _A_Concatenation ATGACCCCTTGT _B 1028 PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLTGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 67 2211 17
ATCCAATCTGTG _A_Concatenation ATAACTCCACGC _B 1029 PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLTGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 67 2211 44
ATCCAATCTGTG _A_Concatenation CTACGTGGCCCC _B 1030 PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLTGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 67 2211 86
ATCCAATCTGTG _A_Concatenation AGAGTGCGCTCT _B 1031 PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLTGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 67 2211 14
ATCCAATCTGTG _A_Concatenation GAGATTGTGTCC _B 1032 PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLTGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 67 2211 64
ATCCAATCTGTG _A_Concatenation ACTTGATGGTTT _B 1033 PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLTGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 67 2211 77
ATCCAATCTGTG _A_Concatenation CCCTATGTTCTA _B 1034 PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLTGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 67 2211 15
TGATGAGCGCCC _A_Concatenation AGAGTGCGCTCT _B 1035 QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 68 2211 14
TGATGAGCGCCC _A_Concatenation ATGACCCCTTGT _B 1036 QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 68 2211 17
TGATGAGCGCCC _A_Concatenation CTACGTGGCCCC _B 1037 QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 68 2211 86
TGATGAGCGCCC _A_Concatenation GAGATTGTGTCC _B 1038 QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 68 2211 64
TGATGAGCGCCC _A_Concatenation ATAACTCCACGC _B 1039 QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 68 2211 44
TGATGAGCGCCC _A_Concatenation ACTTGATGGTTT _B 1040 QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 68 2211 77
TGATGAGCGCCC _A_Concatenation CCCTATGTTCTA _B 1041 QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 68 2211 15
ATGTGGCACGAC _A_Concatenation ACTTGATGGTTT _B 1042 QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLACGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 69 2211 77
ATGTGGCACGAC _A_Concatenation ATGACCCCTTGT _B 1043 QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLACGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 69 2211 17
ATGTGGCACGAC _A_Concatenation AGAGTGCGCTCT _B 1044 QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLACGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 69 2211 14
ATGTGGCACGAC _A_Concatenation ATAACTCCACGC _B 1045 QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLACGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 69 2211 44
ATGTGGCACGAC _A_Concatenation CTACGTGGCCCC _B 1046 QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLACGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 69 2211 86
ATGTGGCACGAC _A_Concatenation GAGATTGTGTCC _B 1047 QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLACGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 69 2211 64
ATGTGGCACGAC _A_Concatenation CCCTATGTTCTA _B 1048 QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLACGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 69 2211 15
GCGAGGCAGACC _A_Concatenation GAGATTGTGTCC _B 1049 QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRDGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 70 2211 64
GCGAGGCAGACC _A_Concatenation ATAACTCCACGC _B 1050 QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRDGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 70 2211 44
GCGAGGCAGACC _A_Concatenation AGAGTGCGCTCT _B 1051 QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRDGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 70 2211 14
GCGAGGCAGACC _A_Concatenation ACTTGATGGTTT _B 1052 QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRDGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 70 2211 77
GCGAGGCAGACC _A_Concatenation CTACGTGGCCCC _B 1053 QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRDGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 70 2211 86
GCGAGGCAGACC _A_Concatenation ATGACCCCTTGT _B 1054 QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRDGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 70 2211 17
GCGAGGCAGACC _A_Concatenation CCCTATGTTCTA _B 1055 QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRDGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 70 2211 15
ACAACGTGCATA _A_Concatenation GAGATTGTGTCC _B 1056 RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDETGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 71 2211 64
ACAACGTGCATA _A_Concatenation ATGACCCCTTGT _B 1057 RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDETGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 71 2211 17
ACAACGTGCATA _A_Concatenation CTACGTGGCCCC _B 1058 RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDETGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 71 2211 86
ACAACGTGCATA _A_Concatenation AGAGTGCGCTCT _B 1059 RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDETGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 71 2211 14
ACAACGTGCATA _A_Concatenation ACTTGATGGTTT _B 1060 RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDETGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 71 2211 77
ACAACGTGCATA _A_Concatenation ATAACTCCACGC _B 1061 RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDETGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 71 2211 44
ACAACGTGCATA _A_Concatenation CCCTATGTTCTA _B 1062 RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDETGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 71 2211 15
TCAGCCGTAGGC _A_Concatenation ATAACTCCACGC _B 1063 RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRRGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 72 2211 44
TCAGCCGTAGGC _A_Concatenation CTACGTGGCCCC _B 1064 RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRRGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 72 2211 86
TCAGCCGTAGGC _A_Concatenation ATGACCCCTTGT _B 1065 RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRRGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 72 2211 17
TCAGCCGTAGGC _A_Concatenation AGAGTGCGCTCT _B 1066 RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRRGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 72 2211 14
TCAGCCGTAGGC _A_Concatenation ACTTGATGGTTT _B 1067 RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRRGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 72 2211 77
TCAGCCGTAGGC _A_Concatenation GAGATTGTGTCC _B 1068 RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRRGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 72 2211 64
TCAGCCGTAGGC _A_Concatenation CCCTATGTTCTA _B 1069 RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRRGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 72 2211 15
AAGACCGGTGCC _A_Concatenation ATAACTCCACGC _B 1070 RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAFGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 73 2211 44
AAGACCGGTGCC _A_Concatenation GAGATTGTGTCC _B 1071 RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAFGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 73 2211 64
AAGACCGGTGCC _A_Concatenation ACTTGATGGTTT _B 1072 RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAFGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 73 2211 77
AAGACCGGTGCC _A_Concatenation ATGACCCCTTGT _B 1073 RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAFGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 73 2211 17
AAGACCGGTGCC _A_Concatenation CTACGTGGCCCC _B 1074 RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAFGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 73 2211 86
AAGACCGGTGCC _A_Concatenation AGAGTGCGCTCT _B 1075 RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAFGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 73 2211 14
AAGACCGGTGCC _A_Concatenation CCCTATGTTCTA _B 1076 RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAFGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 73 2211 15
GCAACGAGGGTC _A_Concatenation CTACGTGGCCCC _B 1077 RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 74 2211 86
GCAACGAGGGTC _A_Concatenation ATAACTCCACGC _B 1078 RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 74 2211 44
GCAACGAGGGTC _A_Concatenation GAGATTGTGTCC _B 1079 RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 74 2211 64
GCAACGAGGGTC _A_Concatenation AGAGTGCGCTCT _B 1080 RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 74 2211 14
GCAACGAGGGTC _A_Concatenation ACTTGATGGTTT _B 1081 RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 74 2211 77
GCAACGAGGGTC _A_Concatenation ATGACCCCTTGT _B 1082 RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 74 2211 17
GCAACGAGGGTC _A_Concatenation CCCTATGTTCTA _B 1083 RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 74 2211 15
TGCCAGACGGTC _A_Concatenation ACTTGATGGTTT _B 1084 RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVGGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 75 2211 77
TGCCAGACGGTC _A_Concatenation GAGATTGTGTCC _B 1085 RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVGGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 75 2211 64
TGCCAGACGGTC _A_Concatenation ATGACCCCTTGT _B 1086 RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVGGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 75 2211 17
TGCCAGACGGTC _A_Concatenation CTACGTGGCCCC _B 1087 RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVGGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 75 2211 86
TGCCAGACGGTC _A_Concatenation AGAGTGCGCTCT _B 1088 RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVGGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 75 2211 14
TGCCAGACGGTC _A_Concatenation ATAACTCCACGC _B 1089 RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVGGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 75 2211 44
TGCCAGACGGTC _A_Concatenation CCCTATGTTCTA _B 1090 RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVGGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 75 2211 15
GGAGGCACATCG _A_Concatenation ATAACTCCACGC _B 1091 RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 76 2211 44
GGAGGCACATCG _A_Concatenation GAGATTGTGTCC _B 1092 RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 76 2211 64
GGAGGCACATCG _A_Concatenation ATGACCCCTTGT _B 1093 RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 76 2211 17
GGAGGCACATCG _A_Concatenation ACTTGATGGTTT _B 1094 RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 76 2211 77
GGAGGCACATCG _A_Concatenation AGAGTGCGCTCT _B 1095 RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 76 2211 14
GGAGGCACATCG _A_Concatenation CTACGTGGCCCC _B 1096 RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 76 2211 86
GGAGGCACATCG _A_Concatenation CCCTATGTTCTA _B 1097 RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 76 2211 15
ACTTGATGGTTT _A_Concatenation CATTGTTTAAAT _B 1098 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGAETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV 77 2211 8
ACTTGATGGTTT _A_Concatenation AATCGATCCTAC _B 1099 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGVTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA 77 2211 96
ACTTGATGGTTT _A_Concatenation AATGGCACTAGC _B 1100 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGEGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR 77 2211 22
ACTTGATGGTTT _A_Concatenation AGTTGCTTTGCT _B 1101 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGDYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM 77 2211 20
ACTTGATGGTTT _A_Concatenation TACTTCTAGCGG _B 1102 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGTWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ 77 2211 90
ACTTGATGGTTT _A_Concatenation CGGCAATTGCTT _B 1103 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGSRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK 77 2211 82
ACTTGATGGTTT _A_Concatenation CACACCCGGCAG _B 1104 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGVTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS 77 2211 95
ACTTGATGGTTT _A_Concatenation TATCGCAAGAAC _B 1105 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGPNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV 77 2211 63
ACTTGATGGTTT _A_Concatenation CCAGACGGTCTG _B 1106 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGPFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN 77 2211 59
ACTTGATGGTTT _A_Concatenation GAGATTGTGTCC _B 1107 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 77 2211 64
ACTTGATGGTTT _A_Concatenation CGCCGGCATACA _B 1108 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGDGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI 77 2211 16
ACTTGATGGTTT _A_Concatenation GGAGGCACATCG _B 1109 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGRPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL 77 2211 76
ACTTGATGGTTT _A_Concatenation CTACGTGGCCCC _B 1110 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 77 2211 86
ACTTGATGGTTT _A_Concatenation ATGACCCCTTGT _B 1111 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 77 2211 17
ACTTGATGGTTT _A_Concatenation CTCACGACAAGA _B 1112 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP 77 2211 4
ACTTGATGGTTT _A_Concatenation CAGTGACATAGT _B 1113 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGAAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT 77 2211 3
ACTTGATGGTTT _A_Concatenation CCGCCCTTATGT _B 1114 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGVYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH 77 2211 98
ACTTGATGGTTT _A_Concatenation CATTAGTCCCGC _B 1115 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGAEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF 77 2211 5
ACTTGATGGTTT _A_Concatenation GCAACGAGGGTC _B 1116 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGRGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ 77 2211 74
ACTTGATGGTTT _A_Concatenation CTCGCACCGAGG _B 1117 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGAEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN 77 2211 6
ACTTGATGGTTT _A_Concatenation AAGGGTAAATTG _B 1118 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGNKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS 77 2211 56
ACTTGATGGTTT _A_Concatenation linker 1_B 1119 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGGGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK 77 2211 35
ACTTGATGGTTT _A_Concatenation linker 3_B 1120 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGGGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD 77 2211 36
ACTTGATGGTTT _A_Concatenation TCTTAGAGGGTG _B 1121 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL 77 2211 29
ACTTGATGGTTT _A_Concatenation TTCTAATTTCCT _B 1122 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGIEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS 77 2211 42
ACTTGATGGTTT _A_Concatenation TGCCAGACGGTC _B 1123 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGRMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG 77 2211 75
ACTTGATGGTTT _A_Concatenation GATGGAGCTACA _B 1124 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGEMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF 77 2211 23
ACTTGATGGTTT _A_Concatenation AAGACCGGTGCC _B 1125 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGRGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF 77 2211 73
ACTTGATGGTTT _A_Concatenation GCGAGGCAGACC _B 1126 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGQWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD 77 2211 70
ACTTGATGGTTT _A_Concatenation TCTTGGTTTTGA _B 1127 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL 77 2211 11
ACTTGATGGTTT _A_Concatenation CAACGCATGCCT _B 1128 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGYPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS 77 2211 100
ACTTGATGGTTT _A_Concatenation GTTCCACGCCTG _B 1129 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGVVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS 77 2211 97
ACTTGATGGTTT _A_Concatenation TATCTATCGTGT _B 1130 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGRTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS 77 2211 78
ACTTGATGGTTT _A_Concatenation CTCAGCGATATA _B 1131 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGFATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG 77 2211 27
ACTTGATGGTTT _A_Concatenation CATCTGAGTCGT _B 1132 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGGGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS 77 2211 33
ACTTGATGGTTT _A_Concatenation TACATGAGGCAC _B 1133 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGPRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL 77 2211 65
ACTTGATGGTTT _A_Concatenation AGAGTGCGCTCT _B 1134 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 77 2211 14
ACTTGATGGTTT _A_Concatenation TAAGATTCAACG _B 1135 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGCFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP 77 2211 13
ACTTGATGGTTT _A_Concatenation AGCCATAAACCT _B 1136 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGPGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ 77 2211 61
ACTTGATGGTTT _A_Concatenation CTCGTTATGGCA _B 1137 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGAFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF 77 2211 9
ACTTGATGGTTT _A_Concatenation TTTAACGGATTG _B 1138 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGSPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN 77 2211 81
ACTTGATGGTTT _A_Concatenation CTGCCGGTTGCG _B 1139 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGLIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL 77 2211 45
ACTTGATGGTTT _A_Concatenation ATGTGGCACGAC _B 1140 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGQQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC 77 2211 69
ACTTGATGGTTT _A_Concatenation TCAGCCGTAGGC _B 1141 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGRGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR 77 2211 72
ACTTGATGGTTT _A_Concatenation TGGCCCCCAGTT _B 1142 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGGFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN 77 2211 31
ACTTGATGGTTT _A_Concatenation AGTTAGGCTCTT _B 1143 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGEFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV 77 2211 21
ACTTGATGGTTT _A_Concatenation ACTTGATGGTTT _B 1144 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 77 2211 77
ACTTGATGGTTT _A_Concatenation linker 2_B 1145 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGGGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA 77 2211 37
ACTTGATGGTTT _A_Concatenation EPICXV.47_B 1146 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 77 2211 34
ACTTGATGGTTT _A_Concatenation TCTGTACCAACG _B 1147 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY 77 2211 25
ACTTGATGGTTT _A_Concatenation GTACATACCGAA _B 1148 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGNENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE 77 2211 55
ACTTGATGGTTT _A_Concatenation TAAAATTTATCA _B 1149 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGLSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY 77 2211 49
ACTTGATGGTTT _A_Concatenation CCCTATGTTCTA _B 1150 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 77 2211 15
ACTTGATGGTTT _A_Concatenation ATGACAAAACGA _B 1151 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGHFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL 77 2211 40
ACTTGATGGTTT _A_Concatenation CATGCTAACACC _B 1152 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGTTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL 77 2211 88
ACTTGATGGTTT _A_Concatenation GATCGCATGTTC _B 1153 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGNSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER 77 2211 58
ACTTGATGGTTT _A_Concatenation AATCGAGACCAG _B 1154 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGEPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH 77 2211 24
ACTTGATGGTTT _A_Concatenation GGCACAGCTCCA _B 1155 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGTTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG 77 2211 87
ACTTGATGGTTT _A_Concatenation ACCAGTATTTAA _B 1156 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGNCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL 77 2211 54
ACTTGATGGTTT _A_Concatenation Stop_1_B 1157 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSG***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK 77 2211 1
ACTTGATGGTTT _A_Concatenation GACTGTGGGCGG _B 1158 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGTWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL 77 2211 89
ACTTGATGGTTT _A_Concatenation GGTGTCTCGATT _B 1159 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV 77 2211 51
ACTTGATGGTTT _A_Concatenation ACTCGTGTTGTC _B 1160 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGVAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT 77 2211 91
ACTTGATGGTTT _A_Concatenation GAGTGACCTATT _B 1161 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGHKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI 77 2211 41
ACTTGATGGTTT _A_Concatenation TGATGAGCGCCC _B 1162 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGQHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL 77 2211 68
ACTTGATGGTTT _A_Concatenation GGGAGTATGTCG _B 1163 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGLLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS 77 2211 47
ACTTGATGGTTT _A_Concatenation TTAAATGAGGGC _B 1164 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGLLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ 77 2211 46
ACTTGATGGTTT _A_Concatenation CGATGGCGGAAT _B 1165 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGDTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG 77 2211 19
ACTTGATGGTTT _A_Concatenation AGCCGTGCTTGA _B 1166 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGGGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY 77 2211 32
ACTTGATGGTTT _A_Concatenation CTCCATTAATGA _B 1167 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGFCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV 77 2211 28
ACTTGATGGTTT _A_Concatenation CTGGTAGTTGTT _B 1168 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGSSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT 77 2211 83
ACTTGATGGTTT _A_Concatenation CATCTCACCTAG _B 1169 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA 77 2211 12
ACTTGATGGTTT _A_Concatenation ATTTGTAGACCG _B 1170 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGPSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE 77 2211 66
ACTTGATGGTTT _A_Concatenation AATCAACGAGCA _B 1171 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGSAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE 77 2211 79
ACTTGATGGTTT _A_Concatenation AAGCAACGATAT _B 1172 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGGHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL 77 2211 38
ACTTGATGGTTT _A_Concatenation ACTTTAGTAACA _B 1173 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGNNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES 77 2211 57
ACTTGATGGTTT _A_Concatenation ACGACTCACCGC _B 1174 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGPGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK 77 2211 60
ACTTGATGGTTT _A_Concatenation GGCACTGTCGAG _B 1175 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGYALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA 77 2211 99
ACTTGATGGTTT _A_Concatenation AGCCAGGTCACC _B 1176 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGGQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK 77 2211 39
ACTTGATGGTTT _A_Concatenation ACAACGTGCATA _B 1177 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGRCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET 77 2211 71
ACTTGATGGTTT _A_Concatenation ATCTCTCAGTTC _B 1178 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGVKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK 77 2211 94
ACTTGATGGTTT _A_Concatenation GCAGATTCGAAT _B 1179 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGKPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP 77 2211 43
ACTTGATGGTTT _A_Concatenation CCCTGCCTCGGG _B 1180 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGEVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY 77 2211 26
ACTTGATGGTTT _A_Concatenation TATTAAAAGTGC _B 1181 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGMHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI 77 2211 53
ACTTGATGGTTT _A_Concatenation ATTGTATCTAAC _B 1182 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGVIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD 77 2211 92
ACTTGATGGTTT _A_Concatenation CCCCTTAGGAAT _B 1183 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGSLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD 77 2211 80
ACTTGATGGTTT _A_Concatenation AAAACAAGCATT _B 1184 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGSTFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS 77 2211 84
ACTTGATGGTTT _A_Concatenation ATAACTCCACGC _B 1185 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 77 2211 44
ACTTGATGGTTT _A_Concatenation GAGACAGCTCTC _B 1186 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGLREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM 77 2211 48
ACTTGATGGTTT _A_Concatenation AGGCTTCCCGCT _B 1187 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGVKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV 77 2211 93
ACTTGATGGTTT _A_Concatenation AAACTTTGAGCG _B 1188 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGAELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ 77 2211 7
ACTTGATGGTTT _A_Concatenation ACGGCGTAAGTA _B 1189 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGLWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL 77 2211 52
ACTTGATGGTTT _A_Concatenation ATCCAATCTGTG _B 1190 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGPSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT 77 2211 67
ACTTGATGGTTT _A_Concatenation CCCGTAGGGGCT _B 1191 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGAPFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS 77 2211 10
ACTTGATGGTTT _A_Concatenation CCTAAATGCAAG _B 1192 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGSVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL 77 2211 85
ACTTGATGGTTT _A_Concatenation GGCCTTTCGGGC _B 1193 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGGAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ 77 2211 30
ACTTGATGGTTT _A_Concatenation GTCACCCTCCTT _B 1194 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGDLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN 77 2211 18
ACTTGATGGTTT _A_Concatenation Stop_2_B 1195 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSG***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE 77 2211 2
ACTTGATGGTTT _A_Concatenation TCTATCACAACT _B 1196 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGLSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH 77 2211 50
ACTTGATGGTTT _A_Concatenation TTAGTGCCATAC _B 1197 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRRGSGGSGGSGGSGPLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP 77 2211 62
TATCTATCGTGT _A_Concatenation GAGATTGTGTCC _B 1198 RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLSGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 78 2211 64
TATCTATCGTGT _A_Concatenation ATAACTCCACGC _B 1199 RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLSGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 78 2211 44
TATCTATCGTGT _A_Concatenation ACTTGATGGTTT _B 1200 RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLSGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 78 2211 77
TATCTATCGTGT _A_Concatenation CTACGTGGCCCC _B 1201 RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLSGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 78 2211 86
TATCTATCGTGT _A_Concatenation ATGACCCCTTGT _B 1202 RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLSGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 78 2211 17
TATCTATCGTGT _A_Concatenation AGAGTGCGCTCT _B 1203 RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLSGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 78 2211 14
TATCTATCGTGT _A_Concatenation CCCTATGTTCTA _B 1204 RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLSGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 78 2211 15
AATCAACGAGCA _A_Concatenation AGAGTGCGCTCT _B 1205 SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEEGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 79 2211 14
AATCAACGAGCA _A_Concatenation CTACGTGGCCCC _B 1206 SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEEGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 79 2211 86
AATCAACGAGCA _A_Concatenation GAGATTGTGTCC _B 1207 SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEEGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 79 2211 64
AATCAACGAGCA _A_Concatenation ATGACCCCTTGT _B 1208 SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEEGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 79 2211 17
AATCAACGAGCA _A_Concatenation ATAACTCCACGC _B 1209 SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEEGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 79 2211 44
AATCAACGAGCA _A_Concatenation ACTTGATGGTTT _B 1210 SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEEGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 79 2211 77
AATCAACGAGCA _A_Concatenation CCCTATGTTCTA _B 1211 SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEEGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 79 2211 15
CCCCTTAGGAAT _A_Concatenation AGAGTGCGCTCT _B 1212 SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLDGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 80 2211 14
CCCCTTAGGAAT _A_Concatenation ACTTGATGGTTT _B 1213 SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLDGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 80 2211 77
CCCCTTAGGAAT _A_Concatenation GAGATTGTGTCC _B 1214 SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLDGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 80 2211 64
CCCCTTAGGAAT _A_Concatenation ATAACTCCACGC _B 1215 SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLDGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 80 2211 44
CCCCTTAGGAAT _A_Concatenation ATGACCCCTTGT _B 1216 SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLDGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 80 2211 17
CCCCTTAGGAAT _A_Concatenation CTACGTGGCCCC _B 1217 SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLDGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 80 2211 86
CCCCTTAGGAAT _A_Concatenation CCCTATGTTCTA _B 1218 SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLDGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 80 2211 15
TTTAACGGATTG _A_Concatenation GAGATTGTGTCC _B 1219 SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDNGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 81 2211 64
TTTAACGGATTG _A_Concatenation AGAGTGCGCTCT _B 1220 SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDNGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 81 2211 14
TTTAACGGATTG _A_Concatenation ATAACTCCACGC _B 1221 SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDNGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 81 2211 44
TTTAACGGATTG _A_Concatenation ACTTGATGGTTT _B 1222 SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDNGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 81 2211 77
TTTAACGGATTG _A_Concatenation ATGACCCCTTGT _B 1223 SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDNGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 81 2211 17
TTTAACGGATTG _A_Concatenation CTACGTGGCCCC _B 1224 SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDNGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 81 2211 86
TTTAACGGATTG _A_Concatenation CCCTATGTTCTA _B 1225 SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDNGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 81 2211 15
CGGCAATTGCTT _A_Concatenation ACTTGATGGTTT _B 1226 SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVKGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 82 2211 77
CGGCAATTGCTT _A_Concatenation AGAGTGCGCTCT _B 1227 SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVKGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 82 2211 14
CGGCAATTGCTT _A_Concatenation ATAACTCCACGC _B 1228 SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVKGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 82 2211 44
CGGCAATTGCTT _A_Concatenation ATGACCCCTTGT _B 1229 SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVKGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 82 2211 17
CGGCAATTGCTT _A_Concatenation CTACGTGGCCCC _B 1230 SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVKGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 82 2211 86
CGGCAATTGCTT _A_Concatenation GAGATTGTGTCC _B 1231 SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVKGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 82 2211 64
CGGCAATTGCTT _A_Concatenation CCCTATGTTCTA _B 1232 SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVKGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 82 2211 15
CTGGTAGTTGTT _A_Concatenation GAGATTGTGTCC _B 1233 SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKTGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 83 2211 64
CTGGTAGTTGTT _A_Concatenation ATGACCCCTTGT _B 1234 SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKTGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 83 2211 17
CTGGTAGTTGTT _A_Concatenation ACTTGATGGTTT _B 1235 SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKTGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 83 2211 77
CTGGTAGTTGTT _A_Concatenation AGAGTGCGCTCT _B 1236 SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKTGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 83 2211 14
CTGGTAGTTGTT _A_Concatenation ATAACTCCACGC _B 1237 SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKTGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 83 2211 44
CTGGTAGTTGTT _A_Concatenation CTACGTGGCCCC _B 1238 SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKTGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 83 2211 86
CTGGTAGTTGTT _A_Concatenation CCCTATGTTCTA _B 1239 SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKTGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 83 2211 15
AAAACAAGCATT _A_Concatenation CTACGTGGCCCC _B 1240 STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVSGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 84 2211 86
AAAACAAGCATT _A_Concatenation ATGACCCCTTGT _B 1241 STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVSGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 84 2211 17
AAAACAAGCATT _A_Concatenation ATAACTCCACGC _B 1242 STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVSGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 84 2211 44
AAAACAAGCATT _A_Concatenation AGAGTGCGCTCT _B 1243 STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVSGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 84 2211 14
AAAACAAGCATT _A_Concatenation ACTTGATGGTTT _B 1244 STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVSGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 84 2211 77
AAAACAAGCATT _A_Concatenation GAGATTGTGTCC _B 1245 STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVSGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 84 2211 64
AAAACAAGCATT _A_Concatenation CCCTATGTTCTA _B 1246 STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVSGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 84 2211 15
CCTAAATGCAAG _A_Concatenation ATAACTCCACGC _B 1247 SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFALGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 85 2211 44
CCTAAATGCAAG _A_Concatenation ACTTGATGGTTT _B 1248 SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFALGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 85 2211 77
CCTAAATGCAAG _A_Concatenation AGAGTGCGCTCT _B 1249 SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFALGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 85 2211 14
CCTAAATGCAAG _A_Concatenation ATGACCCCTTGT _B 1250 SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFALGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 85 2211 17
CCTAAATGCAAG _A_Concatenation CTACGTGGCCCC _B 1251 SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFALGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 85 2211 86
CCTAAATGCAAG _A_Concatenation GAGATTGTGTCC _B 1252 SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFALGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 85 2211 64
CCTAAATGCAAG _A_Concatenation CCCTATGTTCTA _B 1253 SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFALGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 85 2211 15
CTACGTGGCCCC _A_Concatenation GGAGGCACATCG _B 1254 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGRPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL 86 2211 76
CTACGTGGCCCC _A_Concatenation TATCTATCGTGT _B 1255 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGRTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS 86 2211 78
CTACGTGGCCCC _A_Concatenation CTCGTTATGGCA _B 1256 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGAFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF 86 2211 9
CTACGTGGCCCC _A_Concatenation ATCTCTCAGTTC _B 1257 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGVKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK 86 2211 94
CTACGTGGCCCC _A_Concatenation AGTTAGGCTCTT _B 1258 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGEFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV 86 2211 21
CTACGTGGCCCC _A_Concatenation TCTGTACCAACG _B 1259 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY 86 2211 25
CTACGTGGCCCC _A_Concatenation AGGCTTCCCGCT _B 1260 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGVKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV 86 2211 93
CTACGTGGCCCC _A_Concatenation TCTTGGTTTTGA _B 1261 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL 86 2211 11
CTACGTGGCCCC _A_Concatenation TGATGAGCGCCC _B 1262 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGQHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL 86 2211 68
CTACGTGGCCCC _A_Concatenation GAGATTGTGTCC _B 1263 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 86 2211 64
CTACGTGGCCCC _A_Concatenation CATTGTTTAAAT _B 1264 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGAETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV 86 2211 8
CTACGTGGCCCC _A_Concatenation ATGACAAAACGA _B 1265 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGHFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL 86 2211 40
CTACGTGGCCCC _A_Concatenation AGAGTGCGCTCT _B 1266 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 86 2211 14
CTACGTGGCCCC _A_Concatenation ACTTGATGGTTT _B 1267 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 86 2211 77
CTACGTGGCCCC _A_Concatenation TCTTAGAGGGTG _B 1268 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGFGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL 86 2211 29
CTACGTGGCCCC _A_Concatenation AGCCATAAACCT _B 1269 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGPGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ 86 2211 61
CTACGTGGCCCC _A_Concatenation CCCCTTAGGAAT _B 1270 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGSLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD 86 2211 80
CTACGTGGCCCC _A_Concatenation CTGGTAGTTGTT _B 1271 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGSSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT 86 2211 83
CTACGTGGCCCC _A_Concatenation AGTTGCTTTGCT _B 1272 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGDYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM 86 2211 20
CTACGTGGCCCC _A_Concatenation GTTCCACGCCTG _B 1273 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGVVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS 86 2211 97
CTACGTGGCCCC _A_Concatenation GCGAGGCAGACC _B 1274 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGQWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD 86 2211 70
CTACGTGGCCCC _A_Concatenation GAGTGACCTATT _B 1275 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGHKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI 86 2211 41
CTACGTGGCCCC _A_Concatenation TAAGATTCAACG _B 1276 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGCFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP 86 2211 13
CTACGTGGCCCC _A_Concatenation AATCGATCCTAC _B 1277 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGVTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA 86 2211 96
CTACGTGGCCCC _A_Concatenation TCAGCCGTAGGC _B 1278 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGRGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR 86 2211 72
CTACGTGGCCCC _A_Concatenation CATTAGTCCCGC _B 1279 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGAEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF 86 2211 5
CTACGTGGCCCC _A_Concatenation linker 3_B 1280 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGGGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD 86 2211 36
CTACGTGGCCCC _A_Concatenation GAGACAGCTCTC _B 1281 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGLREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM 86 2211 48
CTACGTGGCCCC _A_Concatenation linker 2_B 1282 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGGGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA 86 2211 37
CTACGTGGCCCC _A_Concatenation linker 1_B 1283 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGGGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK 86 2211 35
CTACGTGGCCCC _A_Concatenation CTGCCGGTTGCG _B 1284 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGLIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL 86 2211 45
CTACGTGGCCCC _A_Concatenation AAGACCGGTGCC _B 1285 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGRGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF 86 2211 73
CTACGTGGCCCC _A_Concatenation CATCTGAGTCGT _B 1286 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGGGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS 86 2211 33
CTACGTGGCCCC _A_Concatenation GTACATACCGAA _B 1287 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGNENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE 86 2211 55
CTACGTGGCCCC _A_Concatenation ATTTGTAGACCG _B 1288 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGPSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE 86 2211 66
CTACGTGGCCCC _A_Concatenation Stop_1_B 1289 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSG***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK 86 2211 1
CTACGTGGCCCC _A_Concatenation CAGTGACATAGT _B 1290 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGAAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT 86 2211 3
CTACGTGGCCCC _A_Concatenation ATGTGGCACGAC _B 1291 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGQQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC 86 2211 69
CTACGTGGCCCC _A_Concatenation TGCCAGACGGTC _B 1292 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGRMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG 86 2211 75
CTACGTGGCCCC _A_Concatenation CGATGGCGGAAT _B 1293 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGDTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG 86 2211 19
CTACGTGGCCCC _A_Concatenation ACAACGTGCATA _B 1294 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGRCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET 86 2211 71
CTACGTGGCCCC _A_Concatenation GCAACGAGGGTC _B 1295 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGRGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ 86 2211 74
CTACGTGGCCCC _A_Concatenation AATGGCACTAGC _B 1296 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGEGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR 86 2211 22
CTACGTGGCCCC _A_Concatenation TTCTAATTTCCT _B 1297 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGIEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS 86 2211 42
CTACGTGGCCCC _A_Concatenation CTCGCACCGAGG _B 1298 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGAEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN 86 2211 6
CTACGTGGCCCC _A_Concatenation GCAGATTCGAAT _B 1299 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGKPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP 86 2211 43
CTACGTGGCCCC _A_Concatenation TGGCCCCCAGTT _B 1300 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGGFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN 86 2211 31
CTACGTGGCCCC _A_Concatenation ATTGTATCTAAC _B 1301 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGVIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD 86 2211 92
CTACGTGGCCCC _A_Concatenation TACATGAGGCAC _B 1302 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGPRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL 86 2211 65
CTACGTGGCCCC _A_Concatenation CGGCAATTGCTT _B 1303 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGSRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK 86 2211 82
CTACGTGGCCCC _A_Concatenation ACTCGTGTTGTC _B 1304 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGVAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT 86 2211 91
CTACGTGGCCCC _A_Concatenation ATGACCCCTTGT _B 1305 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 86 2211 17
CTACGTGGCCCC _A_Concatenation AAACTTTGAGCG _B 1306 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGAELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ 86 2211 7
CTACGTGGCCCC _A_Concatenation ACTTTAGTAACA _B 1307 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGNNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES 86 2211 57
CTACGTGGCCCC _A_Concatenation CCAGACGGTCTG _B 1308 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGPFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN 86 2211 59
CTACGTGGCCCC _A_Concatenation AGCCGTGCTTGA _B 1309 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGGGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY 86 2211 32
CTACGTGGCCCC _A_Concatenation TTTAACGGATTG _B 1310 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGSPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN 86 2211 81
CTACGTGGCCCC _A_Concatenation AATCGAGACCAG _B 1311 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGEPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH 86 2211 24
CTACGTGGCCCC _A_Concatenation CCGCCCTTATGT _B 1312 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGVYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH 86 2211 98
CTACGTGGCCCC _A_Concatenation GGCACAGCTCCA _B 1313 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGTTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG 86 2211 87
CTACGTGGCCCC _A_Concatenation AGCCAGGTCACC _B 1314 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGGQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK 86 2211 39
CTACGTGGCCCC _A_Concatenation GATCGCATGTTC _B 1315 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGNSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER 86 2211 58
CTACGTGGCCCC _A_Concatenation GATGGAGCTACA _B 1316 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGEMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF 86 2211 23
CTACGTGGCCCC _A_Concatenation AATCAACGAGCA _B 1317 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGSAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE 86 2211 79
CTACGTGGCCCC _A_Concatenation TTAAATGAGGGC _B 1318 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGLLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ 86 2211 46
CTACGTGGCCCC _A_Concatenation AAGCAACGATAT _B 1319 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGGHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL 86 2211 38
CTACGTGGCCCC _A_Concatenation GGTGTCTCGATT _B 1320 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV 86 2211 51
CTACGTGGCCCC _A_Concatenation TAAAATTTATCA _B 1321 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGLSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY 86 2211 49
CTACGTGGCCCC _A_Concatenation CTCAGCGATATA _B 1322 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGFATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG 86 2211 27
CTACGTGGCCCC _A_Concatenation EPICXV.47_B 1323 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 86 2211 34
CTACGTGGCCCC _A_Concatenation CACACCCGGCAG _B 1324 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGVTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS 86 2211 95
CTACGTGGCCCC _A_Concatenation CATGCTAACACC _B 1325 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGTTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL 86 2211 88
CTACGTGGCCCC _A_Concatenation ACCAGTATTTAA _B 1326 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGNCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL 86 2211 54
CTACGTGGCCCC _A_Concatenation CAACGCATGCCT _B 1327 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGYPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS 86 2211 100
CTACGTGGCCCC _A_Concatenation CATCTCACCTAG _B 1328 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA 86 2211 12
CTACGTGGCCCC _A_Concatenation AAGGGTAAATTG _B 1329 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGNKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS 86 2211 56
CTACGTGGCCCC _A_Concatenation ATAACTCCACGC _B 1330 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 86 2211 44
CTACGTGGCCCC _A_Concatenation GGGAGTATGTCG _B 1331 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGLLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS 86 2211 47
CTACGTGGCCCC _A_Concatenation CTCACGACAAGA _B 1332 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP 86 2211 4
CTACGTGGCCCC _A_Concatenation TATCGCAAGAAC _B 1333 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGPNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV 86 2211 63
CTACGTGGCCCC _A_Concatenation CCCGTAGGGGCT _B 1334 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGAPFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS 86 2211 10
CTACGTGGCCCC _A_Concatenation CCCTATGTTCTA _B 1335 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 86 2211 15
CTACGTGGCCCC _A_Concatenation CCCTGCCTCGGG _B 1336 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGEVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY 86 2211 26
CTACGTGGCCCC _A_Concatenation TACTTCTAGCGG _B 1337 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGTWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ 86 2211 90
CTACGTGGCCCC _A_Concatenation GGCACTGTCGAG _B 1338 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGYALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA 86 2211 99
CTACGTGGCCCC _A_Concatenation CTCCATTAATGA _B 1339 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGFCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV 86 2211 28
CTACGTGGCCCC _A_Concatenation AAAACAAGCATT _B 1340 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGSTFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS 86 2211 84
CTACGTGGCCCC _A_Concatenation TATTAAAAGTGC _B 1341 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGMHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI 86 2211 53
CTACGTGGCCCC _A_Concatenation CGCCGGCATACA _B 1342 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGDGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI 86 2211 16
CTACGTGGCCCC _A_Concatenation ACGACTCACCGC _B 1343 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGPGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK 86 2211 60
CTACGTGGCCCC _A_Concatenation ACGGCGTAAGTA _B 1344 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGLWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL 86 2211 52
CTACGTGGCCCC _A_Concatenation ATCCAATCTGTG _B 1345 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGPSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT 86 2211 67
CTACGTGGCCCC _A_Concatenation CCTAAATGCAAG _B 1346 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGSVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL 86 2211 85
CTACGTGGCCCC _A_Concatenation CTACGTGGCCCC _B 1347 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 86 2211 86
CTACGTGGCCCC _A_Concatenation GACTGTGGGCGG _B 1348 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGTWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL 86 2211 89
CTACGTGGCCCC _A_Concatenation GGCCTTTCGGGC _B 1349 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGGAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ 86 2211 30
CTACGTGGCCCC _A_Concatenation GTCACCCTCCTT _B 1350 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGDLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN 86 2211 18
CTACGTGGCCCC _A_Concatenation Stop_2_B 1351 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSG***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE 86 2211 2
CTACGTGGCCCC _A_Concatenation TCTATCACAACT _B 1352 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGLSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH 86 2211 50
CTACGTGGCCCC _A_Concatenation TTAGTGCCATAC _B 1353 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLGSGGSGGSGGSGPLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP 86 2211 62
GGCACAGCTCCA _A_Concatenation ATGACCCCTTGT _B 1354 TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIGGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 87 2211 17
GGCACAGCTCCA _A_Concatenation ACTTGATGGTTT _B 1355 TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIGGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 87 2211 77
GGCACAGCTCCA _A_Concatenation ATAACTCCACGC _B 1356 TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIGGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 87 2211 44
GGCACAGCTCCA _A_Concatenation GAGATTGTGTCC _B 1357 TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIGGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 87 2211 64
GGCACAGCTCCA _A_Concatenation CTACGTGGCCCC _B 1358 TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIGGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 87 2211 86
GGCACAGCTCCA _A_Concatenation AGAGTGCGCTCT _B 1359 TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIGGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 87 2211 14
GGCACAGCTCCA _A_Concatenation CCCTATGTTCTA _B 1360 TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIGGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 87 2211 15
CATGCTAACACC _A_Concatenation ATAACTCCACGC _B 1361 TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 88 2211 44
CATGCTAACACC _A_Concatenation GAGATTGTGTCC _B 1362 TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 88 2211 64
CATGCTAACACC _A_Concatenation ATGACCCCTTGT _B 1363 TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 88 2211 17
CATGCTAACACC _A_Concatenation ACTTGATGGTTT _B 1364 TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 88 2211 77
CATGCTAACACC _A_Concatenation CTACGTGGCCCC _B 1365 TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 88 2211 86
CATGCTAACACC _A_Concatenation AGAGTGCGCTCT _B 1366 TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 88 2211 14
CATGCTAACACC _A_Concatenation CCCTATGTTCTA _B 1367 TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 88 2211 15
GACTGTGGGCGG _A_Concatenation GAGATTGTGTCC _B 1368 TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLLGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 89 2211 64
GACTGTGGGCGG _A_Concatenation ACTTGATGGTTT _B 1369 TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLLGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 89 2211 77
GACTGTGGGCGG _A_Concatenation AGAGTGCGCTCT _B 1370 TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLLGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 89 2211 14
GACTGTGGGCGG _A_Concatenation ATAACTCCACGC _B 1371 TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLLGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 89 2211 44
GACTGTGGGCGG _A_Concatenation ATGACCCCTTGT _B 1372 TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLLGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 89 2211 17
GACTGTGGGCGG _A_Concatenation CTACGTGGCCCC _B 1373 TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLLGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 89 2211 86
GACTGTGGGCGG _A_Concatenation CCCTATGTTCTA _B 1374 TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLLGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 89 2211 15
TACTTCTAGCGG _A_Concatenation ATGACCCCTTGT _B 1375 TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 90 2211 17
TACTTCTAGCGG _A_Concatenation GAGATTGTGTCC _B 1376 TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 90 2211 64
TACTTCTAGCGG _A_Concatenation ATAACTCCACGC _B 1377 TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 90 2211 44
TACTTCTAGCGG _A_Concatenation CTACGTGGCCCC _B 1378 TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 90 2211 86
TACTTCTAGCGG _A_Concatenation AGAGTGCGCTCT _B 1379 TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 90 2211 14
TACTTCTAGCGG _A_Concatenation ACTTGATGGTTT _B 1380 TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 90 2211 77
TACTTCTAGCGG _A_Concatenation CCCTATGTTCTA _B 1381 TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 90 2211 15
ACTCGTGTTGTC _A_Concatenation AGAGTGCGCTCT _B 1382 VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNTGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 91 2211 14
ACTCGTGTTGTC _A_Concatenation ATGACCCCTTGT _B 1383 VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNTGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 91 2211 17
ACTCGTGTTGTC _A_Concatenation GAGATTGTGTCC _B 1384 VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNTGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 91 2211 64
ACTCGTGTTGTC _A_Concatenation ACTTGATGGTTT _B 1385 VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNTGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 91 2211 77
ACTCGTGTTGTC _A_Concatenation ATAACTCCACGC _B 1386 VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNTGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 91 2211 44
ACTCGTGTTGTC _A_Concatenation CTACGTGGCCCC _B 1387 VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNTGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 91 2211 86
ACTCGTGTTGTC _A_Concatenation CCCTATGTTCTA _B 1388 VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNTGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 91 2211 15
ATTGTATCTAAC _A_Concatenation ATGACCCCTTGT _B 1389 VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSDGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 92 2211 17
ATTGTATCTAAC _A_Concatenation ACTTGATGGTTT _B 1390 VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSDGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 92 2211 77
ATTGTATCTAAC _A_Concatenation CTACGTGGCCCC _B 1391 VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSDGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 92 2211 86
ATTGTATCTAAC _A_Concatenation GAGATTGTGTCC _B 1392 VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSDGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 92 2211 64
ATTGTATCTAAC _A_Concatenation ATAACTCCACGC _B 1393 VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSDGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 92 2211 44
ATTGTATCTAAC _A_Concatenation AGAGTGCGCTCT _B 1394 VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSDGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 92 2211 14
ATTGTATCTAAC _A_Concatenation CCCTATGTTCTA _B 1395 VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSDGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 92 2211 15
AGGCTTCCCGCT _A_Concatenation ATGACCCCTTGT _B 1396 VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIVGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 93 2211 17
AGGCTTCCCGCT _A_Concatenation ATAACTCCACGC _B 1397 VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIVGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 93 2211 44
AGGCTTCCCGCT _A_Concatenation ACTTGATGGTTT _B 1398 VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIVGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 93 2211 77
AGGCTTCCCGCT _A_Concatenation AGAGTGCGCTCT _B 1399 VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIVGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 93 2211 14
AGGCTTCCCGCT _A_Concatenation CTACGTGGCCCC _B 1400 VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIVGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 93 2211 86
AGGCTTCCCGCT _A_Concatenation GAGATTGTGTCC _B 1401 VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIVGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 93 2211 64
AGGCTTCCCGCT _A_Concatenation CCCTATGTTCTA _B 1402 VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIVGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 93 2211 15
ATCTCTCAGTTC _A_Concatenation GAGATTGTGTCC _B 1403 VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLKGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 94 2211 64
ATCTCTCAGTTC _A_Concatenation ATGACCCCTTGT _B 1404 VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLKGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 94 2211 17
ATCTCTCAGTTC _A_Concatenation AGAGTGCGCTCT _B 1405 VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLKGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 94 2211 14
ATCTCTCAGTTC _A_Concatenation ATAACTCCACGC _B 1406 VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLKGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 94 2211 44
ATCTCTCAGTTC _A_Concatenation ACTTGATGGTTT _B 1407 VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLKGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 94 2211 77
ATCTCTCAGTTC _A_Concatenation CTACGTGGCCCC _B 1408 VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLKGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 94 2211 86
ATCTCTCAGTTC _A_Concatenation CCCTATGTTCTA _B 1409 VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLKGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 94 2211 15
CACACCCGGCAG _A_Concatenation GAGATTGTGTCC _B 1410 VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCSGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 95 2211 64
CACACCCGGCAG _A_Concatenation CTACGTGGCCCC _B 1411 VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCSGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 95 2211 86
CACACCCGGCAG _A_Concatenation ATGACCCCTTGT _B 1412 VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCSGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 95 2211 17
CACACCCGGCAG _A_Concatenation ACTTGATGGTTT _B 1413 VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCSGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 95 2211 77
CACACCCGGCAG _A_Concatenation ATAACTCCACGC _B 1414 VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCSGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 95 2211 44
CACACCCGGCAG _A_Concatenation AGAGTGCGCTCT _B 1415 VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCSGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 95 2211 14
CACACCCGGCAG _A_Concatenation CCCTATGTTCTA _B 1416 VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCSGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 95 2211 15
AATCGATCCTAC _A_Concatenation ACTTGATGGTTT _B 1417 VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHAGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 96 2211 77
AATCGATCCTAC _A_Concatenation CTACGTGGCCCC _B 1418 VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHAGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 96 2211 86
AATCGATCCTAC _A_Concatenation ATGACCCCTTGT _B 1419 VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHAGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 96 2211 17
AATCGATCCTAC _A_Concatenation ATAACTCCACGC _B 1420 VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHAGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 96 2211 44
AATCGATCCTAC _A_Concatenation GAGATTGTGTCC _B 1421 VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHAGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 96 2211 64
AATCGATCCTAC _A_Concatenation AGAGTGCGCTCT _B 1422 VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHAGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 96 2211 14
AATCGATCCTAC _A_Concatenation CCCTATGTTCTA _B 1423 VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHAGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 96 2211 15
GTTCCACGCCTG _A_Concatenation ACTTGATGGTTT _B 1424 VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMSGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 97 2211 77
GTTCCACGCCTG _A_Concatenation GAGATTGTGTCC _B 1425 VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMSGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 97 2211 64
GTTCCACGCCTG _A_Concatenation ATAACTCCACGC _B 1426 VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMSGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 97 2211 44
GTTCCACGCCTG _A_Concatenation ATGACCCCTTGT _B 1427 VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMSGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 97 2211 17
GTTCCACGCCTG _A_Concatenation CTACGTGGCCCC _B 1428 VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMSGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 97 2211 86
GTTCCACGCCTG _A_Concatenation AGAGTGCGCTCT _B 1429 VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMSGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 97 2211 14
GTTCCACGCCTG _A_Concatenation CCCTATGTTCTA _B 1430 VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMSGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 97 2211 15
CCGCCCTTATGT _A_Concatenation ACTTGATGGTTT _B 1431 VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLHGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 98 2211 77
CCGCCCTTATGT _A_Concatenation AGAGTGCGCTCT _B 1432 VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLHGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 98 2211 14
CCGCCCTTATGT _A_Concatenation ATGACCCCTTGT _B 1433 VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLHGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 98 2211 17
CCGCCCTTATGT _A_Concatenation CTACGTGGCCCC _B 1434 VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLHGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 98 2211 86
CCGCCCTTATGT _A_Concatenation GAGATTGTGTCC _B 1435 VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLHGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 98 2211 64
CCGCCCTTATGT _A_Concatenation ATAACTCCACGC _B 1436 VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLHGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 98 2211 44
CCGCCCTTATGT _A_Concatenation CCCTATGTTCTA _B 1437 VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLHGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 98 2211 15
GGCACTGTCGAG _A_Concatenation AGAGTGCGCTCT _B 1438 YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDAGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 99 2211 14
GGCACTGTCGAG _A_Concatenation CTACGTGGCCCC _B 1439 YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDAGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 99 2211 86
GGCACTGTCGAG _A_Concatenation ATAACTCCACGC _B 1440 YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDAGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 99 2211 44
GGCACTGTCGAG _A_Concatenation GAGATTGTGTCC _B 1441 YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDAGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 99 2211 64
GGCACTGTCGAG _A_Concatenation ATGACCCCTTGT _B 1442 YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDAGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 99 2211 17
GGCACTGTCGAG _A_Concatenation ACTTGATGGTTT _B 1443 YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDAGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 99 2211 77
GGCACTGTCGAG _A_Concatenation CCCTATGTTCTA _B 1444 YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDAGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 99 2211 15
CAACGCATGCCT _A_Concatenation ATGACCCCTTGT _B 1445 YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLSGSGGSGGSGGSGDLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV 100 2211 17
CAACGCATGCCT _A_Concatenation CTACGTGGCCCC _B 1446 YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLSGSGGSGGSGGSGTPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL 100 2211 86
CAACGCATGCCT _A_Concatenation GAGATTGTGTCC _B 1447 YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLSGSGGSGGSGGSGPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP 100 2211 64
CAACGCATGCCT _A_Concatenation AGAGTGCGCTCT _B 1448 YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLSGSGGSGGSGGSGCPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP 100 2211 14
CAACGCATGCCT _A_Concatenation ATAACTCCACGC _B 1449 YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLSGSGGSGGSGGSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML 100 2211 44
CAACGCATGCCT _A_Concatenation ACTTGATGGTTT _B 1450 YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLSGSGGSGGSGGSGRPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR 100 2211 77
CAACGCATGCCT _A_Concatenation CCCTATGTTCTA _B 1451 YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLSGSGGSGGSGGSGDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL 100 2211 15
The peptide amino acid sequence comprising the combination of "x" shows a stop codon at that position that prevents translation of the remainder of the listed sequence encoded by the nucleic acid.
Heterologous endonucleases
In some embodiments, the engineered gene effector of the present disclosure comprises a polypeptide coupled to a heterologous endonuclease (e.g., an enzymatically active Cas protein, an enzymatically inactive Cas protein, etc.). In some embodiments, the engineered gene effectors disclosed herein, or proteins comprising engineered gene effectors (e.g., proteins comprising engineered gene effectors coupled to heterologous endonucleases), may be referred to as effector moieties. The engineered gene effector and the heterologous endonuclease may be coupled to each other, e.g., directly or indirectly (e.g., via a linker). For example, the engineered gene effector and the heterologous endonuclease may be fused to each other, e.g., directly or indirectly (e.g., via a linker). In another example, the engineered gene effector and the heterologous endonuclease may be non-covalently coupled to each other, e.g., by ionic bonding, hydrogen bonding, oligomerization or dimerization domain-mediated interactions, and the like. In some cases, the engineered gene effector and the heterologous endonuclease may be part of a single polypeptide molecule (e.g., a chimeric or fusion polypeptide).
In a variety of organisms, including a variety of mammals, animals, plants, microorganisms, and yeasts, the CRISPR/Cas system (e.g., modified and/or unmodified) can be used as a genome engineering tool, or can be modified to direct the specific binding of an engineered protein to a target locus disclosed herein. The CRISPR/Cas system may comprise a guide nucleic acid (e.g., guide RNA (gRNA)) complexed with a Cas protein for targeted regulation of gene expression and/or activity or nucleic acid binding. RNA-guided Cas proteins (e.g., cas nucleases, such as Cas9 nucleases) can specifically bind target polynucleotides (e.g., DNA) in a sequence-dependent manner. Cas proteins, if having nuclease activity, can cleave DNA.
Non-limiting examples of heterologous endonucleases disclosed and used herein include, but are not limited to, CRISPR-associated (Cas) proteins or Cas nucleases, including type I CRISPR-associated (Cas) polypeptides, type II CRISPR-associated (Cas) polypeptides, type III CRISPR-associated (Cas) polypeptides, type IV CRISPR-associated (Cas) polypeptides, type V CRISPR-associated (Cas) polypeptides, and type VI CRISPR-associated (Cas) polypeptides, zinc Finger Nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), meganucleases (meganucleases), RNA Binding Proteins (RBPs), CRISPR-associated RNA binding proteins, recombinases, invertases, argonaute (Ago) proteins (e.g., prokaryotic Argonaute (pAgo), archaebacterium Argonaute (aAgo), or eukaryotic Argonaute (eAgo)), or any derivatives thereof, any variants thereof, and any fragments thereof.
In some embodiments, the heterologous endonucleases disclosed herein can be nuclease deficient. In some embodiments, the heterologous endonuclease may be a nuclease-deleted (nucleic-null) DNA binding protein that does not induce transcriptional activation or repression of the target DNA sequence unless it is present in the complex with one or more engineered gene effectors of the present disclosure. In some embodiments, the heterologous endonuclease may be a nuclease-deleted DNA-binding protein that can induce transcriptional activation or repression of the target DNA sequence (e.g., which can be altered or enhanced by the presence of an engineered gene effector provided herein). In some cases, the Cas protein is mutated and/or modified to produce a nuclease-deficient protein or a protein having reduced nuclease activity relative to a wild-type Cas protein. Nuclease-deficient proteins may retain the ability to bind DNA, but may lack or have reduced nucleic acid cleavage activity.
In some embodiments, the heterologous endonucleases disclosed herein can be RNA nucleases, such as engineered (e.g., programmable or targetable) RNA nucleases. In some embodiments, the heterologous endonuclease disclosed herein can be a nuclease-deleted RNA-binding protein that does not induce transcriptional activation or repression of the target RNA sequence unless it is present in a complex with one or more engineered gene effectors of the present disclosure. In some embodiments, the heterologous endonucleases disclosed herein can be nuclease deleted RNA binding proteins that can induce transcriptional activation or repression of a target RNA sequence (e.g., which can be altered or enhanced by the presence of an engineered gene effector provided herein).
In some embodiments, the heterologous endonuclease may be a nucleic acid-guided targeting system. In some embodiments, the heterologous endonuclease may be a DNA-guided targeting system. In some embodiments, the heterologous endonuclease may be an RNA-guided targeting system. The nucleic acid-guided targeting system can comprise and utilize, for example, a guide nucleic acid sequence that facilitates specific binding of the CRISPR-Cas system (e.g., a nuclease-deficient form thereof, such as dCas9 or dCas 14) to a target gene (e.g., a target endogenous gene) or target gene regulatory sequence. For example, the target gene may be any of the genes listed in table 6, and the target gene regulatory sequence may be operably coupled to any of the genes listed in table 6. Binding specificity may be determined by using a guide nucleic acid (e.g., single guide RNA (sgRNA) or a portion thereof). In some embodiments, the use of different sgrnas allows the compositions, combinations, systems, and methods of the present disclosure to be used with (e.g., targeted to) different target genes (e.g., target endogenous genes) or target gene regulatory sequences.
In some embodiments, prokaryotic CRISPR-Cas (clustered regularly interspaced short palindromic repeats-CRISPR related) systems, such as class II CRISPR-Cas systems (e.g., cas9 and Cpfl) can be reused as tools in the compositions, combinations, systems, and methods of the present disclosure to regulate gene expression, epigenomic editing, and chromatin cyclization. In some embodiments, nuclease-inactivated Cas (dCas) protein complexed with a heterologous gene effector can allow for modulation of expression of a target gene (e.g., a target endogenous gene) adjacent to the dCas binding site.
Any suitable CRISPR/Cas system may be used. CRISPR/Cas systems can be referred to using a variety of naming schemes. The CRISPR/Cas system can be a type I, type II, type III, type IV, type V, type VI system, or any other suitable CRISPR/Cas system. As used herein, a CRISPR/Cas system can be a class 1, a class 2, or any other suitable classification of CRISPR/Cas system. The determination of class 1 or class 2 may be based on the gene encoding the effector module. Class 1 systems typically have a multi-subunit crRNA effector complex, while class 2 systems typically have a single protein (e.g., cas9, cpfl, C2C1, C2, C2C 3) or crRNA effector complex. Class 1 CRISPR/Cas systems can use complexes of multiple Cas proteins to achieve regulation. Class 1 CRISPR/Cas systems may comprise, for example, type I (e.g., I, IA, IB, IC, ID, IE, IF or IU), type III (e.g., III, IIIA, IIIB, IIIC or IIID), and type IV (e.g., IV, IVA or IVB) CRISPR/Cas types. Class 2 CRISPR/Cas systems can use a single large Cas protein to achieve regulation. Class 2 CRISPR/Cas systems may comprise, for example, type II (e.g., II, IIA, or IIB) and type V CRISPR/Cas types. CRISPR systems can be complementary to each other and/or can provide trans-functional units to facilitate CRISPR locus targeting.
When the heterologous endonuclease comprises a Cas protein or a derivative thereof, the Cas protein or derivative thereof may be a class 1 or class 2 Cas protein. The Cas protein may be a type I, type II, type III, type IV, type V Cas protein or a type VI Cas protein. The Cas protein may comprise one or more domains. Non-limiting examples of domains include guide recognition and/or binding domains, nuclease domains (e.g., DNase or RNase domains, ruvC or HNH), DNA binding domains, RNA binding domains, helicase domains, protein-protein interaction domains, or dimerization domains. The guide recognition and/or binding domains may interact with the guide. The nuclease domain may comprise catalytic activity for nucleic acid cleavage. The nuclease domain may lack catalytic activity to prevent nucleic acid cleavage. The Cas protein may be a chimeric Cas protein or a fragment thereof fused to other proteins or polypeptides. The Cas protein may be a chimera of multiple Cas proteins, e.g., comprising domains from different Cas proteins.
Non-limiting examples of Cas proteins include c2c1、C2c2、c2c3、Cas1、Cas1B、Cas2、Cas3、Cas4、Cas5、Cas5e(CasD)、Cash、Cas6e、Cas6f、Cas7、Cas8a、Cas8a1、Cas8a2、Cas8b、Cas8c、Cas9(Csn1 or Csx12)、Cas10、Cas10d、CasF、CasG、CasH、Cpfl、Csyl、Csy2、Csy3、Cse1(CasA)、Cse2(CasB)、Cse3(CasE)、Cse4(CasC)、Csc1、Csc2、Csa5、Csn2、Csm2、Csm3、Csm4、Csm5、Csm6、Cmrl、Cmr3、Cmr4、Cmr5、Cmr6、Csb1、Csb2、Csb3、Csx17、Csx14、Csx10、Csx16、CsaX、Csx3、Csx1、Csx15、Csf1、Csf2、Csf3、Csf4、Cul966、Cas13a、Cas13b、Cas13c、Cas13d、Cas13X or Cas13Y, as well as homologs or modified versions thereof.
In some embodiments, the Cas protein disclosed herein may not be and need not be Cas9 or Cas12a. The Cas proteins disclosed herein may have smaller dimensions compared to Cas9 or Cas12a. The Cas proteins disclosed herein may be derived from Un1Cas12f1. In some embodiments, a heterologous endonuclease may comprise an amino acid sequence that has at least or at most about 50%, at least or at most about 55%, at least or at most about 60%, at least or at most about 65%, at least or at most about 70%, at least or at most about 75%, at least or at most about 80%, at least or at most about 85%, at least or at most about 90%, at least or at most about 91%, at least or at most about 92%, at least or at most about 93%, at least or at most about 94%, at least or at most about 95%, at least or at most about 96%, at least or at most about 97%, at least or at most about 98%, at least or at least about 99% or about 100% sequence identity to the polypeptide sequence of SEQ ID NO 2222 (e.g., casMini). In some embodiments, a heterologous endonuclease may comprise an amino acid sequence that has at least or at most about 50%, at least or at most about 55%, at least or at most about 60%, at least or at most about 65%, at least or at most about 70%, at least or at most about 75%, at least or at most about 80%, at least or at most about 85%, at least or at most about 90%, at least or at most about 91%, at least or at most about 92%, at least or at most about 93%, at least or at most about 94%, at least or at most about 95%, at least or at most about 96%, at least or at most about 97%, at least or at most about 98%, at least or at least about 99% or about 100% sequence identity to the polypeptide sequence (e.g., dCasMini) of SEQ ID NO 2231. As disclosed herein, SEQ ID No. 2222 encodes the polypeptide sequence of Un1Cas12f1. As disclosed herein, SEQ ID No. 2231 encodes an engineered variant of Un1Cas12f1 with reduced nuclease activity. As disclosed herein, SEQ ID No. 2232 encodes a non-limiting example of a Cas12f variant suitable for use in the systems, compositions, combinations, and methods of the present disclosure. In some embodiments, cas12f variants disclosed herein may comprise an amino acid sequence that is at least or at least about 50%, at least or at least about 60%, at least or at least about 70%, at least or at least about 75%, at least or at least about 80%, at least or at least about 85%, at least or at least about 90%, at least or at least about 95%, at least or at least about 96%, at least or at least about 97%, at least or at least about 98%, at least or at least about 99%, or substantially about 100% identical to the polypeptide sequence of SEQ ID No. 2232.
SEQ ID NO: 2222(Un1Cas12f1)
1 MAKNTITKTL KLRIVRPYNS AEVEKIVADE KNNREKIALE KNKDKVKEAC
51 SKHLKVAAYC TTQVERNACL FCKARKLDDK FYQKLRGQFP DAVFWQEISE
101 IFRQLQKQAA EIYNQSLIEL YYEIFIKGKG IANASSVEHY LSDVCYTRAA
151 ELFKNAAIAS GLRSKIKSNF RLKELKNMKS GLPTTKSDNF PIPLVKQKGG
201 QYTGFEISNH NSDFIIKIPF GRWQVKKEID KYRPWEKFDF EQVQKSPKPI
251 SLLLSTQRRK RNKGWSKDEG TEAEIKKVMN GDYQTSYIEV KRGSKIGEKS
301 AWMLNLSIDV PKIDKGVDPS IIGGIDVGVK SPLVCAINNA FSRYSISDND
351 LFHFNKKMFA RRRILLKKNR HKRAGHGAKN KLKPITILTE KSERFRKKLI
401 ERWACEIADF FIKNKVGTVQ MENLESMKRK EDSYFNIRLR GFWPYAEMQN
451 KIEFKLKQYG IEIRKVAPNN TSKTCSKCGH LNNYFNFEYR KKNKFPHFKC
501 EKCNFKENAD YNAALNISNP KLKSTKEEP
2231 (Inactivated nuclease variant of Un1Cas12f 1)
1 MAKNTITKTL KLRIVRPYNS AEVEKIVADE KNNREKIALE KNKDKVKEAC
51 SKHLKVAAYC TTQVERNACL FCKARKLDDK FYQKLRGQFP DAVFWQEISE
101 IFRQLQKQAA EIYNQSLIEL YYEIFIKGKG IANASSVEHY LSRVCYRRAA
151 ELFKNAAIAS GLRSKIKSNF RLKELKNMKS GLPTTKSDNF PIPLVKQKGG
201 QYTGFEISNH NSDFIIKIPF GRWQVKKEID KYRPWEKFDF EQVQKSPKPI
251 SLLLSTQRRK RNKGWSKDEG TEAEIKKVMN GDYQTSYIEV KRGSKICEKS
301 AWMLNLSIDV PKIDKGVDPS IIGGIAVGVR SPLVCAINNA FSRYSISDND
351 LFHFNKKMFA RRRILLKKNR HKRAGHGAKN KLKPITILTE KSERFRKKLI
401 ERWACEIADF FIKNKVGTVQ MENLESMKRK EDSYFNIRLR GFWPYAEMQN
451 KIEFKLKQYG IEIRKVAPNN TSKTCSKCGH LNNYFNFEYR KKNKFPHFKC
501 EKCNFKENAA YNAALNISNP KLKSTKERP
SEQ ID NO. 2232 (Cas 12f variant)
MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIAGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
In some embodiments, the amino acid sequences of the heterologous endonucleases disclosed herein can be mutated and/or modified to produce nuclease-deficient proteins or proteins having reduced nuclease activity relative to wild-type Cas proteins. Nuclease-deficient proteins may retain the ability to bind to a target gene (e.g., DNA), but may lack or have reduced nucleic acid cleavage activity. In some embodiments, the heterologous endonuclease can exhibit reduced nuclease activity (e.g., nuclease-deficient or nuclease-deleted) compared to wild-type Un1Cas12f 1. The reduced nuclease activity can be at most about 95%, at most about 90%, at most about 80%, at most about 70%, at most about 60%, at most about 50%, at most about 40%, at most about 30%, at most about 20%, at most about 10%, at most about 5%, at most about 1%, at most about 0.5%, at most about 0.1% or less of wild-type Un1Cas12f 1.
In some cases, the Cas protein provided herein may not be a Cas14 protein.
The Cas protein or fragment or derivative thereof may be from any suitable organism. Non-limiting examples include Streptococcus pyogenes (Streptococcus pyogenes), streptococcus thermophilus (Streptococcus thermophilus), streptococcus sp, staphylococcus aureus (Staphylococcus aureus), north Amycolatopsis darunae (Nocardiopsis dassonvillei), streptomyces roseoflotates (Streptomyces PRISTINAE SPIRALIS), streptococcus sp, Streptomyces viridochromogenes (Streptomyces viridochromo genes), streptomyces viridochromogenes (Streptomyces viridochromogenes), streptomyces roseoflash (Streptosporangium roseum), streptomyces roseoflash (Streptosporangium roseum), alicyclobacHlus acidocaldarius, bacillus pseudomycoides (Bacillus pseudomycoides), Bacillus selenitireducens, exiguobacterium sibiricum, lactobacillus delbrueckii, lactobacillus salivarius 、Microscilla marina、Burkholderiales bacterium、Polaromonas nap hthalenivorans、Polaromonas sp.、Crocosphaera watsonii、Cyanothece sp.、 Microcystis aeruginosa (Microcystis aeruginosa), pseudomonas aeruginosa (Pseudomonas aeruginosa)、Synechococcus sp.、Acetohalobium arabaticum、Ammonifex degensii、Caldicelulosiruptor becscii、Candidatus Desulforudis、 Clostridium botulinum (Clostridium botulinum), Clostridium difficile (Clostridium difficile)、Finegoldia magna、Natranaerobius thermophilus、Pelotomaculum thermopropionicum、 Acidophilic thiobacillus caldus (Acidithiobacillus caldus), acidothiobacillus ferrooxidans (Acidithiobacillus ferrooxidans)、Allochromatium vinosum、Marinobacter sp.、Nitrosococcus halophilus、Nitrosococcus watsoni、 Acidovorax facilis (Pseudoalteromonas haloplanktis), ktedonobacter racemifer, methanohalobium evestigatum, Anabaena variabilis (Anabaena variabilis), nodularia spumigena, nostoc (Nostoc sp.), arthrospira maxima (Arthrospira maxima), arthrospira platensis (Arthrospira platensis), arthrospira (Arthrospira sp.), lyngbya sp., microcystis prototheca (Microcoleus chthonoplastes), oscillatoria (oscilloria sp.), fusarium sp, Petrotoga mobilis, african Thermomyces (Thermosipho africanus), acaryochloris marina, leptotrichia shahii or FRANCISELLA NOVICIDA. In some aspects, the organism is streptococcus pyogenes (s. pyogenes). In some aspects, the organism is staphylococcus aureus (s. In some aspects, the organism is streptococcus thermophilus (s.
Cas proteins may be derived from a variety of bacterial species including, but not limited to Veillonella atypical, fusobacterium nucleatum (Fusobacterium nucleatum), filifactor alocis, solobacterium moorei, coprococcus catus, treponema pallidum (Treponema denticola), peptoniphilus duerdenii, catenibacterium mitsuokai, Streptococcus mutans (Streptococcus mutans), listeria innocens (Listeria innocua)、Staphylococcus pseudintermedius、Acidaminococcus intestine、Olsenella uli、Oenococcus kitaharae、 bifidobacterium bifidum (Bifidobacterium bifidum), lactobacillus rhamnosus (Lactobacillus rhamnosus), lactobacillus gasseri (Lactobacillus gasseri), finegoldia magna, mycoplasma mobile, Mycoplasma gallisepticum (Mycoplasma gallisepticum), mycoplasma ovipneumoniae (Mycoplasma ovipneumoniae), mycoplasma canis (Mycoplasma canis), mycoplasma synovium (Mycoplasma synoviae), mycobacterium rectum (Eubacterium rectale), streptococcus thermophilus (Streptococcus thermophilus), eubacterium dolichum, lactobacillus curvatus subspecies (Lactobacillus coryniformis subsp. Torquens), Ilyobacter polytropus, ruminococcus albus (Ruminococcus albus), acremonium muciniphilum (AKKERMANSIA MUCINIPHILA), acidothermus cellulolyticus, bifidobacterium longum (Bifidobacterium longum), bifidobacterium dentium, bacteroides fragilis (Bacteroides fragilis) of diphtheria bacillus (Corynebacterium diphtheria)、Elusimicrobium minutum、Nitratifractorsalsuginis、Sphaerochaeta globus、Fibrobacter succinogenes subsp. Succinogenes、, Carbon dioxide philic bacteria (Capnocytophaga ochracea), rhodopseudomonas palustris (Rhodopseudomonas palustris), prevotella micans, prevotella ruminicola, flavobacterium columniformis (Flavobacterium columnare), aminomonas paucivorans, rhodospirillum (Azospiralum) (Rhodospirillum rubrum)、Candidatus Puniceispirillum marinum、Verminephrobacter eiseniae、Ralstonia syzygii、Dinoroseobacter shibae、, azospiralum, Nitrobacter hamburgensis, rhizobium (Bradyrhizobium), wolinella succinogenes, campylobacter jejuni subspecies jejuni (Campylobacter jejuni subsp. Jejuni), helicobacter ferret (Helicobacter mustelae), bacillus cereus (Bacillus cereus), acidovorax ebreus, clostridium aerogenes (Clostridium perfringens), parvibaculum lavamentivorans, roseburia intestinalis, neisseria meningitidis (NEISSERIA MENINGITIDIS), pasteurella spinosad (Pasteurella multocida subsp. Multocida), sutterella wadsworthensis, proteobacterium, legionella pneumophila (Legionella pneumophila), Parasutterella excrementihominis, wolinella succinogenes or FRANCISELLA NOVICIDA.
Cas proteins used herein may be wild-type or modified forms of Cas proteins. The Cas protein may be an active variant, inactive variant or fragment of a wild-type or modified Cas protein. The Cas protein may comprise amino acid changes, such as deletions, insertions, substitutions, variants, mutations, fusions, chimeras, or any combination thereof, relative to the wild-type version of the Cas protein. The Cas protein may be a polypeptide having at least or at least about 5%, at least or at least about 10%, at least or at least about 20%, at least or at least about 30%, at least or at least about 40%, at least or at least about 50%, at least or at least about 60%, at least or at least about 70%, at least or at least about 80%, at least or at least about 90%, at least or at least about 91%, at least or at least about 92%, at least or at least about 93%, at least or at least about 94%, at least or at least about 95%, at least or at least about 96%, at least or at least about 97%, at least or at least about 98%, at least or at least about 99%, or 100% sequence identity or sequence similarity to the wild-type Cas protein. Cas proteins may be polypeptides having at most or at most about 5%, at most or at most about 10%, at most or at most about 20%, at most or at most about 30%, at most or at most about 40%, at most or at most about 50%, at most or at most about 60%, at most or at most about 70%, at most or at most about 80%, at most or at most about 90%, or at most about 100% sequence identity and/or sequence similarity to wild-type exemplary Cas proteins. The variant or fragment may comprise at least or at least about 5%, at least or at least about 10%, at least or at least about 20%, at least or at least about 30%, at least or at least about 40%, at least or at least about 50%, at least or at least about 60%, at least or at least about 70%, at least or at least about 80%, at least or at least about 90%, at least or at least about 91%, at least or at least about 92%, at least or at least about 93%, at least or at least about 94%, at least or at least about 95%, at least or at least about 96%, at least or at least about 97%, at least or at least about 98%, at least or at least about 99%, or 100% sequence identity or sequence similarity to the wild-type or modified Cas protein or a portion thereof. The variant or fragment may target a nucleic acid locus that is complexed with a guide nucleic acid, while lacking nucleic acid cleavage activity.
The Cas protein may comprise one or more nuclease domains, such as DNase domains. For example, the Cas9 protein may comprise a RuvC-like nuclease domain and/or an HNH-like 20 nuclease domain. In the nuclease-active form of Cas9, the RuvC domain and HNH domain can each cleave a different strand of double-stranded DNA, thereby creating a double-strand break in the DNA. Cas proteins may contain only one nuclease domain (e.g., cpfl contains a RuvC domain, but lacks an HNH domain). In some embodiments, the nuclease domain is absent. In some embodiments, the nuclease domain is present but inactive, or has reduced or minimal activity. In some embodiments, the nuclease domain is present and active.
One or more nuclease domains (e.g., ruvC or HNH) of the Cas protein may be deleted or mutated such that they are no longer functional or comprise reduced nuclease activity. For example, in Cas proteins (e.g., cas 9) comprising at least two nuclease domains, if one of the nuclease domains is deleted or mutated, the resulting Cas protein (referred to as a nickase) may produce a single-strand break, but not a double-strand break, at the CRISPR RNA (crRNA) recognition sequence within the double-stranded DNA. Such nicking enzymes may cleave either the complementary strand or the non-complementary strand, but may not cleave both simultaneously. If all nuclease domains of the Cas protein (e.g., both RuvC and HNH nuclease domains in Cas9 protein; ruvC nuclease domain in Cpfl protein) are deleted or mutated, the resulting Cas protein may have reduced or no ability to cleave both strands of double-stranded DNA. Examples of mutations that can convert Cas9 proteins to nickases are D10A (aspartic acid to alanine at position 10 of Cas 9) mutations in the RuvC domain of Cas9 from streptococcus pyogenes. Either H939A (histidine at amino acid 839 to alanine) or H840A (histidine at amino acid 840 to alanine) in the HNH domain of Cas9 from streptococcus pyogenes can convert Cas9 to a nickase. Examples of mutations that can convert Cas9 protein to dead Cas9 are the D10A (aspartic acid to alanine at position 10 of Cas 9) mutation in the RuvC domain of Cas9 from streptococcus pyogenes or the H939A (histidine to alanine at amino acid 839) or the H840A (histidine to alanine at amino acid 840) mutation in the HNH domain.
The nuclease-dead Cas protein (e.g., a protein derived from any Cas protein (e.g., un1Cas12f 1)) may comprise one or more mutations relative to a wild-type version of the protein. Mutations can cause no more than 90%, no more than 80%, no more than 70%, no more than 60%, no more than 50%, no more than 40%, no more than 30%, no more than 20%, no more than 10%, no more than 5%, or no more than 1% of the nucleic acid cleavage activity of one or more of the plurality of nucleic acid cleavage domains of the wild-type Cas protein. Mutations can cause one or more of the plurality of nucleic acid cleavage domains to retain the ability to cleave a complementary strand of a target nucleic acid, but reduce its ability to cleave a non-complementary strand of the target nucleic acid. Mutations can cause one or more of the plurality of nucleic acid cleavage domains to retain the ability to cleave a non-complementary strand of a target nucleic acid, but reduce its ability to cleave a complementary strand of the target nucleic acid. Mutations can cause one or more of the plurality of nucleic acid cleavage domains to lack the ability to cleave both the complementary strand and the non-complementary strand of the target nucleic acid. Residues to be mutated in the nuclease domain may correspond to one or more catalytic residues of the nuclease. For example, residues in wild-type exemplary streptococcus pyogenes Cas9 polypeptides, such as Asp10, his840, asn854, and Asn856, can be mutated to inactivate one or more of the plurality of nucleic acid cleavage domains (e.g., nuclease domains). Residues to be mutated in the Cas protein nuclease domain may correspond to Asp10, his840, asn854, and Asn856 residues in the wild-type streptococcus pyogenes Cas9 polypeptide, e.g., as determined by sequence and/or structural alignment.
As non-limiting examples, residues D10, G12, G17, E762, H840, N854, N863, H982, H983, a984, D986, and/or a987 (or corresponding mutations of any Cas protein) can be mutated. Such as D10A, G12A, G17A, E762A, H840A, N854A, N863A, H982A, H983A, A984A and/or D986A. Mutations other than alanine substitutions may be suitable.
The D10A mutation can be combined with one or more of the H840A, N854A or N856A mutations to produce a Cas9 protein (e.g., a death Cas9 protein) that substantially lacks DNA cleavage activity. The H840A mutation may be combined with one or more of the D10A, N854A or N856A mutations to produce a site-directed polypeptide that substantially lacks DNA cleavage activity. The N854A mutation may be combined with one or more of the H840A, D a or N856A mutations to produce a site-directed polypeptide that substantially lacks DNA cleavage activity. The N856A mutation may be combined with one or more of the H840A, N854A or D10A mutations to produce a site-directed polypeptide that substantially lacks DNA cleavage activity.
In some embodiments, the Cas protein is a class 2 Cas protein. In some embodiments, the Cas protein is a type II Cas protein. In some embodiments, the Cas protein is a Cas9 protein, a modified version of a Cas9 protein, or is derived from a Cas9 protein. For example, cas9 proteins lacking cleavage activity. In some embodiments, the Cas9 protein is a Cas9 protein from streptococcus pyogenes (e.g., swissProt accession Q99ZW 2). In some embodiments, the Cas9 protein is Cas9 from staphylococcus aureus (e.g., swissProt accession number J7RUA 5). In some embodiments, the Cas9 protein is a modified version of a Cas9 protein from streptococcus pyogenes or staphylococcus aureus. In some embodiments, the Cas9 protein is derived from a Cas9 protein from streptococcus pyogenes or staphylococcus aureus. For example, a streptococcus pyogenes or staphylococcus aureus Cas9 protein that lacks cleavage activity.
In some embodiments, cas9 may generally refer to a polypeptide having at least or at least about 5%, at least or at least about 10%, at least or at least about 20%, at least or at least about 30%, at least or at least about 40%, at least or at least about 50%, at least or at least about 60%, at least or at least about 70%, at least or at least about 80%, at least or at least about 90%, or about 100% sequence identity and/or sequence similarity to a wild-type exemplary Cas9 polypeptide (e.g., cas9 from streptococcus pyogenes). In some embodiments, cas9 may refer to a polypeptide having up to about 5%, up to about 10%, up to about 20%, up to about 30%, up to about 40%, up to about 50%, up to about 60%, up to about 70%, up to about 80%, up to about 90%, or about 100% sequence identity and/or sequence similarity with a wild-type Cas9 polypeptide (e.g., from streptococcus pyogenes). Cas9 may refer to a wild-type or modified form of Cas9 protein, which may comprise amino acid changes, such as deletions, insertions, substitutions, variants, mutations, fusions, chimeras, or any combination thereof.
The Cas protein may comprise an amino acid sequence having at least or at least about 5%, at least or at least about 10%, at least or at least about 20%, at least or at least about 30%, at least or at least about 40%, at least or at least about 50%, at least or at least about 60%, at least or at least about 70%, at least or at least about 80%, at least or at least about 90%, at least or at least about 91%, at least or at least about 92%, at least or at least about 93%, at least or at least about 94%, at least or at least about 95%, at least or at least about 96%, at least or at least about 97%, at least or at least about 98%, at least or at least about 99%, or 100% sequence identity or sequence similarity to a nuclease domain (e.g., ruvC domain or HNH domain) of the wild-type Cas protein.
Cas proteins, variants or derivatives thereof, may be modified to enhance the modulation of gene expression by the compositions, combinations, systems and methods of the present disclosure, e.g., as part of the complexes disclosed herein. Cas proteins may be modified to increase or decrease nucleic acid binding affinity, nucleic acid binding specificity, enzymatic activity, and/or binding to other factors (e.g., heterodimerization or oligomerization domains), as well as to induce ligands. Cas proteins may also be modified to alter any other activity or property of the protein, such as stability. For example, one or more nuclease domains of the Cas protein may be modified, deleted, or inactivated, or the Cas protein may be truncated to remove domains that are not necessary for the desired function of the protein or complex. Cas proteins may be modified to modulate (e.g., enhance or reduce) the activity of the Cas protein to regulate gene expression by the presently disclosed complexes comprising heterologous gene effectors.
For example, the Cas protein can be coupled (e.g., fused, covalently coupled, or non-covalently coupled) to a heterologous gene effector (e.g., an epigenetic modification domain, a transcriptional activation domain, and/or a transcriptional repression domain). Cas proteins can be coupled (e.g., fused, covalently coupled, or non-covalently coupled) to an oligomerization or dimerization domain (e.g., a heterodimerization domain) disclosed herein. Cas proteins may be coupled (e.g., fused, covalently coupled, or non-covalently coupled) to heterologous polypeptides that provide increased or decreased stability. Cas proteins may be coupled (e.g., fused, covalently coupled, or non-covalently coupled) to sequences that can promote degradation of the Cas protein or a complex containing the Cas protein, e.g., a down-resolution stator, such as an inducible down-resolution stator (e.g., auxin-inducible).
Cas proteins may be coupled (e.g., fused, covalently coupled, or non-covalently coupled) to any suitable number of partners, such as at least 1, at least 2, at least 3, at least 4, or at least 5, at least 6, at least 7, or at least 8 partners. In some embodiments, cas proteins of the present disclosure are coupled (e.g., fused, covalently coupled, or non-covalently coupled) to at most 2, at most 3, at most 4, at most 5, at most 6, at most 7, at most 8, or at most 10 partners. In some embodiments, the Cas proteins of the present disclosure are coupled (e.g., fused, covalently coupled, or non-covalently coupled) to 1-5, 1-4, 1-3, 1-2, 2-5, 2-4, 2-3, 3-5, 3-4, or 4-5 partners. In some embodiments, the Cas protein of the present disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to 1 partner. In some embodiments, the Cas protein of the present disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to 2 partners. In some embodiments, the Cas protein of the present disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to 3 partners. In some embodiments, the Cas protein of the present disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to 4 partners. In some embodiments, the Cas protein of the present disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to 5 partners. In some embodiments, the Cas protein of the present disclosure is coupled (e.g., fused, covalently coupled, or non-covalently coupled) to 6 partners.
The Cas protein may be a fusion protein. The fused domain or heterologous polypeptide may be located at the N-terminus, C-terminus, or inside the Cas protein.
Cas proteins may be provided in any form. For example, the Cas protein may be provided in the form of a protein, such as Cas protein alone, or Cas protein complexed with a guide nucleic acid to ribonucleoprotein. Cas proteins may be provided in a complex form, e.g., complexed with a guide nucleic acid and/or one or more heterologous gene effectors of the present disclosure. Cas proteins, such as RNAs (e.g., messenger RNAs (mrnas)), or DNAs, may be provided in the form of nucleic acids encoding Cas proteins. The nucleic acid encoding the Cas protein may be codon optimized for efficient translation into the protein in a particular cell or organism.
The nucleic acid encoding the Cas protein, fragment or derivative thereof may be stably integrated into the cell genome. The nucleic acid encoding the Cas protein may be operably linked to a promoter, e.g., a promoter that is constitutively or inducible in the cell. The nucleic acid encoding the Cas protein may be operably linked to a promoter in an expression construct. The expression construct may include any nucleic acid construct capable of directing expression of a gene of interest or other nucleic acid sequence of interest (e.g., cas gene), and such nucleic acid sequence of interest may be transferred to a target cell.
In some embodiments, the Cas protein, variant or derivative thereof is a nuclease-dead Cas (dCas) protein. The death Cas protein may be a protein lacking nucleic acid cleavage activity.
The Cas protein may comprise a modified form of the wild-type Cas protein. The modified form of the wild-type Cas protein may comprise an amino acid change (e.g., a deletion, insertion, or substitution) that reduces the nucleic acid cleavage activity of the Cas protein. For example, a modified form of Cas protein may have no more than 90%, no more than 80%, no more than 70%, no more than 60%, no more than 50%, no more than 40%, no more than 30%, no more than 20%, no more than 10%, no more than 5%, or no more than 1% of the nucleic acid cleavage activity of a wild-type Cas protein (e.g., cas9 from streptococcus pyogenes). The modified form of Cas protein may not have substantial nucleic acid cleavage activity. When the Cas protein is a modified form that does not have substantial nucleic acid cleavage activity, it may be referred to as enzymatically inactive, "inactive" and/or "dead" (abbreviated as "d"). The death Cas protein (e.g., dCas or dCas 9) can bind to the target polynucleotide, but may not cleave or minimally cleave the target polynucleotide. In some aspects, the death Cas protein is a death Cas9 protein.
The dCas9 polypeptide can bind to a single guide RNA (sgRNA) to activate or repress transcription of a target gene (e.g., a target endogenous gene), e.g., to a heterologous gene effector as disclosed herein. The sgrnas can be introduced into cells expressing Cas or guide portion components of the present disclosure. In some cases, such cells can comprise one or more different sgrnas that target the same target gene (e.g., target endogenous gene) or target gene regulatory sequence. In other cases, the sgrnas target different nucleic acids in the cell (e.g., different target genes, different target gene regulatory sequences, or different sequences within the same target genes or target gene regulatory sequences).
In some embodiments, the engineered gene effector comprises a polypeptide coupled to a heterologous endonuclease, wherein the heterologous endonuclease has a length of 450, 460, 470, 480, 490, 500, 520, 540, 560, 580, 600, 620, 640, 660, 680, or 700 amino acids, or optionally, the heterologous endonuclease has a length within a range defined by any two of the foregoing values (e.g., 450-700 amino acids, 480-600 amino acids, 500-530 amino acids, 500-600 amino acids, etc.). In some embodiments, the engineered gene effector comprises a polypeptide coupled to a heterologous endonuclease, wherein the heterologous endonuclease is a Cas protein and is 450, 460, 470, 480, 490, 500, 520, 540, 560, 580, 600, 620, 640, 660, 680, or 700 amino acids in length, or optionally, the Cas protein is within a range of any two of the foregoing values (e.g., 450-700 amino acids, 480-600 amino acids, 500-530 amino acids, 500-600 amino acids, etc.).
In some embodiments, the engineered gene effector comprises any of the polypeptides of the disclosure (e.g., having a first peptide and a second peptide as described herein) coupled to a heterologous endonuclease, wherein the heterologous endonuclease comprises any of the amino acid sequences listed in table 7. In some embodiments, the engineered gene effector comprises any of the polypeptides of the disclosure (e.g., having a first peptide and a second peptide as described herein) coupled to a heterologous endonuclease, wherein the heterologous endonuclease comprises an amino acid sequence of any of SEQ ID NOs 2222-2422. In some embodiments, the engineered gene effector comprises any of the polypeptides of the disclosure (e.g., having a first peptide and a second peptide as described herein) coupled to a heterologous endonuclease, wherein the heterologous endonuclease comprises an amino acid sequence that is, at least or at most 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to any of SEQ ID NOs 2222-2422, or optionally the heterologous endonuclease comprises a percent identity (e.g., 85% -100%) within a range defined by any two of the foregoing values for any of SEQ ID NOs 2222-2422, 90% -100%, 95% -100%, 98% -100%, etc.). In some embodiments, the engineered gene effector comprises any of the polypeptides of the disclosure (e.g., having a first peptide and a second peptide as described herein) coupled to a heterologous endonuclease, wherein the heterologous endonuclease comprises an amino acid sequence of any of SEQ ID NOs 2222-2422. In some embodiments, the engineered gene effector comprises any of the polypeptides of the disclosure (e.g., having a first peptide and a second peptide as described herein) coupled to a heterologous endonuclease, wherein the polypeptide comprises a first peptide comprising any of the sequences provided in table 3 and a second peptide heterologous to the first peptide and comprising any of the sequences provided in table 3, wherein the heterologous endonuclease comprises a polypeptide that is, at least or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, a polypeptide that is heterologous to any of SEQ ID NOs 2222-2422 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical, or optionally, a heterologous endonuclease comprises a sequence having a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to any one of SEQ ID NOs 2222-2422. In some embodiments, the engineered gene effector comprises any one of the polypeptides of the disclosure (e.g., having a first peptide and a second peptide as described herein) coupled to a heterologous endonuclease, wherein the polypeptide comprises a first peptide comprising any one of the sequences provided in table 3 and a second peptide heterologous to the first peptide and comprising any one of the sequences provided in table 3, wherein the SEQ ID NOs of the first and second peptides are selected according to any one of the pairwise arrangements of the SEQ ID NOs of the first and second peptides listed in table 4, wherein the first peptide is located at the N-terminus of the second peptide, wherein the heterologous endonuclease comprises a polypeptide having a sequence of any one of SEQ ID NOs 2222-2422 About, at least, or at most 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical amino acid sequence, or optionally, a heterologous endonuclease comprises a sequence having a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to any one of SEQ ID NOS 2222-2422. In some embodiments, the engineered gene effector comprises a polypeptide having a first peptide and a second peptide, the first peptide being located at the N-terminus of the second peptide, wherein the first peptide comprises a sequence that is about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to any of SEQ ID NOs 75, 4, 40, and 77, or optionally the first peptide comprises a percent identity (e.g., 85% -100% >, 90% -100%, 95% -100%, 98-100%, etc.), the second peptide comprising a sequence that is identical to any one of SEQ ID NOs 64, 44, 77, 17, and 96, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%, or optionally, the second peptide comprising a percentage of identity (e.g., 85% -100%, 85% to 100%, 90% -100%, 95% -100%, 98% -100%, etc.), wherein the polypeptide is coupled to a heterologous endonuclease, wherein the SEQ ID NOs of the first and second peptides are selected according to any one of the pairwise permutations of the SEQ ID NOs of the first and second peptides set forth in table a, wherein the heterologous endonuclease comprises a polypeptide having, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, a sequence which is identical to any one of SEQ ID NOs 2222-2422 97%, 98%, 99% or 100% identical, or optionally, a heterologous endonuclease comprises a sequence having a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to any one of SEQ ID NOs 2222-2422. In some embodiments, the engineered gene effector comprises a polypeptide having a sequence that is, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to any one of SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107, or optionally, the first peptide comprises a percentage of identity (e.g., 85% -100%, 85% >, 100% >, 90% -100%, 95% -100%, 98% -100%, etc.), wherein the polypeptide is coupled to a heterologous endonuclease comprising an amino acid sequence that is, about, at least or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to any of SEQ ID NOs 2222-2422, or optionally, a heterologous endonuclease comprising a percent identity (e.g., 85% -100% >, 90% -100%, 95% -100%, 98% -100%, etc.).
The engineered gene effector polypeptide may be coupled to the heterologous endonuclease in any suitable manner. In some embodiments, the heterologous endonuclease is fused to a heterologous endonuclease (e.g., a Cas protein). In some embodiments, the engineered gene effector comprises a polypeptide coupled to a heterologous endonuclease, wherein the polypeptide is fused to the heterologous endonuclease. In some embodiments, the engineered gene effector polypeptide is coupled to the C-terminus of a heterologous endonuclease. In some embodiments, the engineered gene effector comprises a polypeptide coupled to a heterologous endonuclease, wherein the polypeptide is fused to the C-terminus of the heterologous endonuclease. In some embodiments, the engineered gene effector comprises a polypeptide coupled to a heterologous endonuclease, wherein the heterologous endonuclease is a Cas protein, and wherein the polypeptide is fused to the C-terminus of the heterologous endonuclease. In some embodiments, the engineered gene effector is coupled to the heterologous endonuclease through a linker. Any suitable linker (such as those described herein) may be used to couple the engineered gene effector to the heterologous endonuclease.
Fusion proteins comprising the engineered gene effector of the present disclosure, and a heterologous endonuclease coupled to or fused to a polypeptide of the engineered gene effector are also provided. In some embodiments, the fusion protein comprises an engineered gene effector, and a heterologous endonuclease coupled to or fused to the polypeptide. In some embodiments, the fusion protein comprises an engineered gene effector, and a heterologous endonuclease coupled to or fused to the polypeptide, wherein the heterologous endonuclease is a Cas protein. The fusion protein may comprise any suitable engineered gene effector, e.g., any of the engineered gene effectors described herein, e.g., comprising a first peptide of 75-95 amino acids in length comprising the sequence of any one of SEQ ID NOs 3-100 or variants at least 85% identical thereto, and a second peptide of 75-95 amino acids in length, which is heterologous to the first peptide and comprises the sequence of any one of SEQ ID NOs 3-100 or variants at least 85% identical thereto. The fusion protein can comprise any suitable heterologous endonuclease, e.g., any of the heterologous endonucleases or Cas proteins described herein, e.g., a heterologous endonuclease having the amino acid sequence of any of SEQ ID NOs 2222-2422 or a sequence at least 85% identical thereto. The engineered gene effector may be coupled or fused to the heterologous endonuclease in any suitable manner, for example, by a linker as described herein. The engineered gene effector fusion polypeptide may be located at the N-terminus, C-terminus, or within the heterologous endonuclease.
In some embodiments, the fusion protein comprises a heterologous endonuclease fused to an engineered gene effector described herein, e.g., an engineered gene effector comprising a polypeptide comprising a first peptide that is 75-95 amino acids in length and comprises the sequence of any one of SEQ ID NOs 3-100 or a variant at least 85% identical thereto, and a second peptide that is 75-95 amino acids in length and is heterologous to the first peptide and comprises the sequence of any one of SEQ ID NOs 3-100 or a variant at least 85% identical thereto. In some embodiments, the fusion protein comprises a heterologous endonuclease fused to an engineered gene effector comprising a polypeptide comprising a first peptide and a second peptide, the first peptide being at the N-terminus of the second peptide, wherein the first peptide is 75-95 amino acids in length and comprises the sequence of any one of SEQ ID NOs 3-100 (or variants at least 85% identical thereto), the second peptide is 75-95 amino acids in length, the second peptide is heterologous to the first peptide and comprises the sequence of any one of SEQ ID NOs 3-100 (or variants at least 85% identical thereto), wherein the SEQ ID NOs of the first peptide and the second peptide are selected in pairs according to any one of table 4. In some embodiments, the fusion protein comprises a heterologous endonuclease fused to an engineered gene effector comprising a polypeptide comprising a first peptide and a second peptide, the first peptide being located at the N-terminus of the second peptide, wherein the first peptide is 75-95 amino acids in length and comprises the sequence of any one of SEQ ID NOs 3-100 (or variants at least 85% identical thereto), the second peptide is 75-95 amino acids in length, the second peptide is heterologous to the first peptide and comprises the sequence of any one of SEQ ID NOs 3-100 (or variants at least 85% identical thereto), wherein the SEQ ID NOs of the first peptide and the second peptide are selected according to any one of the pairwise permutations of table 4, wherein the first peptide and the second peptide are linked by any one of the spacers (e.g., peptide linkers) described herein. In some embodiments, the fusion protein comprises a heterologous endonuclease fused to an engineered gene effector comprising a polypeptide comprising a first peptide and a second peptide, the first peptide being at the N-terminus of the second peptide, wherein the first peptide is 85 amino acids in length and comprises the sequence of any one of SEQ ID NOs 3-33, 35-100, and the second peptide is 85 amino acids in length, the second peptide being heterologous to the first peptide and comprising the sequence of any one of SEQ ID NOs 3-33, 35-100, wherein the SEQ ID NOs of the first peptide and the second peptide are selected according to any one of the pairwise permutations of table 4, wherein the first peptide and the second peptide are linked by the sequence of SEQ ID NO 2211. In some embodiments, the fusion protein comprises a heterologous endonuclease fused to an engineered gene effector comprising a polypeptide comprising a first peptide and a second peptide, the first peptide being at the N-terminus of the second peptide, wherein the first peptide is 85 amino acids in length and comprises the sequence of any one of SEQ ID NOS: 3-33, 35-100, and the second peptide being heterologous to the first peptide and comprising the sequence of SEQ ID NO: 34, wherein the SEQ ID NOS of the first peptide and the second peptide are selected according to any one of the pairwise permutations of Table 4, wherein the first peptide and the second peptide are linked by the sequence of SEQ ID NO: 2211. In some embodiments, the fusion protein comprises a heterologous endonuclease fused to an engineered gene effector comprising a polypeptide comprising a first peptide and a second peptide, the first peptide being at the end of the second peptide N, wherein the first peptide comprises the sequence of SEQ ID NO: 34 and the second peptide is 85 amino acids in length, the second peptide being heterologous to the first peptide and comprising the sequence of any one of SEQ ID NOs: 3-33, 35-100, wherein the SEQ ID NOs of the first and second peptides are selected according to any one of table 4 in a pairwise arrangement, wherein the first and second peptides are linked by the sequence of SEQ ID NO: 2211. the engineered gene effector may be fused or fused to the heterologous endonuclease in any suitable manner, for example, by a linker as described herein. In some embodiments, the fusion protein comprises any of the polypeptides of the disclosure (e.g., having a first peptide and a second peptide as described herein) coupled to a heterologous endonuclease, wherein the polypeptide comprises a first peptide comprising any of the sequences provided in table 3 and a second peptide heterologous to the first peptide and comprising any of the sequences provided in table 3, wherein the heterologous endonuclease comprises a polypeptide which is, at least or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93% >, a polypeptide which is heterologous to any of SEQ ID NOs 2222-2422 94%, 95%, 96%, 97%, 98%, 99% or 100% identical amino acid sequence, or optionally, a heterologous endonuclease comprises a sequence having a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to any one of SEQ ID NOs 2222-2422. In some embodiments, the fusion protein comprises any one of the polypeptides of the disclosure (e.g., having a first peptide and a second peptide as described herein) coupled to a heterologous endonuclease, wherein the polypeptide comprises a first peptide comprising any one of the sequences provided in table 3 and a second peptide heterologous to the first peptide and comprising any one of the sequences provided in table 3, wherein the SEQ ID NOs of the first and second peptides are selected according to any one of the pairwise arrangements of the SEQ ID NOs of the first and second peptides listed in table 4, wherein the first peptide is at the N-terminus of the second peptide, wherein the heterologous endonuclease comprises a polypeptide having the sequence of any one of SEQ ID NOs 2222-2422, About, at least, or at most 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical amino acid sequence, or optionally, a heterologous endonuclease comprises a sequence having a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to any one of SEQ ID NOS 2222-2422. In some embodiments, the fusion protein comprises a polypeptide having a first peptide and a second peptide, the first peptide being N-terminal to the second peptide, wherein the first peptide comprises a sequence that is, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to any of SEQ ID NOs 75, 4, 40, and 77, or optionally the first peptide comprises a percentage of identity (e.g., 85% -100% >, 90% -100%, 95% -100%, 98% -100%, etc.), the second peptide comprising a sequence that is identical to any one of SEQ ID NOs 64, 44, 77, 17, and 96, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%, or optionally, the second peptide comprising a percentage of identity (e.g., 85% -100%, 85% to 100%, 90% -100%, 95% -100%, 98% -100%, etc.), wherein the polypeptide is coupled to a heterologous endonuclease, wherein the SEQ ID NOs of the first and second peptides are selected according to any one of the pairwise permutations of the SEQ ID NOs of the first and second peptides set forth in table a, wherein the heterologous endonuclease comprises a polypeptide having, about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, a sequence which is identical to any one of SEQ ID NOs 2222-2422 97%, 98%, 99% or 100% identical, or optionally, a heterologous endonuclease comprises a sequence having a percent identity (e.g., 85% -100%, 90% -100%, 95% -100%, 98% -100%, etc.) within a range defined by any two of the foregoing values to any one of SEQ ID NOs 2222-2422. In some embodiments, the fusion protein comprises a polypeptide having a sequence that is about, at least, or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to any one of SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107, or optionally, the first peptide comprises a percentage of identity (e.g., 85% -100%, v/v) within a range defined by any two of the foregoing values to any one of SEQ ID NOs 1085, 122, 1084, 653, 1099, and 1107 90% -100%, 95% -100%, 98% -100%, etc.), wherein the polypeptide is coupled to a heterologous endonuclease comprising an amino acid sequence that is, about, at least or up to 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to any of SEQ ID NOs 2222-2422, or optionally, a heterologous endonuclease comprising a percent identity (e.g., 85% -100% >, 90% -100%, 95% -100%, 98% -100%, etc.).
In some embodiments, the fusion protein comprises a heterologous endonuclease fused to any of the engineered gene effectors described herein by a linker (e.g., but not limited to any of SEQ ID NOS: 2211-2221). In some embodiments, the fusion protein comprises a heterologous endonuclease fused to an engineered gene effector described herein by a linker (e.g., but not limited to any one of SEQ ID NOS: 2211-2221), e.g., the engineered gene effector comprises a polypeptide comprising a first peptide having a length of 75-95 amino acids and comprising the sequence of any one of SEQ ID NOS: 3-100 or a variant at least 85% identical thereto and a second peptide having a length of 75-95 amino acids and being heterologous to the first peptide and comprising the sequence of any one of SEQ ID NOS: 3-100 or a variant at least 85% identical thereto. In some embodiments, the fusion protein comprises a heterologous endonuclease fused to an engineered gene effector comprising a polypeptide comprising a first peptide and a second peptide, the first peptide being at the N-terminus of the second peptide, wherein the first peptide is 75-95 amino acids in length and comprises the sequence of any one of SEQ ID NOS: 3-100 (or a variant at least 85% identical thereto), the second peptide is 75-95 amino acids in length, the second peptide is heterologous to the first peptide and comprises the sequence of any one of SEQ ID NOS: 3-100 (or a variant at least 85% identical thereto), wherein the SEQ ID NOS of the first peptide and the second peptide are selected according to any one of the pairwise permutations of Table 4. In some embodiments, the fusion protein comprises a heterologous endonuclease fused to an engineered gene effector comprising a polypeptide comprising a first peptide and a second peptide, the first peptide being located at the N-terminus of the second peptide, wherein the first peptide is 75-95 amino acids in length and comprises the sequence of any one of SEQ ID NOs: 3-100 (or a variant at least 85% identical thereto), the second peptide is 75-95 amino acids in length, the second peptide is heterologous to the first peptide and comprises the sequence of any one of SEQ ID NOs: 3-100 (or a variant at least 85% identical thereto), wherein the SEQ ID NOs of the first peptide and the second peptide are selected according to any one of the pairwise arrangements in Table 4, wherein the first peptide and the second peptide are linked by any one of the linkers (e.g., peptide linkers) described herein. In some embodiments, the fusion protein comprises a heterologous endonuclease fused to an engineered gene effector comprising a polypeptide comprising a first peptide and a second peptide, the first peptide being N-terminally positioned on the second peptide, wherein the first peptide is 85 amino acids in length and comprises the sequence of any one of SEQ ID NOs: 3-35-100, and the second peptide is 85 amino acids in length and is heterologous to the first peptide and comprises SEQ ID NOs: 3-100, by way of a linker (e.g., any one of SEQ ID NOs: 2211-2221), 35-100, wherein the SEQ ID NOs of the first and second peptides are selected according to any one of the pairwise arrangements of table 4, wherein the first and second peptides are linked by the sequence of SEQ ID NO: 2211. In some embodiments, the fusion protein comprises a heterologous endonuclease fused to an engineered gene effector comprising a polypeptide comprising a first peptide and a second peptide, the first peptide being at the N-terminus of the second peptide, wherein the first peptide is 85 amino acids in length and comprises the sequence of any one of SEQ ID NOS: 3-33, 35-100, and the second peptide is heterologous to the first peptide and comprises the sequence of SEQ ID NO: 34, wherein the SEQ ID NOS of the first peptide and the second peptide are selected according to any one of the pairwise arrangements of SEQ ID NOS: 2211. In some embodiments, the fusion protein comprises a heterologous endonuclease fused to an engineered gene effector comprising a polypeptide comprising a first peptide and a second peptide, the first peptide being located at the N-terminus of the second peptide, wherein the first peptide comprises the sequence of SEQ ID NO: 34 and the second peptide is 85 amino acids in length, the second peptide being heterologous to the first peptide and comprising the sequence of any one of SEQ ID NO: 3-33, 35-100, wherein the SEQ ID NO of the first peptide and the second peptide are selected according to any one of Table 4 in a pairwise arrangement, wherein the first peptide and the second peptide are linked by the sequence of SEQ ID NO: 2211.
TABLE 7 amino acid sequences of heterologous endonuclease variants
Heterologous endonucleases SEQ ID NO Amino acid sequence
Un1Cas12fl, "CasMini" 2222 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSDVCYTRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKIGEKSAWMLNLSIDVPKIDKGVDPSIIGGIDVGVKSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENADYNAALNISNPKLKSTKEEP
Un2Cas12fl 2223 MEVQKTVMKTLSLRILRPLYSQEIEKEIKEEKERRKQAGGTGELDGGFYKKLEKKHSEMFSFDRLNLLLNQLQREIAKVYNHAISELYIATIAQGNKSNKHYISSIVYNRAYGYFYNAYIALGICSKVEANFRSNELLTQQSALPTAKSDNFPIVLHKQKGAEGEDGGFRISTEGSDLIFEIPIPFYEYNGENRKEPYKWVKKGGQKPVLKLILSTFRRQRNKGWAKDEGTDAEIRKVTEGKYQVSQIEINRGKKLGEHQKWFANFSIEQPIYERKPNRSIVGGLDVGIRSPLVCAINNSFSRYSVDSNDVFKFSKQVFAFRRRLLSKNSLKRKGHGAAHKLEPITEMTEKNDKFRKKIIERWAKEVTNFFVKNQVGIVQIEDLSTMKDREDHFFNQYLRGFWPYYQMQTLIENKLKEYGIEVKRVQAKYTSQLCSNPNCRYWNNYFNFEYRKVNKFPKFKCEKCNLEISADYNAARNLSTPDIEKFVAKATKGINLPEK
AsCas12f 2224 MIKVYRYEIVKPLDLDWKEFGTILRQLQQETRFALNKATQLAWEWMGFSSDYKDNHGEYPKSKDILGYTNVHGYAYHTIKTKAYRLNSGNLSQTIKRATDRFKAYQKEILRGDMSIPSYKRDIPLDLIKENISVNRMNHGDYIASLSLLSNPAKQEMNVKRKISVIIIVRGAGKTIMDRILSGEYQVSASQIIHDDRKNKWYLNISYDFEPQTRVLDLNKIMGIDLGVAVAVYMAFQHTPARYKLEGGEIENFRRQVESRRISMLRQGKYAGGARGGHGRDKRIKPIEQLRDKIANFRDTTNHRYSRYIVDMAIKEGCGTIQMEDLTNIRDIGSRFLQNWTYYDLQQKIIYKAEEAGIKVIKIDPQYTSQRCSECGNIDSGNRIGQAIFKCRACGYEANADYNAARNIAIPNIDKIIAESIK
Un1Cas12fl comprising an N-terminal deletion, e.g. amino acid residues 2-21 2225 MEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
Un1Cas12fl comprising an N-terminal deletion, e.g. amino acid residues 2-31 2226 MNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
Un1Cas12fl comprising a C-terminal deletion, e.g. amino acid residues 510-529 2227 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENA
Un1Cas12fl comprising a C-terminal deletion, e.g. amino acid residues 500-529 2228 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFK
Un1Cas12fl comprising a partial N-terminal deletion, e.g. amino acid residues 47-66 2229 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
Un1Cas12fl comprising a partial N-terminal deletion, e.g. amino acid residues 47-71 2230 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALECKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
Inactivated nuclease variant of Un1Cas12fl, i.e., "dCasMini" 2231 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
Cas12f variants 2232 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIAGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA2 2233 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
Mi1Cas12f2 2234 MNMSKTTISVKLKIIDLSSEKKEFLDNYFNEYAKATTFCQLRIRRLLRNTHWLGKKEKSSKKWIFESGICDLCGENKELVNEDRNSGEPAKICKRCYNGRYGNQMIRKLFVSTKKREVQENMDIRRVAKLNNTHYHRIPEEAFDMIKAADTAEKRRKKNVEYDKKRQMEFIEMFNDEKKRAARPKKPNERETRYVHISKLESPSKGYTLNGIKRKIDGMGKKIERAEKGLSRKKIFGYQGNRIKLDSNWVRFDLAESEITIPSLFKEMKLRITGPTNVHSKSGQIYFAEWFERINKQPNNYCYLIRKTSSNGKYEYYLQYTYEAEVEANKEYAGCLGVDIGCSKLAAAVYYDSKNKKAQKPIEIFTNPIKKIKMRREKLIKLLSRVKVRHRRRKLMQLSKTEPIIDYTCHKTARKIVEMANTAKAFISMENLETGIKQKQQARETKKQKFYRNMFLFRKLSKLIEYKALLKGIKIVYVKPDYTSQTCSSCGADKEKTERPSQAIFRCLNPTCRYYQRDINADFNAAVNIAKKALNNTEVVTTLL
Mi2Cas12f2 2235 MPSETYITKTLSLKLIPSDEEKQALENYFITFQRAVNFAIDRIVDIRSSFRYLNKNEQFPAVCDCCGKKEKIMYVNISNKTFKFKPSRNQKDRYTKDIYTIKPNAHICKTCYSGVAGNMFIRKQMYPNDKEGWKVSRSYNIKVNAPGLTGTEYAMAIRKAISILRSFEKRRRNAERRIIEYEKSKKEYLELIDDVEKGKTNKIVVLEKEGHQRVKRYKHKNWPEKWQGISLNKAKSKVKDIEKRIKKLKEWKHPTLNRPYVELHKNNVRIVGYETVELKLGNKMYTIHFASISNLRKPFRKQKKKSIEYLKHLLTLALKRNLETYPSIIKRGKNFFLQYPVRVTVKVPKLTKNFKAFGIDRGVNRLAVGCIISKDGKLTNKNIFFFHGKEAWAKENRYKKIRDRLYAMAKKLRGDKTKKIRLYHEIRKKFRHKVKYFRRNYLHNISKQIVEIAKENTPTVIVLEDLRYLRERTYRGKGRSKKAKKTNYKLNTFTYRMLIDMIKYKAEEAGVPVMIIDPRNTSRKCSKCGYVDENNRKQASFKCLKCGYSLNADLNAAVNIAKAFYECPTFRWEEKLHAYVCSEPDK
AuCas12f2 2236 MKSFKLKLLPTDEQNVLLNEVFCKWASLCTRMASKGHDKERLAPPDSSGNYFNKTQLNQVNTDVTDHMGALEESASQKERAVEKVKRRLKLISDMLSEPNLRDVSQQKPTTFRPLEWVKEGLLKTKYHTVHYWQKECDKLTKQKERMEKTIEKIKKGKITFKPTKMSLHQNCFSLSFGKGTFSMRPFSDTKRGINLDMLTAPIQPAIGKNDGKSSLRSKEFIARNIENYIIFSIHSQLFGLSRSEELLLNAKKEELVAKRDAMLKKKSDSLSKKIKELEKIVGRKITDSERSEIMSQGGKLSSEKFSEDNSYLKTLKVLAKDIIGREELFRLKKYPIVIRKPLNERKKLKNLKPDEWEYYLQLSYDELEKKEFTPKTIMGIDRGLKHILAIAIYDPVQNKFVKNMLIPNPILGWKWKLRKIKRSIQHMERRIRAQQNAHVPENQLKKRLKSIENKIDYYYHNVSRQILNLAHDFKSAIVVEDLQNMKQHGRKKSKGLRGLNYALSNFDYGKIMGLVKYKAESENVPLLTVLPAGTSQNCAYCLLYGKEQGNYVRNNVNSKIGKCKLHGEIDADINAARTIAICYHKNINEPKPYGERKTFKRK
PtCas12f1 2237 MKYTKVMRYQIIKPLNAEWDELGMVLRDIQKETRAALNKTIQLCWEYQGFSADYKQIHGQYPKPKDVLGYTSMHGYAYDRLKNEFSKIASSNLSQTIKRAVDKWNSDLKEILRGDRSIPNFRKDCPIDIVKQSTKIQKCNDGYVLSLGLINREYKNELGRKNGVFDVLIKANDKTQQTILERIINGDYTYTASQIINHKNKWFINLTYQFETKETALDPNNVMGVDLGIVYPVYIAFNNSLHRYHIKGGEIERFRRQVEKRKRELLNQGKYCGDGRKGHGYATRTKSIESISDKIARFRDTCNHKYSRFIVDMALKHNCGIIQMEDLTGISKESTFLKNWTYYDLQQKIEYKAREAGIQVIKIEPQYTSQRCSKCGYIDKENRQEQATFKCIECGFKTNADYNAARNIAIPNIDKIIRKTLKMQ
RuCas12f1 2238 MTLLVKVVKIHLISEQFDKAGNRIDYEEVNKILWELQKQTREAKNKTVQLLWEWNNFSSDYVKASGIYPKAKDIFGYSSVHGQANKELRTKLALNSSNLSTTTMDVCKNFNTYKKEVWKGKRSVPSYKSDQPLDLHKDSIKLIYENNEFYVRLALLKKAEFAKYGFKDGFRFKMQVKDNSTKTILERCFDEVYKINASKLLYDQKKKKWKLNLSYSFDNKNISELDKEKILGVDVGVNCPLVASVFGDRDRFIIKGGEIEKFRKSVEARRRSMLEQTKYCGDGRIGHGRKKRTEPALNIGDKIARFRDTTNHKYSRALIEYAVKKGCGTIQMEKLTGITSKSDRFLKDWTYYDLQTKIENKAKEVGINVVYIAPKYTSQRCSKCGYIHKDNRPNQAKFRCLECDFESNADYNASQNIGIKNIDKIIEKDLQKQESEVQVNENK
SpCas12f1 2239 MGESVKAIKLKILDMFLDPECTKQDDNWRKDLSTMSRFCAEAGNMCLRDLYNYFSMPKEDRISSKDLYNAMYHKTKLLHPELPGKVANQIVNHAKDVWKRNAKLIYRNQISMPTYKITTAPIRLQNNIYKLIKNKNKYIIDVQLYSKEYSKDSGKGTHRYFLVAVRDSSTRMIFDRIMSKDHIDSSKSYTQGQLQIKKDHQGKWYCIIPYTFPTHETVLDPDKVMGVDLGVAKAVYWAFNSSYKRGCIDGGEIEHFRKMIRARRVSIQNQIKHSGDARKGHGRKRALKPIETLSEKEKNFRDTINHRYANRIVEAAIKQGCGTIQIENLEGIADTTGSKFLKNWPYYDLQTKIVNKAKEHGITVVAINPQYTSQRCSMCGYIEKTNRSSQAVFECKQCGYGSRTICINCRHVQVSGDVCEECGGIVKKENVNADYNAAKNISTPYIDQIIMEKCLELGIPYRSITCKECGHIQASGNTCEVCGSTNILKPKKIRKAK
CnCas12f1 2240 MITVRKIKLTIMGDKDTRNSQYKWIRDEQYNQYRALNMGMTYLAVNDILYMNESGLEIRTIKDLKDCEKDIDKNKKEIEKLTARLEKEQNKKNSSSEKLDEIKYKISLVENKIEDYKLKIVELNKILEETQKERMDIQKEFKEKYVDDLYQVLDKIPFKHLDNKSLVTQRIKADIKSDKSNGLLKGERSIRNYKRNFPLMTRGRDLKFKYDDNDDIEIKWMEGIKFKVILGNRIKNSLELRHTLHKVIEGKYKICDSSLQFDKNNNLILNLTLDIPIDIVNKKVSGRVVGVDLGLKIPAYCALNDVEYIKKSIGRIDDFLKVRTQMQSRRRRLQIAIQSAKGGKGRVNKLQALERFAEKEKNFAKTYNHFLSSNIVKFAVSNQAEQINMELLSLKETQNKSILRNWSYYQLQTMIEYKAQREGIKVKYIDPYHTSQTCSKCGNYEEGQRESQADFICKKCGYKVNADYNAARNIAMSNKYITKKEESKYYKIKESMV
cA1 2241 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDGGFYKKLEKKHSEMFSFDRLNLLLNQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA3 2242 MEVQKTVMKTLSLRILRPLYSQEIEKEIKEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA4 2243 MAKNTITKTLKLRIVRPLYSQEIEKEIKEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA5 2244 MIKVYRYEIVKPLDLDWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA6 2245 MITVRKIKLTIMGDKDTRNSQYKWIRDEQYNQYRALNMGMTYLAVNDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA7 2246 MGESVKAIKLKILDMFLDPECTKQDDNWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
Cas variant/cA 8 2247 MKYTKVMRYQIIKPLNAEWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA9 2248 MTLLVKVVKIHLISEQFDKAGNRIDYEEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA1 2249 MAKNTITKTLKLRIVRPYYSQEIEKIVAEEKNRREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA2 2250 MAKNTITKTLKLRIVRPYYSAEVEKIVAEEKNNREKIALDKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYKKLRGQFPDAVFWQEISEIFRQLQKQAREIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA3 2251 MAKNTITKTLKLRIVRPYYSAEIEKIVADEKNRREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRKQFPDAVFWQEISEIFRQLQKQAREIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA4 2252 MAKNTITKTLKLRIVRPYNSQEVEKIVAEEKNRREKIALDKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRKQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA5 2253 MAKNTITKTLKLRIVRPYNSQEVEKIVAEEKNNREKIALDKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYKKLRKQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA6 2254 MAKNTITKTLKLRIVRPYNSQEVEKIVAEEKNNREKIALDKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYKKLRGQFPDAVFWQEISEIFRQLQKQAREIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA7 2255 MAKNTITKTLKLRIVRPYYSAEVEKIVAEEKNNREKIALDKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYKKLRKQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA8 2256 MAKNTITKTLKLRIVRPYNSAEIEKIVADEKNRREKIALDKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYKKLRKQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA9 2257 MAKNTITKTLKLRIVRPYNSAEIEKIVADEKNRREKIALDKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYKKLRGQFPDAVFWQEISEIFRQLQKQAREIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA10 2258 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKNRREKIALDKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA11 2259 MAKNTITKTLKLRIVRPYNSAEIEKIVAEEKNRREKIALDKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYKKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA12 2260 MAKNTITKTLKLRIVRPYNSAEVEKIVAEEKNRREKIALDKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYKKLRKQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA13 2261 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNRREKIALDKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYKKLRKQFPDAVFWQEISEIFRQLQKQAREIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA14 2262 MAKNTITKTLKLRIVRPYYSAEIEKIVADEKNRREKIALDKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRKQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA15 2263 MAKNTITKTLKLRIVRPYYSAEIEKIVAEEKNRREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRKQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mA16 2264 MAKNTITKTLKLRIVRPYYSAEIEKIVAEEKNNREKIALDKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAREIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mB1 2265 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAARLFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFKISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFRQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIRKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPIEQLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mB2 2266 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFKISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFRQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIRKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPIEQLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mB3 2267 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAGLFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFKISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFRQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIRKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPIEQLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mB4 2268 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAARLFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFRISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFRQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIRKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPIEQLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mB5 2269 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFRISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFRQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIRKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPIEQLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mB6 2270 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAGLFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFRISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFRQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIRKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPIEQLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mB7 2271 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAARLFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFSISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFRQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIRKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPIEQLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mB8 2272 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFSISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFRQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIRKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPIEQLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mB9 2273 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAGLFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFSISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFRQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIRKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPIEQLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC1 2274 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLENFNKKMFARRRILLKKNRHKRGGHGAKNKLKPIEQLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC2 2275 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSIEGGDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPIEQLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC3 2276 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSIEGGDLENFNKKMFARRRILLKKNRHKRGGHGRDKKLKPIEQLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC4 2277 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSIEGGDLENFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC5 2278 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGRDKKLKPIEQLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC6 2279 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLENFNKKMFARRRILLKKNRHKRAGHGRDKKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC7 2280 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLEHFNKKMFARRRILLKKNRHKRKGHGAKNKLKPIETLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC8 2281 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSIDGGDLEHFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC9 2282 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSIDGGDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPIETLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC10 2283 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSIDGGDLEHFNKKMFARRRILLKKNRHKRKGHGAKNKLKPIETLTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC11 2284 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSIDSNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC12 2285 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSIDSNDLFHFNKKMFARRRILLKKNRHKRAGHGAAHKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC13 2286 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSIDSNDLFKFNKKMFARRRILLKKNRHKRAGHGAAHKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC14 2287 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSIDSNDLFKFNKKMFARRRILLKKNRHKRAGHGAAHKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC15 2288 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSIDSNDLFKFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC16 2289 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC17 2290 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSIDSNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC18 2291 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAAHKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC19 2292 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSIKGGDLERFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC20 2293 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSIKGGDLERFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC21 2294 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSIKGGDLEKFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC22 2295 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSIKGGDLFHFNKKMFARRRILLKKNRHKRAGHGRKKKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC23 2296 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLEKFNKKMFARRRILLKKNRHKRAGHGRKKKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mC24 2297 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSIKGGDLEKFNKKMFARRRILLKKNRHKRAGHGRKKKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mD1 2298 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWAKEIADFFIKNKVGTVQMEDLSTMKRKEDSYFNIRLRGFWPYYEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAARNISTPDIKSTKERP
mD2 2299 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
mD3 2300 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWAKEIADFFIKNKVGTVQMEDLSTMKRKEDSYFNIRLRGFWPYYEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mD4 2301 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISNPDIKSTKERP
mD5 2302 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISTPDIKSTKERP
mD6 2303 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLSTMKRKEDSYFNIRLRGFWPYYEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mD7 2304 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISTPDIKSTKERP
mD8 2305 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAARNISTPDIKSTKERP
mD9 2306 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWSRYIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYYEMQNKIEFKLKQYGIKIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKANAAYNAARNISNPNIKSTKERP
mD10 2307 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACYIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNISNPNIKSTKERP
mD11 2308 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACYIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKRNAAYNAARNISNPKLKSTKERP
mD12 2309 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACYIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNISNPNIKSTKERP
mD13 2310 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWARYIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYYEMQNKIEFKLKQYGIKIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mD14 2311 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKRNAAYNAARNISNPNIKSTKERP
mD15 2312 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNISNPNIKSTKERP
mD16 2313 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKRNAAYNAARNISNPNIKSTKERP
mD17 2314 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWANRIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIKIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKRNAAYNAAKNISNPKLKSTKERP
mD18 2315 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKRNAAYNAAKNISNPKLKSTKERP
mD19 2316 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIKIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKRNAAYNAAKNISNPKLKSTKERP
mD20 2317 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWANRIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIKIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
mD21 2318 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWANRIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKRNAAYNAALNISNPKLKSTKERP
mD22 2319 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWSRFIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYYEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNISNPNIKSTKERP
cB2 2320 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIDVQLYSKEYSKDSGKGTHRYFLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cB3 2321 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIASLSLLSNPAKQEMNVKRKISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cD1 2322 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPIYERKPNRSIVGGLAVGIRSPLVCAINNSFSRYSVDSNDVFKFSKQVFAFRRRLLSKNSLKRKGHGAAHKLEPITEMTEKNDKFRKKIIERWAKEVTNFFVKNQVGIVQIEDLSTMKDREDHFFNQYLRGFWPYYQMQTLIENKLKEYGIEVKRVQAKYTSQLCSNPNCRYWNNYFNFEYRKVNKFPKFKCEKCNLEISAAYNAARNLSTPDIEKFVAKATKGINLPEK
cD2 2323 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPTHETVLDPDKVMGVALGVAKAVYWAFNSSYKRGCIDGGEIEHFRKMIRARRVSIQNQIKHSGDARKGHGRKRALKPIETLSEKEKNFRDTINHRYANRIVEAAIKQGCGTIQIENLEGIADTTGSKFLKNWPYYDLQTKIVNKAKEHGITVVAINPQYTSQRCSMCGYIEKTNRSSQAVFECKQCGYGSRTICINCRHVQVSGDVCEECGGIVKKENVNAAYNAAKNISTPYIDQIIMEKCLELGIPYRSITCKECGHIQASGNTCEVCGSTNILKPKK
cD3 2324 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPQTRVLDLNKIMGIALGVAVAVYMAFQHTPARYKLEGGEIENFRRQVESRRISMLRQGKYAGGARGGHGRDKRIKPIEQLRDKIANFRDTTNHRYSRYIVDMAIKEGCGTIQMEDLTNIRDIGSRFLQNWTYYDLQQKIIYKAEEAGIKVIKIDPQYTSQRCSECGNIDSGNRIGQAIFKCRACGYEANAAYNAARNIAIPNIDKIIAESIK
cD4 2325 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPIDIVNKKVSGRVVGVALGLKIPAYCALNDVEYIKKSIGRIDDFLKVRTQMQSRRRRLQIAIQSAKGGKGRVNKLQALERFAEKEKNFAKTYNHFLSSNIVKFAVSNQAEQINMELLSLKETQNKSILRNWSYYQLQTMIEYKAQREGIKVKYIDPYHTSQTCSKCGNYEEGQRESQADFICKKCGYKVNAAYNAARNIAMSNKYITKKEESKYYKIKESMV
cD5 2326 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVETKETALDPNNVMGVALGIVYPVYIAFNNSLHRYHIKGGEIERFRRQVEKRKRELLNQGKYCGDGRKGHGYATRTKSIESISDKIARFRDTCNHKYSRFIVDMALKHNCGIIQMEDLTGISKESTFLKNWTYYDLQQKIEYKAREAGIQVIKIEPQYTSQRCSKCGYIDKENRQEQATFKCIECGFKTNAAYNAARNIAIPNIDKIIRKTLKMQ
cD6 2327 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVNRSIVGGLAVGIRSPLVCAINNSFSRYSVDSNDVFKFSKQVFAFRRRLLSKNSLKRKGHGAAHKLEPITEMTEKNDKFRKKIIERWAKEVTNFFVKNQVGIVQIEDLSTMKDREDHFFNQYLRGFWPYYQMQTLIENKLKEYGIEVKRVQAKYTSQLCSNPNCRYWNNYFNFEYRKVNKFPKFKCEKCNLEISAAYNAARNLSTPDIEKFVAKATKGINLPEK
cD7 2328 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPDKVMGVALGVAKAVYWAFNSSYKRGCIDGGEIEHFRKMIRARRVSIQNQIKHSGDARKGHGRKRALKPIETLSEKEKNFRDTINHRYANRIVEAAIKQGCGTIQIENLEGIADTTGSKFLKNWPYYDLQTKIVNKAKEHGITVVAINPQYTSQRCSMCGYIEKTNRSSQAVFECKQCGYGSRTICINCRHVQVSGDVCEECGGIVKKENVNAAYNAAKNISTPYIDQIIMEKCLELGIPYRSITCKECGHIQASGNTCEVCGSTNILKPKK
cD8 2329 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDLNKIMGIALGVAVAVYMAFQHTPARYKLEGGEIENFRRQVESRRISMLRQGKYAGGARGGHGRDKRIKPIEQLRDKIANFRDTTNHRYSRYIVDMAIKEGCGTIQMEDLTNIRDIGSRFLQNWTYYDLQQKIIYKAEEAGIKVIKIDPQYTSQRCSECGNIDSGNRIGQAIFKCRACGYEANAAYNAARNIAIPNIDKIIAESIK
cD9 2330 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPNNVMGVALGIVYPVYIAFNNSLHRYHIKGGEIERFRRQVEKRKRELLNQGKYCGDGRKGHGYATRTKSIESISDKIARFRDTCNHKYSRFIVDMALKHNCGIIQMEDLTGISKESTFLKNWTYYDLQQKIEYKAREAGIQVIKIEPQYTSQRCSKCGYIDKENRQEQATFKCIECGFKTNAAYNAARNIAIPNIDKIIRKTLKMQ
cD10 2331 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDKEKILGVAVGVNCPLVASVFGDRDRFIIKGGEIEKFRKSVEARRRSMLEQTKYCGDGRIGHGRKKRTEPALNIGDKIARFRDTTNHKYSRALIEYAVKKGCGTIQMEKLTGITSKSDRFLKDWTYYDLQTKIENKAKEVGINVVYIAPKYTSQRCSKCGYIHKDNRPNQAKFRCLECDFESNAAYNASQNIGIKNIDKIIEKDLQKQESEVQVNENK
t1 2332 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.1 2333 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.2 2334 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.3 2335 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISTPDIKSTKERP
cA2.4 2336 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNISNPNIKSTKERP
cA2.5 2337 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA2.6 2338 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.7 2339 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.8 2340 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.9 2341 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNI
cA2.10 2342 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.11 2343 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.12 2344 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.13 2345 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISTPDIKSTKERP
cA2.14 2346 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNISNPNIKSTKERP
cA2.15 2347 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA2.16 2348 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.17 2349 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.18 2350 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.19 2351 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNI
cA2.20 2352 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.21 2353 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.22 2354 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.23 2355 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISTPDIKSTKERP
cA2.24 2356 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNISNPNIKSTKERP
cA2.25 2357 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA2.26 2358 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.27 2359 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.28 2360 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.29 2361 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNI
cA2.30 2362 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.31 2363 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.32 2364 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.33 2365 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISTPDIKSTKERP
cA2.34 2366 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNISNPNIKSTKERP
cA2.35 2367 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA2.36 2368 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.37 2369 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.38 2370 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.39 2371 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNI
cA2.40 2372 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.41 2373 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.42 2374 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.43 2375 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISTPDIKSTKERP
cA2.44 2376 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNISNPNIKSTKERP
cA2.45 2377 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA2.46 2378 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.47 2379 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.48 2380 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.49 2381 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNI
cA2.50 2382 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.51 2383 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.52 2384 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.53 2385 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISTPDIKSTKERP
cA2.54 2386 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNISNPNIKSTKERP
cA2.55 2387 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA2.56 2388 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.57 2389 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.58 2390 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.59 2391 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNI
cA2.60 2392 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.61 2393 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.62 2394 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.63 2395 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISTPDIKSTKERP
cA2.64 2396 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNISNPNIKSTKERP
cA2.65 2397 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA2.66 2398 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.67 2399 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.68 2400 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.69 2401 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNI
cA2.70 2402 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.71 2403 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.72 2404 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.73 2405 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISTPDIKSTKERP
cA2.74 2406 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNISNPNIKSTKERP
cA2.75 2407 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA2.76 2408 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.77 2409 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.78 2410 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNI
cA2.79 2411 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQRCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAARNI
cA2.80 2412 MAKNTITKTLKLRIVRPYNSQEIEKIVAEEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNI
cA2.81 2413 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNSFSRYSISDNDLFKFNKKMFARRRILLKKNRHKRKGHGAKNKLKPITELTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.82 2414 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKNNREKIALEKNKDKVKEACSKHLKVAAYCTTQVERNACLFCKARKLDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.83 2415 MAKNTITKTLKLRIVRPYNSAEIEKIVADEKERRKQAGGTGELDDKFYKKLRKQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.84 2416 MAKNTITKTLKLRIVRPYNSAEIEKIVADEKERRKQAGGTGELDDKFYKKLRKQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAALFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.85 2417 MAKNTITKTLKLRIVRPYNSAEIEKIVADEKERRKQAGGTGELDDKFYKKLRKQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.86 2418 MAKNTITKTLKLRIVRPYNSAEIEKIVADEKERRKQAGGTGELDDKFYKKLRKQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPKFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.87 2419 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMEDLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA2.88 2420 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPKLKSTKERP
cA2.89 2421 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSKTCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
cA2.90 2422 MAKNTITKTLKLRIVRPYNSAEVEKIVADEKERRKQAGGTGELDDKFYQKLRGQFPDAVFWQEISEIFRQLQKQAAEIYNQSLIELYYEIFIKGKGIANASSVEHYLSRVCYRRAAELFKNAAIASGLRSKIKSNFRLKELKNMKSGLPTTKSDNFPIPLVKQKGGQYTGFEISNHNSDFIIKIPFGRWQVKKEIDKYRPWEKFDFEQVQKSPKPISLLLSTQRRKRNKGWSKDEGTEAEIKKVMNGDYQTSYIEVKRGSKICEKSAWMLNLSIDVPKIDKGVDPSIIGGIAVGVRSPLVCAINNAFSRYSISDNDLFHFNKKMFARRRILLKKNRHKRAGHGAKNKLKPITILTEKSERFRKKLIERWACEIADFFIKNKVGTVQMENLESMKRKEDSYFNIRLRGFWPYAEMQNKIEFKLKQYGIEIRKVAPNNTSQLCSKCGHLNNYFNFEYRKKNKFPHFKCEKCNFKENAAYNAALNISNPDIKSTKERP
Polynucleotides, vectors, and cells
Also provided herein are polynucleotides comprising a nucleotide sequence encoding any one of the engineered gene effector or fusion proteins of the present disclosure. In some embodiments, the polynucleotide comprises a nucleotide sequence encoding an engineered gene effector as described herein. In some embodiments, the polynucleotide comprises a nucleotide sequence encoding a fusion protein described herein.
Also provided are vectors (e.g., expression vectors) comprising polynucleotides of the disclosure, e.g., comprising a nucleotide sequence encoding an engineered gene effector as described herein. In some embodiments, the vector comprises a polynucleotide comprising a nucleotide sequence encoding any one of the engineered gene effectors described herein. In some embodiments, the vector comprises a polynucleotide comprising a nucleotide sequence encoding any one of the fusion proteins described herein. In some embodiments, the vector is a viral vector.
In some embodiments, the methods of delivering a polynucleotide provided herein to a cell can involve viral or non-viral delivery methods. Thus, the polynucleotide may be one or more viral vectors (e.g., one or more AAV vectors). Or the polynucleotide may be a non-viral vector such as a cationic lipid and/or lipid particle (e.g., a Lipid Nanoparticle (LNP)) complexed with or encapsulated by a non-viral delivery moiety.
Non-viral delivery methods of nucleic acids may include lipofection, nuclear transfection, microinjection, gene guns, virosomes, liposomes, immunoliposomes, polycations or lipids: nucleic acid conjugates, naked DNA, artificial viral particles and agent-enhanced DNA uptake. Cationic lipids and neutral lipids suitable for lipid transfection can be identified using high efficiency receptors for polynucleotides. May be delivered to cells (e.g., administered in vitro or ex vivo) or target tissue (e.g., administered in vivo).
In some embodiments, the compositions and systems provided herein are delivered to a subject using a viral vector. In some cases, the viral vector is an adeno-associated virus (AAV) vector. The term "AAV" is an abbreviation for adeno-associated virus and may be used to refer to the virus itself or derivatives thereof. Unless otherwise required, the term encompasses all serotypes, subtypes, and naturally occurring and recombinant forms. The abbreviation "rAAV" refers to recombinant adeno-associated virus, also known as recombinant AAV vector (or "rAAV vector"). The term "AAV" includes AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, rh10 and hybrids thereof, avian AAV, bovine AAV, canine AAV, equine AAV, primate AAV, non-primate AAV, and ovine AAV. The genomic sequences of AAV of different serotypes and the sequences of the natural Terminal Repeat (TR), rep proteins, and capsid subunits are known in the art. These sequences can be found in the literature or in public databases such as GenBank. As used herein, "rAAV vector" refers to an AAV vector comprising a polynucleotide sequence that is not AAV-derived (i.e., a polynucleotide heterologous to AAV), typically a sequence of interest for cellular gene transformation. Typically, the heterologous polynucleotide is flanked by at least one, and typically two, AAV Inverted Terminal Repeats (ITRs). The term rAAV vector encompasses both rAAV vector particles and rAAV vector plasmids. The rAAV vector may be single stranded (ssAAV) or self-complementary (scAAV). "AAV virus" or "AAV viral particle" or "rAAV vector particle" refers to a viral particle comprised of at least one AAV capsid protein and a packaged polynucleotide rAAV vector. If the particle comprises a heterologous polynucleotide (i.e., a polynucleotide other than the wild-type AAV genome, such as a transgene to be delivered to a mammalian cell), it is generally referred to as a "rAAV vector particle" or simply "rAAV vector". Thus, production of a rAAV particle must include production of a rAAV vector because such a vector is contained in the rAAV particle. In some cases, AAV vectors are selected based on the tropism of the viral vectors. In some embodiments, AAV vectors having tropism for a target tissue may be used to deliver polynucleotides encoding the compositions and systems provided herein to the target tissue.
RNA or DNA virus-based systems can be used to target specific cells in the body and deliver viral payloads to the nucleus. Viral vectors may be administered directly (in vivo), or they may be used to treat cells in vitro, and modified cells may optionally be administered (ex vivo). Viral-based systems may include retroviral, lentiviral, adenoviral, adeno-associated viral and herpes simplex viral vectors for gene transfer. Integration in the host genome can occur using retroviral, lentiviral, and adeno-associated viral gene transfer methods, which can lead to long-term expression of the inserted transgene. High transduction efficiencies can be observed in many different cell types and target tissues.
The potential targeting population of target cells can be expanded by altering the tropism of retroviruses by incorporating exogenous envelope proteins. Lentiviral vectors are retroviral vectors that transduce or infect non-dividing cells and produce high viral titers. The choice of retroviral gene transfer system may depend on the target tissue. Retroviral vectors may contain cis-acting long terminal repeats with packaging capability for foreign sequences up to 6-10kb in length. The minimal cis-acting LTR may be sufficient to replicate and package a vector that can be used to integrate the therapeutic gene into the target cell to provide permanent transgene expression. Retroviral vectors may include vectors based on murine leukemia virus (MuLV), gibbon simian leukemia virus (GaLV), simian Immunodeficiency Virus (SIV) or Human Immunodeficiency Virus (HIV), and combinations thereof.
Adenovirus-based systems may be used. Adenovirus-based systems can cause transient expression of transgenes. Adenovirus-based vectors can have high transduction efficiency in cells, and cell division may not be required. High titres and high levels of expression can be obtained with adenovirus-based vectors. Adeno-associated virus ("AAV") vectors are useful for transducing cells with target nucleic acids (e.g., in the in vitro production of nucleic acids and peptides), as well as for in vivo and ex vivo gene therapy procedures.
Also provided are cells comprising the polynucleotides of the disclosure or vectors of the disclosure. In some embodiments, the cell comprises a polynucleotide comprising a nucleotide sequence encoding an engineered gene effector or fusion protein of the present disclosure. In some embodiments, the cell comprises a vector comprising a nucleotide sequence encoding an engineered gene effector of the present disclosure. In some embodiments, the cell comprises a vector comprising a polynucleotide comprising a nucleotide sequence encoding a fusion protein of the present disclosure.
Packaging cells can be used to form viral particles capable of infecting host cells. Such cells may include 293 cells (e.g., for packaging adenovirus), and Psi2 cells or PA317 cells (e.g., for packaging retrovirus). Viral vectors can be generated by generating cell lines that package the nucleic acid vector into viral particles. The vector may contain minimal viral sequences required for packaging and subsequent integration into a host. The vector may comprise other viral sequences substituted with an expression cassette for the polynucleotide to be expressed. The deleted viral function may be provided in trans by the packaging cell line. For example, an AAV vector may comprise ITR sequences from the AAV genome that are necessary for packaging and integration into the host genome. Viral DNA may be packaged in a cell line that may contain helper plasmids encoding other AAV genes (i.e., rep and cap) while lacking ITR sequences. The cell line may also be infected with adenovirus as a helper. Helper viruses can promote replication of AAV vectors and expression of AAV genes from helper plasmids. Adenovirus contamination can be reduced by, for example, heat treatment (adenovirus is more sensitive to heat treatment than AAV).
Host cells may be transiently or non-transiently transfected with one or more of the vectors described herein. The cell may be transfected as it naturally occurs in the subject. Cells may be removed from or derived from a subject and transfected. The cells may be derived from cells (e.g., cell lines) taken from the subject. In some embodiments, cells transfected with one or more vectors described herein are used to establish new cell lines comprising one or more vector-derived sequences. In some embodiments, cells transiently transfected with the compositions of the present disclosure (e.g., transiently transfected with one or more vectors, or transfected with RNA) and modified by the activity of a heterologous polypeptide comprising the engineered gene effectors and heterologous endonucleases disclosed herein are used to establish a new cell line comprising cells containing the modifications but lacking any other exogenous sequences.
Any suitable vector that is compatible with the host cell may be used in the methods of the present disclosure. Non-limiting examples of vectors for eukaryotic host cells include pXT1, pSG5 (Stratagene TM), pSVK3, pBPV, pMSG and pSVLSV40 (Pharmacia TM).
In some embodiments, the polynucleotide may further comprise one or more promoters (or one or more transcriptional control elements as used interchangeably herein). The different active ingredients encoded by the polynucleotides may be under the control of the same promoter or under the control of different promoters. Promoters disclosed and used herein may be active in eukaryotic cells, mammalian cells, non-human mammalian cells, or human cells. The promoter may be an inducible or a constitutively active promoter. Alternatively or additionally, the promoter may be tissue or cell specific. Non-limiting examples of suitable eukaryotic promoters (i.e., promoters that function in eukaryotic cells) may include the Cytomegalovirus (CMV) immediate early promoter, the Herpes Simplex Virus (HSV) thymidine kinase promoter, the SV40 early and late promoters, the retroviral Long Terminal Repeat (LTR) promoter, the human elongation factor-1 promoter (EF 1), hybrid constructs comprising a Cytomegalovirus (CMV) enhancer fused to a chicken beta-active promoter (CAG), the murine stem cell virus promoter (MSCV), the phosphoglycerate kinase-1 locus Promoter (PGK), or the mouse metallothionein-I promoter. The promoter may be a fungal promoter. The promoter may be a plant promoter. A plant promoter database (e.g., plantProm) can be found. The vector may also contain a ribosome binding site for translation initiation and a transcription terminator. The vector may also contain suitable sequences for amplified expression. In some cases, the promoters disclosed herein may be any of the tissue-specific promoters provided herein, or any of the cell type-specific promoters provided herein.
The polynucleotides (e.g., a single heterologous polynucleotide) in a polynucleotide combination can have a size of at least or up to about 2.5 kilobases, at least or up to about 2.6 kilobases, at least or up to about 2.7 kilobases, at least or up to about 2.8 kilobases, at least or up to about 2.9 kilobases, at least or up to about 3.0 kilobases, at least or up to about 3.1 kilobases, at least or up to about 3.2 kilobases, at least or up to about 3.3 kilobases, at least or up to about 3.4 kilobases, at least or up to about 3.5 kilobases, at least or up to about 3.6 kilobases, at least or up to about 3.7 kilobases, at least or up to about 3.8 kilobases, at least or up to about 3.9 kilobases, at least or up to about 4.0 kilobases, at least or up to about 4.1 kilobases, at least or up to about 4.2 kilobases, at least or up to about 4.3 kilobases, at least or up to about 4.4 kilobases, at least or up to about 4.5 kilobases, at least or up to about 4.6 kilobases, at least or up to about 7.7 kilobases, at least or up to about 4.0, at least or up to about 5 kilobases, at least or up to about 4.0.7 kilobases, at least or up to about 5 kilobases, at least or up to about 4.5 kilobases, or up to about 5 kilobases, at least about 0.5 kilobases, or up to about 5 kilobases. In some cases, a heterologous polynucleotide (e.g., a single heterologous polynucleotide) of one or more heterologous polynucleotides can have a size of between about 3 kilobases and about 5 kilobases, between about 3 kilobases and about 4.8 kilobases, between about 3 kilobases and about 4.6 kilobases, between about 3 kilobases and about 4.4 kilobases, between about 3 kilobases and about 4.2 kilobases, between about 3 kilobases and about 4.0 kilobases, between about 3 kilobases and about 3.5 kilobases, between about 3.5 kilobases and about 5 kilobases, between about 3.5 kilobases and about 4.8 kilobases, between about 3.5 kilobases and about 4.6 kilobases, between about 3.5 kilobases and about 4.4 kilobases, between about 3.5 kilobases and about 4.2 kilobases, between about 3.5 kilobases and about 4 kilobases, between about 4 kilobases and about 4.2 kilobases, between about 3.5 kilobases and about 4.5 kilobases, between about 4.5 kilobases and about 4.8 kilobases, between about 4.5 kilobases and about 4.5 kilobases, between about 4.5 kilobases and about 4.8 kilobases.
System and method for controlling a system
In some aspects, disclosed herein are systems comprising an engineered gene effector, a guide nucleic acid, a heterologous endonuclease (e.g., an endonuclease that lacks or substantially lacks cleavage activity). The systems of the present disclosure can be used, for example, to bring one or more heterologous gene effectors into close proximity with a target gene (e.g., a target endogenous gene) or target gene regulatory sequence, thereby facilitating regulation of the level of expression or activity of the target gene.
In some embodiments, the systems of the present disclosure bind to DNA (e.g., genomic DNA). In some embodiments, the systems of the present disclosure bind to RNA (e.g., mRNA, microRNA, siRNA or non-coding RNA). In some embodiments, the systems of the present disclosure bind to DNA and RNA.
In some embodiments, the system can modulate (e.g., increase) expression and/or activity of a target gene (e.g., a target endogenous gene) by physically blocking a polynucleotide sequence (e.g., a promoter, enhancer, repressor, operator or silencer, insulator, cis-regulatory element, trans-regulatory element, epigenetic modification (e.g., DNA methylation) site or coding sequence).
In some embodiments, the system can modulate (e.g., increase) expression and/or activity of a target gene (e.g., a target endogenous gene) by recruiting additional factors that are effective to repress or enhance expression of the target gene.
In some embodiments, the systems of the present disclosure are used to introduce epigenetic modifications to a target gene (e.g., a target endogenous gene) or a target gene regulatory sequence (e.g., a promoter, enhancer, silencer, insulator, cis regulatory element, trans regulatory element, or epigenetic modification (e.g., DNA methylation) site). In some embodiments, the systems of the present disclosure are used to generate three-dimensional structures, topologically related domains, or genomic boundaries comprising a target gene or target gene regulatory sequence (e.g., a gene from the distal or proximal end of a target gene).
In some embodiments, modulation of a target gene (e.g., a target endogenous gene) by a system disclosed herein (e.g., a system comprising one or more heterologous gene effectors, heterologous endonucleases, and guide nucleic acids) can utilize endogenous target gene regulatory sequences operably coupled to the target gene, e.g., promoters, enhancers, repressors, silencers, insulators, cis-regulatory elements, trans-regulatory elements, epigenetic modification (e.g., DNA methylation) sites, and the like. Thus, such modulation of a target gene by a system may not involve and need not involve exogenous, synthetic, and/or heterologous regulatory sequences, such as promoters, enhancers, repressors, silencers, insulators, cis-regulatory elements, trans-regulatory elements, epigenetic modification (e.g., DNA methylation) sites, and the like, that are heterologous with respect to a subject or host cell. In some embodiments, the modulation of the target gene by the complex does not involve the use of engineered induction systems, repression systems, and/or reporting systems. In some embodiments, the modulation of a target gene by the system does not involve the use of exogenous, engineered, or synthetic regulatory elements, e.g., does not involve responsive elements that are modulated by tetracycline or an analog thereof. In some embodiments, the modulation of the target gene by the system does not involve the use of transactivators or reverse transactivators that function as part of the engineered induction system, repressor system, and/or reporter system. In some embodiments, the regulation of the target gene by the system does not involve a tetoff or tTA dependent system or components thereof. In some embodiments, the modulation of the target gene by the system does not involve a Tet On or rtTA dependent system or components thereof.
In some embodiments, the systems disclosed herein (e.g., complexes comprising one or more engineered gene effectors, heterologous endonucleases, and guide nucleic acids) may be capable of modulating a target gene (e.g., a target endogenous gene) without further control of the modulator (e.g., an agent that directly or indirectly allows the complex to increase or decrease expression of the target gene). In some embodiments, the system is capable of modulating a target gene without involving a transactivator, a reverse transactivator, a small molecule, a drug, a chemical inducer of dimerization or multimerization, an additional inducer, an additional repressor, or any combination thereof. For example, after a single system is introduced into a host cell (e.g., each individual component of a single complex is expressed and/or transfected into a host cell), such introduction may be sufficient to allow the single system to regulate the expression or activity of a target gene.
In some embodiments, the system comprises an engineered gene effector of the present disclosure, a heterologous endonuclease, and a guide nucleic acid. In some embodiments, the system comprises an engineered gene effector, a heterologous endonuclease, and a guide nucleic acid. In some embodiments, the system comprises two engineered gene effectors, a heterologous endonuclease, and a guide nucleic acid. In some embodiments, the system comprises more than three engineered gene effectors, a heterologous endonuclease, and a guide nucleic acid. In some embodiments, the system comprises an engineered gene effector, a heterologous endonuclease, and a guide. In some embodiments, the complex comprises two engineered gene effectors, a heterologous endonuclease, and a guide. In some embodiments, the complex comprises more than three engineered gene effectors, a heterologous endonuclease, and one guide.
In some embodiments, the system comprises an engineered gene effector, a heterologous endonuclease coupled to the engineered gene effector, and a guide nucleic acid capable of forming a complex with the heterologous endonuclease, wherein the complex exhibits specific binding to a target gene (e.g., a promoter or regulatory sequence of the target gene) in a cell when the system is expressed in the cell. In some embodiments, the system comprises an engineered gene effector, a heterologous endonuclease coupled to the engineered gene effector (wherein the heterologous endonuclease is a Cas protein), and a guide nucleic acid capable of forming a complex with the heterologous endonuclease, wherein the complex exhibits specific binding to a target gene in a cell when the system is expressed in the cell. In some embodiments, the binding specificity is determined by using a guide nucleic acid, such as a single guide RNA (sgRNA) or a portion thereof. In some embodiments, the use of different sgrnas allows the compositions, combinations, systems, and methods of the present disclosure to be used with (e.g., targeted to) different target genes (e.g., target endogenous genes) or target gene regulatory sequences.
In some embodiments, the heterologous endonuclease (e.g., and thus by an engineered gene effector operably coupled to the heterologous endonuclease) forms a complex with a guide nucleic acid (e.g., a guide RNA or a portion thereof). In some embodiments, the heterologous endonuclease forms a complex with a unidirectional guide nucleic acid, such as unidirectional guide RNA (sgRNA). In some embodiments, the heterologous endonuclease is an RNA Binding Protein (RBP), which is optionally complexed with a guide nucleic acid (e.g., a guide RNA (e.g., sgRNA)) capable of forming a complex with a Cas protein. In some embodiments, the heterologous endonuclease is a nuclease deleted DNA binding protein that can induce transcriptional activation or repression of the target DNA sequence. In some embodiments, the heterologous endonuclease is an RNA-derived nuclease-deleted RNA-binding protein.
The guide nucleic acids used in the compositions, combinations, systems and methods of the present disclosure can be, for example, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 32, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, at least 39 or at least 40 nucleotides.
In some embodiments, the guide nucleic acids used in the compositions, combinations, systems, and methods of the present disclosure are up to 10, up to 11, up to 12, up to 13, up to 14, up to 15, up to 16, up to 17, up to 18, up to 19, up to 20, up to 21, up to 22, up to 23, up to 24, up to 25, up to 26, up to 27, up to 28, up to 29, up to 30, up to 31, up to 32, up to 33, up to 34, up to 35, up to 36, up to 37, up to 38, up to 39, or up to 40 nucleotides.
In some embodiments, the guide nucleic acids used in the compositions, combinations, systems, and methods of the present disclosure are from about 8 to about 40 nucleotides, from about 10 to about 40 nucleotides, from about 11 to about 40 nucleotides, from about 12 to about 40 nucleotides, from about 13 to about 40 nucleotides, from about 14 to about 40 nucleotides, from about 15 to about 40 nucleotides, from about 16 to about 40 nucleotides, from about 17 to about 40 nucleotides, from about 18 to about 40 nucleotides, from about 19 to about 40 nucleotides, from about 20 to about 40 nucleotides, from about 22 to about 40 nucleotides, About 24 to about 40 nucleotides, about 26 to about 40 nucleotides, about 28 to about 40 nucleotides, about 30 to about 40 nucleotides, about 8 to about 30 nucleotides, about 10 to about 30 nucleotides, about 11 to about 30 nucleotides, about 12 to about 30 nucleotides, about 13 to about 30 nucleotides, about 14 to about 30 nucleotides, about 15 to about 30 nucleotides, about 16 to about 30 nucleotides, about 17 to about 30 nucleotides, about 18 to about 30 nucleotides, about 19 to about 30 nucleotides, about 20 to about 30 nucleotides, about 22 to about 30 nucleotides, about 24 to about 30 nucleotides, about 26 to about 30 nucleotides, about 28 to about 30 nucleotides, about 8 to about 25 nucleotides, about 10 to about 25 nucleotides, about 11 to about 25 nucleotides, about 12 to about 25 nucleotides, about 13 to about 25 nucleotides, about 14 to about 25 nucleotides, about 15 to about 25 nucleotides, about 16 to about 25 nucleotides, about 17 to about 25 nucleotides, about 18 to about 25 nucleotides, about 19 to about 25 nucleotides, about 20 to about 25 nucleotides, About 22 to about 25 nucleotides, about 24 to about 25 nucleotides, about 8 to about 20 nucleotides, about 10 to about 20 nucleotides, about 11 to about 20 nucleotides, about 12 to about 20 nucleotides, about 13 to about 20 nucleotides, about 14 to about 20 nucleotides, about 15 to about 20 nucleotides, about 16 to about 20 nucleotides, about 17 to about 20 nucleotides, about 18 to about 20 nucleotides, about 19 to about 20 nucleotides, about 8 to about 18 nucleotides, about 10 to about 18 nucleotides, about 11 to about 18 nucleotides, About 12 to about 18 nucleotides, about 13 to about 18 nucleotides, about 14 to about 18 nucleotides, about 15 to about 18 nucleotides, about 16 to about 18 nucleotides, about 8 to about 16 nucleotides, about 10 to about 16 nucleotides, about 11 to about 16 nucleotides, about 12 to about 16 nucleotides, about 13 to about 16 nucleotides, about 14 to about 16 nucleotides, or about 15 to about 16 nucleotides. in some embodiments, the guide nucleic acid is a guide RNA or a portion thereof.
In some embodiments, the system comprises any of the engineered gene effectors of the present disclosure, a heterologous endonuclease coupled to a polypeptide of the engineered gene effector (wherein the heterologous endonuclease comprises the amino acid sequence of any of SEQ ID NOS: 2222-2422), and a guide nucleic acid capable of forming a complex with the heterologous endonuclease, wherein the complex exhibits specific binding to a target gene in a cell when the system is expressed in the cell. In some embodiments, the system comprises any of the engineered gene effectors of the present disclosure, a heterologous endonuclease fused to a polypeptide of the engineered gene effector, and a guide nucleic acid capable of forming a complex with the heterologous endonuclease, wherein the complex exhibits specific binding to a target gene in a cell when the system is expressed in the cell. In some embodiments, the system comprises any of the engineered gene effectors of the present disclosure, a heterologous endonuclease fused to a polypeptide of the engineered gene effector (wherein the heterologous endonuclease is a Cas protein described herein), and a guide nucleic acid capable of forming a complex with the heterologous endonuclease, wherein the complex exhibits specific binding to a target gene in a cell when the system is expressed in the cell. In some embodiments, the engineered gene effector polypeptide is fused to the C-terminus of the heterologous endonuclease (directly or indirectly through a linker).
In some embodiments, the system comprises an active ingredient (e.g., an engineered gene effector, a heterologous endonuclease, a guide nucleic acid, etc.), and optionally additional ingredients (e.g., excipients). The system may be segmented, shaped and/or packaged as necessary and/or desired as a single or multiple dose unit, or as a single or multiple implant unit.
Also provided are polynucleotide combinations encoding any of the systems of the present disclosure, wherein the polynucleotide combinations are configured to express in a cell a heterologous endonuclease coupled to an engineered gene effector and a guide nucleic acid.
Kit for detecting a substance in a sample
Also provided are kits comprising any one or more of the engineered gene effectors, fusion proteins, combinations, systems, polynucleotides, vectors, and/or cells of the disclosure. In some embodiments, the kit comprises instructions directing the cells to be contacted with any of the compositions (e.g., in vitro, ex vivo, or in vivo). The information and instructions may be in the form of text, pictures, or both, etc. In some embodiments, the kit comprises one or more containers (e.g., vials, ampoules, tubes, flasks, or bottles) for containing one or more components of the kit.
In some aspects, the disclosure provides a kit comprising any of the compositions disclosed herein (e.g., a combination of systems or polynucleotides) and instructions for (i) contacting a cell with any of the compositions (e.g., in vitro, ex vivo, or in vivo), or (ii) administering a cell comprising any of the compositions disclosed herein to a subject. The subject may have or be suspected of having a disorder (e.g., a genetic disease).
Method for regulating gene expression
Also provided herein are methods of controlling a target gene in a cell, the method comprising contacting the cell with a combination of any of the polynucleotides described herein or any of the systems described herein. In some embodiments, a method of controlling a target gene in a cell comprises contacting a cell of interest (e.g., a cell in which it is desired to regulate expression of the target gene) with any of the systems of the present disclosure. In some embodiments, a method of controlling a target gene in a cell comprises contacting the cell with a combination of any of the polynucleotides of the present disclosure. In some embodiments, the target gene is endogenous to the cell.
The present disclosure provides compositions, methods, and systems for modulating the expression of one or more target genes. Any suitable gene may be the target gene. The target gene may be one or more heterologous target genes. The target gene may be one or more endogenous target genes, such as (i) a pathogenic allele, e.g., a mutant allele, and/or (ii) a non-pathogenic allele, e.g., a wild-type allele. For example, disclosed herein are systems comprising a guide nucleic acid and one or more heterologous polypeptides or fusion proteins comprising an engineered gene effector and a heterologous endonuclease that can modulate (e.g., increase) the activity level or expression level of a target gene (e.g., in a cell).
In some embodiments, the target gene or regulatory sequence thereof is endogenous to the cell, e.g., present in the genome of the cell, or endogenous to the subject, e.g., present in the genome of the subject. In some embodiments, the target gene or regulatory sequence thereof is not part of an engineered reporter system.
In some embodiments, the target gene is exogenous to the host subject, e.g., a pathogen target gene, or an exogenous gene expressed as a result of a therapeutic intervention (e.g., gene therapy and/or cell therapy). In some embodiments, the target gene is an exogenous reporter gene. In some embodiments, the target gene is an exogenous synthetic gene.
In some embodiments, the systems, compositions, combinations, and methods disclosed herein are used to modulate (e.g., increase) expression of a target gene (e.g., after introducing a complex comprising an engineered gene effector and a heterologous polypeptide into a cell or cell population). In some embodiments, the expression level is an RNA expression level, which is determined by, for example, RNAseq, qPCR, microarray, gene array, FISH, or the like. In some embodiments, the expression level is a protein expression level, as determined by, for example, western blot, ELISA, multiplex immunoassay, mass spectrometry, NMR, proteomics, flow cytometry, or mass spectrometry flow cytometry, or the like.
In some embodiments, the target gene (e.g., endogenous target gene) is a mutant of a pathogenic allele, such as a wild-type allele. The disease may be a genetic disease, such as a genetic disorder. Non-limiting examples of genetic disorders may include Duchenne Muscular Dystrophy (DMD), hemophilia, cystic fibrosis, huntington's disease, familial hypercholesterolemia (LDL receptor deficiency), hepatoblastoma, wilson's disease, congenital hepatoporphyrin, hereditary liver metabolic disorders, LESCH NYHAN syndrome, sickle cell anemia, thalassemia, xeroderma pigmentosum, fanconi anemia, retinal pigment degeneration, ataxia telangiectasia, bloom syndrome, retinoblastoma, or tay-saxophone. In some cases, the target gene is a gene encoding a protein. In some cases, the target gene is a gene regulatory sequence (e.g., promoter, enhancer, repressor, silencer, insulator, cis-regulatory element, trans-regulatory element, epigenetic modification (e.g., DNA methylation) site, etc.) that affects the expression of a gene encoding a protein of interest provided herein. For example, the target gene regulatory sequence may be physically located outside of the transcriptional unit or open reading frame encoding the target gene product.
In some embodiments, the target gene regulatory sequence does not comprise a nucleotide sequence that is exogenous to the subject or host cell. In some embodiments, the target gene regulatory sequence does not comprise an engineered or artificially generated or introduced nucleotide sequence.
In some embodiments, the target gene (e.g., a target endogenous gene) is a gene that is over-expressed or under-expressed in a disease or disorder. In some embodiments, the target gene is a gene that is over-expressed or under-expressed in a genetic disease.
In some embodiments, the target gene (e.g., a target endogenous gene) is a gene that is overexpressed or underexpressed in the cancer, e.g., acute leukemia, astrocytoma, biliary tract cancer (cholangiocarcinoma), bone cancer, breast cancer, brain stem glioma, bronchioloalveolar lung cancer, adrenal gland cancer, anal region cancer, bladder cancer, endocrine system cancer, esophageal cancer, head and neck cancer, kidney cancer, parathyroid cancer, penile cancer, pleural/peritoneal cancer, salivary gland cancer, small intestine cancer, thyroid cancer, ureter cancer, urinary tract cancer, cervical cancer, endometrial cancer, fallopian tube cancer, renal pelvis cancer, vaginal cancer, vulval cancer, cervical cancer, chronic leukemia, colon cancer, colorectal cancer, skin melanoma, ependymoma, epidermoid tumor, ewing's sarcoma, gastric cancer, glioblastoma multiforme, glioma hematological malignancy, hepatocellular (liver) carcinoma, hepatocellular carcinoma, hodgkin's disease, intraocular melanoma, kaposi's sarcoma, lung cancer, lymphoma, medulloblastoma, melanoma, meningioma, mesothelioma, multiple myeloma, muscle carcinoma, central Nervous System (CNS) neoplasm, neuronal carcinoma, small cell lung carcinoma, non-small cell lung carcinoma, osteosarcoma, ovarian carcinoma, pancreatic carcinoma, pediatric malignancy, pituitary adenoma, prostate carcinoma, rectal carcinoma, renal cell carcinoma, soft tissue sarcoma, schwannoma, skin carcinoma, spinal axis tumor, squamous cell carcinoma, gastric carcinoma, synovial sarcoma, testicular carcinoma, uterine carcinoma, or tumor, or metastasis thereof, including refractory types of any of the cancers described above, or combinations thereof.
Non-limiting examples of target genes or genes encoding proteins of interest for use in some embodiments herein are disclosed in table 6.
In some embodiments, the methods disclosed herein activate the expression level of a target gene in a cell as compared to a control. In some embodiments, the method of contacting a cell with an engineered gene effector (e.g., an engineered gene activator) disclosed herein activates (e.g., alone or together with a heterologous endonuclease, such as Cas or dCas protein, e.g., an inactivated variant of Un1Cas12f1 comprising the polypeptide sequence of SEQ ID NO: 2231) the expression level of a target gene in the cell as compared to a control. The control can be the expression level (e.g., basal expression level) of the target gene in the control cell without any external manipulation of the target gene (e.g., the absence of engineered gene effectors and/or heterologous endonucleases). The control may be the expression level of the target gene in a control cell that is treated with a separate heterologous endonuclease (e.g., dCas) without any gene effector operatively coupled to the gene effector. The control can be the expression level of the target gene in a control cell that is treated with a control gene effector (e.g., alone or with a heterologous endonuclease). Non-limiting examples of control gene effectors may include VP16, VP64, p65, rta, and VPR. In some embodiments, the target gene is a silenced gene, e.g., a target gene at a methylation site. In some embodiments, the target gene is a methylated gene.
In some embodiments, the methods disclosed herein increase, about or increase the expression level of a target gene by at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 250%, 300%, 400% or 500%, or more, or optionally, an engineered gene effector (e.g., with a heterologous endonuclease) is capable of increasing the expression level of a target gene by a percentage (e.g., 10% -100%, 100% -200%, 200% -400%, 250% -500%, 10% -50%, 50% -100%, etc.) within a range defined by any two of the foregoing values. In some embodiments, the methods comprise contacting a cell with an engineered gene effector of the present disclosure (e.g., with a heterologous endonuclease) to increase, at least or increase, by about 10%, at least or about 20%, at least or about 30%, at least or about 40%, at least or about 50%, at least or about 60%, at least or about 70%, at least or about 80%, at least or about 90%, at least or about 100%, at least or about 200%, at least or about 250%, at least or about 300%, at least or about 400%, or at least or about 500%, or more, the expression level of the target gene by a percentage (e.g., 10% -100%, 100% -200%, 200% -400%, 250% -500%, 10% -50%, 50% -100%, etc.) within a range defined by any two of the foregoing values.
In some embodiments, the engineered gene effector is capable of increasing the expression level of the target gene by at least about 0.1-fold, at least about 0.2-fold, at least about 0.3-fold, at least about 0.4-fold, at least about 0.5-fold, at least about 0.6-fold, at least about 0.7-fold, at least about 0.8-fold, at least about 0.9-fold, at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 1.6-fold, at least about 1.7-fold, at least about 1.8-fold, at least about 1.9-fold, at least about 2-fold, at least about 2.5-fold, at least about 3-fold, at least about 3.5-fold, at least about 4-fold, at least about 4.5-fold, at least about 6-fold, at least about 7-fold, at least about 8-fold, at least about 9-fold, at least about 10-fold, at least about 15-fold, at least about 20-fold, at least about 30-fold, or at least about 50-fold.
In some embodiments, the engineered gene effector is capable of increasing the expression level of the target gene by at least about 0.1-fold, at least about 0.2-fold, at least about 0.3-fold, at least about 0.4-fold, at least about 0.5-fold, at least about 0.6-fold, at least about 0.7-fold, at least about 0.8-fold, at least about 0.9-fold, at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 1.6-fold, at least about 1.7-fold, at least about 1.8-fold, at least about 1.9-fold, at least about 2-fold, at least about 2.5-fold, at least about 3-fold, at least about 3.5-fold, at least about 4-fold, at least about 4.5-fold, at least about 5-fold, at least about 6-fold, at least about 7-fold, at least about 8-fold, at least about 9-fold, at least about 10-fold, at least about 15-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, or at least about 50-fold, wherein the engineered gene effector is capable of increasing the expression level of the target gene by a factor value (e.g., 0.1-1.1-fold, 1-3-fold, 2.5-5-fold, 5-10-fold, 10-50-fold, 0.1-3-fold, 1-10-fold, or 0.1-20-fold) within a range defined by any two of the foregoing values.
In some embodiments, the target gene expression level increased by the engineered gene effector disclosed herein can be substantially sustained (e.g., at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 99%, or about 100%) for at least or up to about 1 hour, at least or up to about 2 hours, at least or up to about 6 hours, at least or up to about 12 hours, at least or up to about 18 hours, at least or up to about 24 hours, at least or up to about 2 days, at least or up to about 3 days, at least or up to about 4 days, at least or up to about, At least or up to about 5 days, at least or up to about 6 days, at least or up to about 7 days, at least or up to about 8 days, at least or up to about 9 days, at least or up to about 10 days, at least or up to about 11 days, at least or up to about 12 days, at least or up to about 13 days, at least or up to about 14 days, at least or up to about 3 weeks, at least or up to about 4 weeks, at least or up to about 2 months, at least or up to about 4 months, or at least or up to about 6 months. In some embodiments, the engineered gene effector is capable of activating a target gene in a cell (e.g., when the engineered gene effector is expressed in a cell and is effective to target a site of the target gene, e.g., as provided herein, when coupled or fused to a heterologous endonuclease complexed with a guide nucleic acid), wherein the expression level of the target gene activated by the engineered gene effector is for a period of time that is, about or at least 9 days, 10 days, 11 days, 12 days, 13 days, 14 days, 15 days, 16 days, 17 days, 18 days, or more, or optionally, the expression level is for a period of time that is within a range defined by any two of the foregoing values (e.g., 9-18 days, 18 days, 9-14 days, 12-18 days, 14-16 days, etc.). In some embodiments, the engineered gene effector is capable of activating a target gene in a cell (e.g., when the engineered gene effector is expressed in the cell and is effective to target a site of the target gene, e.g., when coupled or fused to a heterologous endonuclease complexed with a guide nucleic acid, as provided herein), wherein the level of expression of the target gene activated by the engineered gene effector lasts for a period of 9-14 days. In some embodiments, the engineered gene effector is capable of activating a target gene in a cell (e.g., when the engineered gene effector is expressed in the cell and is effective to target a site of the target gene, e.g., when coupled or fused to a heterologous endonuclease complexed with a guide nucleic acid, as provided herein), wherein the level of expression of the target gene activated by the engineered gene effector lasts for a period of 9-18 days. In some embodiments, the engineered gene effector is capable of activating a target gene in a cell (e.g., when the engineered gene effector is expressed in the cell and is effective to target a site of the target gene, e.g., when coupled or fused to a heterologous endonuclease complexed with a guide nucleic acid, as provided herein), wherein the level of expression of the target gene activated by the engineered gene effector lasts for about 14 days. In some embodiments, sustained activation comprises sustaining or maintaining at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 99% or about 100% of the peak or regulated level, or optionally, the percentage of sustained activation comprising the peak or regulated level is within a range defined by any two of the foregoing values (e.g., 50% -100%, 70% -95%, 80% -90% or 60% -95%, etc.). In some embodiments, sustained activation comprises sustaining or maintaining at least 50% of peak or regulated levels. In some embodiments, sustained activation comprises sustaining or maintaining at least 75% of peak or regulated levels. In some embodiments, sustained activation comprises sustaining or maintaining at least 90% of peak or regulated levels. In some embodiments, sustained activation comprises sustaining or maintaining at least 95% of peak or regulated levels.
In some embodiments, the engineered gene effectors disclosed herein (e.g., engineered gene activators) repress (e.g., alone or together with a heterologous endonuclease, such as a Cas or dCas protein, e.g., an inactivated variant of Un1Cas12f1 comprising the polypeptide sequence of SEQ ID NO: 2231) the expression level of the target gene in the cell as compared to the control.
In some embodiments, the cells provided herein can be referred to as target cells. In some embodiments, the systems and methods provided herein are applied to modify a target cell (e.g., to modify the expression profile of a target gene of a target cell, such as one or more genes in table 6). The target cells may include a variety of cell types. The target cells may be in vitro. The target cell may be in vivo. The target cells may be ex vivo. The target cell may be an isolated cell. The target cell may be a cell in an organism. The target cell may be an organism. The target cell may be a cell in a cell culture. The target cell may be one of a collection of cells. The target cell may be a mammalian cell or derived from a mammalian cell. The target cell may be a rodent cell or derived from a rodent cell. The target cell may be a human cell or derived from a human cell. The target cells may be prokaryotic cells or derived from prokaryotic cells. The target cell may be a bacterial cell or may be derived from a bacterial cell. The target cell may be an archaebacterium cell or derived from an archaebacterium cell. The target cell may be a eukaryotic cell or derived from a eukaryotic cell. The target cells may be pluripotent stem cells. The target cell may be a plant cell or derived from a plant cell. The target cell may be an animal cell or derived from an animal cell. The target cell may be an invertebrate cell or derived from an invertebrate cell. The target cell may be a vertebrate cell or derived from a vertebrate cell. The target cell may be a microbial cell or derived from a microbial cell. The target cell may be a fungal cell or derived from a fungal cell. The target cells may be from a particular organ or tissue.
The target cells may be stem cells or progenitor cells. The target cells may include stem cells (e.g., adult stem cells, embryonic stem cells, induced Pluripotent Stem (iPS) cells) and progenitor cells (e.g., cardiac progenitor cells, neural progenitor cells, etc.). The target cells may comprise mammalian stem cells and progenitor cells, including rodent stem cells, rodent progenitor cells, human stem cells, human progenitor cells, and the like. The cloned cells may comprise the progeny of the cells. The target cell may comprise a target nucleic acid. The target cell may be in a living organism. The target cell may be a genetically modified cell. The target cell may be a host cell.
The target cell may be a primary cell. For example, the primary cell culture may be passaged 0, 1, 2, 4, 5, 10, 15, or more times. The cell may be a unicellular organism. Cells may be grown in culture.
The target cell may be a diseased cell. The diseased cells may have altered metabolic, gene expression, and/or morphological characteristics. The diseased cells may be cancer cells, diabetic cells, and apoptotic cells. The diseased cell may be a cell from a diseased subject. Exemplary diseases may include blood disorders, cancers, metabolic disorders, ocular disorders, organ disorders, musculoskeletal disorders, heart diseases, and the like.
If the target cells are primary cells, they may be obtained from the individual by any method. For example, the white blood cells may be obtained by apheresis, white blood cell separation, density gradient separation, or the like. Cells from tissue (e.g., skin, muscle, bone marrow, spleen, liver, pancreas, lung, intestine, stomach, etc.) can be obtained by biopsy.
Non-limiting examples of cells that can be used as target cells include, but are not limited to, lymphocytes such as B cells, T cells (cytotoxic T cells, natural killer T cells, regulatory T cells, helper T cells), natural killer cells, cytokine-induced killer (CIK) cells, myeloid cells such as granulocytes (basophils, eosinophils, neutrophils/multi-lobular neutrophils), monocytes/macrophages, erythrocytes (reticulocytes), mast cells, platelets/megakaryocytes, dendritic cells, cells from the endocrine system including thyroid cells (thyroid epithelium, follicular paracellular), cells, Parathyroid cells (parathyroid main cells, eosinophils), adrenal cells (pheochromocytes), pineal cells (pineal gland cells), cells of the nervous system including glial cells (astrocytes, microglia), large-cell neurosecretory cells (Magnocellular neurosecretory cell), astrocytes, boettcher cells and pituitary (gonadotrophin cells, corticotropin cells, thyroid stimulating hormone cells, somatic cells, lactogen cells), cells of the respiratory system including lung cells (type I lung cells, Type II lung cells), clara cells, goblet cells, dust cells, cells of the circulatory system including cardiomyocytes, pericytes, cells of the digestive system including gastric cells (gastric host cells, parietal cells), goblet cells, pan' S cells, G cells, D cells, ECL cells, I cells, K cells, S cells, enteroendocrine cells including enterochromaffin cells, APUD cells, liver (hepatocytes, kupffer cells), cartilage/bone/muscle cells, bone cells including osteoblasts, osteocytes, osteoclasts, teeth (cementoblasts, ameloblasts), cartilage cells including chondroblasts, Chondrocytes, skin cells including hair cells, keratinocytes, melanocytes (nevi cells), muscle cells including muscle cells, urinary system cells including podocytes, periglomerular cells, mesangial/mesangial cells, perirenal tubular brush border cells, compact plaque cells, germ system cells including sperm, supporting cells, interstitial cells, ovum, and other cells including adipocytes, fibroblasts, tendon cells, epidermal keratinocytes (differentiated epidermal cells), epidermal basal cells (stem cells), nail and toenail keratinocytes, nail bed basal cells (stem cells), nail, Medullary hair stem cells, cortical hair stem cells, epidermal root sheath cells, huxley's layered root sheath cells, henle's layered root sheath cells, outer root sheath cells, hair matrix cells (stem cells), wet-layered barrier epithelial cells (WET STRATIFIED Barrier EPITHELIAL CELLS), surface epithelial cells of the multilayer squamous epithelium of the cornea, tongue, mouth, esophagus, anal canal, distal urethra and vagina, surface epithelial cells of the cornea, tongue, mouth, esophagus, anal canal, distal urethra and vagina, urothelial basal cells (stem cells), urothelial cells (lining bladder and urethra), and combinations thereof, Exocrine epithelial cells, salivary gland mucus cells (polysaccharide-rich secretions), salivary gland serous cells (glycoprotein-rich secretions), von Ebner's gland cells of the tongue (flushing taste buds), breast cells (milk secretion), lacrimal gland cells (lacrimal secretion), earwax gland cells of the ear (cerumen secretion), exocrine sweat gland dark cells (glycoprotein secretion), exocrine sweat gland clear cells (small molecule secretion), apocrine sweat gland cells (odor secretion, sex hormone sensitivity), moll gland cells of the eyelid (specialised sweat glands), sebaceous gland cells (lipid-rich sebum secretion), bowman's gland cells of the nose (irrigation of olfactory epithelium), and, Brunner's gland cells of the duodenum (enzyme and alkaline mucus), seminal vesicle cells (secretion of semen components including fructose for sperm motility), prostate cells (secretion of semen components), urinary tract bulbar gland cells (mucus secretion), bartholin's gland cells (vaginal lubrication secretion), littre gland cells (mucus secretion), endometrial cells (carbohydrate secretion), isolated goblet cells of the respiratory and digestive tracts (mucus secretion), gastric mucosa mucus cells (mucus secretion), gastric gland zymogen cells (pepsinogen secretion), gastric gland acid secretion cells (hydrochloric acid secretion), pancreatic gland acinar cells (bicarbonate and digestive enzyme secretion), and, Small intestine panus cells (lysozyme secretion), lung type II lung cells (surface active substance secretion), lung Clara cells, hormone secreting cells, pituitary anterior lobe cells, parent somatic cells, prolactin cells, thyroid stimulating hormone cells, gonadotrophin cells, corticotropin cells, pituitary intermediate cells, large cell nerve secreting cells, intestinal and respiratory tract cells, thyroid epithelial cells, follicular paracellular cells, parathyroid main cells, eosinophils, adrenal cells, pheochromocytes, testicular mesenchyme cells, follicular intimal cells, ruptured follicular corpus luteum cells, granulosa corpus luteum cells, follicular pericytes, Membranous corpus luteum cells, glomerular cells (renin secretion), renal compact plaque cells, metabolic and storage cells, cells of barrier function (lung, intestinal tract, exocrine glands and genitourinary tract), kidneys, type I lung cells (lining air cavities of the lung), pancreatic duct cells (acinar cells), (sweat glands, salivary glands, mammary glands, etc.) non-striated duct cells, (seminal vesicles, prostate, etc.) duct cells, epithelial cells lining closed internal body cavities, ciliated cells with propulsion function, extracellular matrix secreting cells, contractile cells, skeletal muscle cells, stem cells, myocardial cells, blood and immune system cells, erythrocytes (red blood cells), megakaryocytes (platelet precursors), Monocytes, connective tissue macrophages (types), epidermal langerhans cells, osteoclasts (in bone), dendritic cells (in lymphoid tissue), microglia (in central nervous system), neutrophils, eosinophils, basophils, mast cells, helper T cells, suppressor T cells, cytotoxic T cells, natural killer T cells, B cells, natural killer cells, reticulocytes, stem cells and committed progenitors of the blood and immune system (types), pluripotent stem cells, totipotent stem cells, induced pluripotent stem cells, adult stem cells, sensory conducting cells, autonomic neurons cells, Sensory and peripheral neuron support cells, central nervous system neurons and glial cells, lens cells, pigment cells, melanocytes, retinal pigment epithelial cells, germ cells, oogonial/oocyte, sperm cells, spermatocyte, spermatogonial cells (stem cells of spermatocyte), sperm, trophoblast or follicular cells.
In some embodiments, a method of controlling a target gene in a cell comprises contacting a cell (e.g., a cell of interest for regulating expression of the target gene) with any of the systems of the present disclosure, wherein contacting is performed in vitro. In some embodiments, a method of controlling a target gene in a cell comprises contacting the cell with a combination of polynucleotides of the present disclosure, wherein contacting is performed in vitro. In some embodiments, a method of controlling a target gene in a cell comprises contacting a cell (e.g., a cell of interest for regulating expression of the target gene) with any of the systems of the present disclosure, wherein the contacting is performed ex vivo. In some embodiments, a method of controlling a target gene in a cell comprises contacting the cell with a combination of polynucleotides of the present disclosure, wherein the contacting is performed ex vivo.
In some embodiments, the cells (or target cells) may be engineered to comprise (or exhibit) any of the systems or compositions disclosed herein, or may be treated in vitro or ex vivo by any of the methods disclosed herein, and then administered to a subject, e.g., for treating or inhibiting a disorder in a subject. For example, any subject's modified cell product can be administered to a subject to treat or inhibit a disorder of the subject's body tissue. In some embodiments, the cells may reside in a subject, and either system or a composition thereof may be administered to the subject, thereby contacting the system/composition with the cells (e.g., to engineer the cells with the system/composition).
In some embodiments, any of the compositions disclosed herein can be administered to a subject orally, intraperitoneally, intravenously, intraarterially, transdermally, intramuscularly, liposomally, locally by catheter or stent delivery, subcutaneously, intrafat, or intrathecally. In some embodiments, the compositions and systems provided herein (including polynucleotides encoding the compositions and systems, e.g., contained in AAV vectors) can be administered to a subject by intravenous administration.
The engineered gene effectors, fusion proteins, polynucleotides, systems, and/or combinations disclosed herein may be provided in compositions (e.g., pharmaceutical compositions) suitable for administration to humans. Furthermore, such compositions may be suitable for administration to any other animal, such as a non-human animal (e.g., a non-human mammal). It is well known to modify pharmaceutical compositions suitable for administration to humans to render the compositions suitable for administration to a variety of animals, and a ordinarily skilled veterinary pharmacologist may design and/or make such modifications using only routine experimentation, if any. Subjects contemplated for administration of the pharmaceutical compositions include, but are not limited to, humans and/or other primates, mammals, including commercially relevant mammals, such as cows, pigs, horses, sheep, cats, dogs, mice and/or rats, and/or birds, including commercially relevant birds, such as poultry, chickens, ducks, geese and/or turkeys.
The heterologous endonuclease in the system may be provided in any suitable form. For example, the heterologous endonuclease may be provided in the form of a protein, such as a heterologous endonuclease alone, or in complex with a guide nucleic acid as a ribonucleoprotein. The heterologous endonuclease may be provided in the form of a complex, for example, complexed with a guide nucleic acid and/or one or more heterologous gene effectors of the present disclosure. The heterologous endonuclease may be provided in the form of a nucleic acid encoding at least the heterologous endonuclease, such as RNA (e.g., messenger RNA (mRNA)) or DNA. Nucleic acids encoding at least heterologous endonucleases can be codon optimized for efficient translation into a protein (e.g., optimized human codons) in a particular cell or organism.
The nucleic acids, fragments or derivatives thereof disclosed herein encoding at least heterologous endonucleases can be stably integrated into the cell genome. The nucleic acid encoding at least the heterologous endonuclease may be operably linked to a promoter, e.g., a promoter that is constitutively or inductively activated in the cell. The nucleic acid encoding at least the heterologous endonuclease may be operably linked to a promoter in an expression construct. The expression construct may comprise any nucleic acid construct capable of directing expression of a gene or other nucleic acid sequence of interest (e.g., at least a heterologous endonuclease), and such nucleic acid sequence of interest may be transferred to a target cell.
In some embodiments, a heterologous endonuclease disclosed herein can bind to a single guide RNA (sgRNA) to activate or repress transcription of a target gene (e.g., a target endogenous gene), e.g., to a heterologous gene effector disclosed herein. The sgrnas can be introduced into cells expressing the heterologous endonucleases or variants thereof provided herein. In some cases, such cells can comprise one or more different sgrnas that target the same target gene (e.g., target endogenous gene) or target gene regulatory sequence. In other cases, the sgrnas target different nucleic acids in the cell (e.g., different target genes, different target gene regulatory sequences, or different sequences within the same target genes or target gene regulatory sequences).
In some embodiments, the target nucleic acid of a heterologous endonuclease disclosed herein can be dsDNA. In these embodiments, dsDNA targeting specificity is determined, at least in part, by two parameters, targeting the gRNA spacer of the protospacer in the target dsDNA (the sequence in the target dsDNA corresponding to the gRNA spacer on the non-complementary DNA strand), and the short sequence immediately 5' (upstream) of the protospacer on the non-complementary DNA strand, i.e., the Protospacer Adjacent Motif (PAM). In some embodiments, PAM is 5'-TTTG-3' or 5'-TTTA-3'. In some embodiments, PAM is 5'-TTTG-3'. In some embodiments, PAM is 5'-TTTA-3'.
In some embodiments, the target nucleic acid of a heterologous endonuclease disclosed herein can be RNA. In these embodiments, RNA targeting specificity is determined, at least in part, by the gRNA spacer targeting the protospacer-like sequence in the target RNA (the sequence in the target RNA that is complementary to the gRNA spacer), and is independent of the sequence immediately 5' (upstream) of the protospacer-like sequence. In some embodiments, the heterologous endonuclease may be further capable of targeting a dsDNA molecule, wherein the gRNA spacer is selected such that it targets a protospacer in the target dsDNA molecule, said protospacer having a PAM selected from the group consisting of 5'-TTTG-3' and 5 '-TTTA-3'. In other embodiments, the heterologous endonuclease is not capable of targeting a dsDNA molecule, wherein the gRNA spacer is selected such that none of the protospacer sequences in the dsDNA molecule targeted by the gRNA spacer have PAM selected from the group consisting of 5'-TTTG-3' and 5 '-TTTA-3'.
In some embodiments, a heterologous polypeptide (e.g., and/or a complex comprising a heterologous polypeptide) comprising an engineered gene effector and a heterologous endonuclease can regulate expression and/or activity of a target gene (e.g., a target endogenous gene). In some embodiments, the heterologous polypeptide and/or complex thereof can edit the sequence of a nucleic acid (e.g., a gene and/or gene product). Variants of the nuclease activity of the heterologous endonuclease can edit the nucleic acid sequence by creating a double-strand break or single-strand break in the target polynucleotide.
In some embodiments, a heterologous polypeptide (e.g., and/or a complex comprising a heterologous polypeptide) comprising an engineered gene effector and a heterologous endonuclease can create a double strand break in a target polynucleotide (e.g., DNA). Double strand breaks in DNA can cause DNA break repair, which allows for the introduction of genetic modifications (e.g., nucleic acid editing). In some embodiments, the nuclease induces site-specific single-stranded DNA breaks or nicks, thereby causing HDR.
Double strand breaks in DNA can cause DNA break repair, which allows for the introduction of genetic modifications (e.g., nucleic acid editing). DNA break repair may occur through non-homologous end joining (NHEJ) or Homology Directed Repair (HDR). In HDR, a donor DNA repair template or template polynucleotide comprising sites flanking homology arms of the target DNA may be provided.
In some embodiments, a heterologous polypeptide (e.g., and/or a complex comprising a heterologous polypeptide) comprising an engineered gene effector and a heterologous endonuclease does not create a double strand break in a target polynucleotide (e.g., DNA). The binding of the heterologous polypeptide or a complex comprising the heterologous polypeptide (e.g., a complex comprising a nuclease-deficient variant of a heterologous endonuclease and a guide RNA) without nucleic acid cleavage can be sufficient to regulate expression (e.g., enhance or repress) of a target gene (e.g., an endogenous target gene).
In some embodiments, the methods disclosed herein (e.g., after introducing a complex comprising an engineered gene effector and a heterologous polypeptide into a cell or population of cells) modulate (e.g., increase) expression of a target gene by at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 6-fold, at least about 7-fold, at least about 8-fold, at least about 9-fold, at least about 10-fold, at least about 11-fold, at least about 12-fold, at least about 13-fold, at least about 14-fold, at least about 15-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 60-fold, at least about 70-fold, at least about 80-fold, at least about 90-fold, at least about 100-fold, at least about 150-fold, at least about 200-fold, at least about 250-fold, at least about 300-fold, at least about 350-fold, at least about 400-fold, at least about 500-fold, at least about 600-fold, at least about 700-fold, at least about 800-fold, or at least about 1000-fold.
In some embodiments, the methods disclosed herein (e.g., after introducing a complex comprising an engineered gene effector and a heterologous polypeptide into a cell or population of cells) modulate (e.g., increase) expression of a target gene by at most about 50%, at most about 60%, at most about 70%, at most about 80%, at most about 90%, at most about 2-fold, at most about 3-fold, at most about 4-fold, at most about 5-fold, at most about 6-fold, at most about 7-fold, at most about 8-fold, at most about 9-fold, at most about 10-fold, at most about 11-fold, at most about 12-fold, at most about 13-fold, at most about 14-fold, at most about 15-fold, at most about 20-fold, at most about 30-fold, at most about 40-fold, at most about 50-fold, at most about 60-fold, at most about 70-fold, at most about 80-fold, at most about 90-fold, at most about 100-fold, at most about 150-fold, at most about 200-fold, at most about 250-fold, at most about 300-fold, at most about 350-fold, at most about 400-fold, at most about 500-fold, at most about 600-fold, at most about 700-fold, at most about 800-fold, at most about 1000-fold, at most about 2000-fold, at most about 2000-fold or at most about 2000-fold).
In some embodiments, the methods disclosed herein (e.g., after introducing a complex comprising an engineered gene effector and a heterologous polypeptide into a cell or population of cells) modulate (e.g., increase) the expression of a target gene by about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, about 2-fold, about 3-fold, about 4-fold, about 5-fold, about 6-fold, about 7-fold, about 8-fold, about 9-fold, about 10-fold, about 11-fold, about 12-fold, about 13-fold, about 14-fold, about 15-fold, about 20-fold, about 30-fold, about 40-fold, about 50-fold, about 60-fold, about 70-fold, about 80-fold, about 90-fold, about 100-fold, about 150-fold, about 200-fold, about 250-fold, about 300-fold, about 350-fold, about 400-fold, about 500-fold, about 600-fold, about 700-fold, about 800-fold, about 900-fold, about 1000-fold, about 1500-fold, about 2000-fold, about 3000-fold, about 5000-fold, or about 5000-fold) in a cell population.
In some embodiments, the methods disclosed herein (e.g., after introducing a complex comprising an engineered gene effector and a heterologous polypeptide into a cell or cell population) modulate (e.g., increase) expression of a target gene from below a detection limit to a detectable level.
In some embodiments, the degree of change in expression is relative to prior to introducing the system of the present disclosure (e.g., a complex comprising an engineered gene effector and a heterologous polypeptide) into a cell or cell population. In some embodiments, the degree of change in expression is relative to a corresponding control cell or cell population not treated with the system of the present disclosure. In some embodiments, the degree of change in expression is relative to a corresponding control cell or cell population treated with an alternative to the system of the present disclosure.
In some embodiments, the methods disclosed herein modulate (e.g., increase) the activity level of a target gene, e.g., after introducing a complex comprising an engineered gene effector and a heterologous polypeptide (comprising a heterologous endonuclease disclosed herein) into a cell or cell population. Depending on the functional characteristics of the target gene, the level of activity may be determined by suitable functional assays for the target gene in question. For example, the level of activity of a target gene that is a mitogen may be determined by measuring cell proliferation, the level of activity of a target gene that induces apoptosis may be determined by an annexin V assay or other suitable cell death assay, and the level of activity of an anti-inflammatory cytokine may be determined by an LPS-induced cytokine release assay.
In some cases, the methods of the present disclosure result in a change in the expression level and/or activity level of a target gene (e.g., a target endogenous gene) that is longer lasting than can be achieved with alternative compositions and methods (e.g., via RNAi repression, such as using siRNA). In some embodiments, sustained modulation of gene expression (e.g., sustained gene activation or sustained gene repression) is beneficial as compared to transient modulation.
In some embodiments, the methods disclosed herein modulate (e.g., increase) the expression level and/or activity level of a target gene for at least about 1 hour, at least about 2 hours, at least about 3 hours, at least about 4 hours, at least about 5 hours, at least about 6 hours, at least about 7 hours, at least about 8 hours, at least about 9 hours, at least about 10 hours, at least about 12 hours, at least about 14 hours, at least about 18 hours, at least about 20 hours, at least about 1 day, at least about 2 days, at least about 3 days, at least about 4 days, at least about 5 days, at least about 6 days, at least about 7 days, at least about 8 days, at least about 9 days, at least about 10 days, at least about 14 days, at least about 28 days, at least about 5 weeks, at least about 6 weeks, at least about 7 weeks, at least about 8 weeks, at least about 9 weeks, at least about 10 weeks, at least about 12 weeks, at least about 14 weeks, at least about 18 weeks, at least about 20 weeks, at least about 26 weeks, or at least about 5 months, at least about 6 months, at least about 12 months, or more.
In some embodiments, the methods disclosed herein modulate (e.g., increase) the expression level and/or activity level of a target gene (e.g., a target endogenous gene) above a certain threshold for up to about 1 hour, up to about 2 hours, up to about 3 hours, up to about 4 hours, up to about 5 hours, up to about 6 hours, up to about 7 hours, up to about 8 hours, up to about 9 hours, up to about 10 hours, up to about 12 hours, up to about 14 hours, up to about 18 hours, up to about 20 hours, up to about 1 day, up to about 2 days, up to about 3 days, up to about 4 days, up to about 5 days, up to about 6 days, up to about 7 days, up to about 8 days, up to about 9 days, up to about 10 days, up to about 28 days, up to about 5 weeks, up to about 6 weeks, up to about 7 weeks, up to about 8 weeks, up to about 9 weeks, up to about 10 weeks, up to about 12 weeks, up to about 14 weeks, up to about 18 weeks, up to about 26 months, up to about 12 months, up to about 5 months, up to about 12 months, or more.
In some embodiments, the methods disclosed herein modulate (e.g., increase) the expression level and/or activity level of a target gene (e.g., a target endogenous gene) above a certain threshold for about 1 hour, about 2 hours, about 3 hours, about 4 hours, about 5 hours, about 6 hours, about 7 hours, about 8 hours, about 9 hours, about 10 hours, about 12 hours, about 14 hours, about 18 hours, about 20 hours, about 1 day, about 2 days, about 3 days, about 4 days, about 5 days, about 6 days, about 7 days, about 8 days, about 9 days, about 10 days, about 14 days, about 21 days, about 28 days, about 5 weeks, about 6 weeks, about 7 weeks, about 8 weeks, about 9 weeks, about 10 weeks, about 12 weeks, about 14 weeks, about 18 weeks, about 20 weeks, about 26 weeks, about 5 months, about 6 months, about 9 months, or about 12 months.
Computer-implemented method for generating functional biological sequences
Also provided herein are computer-implemented methods for generating functional biological sequences. Referring to fig. 21A, a non-limiting embodiment of a computer-implemented method for generating a functional biological sequence is provided. The method may include, at block 2110a, providing an fitness function trained on a biological data set comprising functionally determined biological sequences having a fixed length. The method may further include providing, at block 2120a, a plurality of different sequences comprising a fixed length in a computer, each sequence associated with a temperature and a fitness based on the fitness function, wherein each sequence is associated with a different temperature of the temperature gradient. with respect to computer-implemented methods, as used herein, "sequence" refers to the order of monomer units in a polymer molecule, wherein the monomer units are of the same type (e.g., amino acids, nucleotides, etc.) as are present in the functionally determined biological sequence of the biological dataset. As used herein, "temperature gradient" means a series of monotonically increasing or decreasing temperature values. Any suitable temperature gradient may be used. In some embodiments, the temperatures are geometrically spaced apart in a temperature gradient. In some embodiments, each of the plurality of different sequences is associated (e.g., randomly associated) with a different temperature of the temperature gradient. At block 2130a, the method may include, in parallel, by the computer, in a plurality of different sequences (1) selecting one or more random locations for introducing substitutions in one or more of the plurality of different sequences, and for each of the one or more sequences, evaluating a first fitness change resulting from introducing substitutions at the one or more randomly selected locations, and based on the evaluated first fitness change, And optionally further accepting or rejecting substitutions based on a temperature associated with the sequences, and/or (2) selecting one or more pairs of the plurality of different sequences, each selected pair comprising a sequence associated with a successive temperature of a temperature gradient, and for each selected pair, selecting one or more domains for exchange between the sequences of the selected pair, and evaluating a difference in fitness of the sequences of the selected pair due to the exchange of one or more domains, and accepting or rejecting one or more domain exchanges between the selected pair based on the difference in fitness and the temperature associated with each sequence of the selected pair. The method may include, at block 2140a, iteratively performing block 2130a, wherein, in each subsequent iteration, accepted substitutions of a previous iteration and/or accepted domains of a previous iteration are swapped into a plurality of different sequences, thereby generating one or more functional sequences having fitness that reaches or is above a desired fitness threshold. As used herein, "functionally determined" refers to a biological sequence associated with one or more functional features of interest, e.g., experimentally determined functional features. Without limitation, a biological sequence may be functionally defined when the effect (or lack of effect) of placing the biological sequence in a biological context (e.g., expressed in a cell, expressed in vitro, expressed in vivo, etc.) is determined or known. Any suitable functional feature may be used. The functional characteristics of a biological sequence are any suitable cellular process that can be determined experimentally. In some embodiments, the functional features used to functionally define a biological sequence are transcription, translation, DNA methylation, DNA demethylation, cell death, cell signaling, mitosis, meiosis, energy metabolism, differentiation or dedifferentiation, and the like.
In some embodiments, after performing the predetermined maximum number of iterations of block 2130a, the method terminates. In some embodiments, the predetermined maximum number of iterations is, is about, is at least, or is on the order of 103、104、105、106、107、108、109、1010、1011 or more, or optionally, the predetermined maximum number of iterations is within a range defined by any two of the foregoing values (e.g., 10 3-1011、104-1010、105-109、106-108, etc.). In some embodiments, the predetermined maximum number of iterations is on the order of 10 6. In some embodiments, the predetermined maximum number of iterations is on the order of 10 7. In some embodiments, the predetermined maximum number of iterations is on the order of 10 8. In some embodiments, the predetermined maximum number of iterations is on the order of 10 9. In some embodiments, the predetermined maximum number of iterations is on the order of 10 10. In some embodiments, the method terminates after generating one or more sequences having a fitness above a predetermined threshold fitness based on a fitness function. In some embodiments, after performing the minimum number of iterations of block 2130a, the method terminates after generating one or more sequences having a fitness based fitness function above a predetermined threshold fitness.
Any suitable number of random positions may be selected for introduction at block 2130a (1) (e.g., each iteration). In some embodiments, the number of random positions selected per iteration is or is at least 0, 1, 2,3, 4, 5, 6, 7, 8, 9, 10, or more, or optionally is a number within a range defined by any two of the foregoing values (e.g., 0-10, 0-5, 0-6, 1-3, 1-5, 5-10, or 2-8, etc.). In some embodiments, the number of random positions selected per iteration is in the range of 1-5. In some embodiments, the number of random positions selected per iteration is 1. In some embodiments, the number of random positions selected per iteration is, is about, or at least is, 0.01%, 0.05%, 0.1%, 0.5%, 1%, 1.5%, 2%, 2.5%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10% or more of a fixed length, or is optionally a percentage (e.g., 0.01% -10%, 0.05% -5%, 0.1% -3%, 1% -5% or 1% -10%, etc.) within a range defined by any two of the foregoing values of fixed length. In some embodiments, the number of random positions selected per iteration is in the range of about 1% to about 5% of the fixed length. in some embodiments, the number of random positions selected per iteration is about 1% of the fixed length. In some embodiments, at least 1 random position is selected at block 2130a (1) using a uniform probability distribution. In some implementations, at least 1 random position is selected at block 2130a (1) with a predetermined probability for each iteration. Any suitable predetermined probability may be used. In some embodiments, the predetermined probability is, is about or at most 1%, 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or higher, or optionally, the predetermined probability is within a range defined by any two of the foregoing values (e.g., 1% -10%, 10% -20%, 20% -30%, 30% -40%, 40% -50%, 50% -60%, 60% -70%, 70% -80%, 80% -90% or 90% -95%, etc.). In some embodiments, no position is selected for replacement in a given iteration. Each selected substitution at a random position, if any, may be a substitution of a residue (e.g., an amino acid residue) in the starting sequence (e.g., a residue at a selected position in each iteration prior to introduction of any substitution) with any other residue. In some embodiments, the substitution is random. For example, for an amino acid sequence, an alanine at a randomly selected position can be selected with equal probability to be replaced with any of the other 19 naturally occurring amino acid residues. In some embodiments, the substitution is a conservative amino acid substitution. In some embodiments, the substitution is a non-conservative amino acid substitution. In some embodiments, the one or more randomly selected locations are selected uniformly over a fixed length, e.g., over multiple iterations. The selected locations and/or properties of the substitutions may be the same or different in a plurality of different sequences evaluated in parallel (e.g., in the same iteration). In some embodiments, the selection of the number, location, and/or nature of substitutions in one sequence does not affect the selection of the number, location, and/or nature of substitutions for any other sequence in the plurality of different sequences evaluated in parallel (e.g., in the same iteration).
At block 2130a, the method may include any suitable criteria for accepting substitutions or accepting domain exchanges at one or more randomly selected locations. In some embodiments, accepting or rejecting a substitution at block 2130a option (1) includes accepting a substitution when the fitness of the sequence after the substitution is introduced (e.g., all substitutions for the iteration) is greater than the fitness of the sequence before the substitution is introduced (e.g., before any substitution is introduced in the iteration). For example, in fig. 9A, random positions in the starting sequence may be selected for introducing mutations (as an alternative). The fitness of the starting sequence and the mutant sequence may be determined (e.g., based on a fitness function), and if the fitness of the mutant sequence is higher than the starting sequence, the mutation at the selected position may be accepted and the mutant sequence may be used as a starting sequence for the next iteration, exchange domain, and/or crossover of the selection and evaluation of mutations (as described herein), or the method may terminate as one of the outputs if the desired fitness is reached by the mutant sequence and/or a predetermined number of iterations have been performed.
In some embodiments, accepting or rejecting a substitution at block 2130a option (1) includes accepting or rejecting a substitution at one or more randomly selected locations based on a probability weighted by a ratio of the fitness of the sequence after the substitution was introduced (e.g., all substitutions for the iteration) to the fitness of the sequence before the substitution was introduced (e.g., before any substitution was introduced in the iteration). In some embodiments, the method includes accepting or rejecting substitutions at one or more randomly selected locations based on Boltzmann Metropolis-hastins acceptance criterion r mh. In some embodiments, the substitution is accepted with a probability of min (1, r mh), wherein,Where f (i) is the fitness of sequence i, f (j) is the fitness of sequence j that contains substitutions at randomly selected locations of sequence i, and T is the temperature associated with sequence i. For example, in fig. 9B, random positions in the starting sequence may be selected for introducing mutations (as an alternative). The fitness of the starting sequence and the mutant sequence may be determined (e.g., based on a fitness function), and if the fitness of the mutant sequence is lower than the starting sequence, the mutation at the selected position may in some cases be rejected and the same starting sequence used in the next iteration, domain of exchange, and/or crossover of the mutation (as described herein) is selected and evaluated, or if a predetermined number of iterations has been performed, the method may terminate the mutant sequence as one of the outputs. Or if the fitness of the mutated sequence is lower than the starting sequence, the mutation at the selected position may be accepted with a probability proportional to the decrease in fitness, e.g., using Boltzmann Metropolis-hasts acceptance criterion r mh. As another example, in fig. 9C, random positions in the starting sequence are selected for introducing mutations (as alternatives). The fitness of the starting sequence and the mutant sequence may be determined (e.g., based on a fitness function), and the fitness of the mutant sequence may be found to be lower (0.007) than the starting sequence (0.01). The mutation at the selected position is accepted and the mutated sequence can be used as the starting sequence in the next iteration, wherein a random position can be selected for introducing the mutation. The fitness (e.g., based on a fitness function) of the starting sequence (i.e., the mutant sequence from the previous iteration) and the mutant sequence of the current iteration may be determined, and the fitness (e.g., 0.4) of the mutant sequence may be found to be higher than the starting sequence (e.g., 0.007).
At block 2130a (2) (e.g., each iteration), any domain between any two sequences (or pairs of sequences) associated with consecutive temperatures of a temperature gradient may be selected for exchange. As used herein, domain exchange means exchanging (a) one or more consecutive residues (e.g., amino acid residues) along a fixed length in one sequence associated with a temperature, (b) corresponding one or more consecutive residues at the same position along a fixed length in another sequence associated with a temperature in a temperature gradient immediately preceding or following the temperature associated with the first sequence. In some embodiments, block 2130a (2) involves selecting one or more pairs of the plurality of different sequences for exchange of complete sequences (e.g., the selected domain comprises the complete sequence of the selected pair) such that temperatures of consecutive temperatures associated with the sequence of the selected pair are exchanged. In some embodiments, when block 2130a (2) involves selecting one or more pairs of the plurality of different sequences for exchange of a complete sequence, the selecting the one or more pairs is the same step as selecting for exchange of one or more domains.
Any suitable pair of sequences associated with successive temperatures of the temperature gradient may be used or selected for exchange (e.g., each iteration). For example, for N sequence chains [0, 1, ], i, ], N ] with corresponding temperature gradients [ T 0, T1, . . . , Ti, . . . , TN ], such that for i > j T i ≥ Tj, the selected pair may include sequences i and j corresponding to T i and T j, respectively, such that j=i±1. Any suitable number of sequence pairs (e.g., each iteration) may be selected from a plurality of different sequences. In some embodiments, the number of pairs selected per iteration is, about or at most, 1, 5, 10, 20, 50, 100, 150, 200, 500, 1000, 2000, 5000, 10000 or more, or optionally, the number of pairs per iteration is within a range defined by any two of the foregoing values (e.g., 1-10000, 50-5000, 100, 10000, or 1000-5000). In some embodiments, the number of pairs selected per iteration is at most 3 pairs. In some embodiments, the number of pairs selected per iteration is 1 pair. The domains selected for exchange may be located at any suitable position along a fixed length. The domain selected for exchange may comprise any suitable number of consecutive residues. In some embodiments, the domain selected for exchange comprises, comprises about or comprises up to 1,2, 3, 4, 5,6, 7,8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, or more consecutive residues, or optionally, is comprised within a range defined by any two of the foregoing values (e.g., 1-75, 1-10, 10-20, 20-30, 1-75, 30-40, 40-50, 50-60, 60-70, 1-50, etc.). in some embodiments, the complete sequence of each selected pair is exchanged (e.g., the temperatures associated with each sequence of the selected pair are exchanged). In some embodiments, the domain selected for exchange comprises, about, or comprises 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, or 100% contiguous residues of length, about, or at most about, or of a fixed length, or optionally comprises a percentage of length to fixed length within a range defined by any two of the foregoing values (e.g., 1% -100%, 5% -40%, 10% -30%, 5% -30%, 50% -100% or 1% -25%, etc.).
In some embodiments, accepting or rejecting the exchange of one or more domains at block 2130a option (2) includes accepting the exchange of the selected domain between the selected pair when the fitness of the sequence after the exchange associated with a lower temperature of the consecutive temperature is greater than the fitness of the sequence after the exchange associated with a higher temperature of the consecutive temperature. In some embodiments, accepting or rejecting the exchange of one or more domains at block 2130a option (2) includes accepting the exchange of a selected domain between the selected pair when the fitness of the sequence associated with a lower temperature of the consecutive temperature after the exchange is greater than the fitness of the sequence associated with a higher temperature of the consecutive temperature after the exchange, wherein the selected domain comprises the complete sequence of the selected pair. In some embodiments, the method includes accepting an exchange of the selected domains between the selected pairs based on a probability inversely proportional to a difference between temperatures associated with each sequence of the pairs and a ratio of fitness of the exchanged sequence pairs. In some embodiments, the method comprises accepting an exchange of selected domains between the selected pairs based on a probability inversely proportional to a difference between temperatures associated with each sequence of the pairs, and a ratio of fitness of exchanged sequence pairs, wherein the selected domains comprise the complete sequence of the selected pair. In some embodiments, the exchange of selected domains between the pair is accepted or rejected based on the parallel tempering criterion r re. In some embodiments, the exchange of selected domains between the pairs is accepted or rejected based on the parallel tempering criterion r re, wherein the selected domains comprise the complete sequence of the selected pairs. In some embodiments, the domain comprises the complete sequence, then the selected sequence is accepted for exchange with probability min (1, r re), wherein Where f (i) is the fitness of the swapped sequence i and f (j) is the fitness of the swapped sequence j. In some embodiments, if the domain contains a portion of a sequence, then the selected domain is accepted for exchange with probability min (1, r c), where. Here, i 1 and j 1 are sequences before domain exchange, respectively, and i 2.j2 is a sequence after domain exchange, respectively. For both temperatures T i and T j, i 2、j2 and i 1、j1 are ordered such that f (i 2)>f(i1) and f (j 2)>f(j1).
In some embodiments, at block 2130a, the method includes (3) selecting a crossover site between one or more pairs of the plurality of different sequences, and for each of the one or more pairs in which a crossover site was selected, evaluating a second fitness change for each sequence of the selected pair due to crossover at the crossover site, and accepting or rejecting crossover at the selected crossover site based on the second fitness change and a temperature associated with each sequence of the selected pair. Any suitable pair of a plurality of different sequences may be selected for interleaving. In some embodiments, the sequence pairs are randomly selected from a plurality of different sequences. The crossover point may be any suitable location along the fixed length. In some embodiments, the crossover sites are randomly selected along a fixed length (e.g., regardless of the position of the crossover site selected for any other sequence pair). In some embodiments, the method includes accepting or rejecting the crossover at the selected crossover site based on a probability weighted by a ratio of the second fitness change for each sequence of the selected pair. In some embodiments, the method includes accepting or rejecting the intersection at the selected intersection site based on an intersection criterion r c. In some embodiments, the crossover is accepted with a probability of min (1, r c), whereWherein for the sequence pair i 1 and j 1,i2 is the sequence of i 1 up to the crossover site and the sequence of j 1 after the crossover site, j 2 is the sequence of j 1 up to the crossover site and the sequence of i 1 after the crossover site, and for the two temperatures T i and T j, i 2、j2 and i 1、j1 are ordered such that f (i 2)>f(i1) and f (j 2)>f(j1). For example, in fig. 9D, pairs of sequences are selected, and crossover sites may be selected for crossover between the selected pairs of sequences. To determine whether to accept or reject a crossover at a selected crossover site, the fitness change between the two sequences after the selected domain crossover is based on a second fitness change for each sequence of the selected pair due to the crossover at the crossover site and a temperature associated with each sequence of the selected pair, e.g., a weighted probability based on a ratio of the fitness changes for each sequence of the selected pair.
In some embodiments, at least one of the one or more functional sequences produced has a fitness based on a fitness function that is greater than the fitness of each of the plurality of different sequences prior to any iteration of (c). In some embodiments, at least one of the one or more functional sequences produced has a fitness based on a fitness function that is greater than a fitness of 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or about 100% of the plurality of different sequences prior to any iteration of (c), or optionally, a fitness of a percentage of the plurality of different sequences prior to any iteration of (c) within a range (e.g., 10% -100%, 20% -95%, 30% -80%, 40% -100%, etc.) defined by any two of the foregoing values. In some embodiments, the desired fitness threshold is based on fitness associated with a corresponding sequence of the plurality of different sequences in (b). In some embodiments, the desired fitness threshold is based on a maximum fitness among a plurality of different sequences. In some embodiments, the desired fitness threshold is (e.g., prior to any iteration of (c)) a factor of 1.1, 1.2, 1.5, 2, 2.5, 3, 4,5, 6, 7, 8, 9, 10, 20, 50,100, or more of the greatest fitness among the plurality of different sequences, or, optionally, is (e.g., prior to any iteration of (c)) a factor of the greatest fitness among the plurality of different sequences within a range defined by any two of the foregoing values (e.g., 1.1-100, 1.2-50, 1.5-50, 5-100, etc.).
Any suitable sequence may be used as the plurality of different sequences. In some embodiments, the plurality of different sequences comprises a plurality of different, fixed length naturally occurring sequences. In some embodiments, the plurality of different sequences comprises a plurality of fixed length random sequences.
Also provided herein are computer-implemented methods for generating or generating a functional biological sequence. In one aspect, a computer-implemented method includes evaluating, by a computer, sequences of a plurality of different sequences comprising a fixed length based on an fitness function trained on a biological dataset comprising functional (or functionally determined) biological sequences having a fixed length. The computer-implemented method may further comprise replacing, by a computer, one or more random residues in the sequence to generate the mutated sequence. The computer-implemented method may further comprise evaluating, by the computer, the mutant sequence based on the fitness function. The computer-implemented method may further include collecting, by the computer, the functional sequences accepted by the fitness function.
In some cases, the computer-implemented method further comprises randomly exchanging, by the computer, one or more subsequences from the mutated sequence with different sequences of the plurality of sequences.
In some cases, the fitness function comprises a threshold selected from the group consisting of a binary threshold, a numeric threshold, a multi-class threshold, a confidence threshold, a decision threshold, and any combination thereof. In some cases, the functional sequence is accepted by the fitness function when the fitness score assigned to the functional sequence by the fitness function exceeds a threshold.
In any of the computer-implemented methods herein, in some cases, the functional (or functionally defined) biological sequence may comprise an amino acid sequence or a nucleotide sequence of a protein or peptide. The protein or peptide may be any protein or peptide of interest, including any protein or peptide having a biological function. Non-limiting examples of protein or peptide sequences that can be generated using the computer-implemented methods provided herein include epigenetic modulators, transcription factors, enzymes, nucleases, agonists, antagonists, regulatory factors, and inhibitors. In some embodiments, the functionally or functionally defined biological sequence comprises a transcriptional activator. In some embodiments, the functionally or functionally determined biological sequence comprises an engineered gene effector.
In any of the computer-implemented methods herein, in some cases, the functional biological sequence can comprise an amino acid sequence, a nucleotide sequence, or both. The functional biological sequence or functionally defined sequence may be of any suitable length. In some embodiments, the functionally or functionally defined biological sequence comprises an amino acid sequence. In some cases, the functionally or functionally determined biological sequence may comprise an amino acid sequence, and the fixed length may be at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, at least 100, at least 105, at least 110, at least 115, at least 120, at least 125, at least 130, at least 140, at least 150, or at least 200 amino acids, or at most 500, at most 300, at most 200, at most 150, at most 120, at most 100, at most 95, at most 90, at most 85, at most 75, or at most 70 amino acids, optionally, the fixed length is within a range defined by any two of the foregoing values (e.g., 30-500 amino acids, 50-300 amino acids, 50-200 amino acids, 75-95 amino acids, 80-90 amino acids, or 60-150 amino acids, etc.).
In any of the computer-implemented methods herein, in some cases, the fitness function is based on one or more machine learning models. The machine learning model may be, but is not limited to, a supervised machine learning model, an unsupervised machine learning model, a reinforcement learning model, a deep learning model, a transfer learning model, or any combination thereof. In some cases, the one or more machine learning models may be, but are not limited to, classification models, regression models, convolutional Neural Networks (CNNs), recurrent Neural Networks (RNNs), limit gradient boosting (XGBoost), long-term memory networks, generation countermeasure networks (GANs), automatic encoders, transformer networks, evolutionary Monte Carlo, or any combination thereof. In some embodiments, the fitness function is based on an integrated model comprising CNNs and XGBoost.
In any of the computer-implemented methods herein, in some embodiments, the method includes evaluating, by a computer, the biological data set to generate the fitness function, the evaluating including generating a sequence embedding from a Large Protein Language Model (LPLM) based on the biological data set, and training a machine learning model with the generated sequence embedding as input. In some embodiments LPLM comprises an Evolutionary Scale Modeling (ESM) language model, optionally wherein LPLM comprises ESM-2. In some implementations, the machine learning model includes an integrated model of more than two different models. In some embodiments, the integrated model includes a decision tree model and a convolutional neural network. Any suitable decision tree model and convolutional neural network may be used for the integrated model. In some embodiments, the integrated model comprises CNNs and XGBoost.
In any of the computer-implemented methods herein, in some embodiments, the biological dataset comprises up to about 10 5 or on the order of about 10 5 functionally defined biological sequences. In some embodiments, the biological data set comprises, comprises about, comprises at least, or comprises on the order of 10 3, 10 4, 10 5, 10 6, 10 7 or more functionally defined biological sequences, or optionally, comprising a functionally defined number of biological sequences within a range defined by any two of the foregoing values (e.g., 10 3-107, 10 4-107, 10 4-106, 10 5-106, etc.). In some embodiments, up to 5% of the functionally determined biological sequences comprise functional sequences (e.g., sequences associated with, or determined or known to have, at least some of the desired functions in a relevant context). In some embodiments, the proportion of functional sequences among functionally defined biological sequences of a biological dataset is, is about or at most 0.01%, 0.02%, 0.05%, 0.1%, 0.2%, 0.5%, 0.75%, 1%, 1.2%, 1.4%, 1.6%, 1.8%, 2%, 2.5%, 3%, 3.5%, 4%, 4.5%, 5% or more, or optionally, the proportion is within a range defined by any two of the foregoing values (e.g., 0.01% -5%, 0.1% -3% or 0.5% -2%, etc.). In some embodiments, up to 1% of the functionally defined biological sequences comprise functional sequences (e.g., sequences associated with, or determined or known to have, at least some of the desired functions in a relevant context). In some embodiments, 0.1% -1% of the functionally defined biological sequences comprise functional sequences (e.g., sequences associated with, or determined or known to have, at least some desired functions in a relevant context). In some embodiments, 0.5% -1.5% of the functionally defined biological sequences comprise functional sequences (e.g., sequences associated with, or determined or known to have, at least some of the desired functions in a relevant context).
In some embodiments, the biological dataset comprises one or more amino acid sequences of fixed length or a portion thereof of a heterologous gene effector disclosed in International publication No. WO 2023/004338, which is expressly incorporated herein by reference in its entirety, including but not limited to SEQ ID NOS: 16155-47350 of the above-mentioned publication.
Further provided herein are computer-implemented systems for performing the computer-implemented methods of the present disclosure. In some embodiments, a computer-implemented system is provided that includes a computing device including at least one processor and instructions executed by the at least one processor to provide an application including one or more software modules for performing any of the computer-implemented methods described herein. In one aspect, a computer-implemented system is provided, the system comprising a computing device comprising at least one processor and instructions for execution by the at least one processor to provide an application, the application comprising a software module configured to evaluate, by a computer, sequences of a plurality of different sequences comprising a fixed length based on an fitness function trained on a biological dataset comprising a functional (or functionally determined) biological sequence having the fixed length. In some cases, the application further comprises a software module configured to replace, by a computer, one or more random residues in the sequence to generate the mutant sequence. In some cases, the application further comprises a software module configured to evaluate, by the computer, the mutation sequence based on the fitness function. In some cases, the application further includes a software module configured to collect, by the computer, the sequence of functionalities accepted by the fitness function.
Further provided herein are non-transitory computer-readable media having computer-readable instructions stored thereon, which, when executed by a processor, cause the processor to perform the computer-implemented methods described herein. In some cases, the computer readable instructions, when executed by the processor, cause the processor to evaluate a sequence of a plurality of different sequences comprising a fixed length based on an fitness function trained on a biological dataset comprising a functional (or functionally determined) biological sequence having the fixed length. In some cases, the computer readable instructions, when executed by the processor, cause the processor to replace one or more random residues in the sequence to generate the mutated sequence. In some cases, the computer readable instructions, when executed by the processor, cause the processor to evaluate the mutation sequence based on the fitness function. In some cases, computer readable instructions, when executed by a processor, cause the processor to collect a sequence of functionalities accepted by the fitness function.
Also provided are engineered gene effectors comprising one or more polypeptides produced by any computer-implemented method of the disclosure. In some embodiments, the engineered gene effector has an amino acid sequence that has, or has at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or about 100% sequence identity to any one of the polypeptides produced by the computer-implemented methods of the present disclosure, or optionally, has a percent sequence identity within a range of any two of the foregoing values (e.g., 80% -100%, 85% -95%, 90% -98%, etc.). In some embodiments, an engineered gene effector (e.g., generated or produced by a computer-implemented method of the present disclosure) comprises a polypeptide having any one of the sequences listed in table 8. Also provided are engineered gene effectors comprising a polypeptide of 85 amino acids in length comprising any one of SEQ ID NOs 1452-2199, or a sequence at least 85% identical thereto. In some embodiments, the polypeptide comprises an amino acid sequence having, or having at least about 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or about 100% sequence identity to any of SEQ ID NOS: 1452-2199, or optionally, having a percent sequence identity to any of any two of the foregoing values (e.g., 80% -100%, 85% -95%, 90% -98%, etc.). In some embodiments, the engineered gene effector is capable of activating a target gene in a cell when the engineered gene effector is expressed in the cell and effectively targets a site of the target gene, optionally, wherein the target gene is endogenous to the cell. The target gene may be any suitable target gene described herein.
In some embodiments, the polypeptide comprises an amino acid sequence having, or having at least about 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or about 100% sequence identity to SEQ ID NO 1495, or optionally, having a percentage of sequence identity to SEQ ID NO 1495 within a range defined by any two of the foregoing values (e.g., 80% -100%, 85% -95% or 90% -98%, etc.). In some embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO. 1495.
In some embodiments, the polypeptide comprises an amino acid sequence that has, or has at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or about 100% sequence identity to SEQ ID NO 1592, or optionally, has a percentage of sequence identity to SEQ ID NO 1592 that is within a range defined by any two of the foregoing values (e.g., 80% -100%, 85% -95% or 90% -98%, etc.). In some embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO. 1592.
In some embodiments, the polypeptide comprises an amino acid sequence that has, or has at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or about 100% sequence identity to SEQ ID NO 1595, or optionally, has a percentage of sequence identity to SEQ ID NO 1595 that is within a range defined by any two of the foregoing values (e.g., 80% -100%, 85% -95% or 90% -98%, etc.). In some embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO. 1595.
In some embodiments, the polypeptide comprises an amino acid sequence having, or having at least about 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or about 100% sequence identity to SEQ ID NO 1634, or optionally, having a percentage of sequence identity to SEQ ID NO 1634 within a range defined by any two of the foregoing values (e.g., 80% -100%, 85% -95% or 90% -98%, etc.). In some embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO. 1634.
In some embodiments, the polypeptide comprises an amino acid sequence that has, or has at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or about 100% sequence identity to SEQ ID No. 1654, or optionally, has a percentage of sequence identity to SEQ ID No. 1654 within a range defined by any two of the foregoing values (e.g., 80% -100%, 85% -95% or 90% -98%, etc.). In some embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO. 1654.
In some embodiments, the polypeptide comprises an amino acid sequence having, or having at least about 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or about 100% sequence identity to SEQ ID NO 1665, or optionally, having a percentage of sequence identity to SEQ ID NO 1665 within a range defined by any two of the foregoing values (e.g., 80% -100%, 85% -95% or 90% -98%, etc.). In some embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO 1665.
In some embodiments, the polypeptide comprises an amino acid sequence having, or having at least about 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or about 100% sequence identity to SEQ ID NO 1677, or optionally, having a percentage of sequence identity to SEQ ID NO 1677 within a range defined by any two of the foregoing values (e.g., 80% -100%, 85% -95% or 90% -98%, etc.). In some embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO. 1677.
In some embodiments, the polypeptide comprises an amino acid sequence having, or having at least about 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or about 100% sequence identity to SEQ ID No. 1686, or, optionally, a percentage of sequence identity of the amino acid sequence to SEQ ID No. 1686 within a range defined by any two of the foregoing values (e.g., 80% -100%, 85% -95% or 90% -98%, etc.). In some embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO 1686.
In some embodiments, the polypeptide comprises an amino acid sequence having, or having at least about 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or about 100% sequence identity to SEQ ID No. 1689, or, optionally, having a percentage of sequence identity to SEQ ID No. 1689 within a range of any two of the foregoing values (e.g., 80% -100%, 85% -95% or 90% -98%, etc.). In some embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO 1689.
In some embodiments, the polypeptide comprises an amino acid sequence having, or having at least about 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or about 100% sequence identity to SEQ ID NO 1716, or optionally, having a percentage of sequence identity to SEQ ID NO 1716 within a range defined by any two of the foregoing values (e.g., 80% -100%, 85% -95% or 90% -98%, etc.). In some embodiments, the polypeptide comprises the amino acid sequence of SEQ ID NO 1716.
TABLE 8 amino acid sequences of engineered gene effectors encoded by functional biological sequences generated using computer-implemented methods
SEQ ID NO Amino acid sequence
1452 ICPSTATPGNSGSDALDEFELDMEWGMLLDDFDLDMLDLPAEEKHCMDTCTDFAALDGDCLFDDMPDHSPSPDDDCLFENFPPLQ
1453 PPFPATQVSGDTPEINNDIDQGDGVNFDCDMLESDAHDDFWLDMLGAMQQPEFDLADLDLVHGGWVEGFNWMNGDDMPEPGPGAG
1454 AIPDVDQFNSFEDFFQDMWDTSLPIDFDTDEFNTATTDFDDFLKLLGDNQAGIPSTPEGAEMQFHVHTGCPPVASQVSSHDTIKL
1455 PDDPDEPVSPNNLATTLMYHQLGTDGNVSGGGPCGDSPLDLTLFSLHDDLSAEWPEFDPATLDMADFEFDQWKWDDLGVSDLDDV
1456 DQPPSEEGFPPHWKEPTEPSDQGYDIPSLDEFLDDMGLVSPMTNMMSGLMGYNDLGTIDQSELAELCMLDIEQYTSDAGEGGLWF
1457 SPDSDWPHFSGRSDPIPDDVPVGWELPDLNFYIPDFALDDFDLDMLGASFACDEQNDMLHNPEQEETPLQTPRPTEWFYGAYNID
1458 GWHNEDMPDYTHMLLNSMPAHETQLAWFPPSPDTWINTNPWTDMDLPDLDINDASIQELLNLEVPVQDPDGWDSISPGFSSMEPE
1459 TPGETEYGPRLTPSPMPTDLGPDTFNFGFDMFAGLDTSPEEPIPSLASDLDDYLHTVDHQEGCCPDALDDVSYILEDLFDFMDRN
1460 GQSEPSMSCDFFNPFTMGAEPEYSSNDPPRFSFAPAKFTTPECFQELLYDDLDLNPSTDGEDVIMAMADALDDFDLDAWPEWPSP
1461 HAQRNASMYTGFDCDSDIGPIPDCFDTSPFMDTCSDGWCHTLPHDTEDDIWMQLDFPVFLGDAPIEGPELTPDFISLLDDAIDQW
1462 AHPSQPPEEYWDDFLLDMLMSDDHWIGTQYPELMHALDDFDLDMLGSDTYDMSCLTTEDFQAGGSFGGLSALCGSEGPDSPGHGD
1463 SSPASFEIPILETPDYALPSDGPGVNDDEDIIEMLLASAPPTGLVLSDQLEGQPPSGSCDEAGSLSMPHTYWGNMFEDMFGDSMT
1464 VGTLPSDSENFPRSDTPPQFGMSDDNLCEMTPSDTGDSFDLHLNLPDDQAALHALLQEAGPLDWDELVPLDDFDMNFTLNGMYDQ
1465 GQMDDSNTDITDDLLAPHLVSMMDEWLGDCDHIEGSVLDLFNDHEQTLPDLGILEELLIGDATAPEYQAGQCGNIMTVDDPPHLQ
1466 PPVGSMPDYPHELNHRAWELLFGEEAHNGDNFPSHPSPELVHPSMLESASWDDFDQDMLKFMGYDDLWSGPVQSQILDDLHLFED
1467 IGECQLPSFDGFPALQPPVEAEFVFDFPEPSEGGEEMCYHDLPDYDVFGFEASTPTNTPRLAPGDNPEWEDYDSLFPDHFGFGWD
1468 SSENPSNLMSPGLDETGGSSEPYRDHPFPEWDTFIDDLMLDDDLPDPFTWTPGVEDTGDCTDQPLSPLWVDTYFTWDWPGENYAC
1469 CEAPMEQFDPGCNIALVAGPLDGLGEYDFNDLDLNWFEGYHLTDHSCEQLIDFMDPDPTDFDWDDLADPSFGTSPLTPDPPIYPA
1470 SGGSSPSPQYGSPEDSLLDLVDHEEHVCPSDDWLDDFNDLFSDTAELWIPTELNNWEPYGRASPSHGSSPKEASQDDPIGWIPDM
1471 GHGDELMQSPDPRAQMPFLFADPLGTPALGDCAVPDALSDFNLDTLGSGNEHMFIWDMNDVDAFDHMLDDMAKDMYDCEQLGLDI
1472 AAFSDLLSPEELWSDTHFDAEFTEDWWDEINCLENPAAPGGAAMDSGSGLDDCLPPVDHLEINWPSPLQDEIQEFLDLGDDCDLS
1473 PDAAQSVEDDNMTLGGEWFTGWGYDFFAGFDHIEGALLDDPPTVEYDLDINELDSILLALSAMDMPRPKEDAAECGSTQSGATTK
1474 LCGSELSLDSLDALEPSGVSMPSNSDECALWDPLYDLPDNFDWLEEIDRSLGIPPLPPTSMGPDLCGFIWSLEHENTTQDWGHHE
1475 QCTGIDEMSTWDDVMTHEYDTQTQFDFPEPLEGSVSPFRPNLGPLQYQELMAEPAEDPSWSWPYDQFDLDALGLDALDVCWYPPN
1476 DLTQYEVPSCSDQFIGEHFAWPVPPDLEFDQPFEFDQMEYFPAGDTDATNAGDDWPPDLSGIEEIFTIVPEDHVDMFAGLDMLFS
1477 GGGGSHPDEDLNIATDGSIVHPLCVHDELDDLTYPGALDTYDGDQLGIFDLFDFDLDMLGGTIPNTADPVISYLLQQVTHDTNDI
1478 IDATRDLIGPLDEIASDGWDTSYPIDFDFDQFNTETTDFDTLHELYDLDYTAPEWPNEEAVSGSDPIACLPDPSVGIIVDTQEWQ
1479 ANDNAASAEIIYWDTLSPCHKWLPDDFDLNMLPEDMLDEFDCDLLVSDAIHQFDLEMLGYGQGGPEMRHGLNIYSDDYVIIPSGE
1480 TTTASGGSSAGPPMDLDNLGSLDLDFFDLDYQHEPAHPFFDLDMHGLDDLYDCLAMYEDLCKADEGDCNDAGFMDPMGGSLGSNS
1481 DNPPHSHVDPHQGCFLDTALPTIVGCQTIEPTVCSDILDLFDPYALGSEALDTDNLDTLGIDDLLRECCAGVASWQQEPEPPQCD
1482 LSEPEGSPGTSLDTWPVNYVTWDQVFALQGTPAEALPHDVNMTDDLDVDQGLMEMLMTDESELETLLDCLLPVETQLPPAYGPQD
1483 STGAPEGSLDENFNMIPGPLLPLPMQEDPWQLLPDMGTYPSGDLLGDLPDLGELSELLCPEVSMAIESQHDFFGEGSELELFASG
1484 PDVDNLTGCDFDTLLNHLPGIMDQNIHDLMEPSTQPNQGGASASLLSDGSQLDFDSLYEWDADASAGEWQHLEAEAWEFVCHTFS
1485 ECPGGDDNIPAPEGYPLNNENDDCLDNHLMMDYSNIDDLLTLCGVDLSDIPSPGPPVGMSFGPFDEWFDDIDGGQNSRTAGSLLG
1486 DTPMHSEMQAQHEGEMDLFHDVGWEWPDLGFAWYEDALDPFDWKILGADALYDADILASPTPASDYNLLDEHGGFMDDDFYFDFN
1487 CDSMDNDKNSETMDGLGTLPVVCDFDLDYNFDSSQLQEFLGNAGPDFDALWDMLDTYMSEDDRALFTEGAGPVCEEYGQPASWHC
1488 HLCLDCTIVETLGSGDLMLEQHPSLECPQHSGESHNLTFPATDLLAQDALDHFDLDMTMWLEEWSFNIDSADWDLAGMESGGSGI
1489 VGALLGLDSYPKTTPSLMDLSVCEYDDPDGSLGTKQVNDLDGLDMLGSRLCDWMDLDNTAFDNLMPAEGAMPDEGGDIDWFMMDL
1490 TENNLSPMANWWDEQGLMDNDQYSDWPDLQELPPDLDLMTEALDLGSPLTPELHEILDMFGIDECGNHDSAAGDGGVVYMTKGCQ
1491 ATAQDDEAPLQVHLDEVIIPNHEPCADDSPAQEWDNWFDMLEWTDAYNHAFSSMPSFPWFEDVALWSDCNIDTVEQPDEPPCFDI
1492 QIYRFTADCTPIQCDEGQYELDFDWDFDLLEWCDPTHMTPNDEWTPEPPQGCSSSVHGFTSEFEEEYLMDQWEEFMPTPPPHHDL
1493 GWEISGGAGDADGDPHPGQAFSGPMDFDAALPNIPTFDFDDFPLFADLESFMEPLGQEGLDCLGYDASLMKAVQENLGDLTQSSW
1494 AILENWVIPDSSSAVVCGDFSTDELQPTHVMHAWEEPPHQMDISLEEFDNFWQEQAQHDFDAEMDEWLSSEILSIFNDAPPLGTF
1495 PLCPLGAEPQHGVGWEPCPLSCEPQHAAPPIHCYLWPDNADFLDNGAALDEVPFCEIDIFDGDEFPEIMDLGVNWLEEWDPFSES
1496 GHADNPAGSSLLPSPVEPDPTQQDLLAPWSLPDLDGFNPSLWFDMESISDSETQLGDHLDVDVYALLETMWNLPDPDEIPELPKD
1497 ECEICMALDSLDEMFEQWHYVDRELIQGPSNEVIAPLITLDLMSEATYPAPGVWGTTLGMTGETVDDIFDWNIEDYLEIDFSELM
1498 TYAIWPEPPPLDLHLEDLCACLEQEANGEDTPFESDAAATFDAMLDFDAALEDCDTGMADSPASLNFSLDMGDSDYFSNWMTMML
1499 APPVFDDICETLDEAEPQHGYPDHTDGLVHQVLEPELEAFDLGMLGDDSLDNFDLTHMWAMLEEEAWGESPAAECNQSVQEGQGV
1500 TEAEEEASTPTEPPGSEFTDIMCDWLPCEGDTPPSDALDGTLLDILGQALDPDPWPHMNGSIAHLEFDLDMGYWCSPVQHQPEMP
1501 CSHDRNHSTFEMFPLSESTWDVICPDAHDACLPTTTEMLPEDLYLDSPLEPELMEILDDFLGAECQLNDGFIADFLLGAGDENEP
1502 CDVTSEYAYNQLAEEHLIPMEPHRPLCTSGPLSDTFAMDDFDSPDLNDQALDDVPALWDFFDALNNAPNAMPLLDMLTDFDLHNA
1503 FDQMWLYSFNSIMMANHEWDNQPSHDGGPNTPPPDHETMIPDLNLDWLDTPEEPDDVDMFLNDWGLVSQMLDLQDCVDWPMPYDG
1504 EHTEFMATGMDQGESNDYCWMYDLACECMADSGSPNISPSLTLSEYEEFCRLLPDMDTVDVGGPWAAQEPLFMHEIYESDEPQDL
1505 LGDFDLDSMLLPAVEPPCVAEPAEPPESELSSIYRPADWLAQADWHMNTTEDALSSDVNQFLSELFEPDNWENWVGENINYGLDE
1506 QDMDDAEAASTGEFPVTDWLEGPLPRQHGIASGIEGFGSTGPSNLDEDSWLDYLPMVDQILSDEYLPIADVFNELDDLLDFSCTM
1507 VDWPDEGTEFEHTDYFQNYVLPDVDIDDLLALLGDASPSKEQMGESTVPRTDMTLWGYLSDGAMPEENWKDGFCDWEMGGGEGTR
1508 AREAIGIKTPDGAGAMACAEVFGSELFPTGDLGDNDALDDFDLDDLLNDEFVMPDMLPDILDTQFLTTLQPHSFDFMDGHGHGHL
1509 NEAFCSTICETTGEAWIDPDPRGPQPWEYAFEELCASPMFDTSFDDLIATPWYLEGLQPDHLLSIDPVCSEILGDTEENKTMQHE
1510 FDDQSMLGLLDNFTPECLSSPSEQQDLCPEEDPYAFNPAGSISGDIIDELMLCEIRMDPWSYHTPADEPWWHPHCDLPQPSYEMP
1511 QPYVYPSVNTNISPSEPAHEPNENEWLGWDNSLAWNVGSLNDEFWLDLLTAGQPCFELPDVGGEPMNTGLADLMEFWAASEESMS
1512 AQAAPAAVEQLDDTISLLLQFGDDFDLYKGGLPAGGPWHGGPYYSTDSGGDDFDLDCIEPWVMEDSELSDVGLLDTDIADLILQG
1513 KTPVTVDTVSTCYPLERCNTVAELFDAPMAQDYTQWDNALELCQELFLDWDTEFNLHHAFEDDIMAGFDMLDQFEGTAEFLPTSP
1514 PWVCPRPMGDSDAVAPEIDTVGEWHFEIPDVDIINSPPVLHDLLGESSTLPDWALPWMDDQFCELDLMGQLGGPPAEFDEHSVPY
1515 HTPRGSMQFSTFCELEAVDFVCPASGETPCLHDEEELLDWKDFDDLPQLAPMMDVGDLDSIGPGWSWLDSYTSPIEEFLGCFDAE
1516 EGGPAWFTQPPSQDQGLPCCTCSAPECMIMPVAEWECLSQPIADFEQFTQAYLDNLNRDMSGEDNMSEQWCWDFMNSFDDTAWDT
1517 VMQMDVEATGCCDTTDTNENWLAGCEWPLPEPYAHQDMSSLDNLLHSDLESIDTIEMWAGGDTLALLETFMETMEMPHDYHDGPY
1518 LDEKCSMGFSDFIPTCPLNPMCGFMEWSSEPTDVEFDIPPTWLDDLTSLDVSDVVGIFDDLAFPPPAQTPEDQAHYSAGGGGTQG
1519 SPLRAPDIEFALPAWEDGLDLDWCFDATFDDFNFDTTVEDAFLDIFNWMEGPCNILSELGQGEPPPTRCQPEQGEDQYETPVSYP
1520 WSYPCGIEWPGPYSSSGRNSSQTQPRSDMTDLDLDNFLDLLKTDVESIDLGDFNDDLLGDSDTLDLNQYLFPVDEEDDTCMQPDV
1521 PGLPDHVTPCPDDEFNLYDNTQHPFDMPEPELCFEEMPPLDLFEILEISLTADINCPAVDTNSLEAFDLEDIQQTDNNPFTYFNE
1522 MATNSPVDSIRDETSPNKLEGSHIDPGDLDEFLDHMDLDDLWKTLWKDDQIMSHNGSINSCLENDMDVLFMHCGDFEIPLEHEEW
1523 GTRVNSQPITTEAMDSVAQSQAHELYCTPSEPFDTPCLPMYSLDSDDSLMGPTSDEAIGALLDIESFFDSILGGPSLGTTDDDPM
1524 KQYPQGVHGAIPQDPDAPAWPEFEDAEMFFACEACDLSPPETNTEDPELQPHSWEPLLAYEPSFISETIDTMWGCWDNWPWPDLA
1525 DWWEHFCPTTDWGDPLQGELYTPWSDVPLDQIQPIEPTAGFSDADLFNMLFHSDPSSQFDISHEDWLVNEGGSMLPTTPQESENV
1526 AEPSGSGMPTSQGWITGPQTPHLIDWITHDDIPGWPTDPSKDMSTLELDSSKFSLLDMISDEWLTQGSFEWPTDGEGMTGQGGGE
1527 TAEVIPYVDCPEASANAPTGANALEQPDLYDLASGSTWPLYASDMEMSDFLGDFCLDMLGMHDDIMWHNTLLDMDVLDDWDLSEP
1528 QEQVDLEGFALEQPIDDSNLEMNSPDGPVPQSAPRISEIGSNWFDIIDELGSPDMFDWLWCPPTPWDDICFTLDDLITDPQFEDH
1529 PPSDTDQWEFASLDFFDYDPEGPGGDGGGEPTHSTLLGMWSDLHLGDSNLTESMPSPELPGPVGALLETWDISTMAEWLDIPIDG
1530 AQTEEIQTDCAVGSIPQLTISPLDWHLDMDPHQPMFSGSLSQSHPTEIDDLGFLPEDLDNLAADVDAFTEIQNGMSLAEYYLWME
1531 DPSLGCLQPASLPIDPASELINPSPSRDSNYEPENHGSDLSPMVIQEMPDLGGLWMNASELAMDWREFTACDSDLWDLLGDGSDL
1532 GAAYQDDTLSYLSELVEQQLQQYHMEQWGSLWDSPATGHSTMSSDLELPDLGDLTELLCPPADPQLSSHFTGLSNWIQQVDGCNE
1533 GGAMTTRDDTSAISMPCFGGSQHAGDHPPSETPNEALADFDLHLLGDSDFAGVNFDWMNKGCDVADFFEICDAAFGDIRGFPEQS
1534 ALNVNAIEGSEHGDMPLMSELEHHVTHCPEWWYNDEAADFLQWFEETTNEYDNILGDHEFDPFQEDYNMPVCWPNPDFQGDINSP
1535 ALQQHTTPVPSHSEIADIMQDWPTGLIQEDLTISPPPPETQAFWQTLINEFFNNTAMWGEYGDHNPDEWAPCGSGGDTDQNPITQ
1536 VNPYFSYEDITPMTGFAPTQTCNDELPELFLEPQVAESFDEAELEMPSPHMNDFDLDMLLPDEWPPFDDWLTELMHFMDVQHAHL
1537 MDPDNMLFGVEAWDHCFMGVEHMDNVQEFAETLGFNDMDFDDMPNDIELLNCLGSPDSPAHTSLWCPGYMEQTHDPEPTSTDHSG
1538 YNGCHPGSSNWFGDQMYTQWEEGMDLDDILGSIGEITPPTSHSLDFFTDFFALSPMNSETDLEWADFFEGQEWHGQDGCTHTEWE
1539 ELGLYGELPHRAIGSELEEYPCVWSGQTPNTQHKDANSLMSCQDFQDILDAGFEDFTQGVDFDDFTYSVLDDAWLDQGGATNQWM
1540 PPDSMDCIGSPGVGVAGPSSPHSESQDYQGWETSTHEYNLEDDLQLLDLETYSPGSPLTEIEPDMLARNWFGECLFGDWVEWPPE
1541 YNANHMEPPTPPTTFFTDNLLSNEDWGCWNGPNDSAVNELFSVWGPQAAPPEQSPSFIGPSFDELMDWSYGWPCGGEWSLNWEDW
1542 EMPEADQQSGSPWSNLLFEEVGHEPDRSPADLDQDMFSGGVTLGLNWYDLNDLMDFDCMDFNDLMNGEGETFPTLEGFLTGICSP
1543 ATGRDCAEFQCDSPGSVSTSNWLQLVEDMLDPDPEGLGDFDWREILEPDCVMELDDPAFASLDMASQEMPMPGSVGFHDDLGQQE
1544 PSSNYDGTGHSIITINPDLDDCQSQDEPGWLWDTMTHAESQEPIPFSSVEDVFRPFPCDDLGQFQEDFYTFPGDFFDYIAWGEMQ
1545 LPEADDDFCPSESELFSQIMSLWDDLQYDDPGADLCQCILDDCIIDNDIFVNAFITSGGDSPSHQSGGRFNHNDSPQAFNPCESH
1546 PGPEPEDVGMGALWGETSGLGDSEQGDVFSDLDLDFWIGSLACDMDESLAMLDSVGHKLCSIYPNLPPSPLSLPDAWGQGMDQEM
1547 NWFDELWAEIMDDELPNLHDVRWPQGDYQSIEAQEPLSDDCFQEMLCDNALMGNDGLPDDANLLTWLESEWSASGSTDEWPAQPR
1548 NPNDDDYNPGDDNISWDPSDDDTMHTSPTSPVNPFDAPLGYDFNLDDFMFAQDFTDWPVFEEWMGHFEMPEWCFDPMYEDISFQM
1549 ADMADFSGMECVDMFTEVELSPDVFDFELPEYTTQEMSVSEDLFLDADTFGTPQPAPAQIPGDPYIQASVRNLAAWMLEDPWDFV
1550 AVPVTQEFPCQAYDMLPEQHVEAMDSHDSTLPPGQTGSMTCPMLALDDLMIDDILPGLRLSPNLCDPFECEFGTSFGFPVTPLDQ
1551 TNIPGKHINNWADDPLWDELLDMVPTHPCWFPMADHLEGANLFDVTMDDLEHPDLGDLGFNIFWNDALDSNHGEGQGQNESRVND
1552 SFAATNMASPCSLDSTSCSHPERGHEPMPAWLNLDMFSMTEDLTTRSPLTWDLDNLLDTFDKDPWGNDAMNDVLGGDGCYTEAPQ
1553 YPPVPSTTESATCTPHDGWPTETPEDFNFDEFNTDYTGFDDFLMLLCDNSNGIQSPWEGAQMPEFVDSDPVCDAPEVEGNGWIWL
1554 CEAHHCSHEGDHSVGMPHMGELHDAYAEEGVNLPNSPVIDSVDCDSASSYLDLGSSQWLKEDYLDDNLLAPFAWDTIWDFEVPDW
1555 EPTDLYDVPGGLPCDIDAEWNISGCLGDSDLESSGLFSGSTGSPMDWNVDMKDFDMDMLIPPCFPPADSDDGQEPPDLGYPLFLL
1556 GWGNLDATTSVWYDLPSEYWSCPDFEDVSWYCDIPLESPDLVDASLFDLFNSLDDPYAIANPQIDPESSEPFNTDIPGAEYHMLG
1557 QLPEWQLTADQDPLLELPDAAQYDLDALCTRPLKSDDDTLGSAWYNFSDFQSIWGEVSGMWDAIPSSDYYEILDDSLMTLLGDDD
1558 LDPGLVPMGDIQALQQDRALEHARGAGQVGQPFQSLYNTPDSLHDLWYGDPATVEPWDTYDPCDFFYSLLENEPEIPHSQNTPHN
1559 LAWTEHPLHDFCSIHNDGFDTSEPIDWDFDEFNDCFTDFDDFLKLLGDNSPSKGGGGGGGGEPLEAFDLFIPESNGDDVTVLDAY
1560 TETDGVPGPAIMTLELHPADASYATDALPPPFDLDMLGSDADDNFILDMLSGANLPCLPCLDQQPWMDQCIDFDDFDKFDDFISK
1561 ASDPPIPYLPGAENRCPAFLPNTFPDSGFTEFNPLDLNNLQIEDIHPWEEGLGNILSELEEWPKHFDTFECDESMDPMWCPGGGI
1562 LDNHEGADEPSDEFPLEGFAHHINDYLNYVYEKECPTGPSSASLPDLYDLEFDMPPLDGMPFWPEVNFTSVLGPTAESANPPSSD
1563 CCNMSQEPIWEDCIWQEARATVHFVAPQQDALSNANLWPDTPYDTEQISFPSLGDTPWDWFEPFPEFLDTTATPPLDIPQFTEPL
1564 GGGGSCIDTEGFPDWLDDAMGQISELPPPSNLESSAPTALVEQMGMELPCLSDLSELLCEHDEFMAGSAQLDDLDLLGQGEFKYP
1565 DPFPDMSDPLLDMWICNPDAICHESLSTNQQQGQNFPDPEDNPHDMPFLEFVLNELDFYPICEDVKMQEALYLDDPVANEKNFDH
1566 RPGSSNGAVAWPPAAEAPMENVPPHLCTPDELDLDFFSGNLGLALDSQLLSEDVLNLNTFLNMEPEDLGFVLDDAAGVCDLSYIL
1567 GSLPDVADLEDHEWGGFLSDWEYTGDCPKGLEAWDEKSLGDLFDQYVDGCENDFIEDDGSGMGLVDLLLQMDASSGEDGSMLGGS
1568 ADALDTSGPVENWLNWLPDDACYYPDEPWPPPLESFAVEPTIKEILEESLPADGFEPLCLTPDAEAFDADLTMEFGWLPNMVEGE
1569 EEEATALLWGMSPMLLNPDGADTGPGGPPPIQDLDDFNLSLGWLDEQKAVPILEVLCSEDPCWEESAAPGWTLDDFLLNAHGPLD
1570 IENATWDACECYDCDFLDNLWPYEDSFQWTPTPDVSHNFDNPRDSPIFEELFAGPTFFGDMLCRDLEDLDEYNQEYELPPPHIDL
1571 AVGSSDSWGLLDMPYQLDVMDHARLVESGALLDLDGFDDFDLTLPTDRDAADDMLSDLIDFKGQQALPPPPMDTYALGEDGDPCF
1572 CDIDSSLWAFEHFLEDDQFATCSNMPVDPLLDMTGDEFSGGDLFDWFDWDIAPCIHPMTYDQPDGPHHPYEWVFPEELTPPWSGP
1573 VGIVPTWPLENEAQPCEPMAIHESQACPADSTAYNPTPSEWDFDTAINWDVWNYSSLLDFSDENMWGSLYDWDPAPCATCMGDFS
1574 PDQTAQDFWLANDFDFDFFNIEYTDLDWWQWLSPPAGPTDEQQSNEYMASDYFMDLPFSPLSADDYTWATAHYPFNLGDANEQEG
1575 CHDNVCPGPMVEEPPKENLPEHDGLDNLLGMDYYNIDTLLTQPLTDLDPPLCPGAFGDTWQVEEQASEGGLGDWGDFSDIQWTLG
1576 DGHVDTDAEPPMENWIPMLFDTWWEQCHQLPPNHEDLTTYDPGFNLLDQVLEDDICAQMFDMPWGPEARPSDQFMLMGWPMQETD
1577 EFRHTEPDPESEAQSWDEATNANEDWPLWELTPWTPSPEFTFLMGEDYDLEDLEVHLFSGPFQDIDFMHSEDLLAGEDASSPLNE
1578 FEHVPHYLHSGAPRVDMDVLPTLCYDASPGEVHTYALDDFDLNALDFSALWDYDDPAPMDPMDTYILAPFCGDGGSPSWPIPFEG
1579 DATWHSLTTIGNDNEGPCNEWDDGLDNLLGMDESDFFDTITQEPRDLDIWLPYIHGVMMPQPTEGEGPGEDPCPGYLDIPLSESQ
1580 GGGTDALVDLLSDYIPPLAPSLGNAEPLSEPPTWVTEMLLENMLWLSAATPDQDDPGNIGGYTEEMHGDSICMWSELGFDGGQGT
1581 AQMDHACFWETNGFDTWTDTGADDLPQVSWGSIPATPGATPDGPSHAGILDDILPMYDHLDGCLQDWPSPMEPMDVWAHLWYQLE
1582 DDPFNSYAPYEWTVDELPDHPGGQSFATQPHPVLSGETDAMDDDPWSPLMGDVDLFGLLTCSPTPADTPQEALDFLEYPCDLGDG
1583 ANKGHTTHLGESHSNDKDAPAVPGLTYPWCTMHSQAGSPSPDIWDELMSPVSLGDGLLSNPIGDDTNHMCWDMLFDFDSYITWDS
1584 AGVCHSDTQISGDGPGIMDVGPDASMQHCEQSPQGPVAGDWEQPTAEFNRMEQEWTGEMHAELMDFLLAWIQCDDLGATDMPWVG
1585 WHNYDEAMPQSWTDPDLKTVLTTLDFWDLSVESFFAPSMDEQTQAPPTDESLGTTGNTPGEDVGMFPRTELWDFLLDDMGEPTSP
1586 THGPEHFDTVASGFAIDDFDMGMLFDSAPCWWDCHALGSDQYDLFDLNLGDPSPDAAMYDFAWDTFCLDHEAGMAYQDPAGTDQT
1587 SADTEPVPAEPGSQCPKWPGEWLMDDWLDVSAFPDFDLDMLGDGLLMETVSEPDPYAPSKELDDMLDNFGNHFMNALRADDWMYG
1588 TQIPPVEMPGMGFDCSLDDDLPPLGSLSPDESDWLFGGPHSTHPWWDCGDLCPDYMDPTTGGWGSTDGCADLEQLLQGSMWLENE
1589 PPELLEGMHRTNDNTQTQHDYCGFGYDTSAFDDLFAPDGDWAGTLFEWLDSSHASIQELEDCVPWPYPPFTGNSETDLEMEPIWP
1590 GGSDSQAEDHGELPQLGYAGSIYDVDDRDLAIIGLNPASFHLCDEFSESNSPAPADCLDPGHEWAEQFLDVWDFVDDMFDFSSGM
1591 AIHVAPTGMGDMQEWLYAECQAEEAPWRPVNCLEMALDDFDLDQLGMDFLWDVDLDPAAHSMLPSIANICVSDPFQGGAQGMDGY
1592 PGPMSFVEEHDNLPMLHLGIPMDNVVTPPQMGEDHEWPAPNLTEYSEQILEQIASNNTLFDEQFSELLETWDLYDNSEPLGDPFV
1593 DDIVPPHTPMTFCPYWMPWEDSSGPGCVSDQCIIGDPLPVAAMQDGIDEDLSFLGMMYDEDWECPPFCVNLLSSMDQDSLPPWDM
1594 PAPQPAYFAKMIDGQLNEDPLWDFMDFDLDIPWDSFFHGAASPIDLEDYSPPPHDQDWQSLDASLSMEGVSQLDDFDLQDMYTVV
1595 GLIDTTVDIPGSPQIDEMFYEHNFEPEHVSNWGWQPGQVLQQLLGESSNLPSLEVHDAGQDLPTLMEDLGYLGLLPPLEYDEDFC
1596 GQGGTGIPMTFGNMSQTDLSIGGANDHDHTEPPSTANPGCWMSTWFPDVDSILQSDWPLVDMFQPVPDDEVHDPSLQTHEFDEWW
1597 GPTSDPGPKSEYLDVFDLFDLDMLGSDDEEGADWDMQAPKVNEDLSLDMLATSEDDCLPPVQDPSDDLCQYNNWMGLGSPMGSLS
1598 GGGAERFTQPETPDGPYMDLASWLQGLGQEFPLAPPMPLLPGLLEEWIDIEIPDTSGSGDNMWEEVHFCSPPLDNSFDLWGPPLD
1599 GGGDLGPPISFTPTLEEIEDYTTNFNVPIDDHPDAFLPTVWDLAAVWEDIDLITGPEEELSWEPYPWQIWGDGGGESSSSPPGFK
1600 TDPVVPCASGPQNQCSTPTHQCESGHIEDDSQCMDMFDGWLEMDIINELESALRDADWLDDEWTLPVIPAGLDDLCGLGDPGDFQ
1601 EPDFDGFIEHALNFLNSMDTNLAADYEHPQIEPDTNAQLTAATQPAFHAILPDFYWDAPGECFSDFDWLMSITEHNNFESDGMVN
1602 AESSNTTPGPLAGSTFIFDSYCIGWPQYMPNVPPEDLRESQFPQYSEEFQDLDELICDQDLGAEPWTWPWSPTYDALLSDIREDW
1603 YGGQTNTTTQAQIQELGFPMEPQRPPVAENISLDSGNQLVHYTLNSLGLDDPWGYNEGARDEPIHYYLDWGAYEMDLDDFLDWSG
1604 HYDMSAEEPMEIIPNFDKEFDELFPCFPFHEGDFAIDVALDACQDLFWLDPPDGPWQPPLASQEWLQEDFAWPHGARSPGQVTQE
1605 LDTGEQQPPTQESLQGWFNPPPWTNQIPFFDALDDFDLDMLSNADSPGPSFIDVHPLWDALPWPMCKPPVGVGQDVPWGDFPDFL
1606 QPAQTYQETDKQFDNVGPGIRPRAGALQPGPEAITTEMLCMEDMTQLSLGDYAELCLPEGDSSDMAELSMLMDWWLSGDADNTFG
1607 PQLSEYDGPLAASQDSAVSIPDMVLLPATYDEQWNTDLSFDYNDLQELDDLGDLCDLHFQLLPPMDDTEGELWRCFETEFGMSMG
1608 PDDCCWGDDIPFDVSQVPSLEATTECLGLSQGDFEGLLDLPDPIALPDESGPVADWMSFLNGCADLEPLHTGGMGQSSNLEDPYA
1609 ELGASTPEMDPVSANHAADVMQERSIPDNTCNMCPQADSFLRQWLGDLADPFLEEFAQSFNPDDWLPQEAWDWHLCPGQYGENNG
1610 GGSEPLNEINENWLNFPLCAEFSTSDEHMLLDPLYALDDFDLTLMGIDDLAPTYPAISDTHRIENTNHLYHQPQTPCHICGLGWC
1611 SVFYDLTPCMIMMMAMSLRMEFGMDFETETWWCPEQIKPPPWTLRPLNSSICGYHCHADANWYKPHIGHFDYETSSQIWMDTEKF
1612 EDPWACDLEWLMEFINADAWGHYVDLATLDEMLDPGMLDDLWLDKWAVAASTDVVGHIDRPADPGSSQMGMHDGCMHNSANDWEQ
1613 DDWASSVLECVLSDAEDCTHGAPYSQCETQSHWAVGNHLAPPDMLECIIEEAPPLWAPMCSGGDEDLWGKALFGEWDFNNDNSST
1614 AFSGADWDNLDEFLCTCHFNEGRDVADHAQADPLPEISEPTSTLVPLQFNMQEDNANPPPIYTFNAPDLMAFDFDLDDWICEQPL
1615 DLMVYELGFPPEDDTPAPPMSGPETGPTPSNMDMEFEIPPISLYSDWDELMSAEVNDLLRLSPPSEQPTVDLYQESAPQYSPVSH
1616 PPRDGSDGEDINLLMENMEECMDWSTPLRFPNSWDVDEMDLPLQLDSYDPEGFDITPYPEAFASWEEEPPDSNGWDPQFHVWYPS
1617 IHPDAIDDWNQEETWNNVQPCSLEYDFDFDEFNTETTDFDDFLSLLLDNSPSKCHKPPELCGVFEIPENWRHGGYKAEGVDIPFD
1618 SQARDGGAESDMMDAFDIIDLDWLGLLDDCTAPESHNMMFPDDDALMAEAGKVAANGATPHGEPYEMGPMMTLDVVLGSLDGLFA
1619 LEFLDPTMHMCEYGDQPLLEIVLDDLPPHDIGTFSSWEDFVQDFFDSLGPLDDLGMSIDEPSNWISQSNSEETQNEDTEDPLMSF
1620 PIVGPMCSAFDCFCDAIYEMTFDDPATNHNSPARLPPMDESLLNPYECDWAEFLLEDTPCGCETMGWTCDNMLAHQENQPNQHDP
1621 AYAGLPIVSDEPGMDCIVPLSLAVRDTPEMMWSPPPPAYTAAANDSEIALMDQLPAMDEGLADEPLMDLDVVMWLDDLGFGPWEL
1622 ETTCLPINPHDGDSVEETVTEILEPILSVGECVTLPPAPTTLSKDGTHEFLEDWGWPGIESIPPEWDDYNFVWSDVDQTLDFGYP
1623 MEYTSPWNPAQWLETNVTLDDMWVRFPAHDEILEDFHLHMFADDNFEGDLFQDLLDVPSELWTWLDMEEETTIPLPWSPPESGSS
1624 DCELYEGFPPYPAGFNVVPPEELTIDNDFDEFFASATEFDDFSFDHALMTAGATQDSSEKMCRDFQDIMEELDILDPEWTCIPFM
1625 PVPGFDMSPEQHGMEFPTTDFDFEWSEDTLWGWVTTPTGSETSFWSQPDCLFDFWSLCGQPNPETMPELPDHAVSDPQQESNDMD
1626 AQEPADTPNHAFLPETSDLLPLMPDPWFPAWAPGSSPLMSLEQLMESWDDLELMDADFATSTGMHMSPSESGPSEAKNGEHGATT
1627 MNEGDREDPDEFAQIVGGSLDNTLDVDLFQHYIQSLLWPGNGDPLDPTPPTESPLFPPMYEPWVQYDFQPFSEYDAPELEGNFDP
1628 QAAPLPQIPFLENLDQPEAPQGLPLGAFPSMAPSTMEDACDYTSEGSGPNLCDLSELLGGPFWEADDEKEGMFEDFLPLHLGSTG
1629 IYKPQETDPTQESYIFYNQDTSSWTEPFIDGFIPQDSNAHDLGGFMLDMMPALTSDMSRDEWEFPIWLDPLAPQNMHHEWEFFDW
1630 CQNDIASQFNLMDNVTLWNEDPPHSAQALWGSPMNESLMTPEMFWELLCDDFAYAATPEDQTMENSYASPGDPLDWFAHLETWDG
1631 RPNDPPQQMQMPIDEQNEDPNVEDCACNPMHDDGLTVDFRFDLYGANLPNEDLVPPPLGLPLTDLEWNEEFLVDLVDFLGEGNLS
1632 GIGGSGGGGSGGVDALDDFDLEYVFDMDLYCEASEMISMHDLFGSDVDGFETEANLEAFYGNFCYEPTNHAGDPIDPMCHGWTSQ
1633 TQWCMYPPDENQIQPMLANEEHGQPHDDSSADVARQPCVLKTAWEGAMCLDETLEDLDLADFDLAIDTWACDAPITYDFSVNSPW
1634 DGPYMPTPQSSGTLSKLWADEDYDADQPMPPPHDDYIVWESVPPGLELPGLNDLWMDELFFDASDFDWPSAALDVDDFDKLWFEH
1635 MAENSAYWPQIEGLDEQCFFNDTALEGDGDLPALSFLSMPEGLPSDFPPTFNTPEILDTFLNAWPSFTWSDCLDDQPWPDTGDGE
1636 DWSGFSPFDEYSDWFIGIAEDDHGSPQVDWDTPTAMPTYVAPHSPDTQFDMDTQQADNWPPHWNVDFDYMLHAYGMGTTFPQLED
1637 SNVMPLHDDFDITDISVECPGIHDTFLTTESGDTFWSAPEFSEFSTNILSLEDPYLDSLDLASPDLEPYWPGALIQQDEGMIDIV
1638 YPQDVGEAEADSVIDFQLGDCSEHWGFALPGPPPAPQHALLSYNGQNQIELDELLRQMFDYPDVNLLEEPIPPNLCCNSLLDTDL
1639 QAAQEAAGPGTDAVEQWHEEPCDLGPFASLDDPNMGPGGVQMEPLWQTPGLDDDMEQWLGPIGTYCPEQADLMLGGLDEISISDW
1640 ASMPPTPEAQGPQLFEDLAVYGQSDERALLDQLHTLLSNTDGEGPKTCTHALEFGRELTGMQSCQSDDGWWDPDIMLGLDFGQED
1641 ERTPGFPHTPLPEELDVDCWHEYVPEGFPQGHHAANQPFFHPLAEENYTDMDNCNMGLDFWGESLQHIEPEFWSISPWDFNDDEI
1642 SQPQVTAPHDFSPQIMPLPWWAPSFEQIMGPEPHLDMTPPSEATFFTTTMGDPHDQENWYRELTLEVSNWASMNLDDLDAAFLDD
1643 GPDEDISDGQLIPGDPVAMIQPMDGTLEHWGLDLDTSMGSLSFDLHSPAEEDIFGAWGLGGGLDNPISTQDCDGTGPWGGPWQQQ
1644 FSNSNLQIPDWAFETCNDNYVWEQSYFPDPLGLPSTLSALPGETLLEWEELVLTPEPAHSSFYWEGHTSGTGAGIGGGEGPVDQA
1645 YTPPHGQWMDLDYFFAGLPETETFFDFLEPQEASEEISISSQSIDELMGNDVFFAQPGCDPDVDPYPFNLDMGPVVSDHGGSSAQ
1646 FDWHSQNAMSWEPGQNDNALFAPWPSNDNLPPPEYGLIYPSPIAQLDELLSHDVPMPADKLAELADVYNFGQDDLVGVEDLFMFM
1647 DVYNTWPAASCQESIKPIPPESSPGLVYDFDSVFNSFEDNFQDFFNSLECLTEFMHDAGPSATGCRIGPTDGGGGVFSTTGDWHD
1648 PPNVNHAVVVHANTFNDVYDVCETAPEPFHPIPPMFDAFQDLLGMNLDDCMSLVPEFDQLPLTPSAVMDSFHDETHGDVLWESIG
1649 KTAPEGTEFTIPHPQEVEAEQCFVQPHVRDPWTPSLETKNADKDTEFMTCISTEPSMDQITNPLEDVQDDDMFYHWMDAWCDNVF
1650 HQPCWVGDGNCPETPPAPLMMGGDVDMEPLWDGDFEPFSFDEPCTWEPSNIIMELDELPFTHWDNDNNCFKPCVNWEFFAEFNNE
1651 MDAAEFPLAGTSSCNPDNCDGLCHQEYHQSDETPPFVERPSDDMESIILTDLMDVDDMDFTAFPDGLDYIADGDWGLWVVPGFFE
1652 HETITVSSISMEGPSGDSLTAEDDSDYAPMPSFLIGGHTEYPTLFGDDDHAPWAEMRVCASLDAMDFADPQLFMDWLAIPDFGSE
1653 GCITPWLSPRPAEVEWPEIPFSEEFILDILGQPHPPTPFGTPSLHDLYMVEFDTPVNDPNTSSIDDDHLELLGMKMNGGAGGQSG
1654 WDFDWVGDSIMDIIPIEDMFDEVAPRNSCATWLAETCALSNIDTETLLDMTAGEDLCPSNDAGILSNLDEMEANTVWQWPPYQDE
1655 DWCTLINMSPTPFYDGPNLVEDYGSLQDPEETYQTPFWPEVLSGDFTEESLEDLALMGMPAMPNLDCLQDVDCEQPDWGDLGELP
1656 GRGVSTCCSIIPGLYDETTNHWADVLMAPLNALDDFDLDFLSALPSVDSMEPQSWSDLLDLDLMAWIGSMDPPECGRPADEKTSQ
1657 DWEAHCFERFPPPPFNAMWDHGDVQPNPEPTDPSADLDGFDLTSLDCMLHDSIWDTPMIEFMNELEPGNRENPTCALGYEHTHKM
1658 DHFLAAFVQPEPNEYIPIDDGGQFVDDPPPPEPLSFATGGPWNIDEFISSDVFNLEHFLAEGGSPWMMPDCALMDDDLTDLSDHR
1659 LSHAPTMDEMPSIGTVMFDDAVEEPGSHCTDMEYDLMGLLSILLEESSEPMSQLGLFTGGTTINPLAQLDEYLMSTLENDASPGN
1660 PEEFPWWSSFEPLELEVPEIDDVTLPWPMTNLGVSTIFPQDVYLTDLHFMLDLMGNTDHGPEMDTAMDAMCDQDVQEPDGWPATP
1661 PTNPSSNPGAHIIATGSEWLDWSWGDNMGDPLEHGPTPCGTPSLIDLWGMEYDVPTHDNSEGFCNDLFGVPDDFWQQQFQWCQEY
1662 GQMIIDTWDVPSLDSPNIPLLQFEDLGTLDEFLAWGPCMLNDEFWTDLFMNDSYPALLGGGELDRDGNHMSGDICMHGGRHPGSD
1663 GMTDSELSDEPALWGPGSQVSPPPPHFDAHDLMLDYQDLFDLDCLGRLGLDELDSNMLFSESGGGINWPGQHQVIDGCGEIGPSQ
1664 EMWPFPEANDEESLVEFWPWRDPSPNEPFNPCNTDSFGDMCEALDLDPLPAYVCNMWNNDVDTDWNPDISLAPWVDAELEIDETF
1665 VEWGSSSGGTWDPPHDWVDPIQHMWDCPPADLDGMFIERLFHDELWSDSLMPAPSNMMGYDFDAYEAEHGITDPWDPTDIGNDME
1666 DDISMPGSHVDECNGDSVDSPTNPIPFDPDMANWETAIDDTSDEKLLDNLPSMSGTGPMWGLTLSNLQGEDICTWEEGLGHWCFE
1667 DTFCPTPSAMGPGLSEDLTPGCYQDALPMLDHILGCPAVSDLLGENPKDEGMGIFELLPESWNPSLEQVDLDGYLDTWELGGGEG
1668 TDDGLELDDFLQPIVYDFLAETNDFDAPPQDYGAGHMLDGSDLEMLTSLLDDCHFMGPWIDDAFCFMPSPEPQDMTQDDFALDRQ
1669 ENNAAPGHAQSPIAAVNPLDLPDPLGDHWFVWNNDSDAFWDILGEILSDIPSLGHNDMDNWFDSLAEQPSTPGHWTWHPEPPGFI
1670 EDDCYPPSHSQPPVDAVWVYGPQSAPIDENDEAVSDRADCNVWSPSPGPEGLGNILSELQSLDESGNFFTDMFQSDTWLDLPHML
1671 QESMDHVIGTPFGTPSPHEPYDPEIDVPEGDPWELAHCDLFHDIDLLADYMVSMIGDEQLASKGVNNHGWGMQGEACSWNNHEQQ
1672 SSAFPEPWTNLGLLEDLLPSEYWGFCKAESTAIQGSLLDTSEDFLAMLPELGDLGSLLYETAHQQVDAVPDEGADSTQGGGQDVH
1673 WEWFPLDPQAACPSDPNEHMEAQEHLTFQCDHTVPFCPFGTHSLHDLFDLDVDVPEMDNMLQEFWQDELAMDANDHDEQAAPAWE
1674 LPGEKRQKTDAPPPYIFFEEGPCHYNEPCWDMDLMSPDTVEWMMDIPDLSPAMAPMNHDMQEYWDELFPLFDIGDTECVGDLWDA
1675 PSDVLQPAEEMFDDMLWCLPNEINEWWPHEPGWPMGGISPWHALHGENEDVNVEYLQIYEEDCATANMMDSTDVNQDMDIPQPQW
1676 DWRTDFYCVTDDIDIESVMSGLGLELAEFQACAWAASPPFQPVQECYPGRDMGEEFNLDAPLSDGNNWELDDLLNIDEFASLPDD
1677 PDMNDSMGEACPPDTGPSILVPRPDTPDILGSPHPFFDPAIYGWSTMELFDPLFPELHEIGGLFEHGGSRDCTTYTEGGGRPGGD
1678 PDAGYSQVPMPPEGGEDHEYLEESPYMIPDYHYEGIYLDNFDNFAEDWFWPGIFDELLCVHGPPVGPMLPTTGNSFPPDIDGSQE
1679 HVQGADRMWDIPEWLQGMTSEHWYVVDATEPMESSQPGPTWGGSELEDWDLGGLTSLLEMGEYEPAHSPIPDTWSLDSGYNFIGQ
1680 TEDDSPSQSTIEPALQDILLHPIITLDDDCLQNMDHAPGDFLDPFDRVDALPPPDPDLLEFGKAWGQLDDLGILLPTTSMHWNYF
1681 SPIGAPTLMFPDDDNCIEIANWCYSDLGGLPWFEPRTPFGTPSLQELIDMENDVEPPSQIAQACPDSCEGMDSNIMLEPSAWNSG
1682 DDLEHLEGPCPPGTPEAPLPATYGRVADLWNNHSSTPSPWLLDFELNDFFLGSSPSAKETSEPVDFLPADYLPDFDLSPTMDETQ
1683 SMMDVMDPIMDIVMGSSQGTGYSCDVYQHPQPPNPEVAPVTEPMELGNNPFSEAWVTATWDEEVHHWVNGLNDPCDFDMCHWWVD
1684 DDDVNRGPWYIPDMFQGGLEDLYHLGPPEPSWTSTESWLGDLFDVEVDGNAWGASQFAGDNWLPEEALLEAYCEDIESQFCRKIQ
1685 DETTGGFDDMLIEPPISPVLLPTDSSCQLGDLLRPPTPITEDLLHNLFGLDVFPPESHYMLWDDFWVNDEFINRADPGGAKPFTG
1686 GDQSPEALLPEGGDDPMWPVVSELCQPQTYHYETADVDLQPDMWISTTNIAKTVEGNEDATPMNFYQQEEDIIWAWLYDAMMCAC
1687 EEDHVKYDIWSIQNAFHEIHNMSNTTGQGLVWWFHRHETGLWHLKLMACYIQAAYVKTSPDVWPFNQYGRPRADDINNFMCFIKG
1688 TMDDFMPDHINDPPKDYDAADPVLDNETLPWALDHCDLDDLWATIWTDVYTSQPNLSLSHLPADEETPNDTPGTWEMPLFAQSPM
1689 YTPDTNMGPLMSAPGDDSSTSEHTGGRTCGGLVSGAKGMSRLHTAPPTLDLGDLFELLCETANPQGPMQSLDDFDLQGLLDGDVM
1690 ALLDDTQPPPLHSYALDGDYQGGHFPSAIDTFNSSVSSVGSEIDDWLLSDLGDLSELLFETACSEDWLESMYGGWGDVLPDQALL
1691 AFDSQQGPHDQAGLSMHTGIPNDVLPDVWLEMIAETIGQGIQYPCHDSTEMYNLPSPEDISSNDGMCLEDWWDEIMGFTPDFFPY
1692 FPNDPWDTSLPIDYDFDYFNAIEVPDPDFLKLDSPEMSSLDLFAVECDQWELDHNMNAVPRMFPEHPSLEAEEGSMAAPTTGCGT
1693 PLLPQVPPPTPTDEGDELPLPGWDVIGDSADAHCVFELPMPGDIDQVFEDMLPCMSNPYGWLDEADFEFDIQEQMPPGVHFEGTL
1694 ADMIENLYAQECPAMGDIAEMYHWPTDQTPHPPGWEQYTDIYGLCYDVDLVPPSTSLGGMEPLPPDWECNLFELSSDWFEGTLSF
1695 QMRGDDITDPFMFWGFADSNTLSPNPRRCLEQLTNPEPAPTWSYVSDGVEDEIAQFMDNLENCWHDWDDDANQPSVDPWSWLELP
1696 DLSGHRTSDDLGEAQPHTWDSGWSSHACPCKLPFYDEEDGPEIPAISALDTSLENVSNHEEDWSTLMEFFFPDMTCVMEDNRPFF
1697 EMAGQSPETSVALEEEVGGLWPEQSNVPPPIGNNLTLSDLDSYVDQDLKEGLENELEFPVAPFFMEDIFTWASEGQIQPLPSMDM
1698 NEAHGNCLDNAMQRFNDAFECFDVTDHAEPWRSENEMTHPFFSEDYFLGLDFNDLPFTCDYEDWEPIEECVCQGWMDPCAFTPDI
1699 NHDFDFNLNAWADIPDLNTVLPIWELDDAAVESFFFSSEDSTPMTEGYSKQTKHTFPPIQGTDDPYWGENMQTVTFIGGTWPPCC
1700 VVENSIQSVHWSSDEFITWLFDTDLTEVTPAVATWELPVEEPGMDLPDWTPWPMLFPNPSEEGTWNMADLPHDQLFDMCDQDLWT
1701 WGDGCDSSNPMDLSPPLMDEHFSEDEIFPRDWDLDSFGLLDLPDLDIPEEAELMDARLSFLMSGGLGKNYEGPGVDLSLGSELAQ
1702 TQDLCPTGADPIDIFEDIFATQTLDFDDALQLGIEGLLDLPMDDMWPSETGLVYLDELLRQYNLMEAKESPIAPHDNVSLYPGEQ
1703 QFDDPAFDELWNAPPSLPPEYVNETMPDFLHQESGSIFAPCELTMPSLGEMLLILDIDSQENMEGFEFFMQPNIEESNIHTLGGE
1704 WQGLLPPSHLEPPTLLEDGLTYEVEALDQMLHDCNTLSPSQLMPPAAPAQPVYGVIPLGQLSVLPIGPTIDIDSWEEFLQNILST
1705 NQPDNAGNYVMDDLDLFMGLNLFPQKIIPDIGDVEFLWAPAQCLPDPSALDDSDLDLLDGQAIPSSSQSWGLSFDVTEHTETDRF
1706 DLDGNYFHSSPTGMWWDDFFNEMPVMELPVNCTSNELAIDPDGLEDLDMLGMIPELYSFIASLDNFDPAQQASDVVYEEGGHSGM
1707 PWAHAQHDDLMDWLMEYFDHDATNFEPVPRTQDQWQPQCFSTADTDLSIYLYDDIMPNVDWELVPCTSADQDWCWNQVTPQPGDK
1708 DNTDSPLNLTYTDMEPWQPDEEAFAWGSNHALPHELVDFDLDITNLDFEEGGLAYYLAATNMSFWDFEFEDWYFNAHGMDMDFLE
1709 FLKAETPTESQISMCVGTDPTVHASIMQPILGDEPHPTPTMGEEGPPLMSPWTEDDPFNFWEVADTRIWDVLTFPDMSSAEEWWQ
1710 LEQADRAPNALIEGLEDFESDHCIQDTPDFCFTSTPDITQDTMYPWIYDPYLGPGFMEEFVYDMNPPEMLSIIADTCQEGPSTNH
1711 SWYEIHVHPDMAISAPPQTSPREVNEQRSDNYEPDPLEPFYLPWPEESNDLAEDMLGALNFPEIECFDWDEQLRSNDIDSCGGHY
1712 SEGPTSPYVMGTPSAAEGAHPEIGNAMTQPAQWDCQCQLFEYIDANNMCHEAPEGLLCAFGFEDLNDWINETGEGLEWLLGDFTY
1713 VDTDLLAQAMTELSQTVSQIPPPPDLPCDGCMSEALDDFDLTLLGDDEHWDGLDMFAALFSAVNEHGKCAHMETPVWDFPSMISW
1714 DGPTTYYVLPQVPIPFEVSQEEFRCYFEMMPDPSPAQDNAFGSWMYQSDDWYEDFMQQPTSNDLDTEVSDWVDFDPNTTNSEALG
1715 SPTHPIFGPTPAFEEWSPECLGAAHHDVWWQCFDYTPDHDITQSLPQNNWDVNLEMFIQHGDPEMYGGIEMYEELVHGDTPESQP
1716 ETENDVEVYVANHHGGIFEPEQEFNCADMPSDFPPETPRTMSIVGQELLDLGSLAEWSWDMLDGSPDYDQEWEILLGWGDKFMLH
1717 TDDTPANEPDMWWISIIDGPLPMEPQQTVQNDMSSAGPVAEDLFETLHSRDVWNVGPDTYLNDECLLQMGGDPEYGGIGDGFLFN
1718 HRFTTPSPSPSILHNTPEQTVGEGLGFKPTTENSEALDAGSISEDISDELMSSHDAAAYEWFDWEENHEIGMFLDMLVNPHYGIN
1719 NAMPLDCNTDMGDGETLNINLPSHWCPHHGVDRNAGWATSESAHLDMEDFFPDMWFENVYAFAPIQPVSDQDSINTGLDDFMADM
1720 KELDDFEMDMEDFLPGELHDLMFTHFWAEDTVWLDMLESSAFPDFDLDMLGAQLPMPEDAGNLHAPDSKTNMIDSWSELWALSDD
1721 DPMCLTTNCYTDLSAIQQELPPQWQECDDLDDLMTWLSNSDAELGLEDAMRFSPLELWGQVALSDFGGDLFDGWGGFGDMYTPTV
1722 PDEPTQDPGDSWFQPLDWDLVDLEYETGPYEPPLNLEDQGYHIFDSVMDEPMSQWWMDWDSWLLSGDQHPQLPWFVNENAAGIDF
1723 GGRTEPAPDLASSLEDILFADQSPSDEGALLDQLTCALKDDPAESLTDMALNGEDGEPEITLWDCEPFDEWGELAMLEQGCPNSP
1724 SPQEGAGPQCSHGPPLDSQQVADEFAPEALVSPCTWSNNDQIWLEEICNDQFPIDDEWILNLINPIWAFDLHAGVTEAEATDFHI
1725 GTRSSDEQGQIPEDASWALGLADEMGSEFNIFVPPELPDFGAQWDDLDFGEIFNDPWEMYSYIRDFQLSPSALPHHPNYAPTHDA
1726 DQDKDMYSTLMDDPWIFTSHDVELDSYDDFLSEWIWEGGTYWSFAEYSNWQDPANLTAPEQHPDNVQSGIGSQPEQPASHPFYES
1727 DLYDFDGIFDNIQMIIDDGDMWRPDVDPSWTPPEPLSDFDWTLLNDPAVQLPLMDPLNGLEWHQSIPSEEVFMSKIEDSNFAHDC
1728 DDDKNEFLHWLPLTDPWDLELECMCEYGAPFPPCQPTLHATQDIEDSQNMSTELCWELEQWAESSIDSFDFPAEALPDIHDAWPQ
1729 SMPPHANGDLICPPEQQWEELQEWFNRDWCLPPLEDPVEHQLSEFSMSEQTLPGLPDDLLLYELVHQFEAIFDDMSYDPVGPMGI
1730 RTPPHPQGPTCELPGEPMDLEPQSDGNMHVSSACDIRDLMDGLGIIDELMSTDGQCSSYLALDCPWWEDDPMFTDPLGQMHLGGW
1731 CAQQNNPIPAIYHQLLDEVSDVANFNFDSLGWWTTPFSPHSTDWAQQMEPLEDFDMATLNTCDLDLFDADYGEIWTDDPMWTEAP
1732 PYAVDAMIQPGFPGPIPYDLEPPADWDNVAWMEEHINPSYCADLFLPTDDLLPYLQREDVEAAIWEPFFHNLAPNQDMDQDAPYN
1733 AQDDDSEDDHAEWDICHVELNAQAAAADLVSPVMQGDHPELNAEDLYWCFPELNEAWMNFLNDEFLTADPTNVINPEGTYSQVQH
1734 DITSTEQDSSTPEATGPILFEDLAVYANQSSEDNNCNPFYDLSPDGPWESMSLDKFMMARDSPTPAIDLSLESRMLWDIMDVFDP
1735 YEGFSTIHMDLWQTNVDHPCTLPDVPAAPMHCEDPDTWMCGLEPLFEDMFMDMDIDFANWDDWALEPINNLGATRDEHPGASTKV
1736 TSMRQSQIMTDVHNGQTLLNECDFSAPWHIDLDLMGPSIADWSFDDPDLDILWSFGMMEDVPQVSTACNNDEECIPDWGTTQSAQ
1737 LEACAGCQAETQGENWADMFSSYADTPWEPDLLPTWDGADLNNELLWTEPWTMPVDGLNAVPMFRQAEYPHHWDPMAESPEWQTP
1738 DMLRPGTHTSMQDGDPAGQPPALPHCVDPPSNASEWMSLHDFAFGENMAGEDINPMTMPLGDVLELLETWMPLSSWNWHLDEPNL
1739 FDDWDEAMTPRSGAFDLFNQLSPEDWSLQISCPLEDSLTVMGELGDLFGVLEQDFPLEHPYCGLLGGEASSEPPGGDGGSGLQLQ
1740 PGVNMRPMQGEAFELSDDFKMLEAGAFAPPLSPDICTPWGTGSWTEPSDDTVSGWAGLTSEDLYMADPDWLDNLDWGEPLGWVCK
1741 FDHFWDITLPPAANDPPSHCSETTGPQDFPDLLTDPTMSHQGMFIQELPDWRDLAELLGETTSPLMPGTPLGMLNGDGENDFSCY
1742 HSILDQNAPDGSCPNTMLEFDSPPGEGSGPELDMDIFMHDNWDSVEDIWHPWWEVALNEDFAGTLALPPIPDGLAATPWGTSPNW
1743 TSQEYLGSMEDVCENPGLPDTYFDLARFDLDFPRTPTPGILDSWIEDIWGGSDPMMTPPLLPNRDFTNSTVEYLAFCFNTLGSSE
1744 TETGSRGSSSPDDSLDLLPEAEWLSEWCPFDDQLNFWEALDMDMESLIYNDLSPGLVLDFIDDNWGGPALGGIDSGYEDDAGTSS
1745 SDGPVIHQPGTTMLNEMDEHLNLWPQLQLEQCDLFWDNPFYEHPDLNAEDPWEPLTFDMADNSEMESLLAHACNEWDMPFQWNTP
1746 QPSWGELPLGMPINEWFQTENDQLFDPLLDWSPMSETGSGLCGLSKQGTFDEGLADLLRDALDGYCVGDGSKQQGSCIDLGGGGV
1747 WPDPPDDANINYGLTFQHPNCLEANFGLDIIEDTDPQVPILDEGISQRWLNDLNSIDELDIMQCMYDCYAPAEPEMQPMSQDDCP
1748 ADPGTSDHGGPDALDTASMWESCLHPWDVMDTPSMWPFPCDVLSDPEVQSAWEDPLWEEQQETFPYFSGAEAWETSYACGSHGGI
1749 SPIEDTGDGTREGSISQAADCGGINLGTLDLGGSPGLSSDLYDVLDSDLYHDPDFCGGNGRPGWDEGSDLSYQMFELLGMPPSPF
1750 QAFVDTMDGPMEWPMICVEPMPDVDPWIECSAPATDLSWETFMVDLPQMSHNLMNDLENMELTFHPTPLGTSAWSSSKRSEENEP
1751 LLGGDSMLEGLPDFCVFPEFEESELFAPDLMNYQFATPHSEPWTWEITDTVVNPPGSNAGQPTDDEVFSPYDYWSYGPCDILHSD
1752 GTPSPTTAQTCSDHIATPEWPGLYDVPTLDEPDEALPDLYSGMPLGSCLYSDAGPMLEAMMGGEGCWAPLMTTQDALDWAIWAGQ
1753 FEDFGPAMSWLWEFDFDEFHLVPEDPDTFLGPLGDNMFTPGAVDQNLFECPELNFNEQAVNLEFPWGAMPKDDEAEQGPPIDLHI
1754 DQDPLDSFPHTMENSDAEQCDETINYAADQPDHNNPHGCEFDMALNTVDCFGILYDTWDPLDFAYILGLGSDPPSSPVEPDGQYQ
1755 LCNMTTFNPVLHRVTQECTEFPTEGSWDTASDAPHTVPASELENDPDICFETGWLDDMNSACFWPEVLTNTWDDVQNHLPNEWDT
1756 IPDHMQMLVEQITDSQSCEWSWGSPPPDDPSICHESITPILGEFGWQTPPPEESTSTDPLFNNLVDNMFACWPESELDLMAREAD
1757 WFSWEWYPSYKSMFCAPPDEGMYLMFVAEDDWAMIWNALHSEAEAPPAEFSAMVFHGTCEGCQWDGCMDWEWYHYITFCGEPDWF
1758 FDWFEQKPCLQEQTPYTPYPLQGPEACVDDVLDLVCSDMLEQMSNTDMDSSLPALLDMLRDDTWNLNQILVPNELENNPETGWGQ
1759 YDNTNPPSGFENPFVVELCDWRDTIDTDALIGPVPHGAPDLSMLAFQEFIDMLTMQLAEGSGSQDMQEWGSDWLEQPATGPWIIP
1760 TPDTWCPQAPEGNIDEGPSTHPSDDFPPLPEEPTTYWSDALRDLSLDPLTPELCTTSFQAADDMIDFEQWFMNTYCANMYAMNSS
1761 SPDDNQPHPPSRDVDAQDIVPYVYDPFMDVQFLDQLGHDWRDAASDAVYDFNPVDDQYFTQMTQTTCWLPTMEMIQHPWEGTDTP
1762 CHQDQCNADMPGSGDLWMSGHCYSRARTPAWEMSAIEGLSMWPPDDAQSVYSQVEFESLLFPIPEDMRECLDMDLFDLSLDELPW
1763 DGPPKDTFGQDWALRPPLCDENECFADDILGSPSALFDMFDLSFSDLILPGYESFFGLGNTPADDAPFNMDDNEVWNEGPGGGAW
1764 EAGIIIPDSQPLIPETPVKILPCAQDAHDLSCDCINCSPEGGAQDSTIPQLDMWGGSGILSELDMLDLPGWPEYIDPAELDVFWA
1765 ATMLTFPAAFPQEPFPLLDEMSDNEFGEVPSMADWAPGSTAETDSWFSSPQPAPEYAAQAVAPPQLDDVSDMFVTEYNPCLWWLS
1766 HWPPMLGPEVEEPPFLEMPPEEWYSDEIHTCSYDEFHSGAWVEIDMLEAEDGWNDSWLDGQFNDNNDTPLEELVGSLLHDFIPMD
1767 SMLDPSACAQLFDISMWDPLCEAEADWAEWWGIAQPGAVMYGNTQLDPLDWLMPSAMQESEDSMECPQFWESGEGTSQWPGWEAL
1768 FCDQNHENSHLANDLLELLEPGWGDPYPTPSEAPEEMSLHDIFDVMVDGFESLMPPPLGEPTTQCELQGMTICTWEEGLGYATPS
1769 PEWSTNDLYFMDDPNQWWDLDWQRPEPHEDDDHMMEDHPFHEMLGSFINPPPAPDSCCDHFFPISDSYIFAHDNWETCTLPDMFP
1770 WDDSIEMWPPYDLMEDMIMQNIEAVHASQGWCWDDSAHSQAFTHPFCGAPTDLMVRDTQNNEAPASEWFEDIFSDMCSNLLTTNF
1771 DWQLMDQMSHPDDWLCPELIGGGFSDEGALLDQLYEALVEFDCTEEPDLDLSLPELEDQWKFPAPPCPPDGDSNWYTGGPTEEPA
1772 FTDVHTTSYPIVPWIPCLQDLRTYDVPWIPSPIEFSPEPLWLPNMLDDLAWNEANLDFKDLYDVFYEVDGSFHAQQFPSSGNGST
1773 MNESCNQCYPNDDMLYFYEAPGYDIWAHWWPPGSSNATGIFDWLDLEPSSSSGSVPASIPPQTPQPSTASEALSDFELTFLGLEG
1774 RDVEALEGLLDGIEDMLDYAMSWEPCQDCTPPDWNPGPLLEPLLAPSSNLPNLHDGGGLELDHGFSGWLDDGGLPESGDEQWLIG
1775 PYGMIAQTVTDISLPELPDWAQDPFANTIPLDDDMCHPSMYTELWGSTEFWDESGAGMWHGLIPNPQELQDWMDDFPLASEASVI
1776 PDIDNALWQLVWDFNDLEPWPDPMYDFLEDQLGSVGYPVESTGTNHPQQSPVENSYTDLDAVCWTLGPVMYSDEFANLPVHGPAW
1777 MDEENSCDPTISENMPNKDAHMAGPDGSFWSLLFTELDAFDEFIPESWTTPELNEILDTFLNSECLSEEFNALPYGLGLCSHRGL
1778 IFDDGMWGSSMQFLNLPPQHVSPNSQGHCEPPRESTSPANSEYPPQDELWSAFDMPDLWGPEVWASDCPLSPEDDALIMHALWDL
1779 DLCPFFRTADATELGEIQCGCWNREEPSETYVQADAMFQNLPPSFPPAFDQPDWHNMPATPFELDNDDWEPPMCIFADTIDYMMP
1780 FECMTPLNISCSDDEFDWFNMYNTEANCVPVMEPTDQWRLCELIDLDGCMAEDFDAFAGMNEVVGMIPSPAEPIPEPHPINDEYP
1781 GGGVARPMPPTPENAPNMEAVCMAQGCSPTLDLTLEAWEVSQENLGSGPEQSAAALFGELSAPEHCLQGWDDSMFLDWLGNIIEE
1782 ASDDVFMDPLFANLLNFDQNTTSDVLAPHECQAHAPFNPVMPVYEPHSGLRDTNNMHDLLEDLLKDCEGNLIAWLPEDIGYSPMG
1783 AIAAADPTDTNEYVAEWTENLSSPHFYIPDVDQFNSFEDFFAADPEAQAMEPSDPFPLLPCDFLALFESEVQQGVIYSIFQDPLD
1784 LVEMSCECPQMHTSMEDVPLPSEPMFDEPITWEEPAGSSDLNTWPSSAADLMGELMGLLEDDALSLTCMELDELAAHGATRTGGP
1785 TDGPMIDLEGLGAIPLNDPKPITIEAYYAATHETNSAAGLALGSPLMHAPSDVVSSMMENLFDDISWNDEFWFATIMPDPEDPWQ
1786 GCEDPLMALIQDDLLCPAQPASEPFDEYELLRELQMFSSQDLEDVFSQFDNDLYVMYDVGPNFDGDLDPPPSWLVGQLSAEVEGP
1787 PSMWEPGPPGDVFSTIDESPNIDPDAWAEWTDNILDFLDPLSPQWLYALFDNSGDPYPARDHPTVDVNNEFISPNSPGEMSSQAG
1788 PPMETNTVHEWAPDLPDDPHVSSPYQALEQTEQTHSSWSGSTPRLSPDELMNLDITPLDGSSPTAWLEGNAGPPEYFCLDWHDGL
1789 GCHCSAPTNMHDLLQGTLFGVDESDVSDQVFNISPMPAIWSESPPLPVDEMLFDLPLLAAEFTLDEWDVFNPSMDALANAQSPEQ
1790 LANNEHSNWDDDPNLCNLNYAYTEEDNIWPNLTTEHACVDCWLQGWEEIDNMDLAWGIANAWSLMDWKQDTNPVEEGPSTFELPN
1791 GWDLFDDIIQESNNEQLLYAGLLWQPALPSDLDGMPIPGSASLFDLEIDEFNKGQDLPCFDDAARLGDGSQASEASEDWFFDTLE
1792 DAATDFTSNPNELIELDDDTLDNEQSYDMDDFWLDAERLLALEDFWWGNLPSLGDTELSMPVHTETMQQLNAMVCSGAWDIPQDM
1793 SGGGSVFGSIPTNDLLNMTDPDMLGSGETDLWQPDVLGHDDLSWFDIDYKFGSLLDSLIAVDASEGCKLALDDDMPLVGSEPGDP
1794 PNGTDLDSFMGGLGAQLQHPMGANPPEFPADLDLDYAEADMEMDTMSLILNDFQDSSLCSFAFDDELGGEGGGLDQPWSNGGPWS
1795 DGCTETGSQPMEHNMDQLDMVHNLFGDDLSQLLEDLSNPDHGGGHCGNGCWDKWMSDMLNLFDDIEWNDDFIRHGQPVVPGGSGE
1796 YLGVPNPSTYADMQELGNPATMFVPVLSFDDLLFPELEAFDTFIPDSNNSPLGNWLSEAQSDECSNTGQTLDRWLWGYDWPWAWQ
1797 QEPSWLAVSSPFLPGLDDDPLSMQHPMIPDFTYDDFHEIDFNDIFDGLRSHLADEAFGRNSGGPAPGVGSGPQEPYGLGPVGGME
1798 GGQDGVENGKAPRPQVVQPEPFVAIADYFASDMTQLSPHRDLFMGDFLLDNLDAPDWSFDAMWGWQVPQPAPFQPYQHPDDDGNA
1799 NRDSFGPPVTGVFDQLLGNDYLYELQAPAPWDASSAGPTSPYPRSSTNLFEDVSLGDLWNDDASDPPFNLDDLWHVIDHNMPLEP
1800 AGGGSPFAMSEGWADPPHELELQQWTFNIFGSTIEGMLDLPDFDDAPAEIGASILGLDNAGGTGAGGFDPGAGQEGEVDMYHDKR
1801 TCAELNMEHLPAIWEDAWPPPTDSEFAPYDSQLNPFTPRLPDMCMPSNDTPPATLRPSWDMPGVESADLCALFQALGAFEPDFGL
1802 WENTPNHTGEFDEGPDNAFYDPTWNGFQEEFVWPTTDHTPDNMDVVHPMFMIDMDEADSSDVWAHFCECSGDTLHDIWQLPFDAL
1803 MPLQLNEGEYTWHADMEDHHMDDLPHTFADHPSGYDTYYINPPPFNQLPQMEGPILNANMFRPFLHEWEAANDDPWVDWCWSSDF
1804 GGGPSGGSPPPDDPTEPKWMCLNHNKLSDDMDGSFALDRLFTDMLGWDPSDALDVDEFGFELGAPPQQGGDTEDEPMMFNMFEYH
1805 HESPDLWCDPLPDQYELFEPGLDGLDHLLGMDQYNIDDLLTQELRDMDIPLVPGASVIDEPLGAHEGGSGSEGDEECHGEKISRP
1806 THSQDTMYPNPFCVPYMGLWGTDQDADMSPINDQDEGFPSEVDFWITNYVSDNNIFTFNLGQIANEWDQELNSCFETPPGEDMDD
1807 EPAKPSGNGMWDHLEDQLFEFDTVFDFPEPWEISEELFAYPSLQTQNALPELTYMDWNFVESFAEPMQELCRWGLNQDENPHDVG
1808 PIPAQPEDTYTALIDDDFMPEIFPGIFMDVEINSDEAPWDGCDQLCMDLEDWCHVLDNQIPMPWPGWDVEEFCNFMNLEWFGDPV
1809 IALGDFPQVLQPIQVDLSPPNNMDTPEWMYETVFAASVHPSNSGTMDHSLCPDYIEDLLNWFDDIENPTGASDPVMPSPMWWWYE
1810 GCMFPDTPWPQNNGIMLEPEPLSEDCGAWWVTTPAPLDAVMCDEPSELPWSPGSLELLVPHLSPAYFMHDEQGWEDCTEINMWLD
1811 ALDVLPHDPWGQWFAEDFTAISEELALNELLDNHNFDALWDLLGDNLSDTPSLGDGQPGGGSSCSWQFGFGYEAGLPAGGYEGPH
1812 CDNAMTDDDYICFDYLMDPVDYNLFGWYDGPDYANIPDLIEQELRDVDILALLDPRLDPGVHEFGEMSGAEGSPQGGGGGEHYAM
1813 WFSVPEQEAWDTLANLYWGDDGINDHFPVPVDIQPDVLEFEWAMPCFDPIIGPENEHGPCIPDGELPLELAWTLDPGNDDAMHPP
1814 FPVPEMIDLGTIWPDLTTIPEREEEGEPFVREWFPLDCPEFDLNLDFSCEQHLHEILDTILPDEEPYLPMHMSTGASIFETDFWG
1815 DIDWEHLPGDDDSSMAFDSEWVEHWCSRSRPTCINIPMPAYAGQCSHDDKPLEPGVEAMQFPNDFLESQLPEALMDFEETFMFPW
1816 ITPPDVDMIDIDPMEDPFMGCYWPETDECSAIHVSAPQPYTPEADLGNFWRNDFSNTDPYHGNSLEEDLCQDLLDMVALDPDGWW
1817 RGAVAPQIEIPCSAPQDEQAPGPVLEYRPMSCTLEHFSLPDLADLQDEDFDGLSNEMQVIGNWLSILENGGMPEFEHFWDLMDKE
1818 VEECMWEDVIDKIMPPMDHVWGIGPDALCTDYNPGPGPVLQGWLGEVSPLPSVSLWDHGDDFHLSSMDPSSKLDDFDYTNLCDMG
1819 LLDIPTDFDFAENLPVLDDDGQSFEDAVSAWYDLPFPPPRSWWDLLEQEPYDYSCFDELCLDASPMVTPELALADPDLGWMGDDD
1820 NEAQPTLEDQAWIQFQGLLMDEWLFDNPDPLEHIQCMSTCDLADGCAPPFTPLNMPELWNPLTPEWFMGYDPCEGALTDPQWFDF
1821 ARIHHVMQPTPHEMSCLEQLPWMGPGSMDDVDWSSLGPGADMPDIFDELMSSDVFRVLDLSPLPASGVGTVLEADIDVCQTYDSQ
1822 PDPQIGPWDALDDMDLLEAPNIDFDAHLLDWLQDAYLDDLSNPSVESNVLDDGLLTTTLGFMGQYIPSHSGSHRGMGGAPYFKPM
1823 CSDSPHSANDFQGWFDNNDYADDTPFACPWPDFASFAMGMPAMPESLSPRNDVPAGDELLLSALPDMTGSEAFDEFMYELLTIIE
1824 PSGGPSGEEDVVSRNLEDILGAVTSDEQAIQQIFEDSLMCDTSFDDLALPLSQLGIHLGGGTICPFAQDTENLDDNEWTCMLFDK
1825 FPGTFGNWDQAMTKDLPPVLWGPSLSDEWPMWDQLDMPMWADLSDAMNSTEDVLSTDFLNTLWGILDNDNTHCGVGGVCGTQPSD
1826 ELAQLMPYMTTSQAGVQPEDHLFVDPHVHNVCYEPELPSQNTPVDHENSPLDLLSGGFIEELWDEILAFPVDWIFSDWEEVGTEA
1827 PWLEELQYWDWYLDQNEGDTEALPEVLHDEPQPRHEPPNPPAMEDWNPLSEYISWDFGYCDAGDMWGIMSDLINQDDGAFSPQST
1828 SSDPEPSNIECGASYMDYFEFPAFWDQSVFEVPLDCRPAEFSMERRPHDDSLAFGEVIHDAIQALMMGPQPYWDPHDDDPPPCCF
1829 VDNPPTDSTVNMPLHPEHSHQAQCFHEPNSCNMILDYINAVGITEPMEQWLHNDFFCPDAASLCEQGDCIPNWFEYVDDPLGWPV
1830 GTADPMQESKPKGHCQQVATQPDQPDWTMSAGSPKSFNIDESWFEHLFPIDLDPFAFLDIQNGLADLYFEDDQVELSDNPVGYPH
1831 CDDFEIQFRTPPWVGPPMFDQLLEEVLADNTPPPSHAEPQTLDNCYDLDMTLPTWTQLAAFMNIEHAFGGLSDNLNFDDIDIGMG
1832 FDADASPLADSLFTDIAGNGLPFFNAHDLDASWDPVPMSGWSSWELNDLALSGLWELGDILADYPKSMEGSPDAGGPADFADLMF
1833 NGGTPETQCGSWSGKQGITCEPSFYDWYDWYEFPTAMLQHELPELWGFDTPDVTPNDSIGDLRISPDPMEELEMLLGLPEPWSPT
1834 GDDNHLGNNGVLNDSNNLIEELLGWDYWGIPQPESPLLYGHEIQIDCWMPCPRYSPEDIGEPMDMPEMDPPHDVPAIDLMSDLQQ
1835 SDTPNNAGSIMEDSNCNIPTLEGYDTLTLDEFLAGSDFGDLWKPIWSDPYDLDLNPFVFVPAITSATDQQIESEPTAVMMDTFAT
1836 RESVGPQTSLWGWPNDPHQWALHDVTDHPQNDATTALSANKPSGWTIDELMSSDLGSWMCLTALPDMDYNFVLDPNDIDLLGFDD
1837 SVPGMEPPLPPTYWASLGCNADCMVECMPHEAPTTDYDDRVLYEEFDIYNFVDDMMQFTLWPPEMQDDEDMMHAGLDASIPDPGD
1838 PWDDVNWFVQEEGTQIEDDQPEQPFHPADYITAPTHVRTAAPTFDTYNDLDDLNNPMLDWWNTENLHTWCEFFPLIMEPANQSPE
1839 GNMTCDLAVDAPGGPVWETHLTAPDELPSDPMLDDFWLDLLDHVGQELPPLGVLSKDMLNLEDWAECTDWFSQSGTDGGDARNDK
1840 DNMTPLLEDLANIDWGAVPLQPEESHCQWVMPPQFSLMYPSIDEMRNPSPMHLDCWGLFSPLQEELEDWELLEFDLNVIPGDPPY
1841 EDEEMWETRSDYMSAAVFREHAIEEVVYDNLWPPEGLEPPNLHELYDLDVNAMDDLHWEEVGQILIPDDFLPLSLNCYEEFFPDP
1842 FDFCLEPLDIPLCWDWDAWEQDVNAEQNWMPPGESGLQQPLDPYDLDDFALALDEAARTNDWQIYMDDGTQMFLDALEIDDMSGH
1843 DTGKVLQAPNQEQIPKLPLFEDLAAYFDQEYCPMWPGPLLSGELEGWKESGEDAATMLENGDMLDNVMFSLESWPLGPEDFHGGP
1844 TFFDTDNIGLSNDIPDLFYEIAYMGWPPLADETTQHNDTCELHTTHPETDILLGGVGDMLGDGSWFPFGDGSDDLFDLLGECWAQ
1845 GHAIEELDMLLSLTEIDMTQDVAADHVRHSPPYSTTGGAMNLVLGNQYSNTQAAPAEWDGMASPQQEIHPLLELMPTIYDFEPYT
1846 FPQEAEDTFASVHTCCRIQDASDAPPFMPSEWGAVHDEIPDASQVLWPNCTDWQDFMDRFLGDPTYDCFEMPFGADEQYDYWPMM
1847 GQGGRSQPGAGPPGPSAPESQAMEFDALPSMMPDDALDDFDLTDSDPSLAQPTWDHFTYALADPHSSQDWQMFTDALACMFFDMG
1848 DGMDGWLTEWVDTLLLDQPIDDDALCNTLPWFNSCNSWDFPVDCDCICPWNNYDVHDELFAGGMINGPLDNFDVYPISGPSNTSS
1849 RDMPGPQYYGTMDMDDVIPITEHLSTDFLMGFLDDMDLDMGFLEALWTDTAPANPGSQLPGDLLMPLAYGGSGMALGVASDGSTE
1850 GWGSSYSSPPVVTGPSHPFLSGVEQMHTVFRNDYEFEIPPDLYDIDAPEEALLAAPDVLDWMFQIGISTDTTPESQDSDQDLNDY
1851 FAELADIPCLNACIPCDEDEPMVQEAEWFWNICSMPLPMAEGALPERQAMEPSQLSPLDPGDLMALLETWDPYSSDWGGPEHPAM
1852 ETCDLLDIDWDALLDPSDTLTGGVGLDDAGPESFFWSSIDSTPAFEYENLEDNGGGEGGCDHFQGMHGGDDGLQGEGGPGNGPWV
1853 DPYSAFEPPADAHETSEHDIMDLLGTLHMNLWPPIDSQGTIGQSGCLADAPDGASEQAGIDELLQYDWDLYCEDCFGIYGDTEQW
1854 KQSTVGPSLMPQNPQQAWPVVSCDSDDFPQSPYVNHYEDYCPLASGLQTDDSPYTADWFEDLVSKAWQENEEAYQYEFGLPIFES
1855 TQPPVTTPNMMAGTILTLNDELVFDCIDLDNWRNPNIELLSLESPMDVDEWLFALLDEAENMDVPVGSYGGCGMDAEGGSECGDR
1856 NFAASPPSMEAALMDIENEAYEFPADEDQQVFLLPPTPFQTPPLDMTHDLLMDSPEDTGHPHDLPMLFSFVLGEDGFNVAYNEFY
1857 HDDSVIDGSIMDDSNPNIPLLQGYDLGTLPEFPDINDLDDLWKQAWHECIGDPSTTNTNFDSMLDALDPDAGTNLDAQTECYYAM
1858 WQWPPDELWSINMQPHQPFLGSGWCFNPFGGVDFADNPNDMIMHDLFNTQESLDSQDMLNLFDDILSLIPAEPEGAIDPEPINPI
1859 TFDEPFMSNEFTALLNNLLTQGPSLQCEWVDWLGASEDTTLEFWEPWTEDIEQGLVGLGFGGGGGGGPSGDHGDTCYVKDPTLQR
1860 VFDAWLDVVDEENEIFADIMPSCPDIPEPQWREIHHTFTMPMQEEDLEVGQFTFPNVEEANMHSGAPLSTGYIWPQGQDEWNFLD
1861 NAYFTNSTYTDDARALPEQHTNLDDYILQQTWAISDLELGSLSAEIIDELMCSDVFPEMYLWPTPAMDPNGPLERTEWTPEKPPV
1862 YHRTGRDLGMDGNSSEPLWPSDIEEVLASQLPPPNPGDLLQGHELYMPDHAAPELPNETMIHQMFWDYVALPEYEFDDPTQWPLP
1863 QPAHTDDGEICLPFLAGGLLGPFVDGDNDMTPYEIMDMMDLPDLLSTDHDQLDFDALTDNDDVTMSAFLAWLESVVTFNDNAYNF
1864 AMNPPSEFMWDMFSDLCALDSTSDQHLMHGQTEFSTPSTVLMSGYKEQIMGVGPTDFDQVDDPSWDSLARWFIEYTDNDPSEHWE
1865 HRDNRDTDPNSIKHPHWDGAPDECFNELEIESLSYHSAGTTILCPGLSQRDQGILPGDLFEFDDPIEFDFNFWNDLATELYMCPE
1866 LDYPSSPSDYVGSIDLDDFDIDGPGATVAPGGPLRQDEPPFDSDKSDWLQMLFDLGDWTEDATSSWNQTEFPVCAEYDQPQSVWF
1867 DTPFWPDFPQDLSNTNAVARTPVDVQVVDMCDPDYIALNFPYTNVPCDEEWLSEFDNQDLPDYEWFAQWYMSEVPLDYQDAMSTE
1868 DLGDYPSDGSDGCTMTFFEPLEGLWDGICGCDNWSDVETPEDHIFDFEYLQRMDEFSYQEFDVSQFLGIGVPFGPQYDTQDIIEE
1869 FDDHINDWDELVGQPLMDPLMEFAWLDYECVYEQAPLTLPQSQIPSDPPNMWHEGTCMDSLLPEPYMTEPIMFADVGGDAHESPN
1870 RPAFEGMWTWMSNDWSPGMSKELAMVEDSFQYFPVNDDFDMTLLGDTELLDDEDLATAYAWPNMDPDAYLNQWTDETWFDFDAHQ
1871 GDGGEAYMPPIPKCNVSYLGDECCLGDECQGPLWCGLDDFSGFLGENSDSWAFADASWIDAMLEEQFGSSLDMIEEDLLACMITP
1872 ELSACECPSQQNPDEGPPSKVPHLLDQTQDTDISNAGSLSNDITDALMYLDVEPVPFSDIGLDVEMDDWPLEPPCELLDVLAELG
1873 APPCHPGPSSEEHDPLIIFDMYPGDLPMYTPHWQSLYDDLFWEEHCGDLHNMAEVMNNHVNDQFHTDELGQALDDGLLLNEFLPP
1874 TMFDANTTNTMSTCIDPNPVALDCAGTCCCPGERPANAHPVVFDNKLCFEQFELGHLNELCEVDPTCACEVSPEALGWDWWDVLG
1875 GDGTSEQPGARIPGDPQAMSQHLLHYLTDINSLAEDVNGLNDEFWTDLFTNDIEIGSWEIKEFIEGMNFEQYGQGGCGPQEGSLG
1876 TCEGPSSNDKFAPWLQDLHFWPPPQVLDFPNEDEGISPILAQPQNHMNTDTESPPWHHLNYSLYPELEEWDHNDFLSGPPSPPPE
1877 ATVAYPQSASSDSSWLPEDVDWMHVLDDAFDPSWATLLAMSMSGDIIDEEPDAHDVPLAPISPGNADGLQKCWDFFDCWWDEYMQ
1878 LNDNTMGGVAPMDYNFGDLPDSHKDDFLLDSDVYSDLSDFDLGMLDSELAQDSVDAAMHISELMEDLGMFGEWFMPVFDSSCWRQ
1879 DAYVDSMVTILNDLECPLFDFPEPETPDILGLPTVFFDTAISGWSNHSLGDPHFPDLMEESGAPPDGMTCDGGGGGGNGGGQGGI
1880 VPDGPDLDAITDIYDDSLLGMSVVDDDDLWGLGSDALDGFDKDMFQSDWWLEEIWGDVGNPALEPSPGQSAGPAQCNGLWGHMFP
1881 SPGMTPADSSASIFELLCPIEPCPPPLYECHEPDLYCPSFWELFDPVNANYQGIMDALDENWDNVLPNQDMNASDRGDGTFEMVQ
1882 MDNPYMSLLQACLESDHDETAYTDSQATYLPACPPWPFDFLIPDSPQWLGDTPDIVGDACPPGSEWCRADFDFNSEFPIMSPWFE
1883 LIECHPHMMGVQPSPPWLGDDIPCVLDWPPLSYTEPRANQRHYATMQSMEEAIDLDQFFENIDQPCFDLQDPGCDCEGEAGCGFT
1884 YDEKPEQIGVWSPRAEANDDICHDHTYMECPHAETTSLYPLSLFDSETPELGPGDDWGHFYNGVIPSPLNSMIEQPRWETWDTMS
1885 RFSEAFGGWQWYDMFAGMLDTSFYGLFPELLETAEPTSMQPEVQSTELSDLGDLSELLCETPSTETLANFAQHEWSHGPCDSGSD
1886 NLDRWFADHLMDFWDEFDCAGFSFMMSENMNMHGYANVDWDMDNHAFEDILADWVDGPVISELCEELMGPDDPQPLLSSQDGPDD
1887 PEPGPTNASMPSQHLQMILYPEMPCVFPSDCDMSSRGSLHDLFDFEVDNWESTVNMELSPAPRWDLFGPMDNPAAEVEMSGDAAD
1888 FSSQPSWHSSIMNLDDLPELPGSACDESLVNMVPPTDLSDLDNKLWWDLWIPSVVSPLLPGDVDALLEPWMPGDPLEQLAGMGLQ
1889 GGQGSLLQSETDLPAPSLDTREPSDGIVPDAPQEDPFWADIECILGFDGDGLRDLTFPFASLLMGDDFFEQMIENTDLCQMWGGQ
1890 WHRYSQQKTVVDVYPEGSGSKWESEQEPPCDSMNFPVVEHSIAEILENALKSDGSLEALLQDMDWLMRLDCDPPTDGPLGDDLGL
1891 EPDPSSWAPPCYDWEPVLHMPPIFPECMIMNPSETLDMELEDMDLGHPAQGDDFNALIDASLDKPDATFEQMFTICISGPDDPED
1892 LSVSDPFFVAQMDLFPEILGAIPEEWLADLLPPLAYHPDFPLDMLAEPRDPGADLQDLGSTGMWWESMDDLGGPSGPDLVCFPNK
1893 VPSDSTPSGFTDNEGDDDKPNEAWSHAPRYSYMPEWIALHSHEPLISDGAQLDFDCLRDLDLTGDAIFLDLLFDFDNEMLGPVES
1894 GHDASSDGSLLPDLFDELHQEHETIDVPYMPQLEGFTLDMLGSHDSQSEWSLPWDHAPYYGHILPPALGDEQWLTTTQSDEDFDI
1895 DPADADSMSAWDPFFHELLGDDMAIADQEPEDIIHFHPCPHDTCMEPPLHEPCCPICDIFRDLTEHCNCFAELEQFFQDYVTAFN
1896 GSSAMDLDDDVSEFLSAMDVAPDPVEDPQGLHLDINPLGIMTWEDAIWLCADDSSFHNIPGLDPGLCEALADDDTDFVWWDNWDA
1897 ADSSDFNPNPSMMAIEDLWCSEMSEFAPDLRAFPWATPRTEPTETYNLDWPSGVEGTYEPNDQLSFEKDILDELYCSPKIPPDTI
1898 ASQPSGPEAGTGPEFHELYDSANVGHQASLWEALSQMDQDGLNLINIEDIDLEMFMHDPFSLHIGEAWTVPLEDGLGVGMGTLPL
1899 TEDFPHEPMEYPSCLPDTLWSDYFSGEHLSDCDFDNLCDDTPHDGLDVPASMNLLPTPVDTPLGESWEHLFAQIIGMNPWMDDIV
1900 DWPDEFSKLLSQWMDDMMDNCEFPHDLPDPPDNYIRFCDPFDGLEVYESHHSILGTDHVMPDPFQIDLPGDGYGADNMGVMHAWY
1901 DFEVDASQPQYEILQTYKDSDMDEQCYCPSLCPQGPSNPPPASTAPELDEMVRDELHLTELDNIQLWMYLLDDMFEQMSGDGSES
1902 ERQPDIQAYMDQCNPMQPEMHDDAPQPLNPETPEPSPSCWSQYDQDEVFNFIGSLHSMYMEMFTPNPNLDDDAILSLHWGEHFDQ
1903 GRADPTQSGDAWIWEMMGMLPPLDSLKCFSDMYSTDIPCAPPIPTWHDTPSTEGVTDGLDPLPPSIESLQELEHLLGIASDGWKQ
1904 IDYEATNLDHLDWMCDEPFGFCIHEALATANPWPPCDDEIDDNAMLDYMCEETPNHTFVPWEGSDWLGAEQEMVEPEHMPWQDEI
1905 TNVGFSDHTSIPIICNLPSIPGPYHGGTLDAPVCYTDFEDLMNELPSDMCFDEGGQGDGQPFWEEVWGDSFPISPETTWPSWVSG
1906 AAPAELPCDIEPCFCITDNLLEPCVVQHQPLTPYSGLGLLDLDMESLVQPDFLLGDTADDGWDDMWQASPSWPPWEFLTNMCDLF
1907 QMEPASKEAQPLCDDPAGFSTFFVDDFPAPSEASFEAQLELNTPELFDNWIEWMRGTFNGQGGSGRALSEESNMAEHGLDWWMVG
1908 EPSNSPQDDEPYNTDMGPLGFTPIEFCHFWWDDMYQGIPPMDIGQEGTTEGQTASEILPMNYDSVGDDTFDQMLQLVDIPAWMFY
1909 GGGGSAAPGECFDTVLMGELEAMLGHHSDKNPDVDSLEDNRLDLLSDDGAPLPDLGDLGFGFFAAGMDGSVWPMGAVCDQDTTGM
1910 GNDDVWPDDTPMDFCQDWWPDNLFDHTAGAKGFGKGDCWLELGLNTDLDLRAEPLDLDGLGMLMDWHALLPDLGPVGGASPNWEN
1911 DTQSPFQFYNDDDLFGPVELESMTFPHVGLNYPPFELPDVELPPPAASDPPLPGVINSNGGFGHLGYELDWHRMFDCPEPLPSSQ
1912 DWNGIDILFDDPCGNIEHSLLEPSETAPTKASYFCATMFPLEELDQLFDCPEQTSPFDMNMLDPDMDGWSGQQLMDFAQDYDNGA
1913 CPGGSSGANNALEFNIDLVDTHIWEDFCTLETTSLPGSPWAWQDTWEDELRMEPDHLMGLLDLEYPADLMPDSNAFELAPDAQPA
1914 PTDGADNAGFSTSWSPSDVGQGPWDGSALYDCLSGLMGDVLWEVVPELSDLDQMGLLMAETDPMLPAQTDEGMEPGNIEQDSNGE
1915 IFIDPPETHSCPEEPPIPPSHWLPIPDDLLDQDVLDWMGEDSTHDNAFDPWSPTFPNLADMSYNLPYFVDVEMCQFMEDSGPGWP
1916 AEMDFFHLTMPQSGYPLMGGRGPQPRGALDELMWSAPPYGLDLNLDSPDTSASCNMLDQFLNDPEMDRAMCDEDLFMLLGTLEWD
1917 DTEDFGWEGMPSNDINGLVYDEDGFIGIPDVADYPTPWLTPMPDMDSILPLMYENIFALPNIGGSQEGHQTGATQQMHEQGPFYE
1918 NTDFFNFDGECSDWEAWANTIAEPDFNTIYYPIESPEPEWLGQMDLPDVETAYAPFCSNNLMDSDPTPTGRGSEVEELRWWIHDC
1919 VWRAHIGSPSPFMLGEHQPLFFPMPDVDNDELTTYEESMMPTLNLLSPGSEGVGYAMDDLLEELESMDGFWYGSTLIDEETPWLG
1920 ISEIQHFLQQHFNTCDDYALEPELDVGDSSPGWEAPVQSASGDLSIQAMEYTLADFFQDLNMWEWPNPDAGSWTLVDAMDGDCHT
1921 ATILPIWADEPNLFTWAYTPRMEALQQTCATDISSAEMGLPGTYANEPSSEDFWGLQGPNPYTDSETEGWDTNAFWDILGNMLSH
1922 QTWDFWDFWQGIGNISQEQEIDEIETWVPETSWTNPLHSPVAIPSHMPAPLVMDDLYLQNDLPPPVDDDFLINYPGWALEDINLT
1923 PDCEDLLQKPEPDQPPSDYPFPEDWGPCDSSKLETDSLMCDTSVDDLILQGLEHDPLLEMVTGGLGDTGYADFPDGEDLCLMIGE
1924 VMTDCEFLAFSSADEGPSSFVPRCDVLAYDLWPPPHTMFPNLHELCESLTDPPTQWEPPEVGEDLFDESMVAHNVFDLMRGDFAY
1925 QDQGSTLPDYPTFEDTISSWEPVWQECDQGWRGGSTAPDDFQGTSEHHPMTGITQPDMLGSDALDDFDLHMCGEWFAWINSHWMA
1926 DTPDEECPIAAHVVNSWHAEEHWGWDAWPPENETSWNTLFLGNGINPDDPIAGLDLDLMTFDWMLEMPDLWRMSQLSMSPTMDWD
1927 FDTWKSDLFTALCDDMAGFCEALEKSFDLDNWWAPEATSGAGQTSLEDIFLDSPGDHWGLEQAPVVEGGGSSLDQVMEAHAPSGL
1928 AHDCCYTALCPHPCCEASGGASDFGDILTSMPYEPPLESDIEWQVYEHCFTPDGIYTLFDSEQYIDMGSVQEWLMPWGMDGNAEQ
1929 CCPNCLLEYAWIIDICQNNWDDWDPFPDPYEIIWMMLITSQCHDEGDRDIGRTCVVHWDDMLCADCENNDEMNMTNMIVYQNNWW
1930 SVSDYSVATSNGMSVMEEDGVDWFDDFDLDWLGSDALDLLDLFDEMPALGASRAATMSFLSWQDYMGSPVGNMVGGGVSPGDDLA
1931 IWDIPPLDCNWVGITDDSSAACGQELPGMELPEPMHPYPPTLHPLFDLQNVAPPDLLWLFPDENCQMWLDFLVDEGSAPCNGPMC
1932 YPNNLLEDYWPLENIMVDDCRNDEDVGLSQQEPWTELIELTLHELYDFDVTAPEFPFEEAVSQAFWDLDEPNSYYQACGGSGVGD
1933 DGGPHCVCLPGACQMPCPEPCRTNFASQWALYDIPMMIYEHPPDWASLPSPQPQDSNSWIEDAATMLASDWSTSDFVDIWSILDA
1934 IGEDTPPCDMIMTQMIEAPPSDGPGEFTPCNWRGSATDTDVPHSQEDFNLATEIMLDALMSNPQWGPNTNCFPDDMLVNLDISHL
1935 VTDPPSQDTSSYNGPVLPEIPCGTNNCDLPEFVWNFAETCQLDLSSECAILMGEEGAFPGLHGQKYGDNDIWFPSLEGGMPGGTD
1936 LDTQPSPEAAEHVSLVPEDNYYTTDFDDVLKLLGEGLLDLPDLLQAPDWTLGIPEMVAQHQKRQDSNVSSQLDLLMDGGGDGPGP
1937 HPDMEVIDLPILIPEVTDFFDQDVAFTFGDDGPEPPTPWQTPSHCDLYDNNVDVGALDYLEEMTTIGMTFAPPVESGLMNMTPFE
1938 TTPGTYDHDMYKLLDNIIFQNPPAFEDLSDLDALWMHGPVIGTDHDTWHDISEMTLEPAKFPAAFEGSSLHANFLDDLSNAWDGE
1939 APPVDPQKSDQAHQNLPEMHDMLWVDLTDTSSDAWGPNQLNDEFWADLFMMYDDLWPGGWSGGFGQDEYEMPCVMGTEGYPSDPP
1940 ETCQGGWQHMTELWPYDIFLGEATEHNVTDVWTFNPWNDLSPDAAGLAAEPLSHFWDDSLNWLDQLELMEDFCGDGEDSNWLTDI
1941 STESNGEQTNEDLGETHLDYLCDTDTHDQFWTDHEFYFPPDSLDDLDLYSAPDVWGHFSALYAPGCPDGGAQFHEGSNPDQYWAM
1942 PDDQPGHPEPHASPPTTPGSPEPEPSHFPWDPFVYARSESWEPPLSSNGIFDIFQALEAIDMVDFMLPTEPQNWAPAEPQSRNFS
1943 NDSNFEDPDWAIGMPDEYNCLGSPFMSEEYTGSDREAYLADLDDLLGLWDFMDEDLYVEEMAGLPEFEPGTNNHIWFTDMCAENP
1944 EPCPETRDLDEWIDDYFHEFMAITMDVPMLPPTAGQTPSLDGLFETSENVSTEELWFLGGGTPYMSMPSSILDPWDHIGLGWSMD
1945 FNCDGPDTPPIIDQITEILEMCLQGDVSFESLDDLVLESLDCFSPEPSLDSMWCDVNDSSIVPEGPSEGFMNSEIVAAPQMDRHD
1946 EAESWQPSIIGEPMAEIMDWPCPQEACCNCSEQVNMGSCCTSFNLDPVEGHSGPDIFLGLDNDEVEELFDWHVESPNIMPPSDGP
1947 GGGGSGSMPPTPEDQGPIWWNPFADYTDSDLPSGPMRVTSDDAILDFDALDYKPDPPCVDFQWYDMTPADASDWFDLLMLFDEEP
1948 DAQWPPPTEDMDIHMESWVPENSFDEWHPVDWEYAAETSGMMAKDMLYSSPEDNGFGFLHGIMPSPITDWNMEFGIPEFVQTDQV
1949 DEIFNETLAMGLLDYLDDFDGDWPLSLPDSTPEVDQIFGCRMPLSLNDASELKDLCWTGPPNDEGDGPGSWGTIPINVDPYPMGS
1950 CPGPLPHVADLDILGLMFALEEYAPCAECDNFSAAVTLVDSSGLWDSDQDPARDVPFDGSFLDWIMEDFDVWPPSIGHEEDSALL
1951 NWFASSIDLTAHWDLDLPSDDEEEEGAFNSGSTTEYLLDLPDDDDLLAITWGDPIGDELNLDASPPMTPSEPLDYFLSTFHGPLP
1952 DMTSGQPLRLQNINHGTNEIADEWFPNIEPDCVWNLPCDDSPSPAWGMPTQLWSIWTCSEMHDSVDSFDMDINLFDPNAAVSPPD
1953 DGTPNDDLFANITQNLPWHHEEPPLNALFDTSATQDSSMAWDLWLDSPESQGCVVPLGGQPTGGLLPYMNIGIDCGDIFDLGGDG
1954 HVMPNDPDMDNETICEHIQVYNDYQGYDHQYHNDPLMDTPGSSLQGFRDPGAQIEWVPMDYESFINWLDVPAWDPFDLCSPLWYD
1955 NMDEDPNTDNEWDNPWIGPPNNWHIWEDLQCSTPFQLSDYDKECEFNGMADNHLAAQICDDLDWAPFMQEQEMWTEPLQDEQGGS
1956 DDILFNMYEPANPGSVIHNELGGHNPDLESNTEPDTPIEWTVNTGPLLNDCCEGYNLGNDMQHEMLNTVPIPWFEGCNVDDLPWV
1957 TSSGAAQSTPESLYDEVRVEVPEPHQDLPTNLTFTTFDGVLVCDLESIINSMLMPPMQLDANFQDDFVTDLWGWGDPDGPDEGAP
1958 DFDDFQELLCADTLWPDLLEGGGDVGGQPHVCPMGPFADVPMTDWTSGLCGHSQLTCCDLSELLSQLGDSSDMGPFAPGDSCYWA
1959 GNAMENDATTCVTMCTDEGPIDNGPAMPADALDNPSDWSDFGTSSAITPDNYWSDLSFEETIFNRMAIEPLDWYDILVGLMNGPQ
1960 DAIPPTEEEMRDTHNLPAPEEFSMDEQAAIWGSGTNVTMPACALNFPDWEDMMTHGSLFVADTMEVNEEELDCLLQVAEPLCYRH
1961 FPPEDPPGNACLDMAPHILGPNGCMPEGTPWPVPFPMSDETNQLGLEFCDLDIPIEMLFDWPHDIEFWWTEFPNELDIAWDPNMI
1962 NDDYFCADWDMGNTAEEFFNIYEMFDLSHLPPIDSTAYPVPTSLPSGPSSSDGSAPSFGWIYGGTDMQVQEWTAGQPIGALNPTR
1963 LSDGGTMDLDHSSLLWLQDLPQATIDLPPTDFYDGLMGHSDWSTGDFSEGYTPHMHDELCLDDLLSMTTSESWGAVCPPPPAKRP
1964 SCNWSREGRMTPSDYRHTYFDDQHALTPLAHLVSSAPAQFDCLDLIDWPDFGDLSDKPYGMGDSLDFEAIFGITGHEMDWNQLWA
1965 DDPPVGQWDPTFYDDTDAFLPPLDHLENWGFDIQENMYLFFQSMFASHDDLQTPSPNVYDQTPAAEWICQAASPQEEEGNNPYGK
1966 HGKNTLMWPGHCGSTGIEPASLELPFDGTYNALPHIEPPHIDGWLPHSLPEDSAALDWMDILEGMDFTPTDWFNLNDGPTGTIWN
1967 PMADGGCSRHGCLWHWEDALLPGYDLGTLDEFLDPIDLGDLWKTIESGAGDFGQGDMMDPSDTWMQAHDSSQCTEHTCALDDPVG
1968 PEGPTQPTHDDEYATSSAHGLEAHQDGAEMDPCMLDWMDYVYPSIDDWMCDWMHWFEAPDPAPNSPQMCMGTNCNLQWYPDHCDI
1969 ADDLCDIDDTNTKEDDDWSAFDLGVYPDWGWPDPEHIQMHFQPFHTNFIEDQHILDHVNNPWPLDNGAQGSAPQLPEMLWEAWYA
1970 ADDATTWFDMLPGDLLDDFDLTEGWAAMPWAGPANLLGQLNLDDFDASDISMGAFFSESEDAATVESTGETSHPHTWLYLTTQQW
1971 QNPIDMALYNADFMDIFDTIPWQEGELFNLATDPGFESYQPEIIEQWTNVPTAEPLFDMCDLNHLAQQAQNTDNISQSSFPSPCQ
1972 GADTLMVSHISLGQEGSHPGSEGCLCAAPSMDPEEALDFFPPEPLTSDGAFWDFDCLRCNDDTMMDFPVETMFTDALGCDIGDGT
1973 LDLSGLIPPDDNYQHCNCYCEPCKPMFCFTPQDPWGTDEDALIDSPWFASVAEDTDYYNGDNMFWDGFNPMEGVLDHILDFHQNE
1974 DSIVGDSSHEQEGMAFRCPSLSHPWPEDMFDPSQYNDLDLPVIDFPDIELKSMERPPTMLWMHSEGYGPEAMYTHNSVEDWLMQV
1975 DYGGDALFPADVSLMDLYAAEPVDIDLTDTDCFDDFDLDLQLKTEDRHELLGMLDNAAYGWDGIPDRGFPDMSSDMTDWFATNTS
1976 VAPATPQGPYNQHWDEVPDWPVFDHQVPCDFPVPTWPNCLTTLGELNNGLTDLDDLMNGSPNCMQDDTASPEEQEPDLYFYFALS
1977 PGHNPSNRDDFANFFNDLNACEAEISQHNWIVDWDADAWTEVLRTGPEEFDIPSAVWNMEWDDVLWWDCEDNLDSPWWNQPCDGS
1978 ELNMLEMPQDQPGAPNHTDQCEFADPLAHSQIAIPLDWMGWFHPPPSPPYMDHLNPNDYTEPWDLAWDYFLPLNHMSVPEDGADD
1979 MRPSPGTLFDMDTLFNMAPEDDEHLYLDLGDLTASDTVNIFDGYLAGSTCLDEFMMDMLGFAALGDMDTDALGSPDGNFPEPVGH
1980 LNLDDFCDWPPLGDQNENEDVCMCPEPPDAGCYLDMSSSDFLKGFDLDSLGGSMDDLIDIAAHWNTIQEGWSEAHSPELPSVPEP
1981 SCQLQEGAEQWRYHTENQMDCEHPVVTHEFDPIFVNPEFPSQDTRLSHEEDLGDPMYMPPNLDMFFYAWDFDYAPPTEWMTDGWW
1982 YDVNESWTELTTFWDEVSEILLAPSGQSHADAHEEFHPLFQADGPAETPQPVPCVDEEWVWLHCNYPMAWFEVGPPQPEAGNAMD
1983 DTDYFDAMSWFDNILAQLPQHGPHLPEHAYVAPPAFDPVMAPRRGWFCEDLLASPGLSGEDLATIMEANESVFPWEGLWCNTTDM
1984 TIPQYGPNIDLTCDTSCGELLPPSDALAPVQGFEVVSMIEGDIIDELLWETDKLSSLEGPTPLADPWVWTDNDELRELMGARNQW
1985 DAQACGVTEETLETVTDLDFPEEGSVISPPFEFTPAEHTTAPLGLWFSAPMPAPELAYLYSDVALGMVNMDIDFEPWHYAGQELP
1986 YDPGLYHGFPPPADETQWMDHIFDNSIPPDWWLTAEAAPSAPSSHDLLASGSLVPSNDRPQGDPAVWASFQCWLGPDFGGNPQMD
1987 YEERLEVKQETMPVDSDGPACELWIDFDFDEFDTGLALLNGDAIDGLFSSDVTPLLRDYNTDAEQTKHDWCLNEEINVEPWGPAP
1988 WDDMMTSTPEERLLWSCMDPLAIPLLGAPDQDTTQEMLAPYPMTNSQPMSTGDAVTAPDELETLIRHWCALEDFDLGLPDPLSSG
1989 TMNQARDNIAWSCASPESEVLVPGDLVQHPCPWITPDLSDIALYDPITFDESFDPYFIDAWYFSQDDGDVGTMFIMGDDIEHMNE
1990 NACTWHPEMDQLCSQLWTYDDVYFPSFDNTSDQGMELQDAMLDPAHIQWIKEMDQLLVESDYEVAMDLQDDFPSFGECNDFGDSE
1991 QQDLVIPTDWLMPDYILCNEPDDGLDELLGMDYTNIDHLLTPELRDLDIPLVPWGNGDPQVHEFLERSYAHFGFDGMFGDADFDG
1992 VDTASDMFNLCKQPADNDLWTPYCWDHMDTAPPASNAHPAPTEFSQFNEDDDMLCDLFIAGTSQAGFDLDDWMSMYSGITCREPV
1993 PFDIFQEVNPHPCPPSILMQPDPAWEEENQNDIIWFSGDQIDFPDMMNWSDPDVPEGEFHDFPPLTSNMYPEPLSWAAPPPELSE
1994 DDIWWPMTHNWDDTFQGLLEYEASIDFDTSFPPMPSDSEWPLPDVMVDPIAWEPLGKLWNGMFTDTPEDGAESAAEEPKDEAECG
1995 STCHPCHNGWTIMEPGNMDTQESEADEWGFACGPLWGSSGISLDSDWLAANNMDQNDLGSDFAYYMEAESMNLADGSGATGPIWE
1996 EPSKPMDTDGDAEHHCTVVEVPAFWGADPAGAQLTFDLWPESLMNMCSEDFDVLQPQGAGMIPLDGWGFNEMFQDYIGRSHQGSY
1997 CEDYLSTLVPPGRPDEHGPSSRGFLPQFDMNLDPGFDFDLGLDMALDDIDRLLCFIGREMPFQTVTQSGSAECNQGIPQQATHDV
1998 EAAQMSEAAQWWEDLEPLPLPLDYLTYETFDMCQGTYQWIATPQEPQYNEQIPFEVHIDRHQPFKEFYWPEPMASDDTCDQGDGP
1999 FVMDDGDAFDFDSIPGDDAQLPPLSSDELVICKLDMLGSDTLTAEMLDKLHPGGVDTESNDSDVWFSLGDLPDLGGWCSPCPMTF
2000 PDDIYILVPSVSNSPEMIECPLFPDDHQNLLQSHIDPLLATGSMLDMSQEEGLRNDAMAMFMNYLDHADWGLMFEHAPDSQESQA
2001 PTPGDHFEGIYDDLFQGLAENEYVFDFPEPVERPERMNSNDDIIDHLSSADDFPLLWLSPHPGGGDDGGKYPGQGYAPAGHDWTP
2002 FDNAIPQEPHWWEGMEEGSLESFEWEGPNTFGGHGSFGATSTDPLLWLPQLNDVSSVLCETAEPHESFMNYLGEEWSAGRAAFEG
2003 TDLFGILADVEGGLDILPSQLWPTTPINDCPPLFMHDMTPNMTPINETLEQPLETWDVENCMQRWHHMPEREWLDYEFRMCGDMQ
2004 SIGDPPPGAHAPVWLHEPYDHDCNAPEDPWEEDVSQTTFDDFLELLLDSSPHMYFPPIYFMNEELVAEPGNGQPGDGACPPYGGL
2005 DPLEFSELAMGPKDADCYELYNDWIEAHCATDPPPSMQPEGQLWDVWGLDPPFTVDPLFAEQVQLEPWTLMHPGNVDPLCEEDWG
2006 SPHGTAMDESIEGLTELPEGCNPCFCCQPLPPPSAFGPTFSTKLWQFASDPNLDAHMSGLPAQEEWYNDWILQPDFDLTLQGWDA
2007 DPTDLDMLFYDIFGPPPSNMFVGSCERSDSEALLSIPDMMDITPLGASDDALGFGWGLDEYADPQPSGAGLYKLEGETSPTSVPP
2008 EECELGTADALLDFDITMWDTEEVFDRVLDMIDQDDQSLWDLFDVEPDDFEPFDLDKHIGGGMTGLDLSMNEAHSELSMLDMLYS
2009 VPAVHPWVPPMLLHDGVIPNDAGPDFGSDILYPSTPEQDETMWLEDWNFWECLPLISELFAMPLEIEQAQSMPADAGTNVYGDLN
2010 TWGAAPKPGMQELWCGLCEGEEGLGQLPDGFTTSEWPPATMEQQTCLDPSMTWGGDIFFDNNMLGPASEWQALLEEYVFNFECDQ
2011 GFPLNDIWASSYNPGEEFFPIVDEEALAHLLEANNFDADWDKLGEHLSDMPNDPFHDPGEMPEIGAFDLFNPESSGGDGEGLSGS
2012 LSDDIPWSWINFDNFDVVPPWMMSSWDTDAMLDLAATALQGPGMSAGTDTPPPTPRPQWHNDLWGADPEAELVFLPDATEFGNQS
2013 TPEPFGQPRWTDQMQDIQAIESITDYSVDNLYPPSFLLPTLAEEIYDLDDNAPPDNLVNPGMYLEPLNDAFCPYNDFDLSTFWLA
2014 DAPDPPLCVSYNINAHGSTGQILSKRNEELMDDPVQDPEFDATLLLADEPDNQPFLTMLFMNDPIDFWFEEMCMDAWGCDGGLNH
2015 IATHDLWTRAPEQFDLWDYNQSDYDTEQAFLGYENPTLSYDILTPWMATQPAIHDGQCEGASPTMIDEEWTLAPPDHSHNDCRWR
2016 TDLDDLMYSPSLSDENTDWFGANNMGKDWTQSEIEGLLDLPDDWLAPAMLPDLKVHMSPGPDWGTPDDGQKGPPSTDHSLMDTCG
2017 GDGQPAPAMAKPMPLDLGDTISAAEAGASHGDAPACPGFQATAELDFDVLGHLDDTLIYAKMFQGELLGEIGDIDVLALLDQGLQ
2018 LMDDPWDLFNNLDELFTDKEHPAVCELGLPPPDIGQTAWPMGTPRNPHDGVSTYYSMVGPGLEIEAFFDEIPNTSDAFLNDCWCF
2019 AGDIAWQDATADESDYGGNCFDMLNVNPEFFQPDPWCVSDSEFWQFWDLPPWVNDDDIQDLMRNNMMILGEGLHDADGRSEMDVQ
2020 TQTGQASCTGTWNSVEDIWPEEEIEGPSTPHSSHYADNRFAEGLAGSNAMADTCIVSVGSGYGFDTIEELYWGLDLFSDQDYDAA
2021 EGANDCTCAPGCGQPSRYSPFAPEHAYQECELPSFPTLDDYDWMSIMSPPTHLDLQGCLMAMLEWDNFSDLPTQYQQWEWHDMIE
2022 EVQDPHTPPDCFFGWPTVPNIDEYSWEWESNGENASLWHELMEHSDSPRPTLGMGTPHTFHIFPLDEFMYDTGIQDMQTLPPFDF
2023 VVDDIISTTNRPWWMDHEETEHNGEWNLPEGPNPPTPSPWLHSEENSSRPMEQPRVTWALDDLVMTNDMWAPCIGLTTHWDWGGI
2024 EGPVEPTWSLTCSEHECSDDTTEQCDAQVWFEDPREFSASFNEMPLPQFAYESWWEEMNHNDDPIADIHLIWPWGDMWWEGLSDQ
2025 WCNYSATGASGSGQNVECEYNEHLYEITAEQHAQPNPWPTGPTGQELPCTLWDDSGWFQDVMFDTFWDMTLDDLLQPVMPSWFSG
2026 KVAPIHQMFTYGSVTSSPVNDPYIETEFDSDWMEPMRTFILHNTDMNCLPNPEDTAAAMEQPEDWIMDTWFQNWLNGPGNGLPTN
2027 ATLHSPAPPSMWVQGESEGHDPEHEFTLHADDLSRFDGGSILGSIIDELMDMDVFDGPAKSTPPNDWEWMAWGPSFTEWQSHLNF
2028 GDLDDLPDENRWSEFQGLLETLDDFQFPQPSEPWHSTPAESPFVEDPYDLEGDTPYHDVGECHWNDLRIMWDICLFEELGHMTSD
2029 WGESSHTTLIDLGYQDLPPEDEYQPWHEFPIVDFGEQDTPWDWQKWDCDIDTSNACSMGALGTSCWSDAWEFWEELDLTTPWPED
2030 APPIDGSSTMPAGLEDFISMPCSPGVLDPFFSYTANRTSGEVTGDESLEFELLPIAIEQIYCQEDWPDPFSQYAESDTPSQFWAF
2031 VGLMDTTTAPDLGAVFYDFNADDYPVQPMCHMVDNLAQADMAELKWWTFSTGAALDSIMDQPDWSQASLDPPETWAEFDFSFDLD
2032 GVPGGDSTQGQITMDADPQEYQHTNSHNYGGRLAPAQIDPSATTHDWPVIDDIWYEPALRLLPLDWFGEDPWVVGHDMQMDTLPL
2033 PYDDLSPCDGILPFDPMGMENDNTLGEPWPWSLSERPSEICLWVNDHVPGTLQDLVYAENIDNMSFQDLDWMDLHSPWTAILGDE
2034 PYPIVGPGSPPGLEGVHTSAQPDHWLSPLQDMPAVPTHSAVDTTIWEPEWSLPAFNHYAFYPEDSWEDWVDDWNIINFTMLETNF
2035 MAYDSPTVPLNFDEQLMEEWFYDPQLDLGPDTAEPPTPNGEPSILDIEDLDVYMPKDCPMEIALLDDFYLWDARLLEEGGGTPPP
2036 SDSWLMEDPLSDMFDAFHDLLELDVPVSQLYGLGDLSPTLEHLESALSEAFTWDPSSLSDSGGGLGLWEGGMPEAPGELDCAGFH
2037 NDTDGGASHYAGGDIFTPTDVLLHDSNMHPMQPDDYAFMEIPTWMEDLGIDLYAELDAPGSPEPPHPPAPPAPCTIREYFEDVGC
2038 ADTCVAVMNNMHLCNPPQPELQMSDTHLYPMDEPMPNIFSMNDIHEPLYEQFELPDIDDIVEIVCNALEAGGLHSEMPWPDTWWP
2039 ASGGTIFETSPMDIEFPMPAAHPWEHLGLYSAFEFWDGALPNPLDLDSLALGDHEATLSHGEDVGFCHLMKDDDIMTDPPNTVND
2040 ENEESNEAVTYYESLSDIGDCLHPGRRLDNHEVYDAFPPHDPNYCYGVPCYEPSIQDLFENEMTLDCLMQFWTINPCDGDNIGDS
2041 PPVQHQAPEDFFIFESQCDPQEVTEYFDFWFPCEVAETVPHIGPHLESMMSQSPYTEHIDDQGGQDNFDANWAECMLCMLDGFAL
2042 DMPIWCELDKFFDIDDCTPPVSQSAALNQMSQLSAPSPALESQGLAMTDMPLGCGELLDGEDFEAGHLDDFLDWVDLEMGAGNQF
2043 QEWASQSMPDTPEAQTPFLMEDLQYYFSADEIAADTVLYPSPMKDLDGTSLEPLPPADDGLESLIRLFTAGESMHLDEVTVQMWG
2044 DMPGSSEFLEMIIDLPEEPWVGNACGWHDMDGPHQSDELPDWVESYLIDHQELATTGGCINPLPPWNMTTLDSAMWDLFTDDDFW
2045 EPATEGPSVPCLGPPPLFMDGLSSLGPMHLPPGSSTSGAESGPEILWESLKSDVSFESLLFQGPSMGGIFDLMSSITWSTWGEGE
2046 VGPWMYGMGEIINNMPLDALQVDELFEGFTCDFDCQFPAAHTMLSSVLQAPETTAREEPTSWEENWPYGQASDPDPYVQATGQDS
2047 ETEPGPSFMTTIDTELEHTIPLGHSNGHTSAYGSTIAGLLDLPADNAIDELFMGMDLPGPEPPHHYHQWAELDGTDPPLMPWPHL
2048 PAGAPSRCVCYSDWEPIILCDDILGLDALNKPPTWEGGSDLLDPFDLDMFGHELAQHDFLGGQDYFTGYDGGSCSVNDPPDPAIV
2049 PGMDMEPPPDIMPLTPMPPPFEHGDWCVPTDMTQCKTMALNRDSLDLPLTPCMTGELDGFFNDMEHGELECDQFLMDVDFYILGP
2050 SHDDMQCEATPLEHLHTNIQFHYAFNQDFIEPNMTDITNYWEDGLYLADLGAHLFGEPEPAAEPCSDEGLGPAINNMYIFGLDEP
2051 HDSQWCSTCAQRNDDPPTLPPPVPPWELSEEQSWCNENLTPPELWGLNDIEDAFVTYDLGGLGAMDIEEQTFLNPTWEHFLLGGQ
2052 DKGGEVQESTPSGAPLLHDPGDFSTDVPLDDEWMFPMMQLVPDMTLPLDDYSDAEIMDLLSCCGVGWPIEADNWTPDGTEFGEPW
2053 CIPQSIGSSVMDMLPPLSRLHDGGPVYDVGTADHMPDNVPGPEELGFTDWSWESMDNDQDGIWNVNHFPEPTTWSSPCWDLCYLD
2054 APLRADCSADGMPPSFGLDDWDRSENNTLVTDCSSDGIIFGGTTLEEIDLDELTFWPAIGSMADDEAFNSMLYEWWANMMDSDDT
2055 RDDALFQNENWLHEYPMDVILGDLDCPFEPPTQEECFWQLITHALWQCQEDEPAPPPEFFAWDLPMHDNELAWDPLSQDIEDHNM
2056 QPIEDIMLGMWDEPFQGWDLPPIYPTCVEPPPSWTPAPWHGPIYDELDLESQALSAAQGPPMLHELVNTEDFYIEGGTAFDMGTG
2057 YYNYWERGGASGLEDAMDHTLQGECMPCPDHDIPFDDPGPFWDLEWTACSDIEQIFWPEPMPLQEDDNLWPSLGNSHDCAQWGTD
2058 GGTDDFELWALMEPILGSLFEDPYQFVPDTDMTMGLDLAALGIGDSLSPGETTHDEALGENLDMPPEISTQMFLWTVESPHNDFC
2059 DDFDHDFEMLNAEDMATDIEIVDNFFDTVDTTDFLYLGSPAWFDFLLDIYDPNVANRLPPCDFVTEEDTQPSAGGQGGPPDLGVF
2060 GGTDDIPDAIPLTHIFSNLCFGVDDFDYSPPCPDNADIAEDVYNFEDLMSVIIADGWDDEDQQGEDSSPGFLGLGSTPYVDHFGG
2061 GTYADPSAEPSAPIRNNSPGISDANDHDLWEPSHECNPMGLPDQESIIMNCLFDYDQGGEDDIDSLPFEWFPPSVWATTDWWYVE
2062 VHYPAPDEFESAMIDLDPLDAVLTWNPYGCETHSIEEIHPPGDEFDEFMEALGWSNHQWLPYNLLQEPVQPNDLDILGHPQDHVC
2063 GMLPYHDGELWWDLPLPEPQEPTPWDNAEWMEYMDIFPNLTQELEWPSDHFDWCMHGLSALDMFGLQLPESQWFSELCGDGSPNP
2064 AMDAPDMSPVPQEWRPSHSTAASVNIDMIAPVDYNEQTTMSDRPWCDTLEDTFTFGLMDHWFTVGQVEPNLADWTQWTHWPDQLF
2065 RIYEHGHFSADYNTGTGISTDEHAFWCPDDSTFPLPEPTFDLTLLGADPTFSGLLGNSWPIQNASQDEVEWMEFLLNGMFDLDEG
2066 CECPYENHETYLYESLWEASAIEATIPTREPTPDMAPPYDVLWGMMESQISLNLSGMCPIIDDAMFNIPHDEPVQFSLFGPQDPQ
2067 GPSTGLDYLLMITPSEVNPQICPDFFSPEIEGESSSNLPSMHMFNIASSMDESWAEFWINNLGEYPDDPCRQADDAYDENGAANE
2068 WALDEGPLDHLGMYAMDAPHSWDLGTDALDDHVLDASNPPSWDDFVADIIGGSEGKCAPSSTPDSQNFLDWLIDYDSIFPRLGGH
2069 IPADDSTSKESYPDPYGQNNDDGDLLFGLLEPIDTCIFCEDNMLGLPPQMPVDPDLMGLIGASGNVPHDQYCDFGPIGPYFWECW
2070 AGGGGPTLSPASIWELLSFDEPGDFPKDHPMTELPAGALLDTDNESIIAEDLMDEGLLSPDMPVEFDMNFFPTEWLTCLQALDFP
2071 TITDTVDDTPPTDNCWSEFMPCNYPCWDATRQNNSTNAVFWHEPLASDLASEDPFYLGPNDHPGMAQLQNEHEWEMGLWDYGNIY
2072 EPDWNVCFVNDEPASESDLFRELMQAPWALSDLWHGMYTIQPSDDCVLFTTTEQPAGNPMEGVPSLYTTDPMQSFTDCQPVENPM
2073 KDNGLSAMGPLQCAPDIPPTLPNLELDIWEHWGEGVTGGLTLPDMGFPDPAIGPTGLEGTLPDFDDCIQMWSYLYECAEWMETMY
2074 DGLDNHHWWNCLMNNPCTEHPTEFGDAHRPQYPYEPNIMVGSLDPVLDWDLEYQPGGQDLDNWMEEMYNDNSIARTEEGCMDATM
2075 AQVPYDFAWMADMYNTHPPGQLLQISFDTQESADHDSQPSTSHTSCEQFANMFFDLDWTAIPLPQCDVFTDTQMHEYTPQHNIDE
2076 GEAMAWLCTILNDLVGPLEDFSEPFMLDDLGSPSANPDTAFSGWAMLNDSVAIAGGETGDVWAEYWSLPEDDGGPSQSESSKGCK
2077 LSPLDGCIPLYNDPRQCGPLFQPWNVWTDTTTVPRPIFEYDEQEEHEVWSLINPLAGDLNTCADNPFLNHDNFMSDFEETMRDLN
2078 QWTDPPAQVEYAMQEIDLEACRFHPADMGHQFPNPYNAQFDSVLFRSEFANLPDWDLMGDNEPQPCNEYQYWVTHSDFHELQEEI
2079 DGEPDTTPTAYPESITCDQDWLQGLNWMEGGVDSGQPPTLSQSPPLDHDDWLFAALDVPLSVYAGFFGHIEPGEGFFYGTYGGED
2080 CPGCYHPMPDWINKAMEVPTTPHTQHRDIQDGFTAVASEWPEVSNDYMSQFLEPLMCQDDYSQEDEFMTEFLHFNEWHQEATDST
2081 YNDDEGVLPPFTESVIDDIMGGQMGCLYMLTTPTPLYSPSPAPNDAPAEAGRCADTSLFLTQLQGDYWQEWMYPDEGPSGYALQP
2082 AMCGTSMGIIPLMPTELAFMSEIEPWMQPVGLPGWEESPSPACSLPESPLVQPSPLLEFITQPIPGWESRQNPWDEFMDETGVDQ
2083 HFLNMSITGVPLPDPMSEYLPAFEPSRDHVWDLEGCTLDFLSDVTQDLKDLPDIGELMCGWFQDMPPMYFFGESTPEAWSPPHSG
2084 NPAVRQRTHVIDWGANDFMHSAPSPCYDSDYFDCDDLSSPSLCDGHLPGDFPGLDDILENVLCQPIGEAPFEWDFMGVERPPFGW
2085 IDASPADFPIFVNDASPEPVDNCDPSQMEEWLWPIRQPCPERGEYMCFDVEDILHLPSDHDRMDIENLYYQFQDEHFDIMTYENL
2086 KAQEHPPFDIPLDEFFTLYESWDPQSYDEAQICLAIHNPPWHEPTLSDSPQLDNDILSWSDDRGGVLDFGMPLDELGCGKPGDFM
2087 ASPTLTMDCDFWDIPTHPTISGFQNFPYPFLGDEEFEIPPIPLDSDESLQVSQLYLCDDGGTIQPPAQHLEDFLSTLEEHQDMFG
2088 LILELYEDAMATQDQFIPGEASIGSPQEVADIAETDTVGIFDSDWLQPCGPMNGLNYFLGVLSSFETQYQNSNVPEENWWWSPMQ
2089 SIMSELSSFIWEDNWQGLPDLEGVFDCEANAFEPLVWGTLDMPEDVGMPLDDDFLTMDRMALLPDTRPGHGPGESGGGKDGGKNT
2090 FVPTIMEMPSLGPSPSDQEWTHFEYQNMECIDTEPASWPGAGPSMSDEVPEFLLPPYHCDFFEWLEPAMRNWGFPVQMPSEHETE
2091 AMSTPDPTETPHEMPFDWIEETVVCLGKSSTDSESLDDFDLTPLGDMDHAGQLDDFACSLCSMDIAGWGGGSPVSFPTFNSLDSK
2092 PDTECYTNSLDDLVPQPSTLEEYESLTPYIFIPENDEKNTDASGWEEPWQTPPGVELHIQGQGLEPTDDAIFGILDEPQFWDGPG
2093 LEDTPLPAYDLEQCMSIMDLWANEGGESCWTEPWNTGPLLSGNLLNEEPPPTLLVPYNTGHFHEEIMSRSCVEYVGPMEEGPSWM
2094 DSPMPEMDVPTGMGNMPAPLLEEGPTHPMDTQLPGLFPHNPQGWSMTEDWFQGFIDKIGWESSCDSHADFWDEAHSLEMWMELFE
2095 HDDLPPYNHFEPALLGNPLFQGPSLADERTDWNNTGIPFWDCLFEQLETLGLLPYGPQPYSPELPYLSVQPEGHDQGGSLDAQLN
2096 EVEGNTMPETPDQSPSGHITAGNVEDIPWPQEASDAVWDTPSDQVDDLGENMSTPEWAVDLIFPDLEVCPGEWWPASYTHQLWFG
2097 ALDFENALLMAAWAPGPNELGLDCSELPEPGCLPCDFIWGTEGDYQAMCSTRDAAYMGEEFFPWINQQLSMEDTMLPEDGGQGGA
2098 GMEPLPLPMSMVDLSDAFHLPPLPMTDMPDQWDDLPPALHTDHQLSNVDATMPPTLDHILQNMPGEYYWEDIAYAAGENEFDLMN
2099 PEFLADFHIDSWTNEESHNWIEDWISDPFDRDWPIDPFVTSHLPTGGNSVQAANEANDNMAMLPLFLQDEDITTFEPELLMIYQI
2100 PEQTQDTMSGCVSDHDIDEAIEGAGGAAPELSFLMGHHWGVPAEEWHPFSTNDPAPQHWAFNIMEAPMDMNYLDLYNLLDEWWFG
2101 EGMEPTPKRTIAEHCNSMLHCEQTTYQEADPTNETFHTDVMTSWDIPNNGFNPVESWISWDTDATWALPEAFLREEWELCLFGGS
2102 VEDLLDDMWSSYLNMNPDILMGLEPPCVPPDPSADMESMTCALNADGGSLNDFMEILDTFCMDDTWVEAFTPAWGNCEFMAALGS
2103 DNNNDMPDPYAGHPRCPSPILGDLAPCNDDGCTHTQYPGPAGFNPLAPPPEDVLSSDMLNLFDDIPWCDYHFNMMIDHMLFFPPN
2104 DDFFMIDDSSLGNVLNLFADFECDAFYDDILPPPSMYHPGTDTNLFDKLFNLCEDPNWEAVFEVFEEGGGNAGEWTGDNIGAPVQ
2105 TPDDDGDDPLFSNPFRMQDWPPDISVSTSIDQDLVDSLMCFQDMALPDLGDLESHELWTPESQEPPAPNEDFLRLGGELVLPALG
2106 FDGLPFANEHTWPPAQQWEDLQSDDWNVPNEFYVSPIFPDMLGPSVQHCHDPSDPPGYPGMAEFPEESFAHFLLNDLDTVLEVLT
2107 PNYIELEEFFERMIDPMLPEWIYDGQIPQAMDIPDPMMGAQPTYDGHGFMDLDPCMVFQPWDGFQDVLDKLFCDNPEGETGVQSG
2108 AEYEMPPHGVVEMLTDDIFEELSKYDSFDKEPLPEFMLPSMLSLKDLMDFDLDKDLPDSNNDEIWTAANDAILLPEVQPSGQQAT
2109 PCEGVWQSDGDTKDLSNSDVCWLTPEAWDVTFTMQESLPWHFLQEIGMDNPWVDIPMDDWAAIFPPMDQEFPDLSAPWPDQSPSD
2110 QEITTDWINQWPQLNQIPFDDEMESHEFPPNDDMLCHLHGPAIPVGWPLQDSVFGSPNAQSNTPTMEEWWECWLNMLIDPEVHCN
2111 SEAMDVPDSVAHFDAAFPEVLQPLEDEEPDPFSTALDMFGCFSTLYDDQPKSLDDILAYQLLDMMDTEFYGLNGLGGAEIGAGGR
2112 PEGGSSDVDYDGEAQGSHGSNVSPQGYPWSWEFAPETPNTLLPEAIFDDTPPATPMIHTSQEFSSFISWLFDNLQDGSGQAYMDP
2113 ASMEADPEQPLGGYEEDNAEFAPEPTHDTLSPAQYDGSKPCIMTADQPGSTSPLPFADDELEAILWYQPVEDELAVLLNWSDNGF
2114 RQADMLDSFTASLAPDAMYVEPIDEEDASELLDFFSDDMYMLPMFEVELLDPGSEDLGQLFPQEWPGGYWGGGGGETDGRAPQFL
2115 QDLPDPTQLPMDDMNPVIPLLHGLDLGTQDEFLDHGDLDDLWKTLDCSLPVLDRVDHFFELEGQGQGGDMGGWGPQGGGREGGTG
2116 AGGSEGADALASYGTLLSQLDPGDPPKLALWLDGLEIHDFDFELLSVDEPAQDDGCATTPMPQLWDSLFEPSNFVPCGTMEMDPS
2117 HGFTTQQMGDGSTTYTHNFHEDIAGFWDDFLPSEPFEIPPLLGVLSPEEQEPPWLGVDLFNIWSDECFLTPDVCADGSWGGFPGY
2118 EPSHPQYLDEEAIDPLQDLGQDYPGPSCPLLYPGACPPMPPQPCINIEFNLPDINHEFSSAICEYFLDWMDNQYGWPGPDEHTPQ
2119 AEYEVDDMGVQLVPTGDNLLQSILQYWDQDDLTEQCADPDLGPTWDLNWLEFEWLIPDQILDPVNDVMAPSEPKQWPLSMPPDGI
2120 GPWWHFYPNNDEEPWWPCCQECFLGQQNARADDPPPESVEHLIGHPFPLVDQLPPEDMPDMPIPEMGNIAYEDDIFADCAWYDPD
2121 TDDPGTDLNLGNLGDAQPGVGHELNEEHFVTYFECWTPVFSMGDQLEESLKADSAFESLWHPEEDHCYPPMMPDIEEIISPEAYL
2122 DPAMTDSMDAMMAGFPDDRPLIDFDSIMDEAKFDSLFSFAALETIFQSEVQLGDWLMLGGGQESMAHPPSFNPFDWDWGSDTDPV
2123 SDAVDLDTQEAEIQYLLSPFLPPRVPEASWTAPDFPRINTSEDFIDTSELDPGSASTTMNDLYMLWEQPMPLYWEYLFNELEDPM
2124 DVLHSMIELPPTLNWTPFPLAGVPVVEPMALDQLYSALDNLDGLGMELPDAMIPELESNSEAQEGYDPVDTGFEEGWEGFMYQWP
2125 PCWVCYQDPVWCPNAYESPSNHEDDGAELGIHEFMEPFTLMHSEPTEMEHNFYVLPEFDEPWEIASPEFNWMCEDIPGIADDVEH
2126 PSTQPQLSVMTASHGSLADLYDLMVDYPVDSPTCKASMDHRKDAKPNTTEEATLDAAAYASLDMFDFELDGMFEQGSMWIMEDYA
2127 PVRDEDEPPGNCFAESMYVPQGFEFVSEEMWGSLISGLCREWDAPVAEHSPETTDNAFPGDAEFDLNNPNSLMYSILYDCDTNDP
2128 IAGLMEHAEAPSQHRAVEQYDQSQDSLVGFDVDHQSAATCNMYLDYWMATLRGSPMQLDDIYPLFPGWEDWNPESFCIDFATLDP
2129 GWSNDESGFTGMDLIGKLLEESLTCDVQVESLIFPELAAMALGSLDENELAMEPLGPEGLNPLLATYTMEASPAGSESFYMETMP
2130 PTSTTLDSTNIGGIITLPEAPGHYNPEPPMVDIYSTALSDLWNCLWSDLYSDMDTQPDLNLSYTPADTWNPPLDGGNQGGAGDDD
2131 KAAYQDNGPSPCTDFVNDWCISPELLADIHDFGWQWDHLPKFDPMTTDDLLNFNDPISTEHFKHPHMPSLEDPLPEETQAAPGSP
2132 ICEETPSMISPLSEAWDDFLGGGSMSLPGGGFGGGHNEWCLLDHLSDQGILLYDLGPCGSGDNTPLDNDGTDIIQLLSSIMGQEP
2133 PGENNTLDACDNDINLAPTGACEFSPPLPCWGLPYTLSNEQHEDIICWEEGLGNYGNELEEELDLLGNAWQGMLMEQPAEPGISG
2134 DACCDSWLMFDCGDMSPGCPPEHAAIMADTPLQMQSATDLELGLGLSTEDWMGSLNLDLLNMLLDSDGGEFTYFPDFLGGLGWAG
2135 ALTGEPMIDDMCGMITADELYYTPNPAPPVNLLDTMLSGGLEDDVESLILNLFFDSAMHDAWTDSMLGPLESGLAGAPSQSHEEQ
2136 HHMGSSTETLVRPIASSPAGTEYVHDEPEPVYMTPYGPPDSWTPMDSQNFFDHGMQELLYEDLQAMVESWWKPPIMDIFGDEGGW
2137 SDTDWTADNLECFPSLDLDSDQMAELLSLGHAGNNQQLCWFELDMLVSDSPDPGGAFPGDSDGSNDSPLDMLGYAGPLGNTLHLE
2138 EPDHENGNCFKAWEAAHKWMYPFTGAWDGNYFWCQALPAQMEDLIMPNDPNVEDHGEVASEESYPSFDITQDCWDLFDNPLMTDD
2139 PGLSESSPSAWKGVPGYFTDGIGDPFRGPSMEQHANPHTQPGPTEGGDTPYDPPAGDHDFWMLLSGIDGPEIWPNVLSQIIEHDQ
2140 EFAWQLLQPVGVEPSWCTLSQLDLPDVDAHLEQATSNLFSQPYMHDFDPTQDDFSTINELPADAASQSDDPEPAWMMFDLDTTWP
2141 LCGSPALQINDSFSTSGMFQEATWLDNDLAMCPYTPGPHLQDLLGESSDLPDCEVDLLISNELHEECLWEGLMPDNSMIMPGIGL
2142 TAANVAQPDHTPHDTAECSPRFALTPAFDANFDMFDWINMNDSQFDSAIPEAFDYLTTLLDYIPMINMEVAPPDMEHIPFTWDAM
2143 CPDEMVPMAEDFFQPPDWLHELDYDWVCEPMEEFNEHLWMDWHIIPDDLLAPFSALPDSFMQGASGCSDDIQDESTQEDPSDEDP
2144 WETMPGPGAAYEEMIVPKWCVTPWDDDDIQELLCDDDPFGMPSLDDLYDLEVTFPEDIPHGKYVTDVFSDDEFHQHLAFPDPMVL
2145 GSSGGHLDICPELWEWHNTLGEAAPCFYPDEDDPVNGIPPQLDLDYDLWLKFEHLFEGMAPQDSVSHPDISFSWDSCPEGLPPIF
2146 DPGGQAVQTADTFDPYLIDYMHGCDTPWQADEFTGREWDPMDMLIFLDDLQLDLGLLYDLDDTAMAAFMNQEAGEWLGTGGAGDP
2147 SSDTCLQFWEGLLSMNDFTQPLPFPSYDMLLPPAEEMSLPMTDTCLTPEVVPGVPEMDHKDALAMLHCVNLDDFLFGLAPPPGSK
2148 IVHGVGEDLMDFLEDPFVDELCDNDWPPQPWSWAISDAHYMTMPRDVACPWMEEENLDMCADYIYMMDSPHNYDLEFPLPDENQQ
2149 EDLGVQGTDMILPNIWTVAPNHPQPARDSDLPPTTLESYAGDSHSLSQGLQGSNYILDDLLKELAQQFDEYMGFDLLPCNAWDLP
2150 TGYPPVAGPNFSAYLGFDLGCCPHHPVNFEDVHTADAESSEIILGDSLLDHPMFEPVDWNSACYEDFFDICLQNEDLDTMIMTEQ
2151 DDGNSLFWCNVRTDLGPCVANDEIIPCEAATCVYNPEFSFDLDQAAEQIWRSEQLDLSPPTQTEPNDMNWEDMGMDFEHVEKCFC
2152 SASGSMLEGLGGTCQPYDYTVVPEYTLDADDAFFPPGPALESSLMEMNWPPDTWLWDMHGDFFDIGLNDDQLQGVLQDWIAFFHE
2153 TNPMYQELWDFEWFHVHNSDNITPEPHGVNDQPTISTGIFTPIEELAFEWLDHEPMAQDPCACPCDFTQPDHDQSLVPMWTALND
2154 PGGFRQILFENDHDECDPIVHEVSPVSHWEVYNPYPWPPLESYSDWAMQQYDVGGQPSNNPESNFDQIWAHLEEEYFSGMFTMQM
2155 LDSTEPKDTGAPHPGTAGCEFHPLELLNENNPEEDQPLIETWLAGSPLRLEPPQWDAGAWEFKEWVPADADMDTMGMDQIYTMNC
2156 AVAPLSDHTNLCAAMSPINIPFGYALGTLDEFVSPPTSVSHMGAIDDVFQSLFELLDGAGTESWDVFSPISPTDDGLWMFSTVGE
2157 NDPTITGPEPPNQEHGHGDMTNGGCEDADTCYKDPLQEDSMVPPIYTWSAEQAELHWEDFDGPFGIWGMNWDTILVDEIWGDDFR
2158 GFPAVLAPPTDLGNLDELTLLNTLASIGYSGELDDPPLDALGDGDSPDVDSLQASIQELFHPQEQPVETQEMSWFDDQYAPTTLK
2159 LSDFDFALSMDPWNPDTTSLEPEEGTQDTAEGISDAGSPPVEDILELISRDTSPHCTGSNDAIDMFFSTLTEWEMWDGILPVIPG
2160 APPFTRNMSDTVHDYSGGAPIYEWDHTQEQEAQEPIDLGEPHEWHDYEPSMLWYQYWPGMDFHDIVGPQEDDEWLGLDIFESTLL
2161 TGGLATTTPDFTINLDGDWFPVEAFVSADRLDSPSLFSGALSPEAMPICPDLLDELMLEGADVWMQDCGQIDSFTLFDDKVSCFM
2162 YGEMVNGFGDFADACPCDDLMGPTYVTDHPLPPGWEQRVDLLPEDWGTDTPPATPSSEMPMPFSDNDPDMTLEQLWGIFSDIIMG
2163 VTPLGSSPWYGDGNHLEEDFLKTNVTQPVEDEWYVVDCDVGPDLLGLETQALEPAMDLWDMGTLLDNFMLGGPEPTADICIIDQV
2164 EMSMEPMDASDRPLFSGLEITNMCTMSNEHLYWSDALESAHDANLINICDDLTITPNAAQDGEWDPWYDQVWQDGNIRMFDWMNV
2165 DSSHSGHGGWGGVDALGDFDEPPITSVDSILWWVNDTMQTIFPEAAICDTDIPDPGLLWPMPMPSPPPPDWEFEMDHLESMNISQ
2166 NLGYGGPPEDCMYEEIAGLEDDLWTNFTNWEPVTPVDLSDLNNLLWAPPYTEDALLMDFMSLTPGGMDGLANWLLGLDPPNSAVP
2167 DTETRDTSREWPSGIITQEFTAAQPEVPSVHSNCDNSPYEDMGMGDSEHTPFEFDNEWALDHHDWGDLDVNPYNFCDFFAPCTDP
2168 SEMVSDDSSATEEHGQEIPLEIDDLYEALGQPVIAPLDPWTMCGEELLWDLGDLAELLDPGECYQHKGGDGGGQDMGPVPWFMIA
2169 VTVTNPPGAEMMDSNPDIWDLEVPQIDDLSEILDEMLTDDLWDTIWSDIDTEDVCNGGWTLCQPLEYPNYAGYGLSQDTGDGWPG
2170 PQDILSVLPGIDDLFQLLLETEYVFDSTHFEDANNEMDLAAASAPEGQAVMPSPEQEPVLSPSDLFAEHELPTYFQGTINWYGKG
2171 APPKGVIDGEQSAEDAAWIDWFPLPFSLHPVTDHCFGEWDIDDWEEPLPMVQPEQDQLWKHAAAESWWENTNVDWALFQPGYTGN
2172 GEDGEAGITMEGCDLLSGPTLGDDNPYSERDFDLFWMTADWDMQDDSMLTPEIISMCLPIGPHLEMGQLEGTSDEPHIFDVPTGL
2173 VHFMEELVPDWPTWQDNGPFFLIPHVLDTNLFPPSHNEPSAEDELLDLDNPAKLFPDVDFPQFVPQPFTYTWPQELEDLLTYDHI
2174 DSPCNLVGSPNMDDNPLIDLLQGYDLMQDDEFLDHGALDDLWATILSDVYPWDEPDRGLPGENMGGTGGLQTFATSLPTGASGAF
2175 CESPQIQPGDAQMGSPSVWHDCTVTEEWSTIPYVDAMIMQDITQSHMDVIAPPPPLAAHPSIWGEIADGAPDWMVAAWDEFTNYL
2176 MPEAKPHTSAILPSWPMCPCEVHSLVEDQRQEGPPMTPGVLQQIENLDSAPGLFNLDNWFTEFDLFRLDNEVYEFLGILPQEEMY
2177 PMPLEPGMPGDFEGPPHPVDFEPSPQRCSPNPPQYLYLDLPDDDTYLLPDLGNLFETDDIQLMDMTPTPMWLDYLDIDGMSSVWR
2178 PGYEHNDNQGGGPDPCASAYGEGEHVECQVVREALKPCAWSMAETTNIYQPPSNFDDLPMNWLAPLDYGTLHMWDDDQMDHWWDF
2179 PATVDTSYPFGPTEDMAPALYEAITLSDEPGDSPTPTNDGSECAPPAWDQDNLPWTLDFFLNHKMYWDHMIFNQDDQEVWDCDLQ
2180 DDCGANSDIDSQYTAEEFFNYSEWEALNMLLERCNFDAFWDLGTENLSDMFSLLGTEALGPHGGLMDGGGEGMGLGGGGSGGGAL
2181 GCETDVAELNANDGPKPYPPEDEYNCDTPIMYGLTTSFHDHTTLTQAQPPAQHDDPAAMEFMFDNFNDFMNSYIDTWTSSDIAIC
2182 GGPVQSQSADMWWDTLFPDVPMEDFEPEATEYTHGQVATMMHEPCTLDPYKDIMSPDPDAAFPIMLASFDETVGLNLQCMFPELP
2183 GGGGMPAMGDHLSDVGMDFDLDMLLSNGAELPRTSDCEFGLHDLLLPLLRESSSVYPLDLGTVQDLMPTRGSVDGGGFDYGGSDA
2184 RAALDIEVPLGTWSGTTQLDMEECWDMIPDVDEFYTFEDFFQDFLDSLSPSPDANLCDSLAMFAGSENDEAMMDGESMQNEDRGL
2185 PPGEDCEVPASWTREQPVRASRGVGEHSSDVPSEVTNSISWDDIDELLNSWVSPWLSLSPTPYDFTNFNLEPEELDIDHFDWVYA
2186 AMAFTEELMDNTLDDITNHVFPYPLTEDWLDMYVLGDMLNMIEMPDILAGDLQFIPHEFANPCGPQHDFQAPPMTHESARWQWDD
2187 DESDLSLAAPIPNILDPPSDVHPYDEAVMEGDVDAGDLSLHNPPDLDDLYSDEIQEAMWSNETWDTGSPLYPFDAALLDAIFQGS
2188 CAAPEGVEFDIDYPWDNVLPNVHAMEVHQSFPLPPMDPFDLQDPNEEQVPPSMSWEPYELIEFWDSIANWQHESPGIMLTFENSG
2189 DGVQAQAFPDAKWLDCLWPLLDDWDWCHDPTISYGSPFAPSDMTELDLCQDDWTLMEFLAGVPDMSPPEASNIDSSGDKENMDWC
2190 ADPDYSVDLLADQELFNEFNTMWGDFDDFWGWPCTPPGALAIPNSLPSALPDNDTHTMADFEADPYAISGLLSWCMGPGLQQAEY
2191 PPTQQNLEADEDIHNNFILDPEPPSPGVSDNWPYMPLPHSVLPPQDSPLTPYLNYITDGQLNMEMYAEFGPFSWTDLDECCGGQC
2192 PMPLSGAADRDNDDAEEYFPISEEEALTQLLELNNWDAFLDLMTEMLSDMPMGGEEATTVQLHGGPMGPVPLLGADLFDHFGDDA
2193 LPGEQSAEDSHIWPRYMGPSPSLDSQPHPFDEDKSYTALFSHEPRIFDLPSLDTEWLADLGEMDCAAFMDELEDMGFTLGSGTLP
2194 ETPDLFDVDMPLTPTDDVLVRWILDNDGEHDAPMPLWTPSSFSELDTSDWGIPSPFVGEYDEWQNYMASMTNNITGGGGDEKGEY
2195 MFDFEDWTPFDLDPGLPWSGATPTEVLPMVLHSEGQTPDLLSDWEQTLPDLGDPLALCNNTFDDFSDSTWLSIATGGWHMDPVLF
2196 QYPLEPHSDDPTVDMSSGDTVPLDVEPLYEGTFWTPGPVLQDFLSWFETPNPAPNNAYQEQWGNWLEGFCENPPYSESFNHPQDG
2197 EPDQGNAPDIFMPSDHEENPEVHEYDIMALMQDFPTGVAAGPTISLPCGLHGACPCYLGWDDSAFNEEPLCMDLNLDDVMREAQY
2198 TNHALYCDCVKEGTPYVQPQEHPRPPEAEDSWTDVFLLDDKGYADIYDLLDPGSEQFNSMDLPPSFELGEGTWDDEGGIGTTSHV
2199 ADDHFDHIIAEMAHTFVCNCNPNCPPENPNIFLPPTQEHHDLLDMWHNDSFECLSGLEMNPFVFGPLETPDYEWASFMPMDDNPD
In some embodiments, the polypeptide is conjugated to a heterologous endonuclease. In some embodiments, the heterologous endonuclease is or comprises a Cas protein. Any suitable Cas protein may be coupled to the polypeptide as described herein. In some embodiments, the Cas protein is a catalytically inactive Cas protein (e.g., lacks endonuclease activity). In some embodiments, the heterologous endonuclease comprises any of SEQ ID NOS 2222-2422, or a sequence at least 85% identical thereto. In some embodiments, the heterologous endonuclease comprises an amino acid sequence that has, or has at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or about 100% sequence identity to any of SEQ ID NOS: 2222-2422, or, optionally, has a percent sequence identity to any of SEQ ID NOS: 2222-2422 that is within a range defined by any two of the foregoing values (e.g., 80% -100%, 85% -95%, 90% -98%, etc.).
In some embodiments, the polypeptide is fused to a heterologous endonuclease (e.g., covalently linked directly or indirectly through a linker). The engineered gene effector polypeptide may be fused to the heterologous endonuclease in any suitable manner. In some embodiments, the polypeptide is fused to the C-terminus of the heterologous endonuclease. In some embodiments, the polypeptide is fused to the N-terminus of the heterologous endonuclease. In some embodiments, the polypeptide is fused to a position within the heterologous endonuclease. Accordingly, provided herein are fusion proteins comprising any one or more of the engineered gene effectors of the present disclosure (e.g., engineered gene effectors produced by computer-implemented methods of the present disclosure), and a heterologous endonuclease coupled to a polypeptide.
Also provided are polynucleotides comprising nucleotide sequences encoding any of the engineered gene effectors of the present disclosure or any fusion proteins (e.g., engineered gene effectors produced by computer-implemented methods of the present disclosure, or fusion proteins comprising the gene effectors). Vectors comprising the polynucleotides are also provided. Also provided are cells comprising the polynucleotides or vectors.
Provided herein are systems comprising any engineered gene effector of the present disclosure (e.g., an engineered gene effector produced by a computer-implemented method of the present disclosure), a heterologous endonuclease coupled to the engineered gene effector, and a guide nucleic acid capable of forming a complex with the heterologous endonuclease, wherein the complex exhibits specific binding to a target gene in a cell when the system is expressed in the cell. In some embodiments, the heterologous endonuclease is or comprises a Cas protein, as described herein. Also provided are combinations of polynucleotides encoding the system, wherein the combinations of polynucleotides are configured to express in a cell a heterologous endonuclease coupled to an engineered gene effector, and a guide nucleic acid.
Also provided are methods of controlling a target gene in a cell, the methods comprising contacting the cell with a system of the present disclosure or a combination of the present disclosure. As disclosed herein, the cells may be contacted with the systems of the present disclosure or the combinations of the present disclosure using any suitable option. In some embodiments, the target gene is endogenous to the cell. In some embodiments, the contacting is performed in vitro or ex vivo.
Computer system
The present disclosure provides computer systems programmed to implement one or more methods of the present disclosure (e.g., computer-implemented methods). Fig. 20 illustrates a computer system 2001 that is programmed or otherwise configured.
Computer system 2001 may regulate various aspects of analysis, computation, and generation, and may be a user's electronic device or a computer system remotely located relative to the electronic device. The electronic device may be a mobile electronic device.
Computer system 2001 includes a central processing unit (CPU, also referred to herein as a "processor" and a "computer processor") 2005, which may be a single-core or multi-core processor, or multiple processors for parallel processing. Computer system 2001 also includes memory or storage unit 2010 (e.g., random access memory, read only memory, flash memory), electronic storage unit 2015 (e.g., hard disk), communication interface 2020 (e.g., network adapter) for communicating with one or more other systems, and peripheral devices 2025 (e.g., cache, other memory, data storage, and/or electronic display adapter). The memory 2010, the storage unit 2015, the interface 2020, and the peripheral device 2025 communicate with the CPU 2005 through a communication bus (solid line) (e.g., motherboard). The storage unit 2015 may be a data storage unit (or data repository) for storing data. Computer system 2001 may be operatively coupled to a computer network ("network") 2030 by way of communication interface 2020. The network 2030 may be the internet, the internet and/or an extranet, or an intranet and/or an extranet in communication with the internet.
In some embodiments, network 2030 is a telecommunications network and/or a data network. Network 2030 may contain one or more computer servers that may implement distributed computing, such as cloud computing. For example, one or more computer servers may implement cloud computing over a network 2030 ("cloud") to perform various aspects of the analysis, computation, and generation of the present disclosure, such as evaluating sequences of fixed length based on fitness functions, replacing random residues to generate mutant sequences, and collecting functional sequences accepted by fitness functions. Such cloud computing may be provided by cloud computing platforms, such as Amazon Web Services (AWS), microsoft Azure, google cloud platform, and IBM cloud. In some embodiments, network 2030 may implement a peer-to-peer network with computer system 2001, which may enable devices coupled to computer system 2001 to act as clients or servers.
The CPU 2005 may include one or more computer processors and/or one or more Graphics Processing Units (GPUs). The CPU 2005 may execute a sequence of machine-readable instructions, which may be embodied in a program or software. The instructions may be stored in a storage unit, such as memory 2010. Instructions may be sent to the CPU 2005, which may then program or otherwise configure the CPU 2005 to implement the methods of the present disclosure. Examples of operations performed by the CPU 2005 may include fetch, decode, execute, and write back.
The CPU 2005 may be part of a circuit, such as an integrated circuit. One or more other components of system 2001 may be included in the circuit. In some embodiments, the circuit is an Application Specific Integrated Circuit (ASIC).
The storage unit 2015 may store files such as drivers, libraries, and saved programs. The storage unit 2015 may store user data, such as user preferences and user programs. In some embodiments, computer system 2001 may include one or more additional data storage units external to computer system 2001, such as on a remote server in communication with computer system 2001 via an intranet or the internet.
Computer system 2001 can communicate with one or more remote computer systems over network 2030. For example, computer system 2001 may communicate with a user's remote computer system. Examples of remote computer systems include personal computers (e.g., portable PCs), tablet or tablet computers (e.g., apple iPad, samsung, or Galaxy Tab), telephones, smartphones (e.g., apple iPhone, android-enabled devices, or Blackberry), or personal digital assistants. A user may access computer system 2001 via network 2030.
The methods described herein may be implemented by machine (e.g., a computer processor) executable code stored on an electronic storage location (e.g., memory 2010 or electronic storage 2015) of computer system 2001. The machine executable code or machine readable code may be provided in software. During use, code may be executed by processor 2005. In some embodiments, code may be retrieved from storage unit 2015 and stored in memory 2010 for ready access by processor 2005. In some cases, the electronic storage unit 2015 may be eliminated and the machine-executable instructions stored in memory 2010.
The code may be pre-compiled and configured for use with a machine having a processor adapted to execute the code, or may be compiled at runtime. The code may be provided in a programming language that is selectable to enable execution of the code in a pre-compiled or compiled manner.
Embodiments of the systems and methods provided herein (e.g., computer system 2001) may be implemented in programming. Aspects of the technology may be considered "articles of manufacture" or "articles of manufacture," typically in the form of machine (or processor) executable code and/or associated data, which are carried or embodied on a type of machine readable medium. The machine executable code may be stored on an electronic storage unit such as a memory (e.g., read only memory, random access memory, flash memory) or hard disk. A "storage" medium may include any or all of the tangible memory of a computer, processor, etc., or their associated modules (e.g., various semiconductor memory, tape drive, or disk drive), which may provide non-transitory storage for software programming at any time. All or part of the software may sometimes communicate over the internet or various other telecommunications networks. For example, such communication may enable loading of software from one computer or processor to another computer or processor, such as from a management server or host computer to a computer platform of an application server. Thus, another type of medium that can carry software includes optical, electrical, and electromagnetic waves, such as those used across physical interfaces between local devices through wired and fiber-optic landline networks, and various air links. Physical elements carrying such waves (e.g., wired or wireless links, optical links, etc.) may also be considered to be media carrying software. As used herein, unless limited to a non-transitory, tangible "storage" medium, terms such as computer or machine "readable medium" refer to any medium that participates in providing instructions to a processor for execution.
Accordingly, a machine-readable medium (e.g., computer-executable code) may take many forms, including tangible storage media, carrier wave media, or physical transmission media. Non-volatile storage media includes, for example, optical or magnetic disks (e.g., any storage device in any computer, etc.), such as may be used to implement the databases shown in the figures. Volatile storage media include dynamic memory, such as the main memory of such a computer platform. Tangible transmission media include coaxial cables, copper wire and fiber optics, including the wires that comprise a bus within a computer system. Carrier wave transmission media can take the form of electrical or electromagnetic signals, or acoustic or light waves (such as those generated during Radio Frequency (RF) and Infrared (IR) data communications). Thus, common forms of computer-readable media include, for example, a floppy disk, a flexible disk, hard disk, magnetic tape, any other magnetic medium, a CD-ROM, DVD, or DVD-ROM, any other optical medium, punch cards and paper tape, any other physical storage medium with patterns of holes, RAM, ROM, PROM and EPROMs, FLASH-EPROMs, any other memory chip or cartridge, a carrier wave transporting data or instructions, a cable or link transporting such a carrier wave, or any other medium from which a computer can read programming code and/or data. Many of these forms of computer readable media may be involved in carrying one or more sequences of one or more instructions to a processor for execution.
The computer system 2001 may include an electronic display 2035 or be in communication with the electronic display 2035, the electronic display 2035 including a User Interface (UI) 2040. Examples of UIs include, but are not limited to, graphical User Interfaces (GUIs) and web-based user interfaces.
The methods and systems of the present disclosure may be implemented by one or more algorithms. The algorithm may be implemented by software when executed by the central processing unit 2005. For example, an algorithm may evaluate sequences with a fixed length based on fitness functions, replace random residues to generate mutated sequences, and collect functional sequences accepted by fitness functions.
Non-transitory computer readable storage medium
In some embodiments, the platforms, systems, media, and methods disclosed herein include one or more non-transitory computer-readable storage media encoded with a program comprising instructions executable by an operating system of an optionally networked computing device. In a further embodiment, the computer-readable storage medium is a tangible component of a computing device. In still further embodiments, the computer-readable storage medium is optionally removable from the computing device. In some embodiments, computer readable storage media include, by way of non-limiting example, CD-ROMs, DVDs, flash memory devices, solid state memory, magnetic disk drives, magnetic tape drives, optical disk drives, a distributed computing system comprising a cloud computing system and a server, and the like. In some cases, the programs and instructions are encoded on the medium permanently, substantially permanently, semi-permanently, or non-temporarily.
Computer program
In some embodiments, the platforms, systems, media, and methods disclosed herein include at least one computer program or use thereof. The computer program contains sequences of instructions that are executable by one or more processors of the CPU of the computing device, written to perform specific tasks. Computer readable instructions may be implemented as program modules, such as functions, objects, application Programming Interfaces (APIs), computing data structures, etc., that perform particular tasks or implement particular abstract data types. Based on the disclosure provided herein, one of ordinary skill in the art will recognize that computer programs may be written in various versions of various languages.
The functionality of the computer readable instructions may be combined or distributed as desired in various environments. In some implementations, the computer program includes a set of sequences of instructions. In some implementations, the computer program includes multiple sets of instruction sequences. In some embodiments, the computer program is provided from one location. In other embodiments, the computer program is provided from a plurality of locations. In various embodiments, the computer program includes one or more software modules. In various embodiments, the computer program comprises, in part or in whole, one or more web applications, one or more mobile applications, one or more stand-alone applications, one or more web browser plug-ins, extensions, add-in programs, or add-ons, or a combination thereof.
Web application program
In some implementations, the computer program includes a web application. Based on the disclosure provided herein, one of ordinary skill in the art will recognize that in various embodiments, web applications utilize one or more software frameworks and one or more database systems. In some embodiments, the web application is created on a software framework, such as Microsoft. In some embodiments, the web application utilizes one or more database systems including relational, non-relational, object-oriented, associative, XML, and document-oriented database systems, as non-limiting examples. In further embodiments, suitable relational database systems include Microsoft SQL SERVER, mySQL TM, and/or Oracle as non-limiting examples. Those skilled in the art will also recognize that in various embodiments, web applications are written in one or more versions of one or more languages. The web application may be written in one or more markup languages, presentation definition languages, client-side scripting languages, server-side coding languages, database query languages, or a combination thereof. In some embodiments, web applications are written in a markup language such as hypertext markup language (HTML), extensible hypertext markup language (XHTML), or extensible markup language (XML) to some extent. In some embodiments, the web application is written in a representation definition language (e.g., cascading Style Sheets (CSS)) to some extent. In some embodiments, the web application is written in a client-side scripting language, such as asynchronous JavaScript and XML (AJAX), flash ActionScript, javaScript, or Silverlight. In some embodiments, the web application is written in a server-side coded language such as ACTIVE SERVER PAGES (ASP), cold fusion, perl, java, javaServer Pages (JSP), hypertext preprocessor (PHP), pythonTM, ruby, tcl, smalltalk, webDNA, or Groovy to some extent. In some embodiments, web applications are written in a database query language (e.g., structured Query Language (SQL)) to some extent. In some embodiments, the web application integrates an enterprise server product, such as an IBM body Lotus Domino. In some implementations, the web application includes a media player element. In various further embodiments, the media player element utilizes one or more of a variety of suitable multimedia technologies, including, as non-limiting examples, adobe cube, HTML 5, apple cube, quickTime cube, microsoft, silverlight, java, and Unity cube.
Standalone applications
In some embodiments, the computer program comprises a stand-alone application that is a program that runs as a stand-alone computer process, rather than an add-on (e.g., not a plug-in) to an existing process. Those skilled in the art will recognize that stand-alone applications are typically compiled. A compiler is a computer program that converts source code written in a programming language into binary object code, such as assembly language or machine code. Suitable compiled programming languages include C, C ++, objective-C, COBOL, delphi, eiffel, javaTM, lisp, pythonTM, visual Basic, and VB. NET, or combinations thereof, as non-limiting examples. The execution of compilation is typically at least partially to create an executable program. In some implementations, the computer program includes one or more executable compiled applications.
Software module
In some embodiments, the platforms, systems, media, and methods disclosed herein include software modules, server modules, and/or database modules, or uses thereof. Based on the disclosure provided herein, software modules are created by techniques known to those of skill in the art using machines, software, and languages known in the art. The software modules disclosed herein are implemented in a variety of ways. In various embodiments, the software modules include files, code segments, programming objects, programming structures, distributed computing resources, cloud computing resources, or a combination thereof. In further various embodiments, the software module comprises a plurality of files, a plurality of code segments, a plurality of programming objects, a plurality of programming structures, a plurality of distributed computing resources, a plurality of cloud computing resources, or a combination thereof. In various embodiments, one or more software modules include, as non-limiting examples, a web application, a mobile application, a standalone application, and a distributed or cloud computing application. In some embodiments, the software module is in a computer program or application. In other embodiments, the software modules are in more than one computer program or application. In some embodiments, the software modules are hosted on one machine. In other embodiments, the software modules are hosted on more than one machine. In further embodiments, the software modules are hosted on a distributed computing platform, such as a cloud computing platform. In some embodiments, the software modules are hosted on one or more machines in one location. In other embodiments, the software modules are hosted on one or more machines in more than one location.
Database for storing data
In some embodiments, the systems, media, and methods disclosed herein include one or more databases or their uses. Based on the disclosure provided herein, one of ordinary skill in the art will recognize that many databases are suitable for storing and retrieving user information, genomic information, expression information, transcriptome information, and proteome information. In various embodiments, suitable databases include, as non-limiting examples, relational databases, non-relational databases, object-oriented databases, object databases, entity-relational model databases, associative databases, XML databases, document-oriented databases, and picture databases. Further non-limiting examples include SQL, postgreSQL, mySQL, oracle, DB, sybase, and/or MongoDB. In some embodiments, the database is internet-based. In a further embodiment, the database is web-based. In a further embodiment, the database is cloud computing based. In particular embodiments, the database is a distributed database. In other embodiments, the database is based on one or more local computer storage devices.
Additional embodiments
Systems and methods for gene regulation
Transcriptional and epigenetic regulatory factors can play a key role in modeling the chromatin microenvironment and corresponding gene expression during cell differentiation and homeostasis. Thus, natural or engineered transcriptional regulators constitute a powerful tool kit for controlling gene doses in future therapeutic applications. Robust and durable gene silencing by epigenetic memory, and robust but transient gene activation, respectively, can be achieved using polyprotein domain fusion, but long coding lengths can prevent their packaging into AAV and other vectors, and limit clinical applications. High throughput dCas modulator recruitment screening was performed to systematically query the relative transcriptional regulatory efficacy of tens of thousands of human, viral and archaeal peptide fragments in the opposite promoter environment, finding known and novel domains that have a strong propensity for activation of viral domains, repression of human domains, and background dependent dual activity features of archaeal domains. Sequence-based biochemical component analysis, combined with extensive validated testing of multiple target promoters in different chromatin contexts, reveals predictive biochemical scoring criteria for engineering functional improvement into minimal core activation domains. This procedure for modulator discovery and engineering resulted in ultra-compact transcriptional activators of human gene expression whose targeted upregulation levels exceeded the previous baseline activators in terms of magnitude, background-independent robustness and duration of action, albeit accounting for about 12% -20% of their protein-encoding payload size. When fused to the RNA-guided mini-dCas system for programmable recruitment, these ultra-compact transcriptional regulators are capable of achieving efficient and mitotically durable gene activation in different target environments with reduced payload sizes suitable for delivery and other therapeutic modalities by a single AAV.
Systems and methods for epigenomic editing
Epigenomic editing is an emerging therapeutic application new strategy by activating or silencing target gene expression. To increase the efficiency of epigenomic editing and achieve reduced payload sizes suitable for delivery by a single AAV, the CRISPR-Cas system is engineered to reduce its overall size while enhancing the function of the Ribonucleoprotein (RNP) complex. By semi-rational guided gRNA scaffold and Cas effector modification, a more compact RNP complex is obtained that exhibits improved epigenetic editing activity. The improved accuracy and efficiency of the epigenomic editing system was demonstrated when fused with an internally developed ultra-compact transcriptional regulator. To comprehensively characterize the PAM requirement of dCas protein, a novel mixed cell assay using synthetic target activation was developed to sensitively query the relaxed PAM spectrum of these effectors. In summary, a more compact RNP construct (with a Cas effect factor of 500 amino acids size complexed with a gRNA scaffold of about 100 nucleotides in length), and enhanced CRISPRa/CRISPRi efficiency is achieved when compared to the parental RNP.
Non-limiting embodiments of the present disclosure are further provided in the following numbered embodiments.
1. An engineered gene effector, the effector comprising a polypeptide comprising:
a first peptide of 75-110 (or 75-95) amino acids in length, wherein the first peptide comprises any one of SEQ ID NOS 3-100, or a sequence at least 85% identical thereto, and
A second peptide of 75-110 (or 75-95) amino acids in length and heterologous to the first peptide, wherein the second peptide comprises any one of SEQ ID NOs 3-100, or a sequence at least 85% identical thereto, optionally wherein the first peptide is different from the second peptide.
2. The engineered gene effector of embodiment 1, wherein the first peptide and/or the second peptide is 85 or 108 amino acids in length, optionally wherein the first peptide and the second peptide are each 85 or 108 amino acids in length.
3. The engineered gene effector of embodiments 1 or 2, wherein the first peptide comprises any one of SEQ ID NOs 3-100 with a mutation of 0-3 amino acid residues and/or the second peptide comprises any one of SEQ ID NOs 3-100 with a mutation of 0-3 amino acid residues, optionally wherein any mutation thereof is a conservative substitution.
4. The engineered gene effector of embodiments 1 or 2, wherein the first peptide comprises any one of SEQ ID NOs 3-100 and/or the second peptide comprises any one of SEQ ID NOs 3-100.
5. The engineered gene effector of any one of the preceding embodiments, wherein the first peptide is located N-terminal to the second peptide, wherein any one of SEQ ID NOs 3-100 of the first peptide comprises SEQ ID NOs 3-100, and wherein any one of SEQ ID NOs 3-100 of the second peptide comprises SEQ ID NOs 3-100.
6. The engineered gene effector of embodiment 5, wherein the first peptide and the second peptide are arranged in pairs according to any one of the SEQ ID NOs of the first peptide and the second peptide listed in table 4, wherein the first peptide is located at the N-terminus of the second peptide.
7. The engineered gene effector of any one of the preceding embodiments, wherein the first peptide and the second peptide are linked by a linker, optionally wherein the linker comprises any one or more of SEQ ID NOs 2211-2221, optionally wherein the linker comprises SEQ ID NO 2211.
8. The engineered gene effector of any one of the preceding embodiments, wherein the polypeptide comprises any one of SEQ ID NO: 115-274、276-287、289-367、370-445、448-746、748-777、779-929、931-1007、1009-1156、1158-1194、1196-1288、1290-1350、1352-1451, or a sequence at least 85% identical thereto.
9. The engineered gene effector of embodiment 7, wherein the polypeptide comprises any one of SEQ ID NO: 115-274、276-287、289-367、370-445、448-746、748-777、779-929、931-1007、1009-1156、1158-1194、1196-1288、1290-1350、1352-1451 having a mutation of 0-3 amino acid residues, optionally wherein any mutation thereof is a conservative substitution.
10. The engineered gene effector of any one of the preceding embodiments, wherein the polypeptide comprises any one of SEQ ID NOs 1085, 122, 1084, 653, 1099 and 1107, or a sequence at least 85% identical thereto, or a sequence having a mutation of 0-3 amino acid residues, optionally wherein any mutation thereof is a conservative substitution.
11. The engineered gene effector of any one of the preceding embodiments, wherein the engineered gene effector is capable of activating a target gene in a cell when the engineered gene effector is expressed in the cell and is effective to target a site of the target gene, optionally wherein the target gene is endogenous to the cell.
12. The engineered gene effector of embodiment 10, wherein the target gene is a silenced gene, optionally wherein the silenced gene is a methylated gene.
13. The engineered gene effector of embodiments 10 or 11, wherein the engineered gene effector is capable of increasing, or increasing by at least or about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 250%, 300%, 400%, 500% the expression level of the target gene, optionally wherein the engineered gene effector is capable of increasing the expression level of the target gene by a percentage within a range defined by any two of the foregoing values, e.g., 10% -100%, 100% -200%, 200% -400%, 250% -500%, 10% -50%, 50% -100%, etc.
14. The engineered gene effector of any one of the preceding embodiments, wherein the polypeptide is coupled to a heterologous endonuclease, optionally wherein the heterologous endonuclease is a Cas protein.
15. The engineered gene effector of embodiment 13, wherein the heterologous endonuclease has a length of, at most, or about 450, 460, 470, 480, 490, 500, 520, 540, 560, 580, 600, 620, 640, 660, 680, 700 amino acids, optionally wherein the length of the heterologous endonuclease is within a range defined by any two of the foregoing values, e.g., 450-700 amino acids, 480-600 amino acids, 500-530 amino acids, 500-600 amino acids, etc.
16. The engineered gene effector of embodiments 13 or 14, wherein the heterologous endonuclease comprises an amino acid sequence of any of SEQ ID NOs 2222-2422, or a sequence at least 85% identical thereto.
17. The engineered gene effector of any one of embodiments 13-15, wherein the polypeptide is fused to the heterologous endonuclease.
18. The engineered gene effector of embodiment 16, wherein the polypeptide is fused to the C-terminus of the heterologous endonuclease.
19. An engineered gene effector, the effector comprising a polypeptide comprising:
A first peptide comprising an amino acid sequence of 75-110 amino acids in length and based on a human or viral transcriptional regulator, and
A second peptide comprising an amino acid sequence of 75-110 amino acids in length and based on a human or viral transcriptional regulator, wherein the second peptide is heterologous to the first peptide, wherein the engineered gene effector is capable of activating a target gene in a cell when the engineered gene effector is expressed in the cell and is effective to target a site of the target gene.
20. The engineered gene effector of any one of the preceding embodiments, wherein the first peptide and/or the second peptide has a beta factor of about 30 to about 65.
21. The engineered gene effector of any one of the preceding embodiments, wherein the first peptide and/or the second peptide is enriched in a negative electrostatic potential.
22. The engineered gene effector of any one of the preceding embodiments, wherein the first peptide and/or the second peptide has a negative net charge.
23. The engineered gene effector of any one of the preceding embodiments, wherein the engineered gene effector is capable of activating a target gene in a cell, wherein the level of target gene expression activated by the engineered gene effector lasts for a period of time that is, about or at least 9 days, 10 days, 11 days, 12 days, 13 days, 14 days, 15 days, 16 days, 17 days, 18 days, or longer, or optionally, the level of expression lasts for a period of time that is within a range defined by any two of the preceding values, e.g., 9-18 days, 9-14 days, 12-18 days, 14-16 days, etc.
24. The engineered gene effector of any one of the preceding embodiments, wherein the engineered gene effector is capable of activating a target gene in a cell, wherein the expression level of the target gene is increased by at least 0.1-fold, 0.2-fold, 0.3-fold, 0.4-fold, 0.5-fold, 0.6-fold, 0.7-fold, 0.8-fold, 0.9-fold, 1-fold, 1.5-fold, 2-fold, 2.5-fold, 3-fold, 5-fold, 10-fold, 20-fold, 30-fold, 40-fold, 50-fold or more, or optionally the expression level of the target gene is increased by a factor within a range defined by any two of the foregoing values, e.g., 0.1-50-fold, 0.5-10-fold, 1-40-fold, 2-30-fold, etc., as compared to a control.
25. A fusion protein, the protein comprising:
An engineered gene effector according to any one of the preceding embodiments, and
A heterologous endonuclease coupled to a polypeptide, optionally, wherein the heterologous endonuclease is a Cas protein.
26. The fusion protein of embodiment 24, wherein the heterologous endonuclease comprises an amino acid sequence of any of SEQ ID NOS: 2222-2422, or a sequence at least 85% identical thereto.
27. The fusion protein of embodiment 24 or 25, wherein the polypeptide is fused to the heterologous endonuclease.
28. The fusion protein of embodiment 24, wherein the polypeptide is fused to the C-terminus of the heterologous endonuclease.
29. A polynucleotide comprising a nucleotide sequence encoding the engineered gene effector or fusion protein of any of the preceding embodiments.
30. A vector comprising the polynucleotide of embodiment 28.
31. A cell comprising the polynucleotide of embodiment 28 or the vector of embodiment 29.
32. A system, the system comprising:
the engineered gene effector of any one of embodiments 1-23;
Optionally, a heterologous endonuclease coupled to the polypeptide of the engineered gene effector, wherein the heterologous endonuclease is a Cas protein, and
A guide nucleic acid capable of forming a complex with the heterologous endonuclease, wherein the complex exhibits specific binding to a target gene in a cell when the system is expressed in the cell.
33. The system of embodiment 31, wherein the heterologous endonuclease comprises an amino acid sequence of any of SEQ ID NOS: 2222-2422, or a sequence at least 85% identical thereto.
34. The system of embodiment 31 or 32, wherein the polypeptide is fused to the heterologous endonuclease.
35. The system of embodiment 33, wherein the polypeptide is fused to the C-terminus of the heterologous endonuclease.
36. A combination of polynucleotides encoding the system of any one of embodiments 29-34, wherein the combination of polynucleotides is configured to express a heterologous endonuclease coupled to an engineered gene effector and a guide nucleic acid in a cell.
37. A kit comprising an engineered gene effector, fusion protein, combination, system, polynucleotide, vector, and/or cell of any of the preceding embodiments.
38. A method of controlling a target gene in a cell, the method comprising contacting the cell with the engineered gene effector of any one of embodiments 1-23, the fusion protein of any one of embodiments 24-27, the polynucleotide of embodiment 28, the vector of embodiment 29, the system of any one of embodiments 29-34, or the combination of polynucleotides of embodiment 33.
39. The method of embodiment 37, wherein the target gene is endogenous to the cell.
40. The method of embodiment 37 or 38, wherein the contacting is performed in vitro or ex vivo.
41. A computer-implemented method of generating a functional biological sequence, the method comprising:
(a) Providing an fitness function trained on a biological data set, the biological data set comprising functionally determined biological sequences having a fixed length;
(b) Providing in a computer a plurality of different sequences comprising a fixed length, each sequence being associated with a temperature and a fitness based on the fitness function, wherein each sequence is associated with a different temperature of the temperature gradient;
(c) By means of the computer, parallel in a plurality of different sequences:
(1) Selecting one or more random positions for introducing substitutions in one or more of said plurality of different sequences, optionally 1-5 random positions, optionally 1 random position, and
For each of the one or more sequences, evaluating a first fitness change resulting from introducing a replacement at one or more randomly selected locations, and accepting or rejecting the replacement based on the evaluated first fitness change, and optionally further based on a temperature associated with the sequence, and/or
(2) Selecting one or more pairs of the plurality of different sequences, each selected pair comprising a sequence associated with a successive temperature of the temperature gradient, optionally selecting up to 3 pairs of the plurality of different sequences, optionally selecting 1 pair of the plurality of different sequences, and
For each selected pair:
Selecting one or more domains for exchange between sequences of selected pairs, and
Evaluating the difference in fitness of the sequences of the selected pair due to the exchange of one or more domains, and
Accepting or rejecting one or more domain exchanges between the selected pairs based on the fitness differences and the temperatures associated with each sequence of the selected pairs, and
(D) Iteratively performing (c), wherein in each subsequent iteration, accepted substitutions of a previous iteration and/or accepted domains of a previous iteration are swapped into the plurality of different sequences,
Thereby producing one or more functional sequences having fitness at or above a desired fitness threshold.
42. The method of embodiment 40, comprising accepting, at (c) (1), a substitution at one or more randomly selected positions when the fitness of the sequence after the substitution is introduced is greater than the fitness of the sequence before the substitution is introduced.
43. The method of embodiment 40 or 41, comprising accepting or rejecting, at (c) (1), a substitution at one or more randomly selected locations based on a probability weighted by a ratio of fitness of the sequence after introducing the substitution to fitness of the sequence before introducing the substitution.
44. The method of any one of embodiments 40-42, comprising, at (c) (1), accepting or rejecting a replacement at one or more randomly selected locations based on Boltzmann Metropolis-hastins acceptance criterion r mh.
45. The method of any of embodiments 40-43, wherein the one or more randomly selected locations are selected uniformly across the fixed length.
46. The method of any one of embodiments 40-44, comprising at (c) (2), accepting an exchange of the selected domain between the selected pair when the fitness of the sequence associated with a lower of the consecutive temperatures after the exchange is greater than the fitness of the sequence associated with a higher of the consecutive temperatures after the exchange, optionally wherein the selected domain comprises the complete sequence of the selected pair.
47. The method of any one of embodiments 40-45, comprising, at (c) (2), accepting an exchange of the selected domain between the selected pairs based on a probability inversely proportional to a difference between temperatures associated with each sequence of the pairs, and a ratio of fitness of exchanged sequence pairs, optionally wherein the selected domain comprises the complete sequence of the selected pair.
48. The method of any one of embodiments 40-46, comprising at (c) (2), accepting or rejecting an exchange of selected domains between the pairs based on parallel tempering criteria r re, optionally wherein the selected domains comprise the complete sequence of the selected pair.
49. The method of any one of embodiments 40-47, wherein (c) comprises:
(3) Selecting a crossover site between one or more pairs of said plurality of different sequences, and
For each of the one or more pairs in which a crossover site is selected, a second fitness change for each sequence of the selected pair resulting from crossover at the crossover site is evaluated, and crossover at the selected crossover site is accepted or rejected based on the second fitness change and a temperature associated with each sequence of the selected pair.
50. The method of embodiment 48 comprising accepting or rejecting crossover at the selected crossover site based on a probability weighted by a ratio of second fitness changes for each sequence of the selected pair.
51. The method of any of embodiments 40-49, wherein at least one of the one or more functional sequences produced has a fitness based on a fitness function that is greater than a fitness of each of the plurality of different sequences prior to any iteration of (c).
52. The method of any of embodiments 40-50, wherein the desired fitness threshold is based on fitness associated with a corresponding sequence of the plurality of different sequences in (b), optionally wherein the desired fitness threshold is based on a maximum fitness among the plurality of different sequences.
53. The method of any one of embodiments 40-51, wherein the plurality of different sequences in (b) comprises a plurality of different, naturally occurring sequences.
54. A computer-implemented method of generating a functional biological sequence, the method comprising:
(a) Evaluating, by a computer, sequences of a plurality of different sequences comprising a fixed length based on an fitness function trained on a biological dataset comprising functionally determined biological sequences having the fixed length;
(b) Replacing, by the computer, one or more random residues in the sequence to generate a mutated sequence;
(c) Evaluating, by the computer, the mutant sequence based on the fitness function, and
(D) The functional sequences accepted by the fitness function are collected by the computer.
55. The computer-implemented method of embodiment 53, further comprising randomly exchanging, by a computer, one or more subsequences from the mutated sequence with different sequences of the plurality of sequences.
56. The computer-implemented method of embodiment 53 or 54, wherein the fitness function comprises a threshold selected from the group consisting of a binary threshold, a numerical threshold, a multi-class threshold, a confidence threshold, a decision threshold, and any combination thereof.
57. The computer-implemented method of embodiment 55, wherein a functional sequence is accepted by the fitness function when a fitness score assigned to the functional sequence by the fitness function exceeds the threshold.
58. The computer-implemented method of any of embodiments 53-56, wherein the plurality of different sequences in (a) comprises a plurality of different naturally occurring sequences or different random sequences.
59. The computer-implemented method of any one of embodiments 40-57, wherein the functionally defined biological sequence comprises an amino acid sequence or nucleotide sequence of a protein or peptide, or an amino acid sequence or nucleotide sequence encoding a protein or peptide, optionally wherein the functionally defined biological sequence comprises a transcriptional activator, further optionally wherein the functionally defined biological sequence comprises an engineered gene effector.
60. The computer-implemented method of embodiment 58, wherein the protein or the peptide is an epigenetic regulator, a transcription factor, an enzyme, a nuclease, an agonist, an antagonist, a regulatory factor, or an inhibitor.
61. The computer-implemented method of any one of embodiments 40-59, wherein the functionally defined biological sequence comprises an amino acid sequence or a nucleotide sequence.
62. The computer-implemented method of any one of embodiments 40-60, wherein the functionally defined biological sequence comprises an amino acid sequence, and further wherein the fixed length is at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, at least 100, at least 105, at least 110, at least 115, at least 120, at least 125, at least 130, at least 140, at least 150, or at least 200 amino acids, or at most 500, at most 300, at most 200, at most 150, at most 100, at most 95, at most 90, at most 85, at most 75, or at most 70 amino acids, or optionally wherein the fixed length is within a range defined by any two of the foregoing values, e.g., 30-500 amino acids, 50-300 amino acids, at least 150-200 amino acids, 75-150 amino acids, or the like.
63. The computer-implemented method of any of embodiments 40-61, wherein the fitness function is based on one or more machine learning models, wherein the machine learning model is selected from the group consisting of a supervised machine learning model, an unsupervised machine learning model, a reinforcement learning model, a deep learning model, a transfer learning model, and any combination thereof.
64. The computer-implemented method of embodiment 62, wherein the one or more machine learning models are selected from the group consisting of classification models, regression models, decision tree models, convolutional Neural Networks (CNNs), recurrent Neural Networks (RNNs), limit gradient boosting (XGBoost), long and short term memory networks, generation of countermeasure networks (GAN), auto encoders, transformer networks, evolved monte carlo, and any combination thereof.
65. The computer-implemented method of embodiment 63, wherein the fitness function is based on an integrated model comprising a decision tree model and a convolutional neural network, optionally wherein the integrated model comprises CNNs and XGBoost.
66. A computer-implemented method as in any of the preceding embodiments, the method comprising evaluating, by a computer, a biological data set to generate an fitness function, the evaluating comprising:
Generating sequence embeddings from Large Protein Language Models (LPLM) based on biological datasets, and
Training a machine learning model with the generated sequence embedding as input, optionally wherein LPLM comprises an Evolutionary Scale Modeling (ESM) language model, optionally wherein LPLM comprises ESM-2.
67. The computer-implemented method of embodiment 65, wherein the machine learning model comprises an integrated model of two or more different models.
68. The computer-implemented method of embodiment 66, wherein the integrated model comprises CNNs and XGBoost.
69. The computer-implemented method of any of the preceding embodiments, wherein the plurality of different sequences comprises a plurality of different random sequences.
70. The computer-implemented method of any of the preceding embodiments, wherein the biological dataset comprises up to about 10 5 biological sequences or biological sequences on the order of about 10 5 biological sequences.
71. The computer-implemented method of any of the preceding embodiments, wherein up to 5% of the functionally defined biological sequences comprise functional sequences.
72. A computer-implemented system comprising a computing device including at least one processor and instructions executed by the at least one processor to provide an application, the application comprising:
(a) A software module configured to evaluate, by a computer, a sequence of a plurality of different sequences comprising a fixed length based on an fitness function trained on a biological data set, the biological data set comprising a functionally determined biological sequence having the fixed length;
(b) A software module configured to replace, by the computer, one or more random residues in the sequence to generate a mutated sequence;
(c) A software module configured to evaluate, by the computer, the mutant sequence based on the fitness function, and
(D) A software module configured to collect, by the computer, a sequence of functionalities accepted by the fitness function.
73. A non-transitory computer readable medium having stored thereon computer readable instructions that, when executed by a processor, cause the processor to perform a method comprising:
(a) Evaluating sequences of a plurality of different sequences comprising a fixed length based on an fitness function trained on a biological dataset comprising functionally determined biological sequences having the fixed length;
(b) Replacing one or more random residues in the sequence to generate a mutated sequence;
(c) Evaluating the mutant sequence based on the fitness function, and
(D) Functional sequences accepted by the fitness function are collected.
74. A computer-implemented system comprising a computing device comprising at least one processor and instructions executed by the at least one processor to provide an application comprising one or more software modules for performing the method of any of embodiments 40-67.
75. A non-transitory computer readable medium having stored thereon computer readable instructions which, when executed by a processor, cause the processor to perform the method of any of embodiments 37-67.
76. An engineered gene effector comprising one or more polypeptides produced by the method of any one of embodiments 37-67, or a sequence at least 85% identical thereto.
77. An engineered gene effector comprising a polypeptide of 85 amino acids in length comprising any one of SEQ ID NOs 1495, 1592, 1595, 1634, 1654, 1665, 1677, 1686, 1689, 1716, or a sequence at least 85% identical thereto.
78. The engineered gene effector of embodiments 72 or 73, wherein the engineered gene effector is capable of activating a target gene in a cell when the engineered gene effector is expressed in the cell and is effective to target a site of the target gene, optionally wherein the target gene is endogenous to the cell.
79. The engineered gene effector of any one of embodiments 72-74, wherein the polypeptide is coupled to a heterologous endonuclease, optionally wherein the heterologous endonuclease is a Cas protein.
80. The engineered gene effector of embodiment 75, wherein the heterologous endonuclease comprises an amino acid sequence of any of SEQ ID NOs 2222-2422, or a sequence at least 85% identical thereto.
81. The engineered gene effector of embodiments 75 or 76, wherein the polypeptide is fused to the heterologous endonuclease, optionally wherein the polypeptide is fused to the C-terminus of the heterologous endonuclease.
82. A fusion protein, the fusion protein comprising:
the engineered gene effector of any one of embodiments 72-77, and
A heterologous endonuclease coupled to a polypeptide, optionally, wherein the heterologous endonuclease is a Cas protein.
83. A polynucleotide comprising a nucleotide sequence encoding the engineered gene effector or fusion protein of any one of embodiments 72-77.
84. A vector comprising the polynucleotide of embodiment 79.
85. A cell comprising the polynucleotide of embodiment 79 or the vector of embodiment 80.
86. A system, the system comprising:
The engineered gene effector of any one of embodiments 72-77;
Optionally, a heterologous endonuclease coupled to the engineered gene effector, wherein the heterologous endonuclease is a Cas protein, and
A guide nucleic acid capable of forming a complex with the heterologous endonuclease, wherein the complex exhibits specific binding to a target gene in a cell when the system is expressed in the cell.
87. A combination of polynucleotides encoding the system of embodiment 82, wherein the combination of polynucleotides is configured to express in a cell a heterologous endonuclease coupled to an engineered gene effector and a guide nucleic acid.
88. A method of controlling a target gene in a cell, the method comprising contacting the cell with a system of embodiment 82 or a combination of polynucleotides of embodiment 83.
89. The method of embodiment 84, wherein the target gene is endogenous to the cell.
90. The method of embodiment 84 or 85, wherein the contacting is performed in vitro or ex vivo.
91. A computer-implemented system comprising a computing device comprising at least one processor and instructions executed by the at least one processor to provide an application, the application comprising:
(a) A software module configured to provide, by a computer, an fitness function trained on a biological data set, the biological data set comprising a functionally determined biological sequence having a fixed length;
(b) A software module configured to provide, by the computer, a plurality of different sequences comprising a fixed length, each sequence associated with a temperature and a fitness based on the fitness function, wherein each sequence is associated with a different temperature of a temperature gradient;
(c) Parallel in a plurality of different sequences:
(1) A software module configured to select, by the computer, one or more random positions for introducing substitutions in one or more of a plurality of different sequences, optionally 1-5 random positions, optionally 1 random position, and
For each of the one or more sequences, evaluating a first fitness change resulting from introducing a substitution at one or more randomly selected locations, and accepting or rejecting the substitution based on the evaluated first fitness change, and optionally further based on a temperature associated with the sequence, and/or
(2) A software module configured to select, by the computer, one or more pairs of the plurality of different sequences, each selected pair comprising a sequence associated with a successive temperature of a temperature gradient, optionally, up to 3 pairs of the plurality of different sequences, optionally, 1 pair of the plurality of different sequences, and
For each selected pair:
Selecting one or more domains for exchange between sequences of selected pairs, and
Evaluating the difference in fitness of the sequences of the selected pair due to the exchange of one or more domains, and
Accepting or rejecting one or more domain exchanges between the selected pairs based on the fitness differences and the temperatures associated with each sequence of the selected pairs, and
(D) A software module configured to be executed iteratively by the computer (c), wherein in each subsequent iteration, accepted substitutions of a previous iteration and/or accepted domains of a previous iteration are swapped into a plurality of different sequences, resulting in one or more functional sequences having fitness that reaches or is above a desired fitness threshold.
92. An engineered gene effector comprising a polypeptide comprising any one of SEQ ID NO: 115-274、276-287、289-367、370-445、448-746、748-777、779-929、931-1007、1009-1156、1158-1194、1196-1288、1290-1350、1352-1451, or a sequence at least 85% identical thereto, or a sequence having a mutation of 0-3 amino acid residues, optionally wherein any mutation thereof is a conservative substitution.
93. An engineered gene effector comprising a polypeptide comprising any one of SEQ ID NOs 1085, 122, 1084, 653, 1099 and 1107, or a sequence at least 85% identical thereto, or a sequence having a mutation of 0-3 amino acid residues, optionally, wherein any mutation thereof is a conservative substitution.
TABLE 6 list of examples of target genes (e.g., encoding proteins of interest)
List of proteins/target genes of interest
A1BG、CCND3、FAM163A、KCNK10、NRG4、REM1、TECTB、A1CF、CCNDBP1、FAM163B、KCNK12、NRGN、REM2、TEDDM1、A2M、CCNE1、FAM166A、KCNK13、NRIP1、REN、TEF、A2ML1、CCNE2、FAM166B、KCNK15、NRIP2、RENBP、TEFM、A3GALT2、CCNF、FAM167A、KCNK16、NRIP3、REP15、TEK、A4GALT、CCNG1、FAM167B、KCNK17、NRK、REPIN1、TEKT1、A4GNT、CCNG2、FAM168A、KCNK18、NRL、REPS1、TEKT2、AAAS、CCNH、FAM168B、KCNK2、NRM、REPS2、TEKT3、AACS、CCNI、FAM169A、KCNK3、NRN1、RER1、TEKT4、AADAC、CCNI2、FAM169B、KCNK4、NRN1L、RERE、TEKT5、AADACL2、CCNJ、FAM170A、KCNK5、NRP1、RERG、TELO2、AADACL3、CCNJL、FAM170B、KCNK6、NRP2、RERGL、TEN1、AADACL4、CCNK、FAM171A1、KCNK7、NRROS、RESP18、TENC1、AADAT、CCNL1、FAM171A2、KCNK9、NRSN1、REST、TENM1、AAED1、CCNL2、FAM171B、KCNMA1、NRSN2、RET、TENM2、AAGAB、CCNO、FAM172A、KCNMB1、NRTN、RETN、TENM3、AAK1、CCNT1、FAM173A、KCNMB2、NRXN1、RETNLB、TENM4、AAMDC、CCNT2、FAM173B、KCNMB3、NRXN2、RETSAT、TEP1、AAMP、CCNY、FAM174A、KCNMB4、NRXN3、REV1、TEPP、AANAT、CCNYL1、FAM174B、KCNN1、NSA2、REV3L、TERF1、AAR2、CCP110、FAM175A、KCNN2、NSD1、REXO1、TERF2、AARD、CCPG1、FAM175B、KCNN3、NSDHL、REXO2、TERF2IP、AARS、CCR1、FAM177A1、KCNN4、NSF、REXO4、TERT、AARS2、CCR10、FAM177B、KCNQ1、NSFL1C、RFC1、TES、AARSD1、CCR2、FAM178A、KCNQ2、NSG1、RFC2、TESC、AASDH、CCR3、FAM178B、KCNQ3、NSL1、RFC3、TESK1、AASDHPPT、CCR4、FAM179A、KCNQ4、NSMAF、RFC4、TESK2、AASS、CCR5、FAM179B、KCNQ5、NSMCE1、RFC5、TESPA1、AATF、CCR6、FAM180A、KCNRG、NSMCE2、RFESD、TET1、AATK、CCR7、FAM180B、KCNS1、NSMCE4A、RFFL、TET2、ABAT、CCR8、FAM181A、KCNS2、NSMF、RFK、TET3、ABCA1、CCR9、FAM181B、KCNS3、NSRP1、RFNG、TEX10、ABCA10、CCRL2、FAM183A、KCNT1、NSUN2、RFPL1、TEX101、ABCA12、CCRN4L、FAM184A、KCNT2、NSUN3、RFPL2、TEX11、ABCA13、CCS、FAM184B、KCNU1、NSUN4、RFPL3、TEX12、ABCA2、CCSAP、FAM185A、KCNV1、NSUN5、RFPL4A、TEX13A、ABCA3、CCSER1、FAM186A、KCNV2、NSUN6、RFPL4AL1、TEX13B、ABCA4、CCSER2、FAM186B、KCP、NSUN7、RFPL4B、TEX14、ABCA5、CCT2、FAM187B、KCTD1、NT5C、RFT1、TEX15、ABCA6、CCT3、FAM188A、KCTD10、NT5C1A、RFTN1、TEX19、ABCA7、CCT4、FAM188B、KCTD11、NT5C1B、RFTN2、TEX2、ABCA8、CCT5、FAM189A1、KCTD12、NT5C1B-RDH14、RFWD2、TEX22、ABCA9、CCT6A、FAM189A2、KCTD13、NT5C2、RFWD3、TEX26、ABCB1、CCT6B、FAM189B、KCTD14、NT5C3A、RFX1、TEX261、ABCB10、CCT7、FAM192A、KCTD15、NT5C3B、RFX2、TEX264、ABCB11、CCT8、FAM193A、KCTD16、NT5DC1、RFX3、TEX28、ABCB4、CCT8L2、FAM193B、KCTD17、NT5DC2、RFX4、TEX29、ABCB5、CCZ1、FAM194A、KCTD18、NT5DC3、RFX5、TEX30、ABCB6、CCZ1B、FAM194B、KCTD19、NT5E、RFX6、TEX33、ABCB7、CD101、FAM195A、KCTD2、NT5M、RFX7、TEX35、ABCB8、CD109、FAM195B、KCTD20、NTAN1、RFX8、TEX36、ABCB9、CD14、FAM196A、KCTD21、NTF3、RFXANK、TEX37、ABCC1、CD151、FAM196B、KCTD3、NTF4、RFXAP、TEX38、ABCC10、CD160、FAM198A、KCTD4、NTHL1、RGAG1、TEX40、ABCC11、CD163、FAM198B、KCTD5、NTM、RGAG4、TEX9、ABCC12、CD163L1、FAM199X、KCTD6、NTMT1、RGCC、TF、ABCC2、CD164、FAM19A1、KCTD7、NTN1、RGL1、TFAM、ABCC3、CD164L2、FAM19A2、KCTD8、NTN3、RGL2、TFAP2A、ABCC4、CD177、
FAM19A3、KCTD9、NTN4、RGL3、TFAP2B、ABCC5、CD180、FAM19A4、KDELC1、NTN5、RGL4、TFAP2C、ABCC6、CD19、FAM19A5、KDELC2、NTNG1、RGMA、TFAP2D、ABCC8、CD1A、FAM200A、KDELR1、NTNG2、RGMB、TFAP2E、ABCC9、CD1B、FAM203A、KDELR2、NTPCR、RGN、TFAP4、ABCD1、CD1C、FAM203B、KDELR3、NTRK1、RGP1、TFB1M、ABCD2、CD1D、FAM204A、KDM1A、NTRK2、RGPD1、TFB2M、ABCD3、CD1E、FAM205A、KDM1B、NTRK3、RGPD2、TFCP2、ABCD4、CD2、FAM206A、KDM2A、NTS、RGPD3、TFCP2L1、ABCE1、CD200、FAM207A、KDM2B、NTSR1、RGPD4、TFDP1、ABCF1、CD200R1、FAM208A、KDM3A、NTSR2、RGPD5、TFDP2、ABCF2、CD200R1L、FAM208B、KDM3B、NUAK1、RGPD6、TFDP3、ABCF3、CD207、FAM209A、KDM4A、NUAK2、RGPD8、TFE3、ABCG1、CD209、FAM209B、KDM4B、NUB1、RGR、TFEB、ABCG2、CD22、FAM20A、KDM4C、NUBP1、RGS1、TFEC、ABCG4、CD226、FAM20B、KDM4D、NUBP2、RGS10、TFF1、ABCG5、CD24、FAM20C、KDM4E、NUBPL、RGS11、TFF2、ABCG8、CD244、FAM210A、KDM5A、NUCB1、RGS12、TFF3、ABHD1、CD247、FAM210B、KDM5B、NUCB2、RGS13、TFG、ABHD10、CD248、FAM211A、KDM5C、NUCKS1、RGS14、TFIP11、ABHD11、CD27、FAM211B、KDM5D、NUDC、RGS16、TFPI、ABHD12、CD274、FAM212A、KDM6A、NUDCD1、RGS17、TFPI2、ABHD12B、CD276、FAM212B、KDM6B、NUDCD2、RGS18、TFPT、ABHD13、CD28、FAM213A、KDM8、NUDCD3、RGS19、TFR2、ABHD14A、CD2AP、FAM213B、KDR、NUDT1、RGS2、TFRC、ABHD14B、CD2BP2、FAM214A、KDSR、NUDT10、RGS20、TG、ABHD15、CD300A、FAM214B、KEAP1、NUDT11、RGS21、TGDS、ABHD16A、CD300C、FAM216A、KEL、NUDT12、RGS22、TGFA、ABHD16B、CD300E、FAM216B、KERA、NUDT13、RGS3、TGFB1、ABHD17A、CD300LB、FAM217A、KHDC1、NUDT14、RGS4、TGFB1I1、ABHD17B、CD300LD、FAM217B、KHDC1L、NUDT15、RGS5、TGFB2、ABHD17C、CD300LF、FAM218A、KHDC3L、NUDT16、RGS6、TGFB3、ABHD2、CD300LG、FAM219A、KHDRBS1、NUDT16L1、RGS7、TGFBI、ABHD3、CD302、FAM219B、KHDRBS2、NUDT17、RGS7BP、TGFBR1、ABHD4、CD320、FAM21A、KHDRBS3、NUDT18、RGS8、TGFBR2、ABHD5、CD33、FAM21B、KHK、NUDT19、RGS9、TGFBR3、ABHD6、CD34、FAM21C、KHNYN、NUDT2、RGS9BP、TGFBR3L、ABHD8、CD36、FAM220A、KHSRP、NUDT21、RGSL1、TGFBRAP1、ABI1、CD37、FAM221A、KIAA0020、NUDT22、RHAG、TGIF1、ABI2、CD38、FAM221B、KIAA0040、NUDT3、RHBDD1、TGIF2、ABI3、CD3D、FAM222A、KIAA0100、NUDT4、RHBDD2、TGIF2-C20orf24、ABI3BP、CD3E、FAM222B、KIAA0101、NUDT5、RHBDD3、TGIF2LX、ABL1、CD3EAP、FAM227A、KIAA0141、NUDT6、RHBDF1、TGIF2LY、ABL2、CD3G、FAM227B、KIAA0195、NUDT7、RHBDF2、TGM1、ABLIM1、CD4、FAM228A、KIAA0196、NUDT8、RHBDL1、TGM2、ABLIM2、CD40、FAM228B、KIAA0226、NUDT9、RHBDL2、TGM3、ABLIM3、CD40LG、FAM229A、KIAA0226L、NUF2、RHBDL3、TGM4、ABO、CD44、FAM229B、KIAA0232、NUFIP1、RHBG、TGM5、ABR、CD46、FAM230A、KIAA0247、NUFIP2、RHCE、TGM6、ABRA、CD47、FAM24A、KIAA0319、NUGGC、RHCG、TGM7、ABRACL、CD48、FAM24B、KIAA0319L、NUMA1、RHD、TGOLN2、ABT1、CD5、FAM25A、KIAA0355、NUMB、RHEB、TGS1、ABTB1、CD52、FAM25C、KIAA0368、NUMBL、RHEBL1、TH、ABTB2、CD53、FAM25G、KIAA0391、NUP107、RHNO1、THADA、ACAA1、CD55、FAM26D、KIAA0408、NUP133、RHO、THAP1、ACAA2、CD58、FAM26E、KIAA0430、NUP153、RHOA、THAP10、ACACA、CD59、FAM26F、KIAA0513、
NUP155、RHOB、THAP11、ACACB、CD5L、FAM32A、KIAA0556、NUP160、RHOBTB1、THAP2、ACAD10、CD6、FAM35A、KIAA0586、NUP188、RHOBTB2、THAP3、ACAD11、CD63、FAM3A、KIAA0753、NUP205、RHOBTB3、THAP4、ACAD8、CD68、FAM3B、KIAA0754、NUP210、RHOC、THAP5、ACAD9、CD69、FAM3C、KIAA0825、NUP210L、RHOD、THAP6、ACADL、CD7、FAM3D、KIAA0895、NUP214、RHOF、THAP7、ACADM、CD70、FAM43A、KIAA0895L、NUP35、RHOG、THAP8、ACADS、CD72、FAM43B、KIAA0907、NUP37、RHOH、THAP9、ACADSB、CD74、FAM45A、KIAA0922、NUP43、RHOJ、THBD、ACADVL、CD79A、FAM46A、KIAA0930、NUP50、RHOQ、THBS1、ACAN、CD79B、FAM46B、KIAA0947、NUP54、RHOT1、THBS2、ACAP1、CD80、FAM46C、KIAA1009、NUP62、RHOT2、THBS3、ACAP2、CD81、FAM46D、KIAA1024、NUP62CL、RHOU、THBS4、ACAP3、CD82、FAM47A、KIAA1024L、NUP85、RHOV、THEG、ACAT1、CD83、FAM47B、KIAA1033、NUP88、RHOXF1、THEG5、ACAT2、CD84、FAM47C、KIAA1045、NUP93、RHOXF2、THEGL、ACBD3、CD86、FAM47E、KIAA1107、NUP98、RHOXF2B、THEM4、ACBD4、CD8A、FAM47E-STBD1、KIAA1109、NUPL1、RHPN1、THEM5、ACBD5、CD8B、FAM49A、KIAA1143、NUPL2、RHPN2、THEM6、ACBD6、CD9、FAM49B、KIAA1147、NUPR1、RIBC1、THEMIS、ACBD7、CD93、FAM50A、KIAA1161、NUPR1L、RIBC2、THEMIS2、ACCS、CD96、FAM50B、KIAA1191、NUS1、RIC3、THG1L、ACCSL、CD97、FAM53A、KIAA1199、NUSAP1、RIC8A、THNSL1、ACD、CD99、FAM53B、KIAA1210、NUTF2、RIC8B、THNSL2、ACE、CD99L2、FAM53C、KIAA1211、NUTM1、RICTOR、THOC1、ACE2、CDA、FAM57A、KIAA1211L、NUTM2A、RIF1、THOC2、ACER1、CDADC1、FAM57B、KIAA1217、NUTM2B、RIIAD1、THOC3、ACER2、CDAN1、FAM58A、KIAA1239、NUTM2F、RILP、THOC5、ACER3、CDC123、FAM60A、KIAA1244、NUTM2G、RILPL1、THOC6、ACHE、CDC14A、FAM63A、KIAA1257、NVL、RILPL2、THOC7、ACIN1、CDC14B、FAM63B、KIAA1279、NWD1、RIMBP2、THOP1、ACKR1、CDC16、FAM64A、KIAA1324、NXF1、RIMBP3、THPO、ACKR2、CDC20、FAM65A、KIAA1324L、NXF2、RIMBP3B、THRA、ACKR3、CDC20B、FAM65B、KIAA1328、NXF2B、RIMBP3C、THRAP3、ACKR4、CDC23、FAM65C、KIAA1377、NXF3、RIMKLA、THRB、ACLY、CDC25A、FAM69A、KIAA1407、NXF5、RIMKLB、THRSP、ACMSD、CDC25B、FAM69B、KIAA1429、NXN、RIMS1、THSD1、ACN9、CDC25C、FAM69C、KIAA1430、NXNL1、RIMS2、THSD4、ACO1、CDC26、FAM71A、KIAA1432、NXNL2、RIMS3、THSD7A、ACO2、CDC27、FAM71B、KIAA1456、NXPE1、RIMS4、THSD7B、ACOT1、CDC34、FAM71C、KIAA1462、NXPE2、RIN1、THTPA、ACOT11、CDC37、FAM71D、KIAA1467、NXPE3、RIN2、THUMPD1、ACOT12、CDC37L1、FAM71E1、KIAA1468、NXPE4、RIN3、THUMPD2、ACOT13、CDC40、FAM71E2、KIAA1522、NXPH1、RING1、THUMPD3、ACOT2、CDC42、FAM71F1、KIAA1524、NXPH2、RINL、THY1、ACOT4、CDC42BPA、FAM71F2、KIAA1549、NXPH3、RINT1、THYN1、ACOT6、CDC42BPB、FAM72A、KIAA1549L、NXPH4、RIOK1、TIA1、ACOT7、CDC42BPG、FAM72B、KIAA1551、NXT1、RIOK2、TIAF1、ACOT8、CDC42EP1、FAM72D、KIAA1586、NXT2、RIOK3、TIAL1、ACOT9、CDC42EP2、FAM73A、KIAA1598、NYAP1、RIPK1、TIAM1、ACOX1、CDC42EP3、FAM73B、KIAA1614、NYAP2、RIPK2、TIAM2、ACOX2、CDC42EP4、FAM76A、KIAA1644、NYNRIN、RIPK3、TICAM1、ACOX3、CDC42EP5、FAM76B、KIAA1671、NYX、RIPK4、TICAM2、ACOXL、CDC42SE1、FAM78A、KIAA1683、OAF、RIPPLY1、TICRR、
ACP1、CDC42SE2、FAM78B、KIAA1715、OARD1、RIPPLY2、TIE1、ACP2、CDC45、FAM81A、KIAA1731、OAS1、RIPPLY3、TIFA、ACP5、CDC5L、FAM81B、KIAA1737、OAS2、RIT1、TIFAB、ACP6、CDC6、FAM83A、KIAA1751、OAS3、RIT2、TIGD2、ACPL2、CDC7、FAM83B、KIAA1755、OASL、RLBP1、TIGD3、ACPP、CDC73、FAM83C、KIAA1804、OAT、RLF、TIGD4、ACPT、CDCA2、FAM83D、KIAA1841、OAZ1、RLIM、TIGD5、ACR、CDCA3、FAM83E、KIAA1919、OAZ2、RLN1、TIGD6、ACRBP、CDCA4、FAM83F、KIAA1958、OAZ3、RLN2、TIGD7、ACRC、CDCA5、FAM83G、KIAA1984、OBFC1、RLN3、TIGIT、ACRV1、CDCA7、FAM83H、KIAA2013、OBP2A、RLTPR、TIMD4、ACSBG1、CDCA7L、FAM84A、KIAA2018、OBP2B、RMDN1、TIMELESS、ACSBG2、CDCA8、FAM84B、KIAA2022、OBSCN、RMDN2、TIMM10、ACSF2、CDCP1、FAM86A、KIAA2026、OBSL1、RMDN3、TIMM10B、ACSF3、CDCP2、FAM86B1、KIDINS220、OC90、RMI1、TIMM13、ACSL1、CDH1、FAM86B2、KIF11、OCA2、RMI2、TIMM17A、ACSL3、CDH10、FAM86C1、KIF12、OCEL1、RMND1、TIMM17B、ACSL4、CDH11、FAM86KP、KIF13A、OCIAD1、RMND5A、TIMM21、ACSL5、CDH12、FAM89A、KIF13B、OCIAD2、RMND5B、TIMM22、ACSL6、CDH13、FAM89B、KIF14、OCLM、RNASE1、TIMM23、ACSM1、CDH15、FAM8A1、KIF15、OCLN、RNASE10、TIMM23B、ACSM2A、CDH16、FAM90A1、KIF16B、OCM、RNASE11、TIMM44、ACSM2B、CDH17、FAM91A1、KIF17、OCM2、RNASE12、TIMM50、ACSM3、CDH18、FAM92A1、KIF18A、OCRL、RNASE13、TIMM8A、ACSM4、CDH19、FAM92B、KIF18B、OCSTAMP、RNASE2、TIMM8B、ACSM5、CDH2、FAM96A、KIF19、ODAM、RNASE3、TIMM9、ACSS1、CDH20、FAM96B、KIF1A、ODC1、RNASE4、TIMMDC1、ACSS2、CDH22、FAM98A、KIF1B、ODF1、RNASE6、TIMP1、ACSS3、CDH23、FAM98B、KIF1C、ODF2、RNASE7、TIMP2、ACTA1、CDH24、FAM98C、KIF20A、ODF2L、RNASE8、TIMP3、ACTA2、CDH26、FAM9A、KIF20B、ODF3、RNASE9、TIMP4、ACTB、CDH3、FAM9B、KIF21A、ODF3B、RNASEH1、TINAG、ACTBL2、CDH4、FAM9C、KIF21B、ODF3L1、RNASEH2A、TINAGL1、ACTC1、CDH5、FAN1、KIF22、ODF3L2、RNASEH2B、TINF2、ACTG1、CDH6、FANCA、KIF23、ODF4、RNASEH2C、TIPARP、ACTG2、CDH7、FANCB、KIF24、OFCC1、RNASEK、TIPIN、ACTL10、CDH8、FANCC、KIF25、OFD1、RNASEL、TIPRL、ACTL6A、CDH9、FANCD2、KIF26A、OGDH、RNASET2、TIRAP、ACTL6B、CDHR1、FANCD2OS、KIF26B、OGDHL、RND1、TJAP1、ACTL7A、CDHR2、FANCE、KIF27、OGFOD1、RND2、TJP1、ACTL7B、CDHR3、FANCF、KIF28P、OGFOD2、RND3、TJP2、ACTL8、CDHR4、FANCG、KIF2A、OGFOD3、RNF10、TJP3、ACTL9、CDHR5、FANCI、KIF2B、OGFR、RNF103、TK1、ACTN1、CDIP1、FANCL、KIF2C、OGFRL1、RNF103-CHMP3、TK2、ACTN2、CDIPT、FANCM、KIF3A、OGG1、RNF11、TKT、ACTN3、CDK1、FANK1、KIF3B、OGN、RNF111、TKTL1、ACTN4、CDK10、FAP、KIF3C、OGT、RNF112、TKTL2、ACTR10、CDK11A、FAR1、KIF4A、OIP5、RNF113A、TLCD1、ACTR1A、CDK11B、FAR2、KIF4B、OIT3、RNF113B、TLCD2、ACTR1B、CDK12、FARP1、KIF5A、OLA1、RNF114、TLDC1、ACTR2、CDK13、FARP2、KIF5B、OLAH、RNF115、TLDC2、ACTR3、CDK14、FARS2、KIF5C、OLFM1、RNF121、TLE1、ACTR3B、CDK15、FARSA、KIF6、OLFM2、RNF122、TLE2、ACTR3C、CDK16、FARSB、KIF7、OLFM3、RNF123、TLE3、ACTR5、CDK17、FAS、KIF9、OLFM4、RNF125、TLE4、ACTR6、CDK18、FASLG、KIFAP3、OLFML1、RNF126、TLE6、ACTR8、CDK19、FASN、KIFC1、OLFML2A、RNF128、TLK1、ACTRT1、CDK2、FASTK、KIFC2、OLFML2B、
RNF13、TLK2、ACTRT2、CDK20、FASTKD1、KIFC3、OLFML3、RNF130、TLL1、ACTRT3、CDK2AP1、FASTKD2、KIN、OLIG1、RNF133、TLL2、ACVR1、CDK2AP2、FASTKD3、KIR2DL1、OLIG2、RNF135、TLN1、ACVR1B、CDK3、FASTKD5、KIR2DL3、OLIG3、RNF138、TLN2、ACVR1C、CDK4、FAT1、KIR2DL4、OLR1、RNF139、TLR1、ACVR2A、CDK5、FAT2、KIR2DS4、OMA1、RNF14、TLR10、ACVR2B、CDK5R1、FAT3、KIR3DL1、OMD、RNF141、TLR2、ACVRL1、CDK5R2、FAT4、KIR3DL2、OMG、RNF144A、TLR3、ACY1、CDK5RAP1、FATE1、KIR3DL3、OMP、RNF144B、TLR4、ACY3、CDK5RAP2、FAU、KIRREL、ONECUT1、RNF145、TLR5、ACYP1、CDK5RAP3、FAXC、KIRREL2、ONECUT2、RNF146、TLR6、ACYP2、CDK6、FAXDC2、KIRREL3、ONECUT3、RNF148、TLR7、ADA、CDK7、FBF1、KISS1、OOEP、RNF149、TLR8、ADAD1、CDK8、FBL、KISS1R、OOSP2、RNF150、TLR9、ADAD2、CDK9、FBLIM1、KIT、OPA1、RNF151、TLX1、ADAL、CDKAL1、FBLN1、KITLG、OPA3、RNF152、TLX1NB、ADAM10、CDKL1、FBLN2、KL、OPALIN、RNF157、TLX2、ADAM11、CDKL2、FBLN5、KLB、OPCML、RNF165、TLX3、ADAM12、CDKL3、FBLN7、KLC1、OPHN1、RNF166、TM2D1、ADAM15、CDKL4、FBN1、KLC2、OPLAH、RNF167、TM2D2、ADAM17、CDKL5、FBN2、KLC3、OPN1LW、RNF168、TM2D3、ADAM18、CDKN1A、FBN3、KLC4、OPN1MW、RNF169、TM4SF1、ADAM19、CDKN1B、FBP1、KLF1、OPN1MW2、RNF17、TM4SF18、ADAM2、CDKN1C、FBP2、KLF10、OPN1SW、RNF170、TM4SF19、ADAM20、CDKN2A、FBRS、KLF11、OPN3、RNF175、TM4SF20、ADAM21、CDKN2AIP、FBRSL1、KLF12、OPN4、RNF180、TM4SF4、ADAM22、CDKN2AIPNL、FBXL12、KLF13、OPN5、RNF181、TM4SF5、ADAM23、CDKN2B、FBXL13、KLF14、OPRD1、RNF182、TM6SF1、ADAM28、CDKN2C、FBXL14、KLF15、OPRK1、RNF183、TM6SF2、ADAM29、CDKN2D、FBXL15、KLF16、OPRL1、RNF185、TM7SF2、ADAM30、CDKN3、FBXL16、KLF17、OPRM1、RNF186、TM7SF3、ADAM32、CDNF、FBXL17、KLF2、OPTC、RNF187、TM9SF1、ADAM33、CDO1、FBXL18、KLF3、OPTN、RNF19A、TM9SF2、ADAM7、CDON、FBXL19、KLF4、OR10A2、RNF19B、TM9SF3、ADAM8、CDPF1、FBXL2、KLF5、OR10A3、RNF2、TM9SF4、ADAM9、CDR1、FBXL20、KLF6、OR10A4、RNF20、TMA16、ADAMDEC1、CDR2、FBXL21、KLF7、OR10A5、RNF207、TMA7、ADAMTS1、CDR2L、FBXL22、KLF8、OR10A6、RNF208、TMBIM1、ADAMTS10、CDRT1、FBXL3、KLF9、OR10A7、RNF212、TMBIM4、ADAMTS12、CDRT15、FBXL4、KLHDC1、OR10AG1、RNF213、TMBIM6、ADAMTS13、CDRT15L2、FBXL5、KLHDC10、OR10C1、RNF214、TMC1、ADAMTS14、CDRT4、FBXL6、KLHDC2、OR10G2、RNF215、TMC2、ADAMTS15、CDS1、FBXL7、KLHDC3、OR10G3、RNF216、TMC3、ADAMTS16、CDS2、FBXL8、KLHDC4、OR10G4、RNF217、TMC4、ADAMTS17、CDSN、FBXO10、KLHDC7A、OR10G7、RNF219、TMC5、ADAMTS18、CDT1、FBXO11、KLHDC7B、OR10G8、RNF220、TMC6、ADAMTS19、CDV3、FBXO15、KLHDC8A、OR10G9、RNF222、TMC7、ADAMTS2、CDX1、FBXO16、KLHDC8B、OR10H1、RNF223、TMC8、ADAMTS20、CDX2、FBXO17、KLHDC9、OR10H2、RNF224、TMCC1、ADAMTS3、CDX4、FBXO18、KLHL1、OR10H3、RNF24、TMCC2、ADAMTS4、CDY1、FBXO2、KLHL10、OR10H4、RNF25、TMCC3、ADAMTS5、CDY1B、FBXO21、KLHL11、OR10H5、RNF26、TMCO1、ADAMTS6、CDY2A、FBXO22、KLHL12、OR10J1、RNF31、TMCO2、ADAMTS7、CDY2B、FBXO24、KLHL13、OR10J3、RNF32、TMCO3、ADAMTS8、CDYL、FBXO25、KLHL14、OR10J5、RNF34、TMCO4、ADAMTS9、CDYL2、FBXO27、KLHL15、OR10K1、RNF38、TMCO5A、ADAMTSL1、CEACAM1、FBXO28、
KLHL17、OR10K2、RNF39、TMCO6、ADAMTSL2、CEACAM16、FBXO3、KLHL18、OR10P1、RNF4、TMED1、ADAMTSL3、CEACAM18、FBXO30、KLHL2、OR10Q1、RNF40、TMED10、ADAMTSL4、CEACAM19、FBXO31、KLHL20、OR10R2、RNF41、TMED2、ADAMTSL5、CEACAM20、FBXO32、KLHL21、OR10S1、RNF43、TMED3、ADAP1、CEACAM21、FBXO33、KLHL22、OR10T2、RNF44、TMED4、ADAP2、CEACAM3、FBXO34、KLHL23、OR10V1、RNF5、TMED5、ADAR、CEACAM4、FBXO36、KLHL24、OR10W1、RNF6、TMED6、ADARB1、CEACAM5、FBXO38、KLHL25、OR10X1、RNF7、TMED7、ADARB2、CEACAM6、FBXO39、KLHL26、OR10Z1、RNF8、TMED7-TICAM2、ADAT1、CEACAM7、FBXO4、KLHL28、OR11A1、RNFT1、TMED8、ADAT2、CEACAM8、FBXO40、KLHL29、OR11G2、RNFT2、TMED9、ADAT3、CEBPA、FBXO41、KLHL3、OR11H1、RNGTT、TMEFF1、ADC、CEBPB、FBXO42、KLHL30、OR11H12、RNH1、TMEFF2、ADCK1、CEBPD、FBXO43、KLHL31、OR11H2、RNLS、TMEM100、ADCK2、CEBPE、FBXO44、KLHL32、OR11H4、RNMT、TMEM101、ADCK3、CEBPG、FBXO45、KLHL33、OR11H6、RNMTL1、TMEM102、ADCK4、CEBPZ、FBXO46、KLHL34、OR11L1、RNPC3、TMEM104、ADCK5、CECR1、FBXO47、KLHL35、OR12D2、RNPEP、TMEM105、ADCY1、CECR2、FBXO48、KLHL36、OR12D3、RNPEPL1、TMEM106A、ADCY10、CECR5、FBXO5、KLHL38、OR13A1、RNPS1、TMEM106B、ADCY2、CECR6、FBXO6、KLHL4、OR13C2、ROBO1、TMEM106C、ADCY3、CEL、FBXO7、KLHL40、OR13C3、ROBO2、TMEM107、ADCY4、CELA1、FBXO8、KLHL41、OR13C4、ROBO3、TMEM108、ADCY5、CELA2A、FBXO9、KLHL42、OR13C5、ROBO4、TMEM109、ADCY6、CELA2B、FBXW10、KLHL5、OR13C8、ROCK1、TMEM11、ADCY7、CELA3A、FBXW11、KLHL6、OR13C9、ROCK2、TMEM110、ADCY8、CELA3B、FBXW12、KLHL7、OR13D1、ROGDI、TMEM110-MUSTN1、ADCY9、CELF1、FBXW2、KLHL8、OR13F1、ROM1、TMEM114、ADCYAP1、CELF2、FBXW4、KLHL9、OR13G1、ROMO1、TMEM115、ADCYAP1R1、CELF3、FBXW5、KLK1、OR13H1、ROPN1、TMEM116、ADD1、CELF4、FBXW7、KLK10、OR13J1、ROPN1B、TMEM117、ADD2、CELF5、FBXW8、KLK11、OR14A16、ROPN1L、TMEM119、ADD3、CELF6、FBXW9、KLK12、OR14C36、ROR1、TMEM120A、ADGB、CELSR1、FCAMR、KLK13、OR14J1、ROR2、TMEM120B、ADH1A、CELSR2、FCAR、KLK14、OR1A1、RORA、TMEM121、ADH1B、CELSR3、FCER1A、KLK15、OR1A2、RORB、TMEM123、ADH1C、CEMP1、FCER1G、KLK2、OR1B1、RORC、TMEM125、ADH4、CEND1、FCER2、KLK3、OR1C1、ROS1、TMEM126A、ADH5、CENPA、FCF1、KLK4、OR1D2、RP1、TMEM126B、ADH6、CENPB、FCGBP、KLK5、OR1D5、RP1L1、TMEM127、ADH7、CENPBD1、FCGR1A、KLK6、OR1E1、RP2、TMEM128、ADHFE1、CENPC、FCGR1B、KLK7、OR1E2、RP9、TMEM129、ADI1、CENPE、FCGR2A、KLK8、OR1F1、RPA1、TMEM130、ADIG、CENPF、FCGR2B、KLK9、OR1G1、RPA2、TMEM131、ADIPOQ、CENPH、FCGR2C、KLKB1、OR1I1、RPA3、TMEM132A、ADIPOR1、CENPI、FCGR3A、KLLN、OR1J1、RPA4、TMEM132B、ADIPOR2、CENPJ、FCGR3B、KLRB1、OR1J4、RPAIN、TMEM132C、ADIRF、CENPK、FCGRT、KLRC1、OR1K1、RPAP1、TMEM132D、ADK、CENPL、FCHO1、KLRC2、OR1L1、RPAP2、TMEM132E、ADM、CENPM、FCHO2、KLRC3、OR1L3、RPAP3、TMEM133、ADM2、CENPN、FCHSD1、KLRC4、OR1L4、RPE、TMEM134、ADM5、CENPO、FCHSD2、KLRC4-KLRK1、OR1L6、RPE65、TMEM135、ADNP、CENPP、FCN1、KLRD1、OR1L8、RPF1、TMEM136、ADNP2、CENPQ、FCN2、KLRF1、OR1M1、RPF2、TMEM138、ADO、CENPT、FCN3、KLRF2、OR1N1、RPGR、
TMEM139、ADORA1、CENPU、FCRL1、KLRG1、OR1N2、RPGRIP1、TMEM140、ADORA2A、CENPV、FCRL2、KLRG2、OR1Q1、RPGRIP1L、TMEM141、ADORA2B、CENPW、FCRL3、KLRK1、OR1S1、RPH3A、TMEM143、ADORA3、CEP104、FCRL4、KMO、OR1S2、RPH3AL、TMEM144、ADPGK、CEP112、FCRL5、KMT2A、OR2A12、RPIA、TMEM145、ADPRH、CEP120、FCRL6、KMT2B、OR2A14、RPL10、TMEM147、ADPRHL1、CEP128、FCRLA、KMT2C、OR2A2、RPL10A、TMEM14A、ADPRHL2、CEP135、FCRLB、KMT2D、OR2A25、RPL10L、TMEM14B、ADPRM、CEP152、FDCSP、KMT2E、OR2A4、RPL11、TMEM14C、ADRA1A、CEP164、FDFT1、KNCN、OR2A5、RPL12、TMEM14E、ADRA1B、CEP170、FDPS、KNDC1、OR2A7、RPL13、TMEM150A、ADRA1D、CEP170B、FDX1、KNG1、OR2AE1、RPL13A、TMEM150B、ADRA2A、CEP19、FDX1L、KNOP1、OR2AG1、RPL14、TMEM150C、ADRA2B、CEP192、FDXACB1、KNSTRN、OR2AG2、RPL15、TMEM151A、ADRA2C、CEP250、FDXR、KNTC1、OR2AK2、RPL17、TMEM151B、ADRB1、CEP290、FECH、KPNA1、OR2AP1、RPL17-C18orf32、TMEM154、ADRB2、CEP350、FEM1A、KPNA2、OR2AT4、RPL18、TMEM155、ADRB3、CEP41、FEM1B、KPNA3、OR2B11、RPL18A、TMEM156、ADRBK1、CEP44、FEM1C、KPNA4、OR2B2、RPL19、TMEM158、ADRBK2、CEP55、FEN1、KPNA5、OR2B3、RPL21、TMEM159、ADRM1、CEP57、FER、KPNA6、OR2B6、RPL22、TMEM160、ADSL、CEP57L1、FER1L5、KPNA7、OR2C1、RPL22L1、TMEM161A、ADSS、CEP63、FER1L6、KPNB1、OR2C3、RPL23、TMEM161B、ADSSL1、CEP68、FERD3L、KPRP、OR2D2、RPL23A、TMEM163、ADTRP、CEP70、FERMT1、KPTN、OR2D3、RPL24、TMEM164、AEBP1、CEP72、FERMT2、KRAS、OR2F1、RPL26、TMEM165、AEBP2、CEP76、FERMT3、KRBA1、OR2F2、RPL26L1、TMEM167A、AEN、CEP78、FES、KRBA2、OR2G2、RPL27、TMEM167B、AES、CEP85、FETUB、KRBOX1、OR2G3、RPL27A、TMEM168、AFAP1、CEP85L、FEV、KRBOX4、OR2G6、RPL28、TMEM169、AFAP1L1、CEP89、FEZ1、KRCC1、OR2H1、RPL29、TMEM17、AFAP1L2、CEP95、FEZ2、KREMEN1、OR2J2、RPL3、TMEM170A、AFF1、CEP97、FEZF1、KREMEN2、OR2J3、RPL30、TMEM170B、AFF2、CEPT1、FEZF2、KRI1、OR2K2、RPL31、TMEM171、AFF3、CER1、FFAR1、KRIT1、OR2L13、RPL32、TMEM173、AFF4、CERCAM、FFAR2、KRR1、OR2L2、RPL34、TMEM174、AFG3L2、CERK、FFAR3、KRT1、OR2L3、RPL35、TMEM175、AFM、CERKL、FFAR4、KRT10、OR2L5、RPL35A、TMEM176A、AFMID、CERS1、FGA、KRT12、OR2L8、RPL36、TMEM176B、AFP、CERS2、FGB、KRT13、OR2M2、RPL36A、TMEM177、AFTPH、CERS3、FGD1、KRT14、OR2M3、RPL36A-HNRNPH2、TMEM178A、AGA、CERS4、FGD2、KRT15、OR2M4、RPL36AL、TMEM178B、AGAP1、CERS5、FGD3、KRT16、OR2M5、RPL37、TMEM179、AGAP10、CERS6、FGD4、KRT17、OR2M7、RPL37A、TMEM179B、AGAP11、CES1、FGD5、KRT18、OR2S2、RPL38、TMEM18、AGAP2、CES2、FGD6、KRT19、OR2T1、RPL39、TMEM180、AGAP3、CES3、FGF1、KRT2、OR2T10、RPL39L、TMEM181、AGAP4、CES4A、FGF10、KRT20、OR2T11、RPL3L、TMEM182、AGAP5、CES5A、FGF11、KRT222、OR2T12、RPL4、TMEM183A、AGAP6、CETN1、FGF12、KRT23、OR2T2、RPL41、TMEM183B、AGAP7、CETN2、FGF13、KRT24、OR2T27、RPL5、TMEM184A、AGAP8、CETN3、FGF14、KRT25、OR2T29、RPL6、TMEM184B、AGAP9、CETP、FGF16、KRT26、OR2T3、RPL7、TMEM184C、AGBL1、CFB、FGF17、KRT27、OR2T33、RPL7A、TMEM185A、AGBL2、CFC1、FGF18、KRT28、OR2T34、RPL7L1、TMEM185B、AGBL3、CFC1B、FGF19、KRT3、OR2T35、RPL8、TMEM186、
AGBL4、CFD、FGF2、KRT31、OR2T4、RPL9、TMEM187、AGBL5、CFDP1、FGF20、KRT32、OR2T5、RPLP0、TMEM189、AGER、CFH、FGF21、KRT33A、OR2T6、RPLP1、TMEM189-UBE2V1、AGFG1、CFHR1、FGF22、KRT33B、OR2T8、RPLP2、TMEM19、AGFG2、CFHR2、FGF23、KRT34、OR2V1、RPN1、TMEM190、AGGF1、CFHR3、FGF3、KRT35、OR2V2、RPN2、TMEM191B、AGK、CFHR4、FGF4、KRT36、OR2W1、RPP14、TMEM191C、AGL、CFHR5、FGF5、KRT37、OR2W3、RPP21、TMEM192、AGMAT、CFI、FGF6、KRT38、OR2W5、RPP25、TMEM194A、AGMO、CFL1、FGF7、KRT39、OR2Y1、RPP25L、TMEM194B、AGO1、CFL2、FGF8、KRT4、OR2Z1、RPP30、TMEM196、AGO2、CFLAR、FGF9、KRT40、OR3A1、RPP38、TMEM198、AGO3、CFP、FGFBP1、KRT5、OR3A2、RPP40、TMEM199、AGO4、CFTR、FGFBP2、KRT6A、OR3A3、RPRD1A、TMEM2、AGPAT1、CGA、FGFBP3、KRT6B、OR4A15、RPRD1B、TMEM200A、AGPAT2、CGB、FGFR1、KRT6C、OR4A16、RPRD2、TMEM200B、AGPAT3、CGB1、FGFR1OP、KRT7、OR4A47、RPRM、TMEM201、AGPAT4、CGB2、FGFR1OP2、KRT71、OR4A5、RPRML、TMEM202、AGPAT5、CGB5、FGFR2、KRT72、OR4B1、RPS10、TMEM203、AGPAT6、CGB7、FGFR3、KRT73、OR4C11、RPS10-NUDT3、TMEM204、AGPAT9、CGB8、FGFR4、KRT74、OR4C12、RPS11、TMEM205、AGPS、CGGBP1、FGFRL1、KRT75、OR4C13、RPS12、TMEM206、AGR2、CGN、FGG、KRT76、OR4C15、RPS13、TMEM207、AGR3、CGNL1、FGGY、KRT77、OR4C16、RPS14、TMEM208、AGRN、CGREF1、FGL1、KRT78、OR4C3、RPS15、TMEM209、AGRP、CGRRF1、FGL2、KRT79、OR4C45、RPS15A、TMEM210、AGT、CH25H、FGR、KRT8、OR4C46、RPS16、TMEM211、AGTPBP1、CHAC1、FH、KRT80、OR4C6、RPS17、TMEM212、AGTR1、CHAC2、FHAD1、KRT81、OR4D1、RPS17L、TMEM213、AGTR2、CHAD、FHDC1、KRT82、OR4D10、RPS18、TMEM214、AGTRAP、CHADL、FHIT、KRT83、OR4D11、RPS19、TMEM215、AGXT、CHAF1A、FHL1、KRT84、OR4D2、RPS19BP1、TMEM216、AGXT2、CHAF1B、FHL2、KRT85、OR4D5、RPS2、TMEM217、AHCTF1、CHAMP1、FHL3、KRT86、OR4D6、RPS20、TMEM218、AHCY、CHAT、FHL5、KRT9、OR4D9、RPS21、TMEM219、AHCYL1、CHCHD1、FHOD1、KRTAP10-1、OR4E2、RPS23、TMEM220、AHCYL2、CHCHD10、FHOD3、KRTAP10-10、OR4F15、RPS24、TMEM221、AHDC1、CHCHD2、FIBCD1、KRTAP10-11、OR4F16、RPS25、TMEM222、AHI1、CHCHD3、FIBIN、KRTAP10-12、OR4F21、RPS26、TMEM223、AHNAK、CHCHD4、FIBP、KRTAP10-2、OR4F29、RPS27、TMEM225、AHNAK2、CHCHD5、FICD、KRTAP10-3、OR4F3、RPS27A、TMEM229A、AHR、CHCHD6、FIG4、KRTAP10-4、OR4F4、RPS27L、TMEM229B、AHRR、CHCHD7、FIGF、KRTAP10-5、OR4F5、RPS28、TMEM230、AHSA1、CHD1、FIGLA、KRTAP10-6、OR4F6、RPS29、TMEM231、AHSA2、CHD1L、FIGN、KRTAP10-7、OR4K1、RPS3、TMEM232、AHSG、CHD2、FIGNL1、KRTAP10-8、OR4K13、RPS3A、TMEM233、AHSP、CHD3、FIGNL2、KRTAP10-9、OR4K14、RPS4X、TMEM234、AICDA、CHD4、FILIP1、KRTAP1-1、OR4K15、RPS4Y1、TMEM235、AIDA、CHD5、FILIP1L、KRTAP11-1、OR4K17、RPS4Y2、TMEM236、AIF1、CHD6、FIP1L1、KRTAP12-1、OR4K2、RPS5、TMEM237、AIF1L、CHD7、FIS1、KRTAP12-2、OR4K5、RPS6、TMEM238、AIFM1、CHD8、FITM1、KRTAP12-3、OR4L1、RPS6KA1、TMEM239、AIFM2、CHD9、FITM2、KRTAP12-4、OR4M1、RPS6KA2、TMEM240、AIFM3、CHDC2、FIZ1、KRTAP1-3、OR4M2、RPS6KA3、TMEM241、AIG1、CHDH、FJX1、KRTAP13-1、OR4N2、RPS6KA4、TMEM242、AIM1、CHEK1、FKBP10、KRTAP13-2、OR4N4、RPS6KA5、TMEM243、AIM1L、
CHEK2、FKBP11、KRTAP13-3、OR4N5、RPS6KA6、TMEM244、AIM2、CHERP、FKBP14、KRTAP13-4、OR4P4、RPS6KB1、TMEM245、AIMP1、CHFR、FKBP15、KRTAP1-4、OR4Q3、RPS6KB2、TMEM246、AIMP2、CHGA、FKBP1A、KRTAP1-5、OR4S1、RPS6KC1、TMEM247、AIP、CHGB、FKBP1B、KRTAP15-1、OR4S2、RPS6KL1、TMEM248、AIPL1、CHI3L1、FKBP2、KRTAP16-1、OR4X1、RPS7、TMEM249、AIRE、CHI3L2、FKBP3、KRTAP17-1、OR4X2、RPS8、TMEM25、AJAP1、CHIA、FKBP4、KRTAP19-1、OR51A2、RPS9、TMEM251、AJUBA、CHIC1、FKBP5、KRTAP19-2、OR51A4、RPSA、TMEM252、AK1、CHIC2、FKBP6、KRTAP19-3、OR51A7、RPTN、TMEM253、AK2、CHID1、FKBP7、KRTAP19-4、OR51B2、RPTOR、TMEM254、AK3、CHIT1、FKBP8、KRTAP19-5、OR51B4、RPUSD1、TMEM255A、AK4、CHKA、FKBP9、KRTAP19-6、OR51B5、RPUSD2、TMEM255B、AK5、CHKB、FKBPL、KRTAP19-7、OR51B6、RPUSD3、TMEM256、AK6、CHL1、FKRP、KRTAP19-8、OR51D1、RPUSD4、TMEM257、AK7、CHM、FKTN、KRTAP20-1、OR51E1、RQCD1、TMEM258、AK8、CHML、FLAD1、KRTAP20-2、OR51E2、RRAD、TMEM259、AK9、CHMP1A、FLCN、KRTAP20-3、OR51F1、RRAGA、TMEM26、AKAP1、CHMP1B、FLG、KRTAP2-1、OR51F2、RRAGB、TMEM260、AKAP10、CHMP2A、FLG2、KRTAP21-1、OR51G1、RRAGC、TMEM261、AKAP11、CHMP2B、FLI1、KRTAP21-2、OR51G2、RRAGD、TMEM27、AKAP12、CHMP3、FLII、KRTAP21-3、OR51I1、RRAS、TMEM30A、AKAP13、CHMP4A、FLJ22184、KRTAP2-2、OR51I2、RRAS2、TMEM30B、AKAP14、CHMP4B、FLJ25363、KRTAP22-1、OR51L1、RRBP1、TMEM31、AKAP17A、CHMP4C、FLJ44313、KRTAP22-2、OR51M1、RREB1、TMEM33、AKAP2、CHMP5、FLJ44635、KRTAP2-3、OR51Q1、RRH、TMEM35、AKAP3、CHMP6、FLJ45513、KRTAP23-1、OR51S1、RRM1、TMEM37、AKAP4、CHMP7、FLNA、KRTAP2-4、OR51T1、RRM2、TMEM38A、AKAP5、CHN1、FLNB、KRTAP24-1、OR51V1、RRM2B、TMEM38B、AKAP6、CHN2、FLNC、KRTAP25-1、OR52A1、RRN3、TMEM39A、AKAP7、CHODL、FLOT1、KRTAP26-1、OR52A5、RRNAD1、TMEM39B、AKAP8、CHORDC1、FLOT2、KRTAP27-1、OR52B2、RRP1、TMEM40、AKAP8L、CHP1、FLRT1、KRTAP29-1、OR52B4、RRP12、TMEM41A、AKAP9、CHP2、FLRT2、KRTAP3-1、OR52B6、RRP15、TMEM41B、AKIP1、CHPF、FLRT3、KRTAP3-2、OR52D1、RRP1B、TMEM42、AKIRIN1、CHPF2、FLT1、KRTAP3-3、OR52E2、RRP36、TMEM43、AKIRIN2、CHPT1、FLT3、KRTAP4-1、OR52E4、RRP7A、TMEM44、AKNA、CHRAC1、FLT3LG、KRTAP4-11、OR52E6、RRP8、TMEM45A、AKNAD1、CHRD、FLT4、KRTAP4-12、OR52E8、RRP9、TMEM45B、AKR1A1、CHRDL1、FLVCR1、KRTAP4-2、OR52H1、RRS1、TMEM47、AKR1B1、CHRDL2、FLVCR2、KRTAP4-3、OR52I1、RS1、TMEM5、AKR1B10、CHRFAM7A、FLYWCH1、KRTAP4-4、OR52I2、RSAD1、TMEM50A、AKR1B15、CHRM1、FLYWCH2、KRTAP4-5、OR52J3、RSAD2、TMEM50B、AKR1C1、CHRM2、FMN1、KRTAP4-6、OR52K1、RSBN1、TMEM51、AKR1C2、CHRM3、FMN2、KRTAP4-7、OR52K2、RSBN1L、TMEM52、AKR1C3、CHRM4、FMNL1、KRTAP4-8、OR52L1、RSC1A1、TMEM52B、AKR1C4、CHRM5、FMNL2、KRTAP4-9、OR52M1、RSF1、TMEM53、AKR1D1、CHRNA1、FMNL3、KRTAP5-1、OR52N1、RSG1、TMEM54、AKR1E2、CHRNA10、FMO1、KRTAP5-10、OR52N4、RSL1D1、TMEM55A、AKR7A2、CHRNA2、FMO2、KRTAP5-11、OR52N5、RSL24D1、TMEM55B、AKR7A3、CHRNA3、FMO3、KRTAP5-2、OR52R1、RSPH1、TMEM56、AKT1、CHRNA4、FMO4、KRTAP5-3、OR52W1、RSPH10B、TMEM56-RWDD3、AKT1S1、CHRNA5、FMO5、KRTAP5-4、
OR56A1、RSPH10B2、TMEM57、AKT2、CHRNA6、FMOD、KRTAP5-5、OR56A3、RSPH3、TMEM59、AKT3、CHRNA7、FMR1、KRTAP5- 6、OR56A4、RSPH4A、TMEM59L、AKTIP、CHRNA9、FMR1NB、KRTAP5-7、OR56A5、RSPH6A、TMEM60、ALAD、CHRNB1、FN1、KRTAP5-8、OR56B1、RSPH9、TMEM61、ALAS1、CHRNB2、FN3K、KRTAP5-9、OR56B4、RSPO1、TMEM62、ALAS2、CHRNB3、FN3KRP、KRTAP6-1、OR5A1、RSPO2、TMEM63A、ALB、CHRNB4、FNBP1、KRTAP6-2、OR5A2、RSPO3、TMEM63B、ALCAM、CHRND、FNBP1L、KRTAP6-3、OR5AC2、RSPO4、TMEM63C、ALDH16A1、CHRNE、FNBP4、KRTAP7-1、OR5AK2、RSPRY1、TMEM64、ALDH18A1、CHRNG、FNDC1、KRTAP8-1、OR5AN1、RSRC1、TMEM65、ALDH1A1、CHST1、FNDC3A、KRTAP9-1、OR5AP2、RSRC2、TMEM66、ALDH1A2、CHST10、FNDC3B、KRTAP9-2、OR5AR1、RSU1、TMEM67、ALDH1A3、CHST11、FNDC4、KRTAP9-3、OR5AS1、RTBDN、TMEM68、ALDH1B1、CHST12、FNDC5、KRTAP9-4、OR5AU1、RTCA、TMEM69、ALDH1L1、CHST13、FNDC7、KRTAP9-6、OR5B12、RTCB、TMEM70、ALDH1L2、CHST14、FNDC8、KRTAP9-7、OR5B17、RTDR1、TMEM71、ALDH2、CHST15、FNDC9、KRTAP9- 8、OR5B2、RTEL1、TMEM72、ALDH3A1、CHST2、FNIP1、KRTAP9-9、OR5B21、RTF1、TMEM74、ALDH3A2、CHST3、FNIP2、KRTCAP2、OR5B3、RTFDC1、TMEM74B、ALDH3B1、CHST4、FNTA、KRTCAP3、OR5C1、RTKN、TMEM79、ALDH3B2、CHST5、FNTB、KRTDAP、OR5D13、RTKN2、TMEM80、ALDH4A1、CHST6、FOCAD、KSR1、OR5D14、RTL1、TMEM81、ALDH5A1、CHST7、FOLH1、KSR2、OR5D16、RTN1、TMEM82、ALDH6A1、CHST8、FOLH1B、KTI12、OR5D18、RTN2、TMEM86A、ALDH7A1、CHST9、FOLR1、KTN1、OR5F1、RTN3、TMEM86B、ALDH8A1、CHSY1、FOLR2、KXD1、OR5H1、RTN4、TMEM87A、ALDH9A1、CHSY3、FOLR3、KY、OR5H14、RTN4IP1、TMEM87B、ALDOA、CHTF18、FOLR4、KYNU、OR5H15、RTN4R、TMEM88、ALDOB、CHTF8、FOPNL、L1CAM、OR5H2、RTN4RL1、TMEM88B、ALDOC、CHtop、FOS、L1TD1、OR5H6、RTN4RL2、TMEM89、ALG1、CHUK、FOSB、L2HGDH、OR5I1、RTP1、TMEM8A、ALG10、CHURC1、FOSL1、L3HYPDH、OR5J2、RTP2、TMEM8B、ALG10B、CHURC1-FNTB、FOSL2、L3MBTL1、OR5K1、RTP3、TMEM8C、ALG11、CIAO1、FOXA1、L3MBTL2、OR5K2、RTP4、TMEM9、ALG12、CIAPIN1、FOXA2、L3MBTL3、OR5K3、RTTN、TMEM91、ALG13、CIB1、FOXA3、L3MBTL4、OR5K4、RUFY1、TMEM92、ALG14、CIB2、FOXB1、LACC1、OR5L1、RUFY2、TMEM95、ALG1L、CIB3、FOXB2、LACE1、OR5L2、RUFY3、TMEM97、ALG1L2、CIB4、FOXC1、LACRT、OR5M1、RUFY4、TMEM98、ALG2、CIC、FOXC2、LACTB、OR5M10、RUNDC1、TMEM99、ALG3、CIDEA、FOXD1、LACTB2、OR5M11、RUNDC3A、TMEM9B、ALG5、CIDEB、FOXD2、LACTBL1、OR5M3、RUNDC3B、TMF1、ALG6、CIDEC、FOXD3、LAD1、OR5M8、RUNX1、TMIE、ALG8、CIITA、FOXD4、LAG3、OR5M9、RUNX1T1、TMIGD1、ALG9、CILP、FOXD4L1、LAGE3、OR5P2、RUNX2、TMIGD2、ALK、CILP2、FOXD4L2、LAIR1、OR5P3、RUNX3、TMLHE、ALKBH1、CINP、FOXD4L3、LAIR2、OR5R1、RUSC1、TMOD1、ALKBH2、CIR1、FOXD4L4、LALBA、OR5T1、RUSC1-AS1、TMOD2、ALKBH3、CIRBP、FOXD4L5、LAMA1、OR5T2、RUSC2、TMOD3、ALKBH4、CIRH1A、FOXD4L6、LAMA2、OR5T3、RUVBL1、TMOD4、ALKBH5、CISD1、FOXE1、LAMA3、OR5V1、RUVBL2、TMPO、ALKBH6、CISD2、FOXE3、LAMA4、OR5W2、RWDD1、TMPPE、ALKBH7、CISD3、FOXF1、LAMA5、OR6A2、RWDD2A、TMPRSS11A、ALKBH8、CISH、FOXF2、LAMB1、OR6B1、RWDD2B、TMPRSS11B、ALLC、CIT、FOXG1、LAMB2、OR6B3、RWDD3、TMPRSS11D、
ALMS1、CITED1、FOXH1、LAMB3、OR6C1、RWDD4、TMPRSS11E、ALOX12、CITED2、FOXI1、LAMB4、OR6C2、RXFP1、TMPRSS11F、ALOX12B、CITED4、FOXI2、LAMC1、OR6C3、RXFP2、TMPRSS12、ALOX15、CIZ1、FOXI3、LAMC2、OR6C4、RXFP3、TMPRSS13、ALOX15B、CKAP2、FOXJ1、LAMC3、OR6C6、RXFP4、TMPRSS15、ALOX5、CKAP2L、FOXJ2、LAMP1、OR6C65、RXRA、TMPRSS2、ALOX5AP、CKAP4、FOXJ3、LAMP2、OR6C68、RXRB、TMPRSS3、ALOXE3、CKAP5、FOXK1、LAMP3、OR6C70、RXRG、TMPRSS4、ALPI、CKB、FOXK2、LAMP5、OR6C74、RYBP、TMPRSS5、ALPK1、CKLF、FOXL1、LAMTOR1、OR6C75、RYK、TMPRSS6、ALPK2、CKLF-CMTM1、FOXL2、LAMTOR2、OR6C76、RYR1、TMPRSS7、ALPK3、CKM、FOXM1、LAMTOR3、OR6F1、RYR2、TMPRSS9、ALPL、CKMT1A、FOXN1、LAMTOR4、OR6K2、RYR3、TMSB10、ALPP、CKMT1B、FOXN2、LAMTOR5、OR6K3、S100A1、TMSB15A、ALPPL2、CKMT2、FOXN3、LANCL1、OR6K6、S100A10、TMSB15B、ALS2、CKS1B、FOXN4、LANCL2、OR6M1、S100A11、TMSB4X、ALS2CL、CKS2、FOXO1、LANCL3、OR6N1、S100A12、TMSB4Y、ALS2CR11、CLASP1、FOXO3、LAP3、OR6N2、S100A13、TMTC1、ALS2CR12、CLASP2、FOXO4、LAPTM4A、OR6P1、S100A14、TMTC2、ALX1、CLASRP、FOXO6、LAPTM4B、OR6Q1、S100A16、TMTC3、ALX3、CLC、FOXP1、LAPTM5、OR6S1、S100A2、TMTC4、ALX4、CLCA1、FOXP2、LARGE、OR6T1、S100A3、TMUB1、ALYREF、CLCA2、FOXP3、LARP1、OR6V1、S100A4、TMUB2、AMACR、CLCA4、FOXP4、LARP1B、OR6X1、S100A5、TMX1、AMBN、CLCC1、FOXQ1、LARP4、OR6Y1、S100A6、TMX2、AMBP、CLCF1、FOXR1、LARP4B、OR7A10、S100A7、TMX3、AMBRA1、CLCN1、FOXR2、LARP6、OR7A17、S100A7A、TMX4、AMD1、CLCN2、FOXRED1、LARP7、OR7A5、S100A7L2、TNC、AMDHD1、CLCN3、FOXRED2、LARS、OR7C1、S100A8、TNF、AMDHD2、CLCN4、FOXS1、LARS2、OR7C2、S100A9、TNFAIP1、AMELX、CLCN5、FPGS、LAS1L、OR7D2、S100B、TNFAIP2、AMELY、CLCN6、FPGT、LASP1、OR7D4、S100G、TNFAIP3、AMER1、CLCN7、FPGT-TNNI3K、LAT、OR7E24、S100P、TNFAIP6、AMER2、CLCNKA、FPR1、LAT2、OR7G1、S100PBP、TNFAIP8、AMER3、CLCNKB、FPR2、LATS1、OR7G2、S100Z、TNFAIP8L1、AMFR、CLDN1、FPR3、LATS2、OR7G3、S1PR1、TNFAIP8L2、AMH、CLDN10、FRA10AC1、LAX1、OR8A1、S1PR2、TNFAIP8L2-SCNM1、AMHR2、CLDN11、FRAS1、LAYN、OR8B12、S1PR3、TNFAIP8L3、AMICA1、CLDN12、FRAT1、LBH、OR8B2、S1PR4、TNFRSF10A、AMIGO1、CLDN14、FRAT2、LBP、OR8B3、S1PR5、TNFRSF10B、AMIGO2、CLDN15、FREM1、LBR、OR8B4、SAA1、TNFRSF10C、AMIGO3、CLDN16、FREM2、LBX1、OR8B8、SAA2、TNFRSF10D、AMMECR1、CLDN17、FREM3、LBX2、OR8D1、SAA2-SAA4、TNFRSF11A、AMMECR1L、CLDN18、FRG1、LCA5、OR8D2、SAA4、TNFRSF11B、AMN、CLDN19、FRG2、LCA5L、OR8D4、SAAL1、TNFRSF12A、AMN1、CLDN2、FRG2B、LCAT、OR8G1、SAC3D1、TNFRSF13B、AMOT、CLDN20、FRG2C、LCE1A、OR8G2、SACM1L、TNFRSF13C、AMOTL1、CLDN22、FRK、LCE1B、OR8G5、SACS、TNFRSF14、AMOTL2、CLDN23、FRMD1、LCE1C、OR8H1、SAE1、TNFRSF17、AMPD1、CLDN24、FRMD3、LCE1D、OR8H2、SAFB、TNFRSF18、AMPD2、CLDN25、FRMD4A、LCE1E、OR8H3、SAFB2、TNFRSF19、AMPD3、CLDN3、FRMD4B、LCE1F、OR8I2、SAG、TNFRSF1A、AMPH、CLDN4、FRMD5、LCE2A、OR8J1、SAGE1、TNFRSF1B、AMT、CLDN5、FRMD6、LCE2B、OR8J3、SALL1、TNFRSF21、AMTN、CLDN6、FRMD7、LCE2C、OR8K1、SALL2、TNFRSF25、AMY1A、CLDN7、FRMD8、LCE2D、OR8K3、
SALL3、TNFRSF4、AMY1B、CLDN8、FRMPD1、LCE3A、OR8K5、SALL4、TNFRSF6B、AMY1C、CLDN9、FRMPD2、LCE3B、OR8S1、SAMD1、TNFRSF8、AMY2A、CLDND1、FRMPD3、LCE3C、OR8U1、SAMD10、TNFRSF9、AMY2B、CLDND2、FRMPD4、LCE3D、OR8U8、SAMD11、TNFSF10、AMZ1、CLEC10A、FRRS1、LCE3E、OR9A2、SAMD12、TNFSF11、AMZ2、CLEC11A、FRRS1L、LCE4A、OR9A4、SAMD13、TNFSF12、ANAPC1、CLEC12A、FRS2、LCE5A、OR9G1、SAMD14、TNFSF12-TNFSF13、ANAPC10、CLEC12B、FRS3、LCE6A、OR9G4、SAMD15、TNFSF13、ANAPC11、CLEC14A、FRY、LCK、OR9G9、SAMD3、TNFSF13B、ANAPC13、CLEC16A、FRYL、LCLAT1、OR9I1、SAMD4A、TNFSF14、ANAPC15、CLEC17A、FRZB、LCMT1、OR9K2、SAMD4B、TNFSF15、ANAPC16、CLEC18A、FSBP、LCMT2、OR9Q1、SAMD5、TNFSF18、ANAPC2、CLEC18B、FSCB、LCN1、OR9Q2、SAMD7、TNFSF4、ANAPC4、CLEC18C、FSCN1、LCN10、ORAI1、SAMD8、TNFSF8、ANAPC5、CLEC19A、FSCN2、LCN12、ORAI2、SAMD9、TNFSF9、ANAPC7、CLEC1A、FSCN3、LCN15、ORAI3、SAMD9L、TNIK、ANG、CLEC1B、FSD1、LCN2、ORAOV1、SAMHD1、TNIP1、ANGEL1、CLEC2A、FSD1L、LCN6、ORC1、SAMM50、TNIP2、ANGEL2、CLEC2B、FSD2、LCN8、ORC2、SAMSN1、TNIP3、ANGPT1、CLEC2D、FSHB、LCN9、ORC3、SAP130、TNK1、ANGPT2、CLEC2L、FSHR、LCNL1、ORC4、SAP18、TNK2、ANGPT4、CLEC3A、FSIP1、LCOR、ORC5、SAP25、TNKS、ANGPTL1、CLEC3B、FSIP2、LCORL、ORC6、SAP30、TNKS1BP1、ANGPTL2、CLEC4A、FST、LCP1、ORM1、SAP30BP、TNKS2、ANGPTL3、CLEC4C、FSTL1、LCP2、ORM2、SAP30L、TNMD、ANGPTL4、CLEC4D、FSTL3、LCT、ORMDL1、SAPCD1、TNN、ANGPTL5、CLEC4E、FSTL4、LCTL、ORMDL2、SAPCD2、TNNC1、ANGPTL6、CLEC4F、FSTL5、LDB1、ORMDL3、SAR1A、TNNC2、ANGPTL7、CLEC4G、FTCD、LDB2、OS9、SAR1B、TNNI1、ANHX、CLEC4M、FTH1、LDB3、OSBP、SARDH、TNNI2、ANK1、CLEC5A、FTH1P18、LDHA、OSBP2、SARM1、TNNI3、ANK2、CLEC6A、FTHL17、LDHAL6A、OSBPL10、SARNP、TNNI3K、ANK3、CLEC7A、FTL、LDHAL6B、OSBPL11、SARS、TNNT1、ANKAR、CLEC9A、FTMT、LDHB、OSBPL1A、SARS2、TNNT2、ANKDD1A、CLECL1、FTO、LDHC、OSBPL2、SART1、TNNT3、ANKDD1B、CLGN、FTSJ1、LDHD、OSBPL3、SART3、TNP1、ANKEF1、CLHC1、FTSJ2、LDLR、OSBPL5、SASH1、TNP2、ANKFN1、CLIC1、FTSJ3、LDLRAD1、OSBPL6、SASH3、TNPO1、ANKFY1、CLIC2、FUBP1、LDLRAD2、OSBPL7、SASS6、TNPO2、ANKH、CLIC3、FUBP3、LDLRAD3、OSBPL8、SAT1、TNPO3、ANKHD1、CLIC4、FUCA1、LDLRAD4、OSBPL9、SAT2、TNR、ANKHD1-EIF4EBP3、CLIC5、FUCA2、LDLRAP1、OSCAR、SATB1、TNRC18、ANKIB1、CLIC6、FUK、LDOC1、OSCP1、SATB2、TNRC6A、ANKK1、CLINT1、FUNDC1、LDOC1L、OSER1、SATL1、TNRC6B、ANKLE1、CLIP1、FUNDC2、LEAP2、OSGEP、SAV1、TNRC6C、ANKLE2、CLIP2、FUOM、LECT1、OSGEPL1、SAYSD1、TNS1、ANKMY1、CLIP3、FURIN、LECT2、OSGIN1、SBDS、TNS3、ANKMY2、CLIP4、FUS、LEF1、OSGIN2、SBF1、TNS4、ANKRA2、CLK1、FUT1、leftY1、OSM、SBF2、TNXB、ANKRD1、CLK2、FUT10、leftY2、OSMR、SBK1、TOB1、ANKRD10、CLK3、FUT11、LEKR1、OSR1、SBK2、TOB2、ANKRD11、CLK4、FUT2、LELP1、OSR2、SBK3、TOE1、ANKRD12、CLLU1、FUT3、LEMD1、OST4、SBNO1、TOLLIP、ANKRD13A、CLLU1OS、FUT4、LEMD2、OSTC、SBNO2、TOM1、ANKRD13B、CLMN、FUT5、LEMD3、OSTF1、SBSN、TOM1L1、ANKRD13C、CLMP、FUT6、LENEP、OSTM1、SBSPON、TOM1L2、ANKRD13D、CLN3、FUT7、LENG1、OSTN、SC5D、TOMM20、
ANKRD16、CLN5、FUT8、LENG8、OTC、SCAF1、TOMM20L、ANKRD17、CLN6、FUT9、LENG9、OTOA、SCAF11、TOMM22、ANKRD18A、CLN8、FUZ、LEO1、OTOF、SCAF4、TOMM34、ANKRD18B、CLNK、FXN、LEP、OTOG、SCAF8、TOMM40、ANKRD2、CLNS1A、FXR1、LEPR、OTOGL、SCAI、TOMM40L、ANKRD20A1、CLOCK、FXR2、LEPRE1、OTOL1、SCAMP1、TOMM5、ANKRD20A2、CLP1、FXYD1、LEPREL1、Otop1、SCAMP2、TOMM6、ANKRD20A3、CLPB、FXYD2、LEPREL2、Otop2、SCAMP3、TOMM7、ANKRD20A4、CLPP、FXYD3、LEPREL4、Otop3、SCAMP4、TOMM70A、ANKRD22、CLPS、FXYD4、LEPROT、OTOR、SCAMP5、TONSL、ANKRD23、CLPSL1、FXYD5、LEPROTL1、OTOS、SCAND1、top1、ANKRD24、CLPSL2、FXYD6、LETM1、OTP、SCAND3、top1MT、ANKRD26、CLPTM1、FXYD6-FXYD2、LETM2、OTUB1、SCAP、top2A、ANKRD27、CLPTM1L、FXYD7、LETMD1、OTUB2、SCAPER、top2B、ANKRD28、CLPX、FYB、LEUTX、OTUD1、SCARA3、top3A、ANKRD29、CLRN1、FYCO1、LFNG、OTUD3、SCARA5、top3B、ANKRD30A、CLRN2、FYN、LGALS1、OTUD4、SCARB1、topAZ1、ANKRD30B、CLRN3、FYTTD1、LGALS12、OTUD5、SCARB2、topBP1、ANKRD31、CLSPN、FZD1、LGALS13、OTUD6A、SCARF1、topORS、ANKRD32、CLSTN1、FZD10、LGALS14、OTUD6B、SCARF2、TOR1A、ANKRD33、CLSTN2、FZD2、LGALS16、OTUD7A、SCCPDH、TOR1AIP1、ANKRD33B、CLSTN3、FZD3、LGALS2、OTUD7B、SCD、TOR1AIP2、ANKRD34A、CLTA、FZD4、LGALS3、OTX1、SCD5、TOR1B、ANKRD34B、CLTB、FZD5、LGALS3BP、OTX2、SCEL、TOR2A、ANKRD34C、CLTC、FZD6、LGALS4、OVCA2、SCFD1、TOR3A、ANKRD35、CLTCL1、FZD7、LGALS7、OVCH1、SCFD2、TOR4A、ANKRD36、CLU、FZD8、LGALS7B、OVCH2、SCG2、TOX、ANKRD36B、CLUAP1、FZD9、LGALS8、OVGP1、SCG3、TOX2、ANKRD36C、CLUH、FZR1、LGALS9、OVOL1、SCG5、TOX3、ANKRD37、CLUL1、G0S2、LGALS9B、OVOL2、SCGB1A1、TOX4、ANKRD39、CLVS1、G2E3、LGALS9C、OVOL3、SCGB1C1、TP53、ANKRD40、CLVS2、G3BP1、LGALSL、OVOS、SCGB1D1、TP53AIP1、ANKRD42、CLYBL、G3BP2、LGI1、OVOS2、SCGB1D2、TP53BP1、ANKRD44、CMA1、G6PC、LGI2、OXA1L、SCGB1D4、TP53BP2、ANKRD45、CMAS、G6PC2、LGI3、OXCT1、SCGB2A1、TP53I11、ANKRD46、CMBL、G6PC3、LGI4、OXCT2、SCGB2A2、TP53I13、ANKRD49、CMC1、G6PD、LGMN、OXER1、SCGB2B2、TP53I3、ANKRD50、CMC2、GAA、LGR4、OXGR1、SCGB3A1、TP53INP1、ANKRD52、CMC4、GAB1、LGR5、OXLD1、SCGB3A2、TP53INP2、ANKRD53、CMIP、GAB2、LGR6、OXNAD1、SCGN、TP53RK、ANKRD54、CMKLR1、GAB3、LGSN、OXR1、SCHIP1、TP53TG3、ANKRD55、CMPK1、GAB4、LHB、OXSM、SCIMP、TP53TG3B、ANKRD6、CMPK2、GABARAP、LHCGR、OXSR1、SCIN、TP53TG3C、ANKRD60、CMSS1、GABARAPL1、LHFP、OXT、SCLT1、TP53TG3D、ANKRD61、CMTM1、GABARAPL2、LHFPL1、OXTR、SCLY、TP53TG5、ANKRD62、CMTM2、GABBR1、LHFPL2、P2RX1、SCMH1、TP63、ANKRD63、CMTM3、GABBR2、LHFPL3、P2RX2、SCML1、TP73、ANKRD65、CMTM4、GABPA、LHFPL4、P2RX3、SCML2、TPBG、ANKRD66、CMTM5、GABPB1、LHFPL5、P2RX4、SCML4、TPBGL、ANKRD7、CMTM6、GABPB2、LHPP、P2RX5、SCN10A、TPCN1、ANKRD9、CMTM7、GABRA1、LHX1、P2RX6、SCN11A、TPCN2、ANKS1A、CMTM8、GABRA2、LHX2、P2RX7、SCN1A、TPD52、ANKS1B、CMTR1、GABRA3、LHX3、P2RY1、SCN1B、TPD52L1、ANKS3、CMTR2、GABRA4、LHX4、P2RY10、SCN2A、TPD52L2、ANKS4B、CMYA5、GABRA5、LHX5、P2RY11、SCN2B、TPD52L3、ANKS6、CNBD1、GABRA6、LHX6、
P2RY12、SCN3A、TPGS1、ANKUB1、CNBD2、GABRB1、LHX8、P2RY13、SCN3B、TPGS2、ANKZF1、CNBP、GABRB2、LHX9、P2RY14、SCN4A、TPH1、ANLN、CNDP1、GABRB3、LIAS、P2RY2、SCN4B、TPH2、ANO1、CNDP2、GABRD、LIF、P2RY4、SCN5A、TPI1、ANO10、CNEP1R1、GABRE、LIFR、P2RY6、SCN7A、TPK1、ANO2、CNFN、GABRG1、LIG1、P2RY8、SCN8A、TPM1、ANO3、CNGA1、GABRG2、LIG3、P4HA1、SCN9A、TPM2、ANO4、CNGA2、GABRG3、LIG4、P4HA2、SCNM1、TPM3、ANO5、CNGA3、GABRP、LILRA1、P4HA3、SCNN1A、TPM4、ANO6、CNGA4、GABRQ、LILRA2、P4HB、SCNN1B、TPMT、ANO7、CNGB1、GABRR1、LILRA3、P4HTM、SCNN1D、TPO、ANO8、CNGB3、GABRR2、LILRA4、PA2G4、SCNN1G、TPP1、ANO9、CNIH1、GABRR3、LILRA5、PAAF1、SCO1、TPP2、ANP32A、CNIH2、GAD1、LILRA6、PABPC1、SCO2、TPPP、ANP32B、CNIH3、GAD2、LILRB1、PABPC1L、SCOC、TPPP2、ANP32C、CNIH4、GADD45A、LILRB2、PABPC1L2A、SCP2、TPPP3、ANP32D、CNKSR1、GADD45B、LILRB3、PABPC1L2B、SCP2D1、TPR、ANP32E、CNKSR2、GADD45G、LILRB4、PABPC3、SCPEP1、TPRA1、ANPEP、CNKSR3、GADD45GIP1、LILRB5、PABPC4、SCRG1、TPRG1、ANTXR1、CNN1、GADL1、LIM2、PABPC4L、SCRIB、TPRG1L、ANTXR2、CNN2、GAGE1、LIMA1、PABPC5、SCRN1、TPRKB、ANTXRL、CNN3、GAGE10、LIMCH1、PABPN1、SCRN2、TPRN、ANXA1、CNNM1、GAGE12B、LIMD1、PABPN1L、SCRN3、TPRX1、ANXA10、CNNM2、GAGE12C、LIMD2、PACRG、SCRT1、TPSAB1、ANXA11、CNNM3、GAGE12D、LIME1、PACRGL、SCRT2、TPSB2、ANXA13、CNNM4、GAGE12E、LIMK1、PACS1、SCT、TPSD1、ANXA2、CNOT1、GAGE12F、LIMK2、PACS2、SCTR、TPSG1、ANXA2R、CNOT10、GAGE12G、LIMS1、PACSIN1、SCUBE1、TPST1、ANXA3、CNOT11、GAGE12H、LIMS2、PACSIN2、SCUBE2、TPST2、ANXA4、CNOT2、GAGE12I、LIMS3、PACSIN3、SCUBE3、TPT1、ANXA5、CNOT3、GAGE12J、LIN28A、PADI1、SCXA、TPTE、ANXA6、CNOT4、GAGE13、LIN28B、PADI2、SCXB、TPTE2、ANXA7、CNOT6、GAGE2A、LIN37、PADI3、SCYL1、TPX2、ANXA8、CNOT6L、GAGE2B、LIN52、PADI4、SCYL2、TRA2A、ANXA8L1、CNOT7、GAGE2C、LIN54、PADI6、SCYL3、TRA2B、ANXA8L2、CNOT8、GAGE2D、LIN7A、PAEP、SDAD1、TRABD、ANXA9、CNP、GAGE2E、LIN7B、PAF1、SDC1、TRABD2A、AOAH、CNPPD1、GAGE4、LIN7C、PAFAH1B1、SDC2、TRABD2B、AOC1、CNPY1、GAGE5、LIN9、PAFAH1B2、SDC3、TRADD、AOC2、CNPY2、GAGE6、LINC00452、PAFAH1B3、SDC4、TRAF1、AOC3、CNPY3、GAGE7、LINC00984、PAFAH2、SDCBP、TRAF2、AOX1、CNPY4、GAGE8、LINGO1、PAG1、SDCBP2、TRAF3、AP1AR、CNR1、GAK、LINGO2、PAGE1、SDCCAG3、TRAF3IP1、AP1B1、CNR2、GAL、LINGO3、PAGE2、SDCCAG8、TRAF3IP2、AP1G1、CNRIP1、GAL3ST1、LINGO4、PAGE2B、SDE2、TRAF3IP3、AP1G2、CNST、GAL3ST2、LINS、PAGE4、SDF2、TRAF4、AP1M1、CNTD1、GAL3ST3、LIPA、PAGE5、SDF2L1、TRAF5、AP1M2、CNTD2、GAL3ST4、LIPC、PAGR1、SDF4、TRAF6、AP1S1、CNTF、GALC、LIPE、PAH、SDHA、TRAF7、AP1S2、CNTFR、GALE、LIPF、PAICS、SDHAF1、TRAFD1、AP1S3、CNTLN、GALK1、LIPG、PAIP1、SDHAF2、TRAIP、AP2A1、CNTN1、GALK2、LIPH、PAIP2、SDHB、TRAK1、AP2A2、CNTN2、GALM、LIPI、PAIP2B、SDHC、TRAK2、AP2B1、CNTN3、GALNS、LIPJ、PAK1、SDHD、TRAM1、AP2M1、CNTN4、GALNT1、LIPK、PAK1IP1、SDK1、TRAM1L1、AP2S1、CNTN5、GALNT10、LIPM、PAK2、SDK2、TRAM2、AP3B1、CNTN6、GALNT11、LIPN、PAK3、SDPR、TRANK1、AP3B2、CNTNAP1、GALNT12、LIPT1、PAK4、SDR16C5、TRAP1、
AP3D1、CNTNAP2、GALNT13、LIPT2、PAK6、SDR39U1、TRAPPC1、AP3M1、CNTNAP3、GALNT14、LITAF、PAK7、SDR42E1、TRAPPC10、AP3M2、CNTNAP3B、GALNT15、LIX1、PALB2、SDR9C7、TRAPPC11、AP3S1、CNTNAP4、GALNT16、LIX1L、PALD1、SDS、TRAPPC12、AP3S2、CNTNAP5、GALNT18、LLGL1、PALLD、SDSL、TRAPPC13、AP4B1、CNTRL、GALNT2、LLGL2、PALM、SEBOX、TRAPPC2、AP4E1、CNTROB、GALNT3、LLPH、PALM2、SEC11A、TRAPPC2L、AP4M1、COA1、GALNT4、LMAN1、PALM2-AKAP2、SEC11C、TRAPPC3、AP4S1、COA3、GALNT5、LMAN1L、PALM3、SEC13、TRAPPC3L、AP5B1、COA4、GALNT6、LMAN2、PALMD、SEC14L1、TRAPPC4、AP5M1、COA5、GALNT7、LMAN2L、PAM、SEC14L2、TRAPPC5、AP5S1、COA6、GALNT8、LMBR1、PAM16、SEC14L3、TRAPPC6A、AP5Z1、COASY、GALNT9、LMBR1L、PAMR1、SEC14L4、TRAPPC6B、APAF1、COBL、GALNTL5、LMBRD1、PAN2、SEC14L5、TRAPPC8、APBA1、COBLL1、GALNTL6、LMBRD2、PAN3、SEC14L6、TRAPPC9、APBA2、COCH、GALP、LMCD1、PANK1、SEC16A、TRAT1、APBA3、COG1、GALR1、LMF1、PANK2、SEC16B、TRDMT1、APBB1、COG2、GALR2、LMF2、PANK3、SEC22A、TRDN、APBB1IP、COG3、GALR3、LMLN、PANK4、SEC22B、TREH、APBB2、COG4、GALT、LMNA、PANX1、SEC22C、TREM1、APBB3、COG5、GAMT、LMNB1、PANX2、SEC23A、TREM2、APC、COG6、GAN、LMNB2、PANX3、SEC23B、TREML1、APC2、COG7、GANAB、LMO1、PAOX、SEC23IP、TREML2、APCDD1、COG8、GANC、LMO2、PAPD4、SEC24A、TREML4、APCDD1L、COIL、GAP43、LMO3、PAPD5、SEC24B、TRERF1、APCS、COL10A1、GAPDH、LMO4、PAPD7、SEC24C、TREX1、APEH、COL11A1、GAPDHS、LMO7、PAPL、SEC24D、TREX2、APEX1、COL11A2、GAPT、LMOD1、PAPLN、SEC31A、TRH、APEX2、COL12A1、GAPVD1、LMOD2、PAPOLA、SEC31B、TRHDE、APH1A、COL13A1、GAR1、LMOD3、PAPOLB、SEC61A1、TRHR、APH1B、COL14A1、GAREM、LMTK2、PAPOLG、SEC61A2、TRIAP1、API5、COL15A1、GAREML、LMTK3、PAPPA、SEC61B、TRIB1、APIP、COL16A1、GARNL3、LMX1A、PAPPA2、SEC61G、TRIB2、APITD1、COL17A1、GARS、LMX1B、PAPSS1、SEC62、TRIB3、APITD1-CORT、COL18A1、GART、LNP1、PAPSS2、SEC63、TRIL、APLF、COL19A1、GAS1、LNPEP、PAQR3、SECISBP2、TRIM10、APLN、COL1A1、GAS2、LNX1、PAQR4、SECISBP2L、TRIM11、APLNR、COL1A2、GAS2L1、LNX2、PAQR5、SECTM1、TRIM13、APLP1、COL20A1、GAS2L2、LOH12CR1、PAQR6、SEH1L、TRIM14、APLP2、COL21A1、GAS2L3、LONP1、PAQR7、SEL1L、TRIM15、APMAP、COL22A1、GAS6、LONP2、PAQR8、SEL1L2、TRIM16、APOA1、COL23A1、GAS7、LONRF1、PAQR9、SEL1L3、TRIM16L、APOA1BP、COL24A1、GAS8、LONRF2、PARD3、SELE、TRIM17、APOA2、COL25A1、GAST、LONRF3、PARD3B、SELENBP1、TRIM2、APOA4、COL26A1、GATA1、LOR、PARD6A、SELK、TRIM21、APOA5、COL27A1、GATA2、LOX、PARD6B、SELL、TRIM22、APOB、COL28A1、GATA3、LOXHD1、PARD6G、SELM、TRIM23、APOBEC1、COL2A1、GATA4、LOXL1、PARG、SELO、TRIM24、APOBEC2、COL3A1、GATA5、LOXL2、PARK2、SELP、TRIM25、APOBEC3A、COL4A1、GATA6、LOXL3、PARK7、SELPLG、TRIM26、APOBEC3B、COL4A2、GATAD1、LOXL4、PARL、SELRC1、TRIM27、APOBEC3C、COL4A3、GATAD2A、LPA、PARM1、SELT、TRIM28、APOBEC3D、COL4A3BP、GATAD2B、LPAR1、PARN、SELV、TRIM29、APOBEC3F、COL4A4、GATC、LPAR2、PARP1、SEMA3A、TRIM3、APOBEC3G、COL4A5、GATM、LPAR3、PARP10、SEMA3B、TRIM31、APOBEC3H、COL4A6、GATS、LPAR4、
PARP11、SEMA3C、TRIM32、APOBEC4、COL5A1、GATSL1、LPAR5、PARP12、SEMA3D、TRIM33、APOBR、COL5A2、GATSL2、LPAR6、PARP14、SEMA3E、TRIM34、APOC1、COL5A3、GATSL3、LPCAT1、PARP15、SEMA3F、TRIM35、APOC2、COL6A1、GBA、LPCAT2、PARP16、SEMA3G、TRIM36、APOC3、COL6A2、GBA2、LPCAT3、PARP2、SEMA4A、TRIM37、APOC4、COL6A3、GBA3、LPCAT4、PARP3、SEMA4B、TRIM38、APOD、COL6A5、GBAS、LPGAT1、PARP4、SEMA4C、TRIM39、APOE、COL6A6、GBE1、LPHN1、PARP6、SEMA4D、TRIM39-RPP21、APOF、COL7A1、GBF1、LPHN2、PARP8、SEMA4F、TRIM4、APOH、COL8A1、GBGT1、LPHN3、PARP9、SEMA4G、TRIM40、APOL1、COL8A2、GBP1、LPIN1、PARPBP、SEMA5A、TRIM41、APOL2、COL9A1、GBP2、LPIN2、PARS2、SEMA5B、TRIM42、APOL3、COL9A2、GBP3、LPIN3、PARVA、SEMA6A、TRIM43、APOL4、COL9A3、GBP4、LPL、PARVB、SEMA6B、TRIM43B、APOL5、COLCA2、GBP5、LPO、PARVG、SEMA6C、TRIM44、APOL6、COLEC10、GBP6、LPP、PASD1、SEMA6D、TRIM45、APOLD1、COLEC11、GBP7、LPPR1、PASK、SEMA7A、TRIM46、APOM、COLEC12、GBX1、LPPR2、PATE1、SEMG1、TRIM47、APOO、COLGALT1、GBX2、LPPR3、PATE2、SEMG2、TRIM48、APOOL、COLGALT2、GC、LPPR4、PATE3、SENP1、TRIM49、APOPT1、COLQ、GCA、LPPR5、PATE4、SENP2、TRIM49B、APP、COMMD1、GCAT、LPXN、PATL1、SENP3、TRIM49C、APPBP2、COMMD10、GCC1、LRAT、PATL2、SENP5、TRIM49D1、APPL1、COMMD2、GCC2、LRBA、PATZ1、SENP6、TRIM49D2P、APPL2、COMMD3、GCDH、LRCH1、PAWR、SENP7、TRIM5、APRT、COMMD3-BMI1、GCFC2、LRCH2、PAX1、SENP8、TRIM50、APTX、COMMD4、GCG、LRCH3、PAX2、SEPHS1、TRIM51、AQP1、COMMD5、GCGR、LRCH4、PAX3、SEPHS2、TRIM52、AQP10、COMMD6、GCH1、LRCOL1、PAX4、SEPN1、TRIM54、AQP11、COMMD7、GCHFR、LRFN1、PAX5、SEPP1、TRIM55、AQP12A、COMMD8、GCK、LRFN2、PAX6、SEPSECS、TRIM56、AQP12B、COMMD9、GCKR、LRFN3、PAX7、SEPT1、TRIM58、AQP2、COMP、GCLC、LRFN4、PAX8、SEPT10、TRIM59、AQP3、COMT、GCLM、LRFN5、PAX9、SEPT11、TRIM6、AQP4、COMTD1、GCM1、LRG1、PAXBP1、SEPT12、TRIM60、AQP5、COPA、GCM2、LRGUK、PAXIP1、SEPT14、TRIM61、AQP6、COPB1、GCN1L1、LRIF1、PBDC1、SEPT15、TRIM62、AQP7、COPB2、GCNT1、LRIG1、PBK、SEPT2、TRIM63、AQP8、COPE、GCNT2、LRIG2、PBLD、SEPT3、TRIM64、AQP9、COPG1、GCNT3、LRIG3、PBOV1、SEPT4、TRIM64B、AQPEP、COPG2、GCNT4、LRIT1、PBRM1、SEPT5、TRIM64C、AQR、COPRS、GCNT7、LRIT2、PBX1、SEPT6、TRIM65、AR、COPS2、GCOM1、LRIT3、PBX2、SEPT7、TRIM66、ARAF、COPS3、GCSAM、LRMP、PBX3、SEPT8、TRIM67、ARAP1、COPS4、GCSAML、LRP1、PBX4、SEPT9、TRIM68、ARAP2、COPS5、GCSH、LRP10、PBXIP1、SEPW1、TRIM69、ARAP3、COPS6、GDA、LRP11、PC、SERAC1、TRIM6-TRIM34、ARC、COPS7A、GDAP1、LRP12、PCBD1、SERBP1、TRIM7、ARCN1、COPS7B、GDAP1L1、LRP1B、PCBD2、SERF1A、TRIM71、AREG、COPS8、GDAP2、LRP2、PCBP1、SERF1B、TRIM72、AREGB、COPZ1、GDE1、LRP2BP、PCBP2、SERF2、TRIM73、AREL1、COPZ2、GDF1、LRP3、PCBP3、SERGEF、TRIM74、ARF1、COQ10A、GDF10、LRP4、PCBP4、SERHL2、TRIM77、ARF3、COQ10B、GDF11、LRP5、PCCA、SERINC1、TRIM8、ARF4、COQ2、GDF15、LRP5L、PCCB、SERINC2、TRIM9、ARF5、COQ3、GDF2、LRP6、PCDH1、SERINC3、TRIML1、ARF6、COQ4、GDF3、LRP8、PCDH10、SERINC4、TRIML2、ARFGAP1、COQ5、GDF5、LRPAP1、PCDH11X、SERINC5、TRIO、ARFGAP2、COQ6、GDF6、LRPPRC、
PCDH11Y、SERP1、TRIOBP、ARFGAP3、COQ7、GDF7、LRR1、PCDH12、SERP2、TRIP10、ARFGEF1、COQ9、GDF9、LRRC1、PCDH15、SERPINA1、TRIP11、ARFGEF2、CORIN、GDI1、LRRC10、PCDH17、SERPINA10、TRIP12、ARFIP1、CORO1A、GDI2、LRRC10B、PCDH18、SERPINA11、TRIP13、ARFIP2、CORO1B、GDNF、LRRC14、PCDH19、SERPINA12、TRIP4、ARFRP1、CORO1C、GDNF-AS1、LRRC14B、PCDH20、SERPINA3、TRIP6、ARG1、CORO2A、GDPD1、LRRC15、PCDH7、SERPINA4、TRIQK、ARG2、CORO2B、GDPD2、LRRC16A、PCDH8、SERPINA5、TRIT1、ARGFX、CORO6、GDPD3、LRRC16B、PCDH9、SERPINA6、TRMT1、ARGLU1、CORO7、GDPD4、LRRC17、PCDHA1、SERPINA7、TRMT10A、ARHGAP1、CORO7-PAM16、GDPD5、LRRC18、PCDHA10、SERPINA9、TRMT10B、ARHGAP10、CORT、GDPGP1、LRRC19、PCDHA11、SERPINB1、TRMT10C、ARHGAP11A、COTL1、GEM、LRRC2、PCDHA12、SERPINB10、TRMT11、ARHGAP11B、COX10、GEMIN2、LRRC20、PCDHA13、SERPINB11、TRMT112、ARHGAP12、COX11、GEMIN4、LRRC23、PCDHA2、SERPINB12、TRMT12、ARHGAP15、COX14、GEMIN5、LRRC24、PCDHA3、SERPINB13、TRMT13、ARHGAP17、COX15、GEMIN6、LRRC25、PCDHA4、SERPINB2、TRMT1L、ARHGAP18、COX16、GEMIN7、LRRC26、PCDHA5、SERPINB3、TRMT2A、ARHGAP19、COX17、GEMIN8、LRRC27、PCDHA6、SERPINB4、TRMT2B、ARHGAP20、COX18、GEN1、LRRC28、PCDHA7、SERPINB5、TRMT44、ARHGAP21、COX19、GET4、LRRC29、PCDHA8、SERPINB6、TRMT5、ARHGAP22、COX20、GFAP、LRRC3、PCDHA9、SERPINB7、TRMT6、ARHGAP23、COX4I1、GFER、LRRC30、PCDHAC1、SERPINB8、TRMT61A、ARHGAP24、COX4I2、GFI1、LRRC31、PCDHAC2、SERPINB9、TRMT61B、ARHGAP25、COX5A、GFI1B、LRRC32、PCDHB1、SERPINC1、TRMU、ARHGAP26、COX5B、GFM1、LRRC34、PCDHB10、SERPIND1、TRNAU1AP、ARHGAP27、COX6A1、GFM2、LRRC36、PCDHB11、SERPINE1、TRNP1、ARHGAP28、COX6A2、GFOD1、LRRC37A2、PCDHB12、SERPINE2、TRNT1、ARHGAP29、COX6B1、GFOD2、LRRC37A3、PCDHB13、SERPINE3、TRO、ARHGAP30、COX6B2、GFPT1、LRRC37B、PCDHB14、SERPINF1、TROAP、ARHGAP31、COX6C、GFPT2、LRRC38、PCDHB15、SERPINF2、TROVE2、ARHGAP32、COX7A1、GFRA1、LRRC39、PCDHB16、SERPING1、TRPA1、ARHGAP33、COX7A2、GFRA2、LRRC3B、PCDHB2、SERPINH1、TRPC1、ARHGAP35、COX7A2L、GFRA3、LRRC3C、PCDHB3、SERPINI1、TRPC3、ARHGAP36、COX7B、GFRA4、LRRC4、PCDHB4、SERPINI2、TRPC4、ARHGAP39、COX7B2、GFRAL、LRRC40、PCDHB5、SERTAD1、TRPC4AP、ARHGAP4、COX7C、GGA1、LRRC41、PCDHB6、SERTAD2、TRPC5、ARHGAP40、COX8A、GGA2、LRRC42、PCDHB7、SERTAD3、TRPC5OS、ARHGAP42、COX8C、GGA3、LRRC43、PCDHB8、SERTAD4、TRPC6、ARHGAP44、CP、GGACT、LRRC45、PCDHB9、SERTM1、TRPC7、ARHGAP5、CPA1、GGCT、LRRC46、PCDHGA1、SESN1、TRPM1、ARHGAP6、CPA2、GGCX、LRRC47、PCDHGA10、SESN2、TRPM2、ARHGAP8、CPA3、GGH、LRRC48、PCDHGA11、SESN3、TRPM3、ARHGAP9、CPA4、GGN、LRRC49、PCDHGA12、SESTD1、TRPM4、ARHGDIA、CPA5、GGNBP2、LRRC4B、PCDHGA2、SET、TRPM5、ARHGDIB、CPA6、GGPS1、LRRC4C、PCDHGA3、SETBP1、TRPM6、ARHGDIG、CPAMD8、GGT1、LRRC52、PCDHGA4、SETD1A、TRPM7、ARHGEF1、CPB1、GGT2、LRRC55、PCDHGA5、SETD1B、TRPM8、ARHGEF10、CPB2、GGT5、LRRC56、PCDHGA6、SETD2、TRPS1、ARHGEF10L、CPD、GGT6、LRRC57、PCDHGA7、SETD3、TRPT1、ARHGEF11、CPE、GGT7、LRRC58、PCDHGA8、SETD4、
TRPV1、ARHGEF12、CPEB1、GGTLC1、LRRC59、PCDHGA9、SETD5、TRPV2、ARHGEF15、CPEB2、GGTLC2、LRRC6、PCDHGB1、SETD6、TRPV3、ARHGEF16、CPEB3、GH1、LRRC61、PCDHGB2、SETD7、TRPV4、ARHGEF17、CPEB4、GH2、LRRC63、PCDHGB3、SETD8、TRPV5、ARHGEF18、CPED1、GHDC、LRRC66、PCDHGB4、SETD9、TRPV6、ARHGEF19、CPLX1、GHITM、LRRC69、PCDHGB5、SETDB1、TRRAP、ARHGEF2、CPLX2、GHR、LRRC7、PCDHGB6、SETDB2、TRUB1、ARHGEF25、CPLX3、GHRH、LRRC70、PCDHGB7、SETMAR、TRUB2、ARHGEF26、CPLX4、GHRHR、LRRC71、PCDHGC3、SETSIP、TSACC、ARHGEF28、CPM、GHRL、LRRC72、PCDHGC4、SETX、TSC1、ARHGEF3、CPN1、GHSR、LRRC73、PCDHGC5、SEZ6、TSC2、ARHGEF33、CPN2、GID4、LRRC8A、PCDP1、SEZ6L、TSC22D1、ARHGEF35、CPNE1、GID8、LRRC8B、PCED1A、SEZ6L2、TSC22D2、ARHGEF37、CPNE2、GIF、LRRC8C、PCED1B、SF1、TSC22D3、ARHGEF38、CPNE3、GIGYF1、LRRC8D、PCF11、SF3A1、TSC22D4、ARHGEF39、CPNE4、GIGYF2、LRRC8E、PCGF1、SF3A2、TSEN15、ARHGEF4、CPNE5、GIMAP1、LRRCC1、PCGF2、SF3A3、TSEN2、ARHGEF40、CPNE6、GIMAP1-GIMAP5、LRRD1、PCGF3、SF3B1、TSEN34、ARHGEF5、CPNE7、GIMAP2、LRRFIP1、PCGF5、SF3B14、TSEN54、ARHGEF6、CPNE8、GIMAP4、LRRFIP2、PCGF6、SF3B2、TSFM、ARHGEF7、CPNE9、GIMAP5、LRRIQ1、PCID2、SF3B3、TSG101、ARHGEF9、CPO、GIMAP6、LRRIQ3、PCIF1、SF3B4、TSGA10、ARID1A、CPOX、GIMAP7、LRRIQ4、PCK1、SF3B5、TSGA10IP、ARID1B、CPPED1、GIMAP8、LRRK1、PCK2、SFI1、TSGA13、ARID2、CPQ、GIMD1、LRRK2、PCLO、SFMBT1、TSHB、ARID3A、CPS1、GIN1、LRRN1、PCM1、SFMBT2、TSHR、ARID3B、CPSF1、GINM1、LRRN2、PCMT1、SFN、TSHZ1、ARID3C、CPSF2、GINS1、LRRN3、PCMTD1、SFPQ、TSHZ2、ARID4A、CPSF3、GINS2、LRRN4、PCMTD2、SFR1、TSHZ3、ARID4B、CPSF3L、GINS3、LRRN4CL、PCNA、SFRP1、TSKS、ARID5A、CPSF4、GINS4、LRRTM1、PCNP、SFRP2、TSKU、ARID5B、CPSF4L、GIP、LRRTM2、PCNT、SFRP4、TSLP、ARIH1、CPSF6、GIPC1、LRRTM3、PCNX、SFRP5、TSN、ARIH2、CPSF7、GIPC2、LRRTM4、PCNXL2、SFSWAP、TSNARE1、ARIH2OS、CPT1A、GIPC3、LRSAM1、PCNXL3、SFT2D1、TSNAX、ARL1、CPT1B、GIPR、LRTM1、PCNXL4、SFT2D2、TSNAXIP1、ARL10、CPT1C、GIT1、LRTM2、PCOLCE、SFT2D3、TSPAN1、ARL11、CPT2、GIT2、LRTOMT、PCOLCE2、SFTA2、TSPAN10、ARL13A、CPVL、GJA1、LRWD1、PCP2、SFTA3、TSPAN11、ARL13B、CPXCR1、GJA10、LSAMP、PCP4、SFTPA1、TSPAN12、ARL14、CPXM1、GJA3、LSG1、PCP4L1、SFTPA2、TSPAN13、ARL14EP、CPXM2、GJA4、LSM1、PCSK1、SFTPB、TSPAN14、ARL14EPL、CPZ、GJA5、LSM10、PCSK1N、SFTPC、TSPAN15、ARL15、CR1、GJA8、LSM11、PCSK2、SFTPD、TSPAN16、ARL16、CR1L、GJA9、LSM12、PCSK4、SFXN1、TSPAN17、ARL17A、CR2、GJB1、LSM14A、PCSK5、SFXN2、TSPAN18、ARL17B、CRABP1、GJB2、LSM14B、PCSK6、SFXN3、TSPAN19、ARL2、CRABP2、GJB3、LSM2、PCSK7、SFXN4、TSPAN2、ARL2BP、CRADD、GJB4、LSM3、PCSK9、SFXN5、TSPAN3、ARL3、CRAMP1L、GJB5、LSM4、PCTP、SGCA、TSPAN31、ARL4A、CRAT、GJB6、LSM5、PCYOX1、SGCB、TSPAN32、ARL4C、CRB1、GJB7、LSM6、PCYOX1L、SGCD、TSPAN33、ARL4D、CRB2、GJC1、LSM7、PCYT1A、SGCE、TSPAN4、ARL5A、CRB3、GJC2、LSMD1、PCYT1B、SGCG、TSPAN5、ARL5B、CRBN、GJC3、LSMEM1、PCYT2、SGCZ、TSPAN6、ARL5C、CRCP、GJD2、LSMEM2、PDAP1、SGIP1、TSPAN7、ARL6、CRCT1、GJD3、LSP1、PDC、SGK1、TSPAN8、ARL6IP1、
CREB1、GJD4、LSR、PDCD1、SGK2、TSPAN9、ARL6IP4、CREB3、GK、LSS、PDCD10、SGK223、TSPEAR、ARL6IP5、CREB3L1、GK2、LST1、PDCD11、SGK3、TSPO、ARL6IP6、CREB3L2、GK5、LTA、PDCD1LG2、SGK494、TSPO2、ARL8A、CREB3L3、GKAP1、LTA4H、PDCD2、SGMS1、TSPY1、ARL8B、CREB3L4、GKN1、LTB、PDCD2L、SGMS2、TSPY10、ARL9、CREB5、GKN2、LTB4R、PDCD4、SGOL1、TSPY2、ARMC1、CREBBP、GLA、LTB4R2、PDCD5、SGOL2、TSPY3、ARMC10、CREBL2、GLB1、LTBP1、PDCD6、SGPL1、TSPY4、ARMC12、CREBRF、GLB1L、LTBP2、PDCD6IP、SGPP1、TSPY8、ARMC2、CREBZF、GLB1L2、LTBP3、PDCD7、SGPP2、TSPYL1、ARMC3、CREG1、GLB1L3、LTBP4、PDCL、SGSH、TSPYL2、ARMC4、CREG2、GLCCI1、LTBR、PDCL2、SGSM1、TSPYL4、ARMC5、CRELD1、GLCE、LTC4S、PDCL3、SGSM2、TSPYL5、ARMC6、CRELD2、GLDC、LTF、PDDC1、SGSM3、TSPYL6、ARMC7、CREM、GLDN、LTK、PDE10A、SGTA、TSR1、ARMC8、CRH、GLE1、LTN1、PDE11A、SGTB、TSR2、ARMC9、CRHBP、GLG1、LTV1、PDE12、SH2B1、TSR3、ARMCX1、CRHR1、GLI1、LUC7L、PDE1A、SH2B2、TSSC1、ARMCX2、CRHR2、GLI2、LUC7L2、PDE1B、SH2B3、TSSC4、ARMCX3、CRIM1、GLI3、LUC7L3、PDE1C、SH2D1A、TSSK1B、ARMCX4、CRIP1、GLI4、LUM、PDE2A、SH2D1B、TSSK2、ARMCX5、CRIP2、GLIPR1、LURAP1、PDE3A、SH2D2A、TSSK3、ARMCX5-GPRASP2、CRIP3、GLIPR1L1、LURAP1L、PDE3B、SH2D3A、TSSK4、ARMCX6、CRIPAK、GLIPR1L2、LUZP1、PDE4A、SH2D3C、TSSK6、ARMS2、CRIPT、GLIPR2、LUZP2、PDE4B、SH2D4A、TST、ARNT、CRISP1、GLIS1、LUZP4、PDE4C、SH2D4B、TSTA3、ARNT2、CRISP2、GLIS2、LUZP6、PDE4D、SH2D5、TSTD1、ARNTL、CRISP3、GLIS3、LXN、PDE4DIP、SH2D6、TSTD2、ARNTL2、CRISPLD1、GLMN、LY6D、PDE5A、SH2D7、TSTD3、ARPC1A、CRISPLD2、GLO1、LY6E、PDE6A、SH3BGR、TTBK1、ARPC1B、CRK、GLOD4、LY6G5B、PDE6B、SH3BGRL、TTBK2、ARPC2、CRKL、GLOD5、LY6G5C、PDE6C、SH3BGRL2、TTC1、ARPC3、CRLF1、GLP1R、LY6G6C、PDE6D、SH3BGRL3、TTC12、ARPC4、CRLF2、GLP2R、LY6G6D、PDE6G、SH3BP1、TTC13、ARPC4-TTLL3、CRLF3、GLRA1、LY6G6F、PDE6H、SH3BP2、TTC14、ARPC5、CRLS1、GLRA2、LY6H、PDE7A、SH3BP4、TTC16、ARPC5L、CRMP1、GLRA3、LY6K、PDE7B、SH3BP5、TTC17、ARPP19、CRNKL1、GLRA4、LY75、PDE8A、SH3BP5L、TTC18、ARPP21、CRNN、GLRB、LY75-CD302、PDE8B、SH3D19、TTC19、ARR3、CROCC、GLRX、LY86、PDE9A、SH3D21、TTC21A、ARRB1、CROT、GLRX2、LY9、PDF、SH3GL1、TTC21B、ARRB2、CRP、GLRX3、LY96、PDGFA、SH3GL2、TTC22、ARRDC1、CRTAC1、GLRX5、LYAR、PDGFB、SH3GL3、TTC23、ARRDC2、CRTAM、GLS、LYG1、PDGFC、SH3GLB1、TTC23L、ARRDC3、CRTAP、GLS2、LYG2、PDGFD、SH3GLB2、TTC24、ARRDC4、CRTC1、GLT1D1、LYL1、PDGFRA、SH3KBP1、TTC25、ARRDC5、CRTC2、GLT6D1、LYN、PDGFRB、SH3PXD2A、TTC26、ARSA、CRTC3、GLT8D1、LYNX1、PDGFRL、SH3PXD2B、TTC27、ARSB、CRX、GLT8D2、LYPD1、PDHA1、SH3RF1、TTC28、ARSD、CRY1、GLTP、LYPD2、PDHA2、SH3RF2、TTC29、ARSE、CRY2、GLTPD1、LYPD3、PDHB、SH3RF3、TTC3、ARSF、CRYAA、GLTPD2、LYPD4、PDHX、SH3TC1、TTC30A、ARSG、CRYAB、GLTSCR1、LYPD5、PDIA2、SH3TC2、TTC30B、ARSH、CRYBA1、GLTSCR1L、LYPD6、PDIA3、SH3YL1、TTC31、ARSI、CRYBA2、GLTSCR2、LYPD6B、PDIA4、SHANK1、TTC32、ARSJ、CRYBA4、GLUD1、LYPD8、PDIA5、SHANK2、TTC33、ARSK、CRYBB1、GLUD2、LYPLA1、PDIA6、SHANK3、TTC34、ART1、CRYBB2、GLUL、LYPLA2、PDIK1L、SHARPIN、
TTC36、ART3、CRYBB3、GLYAT、LYPLAL1、PDILT、SHB、TTC37、ART4、CRYBG3、GLYATL1、LYRM1、PDK1、SHBG、TTC38、ART5、CRYGA、GLYATL2、LYRM2、PDK2、SHC1、TTC39A、ARTN、CRYGB、GLYATL3、LYRM4、PDK3、SHC2、TTC39B、ARV1、CRYGC、GLYCTK、LYRM5、PDK4、SHC3、TTC39C、ARVCF、CRYGD、GLYR1、LYRM7、PDLIM1、SHC4、TTC4、ARX、CRYGN、GM2A、LYRM9、PDLIM2、SHCBP1、TTC40、AS3MT、CRYGS、GMCL1、LYSMD1、PDLIM3、SHCBP1L、TTC5、ASAH1、CRYL1、GMDS、LYSMD2、PDLIM4、SHD、TTC6、ASAH2、CRYM、GMEB1、LYSMD3、PDLIM5、SHE、TTC7A、ASAH2B、CRYZ、GMEB2、LYSMD4、PDLIM7、SHF、TTC7B、ASAP1、CRYZL1、GMFB、LYST、PDP1、SHFM1、TTC8、ASAP2、CS、GMFG、LYVE1、PDP2、SHH、TTC9、ASAP3、CSAD、GMIP、LYZ、PDPK1、SHISA2、TTC9B、ASB1、CSAG1、GML、LYZL1、PDPN、SHISA3、TTC9C、ASB10、CSAG2、GMNC、LYZL2、PDPR、SHISA4、TTF1、ASB11、CSAG3、GMNN、LYZL4、PDRG1、SHISA5、TTF2、ASB12、CSDC2、GMPPA、LYZL6、PDS5A、SHISA6、TTI1、ASB13、CSDE1、GMPPB、LZIC、PDS5B、SHISA7、TTI2、ASB14、CSE1L、GMPR、LZTFL1、PDSS1、SHISA8、TTK、ASB15、CSF1、GMPR2、LZTR1、PDSS2、SHISA9、TTL、ASB16、CSF1R、GMPS、LZTS1、PDX1、SHKBP1、TTLL1、ASB17、CSF2、GNA11、LZTS2、PDXDC1、SHMT1、TTLL10、ASB18、CSF2RA、GNA12、LZTS3、PDXK、SHMT2、TTLL11、ASB2、CSF2RB、GNA13、M1AP、PDXP、SHOC2、TTLL12、ASB3、CSF3、GNA14、M6PR、PDYN、SHOX、TTLL13、ASB4、CSF3R、GNA15、MAATS1、PDZD11、SHOX2、TTLL2、ASB5、CSGALNACT1、GNAI1、MAB21L1、PDZD2、SHPK、TTLL3、ASB6、CSGALNACT2、GNAI2、MAB21L2、PDZD3、SHPRH、TTLL4、ASB7、CSH1、GNAI3、MAB21L3、PDZD4、SHQ1、TTLL5、ASB8、CSH2、GNAL、MACC1、PDZD7、SHROOM1、TTLL6、ASB9、CSHL1、GNAO1、MACF1、PDZD8、SHROOM2、TTLL7、ASCC1、CSK、GNAQ、MACROD1、PDZD9、SHROOM3、TTLL8、ASCC2、CSMD1、GNAS、MACROD2、PDZK1、SHROOM4、TTLL9、ASCC3、CSMD2、GNAT1、MAD1L1、PDZK1IP1、SI、TTN、ASCL1、CSMD3、GNAT2、MAD2L1、PDZRN3、SIAE、TTPA、ASCL2、CSN1S1、GNAT3、MAD2L1BP、PDZRN4、SIAH1、TTPAL、ASCL3、CSN2、GNAZ、MAD2L2、PEA15、SIAH2、TTR、ASCL4、CSN3、GNB1、MADCAM1、PEAK1、SIAH3、TTYH1、ASCL5、CSNK1A1、GNB1L、MADD、PEAR1、SIDT1、TTYH2、ASF1A、CSNK1A1L、GNB2、MAEA、PEBP1、SIDT2、TTYH3、ASF1B、CSNK1D、GNB2L1、MAEL、PEBP4、SIGIRR、TUB、ASGR1、CSNK1E、GNB3、MAF、PECAM1、SIGLEC1、TUBA1A、ASGR2、CSNK1G1、GNB4、MAF1、PECR、SIGLEC10、TUBA1B、ASH1L、CSNK1G2、GNB5、MAFA、PEF1、SIGLEC11、TUBA1C、ASH2L、CSNK1G3、GNE、MAFB、PEG10、SIGLEC12、TUBA3C、ASIC1、CSNK2A1、GNG10、MAFF、PEG3、SIGLEC14、TUBA3D、ASIC2、CSNK2A2、GNG11、MAFG、PELI1、SIGLEC15、TUBA3E、ASIC3、CSNK2A3、GNG12、MAFK、PELI2、SIGLEC5、TUBA4A、ASIC4、CSNK2B、GNG13、MAG、PELI3、SIGLEC6、TUBA8、ASIC5、CSPG4、GNG2、MAGEA1、PELO、SIGLEC7、TUBAL3、ASIP、CSPG5、GNG3、MAGEA10、PELP1、SIGLEC8、TUBB、ASL、CSPP1、GNG4、MAGEA10-MAGEA5、PEMT、SIGLEC9、TUBB1、ASMT、CSRNP1、GNG5、MAGEA11、PENK、SIGLECL1、TUBB2A、ASMTL、CSRNP2、GNG7、MAGEA12、PEPD、SIGMAR1、TUBB2B、ASNA1、CSRNP3、GNG8、MAGEA3、PER1、SIK1、TUBB3、ASNS、CSRP1、GNGT1、MAGEA4、PER2、SIK2、TUBB4A、ASNSD1、CSRP2、GNGT2、MAGEA5、PER3、SIK3、TUBB4B、ASPA、CSRP2BP、GNL1、MAGEA6、PERM1、SIKE1、TUBB6、ASPDH、CSRP3、GNL2、MAGEA8、PERP、SIL1、TUBB8、
ASPG、CST1、GNL3、MAGEB1、PES1、SIM1、TUBD1、ASPH、CST11、GNL3L、MAGEB10、PET100、SIM2、TUBE1、ASPHD1、CST2、GNLY、MAGEB16、PET112、SIMC1、TUBG1、ASPHD2、CST3、GNMT、MAGEB17、PET117、SIN3A、TUBG2、ASPM、CST4、GNPAT、MAGEB18、PEX1、SIN3B、TUBGCP2、ASPN、CST5、GNPDA1、MAGEB2、PEX10、SIPA1、TUBGCP3、ASPRV1、CST6、GNPDA2、MAGEB3、PEX11A、SIPA1L1、TUBGCP4、ASPSCR1、CST7、GNPNAT1、MAGEB4、PEX11B、SIPA1L2、TUBGCP5、ASRGL1、CST8、GNPTAB、MAGEB5、PEX11G、SIPA1L3、TUBGCP6、ASS1、CST9、GNPTG、MAGEB6、PEX12、SIRPA、TUFM、ASTE1、CST9L、GNRH1、MAGEC1、PEX13、SIRPB1、TUFT1、ASTL、CSTA、GNRH2、MAGEC2、PEX14、SIRPB2、TULP1、ASTN1、CSTB、GNRHR、MAGEC3、PEX16、SIRPD、TULP2、ASTN2、CSTF1、GNS、MAGED1、PEX19、SIRPG、TULP3、ASUN、CSTF2、GOLGA1、MAGED2、PEX2、SIRT1、TULP4、ASXL1、CSTF2T、GOLGA2、MAGEE1、PEX26、SIRT2、TUSC1、ASXL2、CSTF3、GOLGA3、MAGEE2、PEX3、SIRT3、TUSC2、ASXL3、CSTL1、GOLGA4、MAGEF1、PEX5、SIRT4、TUSC3、ASZ1、CT45A1、GOLGA5、MAGEH1、PEX5L、SIRT5、TUSC5、ATAD1、CT45A2、GOLGA6A、MAGEL2、PEX6、SIRT6、TUT1、ATAD2、CT45A3、GOLGA6B、MAGI1、PEX7、SIRT7、TVP23A、ATAD2B、CT45A4、GOLGA6C、MAGI2、PF4、SIT1、TVP23B、ATAD3A、CT45A5、GOLGA6D、MAGI3、PF4V1、SIVA1、TVP23C、ATAD3B、CT45A6、GOLGA6L1、MAGIX、PFAS、SIX1、TVP23C-CDRT4、ATAD3C、CT47A1、GOLGA6L10、MAGOH、PFDN1、SIX2、TWF1、ATAD5、CT47A10、GOLGA6L2、MAGOHB、PFDN2、SIX3、TWF2、ATAT1、CT47A11、GOLGA6L3、MAGT1、PFDN4、SIX4、TWIST1、ATCAY、CT47A12、GOLGA6L4、MAK、PFDN5、SIX5、TWIST2、ATE1、CT47A2、GOLGA6L6、MAK16、PFDN6、SIX6、TWISTNB、ATF1、CT47A3、GOLGA6L9、MAL、PFKFB1、SKA1、TWSG1、ATF2、CT47A4、GOLGA7、MAL2、PFKFB2、SKA2、TXK、ATF3、CT47A5、GOLGA7B、MALL、PFKFB3、SKA3、TXLNA、ATF4、CT47A6、GOLGA8A、MALSU1、PFKFB4、SKAP1、TXLNB、ATF5、CT47A7、GOLGA8B、MALT1、PFKL、SKAP2、TXLNG、ATF6、CT47A8、GOLGA8H、MAMDC2、PFKM、SKI、TXN、ATF6B、CT47A9、GOLGA8J、MAMDC4、PFKP、SKIDA1、TXN2、ATF7、CT47B1、GOLGA8K、MAML1、PFN1、SKIL、TXNDC11、ATF7IP、CT62、GOLGA8M、MAML2、PFN2、SKIV2L、TXNDC12、ATF7IP2、CTAG1A、GOLGA8O、MAML3、PFN3、SKIV2L2、TXNDC15、ATG10、CTAG1B、GOLGA8R、MAMLD1、PFN4、SKOR1、TXNDC16、ATG12、CTAG2、GOLGB1、MAMSTR、PGA3、SKOR2、TXNDC17、ATG13、CTAGE1、GOLIM4、MAN1A1、PGA4、SKP1、TXNDC2、ATG14、CTAGE15、GOLM1、MAN1A2、PGA5、SKP2、TXNDC5、ATG16L1、CTAGE4、GOLPH3、MAN1B1、PGAM1、SLA、TXNDC8、ATG16L2、CTAGE5、GOLPH3L、MAN1C1、PGAM2、SLA2、TXNDC9、ATG2A、CTAGE6、GOLT1A、MAN2A1、PGAM4、SLAIN1、TXNIP、ATG2B、CTAGE8、GOLT1B、MAN2A2、PGAM5、SLAIN2、TXNL1、ATG3、CTAGE9、GON4L、MAN2B1、PGAP1、SLAMF1、TXNL4A、ATG4A、CTBP1、GOPC、MAN2B2、PGAP2、SLAMF6、TXNL4B、ATG4B、CTBP2、GORAB、MAN2C1、PGAP3、SLAMF7、TXNRD1、ATG4C、CTBS、GORASP1、MANBA、PGBD1、SLAMF8、TXNRD2、ATG4D、CTC1、GORASP2、MANBAL、PGBD2、SLAMF9、TXNRD3、ATG5、CTCF、GOSR1、MANEA、PGBD3、SLBP、TXNRD3NB、ATG7、CTCFL、GOSR2、MANEAL、PGBD4、SLC10A1、TYK2、ATG9A、CTDNEP1、GOT1、MANF、PGBD5、SLC10A2、TYMP、ATG9B、CTDP1、GOT1L1、MANSC1、PGC、SLC10A3、TYMS、ATHL1、CTDSP1、GOT2、MANSC4、PGD、SLC10A4、
TYR、ATIC、CTDSP2、GP1BA、MAOA、PGF、SLC10A5、TYRO3、ATL1、CTDSPL、GP1BB、MAOB、PGGT1B、SLC10A6、TYROBP、ATL2、CTDSPL2、GP2、MAP10、PGK1、SLC10A7、TYRP1、ATL3、CTF1、GP5、MAP1A、PGK2、SLC11A1、TYSND1、ATM、CTGF、GP6、MAP1B、PGLS、SLC11A2、TYW1、ATMIN、CTH、GP9、MAP1LC3A、PGLYRP1、SLC12A1、TYW1B、ATN1、CTHRC1、GPA33、MAP1LC3B、PGLYRP2、SLC12A2、TYW3、ATOH1、CTIF、GPAA1、MAP1LC3B2、PGLYRP3、SLC12A3、TYW5、ATOH7、CTLA4、GPALPP1、MAP1LC3C、PGLYRP4、SLC12A4、U2AF1、ATOH8、CTNNA1、GPAM、MAP1S、PGM1、SLC12A5、U2AF1L4、ATOX1、CTNNA2、GPANK1、MAP2、PGM2、SLC12A6、U2AF2、ATP10A、CTNNA3、GPAT2、MAP2K1、PGM2L1、SLC12A7、U2SURP、ATP10B、CTNNAL1、GPATCH1、MAP2K2、PGM3、SLC12A8、UACA、ATP10D、CTNNB1、GPATCH11、MAP2K3、PGM5、SLC12A9、UAP1、ATP11A、CTNNBIP1、GPATCH2、MAP2K4、PGP、SLC13A1、UAP1L1、ATP11B、CTNNBL1、GPATCH2L、MAP2K5、PGPEP1、SLC13A2、UBA1、ATP11C、CTNND1、GPATCH3、MAP2K6、PGPEP1L、SLC13A3、UBA2、ATP12A、CTNND2、GPATCH4、MAP2K7、PGR、SLC13A4、UBA3、ATP13A1、CTNS、GPATCH8、MAP3K1、PGRMC1、SLC13A5、UBA5、ATP13A2、CTPS1、GPBAR1、MAP3K10、PGRMC2、SLC14A1、UBA52、ATP13A3、CTPS2、GPBP1、MAP3K11、PGS1、SLC14A2、UBA6、ATP13A4、CTR9、GPBP1L1、MAP3K12、PHACTR1、SLC15A1、UBA7、ATP13A5、CTRB1、GPC1、MAP3K13、PHACTR2、SLC15A2、UBAC1、ATP1A1、CTRB2、GPC2、MAP3K14、PHACTR3、SLC15A3、UBAC2、ATP1A2、CTRC、GPC3、MAP3K15、PHACTR4、SLC15A4、UBALD1、ATP1A3、CTRL、GPC4、MAP3K19、PHAX、SLC15A5、UBALD2、ATP1A4、CTSA、GPC5、MAP3K2、PHB、SLC16A1、UBAP1、ATP1B1、CTSB、GPC6、MAP3K3、PHB2、SLC16A10、UBAP1L、ATP1B2、CTSC、GPCPD1、MAP3K4、PHC1、SLC16A11、UBAP2、ATP1B3、CTSD、GPD1、MAP3K5、PHC2、SLC16A12、UBAP2L、ATP1B4、CTSE、GPD1L、MAP3K6、PHC3、SLC16A13、UBASH3A、ATP2A1、CTSF、GPD2、MAP3K7、PHEX、SLC16A14、UBASH3B、ATP2A2、CTSG、GPER1、MAP3K7CL、PHF1、SLC16A2、UBB、ATP2A3、CTSH、GPHA2、MAP3K8、PHF10、SLC16A3、UBC、ATP2B1、CTSK、GPHB5、MAP3K9、PHF11、SLC16A4、UBD、ATP2B2、CTSL、GPHN、MAP4、PHF12、SLC16A5、UBE2A、ATP2B3、CTSO、GPI、MAP4K1、PHF13、SLC16A6、UBE2B、ATP2B4、CTSS、GPIHBP1、MAP4K2、PHF14、SLC16A7、UBE2C、ATP2C1、CTSV、GPKOW、MAP4K3、PHF19、SLC16A8、UBE2D1、ATP2C2、CTSW、GPLD1、MAP4K4、PHF2、SLC16A9、UBE2D2、ATP4A、CTSZ、GPM6A、MAP4K5、PHF20、SLC17A1、UBE2D3、ATP4B、CTTN、GPM6B、MAP6、PHF20L1、SLC17A2、UBE2D4、ATP5A1、CTTNBP2、GPN1、MAP6D1、PHF21A、SLC17A3、UBE2E1、ATP5B、CTTNBP2NL、GPN2、MAP7、PHF21B、SLC17A4、UBE2E2、ATP5C1、CTU1、GPN3、MAP7D1、PHF23、SLC17A5、UBE2E2-AS1、ATP5D、CTU2、GPNMB、MAP7D2、PHF3、SLC17A6、UBE2E3、ATP5E、CTXN1、GPR1、MAP7D3、PHF5A、SLC17A7、UBE2F、ATP5F1、CTXN2、GPR101、MAP9、PHF6、SLC17A8、UBE2G1、ATP5G1、CTXN3、GPR107、MAPK1、PHF7、SLC17A9、UBE2G2、ATP5G2、CUBN、GPR108、MAPK10、PHF8、SLC18A1、UBE2H、ATP5G3、CUEDC1、GPR110、MAPK11、PHGDH、SLC18A2、UBE2I、ATP5H、CUEDC2、GPR111、MAPK12、PHGR1、SLC18A3、UBE2J1、ATP5I、CUL1、GPR112、MAPK13、PHIP、SLC18B1、UBE2J2、ATP5J、CUL2、GPR113、MAPK14、PHKA1、SLC19A1、UBE2K、ATP5J2、CUL3、GPR114、MAPK15、PHKA2、SLC19A2、UBE2L3、
ATP5J2-PTCD1、CUL4A、GPR115、MAPK1IP1L、PHKB、SLC19A3、UBE2L6、ATP5L、CUL4B、GPR116、MAPK3、PHKG1、SLC1A1、UBE2M、ATP5L2、CUL5、GPR119、MAPK4、PHKG2、SLC1A2、UBE2N、ATP5O、CUL7、GPR12、MAPK6、PHLDA1、SLC1A3、UBE2NL、ATP5S、CUL9、GPR123、MAPK7、PHLDA2、SLC1A4、UBE2O、ATP5SL、CUTA、GPR124、MAPK8、PHLDA3、SLC1A5、UBE2Q1、ATP6AP1、CUTC、GPR125、MAPK8IP1、PHLDB1、SLC1A6、UBE2Q2、ATP6AP1L、CUX1、GPR126、MAPK8IP2、PHLDB2、SLC1A7、UBE2QL1、ATP6AP2、CUX2、GPR128、MAPK8IP3、PHLDB3、SLC20A1、UBE2R2、ATP6V0A1、CUZD1、GPR132、MAPK9、PHLPP1、SLC20A2、UBE2S、ATP6V0A2、CWC15、GPR133、MAPKAP1、PHLPP2、SLC22A1、UBE2T、ATP6V0A4、CWC22、GPR135、MAPKAPK2、PHOSPHO1、SLC22A10、UBE2U、ATP6V0B、CWC25、GPR137、MAPKAPK3、PHOSPHO2、SLC22A11、UBE2V1、ATP6V0C、CWC27、GPR137B、MAPKAPK5、PHOSPHO2-KLHL23、SLC22A12、UBE2V2、ATP6V0D1、CWF19L1、GPR137C、MAPKBP1、PHOX2A、SLC22A13、UBE2W、ATP6V0D2、CWF19L2、GPR139、MAPRE1、PHOX2B、SLC22A14、UBE2Z、ATP6V0E1、CWH43、GPR142、MAPRE2、PHPT1、SLC22A15、UBE3A、ATP6V0E2、CX3CL1、GPR143、MAPRE3、PHRF1、SLC22A16、UBE3B、ATP6V1A、CX3CR1、GPR148、MAPT、PHTF1、SLC22A17、UBE3C、ATP6V1B1、CXADR、GPR149、1-Mar、PHTF2、SLC22A18、UBE3D、ATP6V1B2、CXCL1、GPR15、10-Mar、PHYH、SLC22A18AS、UBE4A、ATP6V1C1、CXCL10、GPR150、11- Mar、PHYHD1、SLC22A2、UBE4B、ATP6V1C2、CXCL11、GPR151、2-Mar、PHYHIP、SLC22A20、UBFD1、ATP6V1D、CXCL12、GPR152、3-Mar、PHYHIPL、SLC22A23、UBIAD1、ATP6V1E1、CXCL13、GPR153、4- Mar、PHYKPL、SLC22A24、UBL3、ATP6V1E2、CXCL14、GPR155、5-Mar、PI15、SLC22A25、UBL4A、ATP6V1F、CXCL16、GPR156、6-Mar、PI16、SLC22A3、UBL4B、ATP6V1G1、CXCL17、GPR157、7-Mar、PI3、SLC22A31、UBL5、ATP6V1G2、CXCL2、GPR158、8-Mar、PI4K2A、SLC22A4、UBL7、ATP6V1G3、CXCL3、GPR160、9-Mar、PI4K2B、SLC22A5、UBLCP1、ATP6V1H、CXCL5、GPR161、MARCKS、PI4KA、SLC22A6、UBN1、ATP7A、CXCL6、GPR162、MARCKSL1、PI4KB、SLC22A7、UBN2、ATP7B、CXCL9、GPR17、MARCO、PIANP、SLC22A8、UBOX5、ATP8A1、CXCR1、GPR171、MARK1、PIAS1、SLC22A9、UBP1、ATP8A2、CXCR2、GPR173、MARK2、PIAS2、SLC23A1、UBQLN1、ATP8B1、CXCR3、GPR174、MARK3、PIAS3、SLC23A2、UBQLN2、ATP8B2、CXCR4、GPR176、MARK4、PIAS4、SLC23A3、UBQLN3、ATP8B3、CXCR5、GPR179、MARS、PIBF1、SLC24A1、UBQLN4、ATP8B4、CXCR6、GPR18、MARS2、PICALM、SLC24A2、UBQLNL、ATP9A、CXorf21、GPR180、MARVELD1、PICK1、SLC24A3、UBR1、ATP9B、CXorf22、GPR182、MARVELD2、PID1、SLC24A4、UBR2、ATPAF1、CXorf23、GPR183、MARVELD3、PIDD、SLC24A5、UBR3、ATPAF2、CXorf27、GPR19、MAS1、PIEZO1、SLC25A1、UBR4、ATPIF1、CXorf30、GPR20、MAS1L、PIEZO2、SLC25A10、UBR5、ATR、CXorf36、GPR21、MASP1、PIF1、SLC25A11、UBR7、ATRAID、CXorf38、GPR22、MASP2、PIFO、SLC25A12、UBTD1、ATRIP、CXorf40A、GPR25、MAST1、PIGA、SLC25A13、UBTD2、ATRN、CXorf40B、GPR26、MAST2、PIGB、SLC25A14、UBTF、ATRNL1、CXorf48、GPR27、MAST3、PIGC、SLC25A15、UBTFL1、ATRX、CXorf49、GPR3、MAST4、PIGF、SLC25A16、UBXN1、ATXN1、CXorf49B、GPR31、MASTL、PIGG、SLC25A17、UBXN10、ATXN10、CXorf51A、GPR32、MAT1A、PIGH、SLC25A18、UBXN11、ATXN1L、
CXorf51B、GPR33、MAT2A、PIGK、SLC25A19、UBXN2A、ATXN2、CXorf56、GPR34、MAT2B、PIGL、SLC25A2、UBXN2B、ATXN2L、CXorf57、GPR35、MATK、PIGM、SLC25A20、UBXN4、ATXN3、CXorf58、GPR37、MATN1、PIGN、SLC25A21、UBXN6、ATXN3L、CXorf61、GPR37L1、MATN2、PIGO、SLC25A22、UBXN7、ATXN7、CXorf64、GPR39、MATN3、PIGP、SLC25A23、UBXN8、ATXN7L1、CXorf65、GPR4、MATN4、PIGQ、SLC25A24、UCHL1、ATXN7L2、CXorf66、GPR45、MATR3、PIGR、SLC25A25、UCHL3、ATXN7L3、CXXC1、GPR50、MAU2、PIGS、SLC25A26、UCHL5、ATXN7L3B、CXXC11、GPR52、MAVS、PIGT、SLC25A27、UCK1、AUH、CXXC4、GPR55、MAX、PIGU、SLC25A28、UCK2、AUNIP、CXXC5、GPR56、MAZ、PIGV、SLC25A29、UCKL1、AUP1、CYB561、GPR6、MB、PIGW、SLC25A3、UCMA、AURKA、CYB561A3、GPR61、MB21D1、PIGX、SLC25A30、UCN、AURKAIP1、CYB561D1、GPR62、MB21D2、PIGY、SLC25A31、UCN2、AURKB、CYB561D2、GPR63、MBD1、PIGZ、SLC25A32、UCN3、AURKC、CYB5A、GPR64、MBD2、PIH1D1、SLC25A33、UCP1、AUTS2、CYB5B、GPR65、MBD3、PIH1D2、SLC25A34、UCP2、AVEN、CYB5D1、GPR68、MBD3L1、PIH1D3、SLC25A35、UCP3、AVIL、CYB5D2、GPR75、MBD4、PIK3AP1、SLC25A36、UEVLD、AVL9、CYB5R1、GPR75-ASB3、MBD5、PIK3C2A、SLC25A37、UFC1、AVP、CYB5R2、GPR78、MBD6、PIK3C2B、SLC25A38、UFD1L、AVPI1、CYB5R3、GPR82、MBIP、PIK3C2G、SLC25A39、UFL1、AVPR1A、CYB5R4、GPR83、MBL2、PIK3C3、SLC25A4、UFM1、AVPR1B、CYB5RL、GPR84、MBLAC1、PIK3CA、SLC25A40、UFSP1、AVPR2、CYBA、GPR85、MBLAC2、PIK3CB、SLC25A41、UFSP2、AWAT1、CYBB、GPR87、MBNL1、PIK3CD、SLC25A42、UGCG、AWAT2、CYBRD1、GPR88、MBNL2、PIK3CG、SLC25A43、UGDH、AXDND1、CYC1、GPR89A、MBNL3、PIK3IP1、SLC25A44、UGGT1、AXIN1、CYCS、GPR89B、MBOAT1、PIK3R1、SLC25A45、UGGT2、AXIN2、CYFIP1、GPR89C、MBOAT2、PIK3R2、SLC25A46、UGP2、AXL、CYFIP2、GPR97、MBOAT4、PIK3R3、SLC25A47、UGT1A1、AZGP1、CYGB、GPR98、MBOAT7、PIK3R4、SLC25A48、UGT1A10、AZI1、CYHR1、GPRASP1、MBP、PIK3R5、SLC25A5、UGT1A3、AZI2、CYLC1、GPRASP2、MBTD1、PIK3R6、SLC25A51、UGT1A4、AZIN1、CYLC2、GPRC5A、MBTPS1、PIKFYVE、SLC25A52、UGT1A5、AZU1、CYLD、GPRC5B、MBTPS2、PILRA、SLC25A53、UGT1A6、B2M、CYorf17、GPRC5C、MC1R、PILRB、SLC25A6、UGT1A7、B3GALNT1、CYP11A1、GPRC5D、MC2R、PIM1、SLC26A1、UGT1A8、B3GALNT2、CYP11B1、GPRC6A、MC3R、PIM2、SLC26A10、UGT1A9、B3GALT1、CYP11B2、GPRIN1、MC4R、PIM3、SLC26A11、UGT2A1、B3GALT2、CYP17A1、GPRIN2、MC5R、PIN1、SLC26A2、UGT2A2、B3GALT4、CYP19A1、GPRIN3、MCAM、PIN4、SLC26A3、UGT2A3、B3GALT5、CYP1A1、GPS1、MCAT、PINK1、SLC26A4、UGT2B10、B3GALT6、CYP1A2、GPS2、MCC、PINLYP、SLC26A5、UGT2B11、B3GALTL、CYP1B1、GPSM1、MCCC1、PINX1、SLC26A6、UGT2B15、B3GAT1、CYP20A1、GPSM2、MCCC2、PIP、SLC26A7、UGT2B17、B3GAT2、CYP21A2、GPSM3、MCCD1、PIP4K2A、SLC26A8、UGT2B28、B3GAT3、CYP24A1、GPT、MCEE、PIP4K2B、SLC26A9、UGT2B4、B3GNT1、CYP26A1、GPT2、MCF2、PIP4K2C、SLC27A1、UGT2B7、B3GNT2、CYP26B1、GPX1、MCF2L、PIP5K1A、SLC27A2、UGT3A1、B3GNT3、CYP26C1、GPX2、MCF2L2、PIP5K1B、SLC27A3、UGT3A2、B3GNT4、CYP27A1、GPX3、MCFD2、PIP5K1C、SLC27A4、UGT8、B3GNT5、CYP27B1、GPX4、MCHR1、PIP5KL1、SLC27A5、UHMK1、B3GNT6、
CYP27C1、GPX5、MCHR2、PIPOX、SLC27A6、UHRF1、B3GNT7、CYP2A13、GPX6、MCIDAS、PIR、SLC28A1、UHRF1BP1、B3GNT8、CYP2A6、GPX7、MCL1、PIRT、SLC28A2、UHRF1BP1L、B3GNT9、CYP2A7、GPX8、MCM10、PISD、SLC28A3、UHRF2、B3GNTL1、CYP2B6、GRAMD1A、MCM2、PITHD1、SLC29A1、UIMC1、B4GALNT1、CYP2C18、GRAMD1B、MCM3、PITPNA、SLC29A2、ULBP1、B4GALNT2、CYP2C19、GRAMD1C、MCM3AP、PITPNB、SLC29A3、ULBP2、B4GALNT3、CYP2C8、GRAMD2、MCM4、PITPNC1、SLC29A4、ULBP3、B4GALNT4、CYP2C9、GRAMD3、MCM5、PITPNM1、SLC2A1、ULK1、B4GALT1、CYP2D6、GRAMD4、MCM6、PITPNM2、SLC2A10、ULK2、B4GALT2、CYP2E1、GRAP、MCM7、PITPNM3、SLC2A11、ULK3、B4GALT3、CYP2F1、GRAP2、MCM8、PITRM1、SLC2A12、ULK4、B4GALT4、CYP2J2、GRAPL、MCM9、PITX1、SLC2A13、UMOD、B4GALT5、CYP2R1、GRASP、MCMBP、PITX2、SLC2A14、UMODL1、B4GALT6、CYP2S1、GRB10、MCMDC2、PITX3、SLC2A2、UMPS、B4GALT7、CYP2U1、GRB14、MCOLN1、PIWIL1、SLC2A3、UNC119、B9D1、CYP2W1、GRB2、MCOLN2、PIWIL2、SLC2A4、UNC119B、B9D2、CYP39A1、GRB7、MCOLN3、PIWIL3、SLC2A4RG、UNC13A、BAALC、CYP3A4、GREB1、MCPH1、PIWIL4、SLC2A5、UNC13B、BAAT、CYP3A43、GREM1、MCRS1、PJA1、SLC2A6、UNC13C、BABAM1、CYP3A5、GREM2、MCTP1、PJA2、SLC2A7、UNC13D、BACE1、CYP3A7、GRHL1、MCTP2、PKD1、SLC2A8、UNC45A、BACE2、CYP3A7- CYP3AP1、GRHL2、MCTS1、PKD1L1、SLC2A9、UNC45B、BACH1、CYP46A1、GRHL3、MCU、PKD1L2、SLC30A1、UNC50、BACH2、CYP4A11、GRHPR、MCUR1、PKD1L3、SLC30A10、UNC5A、BAD、CYP4A22、GRIA1、MDC1、PKD2、SLC30A2、UNC5B、BAG1、CYP4B1、GRIA2、MDFI、PKD2L1、SLC30A3、UNC5C、BAG2、CYP4F11、GRIA3、MDFIC、PKD2L2、SLC30A4、UNC5CL、BAG3、CYP4F12、GRIA4、MDGA1、PKDCC、SLC30A5、UNC5D、BAG4、CYP4F2、GRID1、MDGA2、PKDREJ、SLC30A6、UNC79、BAG5、CYP4F22、GRID2、MDH1、PKHD1、SLC30A7、UNC80、BAG6、CYP4F3、GRID2IP、MDH1B、PKHD1L1、SLC30A8、UNC93A、BAGE、CYP4F8、GRIFIN、MDH2、PKIA、SLC30A9、UNC93B1、BAGE2、CYP4V2、GRIK1、MDK、PKIB、SLC31A1、UNCX、BAGE3、CYP4X1、GRIK2、MDM1、PKIG、SLC31A2、UNG、BAHCC1、CYP4Z1、GRIK3、MDM2、PKLR、SLC32A1、UNK、BAHD1、CYP51A1、GRIK4、MDM4、PKM、SLC33A1、UNKL、BAI1、CYP7A1、GRIK5、MDN1、PKMYT1、SLC34A1、UPB1、BAI2、CYP7B1、GRIN1、MDP1、PKN1、SLC34A2、UPF1、BAI3、CYP8B1、GRIN2A、ME1、PKN2、SLC34A3、UPF2、BAIAP2、CYR61、GRIN2B、ME2、PKN3、SLC35A1、UPF3A、BAIAP2L1、CYS1、GRIN2C、ME3、PKNOX1、SLC35A2、UPF3B、BAIAP2L2、CYSLTR1、GRIN2D、MEA1、PKNOX2、SLC35A3、UPK1A、BAIAP3、CYSLTR2、GRIN3A、MEAF6、PKP1、SLC35A4、UPK1B、BAK1、CYSTM1、GRIN3B、MECOM、PKP2、SLC35A5、UPK2、BAMBI、CYTH1、GRINA、MECP2、PKP3、SLC35B1、UPK3A、BANF1、CYTH2、GRIP1、MECR、PKP4、SLC35B2、UPK3B、BANF2、CYTH3、GRIP2、MED1、PLA1A、SLC35B3、UPK3BL、BANK1、CYTH4、GRIPAP1、MED10、PLA2G10、SLC35B4、UPP1、BANP、CYTIP、GRK1、MED11、PLA2G12A、SLC35C1、UPP2、BAP1、CYTL1、GRK4、MED12、PLA2G12B、SLC35C2、UPRT、BARD1、CYYR1、GRK5、MED12L、PLA2G15、SLC35D1、UQCC1、BARHL1、D2HGDH、GRK6、MED13、PLA2G16、SLC35D2、UQCC2、BARHL2、DAAM1、GRK7、MED13L、PLA2G1B、SLC35D3、UQCR10、BARX1、DAAM2、GRM1、MED14、PLA2G2A、SLC35E1、UQCR11、BARX2、DAB1、GRM2、MED15、PLA2G2C、SLC35E2、UQCRB、BASP1、
DAB2、GRM3、MED16、PLA2G2D、SLC35E2B、UQCRC1、BATF、DAB2IP、GRM4、MED17、PLA2G2E、SLC35E3、UQCRC2、BATF2、DACH1、GRM5、MED18、PLA2G2F、SLC35E4、UQCRFS1、BATF3、DACH2、GRM6、MED19、PLA2G3、SLC35F1、UQCRH、BAX、DACT1、GRM7、MED20、PLA2G4A、SLC35F2、UQCRHL、BAZ1A、DACT2、GRM8、MED21、PLA2G4B、SLC35F3、UQCRQ、BAZ1B、DACT3、GRN、MED22、PLA2G4C、SLC35F4、URAD、BAZ2A、DAD1、GRP、MED23、PLA2G4D、SLC35F5、URB1、BAZ2B、DAG1、GRPEL1、MED24、PLA2G4E、SLC35F6、URB2、BBC3、DAGLA、GRPEL2、MED25、PLA2G4F、SLC35G1、URGCP、BBIP1、DAGLB、GRPR、MED26、PLA2G5、SLC35G2、URGCP-MRPS24、BBOX1、DAK、GRSF1、MED27、PLA2G6、SLC35G3、URI1、BBS1、DALRD3、GRTP1、MED28、PLA2G7、SLC35G4、URM1、BBS10、DAND5、GRWD1、MED29、PLA2R1、SLC35G5、UROC1、BBS12、DAO、GRXCR1、MED30、PLAA、SLC35G6、UROD、BBS2、DAOA、GRXCR2、MED31、PLAC1、SLC36A1、UROS、BBS4、DAP、GSAP、MED4、PLAC4、SLC36A2、USB1、BBS5、DAP3、GSC、MED6、PLAC8、SLC36A3、USE1、BBS7、DAPK1、GSC2、MED7、PLAC8L1、SLC36A4、USF1、BBS9、DAPK2、GSDMA、MED8、PLAC9、SLC37A1、USF2、BBX、DAPK3、GSDMB、MED9、PLAG1、SLC37A2、USH1C、BCAM、DAPL1、GSDMC、MEDAG、PLAGL1、SLC37A3、USH1G、BCAN、DAPP1、GSDMD、MEF2A、PLAGL2、SLC37A4、USH2A、BCAP29、DARS、GSE1、MEF2B、PLAT、SLC38A1、USHBP1、BCAP31、DARS2、GSG1、MEF2BNB、PLAU、SLC38A10、USMG5、BCAR1、DAW1、GSG1L、MEF2BNB-MEF2B、PLAUR、SLC38A11、USO1、BCAR3、DAXX、GSG2、MEF2C、PLB1、SLC38A2、USP1、BCAS1、DAZ1、GSK3A、MEF2D、PLBD1、SLC38A3、USP10、BCAS2、DAZ2、GSK3B、MEFV、PLBD2、SLC38A4、USP11、BCAS3、DAZ3、GSKIP、MEGF10、PLCB1、SLC38A5、USP12、BCAS4、DAZ4、GSN、MEGF11、PLCB2、SLC38A6、USP13、BCAT1、DAZAP1、GSPT1、MEGF6、PLCB3、SLC38A7、USP14、BCAT2、DAZAP2、GSPT2、MEGF8、PLCB4、SLC38A8、USP15、BCCIP、DAZL、GSR、MEGF9、PLCD1、SLC38A9、USP16、BCDIN3D、DBF4、GSS、MEI1、PLCD3、SLC39A1、USP17L24、BCHE、DBF4B、GSTA1、MEI4、PLCD4、SLC39A10、USP17L25、BCKDHA、DBH、GSTA2、MEIG1、PLCE1、SLC39A11、USP17L26、BCKDHB、DBI、GSTA3、MEIOB、PLCG1、SLC39A12、USP17L28、BCKDK、DBN1、GSTA4、MEIS1、PLCG2、SLC39A13、USP17L29、BCL10、DBNDD1、GSTA5、MEIS2、PLCH1、SLC39A14、USP18、BCL11A、DBNDD2、GSTCD、MEIS3、PLCH2、SLC39A2、USP19、BCL11B、DBNL、GSTK1、MELK、PLCL1、SLC39A3、USP2、BCL2、DBP、GSTM1、MEMO1、PLCL2、SLC39A4、USP20、BCL2A1、DBR1、GSTM2、MEN1、PLCXD1、SLC39A5、USP21、BCL2L1、DBT、GSTM3、MEOX1、PLCXD2、SLC39A6、USP22、BCL2L10、DBX1、GSTM4、MEOX2、PLCXD3、SLC39A7、USP24、BCL2L11、DBX2、GSTM5、MEP1A、PLCZ1、SLC39A8、USP25、BCL2L12、DCAF10、GSTO1、MEP1B、PLD1、SLC39A9、USP26、BCL2L13、DCAF11、GSTO2、MEPCE、PLD2、SLC3A1、USP27X、BCL2L14、DCAF12、GSTP1、MEPE、PLD3、SLC3A2、USP28、BCL2L15、DCAF12L1、GSTT1、MERTK、PLD4、SLC40A1、USP29、BCL2L2、DCAF12L2、GSTT2、MESDC1、PLD5、SLC41A1、USP3、BCL2L2-PABPN1、DCAF13、GSTT2B、MESDC2、PLD6、SLC41A2、USP30、BCL3、DCAF15、GSTZ1、MESP1、PLEC、SLC41A3、USP31、BCL6、DCAF16、GSX1、MESP2、PLEK、SLC43A1、USP32、BCL6B、DCAF17、GSX2、MEST、PLEK2、SLC43A2、USP33、BCL7A、DCAF4、GTDC1、MET、PLEKHA1、SLC43A3、
USP34、BCL7B、DCAF4L1、GTF2A1、METAP1、PLEKHA2、SLC44A1、USP35、BCL7C、DCAF4L2、GTF2A1L、METAP1D、PLEKHA3、SLC44A2、USP36、BCL9、DCAF5、GTF2A2、METAP2、PLEKHA4、SLC44A3、USP37、BCL9L、DCAF6、GTF2B、METRN、PLEKHA5、SLC44A4、USP38、BCLAF1、DCAF7、GTF2E1、METRNL、PLEKHA6、SLC44A5、USP39、BCMO1、DCAF8、GTF2E2、METTL1、PLEKHA7、SLC45A1、USP4、BCO2、DCAF8L1、GTF2F1、METTL10、PLEKHA8、SLC45A2、USP40、BCOR、DCAKD、GTF2F2、METTL11B、PLEKHB1、SLC45A3、USP41、BCORL1、DCBLD1、GTF2H1、METTL12、PLEKHB2、SLC45A4、USP42、BCR、DCBLD2、GTF2H2、METTL13、PLEKHD1、SLC46A1、USP43、BCS1L、DCC、GTF2H2C、METTL14、PLEKHF1、SLC46A2、USP44、BDH1、DCD、GTF2H3、METTL15、PLEKHF2、SLC46A3、USP45、BDH2、DCDC1、GTF2H4、METTL16、PLEKHG1、SLC47A1、USP46、BDKRB1、DCDC2、GTF2H5、METTL17、PLEKHG2、SLC47A2、USP47、BDKRB2、DCDC2B、GTF2I、METTL18、PLEKHG3、SLC48A1、USP48、BDNF、DCDC2C、GTF2IRD1、METTL20、PLEKHG4、SLC4A1、USP49、BDP1、DCDC5、GTF2IRD2、METTL21A、PLEKHG4B、SLC4A10、USP5、BEAN1、DCHS1、GTF2IRD2B、METTL21B、PLEKHG5、SLC4A11、USP50、BECN1、DCHS2、GTF3A、METTL21C、PLEKHG6、SLC4A1AP、USP51、BECN1P1、DCK、GTF3C1、METTL22、PLEKHG7、SLC4A2、USP53、BEGAIN、DCLK1、GTF3C2、METTL23、PLEKHH1、SLC4A3、USP54、BEND2、DCLK2、GTF3C3、METTL24、PLEKHH2、SLC4A4、USP6、BEND3、DCLK3、GTF3C4、METTL25、PLEKHH3、SLC4A5、USP6NL、BEND4、DCLRE1A、GTF3C5、METTL2A、PLEKHJ1、SLC4A7、USP7、BEND5、DCLRE1B、GTF3C6、METTL2B、PLEKHM1、SLC4A8、USP8、BEND6、DCLRE1C、GTPBP1、METTL3、PLEKHM2、SLC4A9、USP9X、BEND7、DCN、GTPBP10、METTL4、PLEKHM3、SLC50A1、USP9Y、BEST1、DCP1A、GTPBP2、METTL5、PLEKHN1、SLC51A、USPL1、BEST2、DCP1B、GTPBP3、METTL6、PLEKHO1、SLC51B、UST、BEST3、DCP2、GTPBP4、METTL7A、PLEKHO2、SLC52A1、UTF1、BEST4、DCPS、GTPBP6、METTL7B、PLEKHS1、SLC52A2、UTP11L、BET1、DCST1、GTPBP8、METTL8、PLET1、SLC52A3、UTP14A、BET1L、DCST2、GTSCR1、METTL9、PLG、SLC5A1、UTP14C、BEX1、DCSTAMP、GTSE1、MEX3A、PLGLB1、SLC5A10、UTP15、BEX2、DCT、GTSF1、MEX3B、PLGLB2、SLC5A11、UTP18、BEX4、DCTD、GTSF1L、MEX3C、PLGRKT、SLC5A12、UTP20、BEX5、DCTN1、GUCA1A、MEX3D、PLIN1、SLC5A2、UTP23、BFAR、DCTN2、GUCA1B、MFAP1、PLIN2、SLC5A3、UTP3、BFSP1、DCTN3、GUCA1C、MFAP2、PLIN3、SLC5A4、UTP6、BFSP2、DCTN4、GUCA2A、MFAP3、PLIN4、SLC5A5、UTRN、BGLAP、DCTN5、GUCA2B、MFAP3L、PLIN5、SLC5A6、UTS2、BGN、DCTN6、GUCD1、MFAP4、PLK1、SLC5A7、UTS2B、BHLHA15、DCTPP1、GUCY1A2、MFAP5、PLK1S1、SLC5A8、UTY、BHLHA9、DCUN1D1、GUCY1A3、MFF、PLK2、SLC5A9、UVRAG、BHLHB9、DCUN1D2、GUCY1B3、MFGE8、PLK3、SLC6A1、UVSSA、BHLHE22、DCUN1D3、GUCY2C、MFHAS1、PLK4、SLC6A11、UXS1、BHLHE23、DCUN1D4、GUCY2D、MFI2、PLK5、SLC6A12、UXT、BHLHE40、DCUN1D5、GUCY2F、MFN1、PLLP、SLC6A13、VAC14、BHLHE41、DCX、GUF1、MFN2、PLN、SLC6A14、VAMP1、BHMT、DCXR、GUK1、MFNG、PLOD1、SLC6A15、VAMP2、BHMT2、DDA1、GULP1、MFRP、PLOD2、SLC6A16、VAMP3、BICC1、DDAH1、GUSB、MFSD1、PLOD3、SLC6A17、VAMP4、BICD1、DDAH2、GXYLT1、MFSD10、PLP1、SLC6A18、VAMP5、BICD2、DDB1、GXYLT2、MFSD11、PLP2、SLC6A19、
VAMP7、BID、DDB2、GYG1、MFSD12、PLRG1、SLC6A2、VAMP8、BIK、DDC、GYG2、MFSD2A、PLS1、SLC6A20、VANGL1、BIN1、DDHD1、GYLTL1B、MFSD2B、PLS3、SLC6A3、VANGL2、BIN2、DDHD2、GYPA、MFSD3、PLSCR1、SLC6A4、VAPA、BIN3、DDI1、GYPB、MFSD4、PLSCR2、SLC6A5、VAPB、BIRC2、DDI2、GYPC、MFSD5、PLSCR3、SLC6A6、VARS、BIRC3、DDIT3、GYPE、MFSD6、PLSCR4、SLC6A7、VARS2、BIRC5、DDIT4、GYS1、MFSD6L、PLSCR5、SLC6A8、VASH1、BIRC6、DDIT4L、GYS2、MFSD7、PLTP、SLC6A9、VASH2、BIRC7、DDN、GZF1、MFSD8、PLVAP、SLC7A1、VASN、BIRC8、DDO、GZMA、MFSD9、PLXDC1、SLC7A10、VASP、BIVM、DDOST、GZMB、MGA、PLXDC2、SLC7A11、VAT1、BIVM-ERCC5、DDR1、GZMH、MGAM、PLXNA1、SLC7A13、VAT1L、BLCAP、DDR2、GZMK、MGARP、PLXNA2、SLC7A14、VAV1、BLID、DDRGK1、GZMM、MGAT1、PLXNA3、SLC7A2、VAV2、BLK、DDT、H1F0、MGAT2、PLXNA4、SLC7A3、VAV3、BLM、DDTL、H1FNT、MGAT3、PLXNB1、SLC7A4、VAX1、BLMH、DDX1、H1FOO、MGAT4A、PLXNB2、SLC7A5、VAX2、BLNK、DDX10、H1FX、MGAT4B、PLXNB3、SLC7A6、VBP1、BLOC1S1、DDX11、H2AFB1、MGAT4C、PLXNC1、SLC7A6OS、VCAM1、BLOC1S2、DDX17、H2AFB2、MGAT5、PLXND1、SLC7A7、VCAN、BLOC1S3、DDX18、H2AFB3、MGAT5B、PM20D1、SLC7A8、VCL、BLOC1S4、DDX19A、H2AFJ、MGEA5、PM20D2、SLC7A9、VCP、BLOC1S5、DDX19B、H2AFV、MGLL、PMAIP1、SLC8A1、VCPIP1、BLOC1S6、DDX20、H2AFX、MGME1、PMCH、SLC8A2、VCPKMT、BLVRA、DDX21、H2AFY、MGMT、PMEL、SLC8A3、VCX、BLVRB、DDX23、H2AFY2、MGP、PMEPA1、SLC8B1、VCX2、BLZF1、DDX24、H2AFZ、MGRN1、PMF1、SLC9A1、VCX3A、BMF、DDX25、H2BFM、MGST1、PMF1-BGLAP、SLC9A2、VCX3B、BMI1、DDX26B、H2BFWT、MGST2、PMFBP1、SLC9A3、VCY、BMP1、DDX27、H3F3A、MGST3、PML、SLC9A3R1、VCY1B、BMP10、DDX28、H3F3B、MIA、PMM1、SLC9A3R2、VDAC1、BMP15、DDX31、H3F3C、MIA2、PMM2、SLC9A4、VDAC2、BMP2、DDX39A、H6PD、MIA3、PMP2、SLC9A5、VDAC3、BMP2K、DDX39B、HAAO、MIB1、PMP22、SLC9A6、VDR、BMP3、DDX3X、HABP2、MIB2、PMPCA、SLC9A7、VEGFA、BMP4、DDX3Y、HABP4、MICA、PMPCB、SLC9A8、VEGFB、BMP5、DDX4、HACE1、MICAL1、PMS1、SLC9A9、VEGFC、BMP6、DDX41、HACL1、MICAL2、PMS2、SLC9B1、VENTX、BMP7、DDX42、HADH、MICAL3、PMVK、SLC9B2、VEPH1、BMP8A、DDX43、HADHA、MICALCL、PNCK、SLC9C1、VEZF1、BMP8B、DDX46、HADHB、MICALL1、PNISR、SLC9C2、VEZT、BMPER、DDX47、HAGH、MICALL2、PNKD、SLCO1A2、VGF、BMPR1A、DDX49、HAGHL、MICB、PNKP、SLCO1B1、VGLL1、BMPR1B、DDX5、HAL、MICU1、PNLDC1、SLCO1B3、VGLL2、BMPR2、DDX50、HAMP、MICU2、PNLIP、SLCO1B7、VGLL3、BMS1、DDX51、HAND1、MICU3、PNLIPRP1、SLCO1C1、VGLL4、BMX、DDX52、HAND2、MID1、PNLIPRP2、SLCO2A1、VHL、BNC1、DDX53、HAO1、MID1IP1、PNLIPRP3、SLCO2B1、VHLL、BNC2、DDX54、HAO2、MID2、PNMA2、SLCO3A1、VIL1、BNIP1、DDX55、HAP1、MIDN、PNMA3、SLCO4A1、VILL、BNIP2、DDX56、HAPLN1、MIEF1、PNMA5、SLCO4C1、VIM、BNIP3、DDX58、HAPLN2、MIEF2、PNMA6A、SLCO5A1、VIMP、BNIP3L、DDX59、HAPLN3、MIEN1、PNMA6C、SLCO6A1、VIP、BNIPL、DDX6、HAPLN4、MIER1、PNMAL1、SLFN11、VIPAS39、BOC、DDX60、HARBI1、MIER2、PNMAL2、SLFN12、VIPR1、BOD1、DDX60L、HARS、MIER3、PNMT、SLFN12L、VIPR2、BOD1L1、DEAF1、HARS2、MIF、PNN、SLFN13、VIT、BOD1L2、1-Dec、HAS1、MIF4GD、PNO1、SLFN14、VKORC1、BOK、DECR1、HAS2、MIIP、PNOC、SLFN5、
VKORC1L1、BOLA1、DECR2、HAS3、MILR1、PNP、SLFNL1、VLDLR、BOLA2、DEDD、HAT1、MINA、PNPLA1、SLIRP、VMA21、BOLA2B、DEDD2、HAUS1、MINK1、PNPLA2、SLIT1、VMAC、BOLA3、DEF6、HAUS2、MINOS1、PNPLA3、SLIT2、VMO1、BOLL、DEF8、HAUS3、MINOS1-NBL1、PNPLA4、SLIT3、VMP1、BOP1、DEFA1、HAUS4、MINPP1、PNPLA5、SLITRK1、VN1R1、BORA、DEFA1B、HAUS5、MIOS、PNPLA6、SLITRK2、VN1R2、BPGM、DEFA3、HAUS6、MIOX、PNPLA7、SLITRK3、VN1R4、BPHL、DEFA4、HAUS7、MIP、PNPLA8、SLITRK4、VN1R5、BPI、DEFA5、HAUS8、MIPEP、PNPO、SLITRK5、VNN1、BPIFA1、DEFA6、HAVCR1、MIPOL1、PNPT1、SLITRK6、VNN2、BPIFA2、DEFB1、HAVCR2、MIR205HG、PNRC1、SLK、VOPP1、BPIFA3、DEFB103A、HAX1、MIS12、PNRC2、SLMAP、VPRBP、BPIFB1、DEFB103B、HBA1、MIS18A、POC1A、SLMO1、VPREB1、BPIFB2、DEFB104A、HBA2、MIS18BP1、POC1B、SLMO2、VPREB3、BPIFB3、DEFB104B、HBB、MISP、POC1B-GALNT4、SLN、VPS11、BPIFB4、DEFB105A、HBD、MITD1、POC5、SLPI、VPS13A、BPIFB6、DEFB105B、HBE1、MITF、PODN、SLTM、VPS13B、BPIFC、DEFB106A、HBEGF、MIXL1、PODNL1、SLU7、VPS13C、BPNT1、DEFB106B、HBG1、MKI67、PODXL、SLURP1、VPS13D、BPTF、DEFB107A、HBG2、MKKS、PODXL2、SLX1A、VPS16、BPY2、DEFB107B、HBM、MKL1、POF1B、SLX1B、VPS18、BPY2B、DEFB108B、HBP1、MKL2、POFUT1、SLX4、VPS25、BPY2C、DEFB110、HBQ1、MKLN1、POFUT2、SLX4IP、VPS26A、BRAF、DEFB112、HBS1L、MKNK1、POGK、SMAD1、VPS26B、BRAP、DEFB113、HBZ、MKNK2、POGLUT1、SMAD2、VPS28、BRAT1、DEFB114、HCAR1、MKRN1、POGZ、SMAD3、VPS29、BRCA1、DEFB115、HCAR2、MKRN2、POLA1、SMAD4、VPS33A、BRCA2、DEFB116、HCAR3、MKRN3、POLA2、SMAD5、VPS33B、BRCC3、DEFB118、HCCS、MKS1、POLB、SMAD6、VPS35、BRD1、DEFB119、HCFC1、MKX、POLD1、SMAD7、VPS36、BRD2、DEFB121、HCFC1R1、MLANA、POLD2、SMAD9、VPS37A、BRD3、DEFB123、HCFC2、MLC1、POLD3、SMAGP、VPS37B、BRD4、DEFB124、HCK、MLEC、POLD4、SMAP1、VPS37C、BRD7、DEFB125、HCLS1、MLF1、POLDIP2、SMAP2、VPS37D、BRD8、DEFB126、HCN1、MLF2、POLDIP3、SMARCA1、VPS39、BRD9、DEFB127、HCN2、MLH1、POLE、SMARCA2、VPS41、BRDT、DEFB128、HCN3、MLH3、POLE2、SMARCA4、VPS45、BRE、DEFB129、HCN4、MLIP、POLE3、SMARCA5、VPS4A、BRF1、DEFB130、HCRT、MLKL、POLE4、SMARCAD1、VPS4B、BRF2、DEFB131、HCRTR1、MLLT1、POLG、SMARCAL1、VPS51、BRI3、DEFB132、HCRTR2、MLLT10、POLG2、SMARCB1、VPS52、BRI3BP、DEFB133、HCST、MLLT11、POLH、SMARCC1、VPS53、BRICD5、DEFB134、HDAC1、MLLT3、POLI、SMARCC2、VPS54、BRINP1、DEFB135、HDAC10、MLLT4、POLK、SMARCD1、VPS72、BRINP2、DEFB136、HDAC11、MLLT6、POLL、SMARCD2、VPS8、BRINP3、DEFB4A、HDAC2、MLN、POLM、SMARCD3、VPS9D1、BRIP1、DEFB4B、HDAC3、MLNR、POLN、SMARCE1、VRK1、BRIX1、DEGS1、HDAC4、MLPH、POLQ、SMC1A、VRK2、BRK1、DEGS2、HDAC5、MLST8、POLR1A、SMC1B、VRK3、BRMS1、DEK、HDAC6、MLX、POLR1B、SMC2、VRTN、BRMS1L、DENND1A、HDAC7、MLXIP、POLR1C、SMC3、VSIG1、BROX、DENND1B、HDAC8、MLXIPL、POLR1D、SMC4、VSIG10、BRPF1、DENND1C、HDAC9、MLYCD、POLR1E、SMC5、VSIG10L、BRPF3、DENND2A、HDC、MMAA、POLR2A、SMC6、VSIG2、BRS3、DENND2C、HDDC2、MMAB、POLR2B、SMCHD1、VSIG4、BRSK1、DENND2D、HDDC3、MMACHC、POLR2C、SMCO2、VSIG8、BRSK2、DENND3、HDGF、MMADHC、POLR2D、SMCO3、
VSNL1、BRWD1、DENND4A、HDGFL1、MMD、POLR2E、SMCO4、VSTM1、BRWD3、DENND4B、HDGFRP2、MMD2、POLR2F、SMCP、VSTM2A、BSCL2、DENND4C、HDGFRP3、MME、POLR2G、SMCR8、VSTM2B、BSDC1、DENND5A、HDHD1、MMEL1、POLR2H、SMCR9、VSTM2L、BSG、DENND5B、HDHD2、MMGT1、POLR2I、SMDT1、VSTM4、BSN、DENND6A、HDHD3、MMP1、POLR2J、SMEK1、VSTM5、BSND、DENND6B、HDLBP、MMP10、POLR2J2、SMEK2、VSX1、BSPH1、DENR、HDX、MMP11、POLR2J3、SMG1、VSX2、BSPRY、DEPDC1、HEATR1、MMP12、POLR2K、SMG5、VTA1、BST1、DEPDC1B、HEATR2、MMP13、POLR2L、SMG6、VTCN1、BST2、DEPDC4、HEATR3、MMP14、POLR2M、SMG7、VTI1A、BSX、DEPDC5、HEATR4、MMP15、POLR3A、SMG8、VTI1B、BTAF1、DEPDC7、HEATR5A、MMP16、POLR3B、SMG9、VTN、BTBD1、DEPTOR、HEATR5B、MMP17、POLR3C、SMIM1、VWA1、BTBD10、DERA、HEATR6、MMP19、POLR3D、SMIM10、VWA2、BTBD11、DERL1、HEBP1、MMP2、POLR3E、SMIM11、VWA3A、BTBD16、DERL2、HEBP2、MMP20、POLR3F、SMIM12、VWA3B、BTBD17、DERL3、HECA、MMP21、POLR3G、SMIM13、VWA5A、BTBD18、DES、HECTD1、MMP23B、POLR3GL、SMIM14、VWA5B1、BTBD19、DESI1、HECTD2、MMP24、POLR3H、SMIM15、VWA5B2、BTBD2、DESI2、HECTD3、MMP25、POLR3K、SMIM17、VWA7、BTBD3、DET1、HECTD4、MMP26、POLRMT、SMIM18、VWA8、BTBD6、DEXI、HECW1、MMP27、POM121、SMIM19、VWA9、BTBD7、DFFA、HECW2、MMP28、POM121C、SMIM2、VWC2、BTBD8、DFFB、HEG1、MMP3、POM121L12、SMIM20、VWC2L、BTBD9、DFNA5、HELB、MMP7、POM121L2、SMIM21、VWCE、BTC、DFNB31、HELLS、MMP8、POMC、SMIM22、VWDE、BTD、DFNB59、HELQ、MMP9、POMGNT1、SMIM3、VWF、BTF3、DGAT1、HELT、MMRN1、POMGNT2、SMIM4、WAC、BTF3L4、DGAT2、HELZ、MMRN2、POMK、SMIM5、WAPAL、BTG1、DGAT2L6、HELZ2、MMS19、POMP、SMIM6、WARS、BTG2、DGCR14、HEMGN、MMS22L、POMT1、SMIM7、WARS2、BTG3、DGCR2、HEMK1、MN1、POMT2、SMIM8、WAS、BTG4、DGCR6、HENMT1、MNAT1、POMZP3、SMIM9、WASF1、BTK、DGCR6L、HEPACAM、MND1、PON1、SMKR1、WASF2、BTLA、DGCR8、HEPACAM2、MNDA、PON2、SMLR1、WASF3、BTN1A1、DGKA、HEPH、MNS1、PON3、SMN1、WASH1、BTN2A1、DGKB、HEPHL1、MNT、POP1、SMN2、WASL、BTN2A2、DGKD、HEPN1、MNX1、POP4、SMNDC1、WBP1、BTN3A1、DGKE、HERC1、MOAP1、POP5、SMO、WBP11、BTN3A2、DGKG、HERC2、MOB1A、POP7、SMOC1、WBP1L、BTN3A3、DGKH、HERC3、MOB1B、POPDC2、SMOC2、WBP2、BTNL10、DGKI、HERC4、MOB2、POPDC3、SMOX、WBP2NL、BTNL2、DGKK、HERC5、MOB3A、POR、SMPD1、WBP4、BTNL3、DGKQ、HERC6、MOB3B、PORCN、SMPD2、WBP5、BTNL8、DGKZ、HERPUD1、MOB3C、POSTN、SMPD3、WBSCR16、BTNL9、DGUOK、HERPUD2、MOB4、POT1、SMPD4、WBSCR17、BTRC、DHCR24、HES1、MOBP、POTEA、SMPD5、WBSCR22、BUB1、DHCR7、HES2、MOCOS、POTEB、SMPDL3A、WBSCR27、BUB1B、DHDDS、HES3、MOCS1、POTEB2、SMPDL3B、WBSCR28、BUB3、DHDH、HES4、MOCS2、POTEC、SMPX、WDFY1、BUD13、DHFR、HES5、MOCS3、POTED、SMR3A、WDFY2、BUD31、DHFRL1、HES6、MOG、POTEE、SMR3B、WDFY3、BVES、DHH、HES7、MOGAT1、POTEF、SMS、WDFY4、BYSL、DHODH、HESX1、MOGAT2、POTEG、SMTN、WDHD1、BZRAP1、DHPS、HEXA、MOGAT3、POTEH、SMTNL1、WDPCP、BZW1、DHRS1、HEXB、MOGS、POTEI、SMTNL2、WDR1、BZW2、DHRS11、HEXDC、MOK、POTEJ、SMU1、WDR11、C10orf10、DHRS12、HEXIM1、MON1A、POTEM、SMUG1、WDR12、C10orf105、DHRS13、HEXIM2、MON1B、POU1F1、SMURF1、WDR13、
C10orf107、DHRS2、HEY1、MON2、POU2AF1、SMURF2、WDR16、C10orf11、DHRS3、HEY2、MORC1、POU2F1、SMYD1、WDR17、C10orf111、DHRS4、HEYL、MORC2、POU2F2、SMYD2、WDR18、C10orf112、DHRS4L1、HFE、MORC3、POU2F3、SMYD3、WDR19、C10orf113、DHRS4L2、HFE2、MORC4、POU3F1、SMYD4、WDR20、C10orf118、DHRS7、HFM1、MORF4L1、POU3F2、SMYD5、WDR24、C10orf12、DHRS7B、HGC6.3、MORF4L2、POU3F3、SNAI1、WDR25、C10orf120、DHRS7C、HGD、MORN1、POU3F4、SNAI2、WDR26、C10orf126、DHRS9、HGF、MORN2、POU4F1、SNAI3、WDR27、C10orf128、DHRSX、HGFAC、MORN3、POU4F2、SNAP23、WDR3、C10orf129、DHTKD1、HGS、MORN4、POU4F3、SNAP25、WDR31、C10orf131、DHX15、HGSNAT、MORN5、POU5F1、SNAP29、WDR33、C10orf137、DHX16、HHAT、MOS、POU5F1B、SNAP47、WDR34、C10orf2、DHX29、HHATL、MOSPD1、POU5F2、SNAP91、WDR35、C10orf25、DHX30、HHEX、MOSPD2、POU6F1、SNAPC1、WDR36、C10orf32、DHX32、HHIP、MOSPD3、POU6F2、SNAPC2、WDR37、C10orf35、DHX33、HHIPL1、MOV10、PP2D1、SNAPC3、WDR38、C10orf53、DHX34、HHIPL2、MOV10L1、PPA1、SNAPC4、WDR4、C10orf54、DHX35、HHLA1、MOXD1、PPA2、SNAPC5、WDR41、C10orf55、DHX36、HHLA2、MPC1、PPAN、SNAPIN、WDR43、C10orf62、DHX37、HHLA3、MPC1L、PPAN-P2RY11、SNCA、WDR44、C10orf67、DHX38、HIAT1、MPC2、PPAP2A、SNCAIP、WDR45、C10orf68、DHX40、HIATL1、MPDU1、PPAP2B、SNCB、WDR45B、C10orf71、DHX57、HIBADH、MPDZ、PPAP2C、SNCG、WDR46、C10orf76、DHX58、HIBCH、MPEG1、PPAPDC1A、SND1、WDR47、C10orf82、DHX8、HIC1、MPG、PPAPDC1B、SNED1、WDR48、C10orf88、DHX9、HIC2、MPHOSPH10、PPAPDC2、SNF8、WDR49、C10orf90、DIABLO、HID1、MPHOSPH6、PPAPDC3、SNIP1、WDR5、C10orf91、DIAPH1、HIF1A、MPHOSPH8、PPARA、SNN、WDR52、C10orf95、DIAPH2、HIF1AN、MPHOSPH9、PPARD、SNPH、WDR53、C10orf99、DIAPH3、HIF3A、MPI、PPARG、SNRK、WDR54、C11orf1、DICER1、HIGD1A、MPL、PPARGC1A、SNRNP200、WDR55、C11orf16、DIDO1、HIGD1B、MPLKIP、PPARGC1B、SNRNP25、WDR59、C11orf21、DIEXF、HIGD1C、MPND、PPAT、SNRNP27、WDR5B、C11orf24、DIMT1、HIGD2A、MPO、PPBP、SNRNP35、WDR6、C11orf30、DIO1、HILPDA、MPP1、PPCDC、SNRNP40、WDR60、C11orf31、DIO2、HINFP、MPP2、PPCS、SNRNP48、WDR61、C11orf35、DIO3、HINT1、MPP3、PPDPF、SNRNP70、WDR62、C11orf40、DIP2A、HINT2、MPP4、PPEF1、SNRPA、WDR63、C11orf42、DIP2B、HINT3、MPP5、PPEF2、SNRPA1、WDR64、C11orf44、DIP2C、HIP1、MPP6、PPFIA1、SNRPB、WDR65、C11orf45、DIRAS1、HIP1R、MPP7、PPFIA2、SNRPB2、WDR66、C11orf48、DIRAS2、HIPK1、MPPE1、PPFIA3、SNRPC、WDR7、C11orf49、DIRAS3、HIPK2、MPPED1、PPFIA4、SNRPD1、WDR70、C11orf52、DIRC1、HIPK3、MPPED2、PPFIBP1、SNRPD2、WDR72、C11orf53、DIRC2、HIPK4、MPRIP、PPFIBP2、SNRPD3、WDR73、C11orf54、DIS3、HIRA、MPST、PPHLN1、SNRPE、WDR74、C11orf57、DIS3L、HIRIP3、MPV17、PPIA、SNRPF、WDR75、C11orf58、DIS3L2、HIST1H1A、MPV17L、PPIAL4A、SNRPG、WDR76、C11orf63、DISC1、HIST1H1B、MPV17L2、PPIAL4B、SNRPN、WDR77、C11orf65、DISP1、HIST1H1C、MPZ、PPIAL4C、SNTA1、WDR78、C11orf68、DISP2、HIST1H1D、MPZL1、PPIAL4D、SNTB1、WDR81、C11orf70、DIXDC1、HIST1H1E、MPZL2、PPIAL4E、SNTB2、WDR82、C11orf71、DKC1、
HIST1H1T、MPZL3、PPIAL4F、SNTG1、WDR83、C11orf73、DKK1、HIST1H2AA、MR1、PPIAL4G、SNTG2、WDR83OS、C11orf74、DKK2、HIST1H2AB、MRAP、PPIB、SNTN、WDR86、C11orf80、DKK3、HIST1H2AC、MRAP2、PPIC、SNUPN、WDR87、C11orf82、DKK4、HIST1H2AD、MRAS、PPID、SNURF、WDR88、C11orf83、DKKL1、HIST1H2AE、MRC2、PPIE、SNW1、WDR89、C11orf84、DLAT、HIST1H2AG、MRE11A、PPIF、SNX1、WDR90、C11orf85、DLC1、HIST1H2AH、MREG、PPIG、SNX10、WDR91、C11orf86、DLD、HIST1H2AI、MRFAP1、PPIH、SNX11、WDR92、C11orf87、DLEC1、HIST1H2AJ、MRFAP1L1、PPIL1、SNX12、WDR93、C11orf88、DLEU7、HIST1H2AK、MRGBP、PPIL2、SNX13、WDR96、C11orf91、DLG1、HIST1H2AL、MRGPRD、PPIL3、SNX14、WDSUB1、C11orf94、DLG2、HIST1H2AM、MRGPRE、PPIL4、SNX15、WDTC1、C11orf95、DLG3、HIST1H2BA、MRGPRF、PPIL6、SNX16、WDYHV1、C11orf96、DLG4、HIST1H2BB、MRGPRG、PPIP5K1、SNX17、WEE1、C12orf10、DLG5、HIST1H2BC、MRGPRX1、PPIP5K2、SNX18、WEE2、C12orf23、DLGAP1、HIST1H2BD、MRGPRX2、PPL、SNX19、WFDC1、C12orf29、DLGAP2、HIST1H2BE、MRGPRX3、PPM1A、SNX2、WFDC10A、C12orf39、DLGAP4、HIST1H2BF、MRGPRX4、PPM1B、SNX20、WFDC10B、C12orf4、DLGAP5、HIST1H2BG、MRI1、PPM1D、SNX21、WFDC11、C12orf40、DLK1、HIST1H2BH、MRM1、PPM1E、SNX22、WFDC12、C12orf42、DLK2、HIST1H2BI、MRO、PPM1F、SNX24、WFDC13、C12orf43、DLL1、HIST1H2BJ、MROH1、PPM1G、SNX25、WFDC2、C12orf44、DLL3、HIST1H2BK、MROH2A、PPM1H、SNX27、WFDC3、C12orf45、DLL4、HIST1H2BL、MROH2B、PPM1J、SNX29、WFDC5、C12orf49、DLST、HIST1H2BM、MROH5、PPM1K、SNX3、WFDC6、C12orf5、DLX1、HIST1H2BN、MROH6、PPM1L、SNX30、WFDC8、C12orf50、DLX2、HIST1H2BO、MROH7、PPM1M、SNX31、WFDC9、C12orf52、DLX3、HIST1H3A、MROH8、PPM1N、SNX32、WFIKKN1、C12orf54、DLX4、HIST1H3B、MROH9、PPME1、SNX33、WFIKKN2、C12orf55、DLX5、HIST1H3C、MRP63、PPOX、SNX4、WFS1、C12orf56、DLX6、HIST1H3D、MRPL1、PPP1CA、SNX5、WHAMM、C12orf57、DMAP1、HIST1H3E、MRPL10、PPP1CB、SNX6、WHSC1、C12orf60、DMBT1、HIST1H3F、MRPL11、PPP1CC、SNX7、WHSC1L1、C12orf61、DMBX1、HIST1H3G、MRPL12、PPP1R10、SNX8、WIBG、C12orf65、DMC1、HIST1H3H、MRPL13、PPP1R11、SNX9、WIF1、C12orf66、DMD、HIST1H3I、MRPL14、PPP1R12A、SOAT1、WIPF1、C12orf68、DMGDH、HIST1H3J、MRPL15、PPP1R12B、SOAT2、WIPF2、C12orf71、DMKN、HIST1H4A、MRPL16、PPP1R12C、SOBP、WIPF3、C12orf73、DMP1、HIST1H4B、MRPL17、PPP1R13B、SOCS1、WIPI1、C12orf74、DMPK、HIST1H4C、MRPL18、PPP1R13L、SOCS2、WIPI2、C12orf75、DMRT1、HIST1H4D、MRPL19、PPP1R14A、SOCS3、WISP1、C12orf76、DMRT2、HIST1H4E、MRPL2、PPP1R14B、SOCS4、WISP2、C12orf77、DMRT3、HIST1H4F、MRPL20、PPP1R14C、SOCS5、WISP3、C12orf79、DMRTA1、HIST1H4G、MRPL21、PPP1R14D、SOCS6、WIZ、C12orf80、DMRTA2、HIST1H4H、MRPL22、PPP1R15A、SOCS7、WLS、C13orf35、DMRTB1、HIST1H4I、MRPL23、PPP1R15B、SOD1、WNK1、C13orf45、DMRTC1、HIST1H4J、MRPL24、PPP1R16A、SOD2、WNK2、C14orf1、DMRTC1B、HIST1H4K、MRPL27、PPP1R16B、SOD3、WNK3、C14orf105、DMRTC2、HIST1H4L、MRPL28、PPP1R17、SOGA1、WNK4、C14orf119、DMTF1、HIST2H2AA3、MRPL3、PPP1R18、SOGA2、WNT1、C14orf132、DMTN、HIST2H2AA4、
MRPL30、PPP1R1A、SOGA3、WNT10A、C14orf142、DMWD、HIST2H2AB、MRPL32、PPP1R1B、SOHLH1、WNT10B、C14orf159、DMXL1、HIST2H2AC、MRPL33、PPP1R1C、SOHLH2、WNT11、C14orf164、DMXL2、HIST2H2BE、MRPL34、PPP1R2、SON、WNT16、C14orf166、DNA2、HIST2H2BF、MRPL35、PPP1R21、SORBS1、WNT2、C14orf166B、DNAAF1、HIST2H3A、MRPL36、PPP1R26、SORBS2、WNT2B、C14orf169、DNAAF2、HIST2H3C、MRPL37、PPP1R27、SORBS3、WNT3、C14orf177、DNAAF3、HIST2H3D、MRPL38、PPP1R32、SORCS1、WNT3A、C14orf178、DNAH1、HIST2H4A、MRPL39、PPP1R35、SORCS2、WNT4、C14orf180、DNAH10、HIST2H4B、MRPL4、PPP1R36、SORCS3、WNT5A、C14orf182、DNAH11、HIST3H2A、MRPL40、PPP1R37、SORD、WNT5B、C14orf183、DNAH12、HIST3H2BB、MRPL41、PPP1R3A、SORL1、WNT6、C14orf2、DNAH14、HIST3H3、MRPL42、PPP1R3B、SORT1、WNT7A、C14orf28、DNAH17、HIST4H4、MRPL43、PPP1R3C、SOS1、WNT7B、C14orf37、DNAH2、HIVEP1、MRPL44、PPP1R3D、SOS2、WNT8A、C14orf39、DNAH3、HIVEP2、MRPL45、PPP1R3E、SOST、WNT8B、C14orf79、DNAH5、HIVEP3、MRPL46、PPP1R3F、SOSTDC1、WNT9A、C14orf80、DNAH6、HJURP、MRPL47、PPP1R3G、SOWAHA、WNT9B、C14orf93、DNAH7、HK1、MRPL48、PPP1R42、SOWAHB、WRAP53、C15orf26、DNAH8、HK2、MRPL49、PPP1R7、SOWAHC、WRAP73、C15orf27、DNAH9、HK3、MRPL50、PPP1R8、SOWAHD、WRB、C15orf32、DNAI1、HKDC1、MRPL51、PPP1R9A、SOX1、WRN、C15orf38、DNAI2、HKR1、MRPL52、PPP1R9B、SOX10、WRNIP1、C15orf38-AP3S2、DNAJA1、HLa-a、MRPL53、PPP2CA、SOX11、WSB1、C15orf39、DNAJA2、HLA-B、MRPL54、PPP2CB、SOX12、WSB2、C15orf40、DNAJA3、HLA-C、MRPL55、PPP2R1A、SOX13、WSCD1、C15orf41、DNAJA4、HLA-DMA、MRPL9、PPP2R1B、SOX14、WSCD2、C15orf43、DNAJB1、HLA-DMB、MRPS10、PPP2R2A、SOX15、WT1、C15orf48、DNAJB11、HLA-DOA、MRPS11、PPP2R2B、SOX17、WTAP、C15orf52、DNAJB12、HLA-DOB、MRPS12、PPP2R2C、SOX18、WTH3DI、C15orf53、DNAJB13、HLA-DPA1、MRPS14、PPP2R2D、SOX2、WTIP、C15orf54、DNAJB14、HLA-DPB1、MRPS15、PPP2R3A、SOX21、WWC1、C15orf56、DNAJB2、HLA- DQA1、MRPS16、PPP2R3B、SOX3、WWC2、C15orf57、DNAJB3、HLA-DQA2、MRPS17、PPP2R3C、SOX30、WWC3、C15orf59、DNAJB4、HLA-DQB1、MRPS18A、PPP2R4、SOX4、WWOX、C15orf60、DNAJB5、HLA- DQB2、MRPS18B、PPP2R5A、SOX5、WWP1、C15orf61、DNAJB6、HLA-DRA、MRPS18C、PPP2R5B、SOX6、WWP2、C15orf62、DNAJB7、HLA-DRB1、MRPS2、PPP2R5C、SOX7、WWTR1、C15orf65、DNAJB8、HLA- DRB5、MRPS21、PPP2R5D、SOX8、XAB2、C16orf11、DNAJB9、HLA-E、MRPS22、PPP2R5E、SOX9、XAF1、C16orf13、DNAJC1、HLA-F、MRPS23、PPP3CA、SP1、XAGE1D、C16orf3、DNAJC10、HLA-G、MRPS24、PPP3CB、SP100、XAGE2、C16orf45、DNAJC11、HLCS、MRPS25、PPP3CC、SP110、XAGE3、C16orf46、DNAJC12、HLF、MRPS26、PPP3R1、SP140、XAGE5、C16orf47、DNAJC13、HLTF、MRPS27、PPP3R2、SP140L、XBP1、C16orf52、DNAJC14、HLX、MRPS28、PPP4C、SP2、XCL1、C16orf54、DNAJC15、HM13、MRPS30、PPP4R1、SP3、XCL2、C16orf58、DNAJC16、HMBOX1、MRPS31、PPP4R2、SP4、XCR1、C16orf59、DNAJC17、HMBS、MRPS33、PPP4R4、SP5、XDH、C16orf62、DNAJC18、HMCES、MRPS34、PPP5C、SP6、XG、C16orf70、DNAJC19、HMCN1、MRPS35、PPP5D1、SP7、XIAP、C16orf71、DNAJC2、HMCN2、MRPS36、
PPP6C、SP8、XIRP1、C16orf72、DNAJC21、HMG20A、MRPS5、PPP6R1、SP9、XIRP2、C16orf74、DNAJC22、HMG20B、MRPS6、PPP6R2、SPA17、XK、C16orf78、DNAJC24、HMGA1、MRPS7、PPP6R3、SPACA1、XKR3、C16orf80、DNAJC25、HMGA2、MRPS9、PPRC1、SPACA3、XKR4、C16orf82、DNAJC25-GNG10、HMGB1、MRRF、PPT1、SPACA4、XKR5、C16orf86、DNAJC27、HMGB2、MRS2、PPT2、SPACA5、XKR6、C16orf87、DNAJC28、HMGB3、MRTO4、PPTC7、SPACA5B、XKR7、C16orf89、DNAJC3、HMGB4、MRVI1、PPWD1、SPACA7、XKR8、C16orf90、DNAJC30、HMGCL、MS4A1、PPY、SPAG1、XKR9、C16orf91、DNAJC4、HMGCLL1、MS4A10、PQBP1、SPAG11A、XKRX、C16orf92、DNAJC5、HMGCR、MS4A12、PQLC1、SPAG11B、XKRY、C16orf93、DNAJC5B、HMGCS1、MS4A13、PQLC2、SPAG16、XKRY2、C16orf95、DNAJC5G、HMGCS2、MS4A14、PQLC3、SPAG17、XPA、C16orf96、DNAJC6、HMGN1、MS4A15、PRAC1、SPAG4、XPC、C16orf97、DNAJC7、HMGN2、MS4A2、PRAC2、SPAG5、XPNPEP1、C17orf100、DNAJC8、HMGN3、MS4A3、PRADC1、SPAG6、XPNPEP2、C17orf102、DNAJC9、HMGN4、MS4A4A、PRAF2、SPAG7、XPNPEP3、C17orf103、DNAL1、HMGN5、MS4A4E、PRAM1、SPAG8、XPO1、C17orf104、DNAL4、HMGXB3、MS4A5、PRAME、SPAG9、XPO4、C17orf105、DNALI1、HMGXB4、MS4A6A、PRAMEF1、SPAM1、XPO5、C17orf107、DNASE1、HMHA1、MS4A6E、PRAMEF10、SPANXA1、XPO6、C17orf112、DNASE1L1、HMHB1、MS4A7、PRAMEF11、SPANXA2、XPO7、C17orf47、DNASE1L2、HMMR、MS4A8、PRAMEF12、SPANXB1、XPOT、C17orf49、DNASE1L3、HMOX1、MSANTD1、PRAMEF13、SPANXB2、XPR1、C17orf50、DNASE2、HMOX2、MSANTD2、PRAMEF14、SPANXC、XRCC1、C17orf51、DNASE2B、HMP19、MSANTD3、PRAMEF15、SPANXD、XRCC2、C17orf53、DND1、HMSD、MSANTD3-TMEFF1、PRAMEF16、SPANXN1、XRCC3、C17orf58、DNER、HMX1、MSANTD4、PRAMEF17、SPANXN2、XRCC4、C17orf59、DNHD1、HMX2、MSC、PRAMEF18、SPANXN3、XRCC5、C17orf62、DNLZ、HMX3、MSGN1、PRAMEF19、SPANXN4、XRCC6、C17orf64、DNM1、HN1、MSH2、PRAMEF2、SPANXN5、XRCC6BP1、C17orf66、DNM1L、HN1L、MSH3、PRAMEF20、SPARC、XRN1、C17orf67、DNM2、HNF1A、MSH4、PRAMEF21、SPARCL1、XRN2、C17orf70、DNM3、HNF1B、MSH5、PRAMEF22、SPAST、XRRA1、C17orf72、DNMBP、HNF4A、MSH6、PRAMEF23、SPATA1、XXYLT1、C17orf74、DNMT1、HNF4G、MSI1、PRAMEF25、SPATA12、XYLB、C17orf75、DNMT3A、HNMT、MSI2、PRAMEF3、SPATA13、XYLT1、C17orf77、DNMT3B、HNRNPA0、MSL1、PRAMEF4、SPATA16、XYLT2、C17orf78、DNMT3L、HNRNPA1、MSL2、PRAMEF5、SPATA17、YAE1D1、C17orf80、DNPEP、HNRNPA1L2、MSL3、PRAMEF6、SPATA18、YAF2、C17orf82、DNPH1、HNRNPA2B1、MSLN、PRAMEF7、SPATA19、YAP1、C17orf85、DNTT、HNRNPA3、MSMB、PRAMEF8、SPATA2、YARS、C17orf89、DNTTIP1、HNRNPAB、MSMO1、PRAMEF9、SPATA20、YARS2、C17orf96、DNTTIP2、HNRNPC、MSMP、PRAP1、SPATA21、YBEY、C17orf97、DOC2A、HNRNPCL1、MSN、PRB1、SPATA22、YBX1、C17orf98、DOC2B、HNRNPCP5、MSR1、PRB2、SPATA24、YBX2、C17orf99、DOCK1、HNRNPD、MSRA、PRB3、SPATA25、YBX3、C18orf21、DOCK10、HNRNPDL、MSRB1、PRB4、SPATA2L、YDJC、C18orf25、DOCK11、HNRNPF、MSRB2、PRC1、SPATA3、YEATS2、C18orf32、DOCK2、HNRNPH1、MSRB3、PRCC、SPATA31A1、YEATS4、C18orf42、DOCK3、HNRNPH2、MSS51、PRCD、SPATA31A2、YES1、C18orf54、DOCK4、
HNRNPH3、MST1、PRCP、SPATA31A3、YIF1A、C18orf56、DOCK5、HNRNPK、MST1L、PRDM1、SPATA31A4、YIF1B、C18orf63、DOCK6、HNRNPL、MST1R、PRDM10、SPATA31A5、YIPF1、C18orf8、DOCK7、HNRNPLL、MST4、PRDM11、SPATA31A6、YIPF2、C19orf10、DOCK8、HNRNPM、MSTN、PRDM12、SPATA31A7、YIPF3、C19orf12、DOCK9、HNRNPR、MSTO1、PRDM13、SPATA31C1、YIPF4、C19orf18、DOHH、HNRNPU、MSX1、PRDM14、SPATA31C2、YIPF5、C19orf24、DOK1、HNRNPUL1、MSX2、PRDM15、SPATA31D1、YIPF6、C19orf25、DOK2、HNRNPUL2、MT1A、PRDM16、SPATA31D3、YIPF7、C19orf26、DOK3、HOGA1、MT1B、PRDM2、SPATA31D4、YJEFN3、C19orf33、DOK4、HOMER1、MT1E、PRDM4、SPATA31E1、YKT6、C19orf35、DOK5、HOMER2、MT1F、PRDM5、SPATA32、YLPM1、C19orf38、DOK6、HOMER3、MT1G、PRDM6、SPATA33、YME1L1、C19orf40、DOK7、HOMEZ、MT1H、PRDM7、SPATA4、YOD1、C19orf43、DOLK、HOOK1、MT1HL1、PRDM8、SPATA5、YPEL1、C19orf44、DOLPP1、HOOK2、MT1M、PRDM9、SPATA5L1、YPEL2、C19orf45、DONSON、HOOK3、MT1X、PRDX1、SPATA6、YPEL3、C19orf47、DOPEY1、HOPX、MT2A、PRDX2、SPATA6L、YPEL4、C19orf48、DOPEY2、HORMAD1、MT3、PRDX3、SPATA7、YPEL5、C19orf52、DOT1L、HORMAD2、MT4、PRDX4、SPATA8、YRDC、C19orf53、DPAGT1、HOXA1、MTA1、PRDX5、SPATA9、YTHDC1、C19orf54、DPCD、HOXA10、MTA2、PRDX6、SPATC1、YTHDC2、C19orf55、DPCR1、HOXA11、MTA3、PREB、SPATC1L、YTHDF1、C19orf57、DPEP1、HOXA13、MTAP、PRELID1、SPATS1、YTHDF2、C19orf59、DPEP2、HOXA2、MTBP、PRELID2、SPATS2、YTHDF3、C19orf60、DPEP3、HOXA3、MTCH1、PRELP、SPATS2L、YWHAB、C19orf66、DPF1、HOXA4、MTCH2、PREP、SPC24、YWHAE、C19orf67、DPF2、HOXA5、MTCP1、PREPL、SPC25、YWHAG、C19orf68、DPF3、HOXA6、MTDH、PREX1、SPCS1、YWHAH、C19orf69、DPH1、HOXA7、MTERF、PREX2、SPCS2、YWHAQ、C19orf70、DPH2、HOXA9、MTERFD1、PRF1、SPCS3、YWHAZ、C19orf71、DPH3、HOXB1、MTERFD2、PRG2、SPDEF、YY1、C19orf73、DPH3P1、HOXB13、MTERFD3、PRG3、SPDL1、YY1AP1、C19orf77、DPH5、HOXB2、MTF1、PRG4、SPDYA、YY2、C19orf80、DPH6、HOXB3、MTF2、PRH1、SPDYC、ZACN、C19orf81、DPH7、HOXB4、MTFMT、PRH2、SPDYE1、ZADH2、C19orf82、DPM1、HOXB5、MTFP1、PRICKLE1、SPDYE2、ZAK、C1D、DPM2、HOXB6、MTFR1、PRICKLE2、SPDYE2B、ZAN、C1GALT1、DPM3、HOXB7、MTFR1L、PRICKLE3、SPDYE3、ZAP70、C1GALT1C1、DPP10、HOXB8、MTFR2、PRICKLE4、SPDYE4、ZAR1、C1orf100、DPP3、HOXB9、MTG1、PRIM1、SPDYE5、ZAR1L、C1orf101、DPP4、HOXC10、MTG2、PRIM2、SPDYE6、ZBBX、C1orf105、DPP6、HOXC11、MTHFD1、PRIMA1、SPECC1、ZBED1、C1orf106、DPP7、HOXC12、MTHFD1L、PRIMPOL、SPECC1L、ZBED2、C1orf109、DPP8、HOXC13、MTHFD2、PRKAA1、SPEF1、ZBED3、C1orf110、DPP9、HOXC4、MTHFD2L、PRKAA2、SPEF2、ZBED4、C1orf111、DPPA2、HOXC5、MTHFR、PRKAB1、SPEG、ZBED5、C1orf112、DPPA3、HOXC6、MTHFS、PRKAB2、SPEM1、ZBED6、C1orf115、DPPA4、HOXC8、MTHFSD、PRKACA、SPEN、ZBED6CL、C1orf116、DPPA5、HOXC9、MTIF2、PRKACB、SPERT、ZBP1、C1orf122、DPRX、HOXD1、MTIF3、PRKACG、SPESP1、ZBTB1、C1orf123、DPT、HOXD10、MTL5、PRKAG1、SPG11、ZBTB10、C1orf127、DPY19L1、HOXD11、MTM1、PRKAG2、SPG20、ZBTB11、C1orf131、DPY19L2、HOXD12、MTMR1、PRKAG3、SPG21、ZBTB12、C1orf137、DPY19L3、HOXD13、
MTMR10、PRKAR1A、SPG7、ZBTB14、C1orf141、DPY19L4、HOXD3、MTMR11、PRKAR1B、SPHAR、ZBTB16、C1orf146、DPY30、HOXD4、MTMR12、PRKAR2A、SPHK1、ZBTB17、C1orf158、DPYD、HOXD8、MTMR14、PRKAR2B、SPHK2、ZBTB18、C1orf159、DPYS、HOXD9、MTMR2、PRKCA、SPHKAP、ZBTB2、C1orf162、DPYSL2、HP、MTMR3、PRKCB、SPI1、ZBTB20、C1orf167、DPYSL3、HP1BP3、MTMR4、PRKCD、SPIB、ZBTB21、C1orf168、DPYSL4、HPCA、MTMR6、PRKCDBP、SPIC、ZBTB22、C1orf172、DPYSL5、HPCAL1、MTMR7、PRKCE、SPICE1、ZBTB24、C1orf173、DQX1、HPCAL4、MTMR8、PRKCG、SPIDR、ZBTB25、C1orf174、DR1、HPD、MTMR9、PRKCH、SPIN1、ZBTB26、C1orf177、DRAM1、HPDL、MTNR1A、PRKCI、SPIN2A、ZBTB3、C1orf185、DRAM2、HPGD、MTNR1B、PRKCQ、SPIN2B、ZBTB32、C1orf186、DRAP1、HPGDS、MTO1、PRKCSH、SPIN3、ZBTB33、C1orf189、DRAXIN、HPN、MTOR、PRKCZ、SPIN4、ZBTB34、C1orf192、DRC1、HPR、MTPAP、PRKD1、SPINK1、ZBTB37、C1orf194、DRD1、HPRT1、MTPN、PRKD2、SPINK13、ZBTB38、C1orf195、DRD2、HPS1、MTR、PRKD3、SPINK14、ZBTB39、C1orf198、DRD3、HPS3、MTRF1、PRKDC、SPINK2、ZBTB4、C1orf204、DRD4、HPS4、MTRF1L、PRKG1、SPINK4、ZBTB40、C1orf21、DRD5、HPS5、MTRNR2L10、PRKG2、SPINK5、ZBTB41、C1orf210、DRG1、HPS6、MTRNR2L2、PRKRA、SPINK6、ZBTB42、C1orf216、DRG2、HPSE、MTRNR2L3、PRKRIP1、SPINK7、ZBTB43、C1orf226、DRGX、HPSE2、MTRNR2L4、PRKRIR、SPINK8、ZBTB44、C1orf227、DROSHA、HPX、MTRNR2L5、PRKX、SPINK9、ZBTB45、C1orf228、DRP2、HR、MTRNR2L6、PRL、SPINT1、ZBTB46、C1orf229、DSC1、HRAS、MTRNR2L7、PRLH、SPINT2、ZBTB47、C1orf233、DSC2、HRASLS、MTRNR2L9、PRLHR、SPINT3、ZBTB48、C1orf234、DSC3、HRASLS2、MTRR、PRLR、SPINT4、ZBTB49、C1orf27、DSCAM、HRASLS5、MTSS1、PRM1、SPIRE1、ZBTB5、C1orf35、DSCAML1、HRC、MTSS1L、PRM2、SPIRE2、ZBTB6、C1orf43、DSCC1、HRCT1、MTTP、PRM3、SPN、ZBTB7A、C1orf50、DSCR3、HRG、MTURN、PRMT1、SPNS1、ZBTB7B、C1orf51、DSCR4、HRH1、MTUS1、PRMT10、SPNS2、ZBTB7C、C1orf52、DSE、HRH2、MTUS2、PRMT2、SPNS3、ZBTB8A、C1orf53、DSEL、HRH3、MTX1、PRMT3、SPO11、ZBTB8B、C1orf54、DSG1、HRH4、MTX2、PRMT5、SPOCD1、ZBTB8OS、C1orf56、DSG2、HRK、MTX3、PRMT6、SPOCK1、ZBTB9、C1orf61、DSG3、HRNR、MUC1、PRMT7、SPOCK2、ZC2HC1A、C1orf63、DSG4、HRSP12、MUC12、PRMT8、SPOCK3、ZC2HC1B、C1orf64、DSN1、HS1BP3、MUC13、PRND、SPON1、ZC2HC1C、C1orf65、DSP、HS2ST1、MUC15、PRNP、SPON2、ZC3H10、C1orf68、DSPP、HS3ST1、MUC16、PROB1、SPOP、ZC3H11A、C1orf74、DST、HS3ST2、MUC17、PROC、SPOPL、ZC3H12A、C1orf85、DSTN、HS3ST3A1、MUC19、PROCA1、SPP1、ZC3H12B、C1orf86、DSTNP4、HS3ST3B1、MUC2、PROCR、SPP2、ZC3H12C、C1orf87、DSTYK、HS3ST4、MUC20、PRODH、SPPL2A、ZC3H12D、C1orf94、DTD1、HS3ST5、MUC21、PRODH2、SPPL2B、ZC3H13、C1orf95、DTD2、HS3ST6、MUC22、PROK1、SPPL2C、ZC3H14、C1QA、DTHD1、HS6ST1、MUC3A、PROK2、SPPL3、ZC3H15、C1QB、DTL、HS6ST2、MUC4、PROKR1、SPR、ZC3H18、C1QBP、DTNA、HS6ST3、MUC5AC、PROKR2、SPRED1、ZC3H3、C1QC、DTNB、HSBP1、MUC5B、PROL1、SPRED2、ZC3H4、C1QL1、DTNBP1、HSBP1L1、MUC6、PROM1、SPRED3、ZC3H6、C1QL2、DTWD1、HSCB、MUC7、PROM2、SPRN、ZC3H7A、C1QL3、DTWD2、HSD11B1、MUC8、PROP1、SPRR1A、ZC3H7B、
C1QL4、DTX1、HSD11B1L、MUCL1、PROS1、SPRR1B、ZC3H8、C1QTNF1、DTX2、HSD11B2、MUL1、PROSC、SPRR2A、ZC3HAV1、C1QTNF2、DTX3、HSD17B1、MUM1、PROSER1、SPRR2B、ZC3HAV1L、C1QTNF3、DTX3L、HSD17B10、MUM1L1、PROSER2、SPRR2D、ZC3HC1、C1QTNF4、DTX4、HSD17B11、MURC、PROX1、SPRR2E、ZC4H2、C1QTNF5、DTYMK、HSD17B12、MUS81、PROX2、SPRR2F、ZCCHC10、C1QTNF6、DUOX1、HSD17B13、MUSK、PROZ、SPRR2G、ZCCHC11、C1QTNF7、DUOX2、HSD17B14、MUSTN1、PRPF18、SPRR3、ZCCHC12、C1QTNF8、DUOXA1、HSD17B2、MUT、PRPF19、SPRR4、ZCCHC13、C1QTNF9、DUOXA2、HSD17B3、MUTYH、PRPF3、SPRTN、ZCCHC14、C1QTNF9B、DUPD1、HSD17B4、MVB12A、PRPF31、SPRY1、ZCCHC16、C1QTNF9B-AS1、DUS1L、HSD17B6、MVB12B、PRPF38A、SPRY2、ZCCHC17、C1R、DUS2、HSD17B7、MVD、PRPF38B、SPRY3、ZCCHC18、C1RL、DUS3L、HSD17B8、MVK、PRPF39、SPRY4、ZCCHC2、C1S、DUS4L、HSD3B1、MVP、PRPF4、SPRYD3、ZCCHC24、C2、DUSP1、HSD3B2、MX1、PRPF40A、SPRYD4、ZCCHC3、C20orf112、DUSP10、HSD3B7、MX2、PRPF40B、SPRYD7、ZCCHC4、C20orf141、DUSP11、HSDL1、MXD1、PRPF4B、SPSB1、ZCCHC5、C20orf144、DUSP12、HSDL2、MXD3、PRPF6、SPSB2、ZCCHC6、C20orf166、DUSP13、HSF1、MXD4、PRPF8、SPSB3、ZCCHC7、C20orf173、DUSP14、HSF2、MXI1、PRPH、SPSB4、ZCCHC8、C20orf194、DUSP15、HSF2BP、MXRA5、PRPH2、SPTA1、ZCCHC9、C20orf195、DUSP16、HSF4、MXRA7、PRPS1、SPTAN1、ZCRB1、C20orf196、DUSP18、HSF5、MXRA8、PRPS1L1、SPTB、ZCWPW1、C20orf197、DUSP19、HSFX1、MYADM、PRPS2、SPTBN1、ZCWPW2、C20orf201、DUSP2、HSFX2、MYADML2、PRPSAP1、SPTBN2、ZDBF2、C20orf202、DUSP21、HSFY1、MYB、PRPSAP2、SPTBN4、ZDHHC1、C20orf203、DUSP22、HSFY2、MYBBP1A、PRR11、SPTBN5、ZDHHC11、C20orf24、DUSP23、HSH2D、MYBL1、PRR12、SPTLC1、ZDHHC11B、C20orf26、DUSP26、HSP90AA1、MYBL2、PRR13、SPTLC2、ZDHHC12、C20orf27、DUSP27、HSP90AB1、MYBPC1、PRR14、SPTLC3、ZDHHC13、C20orf62、DUSP28、HSP90B1、MYBPC2、PRR14L、SPTSSA、ZDHHC14、C20orf78、DUSP3、HSPA12A、MYBPC3、PRR15、SPTSSB、ZDHHC15、C20orf85、DUSP4、HSPA12B、MYBPH、PRR15L、SPTY2D1、ZDHHC16、C20orf96、DUSP5、HSPA13、MYBPHL、PRR16、SPZ1、ZDHHC17、C21orf140、DUSP6、HSPA14、MYC、PRR18、SQLE、ZDHHC18、C21orf2、DUSP7、HSPA1A、MYCBP、PRR19、SQRDL、ZDHHC19、C21orf33、DUSP8、HSPA1B、MYCBP2、PRR20A、SQSTM1、ZDHHC2、C21orf58、DUSP9、HSPA1L、MYCBPAP、PRR20B、SRA1、ZDHHC20、C21orf59、DUT、HSPA2、MYCL、PRR20C、SRBD1、ZDHHC21、C21orf62、DUXA、HSPA4、MYCN、PRR20D、SRC、ZDHHC22、C21orf91、DVL1、HSPA4L、MYCT1、PRR20E、SRCAP、ZDHHC23、C22orf15、DVL2、HSPA5、MYD88、PRR21、SRCIN1、ZDHHC24、C22orf23、DVL3、HSPA6、MYEF2、PRR22、SRCRB4D、ZDHHC3、C22orf24、DXO、HSPA8、MYEOV、PRR23A、SRD5A1、ZDHHC4、C22orf26、DYDC1、HSPA9、MYEOV2、PRR23B、SRD5A2、ZDHHC5、C22orf29、DYDC2、HSPB1、MYF5、PRR23C、SRD5A3、ZDHHC6、C22orf31、DYM、HSPB11、MYF6、PRR24、SREBF1、ZDHHC7、C22orf39、DYNAP、HSPB2、MYH1、PRR25、SREBF2、ZDHHC8、C22orf42、DYNC1H1、HSPB3、MYH10、PRR3、SREK1、ZDHHC9、C22orf43、DYNC1I1、HSPB6、MYH11、PRR30、SREK1IP1、ZEB1、C22orf46、DYNC1I2、HSPB7、MYH13、PRR4、SRF、ZEB2、C2CD2、DYNC1LI1、HSPB8、
MYH14、PRR5、SRFBP1、ZER1、C2CD2L、DYNC1LI2、HSPB9、MYH15、PRR5-ARHGAP8、SRGAP1、ZFAND1、C2CD3、DYNC2H1、HSPBAP1、MYH2、PRR5L、SRGAP2、ZFAND2A、C2CD4A、DYNC2LI1、HSPBP1、MYH3、PRR7、SRGAP2B、ZFAND2B、C2CD4B、DYNLL1、HSPD1、MYH4、PRR9、SRGAP2C、ZFAND3、C2CD4C、DYNLL2、HSPE1、MYH6、PRRC1、SRGAP3、ZFAND4、C2CD4D、DYNLRB1、HSPE1-MOB4、MYH7、PRRC2A、SRGN、ZFAND5、C2CD5、DYNLRB2、HSPG2、MYH7B、PRRC2B、SRI、ZFAND6、C2orf15、DYNLT1、HSPH1、MYH8、PRRC2C、SRL、ZFAT、C2orf16、DYNLT3、HTATIP2、MYH9、PRRG1、SRM、ZFC3H1、C2orf27A、DYRK1A、HTATSF1、MYL1、PRRG2、SRMS、ZFHX2、C2orf27B、DYRK1B、HTN1、MYL10、PRRG3、SRP14、ZFHX3、C2orf40、DYRK2、HTN3、MYL12A、PRRG4、SRP19、ZFHX4、C2orf42、DYRK3、HTR1A、MYL12B、PRRT1、SRP54、ZFP1、C2orf43、DYRK4、HTR1B、MYL2、PRRT2、SRP68、ZFP14、C2orf44、DYSF、HTR1D、MYL3、PRRT3、SRP72、ZFP2、C2orf47、DYTN、HTR1E、MYL4、PRRT4、SRP9、ZFP28、C2orf48、DYX1C1、HTR1F、MYL5、PRRX1、SRPK1、ZFP3、C2orf49、DZANK1、HTR2A、MYL6、PRRX2、SRPK2、ZFP30、C2orf50、DZIP1、HTR2B、MYL6B、PRSS1、SRPK3、ZFP36、C2orf53、DZIP1L、HTR2C、MYL7、PRSS12、SRPR、ZFP36L1、C2orf54、DZIP3、HTR3A、MYL9、PRSS16、SRPRB、ZFP36L2、C2orf57、E2F1、HTR3B、MYLIP、PRSS2、SRPX、ZFP37、C2orf61、E2F2、HTR3C、MYLK、PRSS21、SRPX2、ZFP41、C2orf62、E2F3、HTR3D、MYLK2、PRSS22、SRR、ZFP42、C2orf66、E2F4、HTR3E、MYLK3、PRSS23、SRRD、ZFP57、C2orf68、E2F5、HTR4、MYLK4、PRSS27、SRRM1、ZFP62、C2orf69、E2F6、HTR5A、MYLPF、PRSS3、SRRM2、ZFP64、C2orf70、E2F7、HTR6、MYNN、PRSS33、SRRM3、ZFP69、C2orf71、E2F8、HTR7、MYO10、PRSS35、SRRM4、ZFP69B、C2orf72、E4F1、HTRA1、MYO15A、PRSS36、SRRM5、ZFP82、C2orf73、EAF1、HTRA2、MYO16、PRSS37、SRRT、ZFP90、C2orf74、EAF2、HTRA3、MYO18A、PRSS38、SRSF1、ZFP91、C2orf76、EAPP、HTRA4、MYO18B、PRSS41、SRSF10、ZFP92、C2orf78、EARS2、HTT、MYO19、PRSS42、SRSF11、ZFPL1、C2orf80、EBAG9、HUNK、MYO1A、PRSS44、SRSF12、ZFPM1、C2orf81、EBF1、HUS1、MYO1B、PRSS45、SRSF2、ZFPM2、C2orf82、EBF2、HUS1B、MYO1C、PRSS46、SRSF3、ZFR、C2orf83、EBF3、HUWE1、MYO1D、PRSS48、SRSF4、ZFR2、C2orf88、EBF4、HVCN1、MYO1E、PRSS50、SRSF5、ZFX、C2orf91、EBI3、HYAL1、MYO1F、PRSS53、SRSF6、ZFY、C3、EBLN1、HYAL2、MYO1G、PRSS54、SRSF7、ZFYVE1、C3AR1、EBLN2、HYAL3、MYO1H、PRSS55、SRSF8、ZFYVE16、C3orf14、EBNA1BP2、HYAL4、MYO3A、PRSS56、SRSF9、ZFYVE19、C3orf17、EBP、HYDIN、MYO3B、PRSS57、SRXN1、ZFYVE20、C3orf18、EBPL、HYI、MYO5A、PRSS58、SRY、ZFYVE21、C3orf20、ECD、HYKK、MYO5B、PRSS8、SS18、ZFYVE26、C3orf22、ECE1、HYLS1、MYO5C、PRTFDC1、SS18L1、ZFYVE27、C3orf27、ECE2、HYOU1、MYO6、PRTG、SS18L2、ZFYVE28、C3orf30、ECEL1、HYPK、MYO7A、PRTN3、SSB、ZFYVE9、C3orf33、ECH1、IAH1、MYO7B、PRUNE、SSBP1、ZG16、C3orf35、ECHDC1、IAPP、MYO9A、PRUNE2、SSBP2、ZG16B、C3orf36、ECHDC2、IARS、MYO9B、PRX、SSBP3、ZGLP1、C3orf38、ECHDC3、IARS2、MYOC、PRY、SSBP4、ZGPAT、C3orf43、ECHS1、IBA57、MYOCD、PRY2、SSC5D、ZHX1、C3orf52、ECI1、IBSP、MYOD1、PSAP、SSFA2、ZHX1-C8ORF76、C3orf55、ECI2、IBTK、MYOF、PSAPL1、SSH1、ZHX2、C3orf56、ECM1、ICA1、MYOG、PSAT1、SSH2、
ZHX3、C3orf58、ECM2、ICA1L、MYOM1、PSCA、SSH3、ZIC1、C3orf62、ECSCR、ICAM1、MYOM2、PSD、SSMEM1、ZIC2、C3orf67、ECSIT、ICAM2、MYOM3、PSD2、SSNA1、ZIC3、C3orf70、ECT2、ICAM3、MYOT、PSD3、SSPN、ZIC4、C3orf72、ECT2L、ICAM4、MYOZ1、PSD4、SSPO、ZIC5、C3orf79、EDA、ICAM5、MYOZ2、PSEN1、SSR1、ZIK1、C3orf80、EDA2R、ICK、MYOZ3、PSEN2、SSR2、ZIM2、C3orf83、EDAR、ICMT、MYPN、PSENEN、SSR3、ZIM3、C3orf84、EDARADD、ICOS、MYPOP、PSG1、SSR4、ZKSCAN1、C4A、EDC3、ICOSLG、MYRF、PSG11、SSRP1、ZKSCAN2、C4B、EDC4、ICT1、MYRFL、PSG2、SSSCA1、ZKSCAN3、C4B_2、EDDM3A、ID1、MYRIP、PSG3、SST、ZKSCAN4、C4BPA、EDDM3B、ID2、MYSM1、PSG4、SSTR1、ZKSCAN5、C4BPB、EDEM1、ID3、MYT1、PSG5、SSTR2、ZKSCAN7、C4orf17、EDEM2、ID4、MYT1L、PSG6、SSTR3、ZKSCAN8、C4orf19、EDEM3、IDE、MYZAP、PSG7、SSTR4、ZMAT1、C4orf21、EDF1、IDH1、MZB1、PSG8、SSTR5、ZMAT2、C4orf22、EDIL3、IDH2、MZF1、PSG9、SSU72、ZMAT3、C4orf26、EDN1、IDH3A、MZT1、PSIP1、SSUH2、ZMAT4、C4orf27、EDN2、IDH3B、MZT2A、PSKH1、SSX1、ZMAT5、C4orf29、EDN3、IDH3G、MZT2B、PSKH2、SSX2、ZMIZ1、C4orf3、EDNRA、IDI1、N4BP1、PSMA1、SSX2B、ZMIZ2、C4orf32、EDNRB、IDI2、N4BP2、PSMA2、SSX2IP、ZMPSTE24、C4orf33、EEA1、IDNK、N4BP2L1、PSMA3、SSX3、ZMYM1、C4orf36、EED、IDO1、N4BP2L2、PSMA4、SSX4、ZMYM2、C4orf40、EEF1A1、IDO2、N4BP3、PSMA5、SSX4B、ZMYM3、C4orf45、EEF1A2、IDS、N6AMT1、PSMA6、SSX5、ZMYM4、C4orf46、EEF1B2、IDUA、N6AMT2、PSMA7、SSX7、ZMYM5、C4orf47、EEF1D、IER2、NAA10、PSMA8、ST13、ZMYM6、C4orf48、EEF1E1、IER3、NAA11、PSMB1、ST14、ZMYM6NB、C4orf50、EEF1G、IER3IP1、NAA15、PSMB10、ST18、ZMYND10、C4orf51、EEF2、IER5、NAA16、PSMB11、ST20、ZMYND11、C4orf6、EEF2K、IER5L、NAA20、PSMB2、ST20-MTHFS、ZMYND12、C5、EEFSEC、IFFO1、NAA25、PSMB3、ST3GAL1、ZMYND15、C5AR1、EEPD1、IFFO2、NAA30、PSMB4、ST3GAL2、ZMYND19、C5AR2、EFCAB1、IFI16、NAA35、PSMB5、ST3GAL3、ZMYND8、C5orf15、EFCAB11、IFI27、NAA38、PSMB6、ST3GAL4、ZNF10、C5orf20、EFCAB12、IFI27L1、NAA40、PSMB7、ST3GAL5、ZNF100、C5orf22、EFCAB13、IFI27L2、NAA50、PSMB8、ST3GAL6、ZNF101、C5orf24、EFCAB14、IFI30、NAA60、PSMB9、ST5、ZNF106、C5orf28、EFCAB2、IFI35、NAAA、PSMC1、ST6GAL1、ZNF107、C5orf30、EFCAB3、IFI44、NAALAD2、PSMC2、ST6GAL2、ZNF112、C5orf34、EFCAB4A、IFI44L、NAALADL1、PSMC3、ST6GALNAC1、ZNF114、C5orf38、EFCAB4B、IFI6、NAALADL2、PSMC3IP、ST6GALNAC2、ZNF117、C5orf42、EFCAB5、IFIH1、NAB1、PSMC4、ST6GALNAC3、ZNF12、C5orf45、EFCAB6、IFIT1、NAB2、PSMC5、ST6GALNAC4、ZNF121、C5orf46、EFCAB7、IFIT1B、NABP1、PSMC6、ST6GALNAC5、ZNF124、C5orf47、EFCAB8、IFIT2、NABP2、PSMD1、ST6GALNAC6、ZNF131、C5orf48、EFCAB9、IFIT3、NACA、PSMD10、ST7、ZNF132、C5orf49、EFCC1、IFIT5、NACA2、PSMD11、ST7L、ZNF133、C5orf50、EFEMP1、IFITM1、NACAD、PSMD12、ST8SIA1、ZNF134、C5orf51、EFEMP2、IFITM10、NACC1、PSMD13、ST8SIA2、ZNF135、C5orf52、EFHB、IFITM2、NACC2、PSMD14、ST8SIA3、ZNF136、C5orf55、EFHC1、IFITM3、NADK、PSMD2、ST8SIA4、ZNF138、C5orf58、EFHC2、IFITM5、NADK2、PSMD3、ST8SIA5、ZNF14、C5orf60、EFHD1、IFLTD1、NADSYN1、PSMD4、ST8SIA6、ZNF140、C5orf63、EFHD2、
IFNA1、NAE1、PSMD5、STAB1、ZNF141、C5orf64、EFNA1、IFNA10、NAF1、PSMD6、STAB2、ZNF142、C6、EFNA2、IFNA13、NAGA、PSMD7、STAC、ZNF143、C6orf1、EFNA3、IFNA14、NAGK、PSMD8、STAC2、ZNF146、C6orf10、EFNA4、IFNA16、NAGLU、PSMD9、STAC3、ZNF148、C6orf106、EFNA5、IFNA17、NAGPA、PSME1、STAG1、ZNF154、C6orf118、EFNB1、IFNA2、NAGS、PSME2、STAG2、ZNF155、C6orf132、EFNB2、IFNA21、NAIF1、PSME3、STAG3、ZNF157、C6orf136、EFNB3、IFNA4、NAIP、PSME4、STAM、ZNF16、C6orf141、EFR3A、IFNA5、NALCN、PSMF1、STAM2、ZNF160、C6orf15、EFR3B、IFNA6、NAMPT、PSMG1、STAMBP、ZNF165、C6orf163、EFS、IFNA7、NANOG、PSMG2、STAMBPL1、ZNF169、C6orf165、EFTUD1、IFNA8、NANOGNB、PSMG3、STAP1、ZNF17、C6orf183、EFTUD2、IFNAR1、NANOS1、PSMG4、STAP2、ZNF174、C6orf195、EGF、IFNAR2、NANOS2、PSORS1C1、STAR、ZNF175、C6orf201、EGFL6、IFNB1、NANOS3、PSORS1C2、STARD10、ZNF177、C6orf203、EGFL7、IFNE、NANP、PSPC1、STARD13、ZNF18、C6orf211、EGFL8、IFNG、NANS、PSPH、STARD3、ZNF180、C6orf222、EGFLAM、IFNGR1、NAP1L1、PSPN、STARD3NL、ZNF181、C6orf223、EGFR、IFNGR2、NAP1L2、PSRC1、STARD4、ZNF182、C6orf226、EGLN1、IFNK、NAP1L3、PSTK、STARD5、ZNF184、C6orf25、EGLN2、IFNL1、NAP1L4、PSTPIP1、STARD6、ZNF185、C6orf47、EGLN3、IFNL2、NAP1L5、PSTPIP2、STARD7、ZNF189、C6orf48、EGR1、IFNL3、NAPA、PTAFR、STARD8、ZNF19、C6orf52、EGR2、IFNL4、NAPB、PTAR1、STARD9、ZNF195、C6orf57、EGR3、IFNLR1、NAPEPLD、PTBP1、STAT1、ZNF197、C6orf58、EGR4、IFNW1、NAPG、PTBP2、STAT2、ZNF2、C6orf62、EHBP1、IFRD1、NAPRT1、PTBP3、STAT3、ZNF20、C6orf7、EHBP1L1、IFRD2、NAPSA、PTCD1、STAT4、ZNF200、C6orf89、EHD1、IFT122、NARF、PTCD2、STAT5A、ZNF202、C6orf99、EHD2、IFT140、NARFL、PTCD3、STAT5B、ZNF205、C7、EHD3、IFT172、NARG2、PTCH1、STAT6、ZNF207、C7orf10、EHD4、IFT20、NARR、PTCH2、STATH、ZNF208、C7orf25、EHF、IFT27、NARS、PTCHD1、STAU1、ZNF211、C7orf26、EHHADH、IFT43、NARS2、PTCHD2、STAU2、ZNF212、C7orf31、EHMT1、IFT46、NASP、PTCHD3、STBD1、ZNF213、C7orf33、EHMT2、IFT52、NAT1、PTCHD4、STC1、ZNF214、C7orf34、EI24、IFT57、NAT10、PTCRA、STC2、ZNF215、C7orf43、EID1、IFT74、NAT14、PTDSS1、STEAP1、ZNF217、C7orf49、EID2、IFT80、NAT16、PTDSS2、STEAP1B、ZNF219、C7orf50、EID2B、IFT81、NAT2、PTEN、STEAP2、ZNF22、C7orf55、EID3、IFT88、NAT6、PTER、STEAP3、ZNF221、C7orf55- LUC7L2、EIF1、IGBP1、NAT8、PTF1A、STEAP4、ZNF222、C7orf57、EIF1AD、IGDCC3、NAT8B、PTGDR、STH、ZNF223、C7orf60、EIF1AX、IGDCC4、NAT8L、PTGDR2、STIL、ZNF224、C7orf61、EIF1AY、IGF1、NAT9、PTGDS、STIM1、ZNF225、C7orf62、EIF1B、IGF1R、NAV1、PTGER1、STIM2、ZNF226、C7orf63、EIF2A、IGF2、NAV2、PTGER2、STIP1、ZNF227、C7orf65、EIF2AK1、IGF2BP1、NAV3、PTGER3、STK10、ZNF229、C7orf66、EIF2AK2、IGF2BP2、NBAS、PTGER4、STK11、ZNF23、C7orf69、EIF2AK3、IGF2BP3、NBEA、PTGES、STK11IP、ZNF230、C7orf71、EIF2AK4、IGF2R、NBEAL1、PTGES2、STK16、ZNF232、C7orf72、EIF2B1、IGFALS、NBEAL2、PTGES3、STK17A、ZNF233、C7orf73、EIF2B2、IGFBP1、NBL1、PTGES3L、STK17B、ZNF234、C7orf76、EIF2B3、IGFBP2、NBN、PTGES3L-AARSD1、STK19、ZNF235、C8A、
EIF2B4、IGFBP3、NBPF14、PTGFR、STK24、ZNF236、C8B、EIF2B5、IGFBP4、NBPF3、PTGFRN、STK25、ZNF239、C8G、EIF2D、IGFBP5、NBPF4、PTGIR、STK3、ZNF24、C8orf22、EIF2S1、IGFBP6、NBPF6、PTGIS、STK31、ZNF248、C8orf31、EIF2S2、IGFBP7、NBPF7、PTGR1、STK32A、ZNF25、C8orf33、EIF2S3、IGFBPL1、NBR1、PTGR2、STK32B、ZNF250、C8orf34、EIF3A、IGFL1、NCALD、PTGS1、STK32C、ZNF251、C8orf37、EIF3B、IGFL2、NCAM1、PTGS2、STK33、ZNF253、C8orf4、EIF3C、IGFL3、NCAM2、PTH、STK35、ZNF254、C8orf44、EIF3CL、IGFL4、NCAN、PTH1R、STK36、ZNF256、C8orf44-SGK3、EIF3D、IGFLR1、NCAPD2、PTH2、STK38、ZNF257、C8orf46、EIF3E、IGFN1、NCAPD3、PTH2R、STK38L、ZNF259、C8orf47、EIF3F、IGHMBP2、NCAPG、PTHLH、STK39、ZNF26、C8orf48、EIF3G、IGIP、NCAPG2、PTK2、STK4、ZNF260、C8orf58、EIF3H、IGJ、NCAPH、PTK2B、STK40、ZNF263、C8orf59、EIF3I、IGLL1、NCAPH2、PTK6、STMN1、ZNF264、C8orf74、EIF3J、IGLL5、NCBP1、PTK7、STMN2、ZNF266、C8orf76、EIF3K、IGLON5、NCBP2、PTMA、STMN3、ZNF267、C8orf82、EIF3L、IGSF1、NCCRP1、PTMS、STMN4、ZNF268、C8orf86、EIF3M、IGSF10、NCDN、PTN、STMND1、ZNF273、C8orf87、EIF4A1、IGSF11、NCEH1、PTOV1、STOM、ZNF274、C9、EIF4A2、IGSF21、NCF1、PTP4A1、STOML1、ZNF275、C9orf106、EIF4A3、IGSF22、NCF2、PTP4A2、STOML2、ZNF276、C9orf114、EIF4B、IGSF23、NCF4、PTP4A3、STOML3、ZNF277、C9orf116、EIF4E、IGSF3、NCK1、PTPDC1、STON1、ZNF28、C9orf117、EIF4E1B、IGSF5、NCK2、PTPLA、STON1-GTF2A1L、ZNF280A、C9orf129、EIF4E2、IGSF6、NCKAP1、PTPLAD1、STON2、ZNF280B、C9orf131、EIF4E3、IGSF8、NCKAP1L、PTPLAD2、STOX1、ZNF280C、C9orf135、EIF4EBP1、IGSF9、NCKAP5、PTPLB、STOX2、ZNF280D、C9orf139、EIF4EBP2、IGSF9B、NCKAP5L、PTPMT1、STPG1、ZNF281、C9orf142、EIF4EBP3、IHH、NCKIPSD、PTPN1、STPG2、ZNF282、C9orf152、EIF4ENIF1、IK、NCL、PTPN11、STRA13、ZNF283、C9orf153、EIF4G1、IKBIP、NCLN、PTPN12、STRA6、ZNF284、C9orf156、EIF4G2、IKBKAP、NCMAP、PTPN13、STRA8、ZNF285、C9orf16、EIF4G3、IKBKB、NCOA1、PTPN14、STRADA、ZNF286A、C9orf163、EIF4H、IKBKE、NCOA2、PTPN18、STRADB、ZNF286B、C9orf169、EIF5、IKBKG、NCOA3、PTPN2、STRAP、ZNF287、C9orf170、EIF5A、IKZF1、NCOA4、PTPN20A、STRBP、ZNF292、C9orf171、EIF5A2、IKZF2、NCOA5、PTPN20B、STRC、ZNF296、C9orf173、EIF5AL1、IKZF3、NCOA6、PTPN21、STRIP1、ZNF3、C9orf24、EIF5B、IKZF4、NCOA7、PTPN22、STRIP2、ZNF30、C9orf3、EIF6、IKZF5、NCOR1、PTPN23、STRN、ZNF300、C9orf37、ELAC1、INTERLEUKIN、IL10、NCOR2、PTPN3、STRN3、ZNF302、C9orf40、ELAC2、IL10RA、NCR1、PTPN4、STRN4、ZNF304、C9orf41、ELANE、IL10RB、NCR2、PTPN5、STS、ZNF311、C9orf43、ELAVL1、IL11、NCR3、PTPN6、STT3A、ZNF316、C9orf47、ELAVL2、IL11RA、NCR3LG1、PTPN7、STT3B、ZNF317、C9orf50、ELAVL3、IL12A、NCS1、PTPN9、STUB1、ZNF318、C9orf57、ELAVL4、IL12B、NCSTN、PTPRA、STX10、ZNF319、C9orf62、ELF1、IL12RB1、NDC1、PTPRB、STX11、ZNF32、C9orf64、ELF2、IL12RB2、NDC80、PTPRC、STX12、ZNF320、C9orf66、ELF3、IL13、NDE1、PTPRCAP、STX16、ZNF322、C9orf69、ELF4、IL13RA1、NDEL1、PTPRD、STX17、ZNF324、C9orf72、ELF5、IL13RA2、NDFIP1、PTPRE、STX18、ZNF324B、C9orf78、ELFN1、IL15、NDFIP2、PTPRF、STX19、
ZNF326、C9orf84、ELFN2、IL15RA、NDN、PTPRG、STX1A、ZNF329、C9orf85、ELK1、IL16、NDNF、PTPRH、STX1B、ZNF330、C9orf89、ELK3、IL17A、NDNL2、PTPRJ、STX2、ZNF331、C9orf9、ELK4、IL17B、NDOR1、PTPRK、STX3、ZNF333、C9orf91、ELL、IL17C、NDP、PTPRM、STX4、ZNF334、C9orf92、ELL2、IL17D、NDRG1、PTPRN、STX5、ZNF335、C9orf96、ELL3、IL17F、NDRG2、PTPRN2、STX6、ZNF337、CA1、ELMO1、IL17RA、NDRG3、PTPRO、STX7、ZNF33A、CA10、ELMO2、IL17RB、NDRG4、PTPRQ、STX8、ZNF33B、CA11、ELMO3、IL17RC、NDST1、PTPRR、STXBP1、ZNF34、CA12、ELMOD1、IL17RD、NDST2、PTPRS、STXBP2、ZNF341、CA13、ELMOD2、IL17RE、NDST3、PTPRT、STXBP3、ZNF343、CA14、ELMOD3、IL17REL、NDST4、PTPRU、STXBP4、ZNF345、CA2、ELMSAN1、IL18、NDUFA1、PTPRZ1、STXBP5、ZNF346、CA3、ELN、IL18BP、NDUFA10、PTRF、STXBP5L、ZNF347、CA4、ELOF1、IL18R1、NDUFA11、PTRH1、STXBP6、ZNF35、CA5A、ELOVL1、IL18RAP、NDUFA12、PTRH2、STYK1、ZNF350、CA5B、ELOVL2、IL19、NDUFA13、PTRHD1、STYX、ZNF354A、CA6、ELOVL3、IL1A、NDUFA2、PTS、STYXL1、ZNF354B、CA7、ELOVL4、IL1B、NDUFA3、PTTG1、SUB1、ZNF354C、CA8、ELOVL5、IL1F10、NDUFA4、PTTG1IP、SUCLA2、ZNF358、CA9、ELOVL6、IL1R1、NDUFA4L2、PTTG2、SUCLG1、ZNF362、CAAP1、ELOVL7、IL1R2、NDUFA5、PTX3、SUCLG2、ZNF365、CAB39、ELP2、IL1RAP、NDUFA6、PTX4、SUCNR1、ZNF366、CAB39L、ELP3、IL1RAPL1、NDUFA7、PUF60、SUCO、ZNF367、CABIN1、ELP4、IL1RAPL2、NDUFA8、PUM1、SUDS3、ZNF37A、CABLES1、ELP5、IL1RL1、NDUFA9、PUM2、SUFU、ZNF382、CABLES2、ELP6、IL1RL2、NDUFAB1、PURA、SUGP1、ZNF383、CABP1、ELSPBP1、IL1RN、NDUFAF1、PURB、SUGP2、ZNF384、CABP2、ELTD1、IL2、NDUFAF2、PURG、SUGT1、ZNF385A、CABP4、EMB、IL20、NDUFAF3、PUS1、SULF1、ZNF385B、CABP5、EMC1、IL20RA、NDUFAF4、PUS10、SULF2、ZNF385C、CABP7、EMC10、IL20RB、NDUFAF5、PUS3、SULT1A1、ZNF385D、CABS1、EMC2、IL21、NDUFAF6、PUS7、SULT1A2、ZNF391、CABYR、EMC3、IL21R、NDUFAF7、PUS7L、SULT1A3、ZNF394、CACFD1、EMC4、IL22、NDUFB1、PUSL1、SULT1A4、ZNF395、CACHD1、EMC6、IL22RA1、NDUFB10、PVALB、SULT1B1、ZNF396、CACNA1A、EMC7、IL22RA2、NDUFB11、PVR、SULT1C2、ZNF397、CACNA1B、EMC8、IL23A、NDUFB2、PVRIG、SULT1C3、ZNF398、CACNA1C、EMC9、IL23R、NDUFB3、PVRL1、SULT1C4、ZNF404、CACNA1D、EMCN、IL24、NDUFB4、PVRL2、SULT1E1、ZNF407、CACNA1E、EMD、IL25、NDUFB5、PVRL3、SULT2A1、ZNF408、CACNA1F、EME1、IL26、NDUFB6、PVRL4、SULT2B1、ZNF41、CACNA1G、EME2、IL27、NDUFB7、PWP1、SULT4A1、ZNF410、CACNA1H、EMG1、IL27RA、NDUFB8、PWP2、SULT6B1、ZNF414、CACNA1I、EMID1、IL2RA、NDUFB9、PWWP2A、SUMF1、ZNF415、CACNA1S、EMILIN1、IL2RB、NDUFC1、PWWP2B、SUMF2、ZNF416、CACNA2D1、EMILIN2、IL2RG、NDUFC2、PXDC1、SUMO1、ZNF417、CACNA2D2、EMILIN3、IL3、NDUFC2-KCTD14、PXDN、SUMO2、ZNF418、CACNA2D3、EML1、IL31、NDUFS1、PXDNL、SUMO3、ZNF419、CACNA2D4、EML2、IL31RA、NDUFS2、PXK、SUMO4、ZNF420、CACNB1、EML3、IL32、NDUFS3、PXMP2、SUN1、ZNF423、CACNB2、EML4、IL33、NDUFS4、PXMP4、SUN2、ZNF425、CACNB3、EML5、IL34、NDUFS5、PXN、SUN3、ZNF426、CACNB4、EML6、IL36A、NDUFS6、PXT1、SUN5、ZNF428、CACNG1、EMP1、IL36B、NDUFS7、PYCARD、SUOX、ZNF429、CACNG2、EMP2、IL36G、
NDUFS8、PYCR1、SUPT16H、ZNF43、CACNG3、EMP3、IL36RN、NDUFV1、PYCR2、SUPT20H、ZNF430、CACNG4、EMR1、IL37、NDUFV2、PYCRL、SUPT3H、ZNF431、CACNG5、EMR2、IL3RA、NDUFV3、PYDC1、SUPT4H1、ZNF432、CACNG6、EMR3、IL4、NEB、PYDC2、SUPT5H、ZNF433、CACNG7、EMX1、IL4I1、NEBL、PYGB、SUPT6H、ZNF436、CACNG8、EMX2、IL4R、NECAB1、PYGL、SUPT7L、ZNF438、CACTIN、EN1、IL5、NECAB2、PYGM、SUPV3L1、ZNF439、CACUL1、EN2、IL5RA、NECAB3、PYGO1、SURF1、ZNF44、CACYBP、ENAH、IL6、NECAP1、PYGO2、SURF2、ZNF440、CAD、ENAM、IL6R、NECAP2、PYHIN1、SURF4、ZNF441、CADM1、ENC1、IL6ST、NEDD1、PYROXD1、SURF6、ZNF442、CADM2、ENDOD1、IL7、NEDD4、PYROXD2、SUSD1、ZNF443、CADM3、ENDOG、IL7R、NEDD4L、PYURF、SUSD2、ZNF444、CADM4、ENDOU、IL8、NEDD8、PYY、SUSD3、ZNF445、CADPS、ENDOV、IL9、NEDD8-MDP1、PZP、SUSD4、ZNF446、CADPS2、ENG、IL9R、NEDD9、QARS、SUSD5、ZNF449、CAGE1、ENGASE、ILDR1、NEFH、QDPR、SUV39H1、ZNF45、CALB1、ENHO、ILDR2、NEFL、QKI、SUV39H2、ZNF451、CALB2、ENKD1、ILF2、NEFM、QPCT、SUV420H1、ZNF454、CALCA、ENKUR、ILF3、NEGR1、QPCTL、SUV420H2、ZNF460、CALCB、ENO1、ILK、NEIL1、QPRT、SUZ12、ZNF461、CALCOCO1、ENO2、ILKAP、NEIL2、QRFP、SV2A、ZNF462、CALCOCO2、ENO3、ILVBL、NEIL3、QRFPR、SV2B、ZNF467、CALCR、ENO4、IMMP1L、NEK1、QRICH1、SV2C、ZNF468、CALCRL、ENOPH1、IMMP2L、NEK10、QRICH2、SVEP1、ZNF469、CALD1、ENOSF1、IMMT、NEK11、QRSL1、SVIL、ZNF470、CALHM1、ENOX1、IMP3、NEK2、QSER1、SVIP、ZNF471、CALHM2、ENOX2、IMP4、NEK3、QSOX1、SVOP、ZNF473、CALHM3、ENPEP、IMPA1、NEK4、QSOX2、SVOPL、ZNF474、CALM1、ENPP1、IMPA2、NEK5、QTRT1、SWAP70、ZNF479、CALM2、ENPP2、IMPACT、NEK6、QTRTD1、SWI5、ZNF48、CALM3、ENPP3、IMPAD1、NEK7、R3HCC1、SWSAP1、ZNF480、CALML3、ENPP4、IMPDH1、NEK8、R3HCC1L、SWT1、ZNF483、CALML4、ENPP5、IMPDH2、NEK9、R3HDM1、SYAP1、ZNF484、CALML5、ENPP6、IMPG1、NELFA、R3HDM2、SYBU、ZNF485、CALML6、ENPP7、IMPG2、NELFB、R3HDM4、SYCE1、ZNF486、CALN1、ENSA、INA、NELFCD、R3HDML、SYCE1L、ZNF488、CALR、ENTHD1、INADL、NELFE、RAB10、SYCE2、ZNF490、CALR3、ENTHD2、INCA1、NELL1、RAB11A、SYCE3、ZNF491、CALU、ENTPD1、INCENP、NELL2、RAB11B、SYCN、ZNF492、CALY、ENTPD2、INF2、NEMF、RAB11FIP1、SYCP1、ZNF493、CAMK1、ENTPD3、ING1、NENF、RAB11FIP2、SYCP2、ZNF496、CAMK1D、ENTPD4、ING2、NEO1、RAB11FIP3、SYCP2L、ZNF497、CAMK1G、ENTPD5、ING3、NES、RAB11FIP4、SYCP3、ZNF500、CAMK2A、ENTPD6、ING4、NET1、RAB11FIP5、SYDE1、ZNF501、CAMK2B、ENTPD7、ING5、NETO1、RAB12、SYDE2、ZNF502、CAMK2D、ENTPD8、INHA、NETO2、RAB13、SYF2、ZNF503、CAMK2G、ENY2、INHBA、NEU1、RAB14、SYK、ZNF506、CAMK2N1、EOGT、INHBB、NEU2、RAB15、SYMPK、ZNF507、CAMK2N2、EOMES、INHBC、NEU3、RAB17、SYN1、ZNF510、CAMK4、EP300、INHBE、NEU4、RAB18、SYN2、ZNF511、CAMKK1、EP400、INIP、NEURL1、RAB19、SYN3、ZNF512、CAMKK2、EPAS1、INMT、NEURL1B、RAB1A、SYNC、ZNF512B、CAMKMT、EPB41、INO80、NEURL2、RAB1B、SYNCRIP、ZNF513、CAMKV、EPB41L1、INO80B、NEURL3、RAB20、SYNDIG1、ZNF514、CAMLG、EPB41L2、INO80C、NEURL4、RAB21、SYNDIG1L、ZNF516、CAMP、EPB41L3、INO80D、NEUROD1、RAB22A、SYNE1、ZNF517、CAMSAP1、EPB41L4A、
INO80E、NEUROD2、RAB23、SYNE2、ZNF518A、CAMSAP2、EPB41L4B、INPP1、NEUROD4、RAB24、SYNE3、ZNF518B、CAMSAP3、EPB41L5、INPP4A、NEUROD6、RAB25、SYNE4、ZNF519、CAMTA1、EPB42、INPP4B、NEUROG1、RAB26、SYNGAP1、ZNF521、CAMTA2、EPC1、INPP5A、NEUROG2、RAB27A、SYNGR1、ZNF524、CAND1、EPC2、INPP5B、NEUROG3、RAB27B、SYNGR2、ZNF526、CAND2、EPCAM、INPP5D、NEXN、RAB28、SYNGR3、ZNF527、CANT1、EPDR1、INPP5E、NF1、RAB2A、SYNGR4、ZNF528、CANX、EPG5、INPP5F、NF2、RAB2B、SYNJ1、ZNF529、CAP1、EPGN、INPP5J、NFAM1、RAB30、SYNJ2、ZNF530、CAP2、EPHA1、INPP5K、NFASC、RAB31、SYNJ2BP、ZNF532、CAPG、EPHA10、INPPL1、NFAT5、RAB32、SYNJ2BP-COX16、ZNF534、CAPN1、EPHA2、INS、NFATC1、RAB33A、SYNM、ZNF536、CAPN10、EPHA3、INSC、NFATC2、RAB33B、SYNPO、ZNF540、CAPN11、EPHA4、INSIG1、NFATC2IP、RAB34、SYNPO2、ZNF541、CAPN12、EPHA5、INSIG2、NFATC3、RAB35、SYNPO2L、ZNF543、CAPN13、EPHA6、INS-IGF2、NFATC4、RAB36、SYNPR、ZNF544、CAPN14、EPHA7、INSL3、NFE2、RAB37、SYNRG、ZNF546、CAPN15、EPHA8、INSL4、NFE2L1、RAB38、SYP、ZNF547、CAPN2、EPHB1、INSL5、NFE2L2、RAB39A、SYPL1、ZNF548、CAPN3、EPHB2、INSL6、NFE2L3、RAB39B、SYPL2、ZNF549、CAPN5、EPHB3、INSM1、NFIA、RAB3A、SYS1、ZNF550、CAPN6、EPHB4、INSM2、NFIB、RAB3B、SYT1、ZNF551、CAPN7、EPHB6、INSR、NFIC、RAB3C、SYT10、ZNF552、CAPN8、EPHX1、INSRR、NFIL3、RAB3D、SYT11、ZNF554、CAPN9、EPHX2、INTS1、NFIX、RAB3GAP1、SYT12、ZNF555、CAPNS1、EPHX3、INTS10、NFKB1、RAB3GAP2、SYT13、ZNF556、CAPNS2、EPHX4、INTS12、NFKB2、RAB3IL1、SYT14、ZNF557、CAPRIN1、EPM2A、INTS2、NFKBIA、RAB3IP、SYT15、ZNF558、CAPRIN2、EPM2AIP1、INTS3、NFKBIB、RAB40A、SYT16、ZNF559、CAPS、EPN1、INTS4、NFKBID、RAB40AL、SYT17、ZNF559-ZNF177、CAPS2、EPN2、INTS5、NFKBIE、RAB40B、SYT2、ZNF560、CAPSL、EPN3、INTS6、NFKBIL1、RAB40C、SYT3、ZNF561、CAPZA1、EPO、INTS7、NFKBIZ、RAB41、SYT4、ZNF562、CAPZA2、EPOR、INTS8、NFRKB、RAB42、SYT5、ZNF563、CAPZA3、EPPIN、INTS9、NFS1、RAB43、SYT6、ZNF564、CAPZB、EPPIN-WFDC6、INTU、NFU1、RAB44、SYT7、ZNF565、CARD10、EPPK1、INVS、NFX1、RAB4A、SYT8、ZNF566、CARD11、EPRS、IP6K1、NFXL1、RAB4B、SYT9、ZNF567、CARD14、EPS15、IP6K2、NFYA、RAB5A、SYTL1、ZNF568、CARD16、EPS15L1、IP6K3、NFYB、RAB5B、SYTL2、ZNF569、CARD17、EPS8、IPCEF1、NFYC、RAB5C、SYTL3、ZNF57、CARD18、EPS8L1、IPMK、NGB、RAB6A、SYTL4、ZNF570、CARD6、EPS8L2、IPO11、NGDN、RAB6B、SYTL5、ZNF571、CARD8、EPS8L3、IPO13、NGEF、RAB6C、SYVN1、ZNF572、CARD9、EPSTI1、IPO4、NGF、RAB7A、SZT2、ZNF573、CARF、EPT1、IPO5、NGFR、RAB7L1、T、ZNF574、CARHSP1、EPX、IPO7、NGFRAP1、RAB8A、TAAR1、ZNF575、CARKD、EPYC、IPO8、NGLY1、RAB8B、TAAR2、ZNF576、CARM1、EQTN、IPO9、NGRN、RAB9A、TAAR5、ZNF577、CARNS1、ERAL1、IPP、NHEJ1、RAB9B、TAAR6、ZNF578、CARS、ERAP1、IPPK、NHLH1、RABAC1、TAAR8、ZNF579、CARS2、ERAP2、IQCA1、NHLH2、RABEP1、TAAR9、ZNF580、CARTPT、ERAS、IQCB1、NHLRC1、RABEP2、TAB1、ZNF581、CASC1、ERBB2、IQCC、NHLRC2、RABEPK、TAB2、ZNF582、CASC10、ERBB2IP、IQCD、NHLRC3、RABGAP1、TAB3、ZNF583、CASC3、ERBB3、IQCE、NHLRC4、RABGAP1L、TAC1、ZNF584、CASC4、ERBB4、IQCF1、NHP2、
RABGEF1、TAC3、ZNF585A、CASC5、ERC1、IQCF2、NHP2L1、RABGGTA、TAC4、ZNF585B、CASD1、ERC2、IQCF3、NHS、RABGGTB、TACC1、ZNF586、CASK、ERCC1、IQCF5、NHSL1、RABIF、TACC2、ZNF587、CASKIN1、ERCC2、IQCF6、NHSL2、RABL2A、TACC3、ZNF587B、CASKIN2、ERCC3、IQCG、NICN1、RABL2B、TACO1、ZNF589、CASP1、ERCC4、IQCH、NID1、RABL3、TACR1、ZNF592、CASP10、ERCC5、IQCJ、NID2、RABL5、TACR2、ZNF593、CASP12、ERCC6、IQCJ-SCHIP1、NIF3L1、RABL6、TACR3、ZNF594、CASP14、ERCC6L、IQCK、NIFK、RAC1、TACSTD2、ZNF595、CASP16、ERCC6L2、IQGAP1、NIM1、RAC2、TADA1、ZNF596、CASP2、ERCC6-PGBD3、IQGAP2、NIN、RAC3、TADA2A、ZNF597、CASP3、ERCC8、IQGAP3、NINJ1、RACGAP1、TADA2B、ZNF598、CASP4、EREG、IQSEC1、NINJ2、RAD1、TADA3、ZNF599、CASP5、ERF、IQSEC2、NINL、RAD17、TAF1、ZNF600、CASP6、ERG、IQSEC3、NIP7、RAD18、TAF10、ZNF605、CASP7、ERGIC1、IQUB、NIPA1、RAD21、TAF11、ZNF606、CASP8、ERGIC2、IRAK1、NIPA2、RAD21L1、TAF12、ZNF607、CASP8AP2、ERGIC3、IRAK1BP1、NIPAL1、RAD23A、TAF13、ZNF608、CASP9、ERH、IRAK2、NIPAL2、RAD23B、TAF15、ZNF609、CASQ1、ERI1、IRAK3、NIPAL3、RAD50、TAF1A、ZNF610、CASQ2、ERI2、IRAK4、NIPAL4、RAD51、TAF1B、ZNF611、CASR、ERI3、IREB2、NIPBL、RAD51AP1、TAF1C、ZNF613、CASS4、ERICH1、IRF1、NIPSNAP1、RAD51AP2、TAF1D、ZNF614、CAST、ERICH2、IRF2、NIPSNAP3A、RAD51B、TAF1L、ZNF615、CASZ1、ERLEC1、IRF2BP1、NIPSNAP3B、RAD51C、TAF2、ZNF616、CAT、ERLIN1、IRF2BP2、NISCH、RAD51D、TAF3、ZNF618、CATSPER1、ERLIN2、IRF2BPL、NIT1、RAD52、TAF4、ZNF619、CATSPER2、ERMAP、IRF3、NIT2、RAD54B、TAF4B、ZNF620、CATSPER3、ERMARD、IRF4、NKAIN1、RAD54L、TAF5、ZNF621、CATSPER4、ERMN、IRF5、NKAIN2、RAD54L2、TAF5L、ZNF622、CATSPERB、ERMP1、IRF6、NKAIN3、RAD9A、TAF6、ZNF623、CATSPERD、ERN1、IRF7、NKAIN4、RAD9B、TAF6L、ZNF624、CATSPERG、ERN2、IRF8、NKAP、RADIL、TAF7、ZNF625、CAV1、ERO1L、IRF9、NKAPL、RAE1、TAF7L、ZNF626、CAV2、ERO1LB、IRG1、NKD1、RAET1E、TAF8、ZNF627、CAV3、ERP27、IRGC、NKD2、RAET1G、TAF9、ZNF628、CBFA2T2、ERP29、IRGM、NKG7、RAET1L、TAF9B、ZNF629、CBFA2T3、ERP44、IRGQ、NKIRAS1、RAF1、TAGAP、ZNF630、CBFB、ERRFI1、IRS1、NKIRAS2、RAG1、TAGLN、ZNF638、CBL、ERVFRD-1、IRS2、NKPD1、RAG2、TAGLN2、ZNF639、CBLB、ERVMER34-1、IRS4、NKRF、RAI1、TAGLN3、ZNF641、CBLC、ERVV-1、IRX1、NKTR、RAI14、TAL1、ZNF644、CBLL1、ERVV-2、IRX2、NKX1-2、RAI2、TAL2、ZNF645、CBLN1、ESAM、IRX3、NKX2-1、RALA、TALDO1、ZNF646、CBLN2、ESCO1、IRX4、NKX2-2、RALB、TAMM41、ZNF648、CBLN3、ESCO2、IRX5、NKX2-3、RALBP1、TANC1、ZNF649、CBLN4、ESD、IRX6、NKX2-4、RALGAPA1、TANC2、ZNF652、CBR1、ESF1、ISCA1、NKX2-5、RALGAPA2、TANGO2、ZNF653、CBR3、ESM1、ISCA2、NKX2-6、RALGAPB、TANGO6、ZNF654、CBR4、ESPL1、ISCU、NKX2-8、RALGDS、TANK、ZNF655、CBS、ESPN、ISG15、NKX3- 1、RALGPS1、TAOK1、ZNF658、CBWD1、ESPNL、ISG20、NKX3-2、RALGPS2、TAOK2、ZNF660、CBWD2、ESR1、ISG20L2、NKX6-1、RALY、TAOK3、ZNF662、CBWD3、ESR2、ISL1、NKX6-2、RALYL、TAP1、ZNF664、CBWD5、ESRP1、ISL2、NKX6-3、RAMP1、TAP2、ZNF664-FAM101A、CBWD6、ESRP2、ISLR、NLE1、RAMP2、TAPBP、ZNF665、CBWD7、ESRRA、ISLR2、NLGN1、RAMP3、TAPBPL、ZNF667、CBX1、ESRRB、
ISM1、NLGN2、RAN、TAPT1、ZNF668、CBX2、ESRRG、ISM2、NLGN3、RANBP1、TARBP1、ZNF669、CBX3、ESX1、ISOC1、NLGN4X、RANBP10、TARBP2、ZNF670、CBX4、ESYT1、ISOC2、NLGN4Y、RANBP17、TARDBP、ZNF671、CBX5、ESYT2、ISPD、NLK、RANBP2、TARM1、ZNF672、CBX6、ESYT3、IST1、NLN、RANBP3、TARP、ZNF674、CBX7、ETAA1、ISX、NLRC3、RANBP3L、TARS、ZNF675、CBX8、ETF1、ISY1、NLRC4、RANBP6、TARS2、ZNF676、CBY1、ETFA、ISY1-RAB43、NLRC5、RANBP9、TARSL2、ZNF677、CBY3、ETFB、ISYNA1、NLRP1、RANGAP1、TAS1R1、ZNF678、CC2D1A、ETFDH、ITCH、NLRP10、RANGRF、TAS1R2、ZNF679、CC2D1B、ETHE1、ITFG1、NLRP11、RAP1A、TAS1R3、ZNF680、CC2D2A、ETNK1、ITFG2、NLRP12、RAP1B、TAS2R1、ZNF681、CC2D2B、ETNK2、ITFG3、NLRP13、RAP1GAP、TAS2R10、ZNF682、CCAR1、ETNPPL、ITGA1、NLRP14、RAP1GAP2、TAS2R13、ZNF683、CCAR2、ETS1、ITGA10、NLRP2、RAP1GDS1、TAS2R14、ZNF684、CCBE1、ETS2、ITGA11、NLRP3、RAP2A、TAS2R16、ZNF687、CCBL1、ETV1、ITGA2、NLRP4、RAP2B、TAS2R19、ZNF688、CCBL2、ETV2、ITGA2B、NLRP5、RAP2C、TAS2R20、ZNF689、CCDC101、ETV3、ITGA3、NLRP6、RAPGEF1、TAS2R3、ZNF69、CCDC102A、ETV3L、ITGA4、NLRP7、RAPGEF2、TAS2R30、ZNF691、CCDC102B、ETV4、ITGA5、NLRP8、RAPGEF3、TAS2R31、ZNF692、CCDC103、ETV5、ITGA6、NLRP9、RAPGEF4、TAS2R38、ZNF695、CCDC104、ETV6、ITGA7、NLRX1、RAPGEF5、TAS2R39、ZNF696、CCDC105、ETV7、ITGA8、NMB、RAPGEF6、TAS2R4、ZNF697、CCDC106、EVA1A、ITGA9、NMBR、RAPGEFL1、TAS2R40、ZNF699、CCDC107、EVA1B、ITGAD、NMD3、RAPH1、TAS2R41、ZNF7、CCDC108、EVA1C、ITGAE、NME1、RAPSN、TAS2R42、ZNF70、CCDC109B、EVC、ITGAL、NME1-NME2、RARA、TAS2R43、ZNF700、CCDC11、EVC2、ITGAM、NME2、RARB、TAS2R46、ZNF701、CCDC110、EVI2A、ITGAV、NME3、RARG、TAS2R5、ZNF703、CCDC112、EVI2B、ITGAX、NME4、RARRES1、TAS2R50、ZNF704、CCDC113、EVI5、ITGB1、NME5、RARRES2、TAS2R60、ZNF705A、CCDC114、EVI5L、ITGB1BP1、NME6、RARRES3、TAS2R7、ZNF705B、CCDC115、EVL、ITGB1BP2、NME7、RARS、TAS2R8、ZNF705D、CCDC116、EVPL、ITGB2、NME8、RARS2、TAS2R9、ZNF705E、CCDC117、EVPLL、ITGB3、NME9、RASA1、TASP1、ZNF705G、CCDC12、EVX1、ITGB3BP、NMI、RASA2、TAT、ZNF706、CCDC120、EVX2、ITGB4、NMNAT1、RASA3、TATDN1、ZNF707、CCDC121、EWSR1、ITGB5、NMNAT2、RASA4、TATDN2、ZNF708、CCDC122、EXD1、ITGB6、NMNAT3、RASA4B、TATDN3、ZNF709、CCDC124、EXD2、ITGB7、NMRAL1、RASAL1、TAX1BP1、ZNF71、CCDC125、EXD3、ITGB8、NMRK1、RASAL2、TAX1BP3、ZNF710、CCDC126、EXO1、ITGBL1、NMRK2、RASAL3、TAZ、ZNF711、CCDC127、EXO5、ITIH1、NMS、RASD1、TBATA、ZNF713、CCDC129、EXOC1、ITIH2、NMT1、RASD2、TBC1D1、ZNF714、CCDC13、EXOC2、ITIH3、NMT2、RASEF、TBC1D10A、ZNF716、CCDC130、EXOC3、ITIH4、NMU、RASGEF1A、TBC1D10B、ZNF717、CCDC132、EXOC3L1、ITIH5、NMUR1、RASGEF1B、TBC1D10C、ZNF718、CCDC134、EXOC3L2、ITIH6、NMUR2、RASGEF1C、TBC1D12、ZNF720、CCDC135、EXOC3L4、ITK、NNAT、RASGRF1、TBC1D13、ZNF721、CCDC136、EXOC4、ITLN1、NNMT、RASGRF2、TBC1D14、ZNF726、CCDC137、EXOC5、ITLN2、NNT、RASGRP1、TBC1D15、ZNF727、CCDC138、EXOC6、ITM2A、NOA1、RASGRP2、TBC1D16、ZNF728、CCDC14、EXOC6B、ITM2B、NOB1、RASGRP3、TBC1D17、ZNF729、CCDC140、EXOC7、ITM2C、NOBOX、
RASGRP4、TBC1D19、ZNF730、CCDC141、EXOC8、ITPA、NOC2L、RASIP1、TBC1D2、ZNF732、CCDC142、EXOG、ITPK1、NOC3L、RASL10A、TBC1D20、ZNF735、CCDC144A、EXOSC1、ITPKA、NOC4L、RASL10B、TBC1D21、ZNF736、CCDC144NL、EXOSC10、ITPKB、NOD1、RASL11A、TBC1D22A、ZNF737、CCDC146、EXOSC2、ITPKC、NOD2、RASL11B、TBC1D22B、ZNF74、CCDC147、EXOSC3、ITPR1、NODAL、RASL12、TBC1D23、ZNF740、CCDC148、EXOSC4、ITPR2、NOG、RASSF1、TBC1D24、ZNF746、CCDC149、EXOSC5、ITPR3、NOL10、RASSF10、TBC1D25、ZNF747、CCDC15、EXOSC6、ITPRIP、NOL11、RASSF2、TBC1D26、ZNF749、CCDC150、EXOSC7、ITPRIPL1、NOL12、RASSF3、TBC1D27、ZNF750、CCDC151、EXOSC8、ITPRIPL2、NOL3、RASSF4、TBC1D28、ZNF75A、CCDC152、EXOSC9、ITSN1、NOL4、RASSF5、TBC1D29、ZNF75D、CCDC153、EXPH5、ITSN2、NOL6、RASSF6、TBC1D2B、ZNF76、CCDC154、EXT1、IVD、NOL7、RASSF7、TBC1D3、ZNF761、CCDC155、EXT2、IVL、NOL8、RASSF8、TBC1D30、ZNF763、CCDC157、EXTL1、IVNS1ABP、NOL9、RASSF9、TBC1D31、ZNF764、CCDC158、EXTL2、IWS1、NOLC1、RAVER1、TBC1D32、ZNF765、CCDC159、EXTL3、IYD、NOM1、RAVER2、TBC1D3B、ZNF766、CCDC160、EYA1、IZUMO1、NOMO1、RAX、TBC1D3C、ZNF768、CCDC166、EYA2、IZUMO2、NOMO2、RAX2、TBC1D3F、ZNF77、CCDC167、EYA3、IZUMO3、NOMO3、RB1、TBC1D3G、ZNF770、CCDC168、EYA4、IZUMO4、NONO、RB1CC1、TBC1D3H、ZNF771、CCDC169、EYS、JADE1、NOP10、RBAK、TBC1D4、ZNF772、CCDC169-SOHLH2、EZH1、JADE2、NOP14、RBAK-RBAKDN、TBC1D5、ZNF773、CCDC17、EZH2、JADE3、NOP16、RBBP4、TBC1D7、ZNF774、CCDC170、EZR、JAG1、NOP2、RBBP5、TBC1D8、ZNF775、CCDC171、F10、JAG2、NOP56、RBBP6、TBC1D8B、ZNF776、CCDC172、F11、JAGN1、NOP58、RBBP7、TBC1D9、ZNF777、CCDC173、F11R、JAK1、NOP9、RBBP8、TBC1D9B、ZNF778、CCDC174、F12、JAK2、NOS1、RBBP8NL、TBCA、ZNF780A、CCDC175、F13A1、JAK3、NOS1AP、RBBP9、TBCB、ZNF780B、CCDC176、F13B、JAKMIP1、NOS2、RBCK1、TBCC、ZNF781、CCDC177、F2、JAKMIP2、NOS3、RBFA、TBCCD1、ZNF782、CCDC178、F2R、JAKMIP3、NOSIP、RBFOX1、TBCD、ZNF783、CCDC179、F2RL1、JAM2、NOSTRIN、RBFOX2、TBCE、ZNF784、CCDC18、F2RL2、JAM3、NOTCH1、RBFOX3、TBCEL、ZNF785、CCDC180、F2RL3、JARID2、NOTCH2、RBKS、TBCK、ZNF786、CCDC181、F3、JAZF1、NOTCH2NL、RBL1、TBK1、ZNF787、CCDC19、F5、JDP2、NOTCH3、RBL2、TBKBP1、ZNF789、CCDC22、F7、JHDM1D、NOTCH4、RBM10、TBL1X、ZNF79、CCDC23、F8、JKAMP、NOTO、RBM11、TBL1XR1、ZNF790、CCDC24、F8A1、JMJD1C、NOTUM、RBM12、TBL1Y、ZNF791、CCDC25、F8A2、JMJD4、NOV、RBM12B、TBL2、ZNF792、CCDC27、F8A3、JMJD6、NOVA1、RBM14、TBL3、ZNF793、CCDC28A、F9、JMJD7、NOVA2、RBM14-RBM4、TBP、ZNF799、CCDC28B、FA2H、JMJD7-PLA2G4B、NOX1、RBM15、TBPL1、ZNF8、CCDC3、FAAH、JMJD8、NOX3、RBM15B、TBPL2、ZNF80、CCDC30、FAAH2、JMY、NOX4、RBM17、TBR1、ZNF800、CCDC33、FABP1、JOSD1、NOX5、RBM18、TBRG1、ZNF804A、CCDC34、FABP12、JOSD2、NOXA1、RBM19、TBRG4、ZNF804B、CCDC36、FABP2、JPH1、NOXO1、RBM20、TBX1、ZNF805、CCDC37、FABP3、JPH2、NOXRED1、RBM22、TBX10、ZNF806、CCDC38、FABP4、JPH3、NPAP1、RBM23、TBX15、ZNF808、CCDC39、FABP5、JPH4、NPAS1、RBM24、TBX18、ZNF81、CCDC40、FABP6、JRK、NPAS2、RBM25、TBX19、ZNF812、CCDC41、FABP7、JRKL、
NPAS3、RBM26、TBX2、ZNF813、CCDC42、FABP9、JSRP1、NPAS4、RBM27、TBX20、ZNF814、CCDC42B、FADD、JTB、NPAT、RBM28、TBX21、ZNF816、CCDC43、FADS1、JUN、NPB、RBM3、TBX22、ZNF816-ZNF321P、CCDC47、FADS2、JUNB、NPBWR1、RBM33、TBX3、ZNF821、CCDC50、FADS3、JUND、NPBWR2、RBM34、TBX4、ZNF823、CCDC51、FADS6、JUP、NPC1、RBM38、TBX5、ZNF827、CCDC53、FAF1、KAAG1、NPC1L1、RBM39、TBX6、ZNF829、CCDC54、FAF2、KAL1、NPC2、RBM4、TBXA2R、ZNF83、CCDC57、FAH、KALRN、NPDC1、RBM41、TBXAS1、ZNF830、CCDC58、FAHD1、KANK1、NPEPL1、RBM42、TC2N、ZNF831、CCDC59、FAHD2A、KANK2、NPEPPS、RBM43、TCAIM、ZNF835、CCDC6、FAHD2B、KANK3、NPFF、RBM46、TCAP、ZNF836、CCDC60、FAIM、KANK4、NPFFR1、RBM47、TCEA1、ZNF837、CCDC61、FAIM2、KANSL1、NPFFR2、RBM48、TCEA2、ZNF839、CCDC62、FAIM3、KANSL1L、NPHP1、RBM4B、TCEA3、ZNF84、CCDC63、FAM101A、KANSL2、NPHP3、RBM5、TCEAL1、ZNF841、CCDC64、FAM101B、KANSL3、NPHP4、RBM6、TCEAL2、ZNF843、CCDC64B、FAM102A、KARS、NPHS1、RBM7、TCEAL3、ZNF844、CCDC65、FAM102B、KAT2A、NPHS2、RBM8A、TCEAL4、ZNF845、CCDC66、FAM103A1、KAT2B、NPIPA1、RBMS1、TCEAL5、ZNF846、CCDC67、FAM104A、KAT5、NPIPA2、RBMS2、TCEAL6、ZNF85、CCDC68、FAM104B、KAT6A、NPIPA3、RBMS3、TCEAL7、ZNF850、CCDC69、FAM105A、KAT6B、NPIPA5、RBMX、TCEAL8、ZNF852、CCDC7、FAM105B、KAT7、NPIPA7、RBMX2、TCEANC、ZNF853、CCDC70、FAM107A、KAT8、NPIPA8、RBMXL1、TCEANC2、ZNF860、CCDC71、FAM107B、KATNA1、NPIPB11、RBMXL2、TCEB1、ZNF862、CCDC71L、FAM109A、KATNAL1、NPIPB15、RBMXL3、TCEB2、ZNF865、CCDC73、FAM109B、KATNAL2、NPIPB3、RBMY1A1、TCEB3、ZNF878、CCDC74A、FAM110A、KATNB1、NPIPB4、RBMY1B、TCEB3B、ZNF879、CCDC74B、FAM110B、KATNBL1、NPIPB5、RBMY1D、TCEB3C、ZNF880、CCDC77、FAM110C、KAZALD1、NPIPB6、RBMY1E、TCEB3CL、ZNF883、CCDC78、FAM110D、KAZN、NPIPB8、RBMY1F、TCEB3CL2、ZNF888、CCDC79、FAM111A、KBTBD11、NPIPB9、RBMY1J、TCERG1、ZNF891、CCDC8、FAM111B、KBTBD12、NPL、RBP1、TCERG1L、ZNF90、CCDC80、FAM114A1、KBTBD13、NPLOC4、RBP2、TCF12、ZNF91、CCDC81、FAM114A2、KBTBD2、NPM1、RBP3、TCF15、ZNF92、CCDC82、FAM115A、KBTBD3、NPM2、RBP4、TCF19、ZNF93、CCDC83、FAM115C、KBTBD4、NPM3、RBP5、TCF20、ZNF98、CCDC84、FAM117A、KBTBD6、NPNT、RBP7、TCF21、ZNF99、CCDC85A、FAM117B、KBTBD7、NPPA、RBPJ、TCF23、ZNFX1、CCDC85B、FAM118A、KBTBD8、NPPB、RBPJL、TCF24、ZNHIT1、CCDC85C、FAM118B、KCMF1、NPPC、RBPMS、TCF25、ZNHIT2、CCDC86、FAM120A、KCNA1、NPR1、RBPMS2、TCF3、ZNHIT3、CCDC87、FAM120AOS、KCNA10、NPR2、RBX1、TCF4、ZNHIT6、CCDC88A、FAM120B、KCNA2、NPR3、RC3H1、TCF7、ZNRD1、CCDC88B、FAM120C、KCNA3、NPRL2、RC3H2、TCF7L1、ZNRF1、CCDC88C、FAM122A、KCNA4、NPRL3、RCAN1、TCF7L2、ZNRF2、CCDC89、FAM122B、KCNA5、NPS、RCAN2、TCFL5、ZNRF3、CCDC9、FAM122C、KCNA6、NPSR1、RCAN3、TCHH、ZNRF4、CCDC90B、FAM124A、KCNA7、NPTN、RCBTB1、TCHHL1、ZP1、CCDC91、FAM124B、KCNAB1、NPTX1、RCBTB2、TCHP、ZP2、CCDC92、FAM126A、KCNAB2、NPTX2、RCC1、TCIRG1、ZP3、CCDC93、FAM126B、KCNAB3、NPTXR、RCC2、TCL1A、ZP4、CCDC94、FAM127A、KCNB1、NPVF、RCCD1、TCL1B、ZPBP、CCDC96、FAM127B、KCNB2、NPW、RCE1、TCN1、ZPBP2、
CCDC97、FAM127C、KCNC1、NPY、RCHY1、TCN2、ZPLD1、CCER1、FAM129A、KCNC2、NPY1R、RCL1、TCOF1、ZRANB1、CCHCR1、FAM129B、KCNC3、NPY2R、RCN1、TCP1、ZRANB2、CCIN、FAM129C、KCNC4、NPY4R、RCN2、TCP10、ZRANB3、CCK、FAM131A、KCND1、NPY5R、RCN3、TCP10L、ZRSR2、CCKAR、FAM131B、KCND2、NQO1、RCOR1、TCP10L2、ZSCAN1、CCKBR、FAM131C、KCND3、NQO2、RCOR2、TCP11、ZSCAN10、CCL1、FAM132A、KCNE1、NR0B1、RCOR3、TCP11L1、ZSCAN12、CCL11、FAM132B、KCNE1L、NR0B2、RCSD1、TCP11L2、ZSCAN16、CCL13、FAM133A、KCNE2、NR1D1、RCVRN、TCP11X1、ZSCAN18、CCL14、FAM133B、KCNE3、NR1D2、RD3、TCP11X2、ZSCAN2、CCL15、FAM134A、KCNE4、NR1H2、RD3L、TCTA、ZSCAN20、CCL16、FAM134B、KCNF1、NR1H3、RDH10、TCTE1、ZSCAN21、CCL17、FAM134C、KCNG1、NR1H4、RDH11、TCTE3、ZSCAN22、CCL18、FAM135A、KCNG2、NR1I2、RDH12、TCTEX1D1、ZSCAN23、CCL19、FAM135B、KCNG3、NR1I3、RDH13、TCTEX1D2、ZSCAN25、CCL2、FAM136A、KCNG4、NR2C1、RDH14、TCTEX1D4、ZSCAN26、CCL20、FAM13A、KCNH1、NR2C2、RDH16、TCTN1、ZSCAN29、CCL21、FAM13B、KCNH2、NR2C2AP、RDH5、TCTN2、ZSCAN30、CCL22、FAM13C、KCNH3、NR2E1、RDH8、TCTN3、ZSCAN31、CCL23、FAM149A、KCNH4、NR2E3、RDM1、TDG、ZSCAN32、CCL24、FAM149B1、KCNH5、NR2F1、RDX、TDGF1、ZSCAN4、CCL25、FAM150A、KCNH6、NR2F2、REC8、TDO2、ZSCAN5A、CCL26、FAM150B、KCNH7、NR2F6、RECK、TDP1、ZSCAN5B、CCL27、FAM151A、KCNH8、NR3C1、RECQL、TDP2、ZSCAN9、CCL28、FAM151B、KCNIP1、NR3C2、RECQL4、TDRD1、ZSWIM1、CCL3、FAM153A、KCNIP2、NR4A1、RECQL5、TDRD10、ZSWIM2、CCL3L1、FAM153B、KCNIP3、NR4A2、REEP1、TDRD12、ZSWIM3、CCL3L3、FAM154A、KCNIP4、NR4A3、REEP2、TDRD3、ZSWIM4、CCL4、FAM154B、KCNJ1、NR5A1、REEP3、TDRD5、ZSWIM5、CCL4L1、FAM155A、KCNJ10、NR5A2、REEP4、TDRD6、ZSWIM6、CCL4L2、FAM155B、KCNJ11、NR6A1、REEP5、TDRD7、ZSWIM7、CCL5、FAM156A、KCNJ12、NRAP、REEP6、TDRD9、ZSWIM8、CCL7、FAM156B、KCNJ13、NRARP、REG1A、TDRKH、ZUFSP、CCL8、FAM157A、KCNJ14、NRAS、REG1B、TDRP、ZW10、CCM2、FAM157B、KCNJ15、NRBF2、REG3A、TEAD1、ZWILCH、CCM2L、FAM159A、KCNJ16、NRBP1、REG3G、TEAD2、ZWINT、CCNA1、FAM159B、KCNJ18、NRBP2、REG4、TEAD3、ZXDA、CCNA2、FAM160A1、KCNJ2、NRCAM、REL、TEAD4、ZXDB、CCNB1、FAM160A2、KCNJ3、NRD1、RELA、TEC、ZXDC、CCNB1IP1、FAM160B1、KCNJ4、NRDE2、RELB、TECPR1、ZYG11A、CCNB2、FAM160B2、KCNJ5、NREP、RELL1、TECPR2、ZYG11B、CCNB3、FAM161A、KCNJ6、NRF1、RELL2、TECR、ZYX、CCNC、FAM161B、KCNJ8、NRG1、RELN、TECRL、ZZEF1、CCND1、FAM162A、KCNJ9、NRG2、RELT、TECTA、ZZZ3、CCND2、FAM162B、KCNK1、NRG、RHODOPSIN、RdCVF、RdCVFL、GIRK、DUX4、 And DBET (or DBET lncRNA).
Examples
The following examples are given for the purpose of illustrating various embodiments of the invention and are not meant to limit the invention in any way. The present examples and methods described herein presently represent preferred embodiments, are exemplary, and are not intended to limit the scope of the invention. Variations therein and other uses encompassed by the spirit of the invention as defined by the scope of the claims will occur to those skilled in the art.
Example 1 combinatorial modulator screening
The activation domains of human and viral transcriptional regulators identified in the high throughput screen were tested as single or tandem fusions targeting GFP reporter. As shown in fig. 1A, the single activator (top sub-panel) showed weak to moderate activation of GFP, while the combined modulator (bottom sub-panel) showed strong activation, independent of the orientation of the modulators. As shown in fig. 1B, the combination of modulators activated GFP to levels greater than the sum of the individual modulators, which is a marker of transcriptional synergy.
100 Putative modulators previously identified from 85aa blocks (tiles) across the human and viral proteomes were used as initial candidates. As shown in fig. 2A, a panel of 85 modulators (50 viruses, 35 humans) was selected from the candidate gene modulators for testing in a combinatorial screen along with positive and negative controls. The selected blocks represent weak, medium and strong activators identified in the initial screen (fig. 2B). As shown in FIG. 2C, homology clusters demonstrated a high degree of sequence diversity between candidate modulators and further confirmed that the library block contained multiple predicted protein domains (FIG. 2D).
As shown in fig. 3A, 100 candidate modulators were synthesized as a or B variants with unique codon usage and homology arms to facilitate targeted cloning into the grna+pyl1 lentiviral expression vector. The K562 cells harboring the unstable GFP reporter gene were engineered to stably express dCasMINI-ABI fusion (fig. 3B). Upon addition of the plant hormone abscisic acid (ABA), ABI dimerizes with PYL1, causing the regulator to be recruited to the reporter locus. As shown in FIG. 3C, recruitment and reversibility of the reporter system was verified using K562-GFP-dCasMINI-ABI cells transduced with lentiviruses encoding PYL 1-activator fusions and gRNAs targeting GFP reporter genes. Robust GFP activation was observed 24 hours after ABA treatment. After 7 days of ABA elution, GFP expression had recovered to levels comparable to DMSO controls.
Referring to FIG. 4, K562-GFP reporter cells expressing dCasMINI-ABI were transduced with a PYL1-ComboMod fusion library and treated with DMSO or ABA. GFPON cells were sorted by FACS into 4 discrete bins representing the quartiles of GFP expression. Genomic DNA was harvested and libraries were prepared from 4 GFPON intervals and GFPOFF intervals in 4 replicate samples. The modulator combinations in each interval were identified by double-ended sequencing and enrichment in the GFPON vs GFPOFF interval was determined using DESEQ.
As shown in fig. 5A, DESEQ2 analysis of ComboMod enrichments identified 1393 hits with FDR < 0.05. As shown in fig. 5B, most of the strong activators consisted of 1 or more viral activators (left side), and some human-human combinations were also identified as strong activators (right side). The heat map indicating the activation states of all 100×100 combination modulators shows that strong activation combinations tend to cluster along a row or column, highlighting the predictability of a few modulators in driving strong combinations (fig. 5C).
As shown in fig. 6, the biochemical and biophysical characteristics of the combination modulator were evaluated. The strong activation combination enriches the negative electrostatic potential (fig. 6A), and generally requires a beta factor (a measure of structural flexibility) of one of the partners between 35-60 (fig. 6B).
The selection of 100×100 human and virus-derived modulators demonstrates the relative strength of the activation domains between virus and human groups using a high throughput screening platform for combinations of transcriptional modulators. In some cases, the person-to-person combination is superior to its individual composition regulator. These strong combinatorial activation domains may prove useful in therapeutic settings where immunogenicity is a concern. Furthermore, combinations of weak or moderate activators may prove beneficial in achieving adjustable levels of activation. In some cases, combo Mod hits can modulate endogenous target genes in various cell types.
Example 2 synthetic reporter cell line validation screening
The regulatory effect of the combination modulator identified in example 1 was re-analyzed using synthetic reporter cell line validation screening. Synthetic reporter cells include GFP reporter gene sites and are engineered to express engineered gene effectors. To evaluate the modulation of the combination modulator, the expression of the non-expressed GFP reporter gene at baseline was determined as fold change (log 2 fold change). The results are provided in table 9. The fold change values for each of the combination modulators are contained in the log2 fold change column of table 9. In this screen, cells expressing the combination modulator were sorted into 5 discrete intervals representing the quartiles of gfp_off and gfp_on intensities. Each of these intervals has a specific GFP average fluorescence intensity value. During the analysis, the MFI value of each combination regulator is calculated by weighted summation of the average fluorescence intensities (MFI) over all intervals, e.g. if combo_75 has 90 reads in the OFF interval and 10 counts in the highest GFP interval, the MFI of combo_75 will be (0.9 x gfp_value_off) + (0) + (0) + (0) + (0.1 x gfp_value_highest_on). The MFI value in the bottom quartile is labeled as "weak" activator, the two middle quartiles are labeled as "medium" activator, and the top quartile is labeled as "strong" activator. These categories are shown in the "mfi_status" column of table 9.
Example 3 endogenous Gene verification screening
Modulation of the combination modulator identified in example 1 was analyzed using endogenous gene verification screening. HEK293T cells were engineered to express an engineered gene effector targeting endogenous CD45 genes. To evaluate the modulation of engineered gene effectors, CD45 gene expression was determined as fold change (log 2 FC) at three time points (day 2, day 8 and day 14) after transfection, the CD45 gene not being expressed at baseline. The results are provided in table 10. Fold change values for each engineered gene effector are included in the "day 2_log2fc", "day 8_log2fc", and "day 14_ logFC" columns of table 10. In table 10, the a_peptide and b_peptide sequences beginning with "×" represent the N-terminal stop codons that prevent translation of the remainder of the listed sequences encoded by the nucleic acid and serve as internal controls. The persistence of the modulation of the engineered gene effector was assessed based on fold change values on day 8 and day 14.
Example 4 comparison of engineered Gene effectors-activation of epigenetic resting loci
HEK293 cells were transfected to express engineered gene effectors targeting endogenous CD45 genes (epigenetic resting sites). Some of the engineered gene effectors detected comprise a first peptide and a second peptide. Some of the engineered gene effectors detected are encoded by functional biological sequences generated using computer-implemented methods. To assess the persistence of the modulation of engineered gene effectors, the expression of CD45 gene, which was not expressed at baseline, was assayed at day 9 as fold change values (fig. 7A-7B).
Example 5 comparison of engineered Gene effectors-reactivation of epigenetic silencing sites
HEK293 cells were transfected to express engineered gene effectors targeting epigenetic silencing sites, including GFP reporter gene sites. To evaluate the modulation of engineered gene effectors, the percentage of transfected cells expressing GFP (gfp+) was calculated on days 2 and 6 (fig. 8).
Example 6 Generation of functional biological sequences Using evolved Monte Carlo
In this embodiment, evolved monte carlo is a statistical sampling method/means for finding new functional biological sequences. As depicted in fig. 9A-9D, a list of random/predefined sequences is provided, iteratively randomly mutating in a computer, and evaluating based on a machine learning model at each iteration. Every few iterations, the sequence list is "shuffled" or "crossed" in an attempt to significantly increase the search space, and then evaluated again on the computer. If the algorithm encounters new sequences with high evaluation in the computer machine learning model, these sequences are stored (e.g., added to a list) and then tested at the wet laboratory level. Machine learning algorithms predict which sequences are functionally biologically active.
Using the above method, 4.6K new sequences predicted to have functional biological activity were generated. Each of these new sequences is then compared to each sequence in the training dataset and their associated edit distance calculated. Edit distance is a measure for determining the degree of similarity or dissimilarity between two sequences, a high edit distance means that two sequences are very dissimilar to each other. The minimum edit distance for each of the 4.6k new sequences was plotted onto the sequences in the original training dataset (fig. 10) and it was confirmed that these new sequences were significantly different from the training dataset (over 55 mismatches on average over a list of 85 sequences), indicating that the machine-learning generated sequences were indeed new and were not merely a reorganization of positive hits in the training dataset. FIG. 11 is a representation of a sequence generated in two dimensions, where it is shown that the new sequence occupies a different space in the 2D view.
Example 7-1 improved Small sample learning-based protein engineering
Design of new protein sequences remains a slow and expensive process due to various protein engineering challenges, and in particular, the number of protein variants that can be experimentally tested in a given experiment is dwarfed compared to the broad overall sequence space, resulting in low hit rates and expensive wet laboratory test cycles. In this example, a small sample learning method of new protein design is described, aimed at speeding up the expensive wet laboratory test cycle, and enabling the use of a small, skewed training dataset (about 10 5 data points, <1% positive hit). The method comprises a semi-supervised transfer learning method and a new evolutionary Monte Carlo Markov chain sampling algorithm, wherein the semi-supervised transfer learning method is used for generating a discrete fitness view for a desired protein function, so that the fitness view is explored more effectively. The performance of the method is proved by experimental screening of the predicted high-fitness gene activating factor, and compared with the existing method, the hit rate is obviously improved. The present method can be readily adapted to other protein engineering and design issues, particularly where the costs associated with obtaining the labeling data are very high.
Designing and optimizing proteins with specific functions is a long-sought goal in protein engineering. Since proteins are composed of amino acid sequences that ultimately determine their structure and function, the protein engineering problem can be re-evolved to find the best mapping from the amino acid sequence s of length L to the biological function f, s.fwdarw.f(s), where f is called the fitness function. Finding the optimal solution for f can be seen as a high-dimensional discrete combination optimization problem. The large size of the protein sequence space (e.g., 20 L possible peptides of length L; 3.84e110 at l=85) and the presence of sensitive and sporadic regions of high fitness in the fitness landscape makes the design of novel proteins an extremely difficult problem.
Traditional methods of novel protein design involve high-throughput, iterative laboratory methods such as directed evolution, deep mutation scanning, and semi-rational design. However, these methods typically require multiple rounds of engineering and analysis, making them cumbersome, expensive, and time consuming. Furthermore, the number of variants that can be tested even in the most advanced laboratories (10 5 to 10 6) is very small compared to the size of the total sequence space, and high throughput screening can be challenging for certain classes of proteins.
In the last decade, the application of machine learning methods to protein engineering has met with tremendous success. In this context, a machine learning model is trained to learn a mapping of sequences to functions (also called fitness functions) and then used to propose new sequences that maximize fitness (and thus the predicted functions). Typically, this is two distinct steps, where fitness is estimated using a machine learning model, and then the sequence-to-function mapping is used to explore the fitness landscape by methods such as Metropolis-Hastings Monte Carlo Search.
In recent years, other methods (e.g., generating models) have been proposed to address this problem, including deep generation of networks, generation of antagonism networks, and diffusion models. In these cases, the exploration problem is trivial, as the model creates an embedding in real space (typically low dimensional) and the computational cost of sampling from that space is low. However, these methods typically require a large amount of training data and a large amount of positive examples to ensure that model embedding is meaningful, and do not simply remember the positive examples, which has been widely observed in image GAN. Given the relatively small number of sequences in training data, and the extreme lack of positive examples, it is expected that training data that is small and skewed will prove insufficient for generating modeling methods. On the other hand, migration learning of Large Protein Language Models (LPLM) has been successful in modeling and designing new proteins whose fitness function is trained on a small number of positive hits. While transfer learning and ML-based sequence-to-function mapping are increasingly receiving widespread attention, model-guided fitness graph exploration remains an under-studied problem in the context of protein engineering.
The Metropolis-Hastings Monte Carlo Search (MHMCS) method is a standard method for exploring high-dimensional discrete views (including those generated by machine learning algorithms), however MHMCS suffers from the inability to get rid of deep local optimal solutions. Other methods of sampling sequence space include gradient-based sampling and modified gibbs sampling. While these methods are powerful, they require extensive computation near the local neighborhood of the fitness view and are therefore computationally too extensive for any significant length sequence (e.g., gradient-based methods require 19L computations per iteration, while Gibbs requires L computations).
Evolutionary Monte Carlo (EMC) is an advanced sampling method that is sensitive to MCMC technology from genetic recombination as well as physics-based. While EMC has previously been used for various sampling tasks, its potential as an efficient algorithm to explore fitness graphs of ML generation in the context of protein engineering has not been explored. In this example, EMC was modified as a search tool for exploring complex fitness patterns of protein sequences capable of gene regulation, referred to herein as EMC Search (EMCs). While EMCS is far less computationally intensive than gradient-based and gibbs samples (and only slightly more intense than MHMCS), it is expected to benefit from faster convergence (due to parallel tempering) and provide more comprehensive and efficient fitness view exploration (by allowing interpolation at the molecular level between chains).
In general, novel protein sequence design strategies using a small sample transfer learning-based approach have been proposed. While this approach is generally applicable to a wide variety of problems, it is applied to the design of protein-based gene activators. High throughput screening of protein sequences has been previously performed to find novel gene activators and to identify less than 200 sequences that are verified to be positive hits (resulting in hit rates of about 0.5%). The small number of positive examples presents a particular problem for ML-guided engineering, as it is difficult to ensure that the fitness function will extrapolate well outward from a small neighborhood of positive examples in the training set. In this study, it was demonstrated that EMCS not only increases the sequence diversity and novelty of the designed sequences, but it also significantly increases the hit rate of the proposed sequences compared to the original high-throughput screening. Furthermore, EMCS can be initialized from known hits and still identify candidate sequences that are distinct from any original molecule, while MHMCS has difficulty getting rid of the locally optimal solution of the known hit.
High throughput screening and independent validation was previously performed in which the ability of 85aa peptide to activate synthetic gene sites was tested using dCasMini gene expression regulatory subsystem (dCasMini-GEMS). This allowed 173 gene activators ("positive hits") to be identified from the training set of 34217 protein sequences (hit rate of 0.51%). With this data, an attempt is made to train a machine learning model that can predict only from the sequences proteins that can activate the genes. Since the dataset of peptide sequences essentially consists of amino acid strings, each peptide sequence needs to be numerically characterized for use as input to a training supervised classification model. OneHot codes were compared to the transfer learning, using LPLM (ESM-2 model) of 6.5 hundred million parameters as input features for both models, XGBoost model and CNN model, XGBoost model flattened the features by averaging. In the test phase, it was found that the transfer learning significantly improved the predictions for both models, and the proposed sequence for each model appeared to capture different features of the training data. In fact, this is not surprising, since for XGBoost, the flattening of the mean of feature embedding amounts to training the global features of these peptide sequences, whereas the CNN model is able to learn the local features. Thus, both models are used with transfer learning to design molecules (fig. 12A), respectively, and with integrated models based on transfer learning to take advantage of global and local features learned by XGBoost and CNN models, respectively.
The MHMCS algorithm operates by proposing a small number of mutations to modify the current molecule and then evaluating the fitness of the new molecule, accepting the scheme if fitness increases, but accepting the scheme with a probability weighted by the ratio of the proposed fitness to the current fitness if fitness decreases. The latter possibility ensures that sub-optimal movements can be made to ensure that the search can be free of locally optimal solutions, but MHMCS tends to be difficult to handle for extremely deep optimal solutions.
The Evolutionary Monte Carlo Search (EMCS) extends the traditional Metropolis-Hastings Monte Carlo search (MHMCS) by introducing gene crossover events in the parallel tempering setup. In parallel tempering, multiple MHMCS chains run simultaneously at different temperatures (referred to as temperature gradients) and are exchanged at two randomly selected temperatures after a predetermined number of iterations. The main advantage of parallel tempering is that it allows MHMCS to occur within a larger search radius without sacrificing resolution. EMCS is based on parallel tempering by adding gene crossover events (domain exchange by chain interpolation). This allows for even larger search radii (fig. 12B) while also increasing the likelihood of favorable protein domain aggregation, which is assumed to be critical for utilizing a small number of positive hits in the training data.
Algorithm 1 below describes in detail the implementation of evolved monte carlo as a search tool. EMCS is highly versatile and allows for distinct exploration behavior compared to traditional sampling techniques due to the implementation of custom temperature gradients and predefined crossover, mutation and exchange rates. These parameters can be adjusted for more efficient exploration based on specific design issues and the complexity of the discrete high-dimensional fitness view. Each major iteration in EMCS may potentially change the state of the algorithm in one of three ways, namely point mutations, exchanges, and crossings between different temperature chains. The likelihood that each of these three classes accepts sub-optimal movement depends on how the acceptance criteria are defined. Rmh, which is the standard Boltzmann Metropolis-Hastings acceptance criterion, is used for mutation-based movements, which, as previously described, accepts suboptimal movements with the probability of being weighted by the ratio of the proposed fitness to the current fitness. For the exchange between two consecutive chains rre is used as standard parallel tempering criterion. Using this criterion, any proposed exchange in which to propose moving the higher fitness sequence to a lower temperature chain will be accepted. In the exchange where it is proposed to move a higher fitness sequence to a higher temperature, the movement is accepted with a probability inversely proportional to the magnitude of the temperature difference of the two chains and the fitness of the two sequences. Finally, the crossover criterion r c accepts the crossover movement (taking into account both the difference in fitness of the generated new sequence set to the old sequence and the difference in temperature of the two chains involved in the crossover). Notably, in general, the crossover criterion penalizes the overall decrease in fitness when both chains are considered.
Protein fitness maps are known to be highly sensitive, multimodal, and rugged, reflecting the possibility that complete loss of function may occur due to relatively small numbers of point mutations (e.g., mutations in the catalytic domain, mutations that result in misfolding). The complexity of such space presents significant challenges for efficient exploration. Here, we compared how EMCS and MHMCS explored the discrete fitness view of the 85aa protein, respectively, that was able to activate the gene, and evaluated the prediction success rate, sequence diversity and convergence rate.
For experimental verification, EMCS and MHMCS were used to design novel proteins using all three models (XGBoost, CNN and integration). Together, EMCS and MHMCS designed 4600 new sequences, which were very different from the sequence space occupied by the original training data (fig. 13), confirming that both model-directed sampling techniques could propose different novel proteins. To ensure accurate identification of gene activators in the experimental validation, 300 previously validated negative controls (random sequences) were also included in the library. The ability of the peptide to activate the gene locus was then tested experimentally. In total 357 positive hits (hit rate 7.59%) were identified, where positive hits refer to the peptide found to be significantly above background fluorescence and activated the synthetic gene reporter. In contrast, the hit rate of the initial screening was only 0.51%. If the latter number is used as a representation of a portion of the naturally occurring 85aa peptide sequence that can act as a consensus gene activator, the designed peptide will have an increase in hit rate of about 15-fold. In fact, the best model guided sampling technique (ensemble model+emcs from known hits) increases hit rate by a factor of 45 (table 1). Even if initialized from known sequences, the proposed peptides are very different from any peptide in the training set, indicating that EMCS works very well in traversing fitness graphs and identifying high fitness peptides.
TABLE 1 Positive hit results for Integrated model
Searching method Initialization of Sequence # # Positive hit Percentage of hits
EMCS Is known to be 410 94 22.9%
EMCS Random arrangement 390 39 10%
MHMCS Random arrangement 200 2 1%
EMCS is unique in its ability to find the optimal solution corresponding to different sequences, even starting from known positive hit sequences (and thus already high fitness sequences). It is shown herein that the final edit distance of the sequence found by EMCS is significantly higher when starting from a known positive sequence compared to the sequence found by MHMCS using similar temperature conditions. Consistently, using entropy as a measure of information change, the average entropy change for each of the EMCS and MHMCS in 10 7 iterations was calculated (fig. 14A), and it was shown that the average entropy change for each iteration of EMCS (using a crossover rate of 0.5 and a default parameter for 4 total chains) was about 3 times higher than MHMCS (assuming the mutation rate is the same).
The proposed increase in sequence diversity and entropy increase per iteration observed with EMCS is inferred to be due to a genetic crossing step, in which functionally beneficial protein domains can be exchanged between known sequences, followed by further refinement by point mutation. Since the initial sequence already occupies a high fitness domain, the final sequence diversity is ensured by forcing the algorithm to run a predetermined number of iterations. The elimination of local minima is further facilitated by the introduction of a temperature gradient (which allows increasing the search radius). In contrast MHMCS is limited to a single temperature and only has access to domains in the fitness function that are accessible only by point mutations. This hampers MHMC's ability to converge to domains corresponding to different sequences when starting from a known positive sequence, as it requires many optimal movements to get rid of the locally optimal solution of the original sequence.
To assess how fast EMCS converges to a high fitness sequence relative to MHMCS, a computer simulated sampling experiment was performed in which 4000 heuristics were performed on fitness graphs with each algorithm using the same and controlled initial conditions.
When initialized with a random sequence, EMCS converges 1.25-5 times faster than MHMCS, depending on the choice of temperature and crossover rate, probably due to the increased versatility of the algorithm over MHMCS. Using default parameters, 1171 EMCS runs achieve convergence, where each run obtains at least one sequence with a fitness of 0.95 or greater. Furthermore, since 4 strands are involved, EMCS produces on average 2.322 sequences with fitness ≡0.5 per run, giving a total of n=2720 sequences with fitness ≡0.5 for 1171 runs. For MHMCS, the minimum failure rate of the chain starting at temperatures above 2.5X10 -2 or higher was 50% and was discarded from the experiment. When these sequences were excluded, a total of n=2571 sequences were obtained from 2571 runs. Wherein the adaptability of 2361 sequences is more than or equal to 0.95. The remaining 210 failed to converge, but still had a final fitness of 0.5 or more. Fig. 14B together shows the average number of iterations that converge.
In this embodiment, a two-step machine learning and sampling method is proposed for protein engineering problems with limited training data and sparse positive hits. The method involves estimating an adaptation map using a Large Protein Language Model (LPLM) with transfer learning, and then effectively sampling the adaptation map with an Evolved Monte Carlo Search (EMCS) to propose a novel high-adaptation protein sequence. As a proof of concept, this approach was applied to the problem of designing small gene activators and showed that this approach successfully designed new and diverse protein sequences with significantly higher validation rates as gene activators when compared to the more traditional sampling method (MHMCS) or baseline discovery of high throughput screening.
Given the complexity of the wet laboratory test cycle, the importance of this approach is amplified by single round screening involving library design, DNA synthesis, plasmid cloning, viral packaging, cell line infection, fluorescence Activated Cell Sorting (FACS), DNA library preparation, next generation DNA sequencing, and downstream bioinformatics analysis. Furthermore, in the field of rational protein engineering, multiple rounds of iterative screening are often required to discover and verify novel proteins with desired functions. In view of the financial, time and technical costs associated with wet laboratory test cycles, it would be of significant value to accelerate this process to ease the experimental burden of protein engineering. In addition to accelerating new protein designs, this approach opens up many new approaches to existing protein engineering methods (e.g., semi-rational engineering) due to the discovery of new and diverse sequences that can be used to gain a deeper understanding of the desired protein function. The vast protein sequence space, coupled with the computational cost of embedding proteins using LPLM (e.g., ESM-2), requires efficient sampling algorithms that can be freed from locally optimal solutions without losing resolution. EMC algorithms are well suited for this application scenario, as the introduction of temperature gradients allows for multiple acceptance rates to exist simultaneously. Furthermore, the genetic crossing step allows for more efficient exploration of fitness views, as shown by the sequence diversity of MHMCS VS EMCS and the average entropy change per iteration.
The strength of this approach is believed to combine LPLM-mediated transfer learning with EMCS. Since LPLM is trained on a large number of different protein sequences, modern LPLM embedding implicitly contains rich features describing the biochemical, biophysical, evolutionary and even three-dimensional information of the protein, and it is therefore reasonable to think that LPLM embedding of the newly proposed sequence is able to capture the predicted functional consequences of EMCS gene crossover, so that the exchange of inactive proteins that would lead to misfolding is designated as low-fitness and not selected by EMCS. In contrast, with this semi-supervised transfer learning-based model, potential exchanges and domains that can act synergistically will be assigned a high degree of fitness and selected by EMCS even though these domains are evolutionarily unrelated. In contrast, since GAN and diffusion models are sampled from a low dimensional subspace and then passed through the model to obtain the proposed sequence, only sequences in the subspace that are close to the training data can be designed by these methods, and furthermore, there is no guarantee that the high collaborative domains will be close in the subspace (especially if they are not evolutionarily related), which limits the potential diversity of sequences that can be proposed by the generation algorithm trained on limited and skewed training data.
This framework is believed to have many advantages over previous ML-guided protein design methods using traditional sampling techniques, as well as classical laboratory protein engineering methods. First, assays to screen peptides of specific function in diverse natural proteins typically have very low hit rates, while new sequences proposed by this approach have significantly higher hit rates in validation experiments. Furthermore, the small number of positive hits in the training data of protein engineering problems inherently limits the accuracy of fitness functions universality, and this approach is believed to alleviate these drawbacks by exploiting the information from LPLM and incorporating multiple positive hits into the proposed new sequence through EMCS domain exchange. Finally, while proof of concept involves the design of relatively small proteins, it is expected that this approach will be particularly applicable to protein engineering problems involving larger proteins having multiple characterized domains.
The methods described in this example will speed up the design and testing of novel proteins for a variety of purposes, including therapeutic drugs.
Example 7-2 fitness view exploration Using novel EMCS
This example provides additional illustration of example 7-1.
A generative AI platform was constructed that enabled de novo design of ultra-compact regulatory peptides with the ability to transcriptionally up-regulate a genetic locus. By performing high throughput screening, a large amount of training data was collected for the first time to discover novel transcriptional activators among peptides derived from the human, viral and archaeal proteomes. Machine learning ensemble models consisting of decision tree models and convolutional neural networks are trained that utilize transfer learning (embedded by large protein language models) to predict transcriptional activators based on peptide sequences alone. This machine learning platform is used to generate libraries of thousands of ultra-compact peptides predicted as transcriptional activators by using a novel sampling algorithm called the evolutionary monte carlo search to more efficiently traverse the predicted fitness graph.
Experimental screening of these peptides and verification that the generated AI method significantly improved the rate of discovery (up to 45-fold), thereby discovering hundreds of novel transcriptional activators with little sequence similarity to known naturally occurring peptides. Studies of the evolutionary, biochemical and biophysical properties of synthetic activator libraries revealed that validated activators consistently lack conserved functional domains, but do share certain biochemical features such as strong negative electrostatic potentials. The 10 top-ranked synthetic activators were then selected for further characterization and their activation intensity was assessed by screening them at the artificial GFP locus and endogenous human locus (CD 45), and their potency was compared to gold standard activators (e.g., vCD, VP 64). These results demonstrate the ability of machine learning to accelerate the discovery of novel functional peptides, expanding our epigenetic regulator toolkit for future therapeutic applications.
Gene expression regulatory System (GEMS) the gene expression and regulatory System (GEMS) consists of 1) dCasMini 1 compact, programmable DNA binding proteins, 2) one or more guide RNAs, 3) regulatory peptides capable of activating or repressing gene expression. See Xiaoshu Xu et al, ENGINEERED MINIATURE CRISPR-CAS SYSTEM for mammalian genome regulation AND EDITING, molecular Cell, vol.81, stage 20, DOI: 10.1016/j. Molcel.2021.08.008.2021.
After the protein fitness maps were generated as described in example 7-1, the fitness maps were explored using the EMCS algorithm and functional biological sequences were generated. A schematic diagram of the EMCS algorithm components is shown in fig. 15. The EMCS algorithm comprises the following components, 1) running parallel MHMCS, 2) temperature gradient implementation (parallel tempering), and 3) domain exchange between peptide chains running parallel (EMCS). The inspiration of EMCS comes from gene exchange events to allow domain exchange to occur between parallel MHMCS runs, enabling more efficient exploration of fitness views.
Comparison of EMCS with MHMCS the entropy change and convergence time of MHMCS and EMCS iterations are measured. EMCS allows for greater sequence diversity per iteration (as measured by entropy change per iteration (fig. 16A)) and faster convergence time (fig. 16B) when compared to MHMCS. For ablation studies, PTP (parallel tempering) and EMC-NPT (EMCs without parallel tempering) were run.
Experimental verification of functional biological sequences with high predictive fitness EMCS and MHCMS were used to design 4600 new sequences that are very different from the sequence space occupied by the training dataset. The ability of the engineered gene effectors encoded by 4600 new sequences to activate synthetic gene loci was tested. Fig. 17 depicts FACS histograms from engineered gene effector validation experiments. Using standard differential expression analysis procedure (DESEQ 2), 357 out of 4600 engineered gene effectors (hit rate 7.51%) significantly activated the gene reporter compared to background fluorescence. In contrast, the hit rate of the training dataset was 0.51%. A hit rate of 20.6% was achieved using EMCS. The statistics of the differential expression analysis are shown in table 2.
TABLE 2 Positive hit results of sampling algorithm
Algorithm Initialization of Total sequence Number of yang hits Percentage of hits
EMCS All of which 2600 338 13%
MHMCS All of which 2000 18 0.9%
EMCS Is known to be 1310 270 20.6%
EMCS Random arrangement 1290 68 5.3%
MHMCS Random arrangement 2000 18 0.9%
Negative control n/a 300 1 0.33%
Biochemical analysis of experimentally verified functional biological sequences Biochemical and structural characterization of experimentally verified functional biological sequences was studied using ESMFold (FIGS. 18A-18C). In general, functional biological sequences are rich in acidic residues that contribute to the solvent-accessible surface area, are moderately flexible (average β -factor near 45), lack β -sheet, and consist primarily of random coils and α -helices.
Verification screening of engineered gene effectors encoded using computer-implemented methods functional biological sequences 748 were analyzed for modulation of the engineered gene effectors using two verification screens. To evaluate the modulation of engineered gene effectors, the expression was measured as fold change values. The results are shown in Table 11. Fold change values for each engineered gene effector are contained in the "log2 fold change" column of table 11.
The modulation of 10 selected engineered gene effectors (SEQ ID NOs: 1495, 1592, 1595, 1634, 1654, 1665, 1677, 1686, 1689, 1716) encoded by functional biological sequences generated using computer-implemented methods was further analyzed using endogenous gene verification screening. HEK293T cells were engineered to express engineered gene effectors targeting the synthetic site (TRE 3G) and the endogenous site (CD 45). The activation potency of the engineered gene effectors was compared with that of the standard activators VP64 and vCD (fig. 19A-19B).
While preferred embodiments of the present disclosure have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. Many modifications, changes, and substitutions will now occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. The following claims are intended to define the scope of the invention and their equivalents and methods and structures within the scope of these claims and their equivalents are thereby covered.
TABLE 9 synthetic reporter cell line validation screen
Combined bar code Base average_repeat Log2 multiple change_repeat LfcSC _repeat Stat_repeat P-value repeat Padj _repeat
AAAACAAGCATT _A_Concatenation ACTTGATGGTTT _B 7469.02185 -1.040403303 0.175382573 -5.932193182 2.99E-09 9.05E-09
AAAACAAGCATT _A_Concatenation AGAGTGCGCTCT _B 7387.267849 -1.086948505 0.194931444 -5.576055265 2.46E-08 6.83E-08
AAAACAAGCATT _A_Concatenation ATAACTCCACGC _B 6061.978221 1.180519208 0.367776982 3.209877906 0.001327914 0.002240016
AAAACAAGCATT _A_Concatenation ATGACCCCTTGT _B 2403.749781 0.682247077 0.222536964 3.065769681 0.002171105 0.003491089
AAAACAAGCATT _A_Concatenation CTACGTGGCCCC _B 8612.761707 -0.400581227 0.194525524 -2.059273354 0.03946806 0.054666068
AAAACAAGCATT _A_Concatenation GAGATTGTGTCC _B 6440.202817 -0.090413201 0.310217427 -0.291451069 0.770706361 0.800138309
AAACTTTGAGCG _A_Concatenation ACTTGATGGTTT _B 13323.61535 2.387530537 0.270533511 8.825267261 1.09E-18 7.09E-18
AAACTTTGAGCG _A_Concatenation AGAGTGCGCTCT _B 22476.20917 -1.141426426 0.106625969 -10.70495709 9.65E-27 1.19E-25
AAACTTTGAGCG _A_Concatenation ATAACTCCACGC _B 5482.672468 -0.789640258 0.255375687 -3.092072962 0.00198764 0.003222678
AAACTTTGAGCG _A_Concatenation ATGACCCCTTGT _B 2441.594848 1.208704527 0.921165047 1.312147623 0.18947033 0.232863979
AAACTTTGAGCG _A_Concatenation CTACGTGGCCCC _B 6556.699452 0.903854866 0.418728636 2.158569506 0.030883581 0.043340823
AAACTTTGAGCG _A_Concatenation GAGATTGTGTCC _B 9320.034414 -0.161969825 0.280907028 -0.576595845 0.564212507 0.611840835
AAGACCGGTGCC _A_Concatenation ACTTGATGGTTT _B 25897.64111 3.464806982 0.65886586 5.258744141 1.45E-07 3.74E-07
AAGACCGGTGCC _A_Concatenation AGAGTGCGCTCT _B 33374.15702 -1.003330918 0.122113515 -8.216378968 2.10E-16 1.12E-15
AAGACCGGTGCC _A_Concatenation ATAACTCCACGC _B 1643.231853 1.795202191 0.509664247 3.522323181 0.000427782 0.000771725
AAGACCGGTGCC _A_Concatenation ATGACCCCTTGT _B 4009.103478 -0.453029757 0.209174418 -2.165799055 0.030326546 0.04266153
AAGACCGGTGCC _A_Concatenation CTACGTGGCCCC _B 8409.892445 1.537955136 0.257238103 5.978722113 2.25E-09 6.96E-09
AAGACCGGTGCC _A_Concatenation GAGATTGTGTCC _B 13132.04427 2.593584989 0.411247838 6.306622788 2.85E-10 9.55E-10
AAGCAACGATAT _A_Concatenation ACTTGATGGTTT _B 3378.329397 -1.323743527 0.41128631 -3.218545075 0.001288427 0.002179698
AAGCAACGATAT _A_Concatenation AGAGTGCGCTCT _B 3940.909635 -1.986502561 0.450370562 -4.410817953 1.03E-05 2.17E-05
AAGCAACGATAT _A_Concatenation ATAACTCCACGC _B 2697.652523 -1.489541924 0.209024082 -7.126173742 1.03E-12 4.17E-12
AAGCAACGATAT _A_Concatenation ATGACCCCTTGT _B 869.8236778 -2.095312326 0.422387426 -4.960640862 7.03E-07 1.68E-06
AAGCAACGATAT _A_Concatenation CTACGTGGCCCC _B 6484.719312 -3.075695623 0.248034305 -12.40028318 2.60E-35 5.07E-34
AAGCAACGATAT _A_Concatenation GAGATTGTGTCC _B 2509.268634 -0.411111801 0.276606552 -1.486269206 0.137207928 0.173027042
AAGGGTAAATTG _A_Concatenation ACTTGATGGTTT _B 3262.476599 -0.355997737 0.422611495 -0.842375896 0.399577581 0.453501157
AAGGGTAAATTG _A_Concatenation AGAGTGCGCTCT _B 2887.363728 -1.810975681 0.447301992 -4.048664468 5.15E-05 0.000101034
AAGGGTAAATTG _A_Concatenation ATAACTCCACGC _B 978.4522766 2.353539847 0.91146247 2.582157714 0.00981847 0.014715118
AAGGGTAAATTG _A_Concatenation ATGACCCCTTGT _B 791.385993 1.665402238 0.465170025 3.580201109 0.00034333 0.000627113
AAGGGTAAATTG _A_Concatenation CTACGTGGCCCC _B 1003.98086 2.17909415 0.812043279 2.683470458 0.007286241 0.011134139
AAGGGTAAATTG _A_Concatenation GAGATTGTGTCC _B 1649.94748 1.424926374 0.406836948 3.502450747 0.000460999 0.000827815
AATCAACGAGCA _A_Concatenation ACTTGATGGTTT _B 12070.22463 2.576730106 0.120841558 21.32321148 6.91E-101 1.15E-98
AATCAACGAGCA _A_Concatenation AGAGTGCGCTCT _B 13133.94992 -0.844786717 0.200644792 -4.210359548 2.55E-05 5.11E-05
AATCAACGAGCA _A_Concatenation ATAACTCCACGC _B 1510.276102 0.615663624 0.673737859 0.913802921 0.360820411 0.415156556
AATCAACGAGCA _A_Concatenation ATGACCCCTTGT _B 1381.655393 -1.555051014 0.809373923 -1.921301107 0.054693758 0.074172857
AATCAACGAGCA _A_Concatenation CTACGTGGCCCC _B 2764.999347 2.745767132 0.75190951 3.651725501 0.000260484 0.000481052
AATCAACGAGCA _A_Concatenation GAGATTGTGTCC _B 3731.257045 4.224890191 0.455970213 9.265715337 1.94E-20 1.45E-19
AATCGAGACCAG _A_Concatenation ACTTGATGGTTT _B 2278.810307 4.842341098 0.497473125 9.733874768 2.16E-22 1.94E-21
AATCGAGACCAG _A_Concatenation AGAGTGCGCTCT _B 5190.119865 -0.203301362 0.245934378 -0.826648815 0.408436141 0.462208953
AATCGAGACCAG _A_Concatenation ATAACTCCACGC _B 507.8189228 3.808431435 0.731717765 5.204781977 1.94E-07 4.97E-07
AATCGAGACCAG _A_Concatenation ATGACCCCTTGT _B 1756.9988 -2.14025026 0.628488836 -3.405391052 0.000660694 0.001172004
AATCGAGACCAG _A_Concatenation CTACGTGGCCCC _B 882.0555616 2.424267755 0.982842294 2.46658876 0.013640692 0.020159252
AATCGAGACCAG _A_Concatenation GAGATTGTGTCC _B 824.0000993 2.200773775 1.458282534 1.509154587 0.131259282 0.166423103
AATCGATCCTAC _A_Concatenation ACTTGATGGTTT _B 17339.39708 2.426968449 0.255050785 9.515628212 1.81E-21 1.52E-20
AATCGATCCTAC _A_Concatenation AGAGTGCGCTCT _B 22599.66373 -1.414848875 0.176516654 -8.015384622 1.10E-15 5.51E-15
AATCGATCCTAC _A_Concatenation ATAACTCCACGC _B 2936.432464 0.928705312 0.52511793 1.76856523 0.076966458 0.101780304
AATCGATCCTAC _A_Concatenation ATGACCCCTTGT _B 1616.545275 0.232700454 0.828555812 0.280850668 0.778824939 0.807849471
AATCGATCCTAC _A_Concatenation CTACGTGGCCCC _B 4446.73136 2.32772015 0.511829202 4.547845536 5.42E-06 1.20E-05
AATCGATCCTAC _A_Concatenation GAGATTGTGTCC _B 7729.401642 0.444776222 0.308954857 1.439615568 0.149976193 0.187110106
AATGGCACTAGC _A_Concatenation ACTTGATGGTTT _B 10102.71102 -0.268726749 0.085880989 -3.129059804 0.001753666 0.002871198
AATGGCACTAGC _A_Concatenation AGAGTGCGCTCT _B 10306.49689 -1.186214715 0.292596039 -4.054103805 5.03E-05 9.89E-05
AATGGCACTAGC _A_Concatenation ATAACTCCACGC _B 2919.532481 1.860377656 0.510029589 3.647587703 0.000264714 0.000488092
AATGGCACTAGC _A_Concatenation ATGACCCCTTGT _B 2118.166198 0.600955949 0.664566427 0.904282739 0.365845497 0.419699103
AATGGCACTAGC _A_Concatenation CTACGTGGCCCC _B 6611.714743 2.735642296 0.286368727 9.552866776 1.26E-21 1.07E-20
AATGGCACTAGC _A_Concatenation GAGATTGTGTCC _B 5320.528702 1.864124959 0.376850975 4.946583887 7.55E-07 1.80E-06
ACAACGTGCATA _A_Concatenation ACTTGATGGTTT _B 21971.69836 2.529439324 0.433294204 5.837694809 5.29E-09 1.55E-08
ACAACGTGCATA _A_Concatenation AGAGTGCGCTCT _B 31332.33425 0.439669885 0.092301795 4.763394721 1.90E-06 4.37E-06
ACAACGTGCATA _A_Concatenation ATAACTCCACGC _B 2120.150699 0.160206921 0.455632511 0.351614332 0.725127516 0.762296822
ACAACGTGCATA _A_Concatenation ATGACCCCTTGT _B 595.738524 2.927637481 1.144503853 2.557997051 0.010527699 0.015717598
ACAACGTGCATA _A_Concatenation CTACGTGGCCCC _B 2344.026099 0.334332536 0.385376996 0.867546687 0.385642543 0.439820617
ACAACGTGCATA _A_Concatenation GAGATTGTGTCC _B 2222.672156 1.792509699 0.665283327 2.694355361 0.007052493 0.010833593
ACCAGTATTTAA _A_Concatenation ACTTGATGGTTT _B 6842.74488 2.911864238 0.225216677 12.92916793 3.08E-38 6.67E-37
ACCAGTATTTAA _A_Concatenation AGAGTGCGCTCT _B 11999.0185 -1.52760053 0.16119521 -9.476711712 2.62E-21 2.15E-20
ACCAGTATTTAA _A_Concatenation ATAACTCCACGC _B 893.4465851 1.950587978 0.739965447 2.636052785 0.008387668 0.012750565
ACCAGTATTTAA _A_Concatenation ATGACCCCTTGT _B 1820.930458 -0.442283758 0.502796338 -0.87964793 0.379050065 0.433147142
ACCAGTATTTAA _A_Concatenation CTACGTGGCCCC _B 3528.505796 1.127777138 0.508957386 2.215857691 0.02670125 0.037880778
ACCAGTATTTAA _A_Concatenation GAGATTGTGTCC _B 2324.79601 1.398548452 0.466655397 2.996961916 0.002726848 0.004342895
ACGACTCACCGC _A_Concatenation ACTTGATGGTTT _B 3960.030496 -0.642872873 0.280369732 -2.292946778 0.021851069 0.031458005
ACGACTCACCGC _A_Concatenation AGAGTGCGCTCT _B 1717.102401 0.424418205 0.323890002 1.310377607 0.190068125 0.233147575
ACGACTCACCGC _A_Concatenation ATAACTCCACGC _B 1894.570068 2.58861963 0.368778889 7.019435521 2.23E-12 8.74E-12
ACGACTCACCGC _A_Concatenation ATGACCCCTTGT _B 2044.618388 -0.340326224 0.468220124 -0.726850911 0.467317292 0.517814137
ACGACTCACCGC _A_Concatenation CTACGTGGCCCC _B 3619.785732 -0.260639191 0.238633403 -1.092215875 0.274738226 0.326723282
ACGACTCACCGC _A_Concatenation GAGATTGTGTCC _B 4933.04858 0.453573298 0.149682824 3.030229421 0.00244368 0.003907883
ACGGCGTAAGTA _A_Concatenation ACTTGATGGTTT _B 1811.61715 -0.487064438 0.382745341 -1.272554846 0.20317605 0.24715172
ACGGCGTAAGTA _A_Concatenation AGAGTGCGCTCT _B 1162.070148 -1.591657956 0.722312875 -2.203557504 0.027555479 0.038950852
ACGGCGTAAGTA _A_Concatenation ATAACTCCACGC _B 4381.024234 -3.299574658 0.32018985 -10.30505701 6.69E-25 7.17E-24
ACGGCGTAAGTA _A_Concatenation ATGACCCCTTGT _B 514.0760198 -0.457951366 0.669465927 -0.684054777 0.493940577 0.544219165
ACGGCGTAAGTA _A_Concatenation CTACGTGGCCCC _B 4299.137056 -3.323231079 0.230893709 -14.39290439 5.73E-47 1.92E-45
ACGGCGTAAGTA _A_Concatenation GAGATTGTGTCC _B 3104.860143 1.613015059 0.576126646 2.799757777 0.005114097 0.007949972
ACTCGTGTTGTC _A_Concatenation ACTTGATGGTTT _B 1050.574385 1.622547696 0.510102131 3.180829086 0.001468542 0.002451286
ACTCGTGTTGTC _A_Concatenation AGAGTGCGCTCT _B 1322.899137 3.942212278 1.368158789 2.881399667 0.003959133 0.006220734
ACTCGTGTTGTC _A_Concatenation ATAACTCCACGC _B 421.6768423 0.531799619 0.91774906 0.579460816 0.562278279 0.610309478
ACTCGTGTTGTC _A_Concatenation ATGACCCCTTGT _B 253.6992619 -1.977561462 1.263779197 -1.564799822 0.117629854 0.150612595
ACTCGTGTTGTC _A_Concatenation CTACGTGGCCCC _B 1102.362215 -2.401459546 0.358660446 -6.695635316 2.15E-11 7.87E-11
ACTCGTGTTGTC _A_Concatenation GAGATTGTGTCC _B 599.6244131 -1.692520339 0.704063249 -2.403932234 0.01621978 0.023701153
ACTTGATGGTTT _A_Concatenation AAAACAAGCATT _B 11348.04144 -0.631795346 0.147881135 -4.272318736 1.93E-05 3.93E-05
ACTTGATGGTTT _A_Concatenation AAACTTTGAGCG _B 4286.67209 3.621900716 0.700943248 5.167181121 2.38E-07 6.03E-07
ACTTGATGGTTT _A_Concatenation AAGACCGGTGCC _B 32843.58417 3.007151918 0.321073975 9.365916121 7.54E-21 5.80E-20
ACTTGATGGTTT _A_Concatenation AAGCAACGATAT _B 11069.81009 -1.589382789 0.214342634 -7.415150028 1.21E-13 5.32E-13
ACTTGATGGTTT _A_Concatenation AAGGGTAAATTG _B 16985.07741 -0.300132874 0.262953633 -1.141390864 0.253707305 0.305127407
ACTTGATGGTTT _A_Concatenation AATCAACGAGCA _B 14639.35486 0.774738674 0.327237448 2.367512271 0.017908126 0.026070485
ACTTGATGGTTT _A_Concatenation AATCGAGACCAG _B 12317.3212 3.662739142 0.527232637 6.947102445 3.73E-12 1.43E-11
ACTTGATGGTTT _A_Concatenation AATCGATCCTAC _B 23440.72691 1.578956967 0.197417818 7.998046905 1.26E-15 6.29E-15
ACTTGATGGTTT _A_Concatenation AATGGCACTAGC _B 16725.02771 -0.992954978 0.234060033 -4.24230898 2.21E-05 4.47E-05
ACTTGATGGTTT _A_Concatenation ACAACGTGCATA _B 12335.91093 2.652736942 0.590834304 4.489815371 7.13E-06 1.54E-05
ACTTGATGGTTT _A_Concatenation ACCAGTATTTAA _B 11720.82305 4.047652121 0.215492651 18.78324899 1.04E-78 1.34E-76
ACTTGATGGTTT _A_Concatenation ACGACTCACCGC _B 7791.576043 -1.869186077 0.268761385 -6.954816357 3.53E-12 1.37E-11
ACTTGATGGTTT _A_Concatenation ACGGCGTAAGTA _B 2913.817666 -1.369994186 0.171695678 -7.9792002 1.47E-15 7.26E-15
ACTTGATGGTTT _A_Concatenation ACTCGTGTTGTC _B 14530.75956 -1.363700034 0.299993635 -4.545763219 5.47E-06 1.21E-05
ACTTGATGGTTT _A_Concatenation ACTTGATGGTTT _B 23510.81628 2.238158842 0.248502199 9.006595734 2.13E-19 1.42E-18
ACTTGATGGTTT _A_Concatenation ACTTGATGGTTT _B 10521.75788 1.641436377 0.509596554 3.221050777 0.001277215 0.002163862
ACTTGATGGTTT _A_Concatenation ACTTGATGGTTT _B 23510.81628 2.238158842 0.248502199 9.006595734 2.13E-19 1.42E-18
ACTTGATGGTTT _A_Concatenation ACTTGATGGTTT _B 10521.75788 1.641436377 0.509596554 3.221050777 0.001277215 0.002163862
ACTTGATGGTTT _A_Concatenation ACTTTAGTAACA _B 12277.19587 1.900877328 0.311006317 6.112021592 9.84E-10 3.13E-09
ACTTGATGGTTT _A_Concatenation AGAGTGCGCTCT _B 21276.8072 0.133140261 0.125115242 1.064141021 0.28726482 0.339349702
ACTTGATGGTTT _A_Concatenation AGAGTGCGCTCT _B 11837.25375 -0.324609741 0.171120831 -1.896962157 0.057832933 0.078248494
ACTTGATGGTTT _A_Concatenation AGAGTGCGCTCT _B 21276.8072 0.133140261 0.125115242 1.064141021 0.28726482 0.339349702
ACTTGATGGTTT _A_Concatenation AGAGTGCGCTCT _B 11837.25375 -0.324609741 0.171120831 -1.896962157 0.057832933 0.078248494
ACTTGATGGTTT _A_Concatenation AGCCAGGTCACC _B 5978.047799 2.107638151 0.413324039 5.099239226 3.41E-07 8.52E-07
ACTTGATGGTTT _A_Concatenation AGCCATAAACCT _B 26726.26915 2.458905498 0.236799357 10.38391966 2.93E-25 3.27E-24
ACTTGATGGTTT _A_Concatenation AGCCGTGCTTGA _B 18594.33083 0.274028871 0.241248431 1.135878355 0.256007478 0.306958442
ACTTGATGGTTT _A_Concatenation AGGCTTCCCGCT _B 14155.14134 -1.986380753 0.191866543 -10.35292931 4.06E-25 4.48E-24
ACTTGATGGTTT _A_Concatenation AGTTAGGCTCTT _B 25624.83108 3.209779933 0.260791446 12.30784206 8.22E-35 1.53E-33
ACTTGATGGTTT _A_Concatenation AGTTGCTTTGCT _B 14935.50261 -1.727278751 0.127356031 -13.56259876 6.67E-42 1.90E-40
ACTTGATGGTTT _A_Concatenation ATAACTCCACGC _B 3444.014411 0.795533404 0.514101589 1.547424517 0.121760913 0.155391384
ACTTGATGGTTT _A_Concatenation ATAACTCCACGC _B 2366.118225 2.564433254 0.805630394 3.183138661 0.001456878 0.002436467
ACTTGATGGTTT _A_Concatenation ATAACTCCACGC _B 3444.014411 0.795533404 0.514101589 1.547424517 0.121760913 0.155391384
ACTTGATGGTTT _A_Concatenation ATAACTCCACGC _B 2366.118225 2.564433254 0.805630394 3.183138661 0.001456878 0.002436467
ACTTGATGGTTT _A_Concatenation ATCCAATCTGTG _B 2718.543771 2.237143787 0.570719735 3.919864074 8.86E-05 0.000169236
ACTTGATGGTTT _A_Concatenation ATCTCTCAGTTC _B 19385.26089 1.736645584 0.183084737 9.485474405 2.41E-21 1.99E-20
ACTTGATGGTTT _A_Concatenation ATGACAAAACGA _B 27241.70132 3.096207993 0.437322045 7.079926637 1.44E-12 5.72E-12
ACTTGATGGTTT _A_Concatenation ATGACCCCTTGT _B 2812.370378 1.450251748 0.273379604 5.304901051 1.13E-07 2.95E-07
ACTTGATGGTTT _A_Concatenation ATGACCCCTTGT _B 1623.761158 0.90573465 0.578114334 1.566705055 0.117183649 0.150205795
ACTTGATGGTTT _A_Concatenation ATGACCCCTTGT _B 2812.370378 1.450251748 0.273379604 5.304901051 1.13E-07 2.95E-07
ACTTGATGGTTT _A_Concatenation ATGACCCCTTGT _B 1623.761158 0.90573465 0.578114334 1.566705055 0.117183649 0.150205795
ACTTGATGGTTT _A_Concatenation ATGTGGCACGAC _B 58177.43919 0.206162766 0.158378429 1.301709877 0.193015582 0.236514901
ACTTGATGGTTT _A_Concatenation ATTGTATCTAAC _B 48074.83443 -0.344042899 0.151937482 -2.264371467 0.023551277 0.033739514
ACTTGATGGTTT _A_Concatenation ATTTGTAGACCG _B 53805.29134 -1.473380947 0.14756178 -9.984841271 1.78E-23 1.69E-22
ACTTGATGGTTT _A_Concatenation CAACGCATGCCT _B 47706.65595 -1.793839835 0.070898002 -25.3016979 3.06E-141 1.79E-138
ACTTGATGGTTT _A_Concatenation CACACCCGGCAG _B 23727.67497 3.822012097 0.467392341 8.177310071 2.90E-16 1.53E-15
ACTTGATGGTTT _A_Concatenation CAGTGACATAGT _B 60768.71893 -0.970787003 0.128294177 -7.566882792 3.82E-14 1.72E-13
ACTTGATGGTTT _A_Concatenation CATCTCACCTAG _B 27692.38789 2.256177685 0.209312674 10.77898267 4.33E-27 5.56E-26
ACTTGATGGTTT _A_Concatenation CATCTGAGTCGT _B 25.17218896 4.063728066 0.630791559 6.442267671 1.18E-10 4.06E-10
ACTTGATGGTTT _A_Concatenation CATGCTAACACC _B 7406.969113 1.297044065 0.217472678 5.964170203 2.46E-09 7.58E-09
ACTTGATGGTTT _A_Concatenation CATTAGTCCCGC _B 7157.117336 3.439595984 0.617479127 5.570384214 2.54E-08 7.04E-08
ACTTGATGGTTT _A_Concatenation CATTGTTTAAAT _B 23796.72389 2.635085182 0.142744837 18.46010856 4.32E-76 4.60E-74
ACTTGATGGTTT _A_Concatenation CCAGACGGTCTG _B 20848.11149 -2.447165503 0.139140853 -17.58768501 3.06E-69 2.56E-67
ACTTGATGGTTT _A_Concatenation CCCCTTAGGAAT _B 7011.335449 0.375257602 0.460126006 0.815553995 0.414755271 0.468324396
ACTTGATGGTTT _A_Concatenation CCCGTAGGGGCT _B 1330.549239 2.995010169 0.765340228 3.913305559 9.10E-05 0.000173617
ACTTGATGGTTT _A_Concatenation CCCTATGTTCTA _B 8024.750138 2.499547228 0.357836611 6.98516348 2.85E-12 1.11E-11
ACTTGATGGTTT _A_Concatenation CCCTGCCTCGGG _B 22680.77655 -1.178843609 0.184397354 -6.392952942 1.63E-10 5.56E-10
ACTTGATGGTTT _A_Concatenation CCGCCCTTATGT _B 5528.662203 0.737085845 0.32237339 2.286435134 0.022228815 0.031962466
ACTTGATGGTTT _A_Concatenation CCTAAATGCAAG _B 15691.99757 -1.359604525 0.096851137 -14.03808537 9.11E-45 2.88E-43
ACTTGATGGTTT _A_Concatenation CGATGGCGGAAT _B 17032.98407 -1.27848825 0.14830826 -8.620479068 6.67E-18 4.08E-17
ACTTGATGGTTT _A_Concatenation CGCCGGCATACA _B 12007.4946 2.76621001 0.321540148 8.603000351 7.77E-18 4.73E-17
ACTTGATGGTTT _A_Concatenation CGGCAATTGCTT _B 5866.518966 0.581497187 0.398491236 1.459247118 0.144497105 0.181047283
ACTTGATGGTTT _A_Concatenation CTACGTGGCCCC _B 0.281036486 0.726896877 1.850372904 0.392838047 0.694439102 0.732670857
ACTTGATGGTTT _A_Concatenation CTACGTGGCCCC _B 2894.765866 2.169723115 0.409504488 5.298411078 1.17E-07 3.05E-07
ACTTGATGGTTT _A_Concatenation CTACGTGGCCCC _B 0.281036486 0.726896877 1.850372904 0.392838047 0.694439102 0.732670857
ACTTGATGGTTT _A_Concatenation CTACGTGGCCCC _B 2894.765866 2.169723115 0.409504488 5.298411078 1.17E-07 3.05E-07
ACTTGATGGTTT _A_Concatenation CTCACGACAAGA _B 20348.57356 3.500106697 0.385612236 9.076752151 1.12E-19 7.74E-19
ACTTGATGGTTT _A_Concatenation CTCAGCGATATA _B 7364.727988 1.330774984 0.375434154 3.544629521 0.000393165 0.000712574
ACTTGATGGTTT _A_Concatenation CTCCATTAATGA _B 26787.33915 -1.241430066 0.164883427 -7.529137961 5.11E-14 2.26E-13
ACTTGATGGTTT _A_Concatenation CTCGCACCGAGG _B 27922.58349 -1.377034604 0.156877437 -8.77777349 1.67E-18 1.06E-17
ACTTGATGGTTT _A_Concatenation CTCGTTATGGCA _B 44007.63448 0.253869975 0.147561949 1.720429803 0.085354344 0.112363995
ACTTGATGGTTT _A_Concatenation CTGCCGGTTGCG _B 3169.376672 -0.307354238 0.207547301 -1.480887666 0.13863651 0.174640173
ACTTGATGGTTT _A_Concatenation CTGGTAGTTGTT _B 34890.36969 -1.771205156 0.107528352 -16.47198265 5.83E-61 3.25E-59
ACTTGATGGTTT _A_Concatenation EPICXV.47_B 5274.685908 4.07790674 0.412345065 9.889549029 4.62E-23 4.25E-22
ACTTGATGGTTT _A_Concatenation GACTGTGGGCGG _B 4105.529473 -1.332221678 0.332022169 -4.012447974 6.01E-05 0.000117275
ACTTGATGGTTT _A_Concatenation GAGACAGCTCTC _B 6959.073104 -1.499620211 0.299145729 -5.013008932 5.36E-07 1.30E-06
ACTTGATGGTTT _A_Concatenation GAGATTGTGTCC _B 12355.41606 2.224680319 0.244339573 9.104871116 8.64E-20 6.05E-19
ACTTGATGGTTT _A_Concatenation GAGATTGTGTCC _B 3364.574402 1.762887396 0.445733222 3.955028055 7.65E-05 0.000147136
ACTTGATGGTTT _A_Concatenation GAGATTGTGTCC _B 12355.41606 2.224680319 0.244339573 9.104871116 8.64E-20 6.05E-19
ACTTGATGGTTT _A_Concatenation GAGATTGTGTCC _B 3364.574402 1.762887396 0.445733222 3.955028055 7.65E-05 0.000147136
ACTTGATGGTTT _A_Concatenation GAGTGACCTATT _B 60120.39869 -1.304353047 0.183748766 -7.098567648 1.26E-12 5.01E-12
ACTTGATGGTTT _A_Concatenation GATCGCATGTTC _B 24104.4819 -1.431446944 0.094146421 -15.20447536 3.30E-52 1.54E-50
ACTTGATGGTTT _A_Concatenation GATGGAGCTACA _B 12438.49421 3.589581259 0.675613269 5.313070986 1.08E-07 2.83E-07
ACTTGATGGTTT _A_Concatenation GCAACGAGGGTC _B 20877.35209 1.262246261 0.298683226 4.226036652 2.38E-05 4.78E-05
ACTTGATGGTTT _A_Concatenation GCAGATTCGAAT _B 16318.83158 -1.763688062 0.202064934 -8.728323235 2.58E-18 1.62E-17
ACTTGATGGTTT _A_Concatenation GCGAGGCAGACC _B 13351.03194 -1.219309005 0.267564765 -4.557061182 5.19E-06 1.15E-05
ACTTGATGGTTT _A_Concatenation GGAGGCACATCG _B 7052.202204 1.444556032 0.251446442 5.744984978 9.19E-09 2.63E-08
ACTTGATGGTTT _A_Concatenation GGCACAGCTCCA _B 37300.51109 -1.402077424 0.059796379 -23.44753044 1.40E-121 4.09E-119
ACTTGATGGTTT _A_Concatenation GGCACTGTCGAG _B 15054.5403 -1.255639499 0.230112926 -5.456623067 4.85E-08 1.33E-07
ACTTGATGGTTT _A_Concatenation GGCCTTTCGGGC _B 0
ACTTGATGGTTT _A_Concatenation GGGAGTATGTCG _B 17730.77851 -1.550267935 0.153286346 -10.11354225 4.81E-24 4.77E-23
ACTTGATGGTTT _A_Concatenation GGTGTCTCGATT _B 6428.058698 2.748999755 0.333980808 8.23101115 1.86E-16 9.95E-16
ACTTGATGGTTT _A_Concatenation GTACATACCGAA _B 10899.68223 -0.659691584 0.20505116 -3.217204843 0.001294461 0.002186742
ACTTGATGGTTT _A_Concatenation GTCACCCTCCTT _B 0
ACTTGATGGTTT _A_Concatenation GTTCCACGCCTG _B 21325.07703 0.450811167 0.10111479 4.458409759 8.26E-06 1.77E-05
ACTTGATGGTTT _A_Concatenation Stop_1_B 13804.32691 1.724857541 0.216278965 7.975151616 1.52E-15 7.48E-15
ACTTGATGGTTT _A_Concatenation Stop_2_B 3287.77979 1.135060665 0.635855491 1.785092181 0.074246366 0.098629548
ACTTGATGGTTT _A_Concatenation TAAAATTTATCA _B 26571.97322 2.383773214 0.414646225 5.748932625 8.98E-09 2.57E-08
ACTTGATGGTTT _A_Concatenation TAAGATTCAACG _B 4405.881968 1.477587592 0.290439702 5.087416015 3.63E-07 8.99E-07
ACTTGATGGTTT _A_Concatenation TACATGAGGCAC _B 13869.68321 -1.228355939 0.376449029 -3.263007327 0.001102367 0.001895098
ACTTGATGGTTT _A_Concatenation TACTTCTAGCGG _B 8109.089692 -0.645954652 0.292350479 -2.209521441 0.027138393 0.038407725
ACTTGATGGTTT _A_Concatenation TATCGCAAGAAC _B 13953.36033 -1.506506437 0.130941905 -11.50515138 1.24E-30 1.99E-29
ACTTGATGGTTT _A_Concatenation TATCTATCGTGT _B 14591.40113 0.973702501 0.20093994 4.845738976 1.26E-06 2.94E-06
ACTTGATGGTTT _A_Concatenation TATTAAAAGTGC _B 7367.937758 -0.460206712 0.284750329 -1.616176225 0.106056217 0.13751427
ACTTGATGGTTT _A_Concatenation TCAGCCGTAGGC _B 26876.3203 -1.527159603 0.130127275 -11.73589171 8.34E-32 1.39E-30
ACTTGATGGTTT _A_Concatenation TCTATCACAACT _B 2121.104892 2.1500662 0.603042776 3.565362669 0.000363353 0.00066162
ACTTGATGGTTT _A_Concatenation TCTGTACCAACG _B 36931.25192 3.147819184 0.37393897 8.418002491 3.83E-17 2.19E-16
ACTTGATGGTTT _A_Concatenation TCTTAGAGGGTG _B 23558.69828 1.330773448 0.227088731 5.860147449 4.62E-09 1.37E-08
ACTTGATGGTTT _A_Concatenation TCTTGGTTTTGA _B 50323.07205 0.186804223 0.10245253 1.82332465 0.068254235 0.091396564
ACTTGATGGTTT _A_Concatenation TGATGAGCGCCC _B 70545.70218 0.021496585 0.12639664 0.170072441 0.864953166 0.88470081
ACTTGATGGTTT _A_Concatenation TGCCAGACGGTC _B 12557.40674 1.401603217 0.256154991 5.471699822 4.46E-08 1.22E-07
ACTTGATGGTTT _A_Concatenation TGGCCCCCAGTT _B 6539.248006 -0.960982659 0.224072914 -4.288705141 1.80E-05 3.67E-05
ACTTGATGGTTT _A_Concatenation TTAAATGAGGGC _B 5347.465633 -0.512750316 0.220548595 -2.324885885 0.020078063 0.029012677
ACTTGATGGTTT _A_Concatenation TTAGTGCCATAC _B 0.183488719 0.044358346 2.991506868 0.014828094 0.988169326 0.98902379
ACTTGATGGTTT _A_Concatenation TTCTAATTTCCT _B 14127.8249 0.179146416 0.171883587 1.042254353 0.29729377 0.349282831
ACTTGATGGTTT _A_Concatenation TTTAACGGATTG _B 44304.43602 -0.213902089 0.109226121 -1.958341899 0.050189907 0.068382286
ACTTGATGGTTT _A_Concatenation Joint_ 1_B 31875.21698 1.800860555 0.084001977 21.43831166 5.87E-102 1.14E-99
ACTTGATGGTTT _A_Concatenation Joint_ 2_B 16691.71292 -0.379452151 0.17815157 -2.129939974 0.033176569 0.046170726
ACTTGATGGTTT _A_Concatenation Joint_ 3_B 32783.85568 -1.13412176 0.101996804 -11.1191892 1.01E-28 1.41E-27
ACTTTAGTAACA _A_Concatenation ACTTGATGGTTT _B 2963.038326 4.261019241 0.876643213 4.860608261 1.17E-06 2.74E-06
ACTTTAGTAACA _A_Concatenation AGAGTGCGCTCT _B 12110.49046 0.973659107 0.162304339 5.998971523 1.99E-09 6.19E-09
ACTTTAGTAACA _A_Concatenation ATAACTCCACGC _B 1287.293528 -0.626331121 0.845109051 -0.741124617 0.458617885 0.511081323
ACTTTAGTAACA _A_Concatenation ATGACCCCTTGT _B 849.0578421 2.664075567 1.397823684 1.905873822 0.056666578 0.076759247
ACTTTAGTAACA _A_Concatenation CTACGTGGCCCC _B 909.7153413 1.41470307 0.879518788 1.608496701 0.107726438 0.139459807
ACTTTAGTAACA _A_Concatenation GAGATTGTGTCC _B 735.9230973 -0.056867808 0.4665382 -0.121893145 0.902983645 0.916308924
AGAGTGCGCTCT _A_Concatenation AAAACAAGCATT _B 6033.159787 -0.155746341 0.277569475 -0.561107597 0.574724189 0.619218965
AGAGTGCGCTCT _A_Concatenation AAACTTTGAGCG _B 10228.12785 -0.932186953 0.207926676 -4.483248479 7.35E-06 1.58E-05
AGAGTGCGCTCT _A_Concatenation AAGACCGGTGCC _B 41191.54044 -0.737585113 0.163474443 -4.511929188 6.42E-06 1.39E-05
AGAGTGCGCTCT _A_Concatenation AAGCAACGATAT _B 5633.946736 -0.611670302 0.235169673 -2.600974408 0.009295939 0.013967805
AGAGTGCGCTCT _A_Concatenation AAGGGTAAATTG _B 10670.10963 -0.582342717 0.352807311 -1.650597077 0.098820876 0.129074418
AGAGTGCGCTCT _A_Concatenation AATCAACGAGCA _B 11504.19122 -1.770458353 0.254948573 -6.944374427 3.80E-12 1.46E-11
AGAGTGCGCTCT _A_Concatenation AATCGAGACCAG _B 20018.94385 -1.472047551 0.175603745 -8.382779948 5.17E-17 2.92E-16
AGAGTGCGCTCT _A_Concatenation AATCGATCCTAC _B 32594.28193 -1.592096108 0.126982141 -12.53795293 4.63E-36 9.33E-35
AGAGTGCGCTCT _A_Concatenation AATGGCACTAGC _B 17155.81157 -1.68078172 0.178021046 -9.441477584 3.68E-21 2.92E-20
AGAGTGCGCTCT _A_Concatenation ACAACGTGCATA _B 18144.4302 2.285378548 0.196534373 11.62839108 2.96E-31 4.80E-30
AGAGTGCGCTCT _A_Concatenation ACCAGTATTTAA _B 14196.27815 -1.225276331 0.208773688 -5.868921247 4.39E-09 1.30E-08
AGAGTGCGCTCT _A_Concatenation ACGACTCACCGC _B 5267.181227 -1.479379014 0.260925115 -5.669745556 1.43E-08 4.06E-08
AGAGTGCGCTCT _A_Concatenation ACGGCGTAAGTA _B 4592.246376 -1.915364087 0.432504585 -4.428540538 9.49E-06 2.02E-05
AGAGTGCGCTCT _A_Concatenation ACTCGTGTTGTC _B 8942.309464 -0.537136704 0.284731219 -1.886469306 0.059231741 0.079863789
AGAGTGCGCTCT _A_Concatenation ACTTGATGGTTT _B 34027.9292 -0.095517739 0.12051819 -0.792558694 0.428034983 0.480204314
AGAGTGCGCTCT _A_Concatenation ACTTGATGGTTT _B 21799.75304 -0.423040087 0.175371358 -2.412253022 0.015854275 0.023225122
AGAGTGCGCTCT _A_Concatenation ACTTGATGGTTT _B 34027.9292 -0.095517739 0.12051819 -0.792558694 0.428034983 0.480204314
AGAGTGCGCTCT _A_Concatenation ACTTGATGGTTT _B 21799.75304 -0.423040087 0.175371358 -2.412253022 0.015854275 0.023225122
AGAGTGCGCTCT _A_Concatenation ACTTTAGTAACA _B 22527.40492 2.782076534 0.312839739 8.892976786 5.95E-19 3.93E-18
AGAGTGCGCTCT _A_Concatenation AGAGTGCGCTCT _B 19878.82633 -0.90349002 0.165674946 -5.453389552 4.94E-08 1.34E-07
AGAGTGCGCTCT _A_Concatenation AGAGTGCGCTCT _B 27985.82161 -1.964934112 0.191794382 -10.2450035 1.25E-24 1.28E-23
AGAGTGCGCTCT _A_Concatenation AGAGTGCGCTCT _B 19878.82633 -0.90349002 0.165674946 -5.453389552 4.94E-08 1.34E-07
AGAGTGCGCTCT _A_Concatenation AGAGTGCGCTCT _B 27985.82161 -1.964934112 0.191794382 -10.2450035 1.25E-24 1.28E-23
AGAGTGCGCTCT _A_Concatenation AGCCAGGTCACC _B 11269.61954 -1.221124506 0.160186687 -7.623133532 2.48E-14 1.14E-13
AGAGTGCGCTCT _A_Concatenation AGCCATAAACCT _B 35439.58083 -1.332604906 0.128750667 -10.35027572 4.17E-25 4.56E-24
AGAGTGCGCTCT _A_Concatenation AGCCGTGCTTGA _B 15007.91566 -1.140028433 0.158417443 -7.196356744 6.18E-13 2.53E-12
AGAGTGCGCTCT _A_Concatenation AGGCTTCCCGCT _B 7270.753356 -1.198870815 0.107498983 -11.15239218 6.97E-29 9.94E-28
AGAGTGCGCTCT _A_Concatenation AGTTAGGCTCTT _B 35993.42697 -1.022946924 0.086948761 -11.76493958 5.92E-32 1.00E-30
AGAGTGCGCTCT _A_Concatenation AGTTGCTTTGCT _B 9600.80293 -2.095243758 0.491709745 -4.261139383 2.03E-05 4.12E-05
AGAGTGCGCTCT _A_Concatenation ATAACTCCACGC _B 4368.413323 2.600302768 0.351804199 7.391335216 1.45E-13 6.29E-13
AGAGTGCGCTCT _A_Concatenation ATAACTCCACGC _B 7292.580394 3.175435695 0.447265037 7.09967342 1.25E-12 4.99E-12
AGAGTGCGCTCT _A_Concatenation ATAACTCCACGC _B 4368.413323 2.600302768 0.351804199 7.391335216 1.45E-13 6.29E-13
AGAGTGCGCTCT _A_Concatenation ATAACTCCACGC _B 7292.580394 3.175435695 0.447265037 7.09967342 1.25E-12 4.99E-12
AGAGTGCGCTCT _A_Concatenation ATCCAATCTGTG _B 5674.111032 -2.115914765 0.325202807 -6.506446813 7.69E-11 2.72E-10
AGAGTGCGCTCT _A_Concatenation ATCTCTCAGTTC _B 23264.18324 -0.955893264 0.098245278 -9.729661142 2.25E-22 2.01E-21
AGAGTGCGCTCT _A_Concatenation ATGACAAAACGA _B 43856.32043 -1.063368337 0.103226538 -10.30130775 6.95E-25 7.39E-24
AGAGTGCGCTCT _A_Concatenation ATGACCCCTTGT _B 3527.175329 1.106080995 0.269686899 4.101352345 4.11E-05 8.12E-05
AGAGTGCGCTCT _A_Concatenation ATGACCCCTTGT _B 2580.14892 0.511089949 0.337069399 1.516275135 0.129449796 0.16430707
AGAGTGCGCTCT _A_Concatenation ATGACCCCTTGT _B 3527.175329 1.106080995 0.269686899 4.101352345 4.11E-05 8.12E-05
AGAGTGCGCTCT _A_Concatenation ATGACCCCTTGT _B 2580.14892 0.511089949 0.337069399 1.516275135 0.129449796 0.16430707
AGAGTGCGCTCT _A_Concatenation ATGTGGCACGAC _B 53298.10969 -1.432897875 0.12548039 -11.41929731 3.35E-30 5.15E-29
AGAGTGCGCTCT _A_Concatenation ATTGTATCTAAC _B 60235.27237 -2.070505821 0.139824513 -14.80788863 1.30E-49 5.64E-48
AGAGTGCGCTCT _A_Concatenation ATTTGTAGACCG _B 51736.93229 -1.719692502 0.123974788 -13.87130832 9.45E-44 2.91E-42
AGAGTGCGCTCT _A_Concatenation CAACGCATGCCT _B 32541.69383 -1.480344288 0.101276066 -14.61692136 2.19E-48 8.54E-47
AGAGTGCGCTCT _A_Concatenation CACACCCGGCAG _B 47072.06583 -0.031440973 0.133379504 -0.235725667 0.813645551 0.841083368
AGAGTGCGCTCT _A_Concatenation CAGTGACATAGT _B 64995.67641 -1.539689674 0.107804766 -14.28220409 2.83E-46 9.17E-45
AGAGTGCGCTCT _A_Concatenation CATCTCACCTAG _B 42083.29921 -1.43140827 0.169507167 -8.444529505 3.05E-17 1.77E-16
AGAGTGCGCTCT _A_Concatenation CATCTGAGTCGT _B 27.49821134 -0.109897937 0.376733709 -0.291712514 0.77050644 0.800138309
AGAGTGCGCTCT _A_Concatenation CATGCTAACACC _B 13533.50281 0.198004179 0.222031555 0.891783958 0.372508742 0.426506091
AGAGTGCGCTCT _A_Concatenation CATTAGTCCCGC _B 8939.484011 1.553096061 0.309808594 5.01308256 5.36E-07 1.30E-06
AGAGTGCGCTCT _A_Concatenation CATTGTTTAAAT _B 39141.013 -1.665536019 0.136497895 -12.20191723 3.04E-34 5.55E-33
AGAGTGCGCTCT _A_Concatenation CCAGACGGTCTG _B 11633.30779 -1.534820737 0.215847191 -7.110682007 1.15E-12 4.62E-12
AGAGTGCGCTCT _A_Concatenation CCCCTTAGGAAT _B 8786.667246 -1.559093298 0.210466493 -7.40779815 1.28E-13 5.60E-13
AGAGTGCGCTCT _A_Concatenation CCCGTAGGGGCT _B 902.3322126 -0.195429481 0.621573777 -0.314410756 0.753209088 0.786161986
AGAGTGCGCTCT _A_Concatenation CCCTATGTTCTA _B 18634.33211 2.191312285 0.19235793 11.39184793 4.59E-30 6.88E-29
AGAGTGCGCTCT _A_Concatenation CCCTGCCTCGGG _B 19758.83273 -1.086481182 0.240077496 -4.525543634 6.02E-06 1.32E-05
AGAGTGCGCTCT _A_Concatenation CCGCCCTTATGT _B 4752.680947 -1.009610709 0.284438949 -3.549481227 0.000385991 0.000700658
AGAGTGCGCTCT _A_Concatenation CCTAAATGCAAG _B 6912.916955 -0.399968294 0.277574525 -1.440940209 0.149601575 0.186842138
AGAGTGCGCTCT _A_Concatenation CGATGGCGGAAT _B 14320.7978 -0.902087734 0.109493871 -8.238705313 1.74E-16 9.38E-16
AGAGTGCGCTCT _A_Concatenation CGCCGGCATACA _B 15401.13123 -0.311855318 0.191105759 -1.631846778 0.102711768 0.133708304
AGAGTGCGCTCT _A_Concatenation CGGCAATTGCTT _B 4334.479328 0.009282783 0.30433099 0.03050226 0.975666491 0.980700024
AGAGTGCGCTCT _A_Concatenation CTACGTGGCCCC _B 0.392086388 1.202792688 1.824531867 0.659233587 0.509745782 0.558474995
AGAGTGCGCTCT _A_Concatenation CTACGTGGCCCC _B 9094.364505 2.228847579 0.128365729 17.36326039 1.57E-67 1.22E-65
AGAGTGCGCTCT _A_Concatenation CTACGTGGCCCC _B 0.392086388 1.202792688 1.824531867 0.659233587 0.509745782 0.558474995
AGAGTGCGCTCT _A_Concatenation CTACGTGGCCCC _B 9094.364505 2.228847579 0.128365729 17.36326039 1.57E-67 1.22E-65
AGAGTGCGCTCT _A_Concatenation CTCACGACAAGA _B 39072.8783 -1.173480079 0.155589187 -7.542169885 4.62E-14 2.05E-13
AGAGTGCGCTCT _A_Concatenation CTCAGCGATATA _B 14655.2068 1.04341514 0.368768678 2.82945706 0.004662706 0.007267604
AGAGTGCGCTCT _A_Concatenation CTCCATTAATGA _B 26399.92626 -1.617829749 0.213552312 -7.575800664 3.57E-14 1.62E-13
AGAGTGCGCTCT _A_Concatenation CTCGCACCGAGG _B 20772.61775 -0.961777507 0.128465103 -7.48668305 7.06E-14 3.12E-13
AGAGTGCGCTCT _A_Concatenation CTCGTTATGGCA _B 61065.0375 -1.533630176 0.09053593 -16.93946455 2.30E-64 1.58E-62
AGAGTGCGCTCT _A_Concatenation CTGCCGGTTGCG _B 3812.240519 -0.433579644 0.326956078 -1.326109753 0.184803346 0.228114312
AGAGTGCGCTCT _A_Concatenation CTGGTAGTTGTT _B 22305.38956 -0.973623661 0.155752498 -6.251095001 4.08E-10 1.35E-09
AGAGTGCGCTCT _A_Concatenation EPICXV.47_B 18187.41142 -0.365000024 0.155418429 -2.348498992 0.01884925 0.027304552
AGAGTGCGCTCT _A_Concatenation GACTGTGGGCGG _B 3815.059768 -2.260367427 0.267706514 -8.443453218 3.08E-17 1.77E-16
AGAGTGCGCTCT _A_Concatenation GAGACAGCTCTC _B 5994.40584 -1.698361982 0.203401247 -8.349811068 6.84E-17 3.79E-16
AGAGTGCGCTCT _A_Concatenation GAGATTGTGTCC _B 24623.44113 1.928345443 0.218155072 8.839333528 9.63E-19 6.29E-18
AGAGTGCGCTCT _A_Concatenation GAGATTGTGTCC _B 5810.881075 1.888395081 0.438249673 4.308948065 1.64E-05 3.36E-05
AGAGTGCGCTCT _A_Concatenation GAGATTGTGTCC _B 24623.44113 1.928345443 0.218155072 8.839333528 9.63E-19 6.29E-18
AGAGTGCGCTCT _A_Concatenation GAGATTGTGTCC _B 5810.881075 1.888395081 0.438249673 4.308948065 1.64E-05 3.36E-05
AGAGTGCGCTCT _A_Concatenation GAGTGACCTATT _B 33464.45212 -1.09992603 0.145300196 -7.570024393 3.73E-14 1.68E-13
AGAGTGCGCTCT _A_Concatenation GATCGCATGTTC _B 12611.75381 -1.446882813 0.203322207 -7.116206527 1.11E-12 4.46E-12
AGAGTGCGCTCT _A_Concatenation GATGGAGCTACA _B 14825.72328 0.179553735 0.177363072 1.01235129 0.31137012 0.364356026
AGAGTGCGCTCT _A_Concatenation GCAACGAGGGTC _B 19689.44194 -1.210544129 0.168055978 -7.203219675 5.88E-13 2.41E-12
AGAGTGCGCTCT _A_Concatenation GCAGATTCGAAT _B 11224.51351 -0.99079831 0.123720835 -8.008338371 1.16E-15 5.81E-15
AGAGTGCGCTCT _A_Concatenation GCGAGGCAGACC _B 12307.23558 -2.037732434 0.156364154 -13.0319666 8.05E-39 1.85E-37
AGAGTGCGCTCT _A_Concatenation GGAGGCACATCG _B 17523.94388 -1.352292005 0.159028454 -8.5034594 1.84E-17 1.08E-16
AGAGTGCGCTCT _A_Concatenation GGCACAGCTCCA _B 38114.95954 -1.424605032 0.114588467 -12.43235963 1.74E-35 3.46E-34
AGAGTGCGCTCT _A_Concatenation GGCACTGTCGAG _B 15478.49194 -1.374277972 0.161915262 -8.487637012 2.11E-17 1.23E-16
AGAGTGCGCTCT _A_Concatenation GGCCTTTCGGGC _B 0
AGAGTGCGCTCT _A_Concatenation GGGAGTATGTCG _B 13402.26725 -0.448251797 0.173540536 -2.582980361 0.00979509 0.014698922
AGAGTGCGCTCT _A_Concatenation GGTGTCTCGATT _B 9906.050271 -1.375539887 0.287930835 -4.777327472 1.78E-06 4.09E-06
AGAGTGCGCTCT _A_Concatenation GTACATACCGAA _B 9942.343107 -1.567353712 0.323058711 -4.8516064 1.22E-06 2.86E-06
AGAGTGCGCTCT _A_Concatenation GTCACCCTCCTT _B 0
AGAGTGCGCTCT _A_Concatenation GTTCCACGCCTG _B 19291.02111 -1.027585704 0.092184817 -11.14701684 7.40E-29 1.04E-27
AGAGTGCGCTCT _A_Concatenation Stop_1_B 11562.22341 -0.888052625 0.242613979 -3.660352249 0.000251869 0.000465907
AGAGTGCGCTCT _A_Concatenation Stop_2_B 3594.014932 -0.641940522 0.245137399 -2.618696802 0.008826636 0.013348432
AGAGTGCGCTCT _A_Concatenation TAAAATTTATCA _B 44324.43997 -1.330617426 0.16219277 -8.203925644 2.33E-16 1.24E-15
AGAGTGCGCTCT _A_Concatenation TAAGATTCAACG _B 6447.697441 -0.79471096 0.448399164 -1.772329263 0.076339916 0.101066095
AGAGTGCGCTCT _A_Concatenation TACATGAGGCAC _B 12786.01925 -0.12579104 0.27858184 -0.451540706 0.651599894 0.690589552
AGAGTGCGCTCT _A_Concatenation TACTTCTAGCGG _B 14283.38156 -1.893816271 0.208480499 -9.083901262 1.05E-19 7.29E-19
AGAGTGCGCTCT _A_Concatenation TATCGCAAGAAC _B 12734.728 -0.959094375 0.153574772 -6.245129745 4.23E-10 1.40E-09
AGAGTGCGCTCT _A_Concatenation TATCTATCGTGT _B 27074.27709 -1.649612488 0.202168425 -8.159595087 3.36E-16 1.76E-15
AGAGTGCGCTCT _A_Concatenation TATTAAAAGTGC _B 7560.08828 -0.961426089 0.274441765 -3.503206191 0.000459693 0.000826741
AGAGTGCGCTCT _A_Concatenation TCAGCCGTAGGC _B 24177.75019 -1.43246823 0.105753123 -13.54539878 8.44E-42 2.35E-40
AGAGTGCGCTCT _A_Concatenation TCTATCACAACT _B 1138.082118 0.109334762 0.730600143 0.149650617 0.881040272 0.898860105
AGAGTGCGCTCT _A_Concatenation TCTGTACCAACG _B 72199.37134 -0.569425391 0.119691147 -4.757456228 1.96E-06 4.48E-06
AGAGTGCGCTCT _A_Concatenation TCTTAGAGGGTG _B 39651.78642 -1.493538372 0.093683724 -15.94234629 3.22E-57 1.57E-55
AGAGTGCGCTCT _A_Concatenation TCTTGGTTTTGA _B 54432.41593 -0.789839699 0.109269794 -7.228344373 4.89E-13 2.01E-12
AGAGTGCGCTCT _A_Concatenation TGATGAGCGCCC _B 84684.34913 -1.21120068 0.132071122 -9.170821491 4.69E-20 3.37E-19
AGAGTGCGCTCT _A_Concatenation TGCCAGACGGTC _B 20810.05868 -0.266957474 0.13474745 -1.98116903 0.047572321 0.065119489
AGAGTGCGCTCT _A_Concatenation TGGCCCCCAGTT _B 7644.614745 -0.988209594 0.411444442 -2.401805669 0.016314373 0.023809615
AGAGTGCGCTCT _A_Concatenation TTAAATGAGGGC _B 6669.436554 -1.621601725 0.313105775 -5.179085959 2.23E-07 5.67E-07
AGAGTGCGCTCT _A_Concatenation TTAGTGCCATAC _B 0
AGAGTGCGCTCT _A_Concatenation TTCTAATTTCCT _B 14636.68577 -1.379627354 0.252187922 -5.470632151 4.48E-08 1.23E-07
AGAGTGCGCTCT _A_Concatenation TTTAACGGATTG _B 40738.6395 -1.640455413 0.163094087 -10.05833776 8.44E-24 8.26E-23
AGAGTGCGCTCT _A_Concatenation Joint_ 1_B 51591.96883 -1.588659298 0.152424038 -10.42262968 1.95E-25 2.20E-24
AGAGTGCGCTCT _A_Concatenation Joint_ 2_B 15751.06634 -1.240989184 0.247024803 -5.023743236 5.07E-07 1.23E-06
AGAGTGCGCTCT _A_Concatenation Joint_ 3_B 32307.69581 -1.361148548 0.190588443 -7.141821001 9.21E-13 3.75E-12
AGCCAGGTCACC _A_Concatenation ACTTGATGGTTT _B 3876.885542 2.933466412 0.648056021 4.526563007 6.00E-06 1.31E-05
AGCCAGGTCACC _A_Concatenation AGAGTGCGCTCT _B 4703.493753 -2.22371757 0.208892734 -10.64526048 1.83E-26 2.23E-25
AGCCAGGTCACC _A_Concatenation ATAACTCCACGC _B 2304.211004 -0.356341457 0.145827982 -2.443573937 0.014542589 0.021437941
AGCCAGGTCACC _A_Concatenation ATGACCCCTTGT _B 375.4169943 0.278787458 1.131464072 0.246395325 0.805376225 0.834649651
AGCCAGGTCACC _A_Concatenation CTACGTGGCCCC _B 1184.16876 -1.776778994 0.443670782 -4.004723919 6.21E-05 0.000120972
AGCCAGGTCACC _A_Concatenation GAGATTGTGTCC _B 217.461885 6.607094185 0.600837958 10.99646602 3.97E-28 5.34E-27
AGCCATAAACCT _A_Concatenation ACTTGATGGTTT _B 13375.49336 3.013120065 0.277859724 10.84403319 2.13E-27 2.80E-26
AGCCATAAACCT _A_Concatenation AGAGTGCGCTCT _B 10426.42815 -1.26195748 0.204589127 -6.168252926 6.90E-10 2.25E-09
AGCCATAAACCT _A_Concatenation ATAACTCCACGC _B 850.8256228 0.241845958 0.426090493 0.567592945 0.570311399 0.615639761
AGCCATAAACCT _A_Concatenation ATGACCCCTTGT _B 2429.247465 -1.159170523 0.29238966 -3.964471671 7.36E-05 0.000142132
AGCCATAAACCT _A_Concatenation CTACGTGGCCCC _B 5148.430924 2.092478249 0.324585885 6.446608881 1.14E-10 3.96E-10
AGCCATAAACCT _A_Concatenation GAGATTGTGTCC _B 6456.778216 2.340856302 0.354048729 6.611678313 3.80E-11 1.37E-10
AGCCGTGCTTGA _A_Concatenation ACTTGATGGTTT _B 5785.441922 1.35549112 0.374919281 3.615421211 0.00029986 0.000550292
AGCCGTGCTTGA _A_Concatenation AGAGTGCGCTCT _B 12690.92634 -1.607501816 0.161011377 -9.983777824 1.79E-23 1.69E-22
AGCCGTGCTTGA _A_Concatenation ATAACTCCACGC _B 2979.148998 3.683221787 0.721788828 5.102907724 3.34E-07 8.37E-07
AGCCGTGCTTGA _A_Concatenation ATGACCCCTTGT _B 2380.993825 1.572750431 0.40847404 3.850306939 0.00011797 0.000223875
AGCCGTGCTTGA _A_Concatenation CTACGTGGCCCC _B 7907.712105 1.48870649 0.270220152 5.509235627 3.60E-08 9.94E-08
AGCCGTGCTTGA _A_Concatenation GAGATTGTGTCC _B 8157.516526 1.363374052 0.315947971 4.315185337 1.59E-05 3.30E-05
AGGCTTCCCGCT _A_Concatenation ACTTGATGGTTT _B 632.1713156 -3.070103012 0.808141688 -3.798966267 0.000145301 0.000273082
AGGCTTCCCGCT _A_Concatenation AGAGTGCGCTCT _B 292.3239341 1.83039769 1.007941257 1.815976553 0.069373969 0.092366936
AGGCTTCCCGCT _A_Concatenation ATAACTCCACGC _B 901.050346 1.501746738 0.342917638 4.37932194 1.19E-05 2.49E-05
AGGCTTCCCGCT _A_Concatenation ATGACCCCTTGT _B 711.0428879 -0.780548841 0.770628094 -1.012873586 0.311120546 0.364356026
AGGCTTCCCGCT _A_Concatenation CTACGTGGCCCC _B 1176.063311 -2.150193866 0.561257627 -3.831028327 0.000127609 0.000241383
AGGCTTCCCGCT _A_Concatenation GAGATTGTGTCC _B 548.5680099 -3.156586506 0.848021575 -3.722295044 0.00019742 0.000367491
AGTTAGGCTCTT _A_Concatenation ACTTGATGGTTT _B 5605.730546 3.744424746 0.372288115 10.05786807 8.48E-24 8.26E-23
AGTTAGGCTCTT _A_Concatenation AGAGTGCGCTCT _B 10599.81573 -1.231003918 0.152568423 -8.068536685 7.11E-16 3.60E-15
AGTTAGGCTCTT _A_Concatenation ATAACTCCACGC _B 468.3853795 -0.374702354 0.771588973 -0.485624299 0.627233565 0.667184747
AGTTAGGCTCTT _A_Concatenation ATGACCCCTTGT _B 538.0462266 0.703649006 0.885463844 0.794667124 0.426807161 0.479286812
AGTTAGGCTCTT _A_Concatenation CTACGTGGCCCC _B 3014.381588 0.707144378 0.227689923 3.105734184 0.001898073 0.003094626
AGTTAGGCTCTT _A_Concatenation GAGATTGTGTCC _B 1547.517664 2.37279869 0.748634071 3.16950401 0.001526994 0.002539197
AGTTGCTTTGCT _A_Concatenation ACTTGATGGTTT _B 27530.83117 -1.300110378 0.213489037 -6.089822676 1.13E-09 3.58E-09
AGTTGCTTTGCT _A_Concatenation AGAGTGCGCTCT _B 24578.32868 -1.892474313 0.19285591 -9.812892514 9.91E-23 9.05E-22
AGTTGCTTTGCT _A_Concatenation ATAACTCCACGC _B 36025.27381 -1.011930499 0.13416273 -7.542560429 4.61E-14 2.05E-13
AGTTGCTTTGCT _A_Concatenation ATGACCCCTTGT _B 10147.04038 -0.811060388 0.261981445 -3.095869586 0.001962366 0.00318612
AGTTGCTTTGCT _A_Concatenation CTACGTGGCCCC _B 35065.60496 -1.827254606 0.126790701 -14.41158216 4.38E-47 1.50E-45
AGTTGCTTTGCT _A_Concatenation GAGATTGTGTCC _B 38128.24829 -1.272134012 0.237403833 -5.358523483 8.39E-08 2.24E-07
ATAACTCCACGC _A_Concatenation AAAACAAGCATT _B 5781.56218 -0.281977919 0.343625126 -0.8205975 0.411875572 0.465650429
ATAACTCCACGC _A_Concatenation AAACTTTGAGCG _B 721.8533858 9.562160094 0.849940824 11.25038334 2.31E-29 3.37E-28
ATAACTCCACGC _A_Concatenation AAGACCGGTGCC _B 497.2474192 0.853015176 0.999817168 0.853171163 0.393564392 0.447110568
ATAACTCCACGC _A_Concatenation AAGCAACGATAT _B 8353.976519 -0.632157946 0.223887962 -2.823545938 0.004749562 0.007393126
ATAACTCCACGC _A_Concatenation AAGGGTAAATTG _B 7013.123501 1.90571259 0.293380387 6.49570548 8.26E-11 2.91E-10
ATAACTCCACGC _A_Concatenation AATCAACGAGCA _B 2455.688828 3.851581517 1.220691517 3.155245584 0.001603631 0.002655303
ATAACTCCACGC _A_Concatenation AATCGAGACCAG _B 1570.999852 -0.376649214 0.64457388 -0.584338314 0.55899272 0.607872083
ATAACTCCACGC _A_Concatenation AATCGATCCTAC _B 4668.859977 2.182183543 0.295826024 7.376577331 1.62E-13 6.98E-13
ATAACTCCACGC _A_Concatenation AATGGCACTAGC _B 11791.06528 2.110045297 0.173591782 12.15521423 5.38E-34 9.68E-33
ATAACTCCACGC _A_Concatenation ACAACGTGCATA _B 1296.972968 1.102299843 0.734703847 1.500332205 0.133528371 0.168933621
ATAACTCCACGC _A_Concatenation ACCAGTATTTAA _B 835.1597014 0.035564837 0.552449978 0.064376574 0.948670382 0.957681932
ATAACTCCACGC _A_Concatenation ACGACTCACCGC _B 2291.736171 1.872071589 0.580311374 3.225977767 0.001255431 0.002130042
ATAACTCCACGC _A_Concatenation ACGGCGTAAGTA _B 2977.097332 -1.439258583 0.306993319 -4.688240735 2.76E-06 6.27E-06
ATAACTCCACGC _A_Concatenation ACTCGTGTTGTC _B 10399.6518 -0.701415956 0.16107289 -4.354649355 1.33E-05 2.78E-05
ATAACTCCACGC _A_Concatenation ACTTGATGGTTT _B 6841.681085 2.247520544 0.494571734 4.544377267 5.51E-06 1.22E-05
ATAACTCCACGC _A_Concatenation ACTTGATGGTTT _B 4455.372675 2.755782805 0.568485182 4.847589507 1.25E-06 2.92E-06
ATAACTCCACGC _A_Concatenation ACTTGATGGTTT _B 6841.681085 2.247520544 0.494571734 4.544377267 5.51E-06 1.22E-05
ATAACTCCACGC _A_Concatenation ACTTGATGGTTT _B 4455.372675 2.755782805 0.568485182 4.847589507 1.25E-06 2.92E-06
ATAACTCCACGC _A_Concatenation ACTTTAGTAACA _B 1756.52705 0.82079204 0.578444952 1.418963097 0.155909775 0.194098538
ATAACTCCACGC _A_Concatenation AGAGTGCGCTCT _B 6993.8095 2.911441993 0.469851865 6.196510455 5.77E-10 1.91E-09
ATAACTCCACGC _A_Concatenation AGAGTGCGCTCT _B 9247.926929 2.887955818 0.487359009 5.925725733 3.11E-09 9.37E-09
ATAACTCCACGC _A_Concatenation AGAGTGCGCTCT _B 6993.8095 2.911441993 0.469851865 6.196510455 5.77E-10 1.91E-09
ATAACTCCACGC _A_Concatenation AGAGTGCGCTCT _B 9247.926929 2.887955818 0.487359009 5.925725733 3.11E-09 9.37E-09
ATAACTCCACGC _A_Concatenation AGCCAGGTCACC _B 1809.954433 2.50899942 1.228434121 2.042437098 0.041108193 0.056736101
ATAACTCCACGC _A_Concatenation AGCCATAAACCT _B 1162.529938 0.668422986 0.563393149 1.186423703 0.235455016 0.284934693
ATAACTCCACGC _A_Concatenation AGCCGTGCTTGA _B 5702.968247 4.674611784 0.89918899 5.198697756 2.01E-07 5.12E-07
ATAACTCCACGC _A_Concatenation AGGCTTCCCGCT _B 13970.13395 -0.643955277 0.101978809 -6.314598927 2.71E-10 9.10E-10
ATAACTCCACGC _A_Concatenation AGTTAGGCTCTT _B 1041.716203 0.842129891 0.837817947 1.005146636 0.314826247 0.367664218
ATAACTCCACGC _A_Concatenation AGTTGCTTTGCT _B 10571.52727 -0.944544869 0.208420795 -4.531912808 5.85E-06 1.28E-05
ATAACTCCACGC _A_Concatenation ATAACTCCACGC _B 1807.962777 -1.477328676 0.456848505 -3.23373867 0.001221812 0.002085107
ATAACTCCACGC _A_Concatenation ATAACTCCACGC _B 1945.736553 -0.230079805 0.530743396 -0.43350479 0.664648087 0.702507788
ATAACTCCACGC _A_Concatenation ATAACTCCACGC _B 1807.962777 -1.477328676 0.456848505 -3.23373867 0.001221812 0.002085107
ATAACTCCACGC _A_Concatenation ATAACTCCACGC _B 1945.736553 -0.230079805 0.530743396 -0.43350479 0.664648087 0.702507788
ATAACTCCACGC _A_Concatenation ATCCAATCTGTG _B 670.4549452 3.617897238 1.834944381 1.971665886 0.04864776 0.066513722
ATAACTCCACGC _A_Concatenation ATCTCTCAGTTC _B 5115.335477 2.976746528 0.740686647 4.018901301 5.85E-05 0.000114492
ATAACTCCACGC _A_Concatenation ATGACAAAACGA _B 11998.35473 1.816229681 0.171140536 10.61250436 2.61E-26 3.05E-25
ATAACTCCACGC _A_Concatenation ATGACCCCTTGT _B 493.2908764 1.096392366 1.209023925 0.906840918 0.364490928 0.418555889
ATAACTCCACGC _A_Concatenation ATGACCCCTTGT _B 2952.299029 -2.700001225 0.369427593 -7.308607354 2.70E-13 1.13E-12
ATAACTCCACGC _A_Concatenation ATGACCCCTTGT _B 493.2908764 1.096392366 1.209023925 0.906840918 0.364490928 0.418555889
ATAACTCCACGC _A_Concatenation ATGACCCCTTGT _B 2952.299029 -2.700001225 0.369427593 -7.308607354 2.70E-13 1.13E-12
ATAACTCCACGC _A_Concatenation ATGTGGCACGAC _B 8343.777904 2.192709064 0.197431018 11.10620351 1.17E-28 1.59E-27
ATAACTCCACGC _A_Concatenation ATTGTATCTAAC _B 14073.49262 3.130382563 0.346211959 9.041809447 1.54E-19 1.05E-18
ATAACTCCACGC _A_Concatenation ATTTGTAGACCG _B 5217.056348 2.140235374 0.369918401 5.785695902 7.22E-09 2.09E-08
ATAACTCCACGC _A_Concatenation CAACGCATGCCT _B 39461.75006 -1.742323931 0.061184592 -28.47651452 2.29E-178 2.68E-175
ATAACTCCACGC _A_Concatenation CACACCCGGCAG _B 2512.823989 0.947075381 0.188059149 5.036050563 4.75E-07 1.16E-06
ATAACTCCACGC _A_Concatenation CAGTGACATAGT _B 6765.912554 0.737299013 0.392584173 1.878066064 0.060372128 0.081307624
ATAACTCCACGC _A_Concatenation CATCTCACCTAG _B 7005.809364 2.791173707 0.304577801 9.164074652 5.00E-20 3.56E-19
ATAACTCCACGC _A_Concatenation CATCTGAGTCGT _B 3.381350172 0.171617145 0.844943034 0.203110906 0.839048347 0.864182835
ATAACTCCACGC _A_Concatenation CATGCTAACACC _B 3580.025369 3.154691462 0.369953295 8.527269526 1.50E-17 8.89E-17
ATAACTCCACGC _A_Concatenation CATTAGTCCCGC _B 2333.383375 -1.346253508 0.264142132 -5.096701154 3.46E-07 8.61E-07
ATAACTCCACGC _A_Concatenation CATTGTTTAAAT _B 6192.8208 1.094617129 0.392402463 2.789526652 0.005278515 0.008194667
ATAACTCCACGC _A_Concatenation CCAGACGGTCTG _B 8419.710655 -0.253212435 0.261580139 -0.968010935 0.333038909 0.387000481
ATAACTCCACGC _A_Concatenation CCCCTTAGGAAT _B 2146.458394 3.581362527 0.818314309 4.376512165 1.21E-05 2.52E-05
ATAACTCCACGC _A_Concatenation CCCGTAGGGGCT _B 85.13781885 8.907804866 1.859484601 4.790469821 1.66E-06 3.86E-06
ATAACTCCACGC _A_Concatenation CCCTATGTTCTA _B 2636.302422 0.196741056 0.810125852 0.24285246 0.808119694 0.836751039
ATAACTCCACGC _A_Concatenation CCCTGCCTCGGG _B 26714.29552 -1.990321657 0.10114427 -19.6780466 3.33E-86 4.86E-84
ATAACTCCACGC _A_Concatenation CCGCCCTTATGT _B 702.9559443 4.146369367 0.991222779 4.183085231 2.88E-05 5.75E-05
ATAACTCCACGC _A_Concatenation CCTAAATGCAAG _B 4358.852137 -2.995845265 0.291004002 -10.29485934 7.43E-25 7.83E-24
ATAACTCCACGC _A_Concatenation CGATGGCGGAAT _B 14072.03242 -1.031916853 0.189252557 -5.452591325 4.96E-08 1.35E-07
ATAACTCCACGC _A_Concatenation CGCCGGCATACA _B 5744.325496 2.565332637 0.553808123 4.632168669 3.62E-06 8.13E-06
ATAACTCCACGC _A_Concatenation CGGCAATTGCTT _B 7126.389565 -0.870154534 0.39864285 -2.182792276 0.029051111 0.040965921
ATAACTCCACGC _A_Concatenation CTACGTGGCCCC _B 0
ATAACTCCACGC _A_Concatenation CTACGTGGCCCC _B 1817.585322 2.2821909 0.517260899 4.412069238 1.02E-05 2.16E-05
ATAACTCCACGC _A_Concatenation CTACGTGGCCCC _B 0
ATAACTCCACGC _A_Concatenation CTACGTGGCCCC _B 1817.585322 2.2821909 0.517260899 4.412069238 1.02E-05 2.16E-05
ATAACTCCACGC _A_Concatenation CTCACGACAAGA _B 3284.680729 1.212962939 0.466326496 2.60110234 0.009292473 0.013967805
ATAACTCCACGC _A_Concatenation CTCAGCGATATA _B 1563.906204 0.779959858 0.462978943 1.684655145 0.092055125 0.120777151
ATAACTCCACGC _A_Concatenation CTCCATTAATGA _B 23703.02195 -1.292269107 0.136860615 -9.442227842 3.65E-21 2.92E-20
ATAACTCCACGC _A_Concatenation CTCGCACCGAGG _B 21139.06244 -1.22153884 0.223940597 -5.454744956 4.90E-08 1.34E-07
ATAACTCCACGC _A_Concatenation CTCGTTATGGCA _B 22341.62302 2.355927816 0.177415423 13.27916018 3.06E-40 7.61E-39
ATAACTCCACGC _A_Concatenation CTGCCGGTTGCG _B 9169.143176 0.726762118 0.33826766 2.148482413 0.031675452 0.044345633
ATAACTCCACGC _A_Concatenation CTGGTAGTTGTT _B 24151.37555 -0.277450365 0.090276496 -3.07333998 0.002116772 0.003417826
ATAACTCCACGC _A_Concatenation EPICXV.47_B 3434.755523 1.095957501 0.408255253 2.684490873 0.007264037 0.011114737
ATAACTCCACGC _A_Concatenation GACTGTGGGCGG _B 3337.186679 -1.928753721 0.324456748 -5.944563435 2.77E-09 8.48E-09
ATAACTCCACGC _A_Concatenation GAGACAGCTCTC _B 5871.994573 -0.850703115 0.129432391 -6.572567393 4.95E-11 1.76E-10
ATAACTCCACGC _A_Concatenation GAGATTGTGTCC _B 1629.650511 1.262729308 0.377937217 3.341108662 0.000834446 0.001457207
ATAACTCCACGC _A_Concatenation GAGATTGTGTCC _B 755.9126702 1.887817562 0.589565162 3.202050738 0.00136453 0.002298466
ATAACTCCACGC _A_Concatenation GAGATTGTGTCC _B 1629.650511 1.262729308 0.377937217 3.341108662 0.000834446 0.001457207
ATAACTCCACGC _A_Concatenation GAGATTGTGTCC _B 755.9126702 1.887817562 0.589565162 3.202050738 0.00136453 0.002298466
ATAACTCCACGC _A_Concatenation GAGTGACCTATT _B 38898.01182 -0.835180179 0.140661781 -5.937506065 2.89E-09 8.79E-09
ATAACTCCACGC _A_Concatenation GATCGCATGTTC _B 4723.657359 1.670471877 0.368062121 4.538559606 5.66E-06 1.25E-05
ATAACTCCACGC _A_Concatenation GATGGAGCTACA _B 3376.88519 0.195980857 0.450526895 0.43500368 0.663559757 0.701992177
ATAACTCCACGC _A_Concatenation GCAACGAGGGTC _B 1533.503845 2.193515279 0.679686386 3.227246159 0.001249879 0.002123704
ATAACTCCACGC _A_Concatenation GCAGATTCGAAT _B 13594.38178 -1.309123082 0.16062805 -8.150027871 3.64E-16 1.90E-15
ATAACTCCACGC _A_Concatenation GCGAGGCAGACC _B 15382.73155 -1.546709208 0.2631532 -5.877599838 4.16E-09 1.24E-08
ATAACTCCACGC _A_Concatenation GGAGGCACATCG _B 879.1416484 -0.819358772 0.709275781 -1.15520478 0.248006578 0.299194726
ATAACTCCACGC _A_Concatenation GGCACAGCTCCA _B 11282.6646 3.164010295 0.29499216 10.72574367 7.71E-27 9.79E-26
ATAACTCCACGC _A_Concatenation GGCACTGTCGAG _B 8847.229774 2.325135133 0.347795244 6.685356324 2.30E-11 8.39E-11
ATAACTCCACGC _A_Concatenation GGCCTTTCGGGC _B 0
ATAACTCCACGC _A_Concatenation GGGAGTATGTCG _B 17584.03076 -1.218503724 0.226940195 -5.369272395 7.91E-08 2.11E-07
ATAACTCCACGC _A_Concatenation GGTGTCTCGATT _B 497.1435816 9.041134031 0.76585993 11.80520573 3.67E-32 6.31E-31
ATAACTCCACGC _A_Concatenation GTACATACCGAA _B 3938.166661 2.102611132 0.651451204 3.22758039 0.00124842 0.002123704
ATAACTCCACGC _A_Concatenation GTCACCCTCCTT _B 0
ATAACTCCACGC _A_Concatenation GTTCCACGCCTG _B 2317.552 1.271545824 0.843551282 1.507372285 0.131715259 0.1668203
ATAACTCCACGC _A_Concatenation Stop_1_B 1137.523105 0.798851374 0.749100475 1.066414187 0.286236451 0.339017641
ATAACTCCACGC _A_Concatenation Stop_2_B 233.468405 2.21612062 1.772497968 1.25028105 0.211196898 0.256641553
ATAACTCCACGC _A_Concatenation TAAAATTTATCA _B 4067.861015 1.440863509 0.333820026 4.316288412 1.59E-05 3.29E-05
ATAACTCCACGC _A_Concatenation TAAGATTCAACG _B 1185.199721 0.887624512 0.936333252 0.947979269 0.343140008 0.397947093
ATAACTCCACGC _A_Concatenation TACATGAGGCAC _B 10593.4815 -0.490193537 0.37106009 -1.321062408 0.186480556 0.229711033
ATAACTCCACGC _A_Concatenation TACTTCTAGCGG _B 2578.578079 1.640060548 0.156548042 10.47640407 1.11E-25 1.28E-24
ATAACTCCACGC _A_Concatenation TATCGCAAGAAC _B 12563.11996 -1.389234571 0.2375299 -5.848672393 4.96E-09 1.46E-08
ATAACTCCACGC _A_Concatenation TATCTATCGTGT _B 826.0902795 1.246599869 0.473453515 2.632993167 0.008463605 0.012832626
ATAACTCCACGC _A_Concatenation TATTAAAAGTGC _B 10622.84954 -1.544755007 0.15962284 -9.677531142 3.76E-22 3.30E-21
ATAACTCCACGC _A_Concatenation TCAGCCGTAGGC _B 8675.039943 3.049505983 0.573241137 5.319761241 1.04E-07 2.73E-07
ATAACTCCACGC _A_Concatenation TCTATCACAACT _B 171.3249742 9.193956794 1.505962955 6.105035161 1.03E-09 3.26E-09
ATAACTCCACGC _A_Concatenation TCTGTACCAACG _B 2192.57237 1.757219114 0.745079347 2.358432189 0.018352312 0.026650749
ATAACTCCACGC _A_Concatenation TCTTAGAGGGTG _B 2960.911399 1.698739618 0.489993821 3.466859263 0.000526578 0.000935515
ATAACTCCACGC _A_Concatenation TCTTGGTTTTGA _B 6725.138863 2.181554535 0.284761747 7.660981705 1.85E-14 8.59E-14
ATAACTCCACGC _A_Concatenation TGATGAGCGCCC _B 9292.107714 1.386807308 0.260273879 5.32826157 9.92E-08 2.62E-07
ATAACTCCACGC _A_Concatenation TGCCAGACGGTC _B 4651.943894 2.879498645 0.367929728 7.826219049 5.03E-15 2.43E-14
ATAACTCCACGC _A_Concatenation TGGCCCCCAGTT _B 6262.640698 -0.582854709 0.240531913 -2.423190761 0.015384846 0.022594076
ATAACTCCACGC _A_Concatenation TTAAATGAGGGC _B 4016.428413 -0.148634354 0.147244636 -1.009438156 0.312764547 0.365621756
ATAACTCCACGC _A_Concatenation TTAGTGCCATAC _B 0
ATAACTCCACGC _A_Concatenation TTCTAATTTCCT _B 4520.477976 1.890478758 0.29031195 6.51188749 7.42E-11 2.63E-10
ATAACTCCACGC _A_Concatenation TTTAACGGATTG _B 8109.13366 2.806610615 0.214137434 13.1065856 3.02E-39 7.06E-38
ATAACTCCACGC _A_Concatenation Joint_ 1_B 2209.767613 3.530393863 0.764919965 4.615376806 3.92E-06 8.80E-06
ATAACTCCACGC _A_Concatenation Joint_ 2_B 9346.095048 2.797108229 0.456043474 6.133424534 8.60E-10 2.78E-09
ATAACTCCACGC _A_Concatenation Joint_ 3_B 17594.74342 2.26213997 0.245779647 9.20393529 3.45E-20 2.51E-19
ATCCAATCTGTG _A_Concatenation ACTTGATGGTTT _B 5548.947268 2.872876554 0.403456794 7.120654801 1.07E-12 4.33E-12
ATCCAATCTGTG _A_Concatenation AGAGTGCGCTCT _B 9393.829084 -1.695062239 0.166799743 -10.16225932 2.92E-24 2.92E-23
ATCCAATCTGTG _A_Concatenation ATAACTCCACGC _B 2942.355362 -1.834673755 0.288438472 -6.360710969 2.01E-10 6.80E-10
ATCCAATCTGTG _A_Concatenation ATGACCCCTTGT _B 311.3967368 -0.154913263 1.036349518 -0.149479747 0.88117509 0.898860105
ATCCAATCTGTG _A_Concatenation CTACGTGGCCCC _B 2455.294276 -0.676441944 0.421373869 -1.605324853 0.108422333 0.140050505
ATCCAATCTGTG _A_Concatenation GAGATTGTGTCC _B 1602.806216 0.560867177 0.427537863 1.311853817 0.189569463 0.232863979
ATCTCTCAGTTC _A_Concatenation ACTTGATGGTTT _B 6054.812689 2.486938892 0.528306637 4.707377723 2.51E-06 5.72E-06
ATCTCTCAGTTC _A_Concatenation AGAGTGCGCTCT _B 12589.08411 -1.652175566 0.174706148 -9.456882812 3.17E-21 2.58E-20
ATCTCTCAGTTC _A_Concatenation ATAACTCCACGC _B 1832.88313 1.990900891 0.385509148 5.164341502 2.41E-07 6.11E-07
ATCTCTCAGTTC _A_Concatenation ATGACCCCTTGT _B 3537.004707 -1.266277279 0.588216121 -2.15274154 0.031338997 0.043927203
ATCTCTCAGTTC _A_Concatenation CTACGTGGCCCC _B 6236.194184 0.903016357 0.317594115 2.843303178 0.004464858 0.006977832
ATCTCTCAGTTC _A_Concatenation GAGATTGTGTCC _B 3970.136403 4.141558093 0.658901102 6.285553447 3.27E-10 1.09E-09
ATGACAAAACGA _A_Concatenation ACTTGATGGTTT _B 18237.28353 -1.650740096 0.187735162 -8.792919113 1.46E-18 9.31E-18
ATGACAAAACGA _A_Concatenation AGAGTGCGCTCT _B 9940.084266 -1.561917179 0.173246666 -9.015568469 1.96E-19 1.32E-18
ATGACAAAACGA _A_Concatenation ATAACTCCACGC _B 24868.31806 -1.613728728 0.121979932 -13.22946075 5.93E-40 1.44E-38
ATGACAAAACGA _A_Concatenation ATGACCCCTTGT _B 13406.54463 -0.908494188 0.238889741 -3.802985364 0.000142963 0.00026912
ATGACAAAACGA _A_Concatenation CTACGTGGCCCC _B 20209.35222 -1.496135662 0.161908491 -9.240625092 2.45E-20 1.80E-19
ATGACAAAACGA _A_Concatenation GAGATTGTGTCC _B 14692.50439 -1.319674258 0.220346731 -5.989080276 2.11E-09 6.56E-09
ATGACCCCTTGT _A_Concatenation AAAACAAGCATT _B 2287.515275 -0.525536449 0.20023098 -2.624651031 0.008673781 0.013134262
ATGACCCCTTGT _A_Concatenation AAACTTTGAGCG _B 5.735324621 5.019891207 2.01835398 2.487121316 0.012878148 0.019104765
ATGACCCCTTGT _A_Concatenation AAGACCGGTGCC _B 359.2014938 0.622596435 0.850207415 0.732287703 0.463992983 0.51510712
ATGACCCCTTGT _A_Concatenation AAGCAACGATAT _B 2767.974294 -0.083507557 0.282940013 -0.295142269 0.767885201 0.799317582
ATGACCCCTTGT _A_Concatenation AAGGGTAAATTG _B 1704.08132 2.47354945 0.685362907 3.609109019 0.00030725 0.000562971
ATGACCCCTTGT _A_Concatenation AATCAACGAGCA _B 203.4996826 5.858636294 0.835955168 7.008313983 2.41E-12 9.43E-12
ATGACCCCTTGT _A_Concatenation AATCGAGACCAG _B 996.684093 -1.715188982 0.914061994 -1.876447104 0.060593912 0.081512408
ATGACCCCTTGT _A_Concatenation AATCGATCCTAC _B 3062.76177 -3.489120362 0.393635252 -8.8638412 7.73E-19 5.08E-18
ATGACCCCTTGT _A_Concatenation AATGGCACTAGC _B 7692.62048 0.922655547 0.271802059 3.394586307 0.000687324 0.001215554
ATGACCCCTTGT _A_Concatenation ACAACGTGCATA _B 878.2026969 2.02689206 0.604880538 3.350896471 0.000805504 0.001413865
ATGACCCCTTGT _A_Concatenation ACCAGTATTTAA _B 216.6006839 1.894384791 1.352123072 1.401044646 0.161200723 0.200046332
ATGACCCCTTGT _A_Concatenation ACGACTCACCGC _B 2132.27275 -0.785126157 0.366168307 -2.14416743 0.032019474 0.044773642
ATGACCCCTTGT _A_Concatenation ACGGCGTAAGTA _B 1043.756628 -1.055441727 1.085283276 -0.972503447 0.330800151 0.385164718
ATGACCCCTTGT _A_Concatenation ACTCGTGTTGTC _B 4628.537822 -0.442695862 0.273699911 -1.617449783 0.105781227 0.137398061
ATGACCCCTTGT _A_Concatenation ACTTGATGGTTT _B 2168.577771 2.949146111 0.371861607 7.93076257 2.18E-15 1.07E-14
ATGACCCCTTGT _A_Concatenation ACTTGATGGTTT _B 1091.484458 -0.191776516 0.520528793 -0.368426336 0.712555363 0.749754473
ATGACCCCTTGT _A_Concatenation ACTTGATGGTTT _B 2168.577771 2.949146111 0.371861607 7.93076257 2.18E-15 1.07E-14
ATGACCCCTTGT _A_Concatenation ACTTGATGGTTT _B 1091.484458 -0.191776516 0.520528793 -0.368426336 0.712555363 0.749754473
ATGACCCCTTGT _A_Concatenation ACTTTAGTAACA _B 2367.858862 -1.264450711 0.375000206 -3.371866707 0.000746606 0.001318402
ATGACCCCTTGT _A_Concatenation AGAGTGCGCTCT _B 1136.121391 0.364815874 0.58990782 0.618428611 0.536292843 0.587009675
ATGACCCCTTGT _A_Concatenation AGAGTGCGCTCT _B 1362.242812 0.71221897 0.46655236 1.526557424 0.126871101 0.161384458
ATGACCCCTTGT _A_Concatenation AGAGTGCGCTCT _B 1136.121391 0.364815874 0.58990782 0.618428611 0.536292843 0.587009675
ATGACCCCTTGT _A_Concatenation AGAGTGCGCTCT _B 1362.242812 0.71221897 0.46655236 1.526557424 0.126871101 0.161384458
ATGACCCCTTGT _A_Concatenation AGCCAGGTCACC _B 785.8614332 -4.430915506 1.009266745 -4.390232343 1.13E-05 2.38E-05
ATGACCCCTTGT _A_Concatenation AGCCATAAACCT _B 402.9449401 1.058582834 1.2987902 0.815052988 0.41504198 0.468324396
ATGACCCCTTGT _A_Concatenation AGCCGTGCTTGA _B 307.0305729 1.458530704 0.907567516 1.607076806 0.108037521 0.139707812
ATGACCCCTTGT _A_Concatenation AGGCTTCCCGCT _B 3606.085282 -1.112953854 0.246859408 -4.508452261 6.53E-06 1.41E-05
ATGACCCCTTGT _A_Concatenation AGTTAGGCTCTT _B 1981.07778 3.623533786 1.089050337 3.327241786 0.000877102 0.001528067
ATGACCCCTTGT _A_Concatenation AGTTGCTTTGCT _B 7349.196536 -1.816119282 0.309895761 -5.860419885 4.62E-09 1.37E-08
ATGACCCCTTGT _A_Concatenation ATAACTCCACGC _B 943.3389474 0.03640805 0.432512206 0.084178088 0.93291484 0.945041116
ATGACCCCTTGT _A_Concatenation ATAACTCCACGC _B 524.4238384 -0.445256262 0.422703297 -1.053354125 0.292178721 0.344311416
ATGACCCCTTGT _A_Concatenation ATAACTCCACGC _B 943.3389474 0.03640805 0.432512206 0.084178088 0.93291484 0.945041116
ATGACCCCTTGT _A_Concatenation ATAACTCCACGC _B 524.4238384 -0.445256262 0.422703297 -1.053354125 0.292178721 0.344311416
ATGACCCCTTGT _A_Concatenation ATCCAATCTGTG _B 109.7031691 -1.247649688 1.208046522 -1.032782816 0.301705547 0.354110226
ATGACCCCTTGT _A_Concatenation ATCTCTCAGTTC _B 1058.096363 2.025692346 1.007609466 2.010394319 0.044389471 0.061120485
ATGACCCCTTGT _A_Concatenation ATGACAAAACGA _B 892.6541716 2.86904875 0.916476182 3.13052189 0.00174496 0.002860951
ATGACCCCTTGT _A_Concatenation ATGACCCCTTGT _B 577.6985671 0.675060337 0.780915191 0.864447695 0.387342001 0.440898539
ATGACCCCTTGT _A_Concatenation ATGACCCCTTGT _B 317.8317371 0.75742988 1.119093522 0.67682447 0.49851733 0.547540563
ATGACCCCTTGT _A_Concatenation ATGACCCCTTGT _B 577.6985671 0.675060337 0.780915191 0.864447695 0.387342001 0.440898539
ATGACCCCTTGT _A_Concatenation ATGACCCCTTGT _B 317.8317371 0.75742988 1.119093522 0.67682447 0.49851733 0.547540563
ATGACCCCTTGT _A_Concatenation ATGTGGCACGAC _B 2466.27402 -1.296948223 0.297017273 -4.366575081 1.26E-05 2.63E-05
ATGACCCCTTGT _A_Concatenation ATTGTATCTAAC _B 1232.366084 0.999686076 0.777524593 1.285729205 0.198537585 0.242012969
ATGACCCCTTGT _A_Concatenation ATTTGTAGACCG _B 1058.215722 0.422084209 0.694986449 0.60732725 0.543633757 0.593930159
ATGACCCCTTGT _A_Concatenation CAACGCATGCCT _B 14422.265 -1.675934858 0.220667353 -7.594847348 3.08E-14 1.41E-13
ATGACCCCTTGT _A_Concatenation CACACCCGGCAG _B 1928.143102 -0.185253019 0.563474082 -0.32876937 0.742330017 0.776887905
ATGACCCCTTGT _A_Concatenation CAGTGACATAGT _B 1301.517589 0.548568316 0.361077174 1.519255039 0.128698312 0.16353079
ATGACCCCTTGT _A_Concatenation CATCTCACCTAG _B 876.4882084 0.554139336 0.754077756 0.734857025 0.462426574 0.514002339
ATGACCCCTTGT _A_Concatenation CATCTGAGTCGT _B 0.365798231 1.018599464 1.87755126 0.542514863 0.587463871 0.631199693
ATGACCCCTTGT _A_Concatenation CATGCTAACACC _B 1566.459453 -0.991486495 0.3268363 -3.033587437 0.002416647 0.003875254
ATGACCCCTTGT _A_Concatenation CATTAGTCCCGC _B 295.2280029 1.06692514 1.42240895 0.750083258 0.453204562 0.506497259
ATGACCCCTTGT _A_Concatenation CATTGTTTAAAT _B 1307.809697 0.853441729 0.791323457 1.078499217 0.280811031 0.333267102
ATGACCCCTTGT _A_Concatenation CCAGACGGTCTG _B 1077.338153 -0.564467951 0.749696166 -0.752928956 0.451492607 0.505066849
ATGACCCCTTGT _A_Concatenation CCCCTTAGGAAT _B 509.3493107 5.04210857 0.660310705 7.635963696 2.24E-14 1.04E-13
ATGACCCCTTGT _A_Concatenation CCCGTAGGGGCT _B 7.121212879 5.330137676 2.706598741 1.969312109 0.048917262 0.066804064
ATGACCCCTTGT _A_Concatenation CCCTATGTTCTA _B 1486.11499 -0.350263211 0.404485887 -0.86594668 0.386519402 0.440391015
ATGACCCCTTGT _A_Concatenation CCCTGCCTCGGG _B 6951.478431 -1.497620447 0.151381786 -9.893002892 4.46E-23 4.14E-22
ATGACCCCTTGT _A_Concatenation CCGCCCTTATGT _B 329.6625888 0.90963677 1.650708968 0.551058235 0.581593755 0.625467433
ATGACCCCTTGT _A_Concatenation CCTAAATGCAAG _B 4187.183258 -0.705756567 0.197454626 -3.574272126 0.000351204 0.000640494
ATGACCCCTTGT _A_Concatenation CGATGGCGGAAT _B 7442.844876 -1.799683603 0.213924313 -8.412711869 4.01E-17 2.28E-16
ATGACCCCTTGT _A_Concatenation CGCCGGCATACA _B 2195.020751 0.376678768 0.216322731 1.741281496 0.081634248 0.107587865
ATGACCCCTTGT _A_Concatenation CGGCAATTGCTT _B 1190.925959 0.048174904 0.644225311 0.074779588 0.940390083 0.950965404
ATGACCCCTTGT _A_Concatenation CTACGTGGCCCC _B 0
ATGACCCCTTGT _A_Concatenation CTACGTGGCCCC _B 482.4955596 -0.478262205 0.957341742 -0.499573124 0.617375686 0.660303913
ATGACCCCTTGT _A_Concatenation CTACGTGGCCCC _B 0
ATGACCCCTTGT _A_Concatenation CTACGTGGCCCC _B 482.4955596 -0.478262205 0.957341742 -0.499573124 0.617375686 0.660303913
ATGACCCCTTGT _A_Concatenation CTCACGACAAGA _B 668.2556154 4.680127047 1.253381716 3.733999777 0.000188463 0.000351937
ATGACCCCTTGT _A_Concatenation CTCAGCGATATA _B 1360.519738 1.050055413 0.644611584 1.628973848 0.103318555 0.1343486
ATGACCCCTTGT _A_Concatenation CTCCATTAATGA _B 10519.93601 -0.811526433 0.242254393 -3.349893569 0.000808426 0.001416867
ATGACCCCTTGT _A_Concatenation CTCGCACCGAGG _B 14533.13671 -1.656139684 0.331105628 -5.001846974 5.68E-07 1.37E-06
ATGACCCCTTGT _A_Concatenation CTCGTTATGGCA _B 1698.340302 1.04464085 0.61697245 1.693172603 0.090422618 0.118902182
ATGACCCCTTGT _A_Concatenation CTGCCGGTTGCG _B 1782.029214 -0.898092237 0.364933969 -2.460971884 0.013856122 0.020451777
ATGACCCCTTGT _A_Concatenation CTGGTAGTTGTT _B 12224.64789 -1.075846971 0.202245607 -5.319507242 1.04E-07 2.73E-07
ATGACCCCTTGT _A_Concatenation EPICXV.47_B 303.3712049 -3.073761593 0.934682791 -3.288561234 0.001007009 0.001733716
ATGACCCCTTGT _A_Concatenation GACTGTGGGCGG _B 1298.324089 -0.793184337 0.227623987 -3.484625443 0.000492826 0.000882257
ATGACCCCTTGT _A_Concatenation GAGACAGCTCTC _B 4195.163522 -0.310077683 0.282390318 -1.098046439 0.272184211 0.324345916
ATGACCCCTTGT _A_Concatenation GAGATTGTGTCC _B 885.7464797 -0.454030441 0.992465168 -0.457477457 0.647327902 0.686684499
ATGACCCCTTGT _A_Concatenation GAGATTGTGTCC _B 521.8273892 4.008315602 0.855559544 4.685022371 2.80E-06 6.35E-06
ATGACCCCTTGT _A_Concatenation GAGATTGTGTCC _B 885.7464797 -0.454030441 0.992465168 -0.457477457 0.647327902 0.686684499
ATGACCCCTTGT _A_Concatenation GAGATTGTGTCC _B 521.8273892 4.008315602 0.855559544 4.685022371 2.80E-06 6.35E-06
ATGACCCCTTGT _A_Concatenation GAGTGACCTATT _B 11215.98758 -1.846011417 0.201625402 -9.155649047 5.40E-20 3.83E-19
ATGACCCCTTGT _A_Concatenation GATCGCATGTTC _B 815.4825475 1.205367865 1.309436343 0.920524217 0.357298884 0.412268472
ATGACCCCTTGT _A_Concatenation GATGGAGCTACA _B 807.7004617 0.547613876 0.473697235 1.156041952 0.247663997 0.299090096
ATGACCCCTTGT _A_Concatenation GCAACGAGGGTC _B 234.1471285 -0.832023953 1.026530525 -0.810520421 0.417641133 0.470802782
ATGACCCCTTGT _A_Concatenation GCAGATTCGAAT _B 7602.69883 -0.988303782 0.17105731 -5.777617935 7.58E-09 2.18E-08
ATGACCCCTTGT _A_Concatenation GCGAGGCAGACC _B 5500.668748 -0.313088304 0.334070013 -0.93719368 0.34865894 0.403546833
ATGACCCCTTGT _A_Concatenation GGAGGCACATCG _B 7462.625076 8.72098559 1.350370488 6.458216963 1.06E-10 3.69E-10
ATGACCCCTTGT _A_Concatenation GGCACAGCTCCA _B 4465.899797 1.790920936 0.597816081 2.995772436 0.002737506 0.004353939
ATGACCCCTTGT _A_Concatenation GGCACTGTCGAG _B 1562.659102 -0.617922189 0.282425217 -2.187914365 0.028675841 0.040485577
ATGACCCCTTGT _A_Concatenation GGCCTTTCGGGC _B 0
ATGACCCCTTGT _A_Concatenation GGGAGTATGTCG _B 2476.152138 -0.521711314 0.467127552 -1.116849802 0.264058551 0.315628268
ATGACCCCTTGT _A_Concatenation GGTGTCTCGATT _B 935.2907766 -2.079233258 0.724782835 -2.868767246 0.004120749 0.006457313
ATGACCCCTTGT _A_Concatenation GTACATACCGAA _B 933.5235491 1.654746947 0.739177606 2.238632412 0.025179843 0.035852907
ATGACCCCTTGT _A_Concatenation GTCACCCTCCTT _B 0
ATGACCCCTTGT _A_Concatenation GTTCCACGCCTG _B 500.1774776 2.765118216 1.142249488 2.420765556 0.015487862 0.022716827
ATGACCCCTTGT _A_Concatenation Stop_1_B 1555.398815 0.474188146 0.515456451 0.919938329 0.357604988 0.412268472
ATGACCCCTTGT _A_Concatenation Stop_2_B 0.612336762 0.5080298 1.467069773 0.346288779 0.729125701 0.765811271
ATGACCCCTTGT _A_Concatenation TAAAATTTATCA _B 1329.875155 0.332647337 1.000211388 0.332577035 0.739453581 0.774571
ATGACCCCTTGT _A_Concatenation TAAGATTCAACG _B 436.4478317 2.473855484 1.914782095 1.291977552 0.196364919 0.239864775
ATGACCCCTTGT _A_Concatenation TACATGAGGCAC _B 1511.794236 1.220926781 0.439402422 2.778607307 0.005459248 0.008452796
ATGACCCCTTGT _A_Concatenation TACTTCTAGCGG _B 478.5198881 0.693637766 0.944185972 0.734641041 0.462558136 0.514002339
ATGACCCCTTGT _A_Concatenation TATCGCAAGAAC _B 4510.443005 -1.161403709 0.365360146 -3.178791451 0.001478905 0.002462734
ATGACCCCTTGT _A_Concatenation TATCTATCGTGT _B 279.5307797 2.212810566 0.972081277 2.276363735 0.022824248 0.032738094
ATGACCCCTTGT _A_Concatenation TATTAAAAGTGC _B 3516.783324 -0.901112032 0.301380839 -2.989944663 0.00279028 0.004425831
ATGACCCCTTGT _A_Concatenation TCAGCCGTAGGC _B 948.4024368 0.368921854 0.832167168 0.443326615 0.657529508 0.696242749
ATGACCCCTTGT _A_Concatenation TCTATCACAACT _B 0.475159289 1.425091302 2.107082379 0.676333928 0.498828656 0.547540563
ATGACCCCTTGT _A_Concatenation TCTGTACCAACG _B 665.2200161 0.15339516 0.962822562 0.159318203 0.87341818 0.892505116
ATGACCCCTTGT _A_Concatenation TCTTAGAGGGTG _B 1950.570672 -0.987105007 0.293861742 -3.359079682 0.000782025 0.001374718
ATGACCCCTTGT _A_Concatenation TCTTGGTTTTGA _B 1129.693256 1.11859095 1.255485191 0.890963078 0.372948978 0.426592324
ATGACCCCTTGT _A_Concatenation TGATGAGCGCCC _B 1606.678065 -0.011157164 0.63925852 -0.01745329 0.986074996 0.98902379
ATGACCCCTTGT _A_Concatenation TGCCAGACGGTC _B 5651.774607 0.012569397 0.274901214 0.045723324 0.963530774 0.969335176
ATGACCCCTTGT _A_Concatenation TGGCCCCCAGTT _B 2714.044003 -0.595346355 0.402697842 -1.4783947 0.139302163 0.175289805
ATGACCCCTTGT _A_Concatenation TTAAATGAGGGC _B 1856.686977 0.147171198 0.282442378 0.521066277 0.602320602 0.646064294
ATGACCCCTTGT _A_Concatenation TTAGTGCCATAC _B 0
ATGACCCCTTGT _A_Concatenation TTCTAATTTCCT _B 11195.00094 1.098253682 0.664274794 1.653312291 0.098267323 0.128494967
ATGACCCCTTGT _A_Concatenation TTTAACGGATTG _B 1455.989183 -0.776645313 0.730019137 -1.063869799 0.287387686 0.339349702
ATGACCCCTTGT _A_Concatenation Joint_ 1_B 752.5198842 -1.444943888 1.263073794 -1.143990078 0.252627752 0.304141959
ATGACCCCTTGT _A_Concatenation Joint_ 2_B 1110.470857 0.637154071 0.480671907 1.325548802 0.184989194 0.228114312
ATGACCCCTTGT _A_Concatenation Joint_ 3_B 3908.820494 0.040268554 0.236898364 0.169982405 0.865023974 0.88470081
ATGTGGCACGAC _A_Concatenation ACTTGATGGTTT _B 5095.641866 2.448419689 0.421020313 5.815443132 6.05E-09 1.77E-08
ATGTGGCACGAC _A_Concatenation AGAGTGCGCTCT _B 6047.820627 -0.729866992 0.197292628 -3.699413405 0.000216098 0.000400983
ATGTGGCACGAC _A_Concatenation ATAACTCCACGC _B 773.6173093 0.431277631 0.598620834 0.72045209 0.471246688 0.521673654
ATGTGGCACGAC _A_Concatenation ATGACCCCTTGT _B 1540.556675 -1.285062464 0.49890564 -2.575762552 0.010001931 0.014970879
ATGTGGCACGAC _A_Concatenation CTACGTGGCCCC _B 1874.432834 2.300046892 0.579951321 3.965930947 7.31E-05 0.000141506
ATGTGGCACGAC _A_Concatenation GAGATTGTGTCC _B 6739.163558 -1.678747934 0.207190859 -8.102422761 5.39E-16 2.75E-15
ATTGTATCTAAC _A_Concatenation ACTTGATGGTTT _B 24139.71307 1.352191287 0.218567775 6.186599503 6.15E-10 2.02E-09
ATTGTATCTAAC _A_Concatenation AGAGTGCGCTCT _B 23994.9063 -1.466219367 0.153272298 -9.566108058 1.11E-21 9.47E-21
ATTGTATCTAAC _A_Concatenation ATAACTCCACGC _B 6980.824289 0.93531153 0.373705889 2.502801156 0.012321475 0.018302165
ATTGTATCTAAC _A_Concatenation ATGACCCCTTGT _B 1971.904722 0.642400748 0.246543575 2.605627624 0.00917061 0.013815004
ATTGTATCTAAC _A_Concatenation CTACGTGGCCCC _B 8719.546707 1.909931947 0.360623875 5.296188293 1.18E-07 3.08E-07
ATTGTATCTAAC _A_Concatenation GAGATTGTGTCC _B 10818.52818 0.638717896 0.125131406 5.104377208 3.32E-07 8.33E-07
ATTTGTAGACCG _A_Concatenation ACTTGATGGTTT _B 12449.504 -0.532255558 0.137232498 -3.878495005 0.000105105 0.000199784
ATTTGTAGACCG _A_Concatenation AGAGTGCGCTCT _B 10062.98179 -1.338155433 0.183240159 -7.302741059 2.82E-13 1.17E-12
ATTTGTAGACCG _A_Concatenation ATAACTCCACGC _B 814.6974146 0.532632847 0.902539208 0.590149261 0.555090585 0.604190777
ATTTGTAGACCG _A_Concatenation ATGACCCCTTGT _B 1115.059329 -1.549811212 0.463508512 -3.343652107 0.000826834 0.001446959
ATTTGTAGACCG _A_Concatenation CTACGTGGCCCC _B 5284.369087 0.678551148 0.157454497 4.309506299 1.64E-05 3.36E-05
ATTTGTAGACCG _A_Concatenation GAGATTGTGTCC _B 4476.743427 1.632420995 0.18068546 9.034600778 1.65E-19 1.12E-18
CAACGCATGCCT _A_Concatenation ACTTGATGGTTT _B 7419.813011 -1.421461103 0.219245394 -6.483425159 8.97E-11 3.14E-10
CAACGCATGCCT _A_Concatenation AGAGTGCGCTCT _B 7073.509416 -0.916010416 0.212461993 -4.311408371 1.62E-05 3.34E-05
CAACGCATGCCT _A_Concatenation ATAACTCCACGC _B 9827.361859 -1.034500129 0.161772056 -6.394801145 1.61E-10 5.51E-10
CAACGCATGCCT _A_Concatenation ATGACCCCTTGT _B 1105.304225 -0.966323948 0.307952527 -3.137899064 0.001701635 0.002797765
CAACGCATGCCT _A_Concatenation CTACGTGGCCCC _B 9528.378512 -2.157771629 0.253692395 -8.505464378 1.81E-17 1.07E-16
CAACGCATGCCT _A_Concatenation GAGATTGTGTCC _B 10490.33969 -2.857159599 0.220524115 -12.95622292 2.17E-38 4.78E-37
CACACCCGGCAG _A_Concatenation ACTTGATGGTTT _B 12305.95256 3.795537336 0.388192184 9.777469758 1.41E-22 1.27E-21
CACACCCGGCAG _A_Concatenation AGAGTGCGCTCT _B 28023.53979 0.013639503 0.142496975 0.09571784 0.923744672 0.936563332
CACACCCGGCAG _A_Concatenation ATAACTCCACGC _B 3076.259048 2.355161651 0.506682838 4.648197012 3.35E-06 7.56E-06
CACACCCGGCAG _A_Concatenation ATGACCCCTTGT _B 6359.834091 -1.808272323 0.213103852 -8.485404222 2.15E-17 1.25E-16
CACACCCGGCAG _A_Concatenation CTACGTGGCCCC _B 5164.160074 1.059695069 0.212060772 4.997129174 5.82E-07 1.40E-06
CACACCCGGCAG _A_Concatenation GAGATTGTGTCC _B 3805.636558 1.17417008 0.373102082 3.147047783 0.00164928 0.002719335
CAGTGACATAGT _A_Concatenation ACTTGATGGTTT _B 4360.546813 0.679023926 0.215814314 3.14633406 0.00165331 0.002722141
CAGTGACATAGT _A_Concatenation AGAGTGCGCTCT _B 6252.181442 -2.070393053 0.154656159 -13.38707148 7.20E-41 1.83E-39
CAGTGACATAGT _A_Concatenation ATAACTCCACGC _B 3918.571575 -1.722742868 0.326697176 -5.273210161 1.34E-07 3.47E-07
CAGTGACATAGT _A_Concatenation ATGACCCCTTGT _B 2241.253907 -0.072621927 0.146057456 -0.497214786 0.619037596 0.660918806
CAGTGACATAGT _A_Concatenation CTACGTGGCCCC _B 3266.05954 -0.297932812 0.160916868 -1.851470366 0.064101914 0.085934791
CAGTGACATAGT _A_Concatenation GAGATTGTGTCC _B 5017.665716 0.013951149 0.226044692 0.061718544 0.950786972 0.958990484
CATCTCACCTAG _A_Concatenation ACTTGATGGTTT _B 11946.44895 3.638856263 0.220133194 16.53024785 2.22E-61 1.37E-59
CATCTCACCTAG _A_Concatenation AGAGTGCGCTCT _B 16003.26959 -1.075382243 0.128420265 -8.373929479 5.57E-17 3.13E-16
CATCTCACCTAG _A_Concatenation ATAACTCCACGC _B 3859.034055 -0.566270041 0.420046936 -1.348111349 0.177622572 0.2197257
CATCTCACCTAG _A_Concatenation ATGACCCCTTGT _B 1986.242644 1.391414284 0.790325411 1.760558709 0.078313123 0.103444113
CATCTCACCTAG _A_Concatenation CTACGTGGCCCC _B 5844.460326 1.390616272 0.461339945 3.014298432 0.002575743 0.004113448
CATCTCACCTAG _A_Concatenation GAGATTGTGTCC _B 5403.645313 1.936581824 0.449375277 4.309497926 1.64E-05 3.36E-05
CATCTGAGTCGT _A_Concatenation ACTTGATGGTTT _B 7217.684202 3.943917037 0.384336997 10.26161173 1.05E-24 1.09E-23
CATCTGAGTCGT _A_Concatenation AGAGTGCGCTCT _B 18368.28068 -1.277099226 0.112859198 -11.315863 1.10E-29 1.62E-28
CATCTGAGTCGT _A_Concatenation ATAACTCCACGC _B 1588.843348 2.420260724 0.666019066 3.633921079 0.000279146 0.000513085
CATCTGAGTCGT _A_Concatenation ATGACCCCTTGT _B 897.3327052 -0.781425839 1.342679738 -0.581989745 0.560573588 0.609024651
CATCTGAGTCGT _A_Concatenation CTACGTGGCCCC _B 1782.523095 1.608260208 0.274976646 5.848715625 4.95E-09 1.46E-08
CATCTGAGTCGT _A_Concatenation GAGATTGTGTCC _B 2732.22942 0.746913135 0.89373031 0.835725415 0.403309394 0.457292611
CATGCTAACACC _A_Concatenation ACTTGATGGTTT _B 2149.110057 1.147353288 0.64736941 1.772331639 0.076339522 0.101066095
CATGCTAACACC _A_Concatenation AGAGTGCGCTCT _B 3918.207535 0.809065323 0.281976499 2.869265079 0.004114268 0.006455812
CATGCTAACACC _A_Concatenation ATAACTCCACGC _B 1301.954231 0.185729359 0.325528617 0.570546949 0.568306788 0.614570431
CATGCTAACACC _A_Concatenation ATGACCCCTTGT _B 1519.002401 -2.194968027 0.332003084 -6.611288062 3.81E-11 1.37E-10
CATGCTAACACC _A_Concatenation CTACGTGGCCCC _B 655.3439085 1.267264167 0.903566746 1.402513065 0.160762092 0.199714012
CATGCTAACACC _A_Concatenation GAGATTGTGTCC _B 357.3287364 2.234016085 1.245886488 1.793113665 0.072954751 0.097024009
CATTAGTCCCGC _A_Concatenation ACTTGATGGTTT _B 5373.385514 2.311393706 0.240209383 9.622412221 6.43E-22 5.57E-21
CATTAGTCCCGC _A_Concatenation AGAGTGCGCTCT _B 11785.31091 -0.133471811 0.188900808 -0.706570883 0.479833189 0.53067644
CATTAGTCCCGC _A_Concatenation ATAACTCCACGC _B 3205.555883 -1.053650808 0.391163138 -2.693635231 0.007067747 0.010842777
CATTAGTCCCGC _A_Concatenation ATGACCCCTTGT _B 3979.998108 -3.015937955 0.389643513 -7.740249361 9.92E-15 4.68E-14
CATTAGTCCCGC _A_Concatenation CTACGTGGCCCC _B 3403.644957 -0.924091644 0.288917896 -3.19845761 0.001381648 0.002320614
CATTAGTCCCGC _A_Concatenation GAGATTGTGTCC _B 2536.550192 -0.889240951 0.437382374 -2.03309736 0.042042688 0.057957431
CATTGTTTAAAT _A_Concatenation ACTTGATGGTTT _B 14303.36113 3.944358545 0.486788739 8.102813863 5.37E-16 2.75E-15
CATTGTTTAAAT _A_Concatenation AGAGTGCGCTCT _B 20511.17682 -1.552367967 0.145878579 -10.64150734 1.91E-26 2.30E-25
CATTGTTTAAAT _A_Concatenation ATAACTCCACGC _B 2181.724046 1.759846351 0.765198434 2.299856184 0.021456369 0.030927861
CATTGTTTAAAT _A_Concatenation ATGACCCCTTGT _B 732.5492685 0.817619209 0.890944483 0.91769939 0.358776267 0.413211287
CATTGTTTAAAT _A_Concatenation CTACGTGGCCCC _B 3285.122722 2.609998818 0.509403177 5.12364064 3.00E-07 7.53E-07
CATTGTTTAAAT _A_Concatenation GAGATTGTGTCC _B 2683.67616 1.184993478 0.287701286 4.118832748 3.81E-05 7.56E-05
CCAGACGGTCTG _A_Concatenation ACTTGATGGTTT _B 8223.252539 -1.340416534 0.233808013 -5.732979462 9.87E-09 2.81E-08
CCAGACGGTCTG _A_Concatenation AGAGTGCGCTCT _B 7451.368311 -1.563258225 0.325597588 -4.801197196 1.58E-06 3.67E-06
CCAGACGGTCTG _A_Concatenation ATAACTCCACGC _B 10467.93958 -0.191487445 0.262392227 -0.729775601 0.465527361 0.516320194
CCAGACGGTCTG _A_Concatenation ATGACCCCTTGT _B 1396.630034 -0.3794634 0.625593917 -0.606565041 0.544139606 0.593930159
CCAGACGGTCTG _A_Concatenation CTACGTGGCCCC _B 9536.830883 -1.493867682 0.221391423 -6.74763125 1.50E-11 5.56E-11
CCAGACGGTCTG _A_Concatenation GAGATTGTGTCC _B 12886.20754 -0.737983045 0.232029654 -3.180554865 0.001469933 0.002451286
CCCCTTAGGAAT _A_Concatenation ACTTGATGGTTT _B 12114.4044 1.640303072 0.397763095 4.123819159 3.73E-05 7.41E-05
CCCCTTAGGAAT _A_Concatenation AGAGTGCGCTCT _B 18800.88348 -1.895815142 0.146215244 -12.9659199 1.91E-38 4.29E-37
CCCCTTAGGAAT _A_Concatenation ATAACTCCACGC _B 2493.416299 0.17498565 0.514836053 0.339886162 0.733942258 0.770178187
CCCCTTAGGAAT _A_Concatenation ATGACCCCTTGT _B 979.7426449 0.57270296 0.716587962 0.799208178 0.424169711 0.477241955
CCCCTTAGGAAT _A_Concatenation CTACGTGGCCCC _B 4755.683253 0.6723488 0.489853627 1.372550417 0.169892155 0.210385518
CCCCTTAGGAAT _A_Concatenation GAGATTGTGTCC _B 4733.324715 2.229956362 0.877081145 2.542474405 0.011007066 0.016412322
CCCGTAGGGGCT _A_Concatenation ACTTGATGGTTT _B 3767.976556 1.941885961 0.427959132 4.537550001 5.69E-06 1.25E-05
CCCGTAGGGGCT _A_Concatenation AGAGTGCGCTCT _B 9017.133556 -0.624429884 0.211892602 -2.946916877 0.003209595 0.005084033
CCCGTAGGGGCT _A_Concatenation ATAACTCCACGC _B 1565.709678 0.631866807 0.436152941 1.448727608 0.147413667 0.184306499
CCCGTAGGGGCT _A_Concatenation ATGACCCCTTGT _B 644.7174587 -2.315772704 0.681709656 -3.397007338 0.000681271 0.001206676
CCCGTAGGGGCT _A_Concatenation CTACGTGGCCCC _B 1758.536277 2.379765813 1.472676465 1.615946115 0.106105964 0.13751427
CCCGTAGGGGCT _A_Concatenation GAGATTGTGTCC _B 2056.702882 1.555989071 0.528424946 2.944579138 0.003233942 0.005115668
CCCTATGTTCTA _A_Concatenation ACTTGATGGTTT _B 9585.621476 1.281474094 0.311264638 4.116992217 3.84E-05 7.61E-05
CCCTATGTTCTA _A_Concatenation AGAGTGCGCTCT _B 22519.92456 1.091908721 0.178459696 6.118517209 9.45E-10 3.02E-09
CCCTATGTTCTA _A_Concatenation ATAACTCCACGC _B 4476.378542 -0.473947881 0.32241803 -1.469979455 0.141567318 0.17775746
CCCTATGTTCTA _A_Concatenation ATGACCCCTTGT _B 3851.487833 -0.725885908 0.301105707 -2.410734477 0.015920434 0.02329285
CCCTATGTTCTA _A_Concatenation CTACGTGGCCCC _B 1961.488657 0.684350196 0.446388534 1.533081933 0.125255669 0.159677074
CCCTATGTTCTA _A_Concatenation GAGATTGTGTCC _B 5615.839237 -1.380460315 0.361795071 -3.815586301 0.00013586 0.000256162
CCCTGCCTCGGG _A_Concatenation ACTTGATGGTTT _B 13706.93685 -2.025843128 0.236361474 -8.570953173 1.03E-17 6.22E-17
CCCTGCCTCGGG _A_Concatenation AGAGTGCGCTCT _B 4395.746943 -0.720679689 0.396410415 -1.818014012 0.069061991 0.09220222
CCCTGCCTCGGG _A_Concatenation ATAACTCCACGC _B 10291.76466 -0.173235129 0.152515265 -1.135854366 0.25601752 0.306958442
CCCTGCCTCGGG _A_Concatenation ATGACCCCTTGT _B 1140.627367 -1.549313497 0.747535923 -2.072560594 0.038213192 0.053053707
CCCTGCCTCGGG _A_Concatenation CTACGTGGCCCC _B 10152.64704 -0.991194945 0.160500384 -6.175654653 6.59E-10 2.15E-09
CCCTGCCTCGGG _A_Concatenation GAGATTGTGTCC _B 14253.85409 -1.669580159 0.114409503 -14.59301993 3.11E-48 1.17E-46
CCGCCCTTATGT _A_Concatenation ACTTGATGGTTT _B 920.9953773 7.680540582 0.915405873 8.390311674 4.85E-17 2.75E-16
CCGCCCTTATGT _A_Concatenation AGAGTGCGCTCT _B 1514.582336 -0.391918069 0.365405424 -1.072556793 0.28347002 0.336081595
CCGCCCTTATGT _A_Concatenation ATAACTCCACGC _B 64.94454274 8.517170106 1.781455677 4.78101713 1.74E-06 4.02E-06
CCGCCCTTATGT _A_Concatenation ATGACCCCTTGT _B 46.47763298 -0.99230683 1.413958346 -0.701793538 0.482807929 0.533461691
CCGCCCTTATGT _A_Concatenation CTACGTGGCCCC _B 344.9398444 0.389994122 1.231620918 0.316651103 0.751508355 0.785087817
CCGCCCTTATGT _A_Concatenation GAGATTGTGTCC _B 279.830954 7.895667926 0.923688136 8.547980235 1.25E-17 7.47E-17
CCTAAATGCAAG _A_Concatenation ACTTGATGGTTT _B 2319.402208 -2.39325174 0.286203924 -8.362050757 6.16E-17 3.45E-16
CCTAAATGCAAG _A_Concatenation AGAGTGCGCTCT _B 1522.603878 -1.127836336 0.308123728 -3.660335872 0.000251885 0.000465907
CCTAAATGCAAG _A_Concatenation ATAACTCCACGC _B 1377.550391 -0.445092013 0.575862296 -0.772913969 0.439573297 0.492676112
CCTAAATGCAAG _A_Concatenation ATGACCCCTTGT _B 349.099442 1.716151465 0.965156408 1.778107103 0.075386266 0.10003013
CCTAAATGCAAG _A_Concatenation CTACGTGGCCCC _B 497.1558333 0.547142923 0.958769489 0.570672023 0.568221987 0.614570431
CCTAAATGCAAG _A_Concatenation GAGATTGTGTCC _B 891.1928175 -1.154494628 0.440968466 -2.618088861 0.008842378 0.01335496
CGATGGCGGAAT _A_Concatenation ACTTGATGGTTT _B 562.9838927 0.015138398 1.021654006 0.014817539 0.988177747 0.98902379
CGATGGCGGAAT _A_Concatenation AGAGTGCGCTCT _B 1130.648088 -1.163985128 0.698758477 -1.665790351 0.095755179 0.125490811
CGATGGCGGAAT _A_Concatenation ATAACTCCACGC _B 541.8691076 -0.823311671 1.236196407 -0.666003935 0.505408585 0.554242623
CGATGGCGGAAT _A_Concatenation ATGACCCCTTGT _B 1670.880527 -5.614902708 0.923164854 -6.082231885 1.19E-09 3.73E-09
CGATGGCGGAAT _A_Concatenation CTACGTGGCCCC _B 580.192006 1.716848853 1.093292956 1.570346578 0.116334503 0.149281048
CGATGGCGGAAT _A_Concatenation GAGATTGTGTCC _B 1787.592118 -0.178987543 0.258500038 -0.692408189 0.488681025 0.538932187
CGCCGGCATACA _A_Concatenation ACTTGATGGTTT _B 7349.671117 2.357755315 0.467889257 5.039131118 4.68E-07 1.15E-06
CGCCGGCATACA _A_Concatenation AGAGTGCGCTCT _B 14868.66965 -0.555204161 0.170695658 -3.252596849 0.001143556 0.001963021
CGCCGGCATACA _A_Concatenation ATAACTCCACGC _B 3363.201043 0.999578994 1.170468616 0.853998972 0.393105559 0.447023734
CGCCGGCATACA _A_Concatenation ATGACCCCTTGT _B 1171.087108 -0.424791624 0.569397984 -0.746036402 0.455645425 0.50873878
CGCCGGCATACA _A_Concatenation CTACGTGGCCCC _B 2362.373098 2.216825278 0.850690562 2.605912628 0.009162983 0.013815004
CGCCGGCATACA _A_Concatenation GAGATTGTGTCC _B 1060.909951 1.201347941 0.453279028 2.650349709 0.008040849 0.012271218
CGGCAATTGCTT _A_Concatenation ACTTGATGGTTT _B 0
CGGCAATTGCTT _A_Concatenation AGAGTGCGCTCT _B 0
CGGCAATTGCTT _A_Concatenation ATAACTCCACGC _B 0
CGGCAATTGCTT _A_Concatenation ATGACCCCTTGT _B 0
CGGCAATTGCTT _A_Concatenation CTACGTGGCCCC _B 0
CGGCAATTGCTT _A_Concatenation GAGATTGTGTCC _B 0
CTACGTGGCCCC _A_Concatenation AAAACAAGCATT _B 9995.714008 -0.428803015 0.143315429 -2.992022678 0.002771357 0.004401788
CTACGTGGCCCC _A_Concatenation AAACTTTGAGCG _B 1502.767915 2.305731445 0.740849225 3.112281644 0.001856473 0.003031029
CTACGTGGCCCC _A_Concatenation AAGACCGGTGCC _B 10785.24246 1.936743035 0.236167695 8.200711083 2.39E-16 1.26E-15
CTACGTGGCCCC _A_Concatenation AAGCAACGATAT _B 12038.37956 -1.355369685 0.355009618 -3.817839337 0.000134626 0.000254244
CTACGTGGCCCC _A_Concatenation AAGGGTAAATTG _B 12102.24364 -0.470531276 0.388087464 -1.212436165 0.225345464 0.27326644
CTACGTGGCCCC _A_Concatenation AATCAACGAGCA _B 7094.232407 3.419168586 0.519147532 6.586121234 4.51E-11 1.61E-10
CTACGTGGCCCC _A_Concatenation AATCGAGACCAG _B 2829.285345 4.678915085 0.778084069 6.013379876 1.82E-09 5.68E-09
CTACGTGGCCCC _A_Concatenation AATCGATCCTAC _B 9520.461054 3.254642759 0.284649602 11.43385671 2.83E-30 4.41E-29
CTACGTGGCCCC _A_Concatenation AATGGCACTAGC _B 18314.94031 -0.43592875 0.185330676 -2.352167267 0.01866438 0.027070298
CTACGTGGCCCC _A_Concatenation ACAACGTGCATA _B 5234.170228 1.728268959 0.466560772 3.704274046 0.000211997 0.000393998
CTACGTGGCCCC _A_Concatenation ACCAGTATTTAA _B 3425.963345 3.002088097 0.73482886 4.085424864 4.40E-05 8.67E-05
CTACGTGGCCCC _A_Concatenation ACGACTCACCGC _B 4670.921673 0.088456585 0.157437686 0.561851401 0.574217268 0.619218965
CTACGTGGCCCC _A_Concatenation ACGGCGTAAGTA _B 1552.079293 -2.399873102 0.415027572 -5.782442571 7.36E-09 2.13E-08
CTACGTGGCCCC _A_Concatenation ACTCGTGTTGTC _B 11944.92523 -0.736553968 0.147401584 -4.996920311 5.83E-07 1.40E-06
CTACGTGGCCCC _A_Concatenation ACTTGATGGTTT _B 11551.28423 1.587956181 0.382028571 4.156642462 3.23E-05 6.44E-05
CTACGTGGCCCC _A_Concatenation ACTTGATGGTTT _B 3972.048043 0.327161629 0.472084969 0.69301429 0.488300584 0.538932187
CTACGTGGCCCC _A_Concatenation ACTTGATGGTTT _B 11551.28423 1.587956181 0.382028571 4.156642462 3.23E-05 6.44E-05
CTACGTGGCCCC _A_Concatenation ACTTGATGGTTT _B 3972.048043 0.327161629 0.472084969 0.69301429 0.488300584 0.538932187
CTACGTGGCCCC _A_Concatenation ACTTTAGTAACA _B 5354.187064 -0.463095371 0.222088069 -2.085187977 0.037052241 0.051503055
CTACGTGGCCCC _A_Concatenation AGAGTGCGCTCT _B 17085.73655 0.788539819 0.125401725 6.288109804 3.21E-10 1.07E-09
CTACGTGGCCCC _A_Concatenation AGAGTGCGCTCT _B 12307.05036 0.535323344 0.16900728 3.167457296 0.001537783 0.002553506
CTACGTGGCCCC _A_Concatenation AGAGTGCGCTCT _B 17085.73655 0.788539819 0.125401725 6.288109804 3.21E-10 1.07E-09
CTACGTGGCCCC _A_Concatenation AGAGTGCGCTCT _B 12307.05036 0.535323344 0.16900728 3.167457296 0.001537783 0.002553506
CTACGTGGCCCC _A_Concatenation AGCCAGGTCACC _B 1136.55943 1.901178167 1.359680869 1.39825323 0.162037037 0.20087094
CTACGTGGCCCC _A_Concatenation AGCCATAAACCT _B 12830.46762 3.072085653 0.228314337 13.45550913 2.86E-41 7.59E-40
CTACGTGGCCCC _A_Concatenation AGCCGTGCTTGA _B 12343.5194 0.139613203 0.206013139 0.677690774 0.497967777 0.547540563
CTACGTGGCCCC _A_Concatenation AGGCTTCCCGCT _B 10356.15466 -0.843379431 0.214114828 -3.938911834 8.19E-05 0.000156604
CTACGTGGCCCC _A_Concatenation AGTTAGGCTCTT _B 11822.87803 2.246041671 0.304421612 7.378062469 1.61E-13 6.93E-13
CTACGTGGCCCC _A_Concatenation AGTTGCTTTGCT _B 12953.38622 -1.440908575 0.242460548 -5.942857866 2.80E-09 8.55E-09
CTACGTGGCCCC _A_Concatenation ATAACTCCACGC _B 1831.525994 0.164864555 0.699763474 0.2356004 0.813742763 0.841083368
CTACGTGGCCCC _A_Concatenation ATAACTCCACGC _B 2053.123089 0.045546261 0.316816823 0.143762128 0.885688309 0.901889925
CTACGTGGCCCC _A_Concatenation ATAACTCCACGC _B 1831.525994 0.164864555 0.699763474 0.2356004 0.813742763 0.841083368
CTACGTGGCCCC _A_Concatenation ATAACTCCACGC _B 2053.123089 0.045546261 0.316816823 0.143762128 0.885688309 0.901889925
CTACGTGGCCCC _A_Concatenation ATCCAATCTGTG _B 3114.842864 2.520287753 0.555501973 4.53695554 5.71E-06 1.25E-05
CTACGTGGCCCC _A_Concatenation ATCTCTCAGTTC _B 13802.3289 3.771621585 0.514650493 7.328510577 2.33E-13 9.86E-13
CTACGTGGCCCC _A_Concatenation ATGACAAAACGA _B 17734.13989 2.411483278 0.392766806 6.139732888 8.27E-10 2.68E-09
CTACGTGGCCCC _A_Concatenation ATGACCCCTTGT _B 950.1151898 2.652771902 0.339605108 7.811342751 5.66E-15 2.72E-14
CTACGTGGCCCC _A_Concatenation ATGACCCCTTGT _B 1050.446502 -0.370077905 0.287481368 -1.287310924 0.197985938 0.241592444
CTACGTGGCCCC _A_Concatenation ATGACCCCTTGT _B 950.1151898 2.652771902 0.339605108 7.811342751 5.66E-15 2.72E-14
CTACGTGGCCCC _A_Concatenation ATGACCCCTTGT _B 1050.446502 -0.370077905 0.287481368 -1.287310924 0.197985938 0.241592444
CTACGTGGCCCC _A_Concatenation ATGTGGCACGAC _B 20487.6608 1.887721083 0.257877736 7.320217371 2.48E-13 1.04E-12
CTACGTGGCCCC _A_Concatenation ATTGTATCTAAC _B 18959.20939 1.502551182 0.273416685 5.49546265 3.90E-08 1.07E-07
CTACGTGGCCCC _A_Concatenation ATTTGTAGACCG _B 19899.28901 0.973082807 0.193742925 5.022546269 5.10E-07 1.24E-06
CTACGTGGCCCC _A_Concatenation CAACGCATGCCT _B 44648.52901 -1.564371853 0.177549846 -8.810888261 1.24E-18 8.02E-18
CTACGTGGCCCC _A_Concatenation CACACCCGGCAG _B 11325.87665 3.493793282 0.680660735 5.132943777 2.85E-07 7.19E-07
CTACGTGGCCCC _A_Concatenation CAGTGACATAGT _B 30955.90969 1.839559611 0.075625436 24.32461492 1.08E-130 4.19E-128
CTACGTGGCCCC _A_Concatenation CATCTCACCTAG _B 9956.643988 1.281083965 0.262161517 4.88662096 1.03E-06 2.41E-06
CTACGTGGCCCC _A_Concatenation CATCTGAGTCGT _B 8.390306463 2.067232846 0.927075807 2.229842296 0.025757914 0.036631389
CTACGTGGCCCC _A_Concatenation CATGCTAACACC _B 7489.117357 0.560918063 0.271833533 2.06346162 0.039068791 0.054177244
CTACGTGGCCCC _A_Concatenation CATTAGTCCCGC _B 1198.593846 1.562291149 0.728816101 2.14360131 0.032064846 0.044783518
CTACGTGGCCCC _A_Concatenation CATTGTTTAAAT _B 8505.425357 1.467152345 0.202181437 7.256612502 3.97E-13 1.65E-12
CTACGTGGCCCC _A_Concatenation CCAGACGGTCTG _B 15173.55912 -1.62092495 0.174065653 -9.31214701 1.25E-20 9.51E-20
CTACGTGGCCCC _A_Concatenation CCCCTTAGGAAT _B 4490.436985 2.348146093 0.281108338 8.353171272 6.65E-17 3.70E-16
CTACGTGGCCCC _A_Concatenation CCCGTAGGGGCT _B 384.699154 6.410387998 0.933137288 6.86971583 6.43E-12 2.44E-11
CTACGTGGCCCC _A_Concatenation CCCTATGTTCTA _B 3021.105096 -0.252476773 0.371238411 -0.680093345 0.496445358 0.546463864
CTACGTGGCCCC _A_Concatenation CCCTGCCTCGGG _B 20559.72151 -0.955639078 0.143053369 -6.680297617 2.38E-11 8.66E-11
CTACGTGGCCCC _A_Concatenation CCGCCCTTATGT _B 923.4517292 2.588344419 0.694558287 3.726605048 0.000194076 0.000361842
CTACGTGGCCCC _A_Concatenation CCTAAATGCAAG _B 10541.40504 -0.569930632 0.252623063 -2.256051465 0.024067408 0.03443672
CTACGTGGCCCC _A_Concatenation CGATGGCGGAAT _B 19259.9293 -0.93143138 0.12738169 -7.312129216 2.63E-13 1.10E-12
CTACGTGGCCCC _A_Concatenation CGCCGGCATACA _B 3676.560558 2.873268172 0.825777767 3.479469035 0.000502408 0.000896665
CTACGTGGCCCC _A_Concatenation CGGCAATTGCTT _B 2816.903216 0.715022949 0.348739526 2.050306593 0.040334522 0.055734108
CTACGTGGCCCC _A_Concatenation CTACGTGGCCCC _B 0.050528823 -1.291682769 3.869099561 -0.333845834 0.738495895 0.774261616
CTACGTGGCCCC _A_Concatenation CTACGTGGCCCC _B 2607.57268 0.01359799 0.227015977 0.059898823 0.952236217 0.959624257
CTACGTGGCCCC _A_Concatenation CTACGTGGCCCC _B 0.050528823 -1.291682769 3.869099561 -0.333845834 0.738495895 0.774261616
CTACGTGGCCCC _A_Concatenation CTACGTGGCCCC _B 2607.57268 0.01359799 0.227015977 0.059898823 0.952236217 0.959624257
CTACGTGGCCCC _A_Concatenation CTCACGACAAGA _B 4727.438679 1.774155744 0.531047059 3.340863515 0.000835183 0.001457207
CTACGTGGCCCC _A_Concatenation CTCAGCGATATA _B 1870.982354 0.826140089 0.415758883 1.987065394 0.046915151 0.06437067
CTACGTGGCCCC _A_Concatenation CTCCATTAATGA _B 24021.04368 -1.408575352 0.116443974 -12.09659296 1.10E-33 1.95E-32
CTACGTGGCCCC _A_Concatenation CTCGCACCGAGG _B 21704.72651 -0.48556436 0.243236705 -1.996262693 0.045905337 0.063133339
CTACGTGGCCCC _A_Concatenation CTCGTTATGGCA _B 20955.78015 3.020424424 0.24421828 12.36772457 3.91E-35 7.49E-34
CTACGTGGCCCC _A_Concatenation CTGCCGGTTGCG _B 4601.952868 0.686146226 0.350866759 1.955574897 0.050515252 0.068745437
CTACGTGGCCCC _A_Concatenation CTGGTAGTTGTT _B 28428.95492 -0.995216768 0.20779412 -4.789436617 1.67E-06 3.87E-06
CTACGTGGCCCC _A_Concatenation EPICXV.47_B 1874.838933 1.745258852 0.595822233 2.929160335 0.003398791 0.005361925
CTACGTGGCCCC _A_Concatenation GACTGTGGGCGG _B 2696.178552 0.151572686 0.471584843 0.321411275 0.74789874 0.782015766
CTACGTGGCCCC _A_Concatenation GAGACAGCTCTC _B 5416.730231 -0.826601369 0.186697653 -4.427486659 9.53E-06 2.03E-05
CTACGTGGCCCC _A_Concatenation GAGATTGTGTCC _B 7353.088648 0.724214392 0.165160539 4.384911784 1.16E-05 2.44E-05
CTACGTGGCCCC _A_Concatenation GAGATTGTGTCC _B 1349.54807 -0.40789886 0.313629932 -1.300573762 0.193404393 0.236743178
CTACGTGGCCCC _A_Concatenation GAGATTGTGTCC _B 7353.088648 0.724214392 0.165160539 4.384911784 1.16E-05 2.44E-05
CTACGTGGCCCC _A_Concatenation GAGATTGTGTCC _B 1349.54807 -0.40789886 0.313629932 -1.300573762 0.193404393 0.236743178
CTACGTGGCCCC _A_Concatenation GAGTGACCTATT _B 50839.64172 -1.648692048 0.206319828 -7.990952996 1.34E-15 6.63E-15
CTACGTGGCCCC _A_Concatenation GATCGCATGTTC _B 10539.06178 -1.36682814 0.210577157 -6.490866153 8.53E-11 3.00E-10
CTACGTGGCCCC _A_Concatenation GATGGAGCTACA _B 3892.323996 1.652497221 0.307243551 5.378460231 7.51E-08 2.01E-07
CTACGTGGCCCC _A_Concatenation GCAACGAGGGTC _B 4001.257961 3.832368817 0.322159989 11.89585594 1.24E-32 2.17E-31
CTACGTGGCCCC _A_Concatenation GCAGATTCGAAT _B 17457.31343 -1.618131233 0.173888316 -9.305577689 1.33E-20 1.01E-19
CTACGTGGCCCC _A_Concatenation GCGAGGCAGACC _B 12287.75963 -1.483345806 0.255087307 -5.815051412 6.06E-09 1.77E-08
CTACGTGGCCCC _A_Concatenation GGAGGCACATCG _B 7113.897887 -0.068571362 0.535643644 -0.128016756 0.898135713 0.912181275
CTACGTGGCCCC _A_Concatenation GGCACAGCTCCA _B 30174.64994 0.100923308 0.111749975 0.90311706 0.366463771 0.419996223
CTACGTGGCCCC _A_Concatenation GGCACTGTCGAG _B 12685.3025 -0.944676466 0.250777254 -3.766994212 0.000165225 0.000310029
CTACGTGGCCCC _A_Concatenation GGCCTTTCGGGC _B 0
CTACGTGGCCCC _A_Concatenation GGGAGTATGTCG _B 12443.62218 -1.536299005 0.228968755 -6.709644754 1.95E-11 7.19E-11
CTACGTGGCCCC _A_Concatenation GGTGTCTCGATT _B 2190.706918 3.737178336 0.321297447 11.63152205 2.85E-31 4.69E-30
CTACGTGGCCCC _A_Concatenation GTACATACCGAA _B 8574.331557 0.843789417 0.289412463 2.915525499 0.003550901 0.005594344
CTACGTGGCCCC _A_Concatenation GTCACCCTCCTT _B 0
CTACGTGGCCCC _A_Concatenation GTTCCACGCCTG _B 7600.094339 1.963358831 0.253415918 7.747575002 9.37E-15 4.45E-14
CTACGTGGCCCC _A_Concatenation Stop_1_B 5522.844032 2.67554007 0.870832724 3.072392662 0.002123502 0.003423964
CTACGTGGCCCC _A_Concatenation Stop_2_B 1681.917788 1.435028374 1.246044484 1.151667049 0.249457918 0.300635367
CTACGTGGCCCC _A_Concatenation TAAAATTTATCA _B 8180.859527 1.935216539 0.380137656 5.090830933 3.56E-07 8.85E-07
CTACGTGGCCCC _A_Concatenation TAAGATTCAACG _B 3954.42675 2.273555948 0.684904609 3.319522044 0.000901717 0.001561877
CTACGTGGCCCC _A_Concatenation TACATGAGGCAC _B 10534.22985 -1.05049681 0.196904025 -5.335070268 9.55E-08 2.53E-07
CTACGTGGCCCC _A_Concatenation TACTTCTAGCGG _B 4153.796307 2.155327639 0.4861279 4.433663729 9.26E-06 1.98E-05
CTACGTGGCCCC _A_Concatenation TATCGCAAGAAC _B 15884.14259 -1.241494587 0.211192248 -5.87850452 4.14E-09 1.24E-08
CTACGTGGCCCC _A_Concatenation TATCTATCGTGT _B 398.3921212 -0.56349983 0.521921 -1.079664987 0.280291391 0.332988451
CTACGTGGCCCC _A_Concatenation TATTAAAAGTGC _B 8918.26617 -0.65918521 0.236435031 -2.788018368 0.005303154 0.008221998
CTACGTGGCCCC _A_Concatenation TCAGCCGTAGGC _B 26006.54953 -1.095796534 0.1151699 -9.514608737 1.82E-21 1.52E-20
CTACGTGGCCCC _A_Concatenation TCTATCACAACT _B 283.6516595 1.363817389 1.845951365 0.73881545 0.460019069 0.512154563
CTACGTGGCCCC _A_Concatenation TCTGTACCAACG _B 22581.14821 2.378339087 0.176941006 13.44142404 3.46E-41 8.98E-40
CTACGTGGCCCC _A_Concatenation TCTTAGAGGGTG _B 8432.5081 3.317190139 0.298511555 11.11243464 1.09E-28 1.50E-27
CTACGTGGCCCC _A_Concatenation TCTTGGTTTTGA _B 18540.96686 1.866446198 0.182522584 10.22583704 1.52E-24 1.53E-23
CTACGTGGCCCC _A_Concatenation TGATGAGCGCCC _B 40064.30796 0.910760623 0.229646806 3.965918968 7.31E-05 0.000141506
CTACGTGGCCCC _A_Concatenation TGCCAGACGGTC _B 11614.94098 1.512919413 0.290403076 5.209722402 1.89E-07 4.85E-07
CTACGTGGCCCC _A_Concatenation TGGCCCCCAGTT _B 9562.072512 -1.711727411 0.224103565 -7.638108802 2.20E-14 1.02E-13
CTACGTGGCCCC _A_Concatenation TTAAATGAGGGC _B 4456.141083 -0.806234392 0.207263217 -3.889905813 0.000100283 0.00019093
CTACGTGGCCCC _A_Concatenation TTAGTGCCATAC _B 0
CTACGTGGCCCC _A_Concatenation TTCTAATTTCCT _B 17717.96749 -0.207761382 0.202788232 -1.02452386 0.305587907 0.358307185
CTACGTGGCCCC _A_Concatenation TTTAACGGATTG _B 22952.00753 0.913913801 0.141577678 6.455211124 1.08E-10 3.75E-10
CTACGTGGCCCC _A_Concatenation Joint_ 1_B 9569.397796 2.449199321 0.343065543 7.139158598 9.39E-13 3.81E-12
CTACGTGGCCCC _A_Concatenation Joint_ 2_B 18612.82245 -0.64014566 0.084456667 -7.579575185 3.47E-14 1.58E-13
CTACGTGGCCCC _A_Concatenation Joint_ 3_B 23642.08197 -1.240754822 0.091948981 -13.49394853 1.70E-41 4.62E-40
CTCACGACAAGA _A_Concatenation ACTTGATGGTTT _B 6395.112661 3.880306999 0.524264206 7.40143415 1.35E-13 5.85E-13
CTCACGACAAGA _A_Concatenation AGAGTGCGCTCT _B 15677.68144 -0.741292505 0.089003166 -8.328832966 8.16E-17 4.50E-16
CTCACGACAAGA _A_Concatenation ATAACTCCACGC _B 2044.350463 -0.663463213 0.466478541 -1.422280244 0.154944912 0.193102987
CTCACGACAAGA _A_Concatenation ATGACCCCTTGT _B 640.6129647 1.368113686 0.86785586 1.576429622 0.114926826 0.147636769
CTCACGACAAGA _A_Concatenation CTACGTGGCCCC _B 1663.271467 2.300705548 1.53796282 1.495943542 0.134668368 0.1701917
CTCACGACAAGA _A_Concatenation GAGATTGTGTCC _B 1342.98382 5.14370171 0.350586819 14.67169165 9.79E-49 3.95E-47
CTCAGCGATATA _A_Concatenation ACTTGATGGTTT _B 2788.747391 2.628395138 0.338157994 7.772683719 7.68E-15 3.67E-14
CTCAGCGATATA _A_Concatenation AGAGTGCGCTCT _B 11567.21068 2.819340595 0.416066622 6.776175841 1.23E-11 4.62E-11
CTCAGCGATATA _A_Concatenation ATAACTCCACGC _B 1817.295306 -1.203752825 0.493744811 -2.438006027 0.014768528 0.021743588
CTCAGCGATATA _A_Concatenation ATGACCCCTTGT _B 2255.304424 0.644185286 0.662157851 0.972857584 0.330624089 0.385164718
CTCAGCGATATA _A_Concatenation CTACGTGGCCCC _B 2374.13525 0.237574093 0.392847196 0.604749366 0.545345542 0.594691174
CTCAGCGATATA _A_Concatenation GAGATTGTGTCC _B 4171.555389 -1.558281739 0.226563311 -6.87790858 6.07E-12 2.31E-11
CTCCATTAATGA _A_Concatenation ACTTGATGGTTT _B 10343.15364 -1.536355055 0.191592886 -8.018852292 1.07E-15 5.38E-15
CTCCATTAATGA _A_Concatenation AGAGTGCGCTCT _B 5399.455078 -1.490845992 0.220464101 -6.762307272 1.36E-11 5.04E-11
CTCCATTAATGA _A_Concatenation ATAACTCCACGC _B 6252.989448 1.865511556 0.304377256 6.128945309 8.85E-10 2.84E-09
CTCCATTAATGA _A_Concatenation ATGACCCCTTGT _B 564.3668627 0.458413688 0.883539221 0.518837962 0.603873743 0.647047118
CTCCATTAATGA _A_Concatenation CTACGTGGCCCC _B 6784.624693 -0.348081573 0.224431412 -1.550948551 0.120914014 0.154479216
CTCCATTAATGA _A_Concatenation GAGATTGTGTCC _B 5309.513453 0.372147324 0.33420378 1.113534154 0.265479088 0.317002098
CTCGCACCGAGG _A_Concatenation ACTTGATGGTTT _B 9109.229052 -2.145597155 0.200223369 -10.71601765 8.56E-27 1.08E-25
CTCGCACCGAGG _A_Concatenation AGAGTGCGCTCT _B 8108.284261 -2.084395884 0.239785435 -8.692754369 3.54E-18 2.19E-17
CTCGCACCGAGG _A_Concatenation ATAACTCCACGC _B 14278.85438 -1.265497881 0.132138307 -9.577070461 9.98E-22 8.58E-21
CTCGCACCGAGG _A_Concatenation ATGACCCCTTGT _B 2109.25225 -0.937628791 0.5936616 -1.579399428 0.114244468 0.146921654
CTCGCACCGAGG _A_Concatenation CTACGTGGCCCC _B 8371.120298 -0.763457493 0.248207208 -3.075887677 0.002098769 0.003393445
CTCGCACCGAGG _A_Concatenation GAGATTGTGTCC _B 11869.26941 -1.878171316 0.179813271 -10.44512067 1.54E-25 1.77E-24
CTCGTTATGGCA _A_Concatenation ACTTGATGGTTT _B 13379.89754 2.338265161 0.33631153 6.952676183 3.58E-12 1.38E-11
CTCGTTATGGCA _A_Concatenation AGAGTGCGCTCT _B 22164.57054 -1.335231707 0.071646755 -18.63631792 1.63E-77 1.91E-75
CTCGTTATGGCA _A_Concatenation ATAACTCCACGC _B 7477.211868 4.108411187 0.732055016 5.612161786 2.00E-08 5.61E-08
CTCGTTATGGCA _A_Concatenation ATGACCCCTTGT _B 2978.925046 0.087177747 0.598411611 0.145681912 0.884172501 0.901131346
CTCGTTATGGCA _A_Concatenation CTACGTGGCCCC _B 10865.87408 1.089079629 0.170947862 6.370829176 1.88E-10 6.39E-10
CTCGTTATGGCA _A_Concatenation GAGATTGTGTCC _B 9939.231949 2.433673767 0.332725829 7.314351806 2.59E-13 1.09E-12
CTGCCGGTTGCG _A_Concatenation ACTTGATGGTTT _B 2116.823763 -1.419560262 0.399411133 -3.554132931 0.000379227 0.000689451
CTGCCGGTTGCG _A_Concatenation AGAGTGCGCTCT _B 1268.373288 0.008975367 0.59913725 0.014980485 0.988047749 0.98902379
CTGCCGGTTGCG _A_Concatenation ATAACTCCACGC _B 1674.771176 -0.455347032 0.292928613 -1.554464166 0.120073738 0.153573522
CTGCCGGTTGCG _A_Concatenation ATGACCCCTTGT _B 7237.238196 0.045746629 0.675071288 0.067765626 0.945972207 0.955783501
CTGCCGGTTGCG _A_Concatenation CTACGTGGCCCC _B 1825.147016 -0.670191524 0.206441473 -3.246399649 0.001168746 0.002000387
CTGCCGGTTGCG _A_Concatenation GAGATTGTGTCC _B 1348.103589 -0.124497455 0.636114377 -0.195715518 0.844832837 0.869374636
CTGGTAGTTGTT _A_Concatenation ACTTGATGGTTT _B 20496.52659 -1.922070949 0.184084232 -10.44125796 1.61E-25 1.82E-24
CTGGTAGTTGTT _A_Concatenation AGAGTGCGCTCT _B 18227.85153 -2.052838972 0.111296972 -18.4446974 5.75E-76 5.60E-74
CTGGTAGTTGTT _A_Concatenation ATAACTCCACGC _B 19402.86345 -0.721009974 0.146392835 -4.925172558 8.43E-07 2.00E-06
CTGGTAGTTGTT _A_Concatenation ATGACCCCTTGT _B 2795.615337 -0.236062309 0.453140914 -0.520946799 0.602403833 0.646064294
CTGGTAGTTGTT _A_Concatenation CTACGTGGCCCC _B 18345.75204 -1.220000705 0.206146695 -5.918119165 3.26E-09 9.79E-09
CTGGTAGTTGTT _A_Concatenation GAGATTGTGTCC _B 34006.24859 -1.317749147 0.099837792 -13.19890108 8.90E-40 2.12E-38
EPICXV.47-A\u splice_ ACTTGATGGTTT _b 3370.110148 1.522363448 0.352974749 4.312952846 1.61E-05 3.33E-05
EPICXV.47-A\u splice_ AGAGTGCGCTCT _b 7256.173245 -0.453222459 0.249387941 -1.817339113 0.069165204 0.09220222
EPICXV.47-A\u splice_ ATAACTCCACGC _b 1219.294049 0.248513604 0.658994108 0.377110511 0.706091481 0.743622469
EPICXV.47-A\u splice_ ATGACCCCTTGT _b 131.4668592 4.428082018 1.333109449 3.321619257 0.000894967 0.001554557
EPICXV.47-A\u splice_ CTACGTGGCCCC _b 1341.717945 2.770964438 0.566320975 4.892922139 9.93E-07 2.34E-06
EPICXV.47-A\u splice_ GAGATTGTGTCC _b 529.1405845 0.210914372 0.689788465 0.305766743 0.759782252 0.792315301
GACTGTGGGCGG _A_Concatenation ACTTGATGGTTT _B 10126.88727 -1.334994621 0.23778751 -5.614233571 1.97E-08 5.56E-08
GACTGTGGGCGG _A_Concatenation AGAGTGCGCTCT _B 7684.439835 -1.434278817 0.165398846 -8.671637361 4.26E-18 2.62E-17
GACTGTGGGCGG _A_Concatenation ATAACTCCACGC _B 11520.48725 -1.662130684 0.092569715 -17.95544781 4.35E-72 3.91E-70
GACTGTGGGCGG _A_Concatenation ATGACCCCTTGT _B 1514.374437 -1.725293087 0.47455095 -3.635632989 0.000277299 0.000510492
GACTGTGGGCGG _A_Concatenation CTACGTGGCCCC _B 13332.76266 -1.807085252 0.217507876 -8.308137109 9.72E-17 5.34E-16
GACTGTGGGCGG _A_Concatenation GAGATTGTGTCC _B 11917.31455 -1.112830768 0.1975679 -5.632649682 1.77E-08 5.01E-08
GAGACAGCTCTC _A_Concatenation ACTTGATGGTTT _B 5809.274672 -1.007916626 0.310419338 -3.246951799 0.001166481 0.001999438
GAGACAGCTCTC _A_Concatenation AGAGTGCGCTCT _B 6879.124207 -1.695999894 0.253139974 -6.699850162 2.09E-11 7.67E-11
GAGACAGCTCTC _A_Concatenation ATAACTCCACGC _B 6869.928675 -2.045937039 0.277694274 -7.367588149 1.74E-13 7.41E-13
GAGACAGCTCTC _A_Concatenation ATGACCCCTTGT _B 3300.979054 -1.023436666 0.456447308 -2.242179213 0.024949792 0.035612096
GAGACAGCTCTC _A_Concatenation CTACGTGGCCCC _B 14935.04721 -1.638631431 0.232499192 -7.047901623 1.82E-12 7.15E-12
GAGACAGCTCTC _A_Concatenation GAGATTGTGTCC _B 7190.891971 -1.525652825 0.204313542 -7.467213448 8.19E-14 3.60E-13
GAGATTGTGTCC _A_Concatenation AAAACAAGCATT _B 5336.675738 0.40421582 0.384851971 1.050315059 0.293573284 0.345606413
GAGATTGTGTCC _A_Concatenation AAACTTTGAGCG _B 1459.363758 3.274600662 1.024407121 3.196581316 0.001390666 0.002329067
GAGATTGTGTCC _A_Concatenation AAGACCGGTGCC _B 4279.774372 2.851095796 0.411488986 6.928729305 4.25E-12 1.62E-11
GAGATTGTGTCC _A_Concatenation AAGCAACGATAT _B 9687.727397 -0.732791936 0.179614257 -4.079809421 4.51E-05 8.87E-05
GAGATTGTGTCC _A_Concatenation AAGGGTAAATTG _B 5539.172828 0.784199956 0.247823194 3.16435255 0.001554284 0.002577245
GAGATTGTGTCC _A_Concatenation AATCAACGAGCA _B 2866.516386 2.370936903 0.741661122 3.196792759 0.001389647 0.002329067
GAGATTGTGTCC _A_Concatenation AATCGAGACCAG _B 2471.244011 0.631123237 0.19546636 3.228807434 0.001243076 0.002118302
GAGATTGTGTCC _A_Concatenation AATCGATCCTAC _B 7735.003056 0.436237925 0.300115623 1.453566195 0.146066615 0.182817851
GAGATTGTGTCC _A_Concatenation AATGGCACTAGC _B 12169.38318 0.290513971 0.197503802 1.470928498 0.141310455 0.177625722
GAGATTGTGTCC _A_Concatenation ACAACGTGCATA _B 1816.591048 2.051709993 0.777095393 2.640229257 0.008284996 0.012610886
GAGATTGTGTCC _A_Concatenation ACCAGTATTTAA _B 2318.362759 2.530517049 0.306607549 8.253277041 1.54E-16 8.34E-16
GAGATTGTGTCC _A_Concatenation ACGACTCACCGC _B 2177.911514 0.385890205 0.653687308 0.590328435 0.55497048 0.604190777
GAGATTGTGTCC _A_Concatenation ACGGCGTAAGTA _B 1502.255208 -0.869935424 0.468775919 -1.855759624 0.063487828 0.085209266
GAGATTGTGTCC _A_Concatenation ACTCGTGTTGTC _B 6640.265781 0.10117339 0.207713557 0.487081306 0.626200714 0.666692745
GAGATTGTGTCC _A_Concatenation ACTTGATGGTTT _B 4095.253834 1.338298146 0.26750419 5.00290536 5.65E-07 1.36E-06
GAGATTGTGTCC _A_Concatenation ACTTGATGGTTT _B 2062.528412 2.117670792 0.423619812 4.99898903 5.76E-07 1.39E-06
GAGATTGTGTCC _A_Concatenation ACTTGATGGTTT _B 4095.253834 1.338298146 0.26750419 5.00290536 5.65E-07 1.36E-06
GAGATTGTGTCC _A_Concatenation ACTTGATGGTTT _B 2062.528412 2.117670792 0.423619812 4.99898903 5.76E-07 1.39E-06
GAGATTGTGTCC _A_Concatenation ACTTTAGTAACA _B 1655.016432 1.199127769 1.052733662 1.139060916 0.254677747 0.30597974
GAGATTGTGTCC _A_Concatenation AGAGTGCGCTCT _B 6624.693834 1.33366152 0.404396008 3.297909704 0.000974075 0.001679488
GAGATTGTGTCC _A_Concatenation AGAGTGCGCTCT _B 10297.21521 2.082788864 0.238568438 8.730362149 2.54E-18 1.60E-17
GAGATTGTGTCC _A_Concatenation AGAGTGCGCTCT _B 6624.693834 1.33366152 0.404396008 3.297909704 0.000974075 0.001679488
GAGATTGTGTCC _A_Concatenation AGAGTGCGCTCT _B 10297.21521 2.082788864 0.238568438 8.730362149 2.54E-18 1.60E-17
GAGATTGTGTCC _A_Concatenation AGCCAGGTCACC _B 244.4812552 0.441155599 2.01336616 0.219113447 0.826561676 0.853578268
GAGATTGTGTCC _A_Concatenation AGCCATAAACCT _B 7882.770077 2.682125058 0.394753816 6.79442465 1.09E-11 4.09E-11
GAGATTGTGTCC _A_Concatenation AGCCGTGCTTGA _B 6566.441048 2.21670191 0.223560652 9.915438572 3.57E-23 3.34E-22
GAGATTGTGTCC _A_Concatenation AGGCTTCCCGCT _B 8692.300135 -1.319663199 0.192277876 -6.863312746 6.73E-12 2.55E-11
GAGATTGTGTCC _A_Concatenation AGTTAGGCTCTT _B 4833.494264 3.053940999 0.684388252 4.462293135 8.11E-06 1.74E-05
GAGATTGTGTCC _A_Concatenation AGTTGCTTTGCT _B 12057.46837 -1.812094514 0.181276682 -9.996291282 1.58E-23 1.52E-22
GAGATTGTGTCC _A_Concatenation ATAACTCCACGC _B 662.3080557 9.181911569 1.246691289 7.365024247 1.77E-13 7.53E-13
GAGATTGTGTCC _A_Concatenation ATAACTCCACGC _B 2560.862639 0.033208854 0.433176783 0.076663513 0.938891232 0.950271731
GAGATTGTGTCC _A_Concatenation ATAACTCCACGC _B 662.3080557 9.181911569 1.246691289 7.365024247 1.77E-13 7.53E-13
GAGATTGTGTCC _A_Concatenation ATAACTCCACGC _B 2560.862639 0.033208854 0.433176783 0.076663513 0.938891232 0.950271731
GAGATTGTGTCC _A_Concatenation ATCCAATCTGTG _B 807.2746093 3.507832961 1.287492344 2.724546655 0.006438983 0.009943423
GAGATTGTGTCC _A_Concatenation ATCTCTCAGTTC _B 6245.453834 3.072569727 0.669890927 4.586671656 4.50E-06 1.01E-05
GAGATTGTGTCC _A_Concatenation ATGACAAAACGA _B 6512.791531 2.46608327 0.581322977 4.24219129 2.21E-05 4.47E-05
GAGATTGTGTCC _A_Concatenation ATGACCCCTTGT _B 808.5837658 0.759510522 0.837274784 0.907122173 0.364342194 0.418555889
GAGATTGTGTCC _A_Concatenation ATGACCCCTTGT _B 574.4835973 1.901952006 0.420635581 4.521614649 6.14E-06 1.34E-05
GAGATTGTGTCC _A_Concatenation ATGACCCCTTGT _B 808.5837658 0.759510522 0.837274784 0.907122173 0.364342194 0.418555889
GAGATTGTGTCC _A_Concatenation ATGACCCCTTGT _B 574.4835973 1.901952006 0.420635581 4.521614649 6.14E-06 1.34E-05
GAGATTGTGTCC _A_Concatenation ATGTGGCACGAC _B 12919.07171 1.774827467 0.122101684 14.53565106 7.20E-48 2.63E-46
GAGATTGTGTCC _A_Concatenation ATTGTATCTAAC _B 11238.66518 2.430304942 0.211274208 11.5030839 1.27E-30 2.01E-29
GAGATTGTGTCC _A_Concatenation ATTTGTAGACCG _B 12084.65494 2.234113777 0.293388344 7.614868899 2.64E-14 1.21E-13
GAGATTGTGTCC _A_Concatenation CAACGCATGCCT _B 32322.32251 -1.668511748 0.101017217 -16.51710265 2.76E-61 1.62E-59
GAGATTGTGTCC _A_Concatenation CACACCCGGCAG _B 6846.836394 1.978083324 0.383232672 5.161572771 2.45E-07 6.18E-07
GAGATTGTGTCC _A_Concatenation CAGTGACATAGT _B 20915.68626 2.199746831 0.195892237 11.22937216 2.93E-29 4.22E-28
GAGATTGTGTCC _A_Concatenation CATCTCACCTAG _B 3917.193211 1.824174603 0.670392345 2.721055237 0.006507389 0.010035801
GAGATTGTGTCC _A_Concatenation CATCTGAGTCGT _B 3.550869394 -0.899600572 0.672273122 -1.338147461 0.180848372 0.223479648
GAGATTGTGTCC _A_Concatenation CATGCTAACACC _B 2085.124923 1.925574775 0.334277327 5.760410943 8.39E-09 2.41E-08
GAGATTGTGTCC _A_Concatenation CATTAGTCCCGC _B 1330.338417 0.718933463 0.451892924 1.590937644 0.11162361 0.143867695
GAGATTGTGTCC _A_Concatenation CATTGTTTAAAT _B 5566.683453 1.864175902 0.347928416 5.35792943 8.42E-08 2.24E-07
GAGATTGTGTCC _A_Concatenation CCAGACGGTCTG _B 9309.548485 -1.553325001 0.150987049 -10.28780293 8.00E-25 8.35E-24
GAGATTGTGTCC _A_Concatenation CCCCTTAGGAAT _B 1646.590941 3.114869603 0.887547862 3.509522964 0.000448911 0.000808594
GAGATTGTGTCC _A_Concatenation CCCGTAGGGGCT _B 114.324098 0.281645139 1.501960445 0.187518346 0.851254241 0.874443064
GAGATTGTGTCC _A_Concatenation CCCTATGTTCTA _B 939.1830871 0.601977151 0.79321458 0.758908328 0.447907397 0.501536156
GAGATTGTGTCC _A_Concatenation CCCTGCCTCGGG _B 21776.99227 -2.189884868 0.134972243 -16.22470536 3.37E-59 1.79E-57
GAGATTGTGTCC _A_Concatenation CCGCCCTTATGT _B 2219.489338 2.34397758 0.545325538 4.298308843 1.72E-05 3.52E-05
GAGATTGTGTCC _A_Concatenation CCTAAATGCAAG _B 9186.061817 -0.501834755 0.275741955 -1.819943416 0.068767622 0.091978662
GAGATTGTGTCC _A_Concatenation CGATGGCGGAAT _B 14164.33153 -1.334820415 0.278967887 -4.784853296 1.71E-06 3.95E-06
GAGATTGTGTCC _A_Concatenation CGCCGGCATACA _B 3010.214688 1.447480652 0.325147731 4.451763036 8.52E-06 1.82E-05
GAGATTGTGTCC _A_Concatenation CGGCAATTGCTT _B 7661.268671 -1.242791132 0.245722055 -5.057710962 4.24E-07 1.05E-06
GAGATTGTGTCC _A_Concatenation CTACGTGGCCCC _B 0.057653027 -1.291685486 2.119037916 -0.609562234 0.542151835 0.592867629
GAGATTGTGTCC _A_Concatenation CTACGTGGCCCC _B 1730.527538 1.039453407 0.792476787 1.31165155 0.189637731 0.232863979
GAGATTGTGTCC _A_Concatenation CTACGTGGCCCC _B 0.057653027 -1.291685486 2.119037916 -0.609562234 0.542151835 0.592867629
GAGATTGTGTCC _A_Concatenation CTACGTGGCCCC _B 1730.527538 1.039453407 0.792476787 1.31165155 0.189637731 0.232863979
GAGATTGTGTCC _A_Concatenation CTCACGACAAGA _B 1712.701439 1.628481425 0.516733271 3.151493269 0.001624379 0.002685854
GAGATTGTGTCC _A_Concatenation CTCAGCGATATA _B 3342.705632 -0.849998546 0.511575254 -1.661531787 0.09660669 0.126464973
GAGATTGTGTCC _A_Concatenation CTCCATTAATGA _B 18014.6158 -0.801891214 0.08515548 -9.416789284 4.65E-21 3.65E-20
GAGATTGTGTCC _A_Concatenation CTCGCACCGAGG _B 17528.86335 -0.699910187 0.142989767 -4.894827104 9.84E-07 2.32E-06
GAGATTGTGTCC _A_Concatenation CTCGTTATGGCA _B 8403.138644 2.491850706 0.416343169 5.985088479 2.16E-09 6.71E-09
GAGATTGTGTCC _A_Concatenation CTGCCGGTTGCG _B 3604.594588 -0.040901727 0.308803908 -0.132452104 0.894626705 0.910007777
GAGATTGTGTCC _A_Concatenation CTGGTAGTTGTT _B 26536.15736 -0.584990614 0.128042923 -4.56870711 4.91E-06 1.09E-05
GAGATTGTGTCC _A_Concatenation EPICXV.47_B 1904.677425 -1.066978728 0.919621999 -1.160236193 0.245952655 0.297330562
GAGATTGTGTCC _A_Concatenation GACTGTGGGCGG _B 4126.808083 -0.476568317 0.262239943 -1.817298737 0.069171383 0.09220222
GAGATTGTGTCC _A_Concatenation GAGACAGCTCTC _B 34038.5331 -3.965311735 0.423368572 -9.36609848 7.53E-21 5.80E-20
GAGATTGTGTCC _A_Concatenation GAGATTGTGTCC _B 2980.186823 0.589599728 0.637115163 0.925420963 0.354746979 0.41018716
GAGATTGTGTCC _A_Concatenation GAGATTGTGTCC _B 1929.576791 0.327021536 0.657788863 0.497152741 0.619081345 0.660918806
GAGATTGTGTCC _A_Concatenation GAGATTGTGTCC _B 2980.186823 0.589599728 0.637115163 0.925420963 0.354746979 0.41018716
GAGATTGTGTCC _A_Concatenation GAGATTGTGTCC _B 1929.576791 0.327021536 0.657788863 0.497152741 0.619081345 0.660918806
GAGATTGTGTCC _A_Concatenation GAGTGACCTATT _B 29879.64409 -1.696783117 0.101623892 -16.69669485 1.39E-62 9.00E-61
GAGATTGTGTCC _A_Concatenation GATCGCATGTTC _B 8667.101634 -0.116139831 0.238166236 -0.48764188 0.625803526 0.666692745
GAGATTGTGTCC _A_Concatenation GATGGAGCTACA _B 1727.264409 0.744281536 0.314497427 2.366574325 0.017953569 0.026104132
GAGATTGTGTCC _A_Concatenation GCAACGAGGGTC _B 2910.042837 2.498823405 0.589846464 4.236396344 2.27E-05 4.58E-05
GAGATTGTGTCC _A_Concatenation GCAGATTCGAAT _B 9927.659867 -1.511057001 0.139599281 -10.82424629 2.64E-27 3.43E-26
GAGATTGTGTCC _A_Concatenation GCGAGGCAGACC _B 5389.369696 -0.803765379 0.317934658 -2.528083549 0.011468706 0.017074953
GAGATTGTGTCC _A_Concatenation GGAGGCACATCG _B 11011.52144 -1.926106727 0.580243561 -3.319479711 0.000901854 0.001561877
GAGATTGTGTCC _A_Concatenation GGCACAGCTCCA _B 18723.48113 0.766734611 0.308720878 2.483585226 0.013006721 0.019271047
GAGATTGTGTCC _A_Concatenation GGCACTGTCGAG _B 6256.479428 1.107213689 0.438029097 2.527717215 0.011480678 0.017074953
GAGATTGTGTCC _A_Concatenation GGCCTTTCGGGC _B 0
GAGATTGTGTCC _A_Concatenation GGGAGTATGTCG _B 7172.554958 -1.076649573 0.152398239 -7.064711364 1.61E-12 6.36E-12
GAGATTGTGTCC _A_Concatenation GGTGTCTCGATT _B 3242.926378 -1.276083839 0.674481515 -1.89194783 0.058497932 0.078965453
GAGATTGTGTCC _A_Concatenation GTACATACCGAA _B 3800.126493 1.382760014 0.31239479 4.426322267 9.59E-06 2.04E-05
GAGATTGTGTCC _A_Concatenation GTCACCCTCCTT _B 0
GAGATTGTGTCC _A_Concatenation GTTCCACGCCTG _B 5370.214087 3.661429717 0.547584871 6.686506344 2.29E-11 8.35E-11
GAGATTGTGTCC _A_Concatenation Stop_1_B 2185.779945 2.004603591 0.467876258 4.28447385 1.83E-05 3.73E-05
GAGATTGTGTCC _A_Concatenation Stop_2_B 588.0161036 3.23800693 0.924983126 3.50061189 0.000464191 0.000832269
GAGATTGTGTCC _A_Concatenation TAAAATTTATCA _B 3292.626297 2.46324368 0.498016272 4.946110834 7.57E-07 1.80E-06
GAGATTGTGTCC _A_Concatenation TAAGATTCAACG _B 2214.537491 3.701330118 1.518623491 2.437292811 0.014797692 0.021759121
GAGATTGTGTCC _A_Concatenation TACATGAGGCAC _B 10300.46707 -0.506762525 0.182672861 -2.774153329 0.005534559 0.008558067
GAGATTGTGTCC _A_Concatenation TACTTCTAGCGG _B 2633.119212 3.23757946 1.044560922 3.099464466 0.001938708 0.003152086
GAGATTGTGTCC _A_Concatenation TATCGCAAGAAC _B 8427.543488 -1.042180713 0.227350372 -4.584029057 4.56E-06 1.02E-05
GAGATTGTGTCC _A_Concatenation TATCTATCGTGT _B 681.5148978 -3.253391393 0.923010978 -3.524759155 0.000423868 0.000765845
GAGATTGTGTCC _A_Concatenation TATTAAAAGTGC _B 6420.025151 -2.017700031 0.371097623 -5.437113863 5.42E-08 1.47E-07
GAGATTGTGTCC _A_Concatenation TCAGCCGTAGGC _B 15038.04828 0.571697876 0.245394253 2.329711759 0.019821391 0.028677235
GAGATTGTGTCC _A_Concatenation TCTATCACAACT _B 305.4227279 7.245923899 0.846358948 8.561289414 1.12E-17 6.69E-17
GAGATTGTGTCC _A_Concatenation TCTGTACCAACG _B 11200.34075 1.762086203 0.402249802 4.380576933 1.18E-05 2.48E-05
GAGATTGTGTCC _A_Concatenation TCTTAGAGGGTG _B 4569.884844 2.022091562 0.503627264 4.015055794 5.94E-05 0.00011618
GAGATTGTGTCC _A_Concatenation TCTTGGTTTTGA _B 11834.91456 1.984629396 0.117081562 16.95082781 1.90E-64 1.39E-62
GAGATTGTGTCC _A_Concatenation TGATGAGCGCCC _B 21750.91788 1.713243827 0.223476867 7.666313964 1.77E-14 8.28E-14
GAGATTGTGTCC _A_Concatenation TGCCAGACGGTC _B 5098.935443 1.602001775 0.404186805 3.963518243 7.39E-05 0.000142466
GAGATTGTGTCC _A_Concatenation TGGCCCCCAGTT _B 9713.291081 -1.717455543 0.237420085 -7.233825816 4.70E-13 1.94E-12
GAGATTGTGTCC _A_Concatenation TTAAATGAGGGC _B 6260.971569 -1.01361349 0.201210809 -5.037569767 4.71E-07 1.16E-06
GAGATTGTGTCC _A_Concatenation TTAGTGCCATAC _B 0
GAGATTGTGTCC _A_Concatenation TTCTAATTTCCT _B 3938.013777 0.939190431 0.270525188 3.471730069 0.000517116 0.000921507
GAGATTGTGTCC _A_Concatenation TTTAACGGATTG _B 8439.47047 1.401241181 0.327326145 4.280871554 1.86E-05 3.78E-05
GAGATTGTGTCC _A_Concatenation Joint_ 1_B 7033.564145 1.60921915 0.407312008 3.950826682 7.79E-05 0.000149497
GAGATTGTGTCC _A_Concatenation Joint_ 2_B 13580.00704 -0.191414085 0.182603324 -1.048250824 0.29452306 0.346375711
GAGATTGTGTCC _A_Concatenation Joint_ 3_B 20988.50251 -0.968619845 0.145282366 -6.667153568 2.61E-11 9.44E-11
GAGTGACCTATT _A_Concatenation ACTTGATGGTTT _B 4697.773394 -1.784620704 0.299626504 -5.956151004 2.58E-09 7.92E-09
GAGTGACCTATT _A_Concatenation AGAGTGCGCTCT _B 2203.003598 -0.004534526 0.37040494 -0.012242079 0.990232478 0.990232478
GAGTGACCTATT _A_Concatenation ATAACTCCACGC _B 4738.45974 -1.653219507 0.251002117 -6.586476357 4.50E-11 1.61E-10
GAGTGACCTATT _A_Concatenation ATGACCCCTTGT _B 210.4204591 0.520192554 1.091455008 0.47660467 0.633643657 0.673390396
GAGTGACCTATT _A_Concatenation CTACGTGGCCCC _B 4904.828969 -1.407171281 0.242143112 -5.811320696 6.20E-09 1.80E-08
GAGTGACCTATT _A_Concatenation GAGATTGTGTCC _B 3175.115736 -0.890295066 0.457619496 -1.945491994 0.051715792 0.070297397
GATCGCATGTTC _A_Concatenation ACTTGATGGTTT _B 21926.63182 -1.1643545 0.123435064 -9.432931489 3.99E-21 3.15E-20
GATCGCATGTTC _A_Concatenation AGAGTGCGCTCT _B 17309.22293 -1.202089842 0.194290174 -6.187085095 6.13E-10 2.02E-09
GATCGCATGTTC _A_Concatenation ATAACTCCACGC _B 9774.241692 1.822935539 0.285335583 6.388742409 1.67E-10 5.70E-10
GATCGCATGTTC _A_Concatenation ATGACCCCTTGT _B 1303.120528 -0.120356953 0.310037765 -0.388200945 0.697867334 0.735623908
GATCGCATGTTC _A_Concatenation CTACGTGGCCCC _B 14679.17258 -0.674627074 0.215892587 -3.124827417 0.001779094 0.002908757
GATCGCATGTTC _A_Concatenation GAGATTGTGTCC _B 16097.81153 0.046021931 0.156391263 0.294274312 0.7685483 0.799317582
GATGGAGCTACA _A_Concatenation ACTTGATGGTTT _B 4553.568887 2.195814037 0.233215443 9.415388661 4.71E-21 3.67E-20
GATGGAGCTACA _A_Concatenation AGAGTGCGCTCT _B 8291.856374 -0.948994035 0.169952594 -5.583874967 2.35E-08 6.58E-08
GATGGAGCTACA _A_Concatenation ATAACTCCACGC _B 2064.702728 3.854252837 0.756647404 5.09385589 3.51E-07 8.73E-07
GATGGAGCTACA _A_Concatenation ATGACCCCTTGT _B 827.6917824 -1.444321425 0.875735192 -1.64926731 0.099092884 0.129285247
GATGGAGCTACA _A_Concatenation CTACGTGGCCCC _B 1274.880764 0.799181055 0.724730237 1.102729008 0.270144866 0.322244233
GATGGAGCTACA _A_Concatenation GAGATTGTGTCC _B 708.2943437 0.919186736 0.751331732 1.223409976 0.221174887 0.26848748
GCAACGAGGGTC _A_Concatenation ACTTGATGGTTT _B 13269.23238 2.211123582 0.324280159 6.818559566 9.20E-12 3.47E-11
GCAACGAGGGTC _A_Concatenation AGAGTGCGCTCT _B 20365.77872 -0.917134773 0.144835344 -6.332258048 2.42E-10 8.16E-10
GCAACGAGGGTC _A_Concatenation ATAACTCCACGC _B 4475.965961 -0.037496669 0.284700847 -0.131705505 0.895217231 0.910007777
GCAACGAGGGTC _A_Concatenation ATGACCCCTTGT _B 1475.282763 1.452162 0.585303114 2.48104267 0.013099869 0.01938449
GCAACGAGGGTC _A_Concatenation CTACGTGGCCCC _B 4202.603619 3.814915033 0.309101227 12.34196019 5.38E-35 1.01E-33
GCAACGAGGGTC _A_Concatenation GAGATTGTGTCC _B 8053.304043 1.747967233 0.227510524 7.683017043 1.55E-14 7.29E-14
GCAGATTCGAAT _A_Concatenation ACTTGATGGTTT _B 1681.303554 -1.929604695 0.213270677 -9.04767933 1.46E-19 1.00E-18
GCAGATTCGAAT _A_Concatenation AGAGTGCGCTCT _B 1005.136945 -0.722559377 0.413449554 -1.747636126 0.080527056 0.106248452
GCAGATTCGAAT _A_Concatenation ATAACTCCACGC _B 2415.102403 -1.583046187 0.292496086 -5.412196129 6.23E-08 1.68E-07
GCAGATTCGAAT _A_Concatenation ATGACCCCTTGT _B 2008.675555 -1.374543042 0.300679732 -4.571452261 4.84E-06 1.08E-05
GCAGATTCGAAT _A_Concatenation CTACGTGGCCCC _B 2163.854106 -1.207765506 0.346727309 -3.483329621 0.000495218 0.000885184
GCAGATTCGAAT _A_Concatenation GAGATTGTGTCC _B 3134.395892 -1.932223915 0.121062901 -15.96049576 2.41E-57 1.22E-55
GCGAGGCAGACC _A_Concatenation ACTTGATGGTTT _B 9998.460602 -1.608929324 0.270077497 -5.957287597 2.56E-09 7.89E-09
GCGAGGCAGACC _A_Concatenation AGAGTGCGCTCT _B 10229.28468 -0.41378564 0.281685872 -1.468961285 0.14184329 0.177912882
GCGAGGCAGACC _A_Concatenation ATAACTCCACGC _B 7481.38397 0.984753459 0.17775046 5.540089516 3.02E-08 8.35E-08
GCGAGGCAGACC _A_Concatenation ATGACCCCTTGT _B 2947.323423 -0.581244666 0.550826388 -1.055222987 0.291323353 0.343649849
GCGAGGCAGACC _A_Concatenation CTACGTGGCCCC _B 13945.66307 -0.657090684 0.229655841 -2.861197353 0.004220442 0.006604682
GCGAGGCAGACC _A_Concatenation GAGATTGTGTCC _B 13856.50588 -1.089375366 0.133772988 -8.143462896 3.84E-16 2.00E-15
GGAGGCACATCG _A_Concatenation ACTTGATGGTTT _B 10911.17753 0.130262569 0.683996076 0.190443445 0.848961656 0.872855036
GGAGGCACATCG _A_Concatenation AGAGTGCGCTCT _B 10449.92639 -1.247434251 0.169311162 -7.367702399 1.74E-13 7.41E-13
GGAGGCACATCG _A_Concatenation ATAACTCCACGC _B 1820.823073 1.000313211 0.288305058 3.469634627 0.000521167 0.000927312
GGAGGCACATCG _A_Concatenation ATGACCCCTTGT _B 736.3850518 1.665678435 1.488224186 1.119238923 0.263038224 0.314730486
GGAGGCACATCG _A_Concatenation CTACGTGGCCCC _B 2577.387099 -0.246567613 0.577620977 -0.426867484 0.66947585 0.706971336
GGAGGCACATCG _A_Concatenation GAGATTGTGTCC _B 2408.376071 0.171544071 0.342882893 0.500299299 0.616864348 0.660303913
GGCACAGCTCCA _A_Concatenation ACTTGATGGTTT _B 10168.40188 1.82754345 0.300879549 6.074003564 1.25E-09 3.92E-09
GGCACAGCTCCA _A_Concatenation AGAGTGCGCTCT _B 13809.01054 -1.319436508 0.128819826 -10.24249569 1.28E-24 1.30E-23
GGCACAGCTCCA _A_Concatenation ATAACTCCACGC _B 3891.890595 0.875741323 0.439679875 1.991770316 0.046396265 0.063733529
GGCACAGCTCCA _A_Concatenation ATGACCCCTTGT _B 665.7482723 2.529857075 1.170407546 2.161518083 0.030655342 0.043072229
GGCACAGCTCCA _A_Concatenation CTACGTGGCCCC _B 4210.973173 2.255188415 0.475051927 4.747246121 2.06E-06 4.71E-06
GGCACAGCTCCA _A_Concatenation GAGATTGTGTCC _B 7092.482291 0.309945391 0.203026896 1.526622322 0.126854953 0.161384458
GGCACTGTCGAG _A_Concatenation ACTTGATGGTTT _B 26937.44905 -0.484092301 0.145936065 -3.317153301 0.000909397 0.001572611
GGCACTGTCGAG _A_Concatenation AGAGTGCGCTCT _B 22237.80463 -1.634076587 0.168257112 -9.711783167 2.69E-22 2.38E-21
GGCACTGTCGAG _A_Concatenation ATAACTCCACGC _B 6876.088123 1.567885594 0.515860392 3.039360297 0.002370811 0.003806976
GGCACTGTCGAG _A_Concatenation ATGACCCCTTGT _B 5560.317685 -0.602936851 0.195686234 -3.081140856 0.002062091 0.003338759
GGCACTGTCGAG _A_Concatenation CTACGTGGCCCC _B 12239.14756 1.735512855 0.297827413 5.827243501 5.64E-09 1.65E-08
GGCACTGTCGAG _A_Concatenation GAGATTGTGTCC _B 15495.60877 0.447924345 0.198725186 2.253988807 0.024196872 0.034579638
GGCCTTTCGGGC _A_Concatenation ACTTGATGGTTT _B 12622.87776 -1.370937157 0.225191372 -6.087876027 1.14E-09 3.62E-09
GGCCTTTCGGGC _A_Concatenation AGAGTGCGCTCT _B 10465.7957 -1.517975013 0.301138982 -5.040778848 4.64E-07 1.14E-06
GGCCTTTCGGGC _A_Concatenation ATAACTCCACGC _B 11042.17768 0.699486794 0.228048976 3.067265671 0.002160268 0.003478447
GGCCTTTCGGGC _A_Concatenation ATGACCCCTTGT _B 5341.830827 -1.033980621 0.22857522 -4.523590171 6.08E-06 1.33E-05
GGCCTTTCGGGC _A_Concatenation CTACGTGGCCCC _B 13022.28958 -0.344613453 0.167997564 -2.051300298 0.040237712 0.055666136
GGCCTTTCGGGC _A_Concatenation GAGATTGTGTCC _B 15749.77559 -0.902085088 0.152874766 -5.90081093 3.62E-09 1.08E-08
GGGAGTATGTCG _A_Concatenation ACTTGATGGTTT _B 5734.817642 -0.44540666 0.281682904 -1.581234267 0.113824483 0.146542754
GGGAGTATGTCG _A_Concatenation AGAGTGCGCTCT _B 5406.206268 -1.400220305 0.259292108 -5.400165543 6.66E-08 1.79E-07
GGGAGTATGTCG _A_Concatenation ATAACTCCACGC _B 8760.106862 -1.501784784 0.227146455 -6.611526413 3.80E-11 1.37E-10
GGGAGTATGTCG _A_Concatenation ATGACCCCTTGT _B 1288.255927 1.582732061 0.398763117 3.96910344 7.21E-05 0.000140093
GGGAGTATGTCG _A_Concatenation CTACGTGGCCCC _B 12308.85067 -1.381244589 0.183001588 -7.547719152 4.43E-14 1.98E-13
GGGAGTATGTCG _A_Concatenation GAGATTGTGTCC _B 13024.98764 -1.213572927 0.360156739 -3.369568845 0.000752859 0.001327439
GGTGTCTCGATT _A_Concatenation ACTTGATGGTTT _B 3814.828771 1.464165491 0.336886557 4.346167748 1.39E-05 2.88E-05
GGTGTCTCGATT _A_Concatenation AGAGTGCGCTCT _B 8109.140606 -0.582246437 0.226659977 -2.568810098 0.010204836 0.015255056
GGTGTCTCGATT _A_Concatenation ATAACTCCACGC _B 1582.26442 -0.887264559 0.382376342 -2.320396066 0.020319462 0.029325248
GGTGTCTCGATT _A_Concatenation ATGACCCCTTGT _B 660.0491018 2.02701979 0.934524351 2.169039028 0.030079721 0.042365294
GGTGTCTCGATT _A_Concatenation CTACGTGGCCCC _B 1598.326951 1.211058913 0.42680666 2.83748832 0.004547001 0.007096721
GGTGTCTCGATT _A_Concatenation GAGATTGTGTCC _B 1317.17623 5.588341356 0.407807395 13.70338406 9.69E-43 2.85E-41
GTACATACCGAA _A_Concatenation ACTTGATGGTTT _B 6560.157472 0.149710419 0.179329014 0.83483657 0.403809732 0.457416257
GTACATACCGAA _A_Concatenation AGAGTGCGCTCT _B 4829.016109 -0.88093024 0.14367658 -6.131341942 8.71E-10 2.81E-09
GTACATACCGAA _A_Concatenation ATAACTCCACGC _B 1740.833154 1.160160012 0.518104636 2.239238817 0.025140381 0.035840373
GTACATACCGAA _A_Concatenation ATGACCCCTTGT _B 886.7533914 1.577162801 1.327084549 1.188441838 0.234659381 0.284266131
GTACATACCGAA _A_Concatenation CTACGTGGCCCC _B 1434.784202 3.118197979 1.160222501 2.687586197 0.007197052 0.011026677
GTACATACCGAA _A_Concatenation GAGATTGTGTCC _B 3638.304071 2.018138714 0.435258468 4.636644341 3.54E-06 7.98E-06
GTCACCCTCCTT _A_Concatenation ACTTGATGGTTT _B 2952.325444 3.977248973 0.587903056 6.765144236 1.33E-11 4.96E-11
GTCACCCTCCTT _A_Concatenation AGAGTGCGCTCT _B 2794.00108 -1.444032725 0.491776329 -2.936360782 0.003320878 0.00524609
GTCACCCTCCTT _A_Concatenation ATAACTCCACGC _B 343.8542156 6.720052053 0.588537442 11.4182235 3.39E-30 5.15E-29
GTCACCCTCCTT _A_Concatenation ATGACCCCTTGT _B 58.45196938 8.365760902 1.757942818 4.758835621 1.95E-06 4.46E-06
GTCACCCTCCTT _A_Concatenation CTACGTGGCCCC _B 976.812591 3.086674978 1.881831906 1.640250103 0.100953176 0.13156551
GTCACCCTCCTT _A_Concatenation GAGATTGTGTCC _B 427.5145602 1.539530063 1.58836413 0.969255119 0.332417916 0.386663228
GTTCCACGCCTG _A_Concatenation ACTTGATGGTTT _B 3541.702709 2.023938089 0.414889818 4.87825442 1.07E-06 2.51E-06
GTTCCACGCCTG _A_Concatenation AGAGTGCGCTCT _B 4180.581225 -1.658960556 0.286791818 -5.784546328 7.27E-09 2.10E-08
GTTCCACGCCTG _A_Concatenation ATAACTCCACGC _B 1642.066144 -0.381452 0.512562258 -0.744206181 0.456751749 0.5094874
GTTCCACGCCTG _A_Concatenation ATGACCCCTTGT _B 2531.307398 -1.891181968 0.241050358 -7.845588706 4.31E-15 2.09E-14
GTTCCACGCCTG _A_Concatenation CTACGTGGCCCC _B 11395.16067 -2.639856976 0.326663242 -8.081279555 6.41E-16 3.26E-15
GTTCCACGCCTG _A_Concatenation GAGATTGTGTCC _B 3410.426437 0.806907237 0.418935973 1.926087251 0.05409348 0.073443993
Stop-1_A/u splice_ ACTTGATGGTTT _b 6770.939871 -1.550289594 0.154376583 -10.04225878 9.94E-24 9.60E-23
Stop-1_A/u splice_ AGAGTGCGCTCT _b 3515.42995 -1.073168093 0.213288463 -5.031533723 4.87E-07 1.19E-06
Stop-1_A/u splice_ ATAACTCCACGC _b 3762.375634 -0.058833579 0.323078612 -0.182102983 0.855501908 0.878034882
Stop-1_A/u splice_ ATGACCCCTTGT _b 5292.419803 -1.232494879 0.220901192 -5.579394429 2.41E-08 6.72E-08
Stop-1_A/u splice_ CTACGTGGCCCC _b 4763.092809 -0.862005644 0.163658444 -5.267101534 1.39E-07 3.58E-07
Stop-1_A/u splice_ GAGATTGTGTCC _b 4167.210017 -0.845348565 0.3825702 -2.20965607 0.027129041 0.038407725
Stop 2_A splice ACTTGATGGTTT B 4651.827038 -1.345155133 0.31788373 -4.231594783 2.32E-05 4.67E-05
Stop 2_A splice AGAGTGCGCTCT B 5491.22895 -0.209680553 0.196807086 -1.065411603 0.286689708 0.339210798
Stop 2_A splice ATAACTCCACGC B 7206.40228 -1.432278801 0.204768335 -6.994630294 2.66E-12 1.04E-11
Stop 2_A splice ATGACCCCTTGT B 4264.663587 -1.301521091 0.228364187 -5.699322233 1.20E-08 3.42E-08
Stop 2_A splice CTACGTGGCCCC B 8777.825472 -0.958387811 0.223296821 -4.291990402 1.77E-05 3.62E-05
Stop 2_A splice GAGATTGTGTCC B 7440.622554 -1.787105409 0.123418402 -14.48005631 1.62E-47 5.74E-46
TAAAATTTATCA _A_Concatenation ACTTGATGGTTT _B 21275.58545 3.681447284 0.419828387 8.768933689 1.80E-18 1.14E-17
TAAAATTTATCA _A_Concatenation AGAGTGCGCTCT _B 35374.02741 -1.2744245 0.118961367 -10.71292748 8.85E-27 1.10E-25
TAAAATTTATCA _A_Concatenation ATAACTCCACGC _B 6986.696056 2.26992008 0.463393497 4.898472018 9.66E-07 2.29E-06
TAAAATTTATCA _A_Concatenation ATGACCCCTTGT _B 1571.431201 1.152400318 0.905355488 1.272870527 0.203063989 0.24715172
TAAAATTTATCA _A_Concatenation CTACGTGGCCCC _B 7003.422513 0.884461045 0.413024856 2.141423285 0.032239919 0.044974302
TAAAATTTATCA _A_Concatenation GAGATTGTGTCC _B 6062.194542 2.021145272 0.229834503 8.793915836 1.44E-18 9.28E-18
TAAGATTCAACG _A_Concatenation ACTTGATGGTTT _B 6759.542322 3.512228277 0.433036603 8.110696076 5.03E-16 2.60E-15
TAAGATTCAACG _A_Concatenation AGAGTGCGCTCT _B 5395.080264 -0.985697108 0.161075465 -6.119473945 9.39E-10 3.01E-09
TAAGATTCAACG _A_Concatenation ATAACTCCACGC _B 2099.298848 1.514948514 0.632862051 2.393805271 0.016674599 0.024304995
TAAGATTCAACG _A_Concatenation ATGACCCCTTGT _B 632.6750281 -0.220433001 0.746118033 -0.295439852 0.767657894 0.799317582
TAAGATTCAACG _A_Concatenation CTACGTGGCCCC _B 3468.070765 1.377322426 0.38881717 3.542339516 0.000396595 0.000717677
TAAGATTCAACG _A_Concatenation GAGATTGTGTCC _B 4582.259843 1.88792703 0.433941248 4.350651241 1.36E-05 2.82E-05
TACATGAGGCAC _A_Concatenation ACTTGATGGTTT _B 6149.76019 -1.347641882 0.154978138 -8.695690212 3.45E-18 2.14E-17
TACATGAGGCAC _A_Concatenation AGAGTGCGCTCT _B 5492.076135 -1.683303257 0.301492183 -5.583240142 2.36E-08 6.59E-08
TACATGAGGCAC _A_Concatenation ATAACTCCACGC _B 5919.33223 1.551789521 0.16325261 9.50544997 1.99E-21 1.65E-20
TACATGAGGCAC _A_Concatenation ATGACCCCTTGT _B 1309.199585 -1.026581297 0.523263491 -1.961882137 0.049776208 0.067897768
TACATGAGGCAC _A_Concatenation CTACGTGGCCCC _B 9934.852402 -0.76646939 0.169716852 -4.516165499 6.30E-06 1.37E-05
TACATGAGGCAC _A_Concatenation GAGATTGTGTCC _B 8728.309171 0.233306811 0.212863709 1.096038456 0.273061946 0.325060503
TACTTCTAGCGG _A_Concatenation ACTTGATGGTTT _B 4302.117559 1.652448374 0.441083263 3.746341136 0.000179433 0.000336148
TACTTCTAGCGG _A_Concatenation AGAGTGCGCTCT _B 4475.732991 -1.667434184 0.200975338 -8.296710438 1.07E-16 5.85E-16
TACTTCTAGCGG _A_Concatenation ATAACTCCACGC _B 477.7724878 2.437585897 1.639942812 1.486384695 0.137177396 0.173027042
TACTTCTAGCGG _A_Concatenation ATGACCCCTTGT _B 809.5944266 2.088643729 0.471961628 4.425452423 9.62E-06 2.04E-05
TACTTCTAGCGG _A_Concatenation CTACGTGGCCCC _B 2398.191231 3.13191843 0.628499884 4.98316469 6.26E-07 1.50E-06
TACTTCTAGCGG _A_Concatenation GAGATTGTGTCC _B 2284.807319 0.044261275 0.247928202 0.178524569 0.858311028 0.880145256
TATCGCAAGAAC _A_Concatenation ACTTGATGGTTT _B 12059.72269 -1.388513503 0.130785731 -10.61670483 2.49E-26 2.94E-25
TATCGCAAGAAC _A_Concatenation AGAGTGCGCTCT _B 15379.92089 -1.799208835 0.194469728 -9.251871009 2.21E-20 1.63E-19
TATCGCAAGAAC _A_Concatenation ATAACTCCACGC _B 18972.64165 -1.138822259 0.107213583 -10.6219961 2.35E-26 2.81E-25
TATCGCAAGAAC _A_Concatenation ATGACCCCTTGT _B 6480.891045 -1.495319759 0.284813575 -5.25017026 1.52E-07 3.91E-07
TATCGCAAGAAC _A_Concatenation CTACGTGGCCCC _B 16789.93099 -1.246919211 0.120838265 -10.31891023 5.79E-25 6.26E-24
TATCGCAAGAAC _A_Concatenation GAGATTGTGTCC _B 14644.22332 -0.488531461 0.057006493 -8.569751243 1.04E-17 6.25E-17
TATCTATCGTGT _A_Concatenation ACTTGATGGTTT _B 2811.948212 2.672265841 0.593125566 4.505396487 6.62E-06 1.43E-05
TATCTATCGTGT _A_Concatenation AGAGTGCGCTCT _B 10930.15846 -0.773125124 0.099871015 -7.741236252 9.85E-15 4.66E-14
TATCTATCGTGT _A_Concatenation ATAACTCCACGC _B 1384.03324 -2.232141475 0.538541488 -4.144790186 3.40E-05 6.77E-05
TATCTATCGTGT _A_Concatenation ATGACCCCTTGT _B 781.9986629 -0.89185393 0.400957593 -2.224309867 0.026127603 0.037111991
TATCTATCGTGT _A_Concatenation CTACGTGGCCCC _B 14.1790172 3.494903988 1.210382028 2.887438764 0.003883922 0.006110774
TATCTATCGTGT _A_Concatenation GAGATTGTGTCC _B 269.9342778 2.758841473 1.015950257 2.715528102 0.006617016 0.010191425
TATTAAAAGTGC _A_Concatenation ACTTGATGGTTT _B 1791.891217 -0.490610036 0.307945502 -1.593171626 0.111121695 0.143378875
TATTAAAAGTGC _A_Concatenation AGAGTGCGCTCT _B 6790.197544 -3.056231128 0.242437497 -12.60626417 1.95E-36 4.07E-35
TATTAAAAGTGC _A_Concatenation ATAACTCCACGC _B 2098.961786 0.262924991 0.458676551 0.573225272 0.566492175 0.613743607
TATTAAAAGTGC _A_Concatenation ATGACCCCTTGT _B 1293.63795 -0.035491092 0.660966192 -0.053695776 0.957177549 0.963773087
TATTAAAAGTGC _A_Concatenation CTACGTGGCCCC _B 6284.208552 -1.152659623 0.213556954 -5.397434272 6.76E-08 1.82E-07
TATTAAAAGTGC _A_Concatenation GAGATTGTGTCC _B 7027.473739 -2.02708462 0.219005973 -9.255841726 2.13E-20 1.58E-19
TCAGCCGTAGGC _A_Concatenation ACTTGATGGTTT _B 16554.34192 -1.453057618 0.064022099 -22.69618844 4.89E-114 1.14E-111
TCAGCCGTAGGC _A_Concatenation AGAGTGCGCTCT _B 20248.40323 -1.988435524 0.133368132 -14.90937526 2.86E-50 1.29E-48
TCAGCCGTAGGC _A_Concatenation ATAACTCCACGC _B 5684.077083 3.324799708 0.519910917 6.394941129 1.61E-10 5.51E-10
TCAGCCGTAGGC _A_Concatenation ATGACCCCTTGT _B 1689.960123 1.251569527 0.39928948 3.134491611 0.001721522 0.002826487
TCAGCCGTAGGC _A_Concatenation CTACGTGGCCCC _B 11740.81276 0.12600346 0.129210924 0.97517653 0.329472702 0.384384819
TCAGCCGTAGGC _A_Concatenation GAGATTGTGTCC _B 12895.42597 0.070861143 0.127183025 0.55715881 0.57741893 0.621549474
TCTATCACAACT _A_Concatenation ACTTGATGGTTT _B 4350.649725 4.136958603 1.244116147 3.325218962 0.000883491 0.001536907
TCTATCACAACT _A_Concatenation AGAGTGCGCTCT _B 6986.453878 -1.147688985 0.287304839 -3.994673361 6.48E-05 0.00012601
TCTATCACAACT _A_Concatenation ATAACTCCACGC _B 1160.085311 0.483088111 1.021330534 0.472998794 0.63621404 0.675507913
TCTATCACAACT _A_Concatenation ATGACCCCTTGT _B 1327.290744 4.198515234 0.75143292 5.587345356 2.31E-08 6.46E-08
TCTATCACAACT _A_Concatenation CTACGTGGCCCC _B 3739.709094 1.537223849 0.293426341 5.238874747 1.62E-07 4.15E-07
TCTATCACAACT _A_Concatenation GAGATTGTGTCC _B 3762.752041 -0.426854541 0.867553278 -0.492021127 0.622704409 0.664180158
TCTGTACCAACG _A_Concatenation ACTTGATGGTTT _B 6868.114756 3.852317439 0.427231429 9.016933621 1.93E-19 1.31E-18
TCTGTACCAACG _A_Concatenation AGAGTGCGCTCT _B 17789.47762 -0.916017932 0.154156057 -5.942146884 2.81E-09 8.56E-09
TCTGTACCAACG _A_Concatenation ATAACTCCACGC _B 305.1123518 -1.151692157 1.438008653 -0.80089376 0.423193153 0.476601923
TCTGTACCAACG _A_Concatenation ATGACCCCTTGT _B 980.9939975 -0.314616163 0.394106935 -0.798301515 0.424695539 0.477374121
TCTGTACCAACG _A_Concatenation CTACGTGGCCCC _B 1319.119465 4.970174484 0.838038212 5.930725369 3.02E-09 9.11E-09
TCTGTACCAACG _A_Concatenation GAGATTGTGTCC _B 1905.64523 1.193588807 0.707357765 1.687390548 0.09152828 0.120220853
TCTTAGAGGGTG _A_Concatenation ACTTGATGGTTT _B 8723.933313 2.576483961 0.380312211 6.774654847 1.25E-11 4.66E-11
TCTTAGAGGGTG _A_Concatenation AGAGTGCGCTCT _B 9934.008985 -1.373247148 0.150755253 -9.109116426 8.31E-20 5.85E-19
TCTTAGAGGGTG _A_Concatenation ATAACTCCACGC _B 1698.377181 3.254335631 0.608362516 5.349336204 8.83E-08 2.35E-07
TCTTAGAGGGTG _A_Concatenation ATGACCCCTTGT _B 1297.614228 1.932667118 0.516297332 3.743321916 0.000181603 0.000339671
TCTTAGAGGGTG _A_Concatenation CTACGTGGCCCC _B 2864.773702 1.887592686 0.356690179 5.291967087 1.21E-07 3.14E-07
TCTTAGAGGGTG _A_Concatenation GAGATTGTGTCC _B 2702.107475 1.069024949 0.339439251 3.149385185 0.001636144 0.002701486
TCTTGGTTTTGA _A_Concatenation ACTTGATGGTTT _B 5137.004474 2.817593061 0.190327188 14.80394413 1.38E-49 5.77E-48
TCTTGGTTTTGA _A_Concatenation AGAGTGCGCTCT _B 13199.65293 -1.589187701 0.177938744 -8.931094288 4.22E-19 2.80E-18
TCTTGGTTTTGA _A_Concatenation ATAACTCCACGC _B 1673.558599 -1.585776296 0.256590073 -6.180193471 6.40E-10 2.10E-09
TCTTGGTTTTGA _A_Concatenation ATGACCCCTTGT _B 1110.849951 2.601857597 0.86405465 3.011218788 0.002602013 0.004149731
TCTTGGTTTTGA _A_Concatenation CTACGTGGCCCC _B 4347.668806 -0.058402182 0.337794666 -0.172892554 0.862735885 0.883907318
TCTTGGTTTTGA _A_Concatenation GAGATTGTGTCC _B 3259.311251 1.003068615 0.37059663 2.706631776 0.00679696 0.010454798
TGATGAGCGCCC _A_Concatenation ACTTGATGGTTT _B 25515.2802 1.948544725 0.201628521 9.664033227 4.29E-22 3.74E-21
TGATGAGCGCCC _A_Concatenation AGAGTGCGCTCT _B 40174.9284 -1.78389791 0.14226947 -12.53886664 4.57E-36 9.33E-35
TGATGAGCGCCC _A_Concatenation ATAACTCCACGC _B 8264.16873 -0.729858195 0.389623817 -1.873238142 0.061035508 0.082012079
TGATGAGCGCCC _A_Concatenation ATGACCCCTTGT _B 2404.200024 -0.384694895 0.29607366 -1.299321581 0.193833591 0.237020364
TGATGAGCGCCC _A_Concatenation CTACGTGGCCCC _B 8870.018591 1.727602207 0.218279543 7.914631766 2.48E-15 1.21E-14
TGATGAGCGCCC _A_Concatenation GAGATTGTGTCC _B 12717.27295 1.750387064 0.185141296 9.454330849 3.25E-21 2.62E-20
TGCCAGACGGTC _A_Concatenation ACTTGATGGTTT _B 4510.886746 1.052444206 0.397232804 2.649439312 0.008062545 0.012288286
TGCCAGACGGTC _A_Concatenation AGAGTGCGCTCT _B 12203.32168 1.49634125 0.243676951 6.140676193 8.22E-10 2.67E-09
TGCCAGACGGTC _A_Concatenation ATAACTCCACGC _B 2226.029711 -0.289680951 0.304542795 -0.951199492 0.341503116 0.396442048
TGCCAGACGGTC _A_Concatenation ATGACCCCTTGT _B 783.9451893 -1.282710293 0.387401864 -3.311058648 0.000929437 0.001604892
TGCCAGACGGTC _A_Concatenation CTACGTGGCCCC _B 511.6013738 0.014106396 0.710152224 0.019863905 0.984151939 0.988379396
TGCCAGACGGTC _A_Concatenation GAGATTGTGTCC _B 7259.82545 -0.114018914 0.541302892 -0.210637917 0.833169822 0.859643003
TGGCCCCCAGTT _A_Concatenation ACTTGATGGTTT _B 1753.655345 -0.193364443 0.340707643 -0.567537734 0.570348898 0.615639761
TGGCCCCCAGTT _A_Concatenation AGAGTGCGCTCT _B 709.7995748 -0.456607033 0.487135992 -0.937329698 0.348588991 0.403546833
TGGCCCCCAGTT _A_Concatenation ATAACTCCACGC _B 1720.70992 -1.822296704 0.288489682 -6.316678953 2.67E-10 9.00E-10
TGGCCCCCAGTT _A_Concatenation ATGACCCCTTGT _B 875.4770592 -2.387857953 0.540489318 -4.417955864 9.96E-06 2.11E-05
TGGCCCCCAGTT _A_Concatenation CTACGTGGCCCC _B 4944.371412 -1.374462897 0.2652007 -5.182727265 2.19E-07 5.57E-07
TGGCCCCCAGTT _A_Concatenation GAGATTGTGTCC _B 2738.201366 -0.806594213 0.306036809 -2.635611765 0.008398576 0.012750566
TTAAATGAGGGC _A_Concatenation ACTTGATGGTTT _B 0
TTAAATGAGGGC _A_Concatenation AGAGTGCGCTCT _B 0
TTAAATGAGGGC _A_Concatenation ATAACTCCACGC _B 0
TTAAATGAGGGC _A_Concatenation ATGACCCCTTGT _B 0
TTAAATGAGGGC _A_Concatenation CTACGTGGCCCC _B 0
TTAAATGAGGGC _A_Concatenation GAGATTGTGTCC _B 0
TTAGTGCCATAC _A_Concatenation ACTTGATGGTTT _B 4089.882088 3.932440362 0.99665856 3.945624431 7.96E-05 0.00015253
TTAGTGCCATAC _A_Concatenation AGAGTGCGCTCT _B 7283.312021 -0.823307279 0.229314819 -3.590292525 0.000330307 0.000604271
TTAGTGCCATAC _A_Concatenation ATAACTCCACGC _B 1720.584853 -0.589194406 0.525843587 -1.120474644 0.262511552 0.314422135
TTAGTGCCATAC _A_Concatenation ATGACCCCTTGT _B 929.7064877 2.355566835 1.100306086 2.140828687 0.032287856 0.044987489
TTAGTGCCATAC _A_Concatenation CTACGTGGCCCC _B 1217.381374 0.555779736 0.394127845 1.410150901 0.15849513 0.197107242
TTAGTGCCATAC _A_Concatenation GAGATTGTGTCC _B 1245.260138 2.319633216 0.765456969 3.030390094 0.002442381 0.003907883
TTCTAATTTCCT _A_Concatenation ACTTGATGGTTT _B 12951.39885 -0.76441369 0.193074368 -3.95916713 7.52E-05 0.000144847
TTCTAATTTCCT _A_Concatenation AGAGTGCGCTCT _B 5536.76865 -0.847994458 0.149773074 -5.661861882 1.50E-08 4.24E-08
TTCTAATTTCCT _A_Concatenation ATAACTCCACGC _B 1112.148194 2.379131032 1.200826686 1.981244305 0.047563883 0.065119489
TTCTAATTTCCT _A_Concatenation ATGACCCCTTGT _B 1868.207944 -2.315285466 0.382277849 -6.056551462 1.39E-09 4.36E-09
TTCTAATTTCCT _A_Concatenation CTACGTGGCCCC _B 2491.36953 1.716095998 0.409061779 4.195200057 2.73E-05 5.46E-05
TTCTAATTTCCT _A_Concatenation GAGATTGTGTCC _B 5174.311018 0.665455092 0.197959458 3.361572619 0.000775 0.00136442
TTTAACGGATTG _A_Concatenation ACTTGATGGTTT _B 7102.883505 -1.925070194 0.152000796 -12.66486919 9.26E-37 1.97E-35
TTTAACGGATTG _A_Concatenation AGAGTGCGCTCT _B 9789.187406 -1.970966116 0.179422164 -10.98507607 4.51E-28 5.99E-27
TTTAACGGATTG _A_Concatenation ATAACTCCACGC _B 1635.59373 -2.09172008 0.415691771 -5.031901582 4.86E-07 1.19E-06
TTTAACGGATTG _A_Concatenation ATGACCCCTTGT _B 936.910252 -2.594250445 0.556343749 -4.663035124 3.12E-06 7.05E-06
TTTAACGGATTG _A_Concatenation CTACGTGGCCCC _B 3248.860038 -2.595650747 0.332769212 -7.800152945 6.18E-15 2.96E-14
TTTAACGGATTG _A_Concatenation GAGATTGTGTCC _B 1459.234146 0.109469422 0.525289403 0.208398307 0.834917978 0.860687052
Joint_1_A/u splice_ ACTTGATGGTTT _b 12602.18685 2.834282519 0.307436038 9.219096566 3.00E-20 2.19E-19
Joint_1_A/u splice_ AGAGTGCGCTCT _b 13857.04827 -1.407795098 0.151174797 -9.312366374 1.25E-20 9.51E-20
Joint_1_A/u splice_ ATAACTCCACGC _b 1414.540746 0.869479763 0.942569942 0.922456492 0.356290514 0.411564833
Joint_1_A/u splice_ ATGACCCCTTGT _b 1444.921777 1.821238188 0.961015395 1.895118639 0.058076686 0.078487452
Joint_1_A/u splice_ CTACGTGGCCCC _b 3161.063681 3.741360185 0.451001485 8.295671541 1.08E-16 5.87E-16
Joint_1_A/u splice_ GAGATTGTGTCC _b 4439.277102 2.339930864 0.288768714 8.103131495 5.36E-16 2.75E-15
Joint_ 2_A/u splice_ ACTTGATGGTTT _b 4764.362017 1.302185917 0.318553825 4.087804998 4.35E-05 8.60E-05
Joint_ 2_A/u splice_ AGAGTGCGCTCT _b 4641.014609 -0.858114368 0.268247719 -3.198962409 0.001379232 0.002319887
Joint_ 2_A/u splice_ ATAACTCCACGC _b 2312.608333 1.007811593 0.311329084 3.237126382 0.001207399 0.002063523
Joint_ 2_A/u splice_ ATGACCCCTTGT _b 869.5095131 -0.285909193 0.326995498 -0.874352078 0.381926547 0.436007942
Joint_ 2_A/u splice_ CTACGTGGCCCC _b 1693.242298 1.340402916 0.586921782 2.283784584 0.022384194 0.032146343
Joint_ 2_A/u splice_ GAGATTGTGTCC _b 2229.334468 3.897228193 1.017061007 3.831852923 0.000127182 0.000240965
Joint_ 3_A/u splice_ ACTTGATGGTTT _b 20972.98866 -0.931065923 0.174674411 -5.330293769 9.81E-08 2.59E-07
Joint_ 3_A/u splice_ AGAGTGCGCTCT _b 14106.2696 -1.077543821 0.230768459 -4.669372177 3.02E-06 6.84E-06
Joint_ 3_A/u splice_ ATAACTCCACGC _b 12209.13927 1.775272594 0.129554249 13.70292841 9.75E-43 2.85E-41
Joint_ 3_A/u splice_ ATGACCCCTTGT _b 2505.160791 1.677157426 0.259069398 6.473776666 9.56E-11 3.34E-10
Joint_ 3_A/u splice_ CTACGTGGCCCC _b 13055.01539 0.949972726 0.305937506 3.105120186 0.001902017 0.003096738
Joint_ 3_A/u splice_ GAGATTGTGTCC _b 13071.28484 1.250540812 0.13589717 9.202110778 3.51E-20 2.53E-19
TABLE 10 endogenous Gene validation screening
Bar code Class A Class B Day 2_log2FC Day 2_ padj Day 8_log2FC Day 8_ padj Day 14_log2FC Day 14_ padj Day 14 \u alt_log2FC Day 14_alt_ padj Category a extension Category B extension A_ID B_ID A peptide B peptide State of combination
ATTTGTAGACCG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -3.463642398 0.02903062 -3.003231346 0.30068271 -22.21918925 0.724533156 15.39702108 0.001689303 Human target neg_ctrl MYCN Random_1 PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
Stop-1_A/u splice_ GAGATTGTGTCC _b neg_ctrl Target(s) -1.627436671 0.16187644 -3.692147517 0.03832777 11.16585017 0.007680143 11.57361012 0.000990799 neg_ctrl Virus_target Stop_1 hvTR_Q2HR71 ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Termination_X_strong
ACTTGATGGTTT _A_Concatenation CATTGTTTAAAT _B Target(s) Target(s) 0.358368292 0.73268364 6.81368932 0.00415692 15.31139868 0.000615278 11.44124604 0.014888877 Virus_target Human target hvTR_Q2HR73 POLR1E RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV Mid_X_unknown
AAAACAAGCATT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 2.291212195 0.13710355 5.050991528 11.49531014 0.013697255 11.22173713 0.003533249 Virus_target Virus_target cds_NC_030656.1_cds_YP_009268803.1_88 hvTR_Q2HR71 STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Weak_X_Strong
ACTTGATGGTTT _A_Concatenation AATCGATCCTAC _B Target(s) Target(s) 1.187220049 1.13E-05 5.424965834 0.01393042 2.099804209 0.959421461 11.06854166 5.73E-13 Virus_target Human target hvTR_Q2HR73 PRDM10 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA mid_X_mid
ATAACTCCACGC _A_Concatenation GGCACAGCTCCA _B pos_ctrl_A Target(s) 1.677766859 0.1798316 4.713241977 13.22676222 0.000615278 10.99197172 0.003038184 vIRF2_VP64_pos_ctrl Human target core_VP64_a DTX2 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG VIRF2_VP64_pos ctrl_x_unknown
CTACGTGGCCCC _A_Concatenation GGAGGCACATCG _B Target(s) Target(s) 0.587962509 0.3372476 -2.336900595 0.05332773 9.829872553 0.010044477 10.6963194 0.000470015 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q87042 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL Strong_X_strong
ACGACTCACCGC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -2.494227768 0.49156691 -4.981795165 0.14175911 12.10068427 0.006685168 10.69187257 0.014756797 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 core_VP64_a PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
CCCTGCCTCGGG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.175471785 0.76651497 5.280125963 0.00638945 3.537992144 0.641409749 10.39508229 6.06E-08 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 Random_1 EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
CTACGTGGCCCC _A_Concatenation TATCTATCGTGT _B Target(s) Target(s) 0.806061304 0.12515795 2.752006162 0.23699814 10.68350197 0.01463682 10.25424652 0.021401372 Virus_target Virus_target hvTR_Q2HR71 hvTR_P03259 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS Strong_X_strong
AGAGTGCGCTCT _A_Concatenation CCGCCCTTATGT _B neg_ctrl Target(s) 5.555496896 9.63E-75 12.37826167 1.30E-25 -0.408180374 0.998501722 10.19520655 0.000139482 neg_ctrl Human target Random_1 ZFX CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH Random_x_mid
GAGATTGTGTCC _A_Concatenation CCAGACGGTCTG _B Target(s) Target(s) 4.107593678 2.63E-50 13.29316804 -0.550800531 0.997690695 10.11839603 6.85E-05 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN Strong_x_weak
Stop 2_A splice ATGACCCCTTGT B neg_ctrl Target(s) 0.479260793 0.89056441 -0.02450146 0.98982201 10.64472131 0.037311924 10.09812267 0.040651866 neg_ctrl Human target Stop_2 LEUTX ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Termination_X_strong
ACTTGATGGTTT _A_Concatenation AATGGCACTAGC _B Target(s) Target(s) 4.136321997 7.26E-66 11.71054224 1.41E-26 0.013660142 0.998980766 10.09745322 3.41E-09 Virus_target Virus_target hvTR_Q2HR73 cds_NC_024711.1_cds_YP_009052522.2_44 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR Mid_X_weak
CTCACGACAAGA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.574870212 0.66469402 7.748547615 3.55E-13 1.411387083 0.997690695 9.902304554 3.12E-14 Virus_target vIRF2_VP64_pos_ctrl hvTR_G3G929 core_VP64_a ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation CTCCATTAATGA _B neg_ctrl Target(s) 0.119959975 0.926639 -5.887757856 0.00109019 10.52305524 0.014416269 9.802964914 0.024978026 neg_ctrl Virus_target Random_1 hvTR_Q5IXR3 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV Random_x_weak
ATGACCCCTTGT _A_Concatenation AAGACCGGTGCC _B Target(s) Target(s) 3.222871153 2.76E-21 10.16328231 1.61E-15 -0.413316708 0.997690695 9.786927446 3.70E-08 Human target Virus_target LEUTX hvTR_P08392 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF Strong_X_mid
CACACCCGGCAG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -1.313750639 0.04563877 1.527309891 0.33007775 9.13783676 0.013424967 9.785512442 0.001593463 Virus_target Virus_target hvTR_P19893 hvTR_Q2HR71 VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
TATCGCAAGAAC _A_Concatenation ATGACCCCTTGT _B neg_ctrl Target(s) -1.481547652 0.02824838 3.405197617 0.00200126 8.928669924 0.013697255 9.671232973 0.001188078 neg_ctrl Human target Random_5 LEUTX PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Random_x_strong
ATGACAAAACGA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.818128085 0.03302877 4.956537401 0.04840645 1.261699218 0.997690695 9.524994268 3.65E-13 Virus_target Human target cds_NC_025350.1_cds_YP_009094074.1_1 LEUTX HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
TAAGATTCAACG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.175895712 0.80641597 6.167436043 0.00322932 -12.41618445 0.005760567 9.524773773 0.024114563 Human target Human target ZNF292 LEUTX CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
CAGTGACATAGT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.03920268 0.97418471 0.818673193 0.62046092 10.71864212 0.01049685 9.520527873 0.017149591 Human target Virus_target EHMT2 hvTR_Q2HR73 AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
GAGATTGTGTCC _A_Concatenation TAAGATTCAACG _B Target(s) Target(s) 1.268705673 0.01521957 8.164086967 2.60E-05 -2.997341245 0.848773578 9.388949471 0.006225935 Virus_target Human target hvTR_Q2HR71 ZNF292 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP Strong_X_strong
AGTTGCTTTGCT _A_Concatenation CTACGTGGCCCC _B neg_ctrl Target(s) 0.577303779 0.1639263 6.003819587 0.00224128 -2.741546785 0.871241789 9.333613067 0.002371643 neg_ctrl Virus_target Random_2 hvTR_Q2HR71 DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Random_x_strong
ACTTGATGGTTT _A_Concatenation AGTTGCTTTGCT _B Target(s) neg_ctrl -0.691496294 0.80286361 -4.818684306 0.07168116 10.46602713 0.005760567 9.274087677 0.006669939 Virus_target neg_ctrl hvTR_Q2HR73 Random_2 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM Mid_X_random
AGAGTGCGCTCT _A_Concatenation TGCCAGACGGTC _B neg_ctrl Target(s) 1.029223434 0.16544842 -3.020111598 0.03888733 -12.3335532 0.00409751 9.166300733 0.042038866 neg_ctrl Virus_target Random_1 hvTR_P10541 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG Random_x_strong
GTACATACCGAA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -3.036940265 0.15677681 5.341118744 0.00095831 -4.401537116 9.123474948 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 hvTR_Q2HR71 NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
ATGACCCCTTGT _A_Concatenation ATGTGGCACGAC _B Target(s) Target(s) 1.077030919 0.01879827 3.888636396 9.788818368 0.010044477 9.113663087 0.010118121 Human target Human target LEUTX ANKRD17 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC Strong_X_mid
ACTTGATGGTTT _A_Concatenation TACTTCTAGCGG _B Target(s) Target(s) 0.671713663 0.09741707 5.379370488 0.00542747 -2.023130364 0.973675561 9.076876056 0.000470015 Virus_target Virus_target hvTR_Q2HR73 cds_NC_025350.1_cds_YP_009094080.1_7 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ Mid_X_weak
ATGACCCCTTGT _A_Concatenation GATGGAGCTACA _B Target(s) Target(s) 0.347533738 0.72753065 -6.135580134 0.00220481 -11.00294679 0.008547707 9.018961759 0.031425518 Human target Virus_target LEUTX hvTR_P03209 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF Strong_X_strong
GGCACTGTCGAG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.552037234 1.14E-40 5.305129936 1.14E-07 -0.106562398 0.998501722 9.00482691 3.07E-07 Virus_target neg_ctrl hvTR_P17147 Random_1 YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Unknown_X_random
GTTCCACGCCTG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 1.312879507 3.17E-27 5.143585346 3.35E-06 -0.067195717 0.998501722 8.911865562 4.03E-10 Human target Virus_target ZFX hvTR_Q2HR73 VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
CTACGTGGCCCC _A_Concatenation CTCGTTATGGCA _B Target(s) Target(s) 0.698570228 0.03930843 3.434874282 -11.64255281 0.002836034 8.813322585 0.020866488 Virus_target Human target hvTR_Q2HR71 NOVA1 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF Strong_X_mid
ATGACCCCTTGT _A_Concatenation GTTCCACGCCTG _B Target(s) Target(s) 1.506779301 2.35E-30 5.90920395 2.85E-08 0.750660557 0.997690695 8.793027335 1.11E-09 Human target Human target LEUTX ZFX DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS Strong_X_strong
TTCTAATTTCCT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.460415026 0.29541092 -3.670780763 0.01608804 -10.40897683 0.013424967 8.752477206 0.045483182 Human target Virus_target WDR76 hvTR_Q2HR71 IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
CATCTGAGTCGT _A_Concatenation CTACGTGGCCCC _B pos_ctrl_A Target(s) 0.704464119 0.00364994 3.354553919 0.02127152 4.728102918 0.313599588 8.748179961 5.64E-05 vIRF2_VP64_pos_ctrl Virus_target RWY_VP16_vIRF2 hvTR_Q2HR71 GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL VIRF2_VP64_pos strong_ctrl_x_strong
CTACGTGGCCCC _A_Concatenation ATCTCTCAGTTC _B Target(s) Target(s) 5.109216358 0.00479435 -1.561058745 0.59689983 0.361292 8.708850861 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK Strong_X_mid
Joint_1_A/u splice_ CTACGTGGCCCC _b neg_ctrl Target(s) 0.922993713 0.001233 -2.440330191 0.09193957 -10.15768613 0.015403937 8.652352489 0.049730741 neg_ctrl Virus_target Joint_1 hvTR_Q2HR71 GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Joint_X_strong
CCCTATGTTCTA _A_Concatenation ATAACTCCACGC _B pos_ctrl_A pos_ctrl_A -0.084286418 0.89056441 -4.59368123 0.00298056 10.27291873 0.005760567 8.641999877 0.013762881 vIRF2_VP64_pos_ctrl vIRF2_VP64_pos_ctrl core_VP64_b core_VP64_a DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML vIRF2_VP64_pos_ctrl_X_vIRF2_VP64_pos_ctrl
Joint_ 3_A/u splice_ ATGACCCCTTGT _b neg_ctrl Target(s) -0.266080949 0.38922543 5.779947485 0.01080066 10.01853312 0.007680143 8.637041421 0.013866638 neg_ctrl Human target Joint_3 LEUTX GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Joint_X_strong
ACTTGATGGTTT _A_Concatenation CGGCAATTGCTT _B Target(s) Target(s) -1.457822561 0.00041066 -5.793769636 9.74E-06 8.360448954 0.018200984 8.594749615 0.005938192 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q9QB93 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK mid_X_mid
GAGATTGTGTCC _A_Concatenation TTAAATGAGGGC _B Target(s) Target(s) -1.106091376 0.49700987 -3.5590441 0.23526565 -11.27049773 0.01068728 8.566673132 0.038129118 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ Strong_X_mid
GCGAGGCAGACC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -1.351249984 0.16544842 -5.384901622 0.02989572 -12.35765225 0.003914851 8.56354919 0.048359833 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 hvTR_Q2HR71 QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Unknown_X_strong
ATAACTCCACGC _A_Concatenation CCGCCCTTATGT _B pos_ctrl_A Target(s) 1.773949498 0.25106342 7.19197092 0.00554242 -3.781905878 8.547214128 vIRF2_VP64_pos_ctrl Human target core_VP64_a ZFX LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH vIRF2_VP64_pos_ctrl_X_mid
TGATGAGCGCCC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.344793988 2.55E-26 5.110659356 1.01E-07 0.40706954 0.997690695 8.518596789 1.60E-12 Human target neg_ctrl EEF1D Random_1 QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Weak_X_random
GTTCCACGCCTG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.468747959 0.86342423 6.344131903 1.08E-06 -1.725473963 8.514359599 Human target Virus_target ZFX hvTR_Q2HR71 VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
AGAGTGCGCTCT _A_Concatenation EPICXV.47_B neg_ctrl pos_ctrl_A 1.181055392 0.0391193 4.36387425 0.03785759 -10.50955972 0.010634211 8.47614677 0.032516708 neg_ctrl vIRF2_VP64_pos_ctrl Random_1 3x_vIRF2_core CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Random_X_vIRF2\u VP64_pos_ctrl
TAAAATTTATCA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.267233163 0.3372476 5.060331334 0.01115993 -10.2774351 0.008462651 8.439808168 0.024114563 Human target Virus_target HSF1 hvTR_Q2HR73 LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
ATCTCTCAGTTC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.471641559 0.86936399 1.431186174 -5.702284475 8.438455131 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 hvTR_Q2HR71 VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
CTGGTAGTTGTT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -1.36051598 3.21E-05 5.305392728 0.04394168 9.343547687 0.010044477 8.427055297 0.013365535 Virus_target Virus_target hvTR_F6KS47 hvTR_Q2HR71 SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Unknown_X_strong
ATAACTCCACGC _A_Concatenation CAACGCATGCCT _B pos_ctrl_A Target(s) -0.159358465 0.90519313 -4.206018048 0.03359124 -10.64702017 0.010044477 8.380637875 0.052123751 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS VIRF2_VP64_pos ctrl_x_weak
GAGATTGTGTCC _A_Concatenation TCTGTACCAACG _B Target(s) Target(s) 0.88255408 0.00036657 4.942691993 0.00601804 -1.102235492 0.997690695 8.377224871 2.68E-08 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q6XA69 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY Strong_x_unknown
GAGATTGTGTCC _A_Concatenation CCCTGCCTCGGG _B Target(s) Target(s) 0.896690546 2.85E-08 4.962600718 0.00919494 9.365836563 0.010634211 8.368592116 0.016953736 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY Strong_X_mid
CTACGTGGCCCC _A_Concatenation AGTTAGGCTCTT _B Target(s) Target(s) -0.20177753 0.26404203 5.64449311 0.00347065 9.758019231 0.009384904 8.351454723 0.019312927 Virus_target Human target hvTR_Q2HR71 TOX4 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV Strong_X_mid
GTACATACCGAA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -1.112767526 0.11078803 0.461232116 -9.640315773 0.017128651 8.32131175 0.044930268 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 hvTR_Q2HR73 NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
GTACATACCGAA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.304629792 0.48768396 5.118584334 0.00880421 -9.008287886 0.013424967 8.313758907 0.014269501 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 hvTR_Q2HR71 NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
ATGACCCCTTGT _A_Concatenation ACAACGTGCATA _B Target(s) Target(s) -0.026948112 0.9335834 2.919238068 2.303723965 0.878576426 8.292700867 1.11E-07 Human target Virus_target LEUTX hvTR_P10407 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET Strong_X_strong
CCAGACGGTCTG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.113468401 0.93141616 -6.233352431 7.87E-05 -9.049712443 0.031301413 8.223203079 0.047594012 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 hvTR_Q2HR73 PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Weak_X_mid
AGAGTGCGCTCT _A_Concatenation AGCCAGGTCACC _B neg_ctrl Target(s) 1.843274291 0.20318702 -2.776815549 0.4314257 8.042169107 0.032334459 8.175432008 0.013342514 neg_ctrl Human target Random_1 MESP1 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK Random_x_strong
GAGATTGTGTCC _A_Concatenation CATGCTAACACC _B Target(s) Target(s) 0.677641024 0.07423608 5.68162752 0.00224462 -3.176838108 0.668601651 8.152243641 8.11E-05 Virus_target Virus_target hvTR_Q2HR71 hvTR_F5HCV3 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL Strong_X_strong
ACTTGATGGTTT _A_Concatenation CACACCCGGCAG _B Target(s) Target(s) 0.56890209 0.47019918 -3.661843296 0.09821235 -10.03343679 0.010044477 8.135475876 0.020523923 Virus_target Virus_target hvTR_Q2HR73 hvTR_P19893 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS Mid_X_strong
TGCCAGACGGTC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 1.849831622 1.11E-57 7.465320142 5.46E-10 -0.018339391 0.998980766 8.128216634 3.12E-14 Virus_target Virus_target hvTR_P10541 hvTR_Q2HR73 RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
TGCCAGACGGTC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 1.718555487 4.04E-57 7.040675547 3.03E-11 0.341850866 0.998501722 8.106708867 1.11E-266 Virus_target Virus_target hvTR_P10541 hvTR_Q2HR71 RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
CTACGTGGCCCC _A_Concatenation TCTGTACCAACG _B Target(s) Target(s) 0.338397866 0.03952651 5.591561227 9.22E-06 3.798276949 0.51410768 8.105257146 4.75E-05 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q6XA69 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY Strong_x_unknown
ATAACTCCACGC _A_Concatenation TATTAAAAGTGC _B pos_ctrl_A Target(s) -0.428182491 0.91132524 -3.256249648 0.27742577 -8.211751493 0.121748841 8.047026287 0.072895441 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_202 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI VIRF2_VP64_pos ctrl_x_weak
GGGAGTATGTCG _A_Concatenation CTACGTGGCCCC _B neg_ctrl Target(s) 0.994372916 0.49872084 2.199227781 8.849871636 0.031031764 8.033227141 0.043327277 neg_ctrl Virus_target Random_4 hvTR_Q2HR71 LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Random_x_strong
GAGATTGTGTCC _A_Concatenation GCAACGAGGGTC _B Target(s) Target(s) 0.69588737 0.01226337 6.035742044 0.00332847 -8.891116836 0.013697255 8.015841384 0.019312927 Virus_target Human target hvTR_Q2HR71 QRICH2 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ Strong_X_mid
GGCACTGTCGAG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.844173271 0.003519 5.658536069 0.03543209 -9.099015922 0.013789042 7.995264513 0.02616872 Virus_target Virus_target hvTR_P17147 hvTR_Q2HR71 YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Unknown_X_strong
AATCAACGAGCA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.134355205 0.71346073 6.032674181 6.36E-07 -2.565410005 0.827084874 7.970322286 1.10E-10 Human target neg_ctrl C2orf81 Random_1 SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
ACTTGATGGTTT _A_Concatenation TATCGCAAGAAC _B Target(s) neg_ctrl -0.534561011 0.55849977 -6.250942664 0.00088003 -8.925504392 0.031438151 7.908569995 0.053375963 Virus_target neg_ctrl hvTR_Q2HR73 Random_5 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV Mid_X_random
GCAACGAGGGTC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.93245887 0.10382793 -2.038043552 0.1185804 -8.558909782 0.040368381 7.888186097 0.057249341 Human target Virus_target QRICH2 hvTR_Q2HR71 RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
ATAACTCCACGC _A_Concatenation ATGACAAAACGA _B pos_ctrl_A Target(s) 0.549699281 1.85E-05 5.694436396 0.00214973 -2.166437094 0.907768852 7.863100258 2.60E-07 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NC_025350.1_cds_YP_009094074.1_1 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL VIRF2_VP64_pos strong_ctrl_x_strong
GAGATTGTGTCC _A_Concatenation GACTGTGGGCGG _B Target(s) Target(s) 0.699833746 0.92304294 3.248265669 0.2160146 3.182389004 7.820929841 Virus_target Human target hvTR_Q2HR71 MON2 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL Strong_X_strong
ATGACCCCTTGT _A_Concatenation CGGCAATTGCTT _B Target(s) Target(s) -0.87344521 0.75207104 1.500155013 0.58424935 3.182389004 7.820929841 Human target Virus_target LEUTX hvTR_Q9QB93 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK Strong_X_mid
CTACGTGGCCCC _A_Concatenation AGGCTTCCCGCT _B Target(s) Target(s) -0.882861289 0.80641597 -3.354317901 0.33987316 3.182389004 7.820929841 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV Strong_X_mid
AGAGTGCGCTCT _A_Concatenation GACTGTGGGCGG _B neg_ctrl Target(s) -4.159852954 4.477125894 0.01342699 3.182389004 7.820929841 neg_ctrl Human target Random_1 MON2 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL Random_x_strong
ATAACTCCACGC _A_Concatenation ATCTCTCAGTTC _B pos_ctrl_A Target(s) 1.021475407 0.05020483 -5.899487239 0.00055535 12.24207843 0.003293826 7.814648111 0.078603617 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation CTACGTGGCCCC _B pos_ctrl_A Target(s) -0.970571596 0.00852049 -3.894333782 0.00108214 -8.998423902 0.010634211 7.765168345 0.021401372 vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q2HR71 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL VIRF2_VP64_pos strong_ctrl_x_strong
AGAGTGCGCTCT _A_Concatenation GAGTGACCTATT _B neg_ctrl Target(s) 0.926559308 0.01063759 5.835977781 0.0030859 -2.480029226 0.846433413 7.712272998 1.03E-05 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI Random_x_unknown
CTACGTGGCCCC _A_Concatenation TCTTGGTTTTGA _B Target(s) Target(s) 0.78842733 0.10348588 1.696022505 0.44251631 -8.542847615 0.03963291 7.705659039 0.062240466 Virus_target Human target hvTR_Q2HR71 ZNF597 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL Strong_X_strong
ATGACCCCTTGT _A_Concatenation TGATGAGCGCCC _B Target(s) Target(s) 0.588388731 0.434377 3.986380963 0.0883946 -4.225558781 0.386178764 7.701181979 0.000249782 Human target Human target LEUTX EEF1D DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL Strong_x_weak
ATAACTCCACGC _A_Concatenation ACCAGTATTTAA _B pos_ctrl_A Target(s) 0.683576529 0.10163584 5.169983535 0.00843344 -8.251747267 0.023733264 7.697459543 0.022916896 vIRF2_VP64_pos_ctrl Human target core_VP64_a ARNTL2 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL VIRF2_VP64_pos strong_ctrl_x_strong
TCTATCACAACT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 1.682639397 3.85E-16 6.89865852 1.35E-11 -4.754507413 0.282213439 7.685191765 0.000604952 Human target Virus_target TOX4 hvTR_Q2HR71 LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
ATGACCCCTTGT _A_Concatenation EPICXV.47_B Target(s) pos_ctrl_A 0.541557247 0.75878608 -6.805037577 0.00458448 -8.538761837 0.037311924 7.665495834 0.057593693 Human target vIRF2_VP64_pos_ctrl LEUTX 3x_vIRF2_core DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
TTCTAATTTCCT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.794177652 0.36774226 4.665974015 0.07669979 -2.474694974 7.656169272 Human target neg_ctrl WDR76 Random_1 IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
GAGATTGTGTCC _A_Concatenation CCGCCCTTATGT _B Target(s) Target(s) -4.703491632 0.12519805 -4.609441729 0.13475115 -2.474694974 7.656169272 Virus_target Human target hvTR_Q2HR71 ZFX PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH Strong_X_mid
Joint_1_A/u splice_ ATGACCCCTTGT _b neg_ctrl Target(s) 0.394136657 0.88343822 -0.090714014 -2.474694974 7.656169272 neg_ctrl Human target Joint_1 LEUTX GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Joint_X_strong
ATAACTCCACGC _A_Concatenation ATCCAATCTGTG _B pos_ctrl_A Target(s) 0.650896034 0.91465995 -7.081509867 7.653290084 vIRF2_VP64_pos_ctrl Human target core_VP64_a GLI2 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT vIRF2_VP64_pos_ctrl_X_mid
TCTGTACCAACG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.791455752 0.00785913 -1.88030649 0.26956633 8.646985846 0.014416269 7.640775335 0.024390507 Virus_target neg_ctrl hvTR_Q6XA69 Random_1 ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Unknown_X_random
ATAACTCCACGC _A_Concatenation ATGTGGCACGAC _B pos_ctrl_A Target(s) 0.641890332 1.47E-06 4.899229592 0.00810947 -2.262580913 0.877171774 7.639289022 1.10E-06 vIRF2_VP64_pos_ctrl Human target core_VP64_a ANKRD17 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC vIRF2_VP64_pos_ctrl_X_mid
ACTTGATGGTTT _A_Concatenation CCAGACGGTCTG _B Target(s) Target(s) 0.196096406 0.7214152 3.190731301 0.05166799 7.184898497 0.037311924 7.58221318 0.008616869 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN Mid_X_weak
TGATGAGCGCCC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.170016441 0.85449119 4.574274272 8.99E-05 8.660383291 0.013697255 7.534873296 0.024390507 Human target Human target EEF1D LEUTX QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Weak_X_Strong
ACTTGATGGTTT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.71367231 5.18E-09 6.185353501 6.51E-14 -0.571016588 0.997690695 7.460628152 5.89E-12 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q2HR71 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
GGCCTTTCGGGC _A_Concatenation GAGATTGTGTCC _B neg_ctrl Target(s) -0.370682457 0.55882961 4.244894033 -9.412315455 0.010044477 7.451282675 0.038629478 neg_ctrl Virus_target Random_3 hvTR_Q2HR71 GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Random_x_strong
ATGTGGCACGAC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.09503707 0.85386731 -5.658715328 0.00038416 -8.732218594 0.034652658 7.411667637 0.073394906 Human target Virus_target ANKRD17 hvTR_Q2HR73 QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
ACTCGTGTTGTC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.933547888 0.04563877 5.663934779 0.01080066 -8.370596018 0.039318978 7.407951899 0.065462272 Virus_target neg_ctrl hvTR_Q9QB97 Random_1 VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
TCTTAGAGGGTG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.447646944 0.33804652 6.230481486 0.00013177 -4.688684329 0.287920482 7.401409185 0.001244361 Human target Human target TRO LEUTX FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
AGAGTGCGCTCT _A_Concatenation CATTAGTCCCGC _B neg_ctrl Target(s) 0.339728174 0.91324404 6.332730281 0.01752569 -10.55211288 0.008547707 7.391569974 0.047229778 neg_ctrl Virus_target Random_1 hvTR_P06930 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF Random_x_strong
GCAACGAGGGTC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 1.127345911 1.21E-07 2.953757152 0.0883946 -8.540949516 0.01463682 7.382284708 0.030899299 Human target vIRF2_VP64_pos_ctrl QRICH2 core_VP64_a RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
ACAACGTGCATA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.367173601 0.0878806 4.339796723 0.01981257 0.713647901 0.997690695 7.365499688 4.82E-11 Virus_target Virus_target hvTR_P10407 hvTR_Q2HR71 RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
ATGACCCCTTGT _A_Concatenation CAGTGACATAGT _B Target(s) Target(s) -0.638970843 0.19699665 2.666397192 0.20783496 -8.021739216 0.057076235 7.263387238 0.073394906 Human target Human target LEUTX EHMT2 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT Strong_X_mid
ATTGTATCTAAC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.24455974 0.5252844 0.933374222 0.58407409 8.096956646 0.01463682 7.132367256 0.023033881 Human target Human target SMARCB1 LEUTX VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
GAGATTGTGTCC _A_Concatenation EPICXV.47_B Target(s) pos_ctrl_A 0.84943315 0.01041536 5.292348308 0.01153845 -2.958427043 0.679495956 7.131041645 9.86E-05 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q2HR71 3x_vIRF2_core PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
AAACTTTGAGCG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.186648247 0.55752411 -4.495524771 0.00054941 -8.180239368 0.021245195 7.12411116 0.036860748 Human target Human target ATF6B LEUTX AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
AATGGCACTAGC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.393476475 0.51381092 3.555384926 0.03980823 8.600579785 0.012328448 7.110854791 0.029747778 Virus_target Virus_target cds_NC_024711.1_cds_YP_009052522.2_44 hvTR_Q2HR71 EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Weak_X_Strong
TGCCAGACGGTC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.128067764 0.86727966 4.975760846 3.73E-06 -4.555968363 0.293964969 7.094708216 0.00206584 Virus_target Human target hvTR_P10541 LEUTX RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
ATTGTATCTAAC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.577875132 0.00077305 5.853344778 8.42E-11 -8.23048064 0.024322502 7.047382471 0.045483182 Human target Virus_target SMARCB1 hvTR_Q2HR73 VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
ATGACCCCTTGT _A_Concatenation CTCACGACAAGA _B Target(s) Target(s) 0.309333447 0.22734303 4.239753439 0.02675203 -9.235195175 0.017210958 6.988338799 0.084897118 Human target Virus_target LEUTX hvTR_G3G929 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP Strong_X_strong
GAGATTGTGTCC _A_Concatenation CACACCCGGCAG _B Target(s) Target(s) 3.018107924 0.00015112 -4.623899275 0.08162571 10.25436772 0.010044477 6.986014056 0.07764548 Virus_target Virus_target hvTR_Q2HR71 hvTR_P19893 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS Strong_X_strong
ATAACTCCACGC _A_Concatenation CATTAGTCCCGC _B pos_ctrl_A Target(s) -1.645745018 0.1490407 4.21019857 0.15762866 7.950859248 0.024855315 6.931958177 0.03998295 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_P06930 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF VIRF2_VP64_pos strong_ctrl_x_strong
AGCCGTGCTTGA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.878123356 7.29E-24 4.526014459 3.45E-09 -2.22960665 0.872533238 6.893945788 1.25E-05 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 hvTR_Q2HR71 GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Weak_X_Strong
GAGATTGTGTCC _A_Concatenation CCCTATGTTCTA _B Target(s) pos_ctrl_A -0.178079772 0.4482293 -5.111208709 0.00542747 -7.535138383 0.037311924 6.809056893 0.04282005 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q2HR71 core_VP64_b PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
CTACGTGGCCCC _A_Concatenation TGATGAGCGCCC _B Target(s) Target(s) 0.157480431 0.61264864 5.765720157 0.00428769 -8.19539128 0.03963291 6.768714549 0.091603543 Virus_target Human target hvTR_Q2HR71 EEF1D TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL Strong_x_weak
GAGATTGTGTCC _A_Concatenation GAGTGACCTATT _B Target(s) Target(s) 1.730596028 2.15E-47 5.93173213 3.97E-07 7.168872131 0.044830089 6.739547537 0.040651866 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI Strong_x_unknown
CTACGTGGCCCC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.337180443 0.07843366 2.573158999 0.09959926 -5.414924903 0.14287914 6.706022541 0.00670365 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q2HR71 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
AGAGTGCGCTCT _A_Concatenation TCTTGGTTTTGA _B neg_ctrl Target(s) -0.74503331 0.14064671 8.313550125 0.0011281 4.328708948 0.362786872 6.646346356 0.011977532 neg_ctrl Human target Random_1 ZNF597 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL Random_x_strong
AGAGTGCGCTCT _A_Concatenation ATGACAAAACGA _B neg_ctrl Target(s) 1.410529323 9.91E-05 -4.150328263 0.00350005 7.062387535 0.038202744 6.64251345 0.030287396 neg_ctrl Virus_target Random_1 cds_NC_025350.1_cds_YP_009094074.1_1 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL Random_x_strong
ACTTGATGGTTT _A_Concatenation CGCCGGCATACA _B Target(s) Target(s) -0.197936285 0.68632574 -7.095341077 8.78E-05 -7.576259922 0.037311924 6.626304899 0.053105771 Virus_target Virus_target hvTR_Q2HR73 cds_NC_030656.1_cds_YP_009268729.1_14 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI Mid_X_strong
ACTTGATGGTTT _A_Concatenation GGAGGCACATCG _B Target(s) Target(s) -0.316675898 0.03929624 3.350519782 0.00159403 -4.420398391 0.293964969 6.602698477 0.003106839 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q87042 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL Mid_X_strong
TGCCAGACGGTC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.717532585 4.59E-12 5.872358314 8.76E-11 2.464141404 0.812553322 6.584128232 6.86E-05 Virus_target Virus_target hvTR_P10541 hvTR_Q2HR71 RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
CCCTGCCTCGGG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 1.748257237 2.99E-20 6.318333506 2.79E-08 6.460286125 0.057907582 6.579359253 0.021459435 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 hvTR_Q2HR71 EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
ATGACCCCTTGT _A_Concatenation TAAAATTTATCA _B Target(s) Target(s) 1.606770111 4.86E-27 6.057202717 4.82E-07 2.124398883 0.878578286 6.505290123 3.04E-05 Human target Human target LEUTX HSF1 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY Strong_X_mid
ATGACCCCTTGT _A_Concatenation GGCACAGCTCCA _B Target(s) Target(s) 0.984042253 0.02618256 6.179487016 0.0022556 -8.45326965 0.032851053 6.497017144 0.103147793 Human target Human target LEUTX DTX2 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG Strong_x_unknown
ATGACCCCTTGT _A_Concatenation TCAGCCGTAGGC _B Target(s) Target(s) -0.219410497 0.38922543 6.020366948 4.09E-05 3.857306341 0.412917083 6.393026493 0.002470579 Human target Virus_target LEUTX hvTR_Q805Y1 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR Strong_X_mid
GGCCTTTCGGGC _A_Concatenation ACTTGATGGTTT _B neg_ctrl Target(s) 0.037292078 0.95906455 0.673923387 0.78843818 -6.958853731 0.122481178 6.35425437 0.112333694 neg_ctrl Virus_target Random_3 hvTR_Q2HR73 GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Random_x_mid
ATGACCCCTTGT _A_Concatenation CATTGTTTAAAT _B Target(s) Target(s) -0.654300763 0.86442316 2.70043121 3.19850768 6.340800955 Human target Human target LEUTX POLR1E DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV Strong_x_unknown
ACTTGATGGTTT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -3.506945951 0.5449371 4.276073406 0.10260338 3.19850768 6.340800955 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q2HR71 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
CTACGTGGCCCC _A_Concatenation CATTGTTTAAAT _B Target(s) Target(s) 0.275748237 0.81782989 -5.130312394 0.00212861 3.19850768 6.340800955 Virus_target Human target hvTR_Q2HR71 POLR1E TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV Strong_x_unknown
ATGACCCCTTGT _A_Concatenation TGGCCCCCAGTT _B Target(s) Target(s) 1.35649995 0.5449371 3.909200449 2.734260423 6.319993521 Human target Virus_target LEUTX hvTR_Q3I8P6 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN Strong_x_weak
CTGCCGGTTGCG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.717439327 0.00016173 4.785605962 0.00061704 6.902284204 0.032851053 6.296303404 0.029747778 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 hvTR_Q2HR71 LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
ATGACCCCTTGT _A_Concatenation ACTTTAGTAACA _B Target(s) Target(s) -0.341033992 0.00096247 3.391356102 0.0053351 -1.797926493 0.957192732 6.261209443 3.16E-05 Human target Virus_target LEUTX hvTR_Q69113 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES Strong_X_strong
CAACGCATGCCT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.850450243 0.03846116 3.365446093 0.1690105 7.805674581 0.019323708 6.251352533 0.051926957 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 LEUTX YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Weak_X_Strong
GAGTGACCTATT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.606450251 0.93141616 1.978973151 0.33412121 -2.112805223 6.230613334 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 hvTR_Q2HR73 HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Unknown_X_mid
CAGTGACATAGT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.889230969 0.0994125 4.709347227 0.02096729 8.240196255 0.032851053 6.212455053 0.110875085 Human target neg_ctrl EHMT2 Random_1 AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
CTACGTGGCCCC _A_Concatenation ATGACAAAACGA _B Target(s) Target(s) -0.325532863 8.79E-05 2.987947112 0.01328684 -1.170979709 0.997690695 6.205129932 1.80E-06 Virus_target Virus_target hvTR_Q2HR71 cds_NC_025350.1_cds_YP_009094074.1_1 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL Strong_X_strong
AGAGTGCGCTCT _A_Concatenation TGATGAGCGCCC _B neg_ctrl Target(s) 0.674483413 2.99E-05 4.838687807 0.01658458 -1.776214416 0.95807515 6.193449665 2.74E-05 neg_ctrl Human target Random_1 EEF1D CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL Random_x_weak
AGAGTGCGCTCT _A_Concatenation ATTTGTAGACCG _B neg_ctrl Target(s) -0.418006152 5.99E-07 3.323023044 0.00572381 -1.480915271 0.987041604 6.188928586 6.07E-06 neg_ctrl Human target Random_1 MYCN CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE Random_x_mid
CATTAGTCCCGC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.372980136 0.79871251 2.284801106 0.23647027 -8.477170869 0.024008997 6.174686499 0.105019042 Virus_target Virus_target hvTR_P06930 hvTR_Q2HR71 AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
ATAACTCCACGC _A_Concatenation TCTGTACCAACG _B pos_ctrl_A Target(s) 0.541422669 0.01223661 5.406175504 0.0045335 -4.445832933 0.282213439 6.161499231 0.007678852 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q6XA69 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY VIRF2_VP64_pos ctrl_x_unknown
CTACGTGGCCCC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.361809382 0.0008871 3.193455072 0.00893844 -1.427185386 0.995270541 6.159230965 1.06E-05 Virus_target neg_ctrl hvTR_Q2HR71 Random_1 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
GATCGCATGTTC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.200750408 0.76651497 5.031563292 0.02106667 7.535132388 0.063801421 6.119721715 0.119830883 Human target Human target INPP4A LEUTX NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
ACTTGATGGTTT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.336722369 0.01420547 3.245423411 0.00752982 -1.46942369 0.987041604 6.111315085 1.51E-05 Virus_target Human target hvTR_Q2HR73 LEUTX RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
GGGAGTATGTCG _A_Concatenation GAGATTGTGTCC _B neg_ctrl Target(s) -4.044708888 0.26523132 3.522458847 0.17663032 -4.582413654 6.078038084 neg_ctrl Virus_target Random_4 hvTR_Q2HR71 LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Random_x_strong
ATAACTCCACGC _A_Concatenation AAGCAACGATAT _B pos_ctrl_A Target(s) 0.126628979 0.85681444 -5.322501137 4.73E-05 -3.496584394 0.51410768 5.999225039 0.005938192 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL vIRF2_VP64_pos_ctrl_X_mid
CGCCGGCATACA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.658231269 5.51E-10 3.269640187 0.03888733 -7.049005246 0.034542461 5.967395052 0.053093786 Virus_target Virus_target cds_NC_030656.1_cds_YP_009268729.1_14 hvTR_Q2HR71 DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
TATCGCAAGAAC _A_Concatenation ACTTGATGGTTT _B neg_ctrl Target(s) -0.766526056 8.29E-12 3.02075261 0.01096213 6.278935959 0.057490126 5.963147582 0.037017444 neg_ctrl Virus_target Random_5 hvTR_Q2HR73 PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Random_x_mid
AGAGTGCGCTCT _A_Concatenation TAAGATTCAACG _B neg_ctrl Target(s) 0.772902269 0.60833425 -0.461147923 9.550580489 0.011217716 5.941055263 0.12609254 neg_ctrl Human target Random_1 ZNF292 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP Random_x_strong
CTACGTGGCCCC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.134604353 0.72546447 -2.279760169 0.10968094 -6.760429115 0.070775661 5.880306523 0.079863392 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q2HR73 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
ATAACTCCACGC _A_Concatenation AGAGTGCGCTCT _B pos_ctrl_A neg_ctrl 0.538570889 0.73293852 -2.56907084 0.16091351 5.339876323 0.157574718 5.810337233 0.038754945 vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_a Random_1 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP VIRF2_VP64_pos ctrl_x_random
TCTATCACAACT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.725875387 3.422691034 0.09936833 -5.867398127 0.192744457 5.782993122 0.066071339 Human target Virus_target TOX4 hvTR_Q2HR73 LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
CTGGTAGTTGTT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.014430461 0.9854934 2.555604322 -2.945954063 0.666430638 5.751999664 0.010952256 Virus_target Human target hvTR_F6KS47 LEUTX SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Unknown_X_strong
TCTATCACAACT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 2.450058918 0.66445658 4.930459559 0.00659263 0.549259713 5.705303242 Human target neg_ctrl TOX4 Random_1 LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
CTACGTGGCCCC _A_Concatenation TCTTAGAGGGTG _B Target(s) Target(s) 0.047194761 0.88448623 3.34870272 0.094757 6.161150021 0.083289714 5.700507043 0.057793972 Virus_target Human target hvTR_Q2HR71 TRO TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL Strong_X_strong
TAAAATTTATCA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -1.005357119 0.434377 5.264303211 -6.177765888 5.696845592 Human target neg_ctrl HSF1 Random_1 LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
ACCAGTATTTAA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.625794165 0.34506942 -2.302547309 0.2598498 -6.479473449 0.143457767 5.659119579 0.14600867 Human target Human target ARNTL2 LEUTX NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
GAGATTGTGTCC _A_Concatenation CTCAGCGATATA _B Target(s) Target(s) -0.884299816 0.00030933 -2.44964826 0.15212449 4.214676328 0.294130007 5.650487209 0.013542366 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q5Y0Q4 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG Strong_X_strong
AGAGTGCGCTCT _A_Concatenation CTCGCACCGAGG _B neg_ctrl Target(s) -2.248100258 0.27973193 1.91838485 0.22163368 -3.992840406 5.642017357 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN Random_x_mid
TGATGAGCGCCC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.128203736 0.81690552 4.275579049 0.00201847 -4.232883325 0.293489659 5.641065858 0.013342514 Human target Virus_target EEF1D hvTR_Q2HR71 QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Weak_X_Strong
ACTTGATGGTTT _A_Concatenation CTCACGACAAGA _B Target(s) Target(s) 0.512788805 0.19856441 -6.761446257 9.73E-06 6.931387361 0.044830089 5.613397995 0.083931183 Virus_target Virus_target hvTR_Q2HR73 hvTR_G3G929 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP Mid_X_strong
ATGACAAAACGA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.767556927 0.59461055 -1.460186931 0.49066643 -8.786311219 0.0333998 5.602273733 0.141823166 Virus_target Virus_target cds_NC_025350.1_cds_YP_009094074.1_1 hvTR_Q2HR73 HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
CTACGTGGCCCC _A_Concatenation AGCCATAAACCT _B Target(s) Target(s) 0.604078556 0.01809398 3.294447575 0.0030859 -6.459509617 0.157574718 5.573094039 0.166180187 Virus_target Virus_target hvTR_Q2HR71 hvTR_P03209 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ Strong_X_mid
ACTTGATGGTTT _A_Concatenation CAGTGACATAGT _B Target(s) Target(s) -0.868991746 0.00328682 2.434942105 0.24676367 -6.170530665 0.132193841 5.527243828 0.105824256 Virus_target Human target hvTR_Q2HR73 EHMT2 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT mid_X_mid
AGAGTGCGCTCT _A_Concatenation GCAACGAGGGTC _B neg_ctrl Target(s) -0.73672478 0.28451306 2.960379355 0.2079163 4.622730402 0.219924157 5.518910518 0.020523923 neg_ctrl Human target Random_1 QRICH2 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ Random_x_mid
CATCTGAGTCGT _A_Concatenation GAGATTGTGTCC _B pos_ctrl_A Target(s) 0.68144834 3.41E-08 4.647580585 7.29E-05 -1.749733886 0.953105946 5.493306706 0.000198815 vIRF2_VP64_pos_ctrl Virus_target RWY_VP16_vIRF2 hvTR_Q2HR71 GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP VIRF2_VP64_pos strong_ctrl_x_strong
TATCGCAAGAAC _A_Concatenation CTACGTGGCCCC _B neg_ctrl Target(s) -0.437809724 8.07E-05 3.990982145 0.00080811 2.941277101 0.624937505 5.478556335 0.003715881 neg_ctrl Virus_target Random_5 hvTR_Q2HR71 PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Random_x_strong
ATGACCCCTTGT _A_Concatenation CATCTCACCTAG _B Target(s) Target(s) -0.616296947 0.00211261 -1.818350993 0.25412646 4.014576765 0.331166233 5.471178659 0.015867719 Human target Virus_target LEUTX hvTR_P03206 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA Strong_X_mid
GAGATTGTGTCC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -1.448570292 1.14E-36 2.198877341 0.06287927 -5.50748689 0.097296843 5.458903719 0.03571309 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q2HR73 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
AGAGTGCGCTCT _A_Concatenation CATCTCACCTAG _B neg_ctrl Target(s) 0.57712913 0.27877822 3.749769461 0.06126631 -6.027259113 0.199492182 5.430632539 0.166180187 neg_ctrl Virus_target Random_1 hvTR_P03206 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA Random_x_mid
TATCGCAAGAAC _A_Concatenation GAGATTGTGTCC _B neg_ctrl Target(s) -0.62509196 4.51E-13 5.523717251 3.18E-05 -1.348705749 0.996490702 5.38039431 6.86E-05 neg_ctrl Virus_target Random_5 hvTR_Q2HR71 PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Random_x_strong
ATAACTCCACGC _A_Concatenation Joint_ 3_B pos_ctrl_A neg_ctrl -1.301941061 0.61787605 -2.06316216 0.5129823 3.252755424 5.367283212 vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_a Joint_3 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD VIRF2_VP64_pos ctrl X joint
AGCCGTGCTTGA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.01277519 0.94601266 5.007069331 0.00039124 -0.437400967 0.997690695 5.324969817 1.08E-06 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 LEUTX GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Weak_X_Strong
TCTGTACCAACG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 3.73784303 0.1062238 -1.714863241 2.692466488 5.307560891 Virus_target Virus_target hvTR_Q6XA69 hvTR_Q2HR73 ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Unknown_X_mid
GAGATTGTGTCC _A_Concatenation ATGACAAAACGA _B Target(s) Target(s) 0.414630977 0.04988573 -1.555194347 0.36688342 -6.922708071 0.038202744 5.307523378 0.096401552 Virus_target Virus_target hvTR_Q2HR71 cds_NC_025350.1_cds_YP_009094074.1_1 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL Strong_X_strong
CACACCCGGCAG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.721267944 3.00E-06 4.803838669 0.00021381 -2.259591951 0.827084874 5.302231035 0.002219785 Virus_target Virus_target hvTR_P19893 hvTR_Q2HR71 VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
CTACGTGGCCCC _A_Concatenation CCCCTTAGGAAT _B Target(s) Target(s) -0.415735662 0.00342394 4.10137785 0.01608804 5.772574839 0.094980347 5.25076635 0.064978326 Virus_target Human target hvTR_Q2HR71 SYNE1 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD Strong_X_mid
ACTTGATGGTTT _A_Concatenation CCGCCCTTATGT _B Target(s) Target(s) -2.566978842 0.17011976 0.124097897 0.95625121 -1.872655101 5.246885141 Virus_target Human target hvTR_Q2HR73 ZFX RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH mid_X_mid
ACTTGATGGTTT _A_Concatenation CATTAGTCCCGC _B Target(s) Target(s) 0.872044681 4.58E-08 2.318146865 0.14351379 -1.723372103 0.953105946 5.119439014 0.000956228 Virus_target Virus_target hvTR_Q2HR73 hvTR_P06930 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF Mid_X_strong
ACTTGATGGTTT _A_Concatenation GCAACGAGGGTC _B Target(s) Target(s) 0.071133467 0.82429662 2.390838029 0.17619603 -5.15379121 0.132483935 5.098128037 0.049730741 Virus_target Human target hvTR_Q2HR73 QRICH2 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ mid_X_mid
GCAACGAGGGTC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 2.703348434 0.00517376 3.418632406 0.05506627 -5.72392502 5.068979119 Human target Virus_target QRICH2 hvTR_Q2HR71 RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
AGAGTGCGCTCT _A_Concatenation ACTCGTGTTGTC _B neg_ctrl Target(s) 0.568844848 0.11961968 5.1190242 0.01166474 -4.488430838 0.224079904 5.051946961 0.037329215 neg_ctrl Virus_target Random_1 hvTR_Q9QB97 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT Random_x_mid
AGAGTGCGCTCT _A_Concatenation AAGGGTAAATTG _B neg_ctrl Target(s) 2.25023623 8.92E-13 7.857077919 3.60E-10 -2.821647733 0.634646987 5.035129856 0.007557258 neg_ctrl Virus_target Random_1 hvTR_Q69513 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS Random_x_mid
Stop-1_A/u splice_ ACTTGATGGTTT _b neg_ctrl Target(s) -0.685381147 0.21779974 2.599760865 0.29336258 5.914021925 0.102621121 5.017089571 0.098722329 neg_ctrl Virus_target Stop_1 hvTR_Q2HR73 ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Termination_X_mid
ACTTTAGTAACA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.978906718 0.29193745 -0.556849683 0.70600688 7.275519454 0.057907582 4.985211359 0.205419389 Virus_target Human target hvTR_Q69113 LEUTX NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
ATAACTCCACGC _A_Concatenation CTCGTTATGGCA _B pos_ctrl_A Target(s) -0.620434447 0.7723094 1.942028427 0.37601 -3.802465784 4.906448941 vIRF2_VP64_pos_ctrl Human target core_VP64_a NOVA1 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF vIRF2_VP64_pos_ctrl_X_mid
AGAGTGCGCTCT _A_Concatenation CTGCCGGTTGCG _B neg_ctrl Target(s) -1.006323764 0.00702337 2.40315768 -5.31606666 0.238404099 4.904881664 0.156071982 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL Random_x_mid
ACTTGATGGTTT _A_Concatenation CTCGCACCGAGG _B Target(s) Target(s) 0.027329402 0.93141616 3.6256305 0.10029123 5.262055987 0.147334557 4.903163109 0.083931183 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN mid_X_mid
CTACGTGGCCCC _A_Concatenation CTGGTAGTTGTT _B Target(s) Target(s) -0.642305999 5.44E-08 4.607495407 0.01080066 -2.171575653 0.834609851 4.881624466 0.00409792 Virus_target Virus_target hvTR_Q2HR71 hvTR_F6KS47 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT Strong_x_unknown
AGCCGTGCTTGA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.713838 8.42E-15 4.721963507 0.00010641 3.01746559 0.572967985 4.871259689 0.013542366 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 Random_1 GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Weak_X_random
TTAGTGCCATAC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.458630922 0.9002656 -4.311999143 0.19857061 -4.368937873 4.861460917 Human target neg_ctrl TET3 Random_1 PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
ACTTGATGGTTT _A_Concatenation AAGGGTAAATTG _B Target(s) Target(s) 0.236760216 0.41025376 2.484753585 0.08844163 6.327382813 0.049730355 4.854162722 0.103147793 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q69513 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS mid_X_mid
ACTTGATGGTTT _A_Concatenation Joint_ 1_B Target(s) neg_ctrl -0.448839266 1.61E-08 1.091281679 0.48142569 -0.769203873 0.997690695 4.846880808 4.80E-05 Virus_target neg_ctrl hvTR_Q2HR73 Joint_1 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK Mid_X_joint
GCGAGGCAGACC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.495120472 0.33149001 -5.675425183 0.00016952 -4.048654583 0.293489659 4.840659795 0.032446636 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 core_VP64_a QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Unknown_X_vIRF2\u VP64_pos_ctrl
TTTAACGGATTG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.313455045 0.72633824 -5.046716429 0.00109019 6.867457986 0.032851053 4.829196686 0.114829927 Human target Virus_target TWIST2 hvTR_Q2HR71 SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
ACGGCGTAAGTA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.152739419 1.86107202 0.50348283 0.626306341 4.819065922 Virus_target Human target cds_NC_025350.1_cds_YP_009094078.1_5 LEUTX LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
ATCCAATCTGTG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 3.225471538 0.1798316 -1.231031929 0.626306341 4.819065922 Human target Human target GLI2 LEUTX PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
ATGACCCCTTGT _A_Concatenation TACATGAGGCAC _B Target(s) Target(s) -1.29829054 0.78044287 -3.78994125 0.18898801 0.626306341 4.819065922 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL Strong_X_mid
GAGTGACCTATT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.766177256 0.91394475 8.061353226 0.0004211 0.626306341 4.819065922 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 hvTR_Q2HR71 HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Unknown_X_strong
ATCTCTCAGTTC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.401493697 0.93141616 3.915678524 0.01832212 0.626306341 4.819065922 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 LEUTX VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
CTACGTGGCCCC _A_Concatenation AGTTGCTTTGCT _B Target(s) neg_ctrl 0.024087327 0.99110789 1.320373251 0.62046092 0.626306341 4.819065922 Virus_target neg_ctrl hvTR_Q2HR71 Random_2 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM Strong_X_random
GAGTGACCTATT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -2.008848691 0.50396632 2.531243998 0.09245164 0.626306341 4.819065922 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 hvTR_Q2HR71 HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Unknown_X_strong
ACTCGTGTTGTC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 5.888128791 0.11942756 4.177353315 0.03154261 0.626306341 4.819065922 Virus_target Human target hvTR_Q9QB97 LEUTX VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
AGGCTTCCCGCT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.063695227 2.383584984 0.626306341 4.819065922 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 LEUTX VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
ATGACCCCTTGT _A_Concatenation GACTGTGGGCGG _B Target(s) Target(s) 0.400705299 0.94601266 2.660356044 0.2646259 0.626306341 4.819065922 Human target Human target LEUTX MON2 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL Strong_X_strong
AAGGGTAAATTG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.411486358 0.92304294 0.626306341 4.819065922 Virus_target Virus_target hvTR_Q69513 hvTR_Q2HR73 NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
CCGCCCTTATGT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -2.874592287 0.04755468 2.526318689 0.26334677 0.626306341 4.819065922 Human target Virus_target ZFX hvTR_Q2HR73 VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
ATAACTCCACGC _A_Concatenation ATAACTCCACGC _B pos_ctrl_A pos_ctrl_A 2.872711071 0.58303182 3.88115205 0.626306341 4.819065922 vIRF2_VP64_pos_ctrl vIRF2_VP64_pos_ctrl core_VP64_a core_VP64_a LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML vIRF2_VP64_pos_ctrl_X_vIRF2_VP64_pos_ctrl
AAGGGTAAATTG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -1.429753257 0.22734303 1.879481815 0.61804847 0.626306341 4.819065922 Virus_target Virus_target hvTR_Q69513 hvTR_Q2HR71 NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
CCAGACGGTCTG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.677957275 0.52901596 -2.991824273 0.626306341 4.819065922 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 hvTR_Q2HR71 PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Weak_X_Strong
GTACATACCGAA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -5.753126713 0.02537355 3.348134157 0.12383329 0.626306341 4.819065922 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 Random_1 NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
GAGATTGTGTCC _A_Concatenation Stop_1_B Target(s) neg_ctrl 3.924025072 0.08319708 3.437785233 0.06755234 0.626306341 4.819065922 Virus_target neg_ctrl hvTR_Q2HR71 Stop_1 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK Strong_X_termination
ATGACCCCTTGT _A_Concatenation TACTTCTAGCGG _B Target(s) Target(s) 0.573167945 0.93141616 4.314475304 0.04580015 0.626306341 4.819065922 Human target Virus_target LEUTX cds_NC_025350.1_cds_YP_009094080.1_7 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ Strong_x_weak
ATGACCCCTTGT _A_Concatenation ATTGTATCTAAC _B Target(s) Target(s) 0.530604996 0.82433714 -2.554333293 0.37627633 0.626306341 4.819065922 Human target Human target LEUTX SMARCB1 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD Strong_X_mid
GAGACAGCTCTC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.042591165 0.99110789 1.753604194 0.42967531 0.626306341 4.819065922 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 core_VP64_a LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
TATTAAAAGTGC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 4.513738497 0.09354755 3.180465029 0.008531041 4.817650637 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_202 hvTR_Q2HR71 MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Weak_X_Strong
GCAGATTCGAAT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 1.968965852 0.74622616 2.714505633 0.008531041 4.817650637 Human target Human target AK9 LEUTX KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
AATCGAGACCAG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.01068637 0.99738617 1.674848326 0.52560211 0.008531041 4.817650637 Virus_target Virus_target hvTR_Q8BB47 hvTR_Q2HR71 EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
CCGCCCTTATGT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.048836165 3.597606602 0.23767729 0.008531041 4.817650637 Human target neg_ctrl ZFX Random_1 VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
AGAGTGCGCTCT _A_Concatenation TATTAAAAGTGC _B neg_ctrl Target(s) 4.210264422 0.02293696 -2.178779714 0.008531041 4.817650637 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_202 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI Random_x_weak
ATGACCCCTTGT _A_Concatenation GGGAGTATGTCG _B Target(s) neg_ctrl 5.719757257 0.09831818 3.376466819 0.24054738 0.008531041 4.817650637 Human target neg_ctrl LEUTX Random_4 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS Strong_X_random
CAGTGACATAGT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.633103951 0.23357818 6.961728472 0.00062691 6.014135995 0.070775661 4.806155772 0.098005281 Human target Virus_target EHMT2 hvTR_Q2HR71 AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
CATCTGAGTCGT _A_Concatenation ACTTGATGGTTT _B pos_ctrl_A Target(s) 0.514602955 0.00297531 -1.936876258 0.10515713 -5.433552896 0.157574718 4.778675958 0.120169648 vIRF2_VP64_pos_ctrl Virus_target RWY_VP16_vIRF2 hvTR_Q2HR73 GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR vIRF2_VP64_pos_ctrl_X_mid
CTACGTGGCCCC _A_Concatenation GTTCCACGCCTG _B Target(s) Target(s) 1.746719742 5.37E-82 4.776255178 9.55E-07 -3.152487146 0.527905197 4.734938002 0.019978168 Virus_target Human target hvTR_Q2HR71 ZFX TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS Strong_X_strong
CGCCGGCATACA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.686467577 1.31E-10 4.62986365 0.01341224 -2.292600758 0.796218089 4.73466062 0.006506022 Virus_target Virus_target cds_NC_030656.1_cds_YP_009268729.1_14 hvTR_Q2HR73 DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
TAAAATTTATCA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.340331079 0.48494031 4.055025037 0.06313276 6.504154619 0.035023401 4.621186152 0.114829927 Human target Virus_target HSF1 hvTR_Q2HR71 LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
AGCCGTGCTTGA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.575043791 5.18E-09 3.885897319 0.00081919 -4.869311352 0.159642332 4.580420429 0.07853916 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 hvTR_Q2HR71 GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Weak_X_Strong
Joint_ 3_A/u splice_ AGAGTGCGCTCT _b neg_ctrl neg_ctrl -0.514511925 0.0391193 4.36437024 0.09576001 -4.145313298 0.276778688 4.557544939 0.053093786 neg_ctrl neg_ctrl Joint_3 Random_1 GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Joint_X_random
CTCAGCGATATA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.056408024 0.94791391 2.765175233 0.1690105 5.928892386 0.147189779 4.526731112 0.211428465 Virus_target Virus_target hvTR_Q5Y0Q4 hvTR_Q2HR71 FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
CTGCCGGTTGCG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.083963244 1.96E-29 4.082111304 5.02E-05 1.009824996 0.997690695 4.521248356 0.000437289 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 LEUTX LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
GCAGATTCGAAT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.522836526 0.82433714 2.714505633 -4.679438388 4.4847972 Human target neg_ctrl AK9 Random_1 KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
CTACGTGGCCCC _A_Concatenation GCGAGGCAGACC _B Target(s) Target(s) -0.581747506 0.37171203 -2.686570817 -7.910589289 0.037998675 4.43466331 0.218195585 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD Strong_x_unknown
CGCCGGCATACA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.592005368 0.7442977 2.482137414 0.14747479 -4.933713028 4.353122451 Virus_target Virus_target cds_NC_030656.1_cds_YP_009268729.1_14 hvTR_Q2HR71 DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
ACTTGATGGTTT _A_Concatenation Joint_ 3_B Target(s) neg_ctrl -0.684616958 3.10E-07 2.71765269 0.13687823 -5.222847873 0.15759116 4.323339925 0.149075572 Virus_target neg_ctrl hvTR_Q2HR73 Joint_3 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD Mid_X_joint
AGAGTGCGCTCT _A_Concatenation ACAACGTGCATA _B neg_ctrl Target(s) -0.119715682 0.88343822 -2.872423564 0.09697966 -4.945676675 0.291084293 4.287915513 0.218195585 neg_ctrl Virus_target Random_1 hvTR_P10407 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET Random_x_strong
ACTTTAGTAACA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.795019997 1.56E-13 2.337542575 0.05487336 4.553566667 0.190780897 4.267868557 0.084897118 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q69113 core_VP64_a NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation CGATGGCGGAAT _B neg_ctrl Target(s) 2.379321465 0.29119771 -3.172461621 0.14019482 -1.529585356 4.249434651 neg_ctrl Human target Random_1 KIAA1586 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG Random_x_weak
ACTTGATGGTTT _A_Concatenation TCTTAGAGGGTG _B Target(s) Target(s) -0.816754023 4.10E-15 2.416506618 0.02599406 -1.00954376 0.997690695 4.234655018 0.001244361 Virus_target Human target hvTR_Q2HR73 TRO RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL Mid_X_strong
AAACTTTGAGCG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.896167783 0.00180764 -4.589662802 0.00780178 5.182701598 0.121748841 4.200855522 0.121024915 Human target Virus_target ATF6B hvTR_Q2HR71 AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
Joint_1_A/u splice_ ACTTGATGGTTT _b neg_ctrl Target(s) -1.321759131 0.23382733 5.374561966 -4.805853555 0.293964969 4.13584173 0.219085969 neg_ctrl Virus_target Joint_1 hvTR_Q2HR73 GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Joint_X_mid
CATTAGTCCCGC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.001723186 0.99110789 3.840432029 0.00021381 5.192558181 0.102803489 4.135044146 0.114829927 Virus_target vIRF2_VP64_pos_ctrl hvTR_P06930 core_VP64_a AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
TGATGAGCGCCC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.5606855 3.81E-07 4.882370118 5.26E-05 5.202123424 0.118326543 4.132686783 0.129741023 Human target Virus_target EEF1D hvTR_Q2HR71 QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Weak_X_Strong
ATAACTCCACGC _A_Concatenation TAAAATTTATCA _B pos_ctrl_A Target(s) -2.017912696 0.00656289 -7.129459616 1.79E-05 -4.75325373 0.224041844 4.12666021 0.15725643 vIRF2_VP64_pos_ctrl Human target core_VP64_a HSF1 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY vIRF2_VP64_pos_ctrl_X_mid
GTTCCACGCCTG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.59342721 0.23227699 -4.850593119 4.70E-05 5.376784959 0.113496245 4.073639666 0.156896347 Human target vIRF2_VP64_pos_ctrl ZFX core_VP64_a VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
CTACGTGGCCCC _A_Concatenation GAGTGACCTATT _B Target(s) Target(s) -1.212915259 0.00611211 -3.266828623 0.02354207 5.225630787 0.119647439 4.056596789 0.145553091 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI Strong_x_unknown
ATAACTCCACGC _A_Concatenation CGGCAATTGCTT _B pos_ctrl_A Target(s) -0.886433135 0.82490703 -9.186465444 2.47975804 3.986325535 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q9QB93 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK vIRF2_VP64_pos_ctrl_X_mid
CTCAGCGATATA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.7946659 0.3917492 -4.364284668 0.01471201 4.567585339 0.376738345 3.947140588 0.280453332 Virus_target neg_ctrl hvTR_Q5Y0Q4 Random_1 FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
ATAACTCCACGC _A_Concatenation CTCGCACCGAGG _B pos_ctrl_A Target(s) 0.038259789 0.94973077 -6.286238265 5.17E-06 5.406406776 0.102782462 3.91850514 0.170652887 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN vIRF2_VP64_pos_ctrl_X_mid
TCTATCACAACT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.250851938 0.92181589 4.155766861 0.00258074 -3.587807756 0.396424471 3.906218412 0.082429628 Human target Virus_target TOX4 hvTR_Q2HR71 LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
AGAGTGCGCTCT _A_Concatenation GTACATACCGAA _B neg_ctrl Target(s) -0.45901124 0.00909985 4.892300443 0.00064194 5.019730946 0.143201686 3.862706708 0.166769969 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE Random_x_mid
AATCGAGACCAG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -3.018473907 0.56300311 0.971871461 0.71938819 0.829279031 3.846699138 Virus_target neg_ctrl hvTR_Q8BB47 Random_1 EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
CCCGTAGGGGCT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 1.388035943 0.0391193 0.502144298 0.86913847 0.829279031 3.846699138 Human target Virus_target TET3 hvTR_Q2HR71 APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
ACTCGTGTTGTC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -1.686695082 0.56169587 4.179516292 0.0883946 0.829279031 3.846699138 Virus_target Virus_target hvTR_Q9QB97 hvTR_Q2HR71 VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
ATGACCCCTTGT _A_Concatenation TTCTAATTTCCT _B Target(s) Target(s) 0.513338906 0.84209077 -0.965969031 0.70600688 0.829279031 3.846699138 Human target Human target LEUTX WDR76 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS Strong_X_strong
CTACGTGGCCCC _A_Concatenation TAAGATTCAACG _B Target(s) Target(s) 2.388360107 0.24325835 -0.787007754 0.73748128 0.829279031 3.846699138 Virus_target Human target hvTR_Q2HR71 ZNF292 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP Strong_X_strong
GTTCCACGCCTG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.877386641 0.75207104 3.063860648 0.04159867 0.829279031 3.846699138 Human target Human target ZFX LEUTX VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
GGCACAGCTCCA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.62755326 0.86342423 -0.693155365 0.80253436 0.281549106 3.818788286 Human target Human target DTX2 LEUTX TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Unknown_X_strong
Stop-1_A/u splice_ CTACGTGGCCCC _b neg_ctrl Target(s) 0.683907776 0.86342423 1.168097043 0.66299712 0.281549106 3.818788286 neg_ctrl Virus_target Stop_1 hvTR_Q2HR71 ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Termination_X_strong
AATCGAGACCAG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.995661307 0.80641597 -0.960540005 0.6995053 0.281549106 3.818788286 Virus_target Human target hvTR_Q8BB47 LEUTX EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
AGCCATAAACCT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.362737289 0.86892322 -3.680463876 0.08080854 -4.985144597 0.235018356 3.815874487 0.240319952 Virus_target Human target hvTR_P03209 LEUTX PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
GAGACAGCTCTC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -1.107184315 0.00106928 2.222771652 -4.474190807 0.308007813 3.79551466 0.227978501 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 hvTR_Q2HR71 LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
ATTTGTAGACCG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -2.95041379 0.12947864 -4.42781772 0.13874298 -0.232933756 3.790388015 Human target Virus_target MYCN hvTR_Q2HR73 PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
GAGTGACCTATT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -4.296574815 0.03859914 -0.980159177 0.63235965 -0.232933756 3.790388015 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 LEUTX HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Unknown_X_strong
ACTTGATGGTTT _A_Concatenation TTCTAATTTCCT _B Target(s) Target(s) 0.191073585 0.53702369 5.958282976 3.50E-05 4.521245673 0.221075186 3.769588171 0.166769969 Virus_target Human target hvTR_Q2HR73 WDR76 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS Mid_X_strong
ACTTGATGGTTT _A_Concatenation TGCCAGACGGTC _B Target(s) Target(s) 1.143390021 1.09E-31 3.417055602 0.00415692 -2.992769055 0.519811123 3.765361464 0.057793972 Virus_target Virus_target hvTR_Q2HR73 hvTR_P10541 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG Mid_X_strong
GAGATTGTGTCC _A_Concatenation AAGGGTAAATTG _B Target(s) Target(s) 1.83968484 0.11077416 -0.927064695 0.56218914 -0.820792161 3.732955313 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q69513 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS Strong_X_mid
ATAACTCCACGC _A_Concatenation CATCTCACCTAG _B pos_ctrl_A Target(s) -0.57261756 0.00120671 4.85596544 0.00976768 3.710777737 0.312984371 3.705480381 0.097385987 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_P03206 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA vIRF2_VP64_pos_ctrl_X_mid
ACCAGTATTTAA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -1.308151321 0.52243449 1.063741737 0.55594745 -3.714882772 3.702358872 Human target Virus_target ARNTL2 hvTR_Q2HR71 NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
GAGATTGTGTCC _A_Concatenation ATTTGTAGACCG _B Target(s) Target(s) -0.139032733 0.77018502 4.981691653 0.00017155 4.592913464 0.199492182 3.641056831 0.185259163 Virus_target Human target hvTR_Q2HR71 MYCN PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE Strong_X_mid
ATGACCCCTTGT _A_Concatenation CCAGACGGTCTG _B Target(s) Target(s) 2.027904962 9.34E-28 4.303894712 3.621857425 0.412917083 3.617434647 0.166769969 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN Strong_x_weak
GTACATACCGAA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.187917836 0.8328782 1.888354122 3.470299044 0.480471365 3.604969731 0.170492804 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 LEUTX NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
AGAGTGCGCTCT _A_Concatenation CCCTATGTTCTA _B neg_ctrl pos_ctrl_A 1.49162859 0.00035572 -4.847063634 0.09956659 -4.178763208 3.551565897 neg_ctrl vIRF2_VP64_pos_ctrl Random_1 core_VP64_b CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Random_X_vIRF2\u VP64_pos_ctrl
CAGTGACATAGT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.993927375 0.00011273 3.68919542 0.03872928 4.22307018 0.240423389 3.52421743 0.161820004 Human target vIRF2_VP64_pos_ctrl EHMT2 core_VP64_a AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
ACTTGATGGTTT _A_Concatenation GATGGAGCTACA _B Target(s) Target(s) -0.167795738 0.66495342 -2.557751966 0.2429528 -4.325988601 0.300125168 3.510619438 0.240710289 Virus_target Virus_target hvTR_Q2HR73 hvTR_P03209 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF Mid_X_strong
CTACGTGGCCCC _A_Concatenation AATCGATCCTAC _B Target(s) Target(s) 0.503436447 0.36968355 -3.866815764 0.02599858 4.334605399 0.248335977 3.327977707 0.226429785 Virus_target Human target hvTR_Q2HR71 PRDM10 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA Strong_X_mid
ACTTTAGTAACA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.538014265 0.28696374 -0.597156974 0.75729949 -3.732806137 0.396731112 3.323510759 0.22237192 Virus_target Virus_target hvTR_Q69113 hvTR_Q2HR73 NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
ACTTGATGGTTT _A_Concatenation AAGACCGGTGCC _B Target(s) Target(s) 0.175128128 0.81658896 -1.994365446 0.10327598 1.769793744 0.899906723 3.288423147 0.083931183 Virus_target Virus_target hvTR_Q2HR73 hvTR_P08392 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF mid_X_mid
TATTAAAAGTGC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.400837758 0.93141616 -3.112909338 0.32315075 0.992478895 3.265158543 Virus_target Human target cds_NZ_WFIY01000004.1_cds_202 LEUTX MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Weak_X_Strong
AATGGCACTAGC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.58686097 0.81782989 -0.71508607 0.74513376 0.992478895 3.265158543 Virus_target vIRF2_VP64_pos_ctrl cds_NC_024711.1_cds_YP_009052522.2_44 core_VP64_a EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Weak_X_vIRF2\u VP64_pos_ctrl
ACTTGATGGTTT _A_Concatenation GCGAGGCAGACC _B Target(s) Target(s) -1.11793553 3.54E-06 0.012481178 0.99447953 4.162533808 0.33827099 3.25963233 0.287660953 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD Mid_X_unknown
ACTTGATGGTTT _A_Concatenation TCTTGGTTTTGA _B Target(s) Target(s) 0.769503539 2.44E-06 3.629850618 0.04213749 2.712895042 0.580611989 3.258274325 0.098722329 Virus_target Human target hvTR_Q2HR73 ZNF597 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL Mid_X_strong
AGAGTGCGCTCT _A_Concatenation CATTGTTTAAAT _B neg_ctrl Target(s) -0.165624884 0.94311303 -4.768067996 0.17464411 -9.368168006 0.031295368 3.253545177 0.563387292 neg_ctrl Human target Random_1 POLR1E CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV Random_x_unknown
TACATGAGGCAC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -2.801505774 6.208085109 0.00919494 0.484522458 3.242086421 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 hvTR_Q2HR73 PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
ATGACCCCTTGT _A_Concatenation CTCGCACCGAGG _B Target(s) Target(s) 5.334338063 4.83E-06 -4.745179292 0.13430239 0.039583014 3.225292967 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN Strong_X_mid
CTCGTTATGGCA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -1.26760315 0.28757726 -1.78658158 0.38421781 -2.568606871 3.219257258 Human target neg_ctrl NOVA1 Random_1 AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
ATGTGGCACGAC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -2.479464704 0.47019918 -2.181884034 -0.421182688 3.202325748 Human target Human target ANKRD17 LEUTX QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
GAGATTGTGTCC _A_Concatenation AATCGAGACCAG _B Target(s) Target(s) 1.041691665 0.80641597 4.04599926 0.2646259 -0.421182688 3.202325748 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q8BB47 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH Strong_X_strong
AGCCGTGCTTGA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.976808483 2.44E-21 4.108451372 0.00127782 -1.146836102 0.997690695 3.116594452 0.040651866 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 hvTR_Q2HR73 GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Weak_X_mid
AATCAACGAGCA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 1.871013409 2.26E-29 5.450865513 7.96E-12 4.000179191 0.268392054 3.109190226 0.218195585 Human target Virus_target C2orf81 hvTR_Q2HR71 SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
ATAACTCCACGC _A_Concatenation CGCCGGCATACA _B pos_ctrl_A Target(s) -0.073923687 0.9134757 -1.115277239 0.5778329 -6.991681549 0.037311924 3.085206611 0.415694411 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NC_030656.1_cds_YP_009268729.1_14 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI VIRF2_VP64_pos strong_ctrl_x_strong
CCCTGCCTCGGG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.16975091 0.938493 4.36045795 0.00212573 1.154823109 3.028913516 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 hvTR_Q2HR71 EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
CTACGTGGCCCC _A_Concatenation TCAGCCGTAGGC _B Target(s) Target(s) -0.378082808 0.27973193 -1.381398754 0.31960259 -3.700201318 0.323795032 3.014726614 0.219163182 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q805Y1 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR Strong_X_mid
AAGACCGGTGCC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.019631021 0.9335834 2.187166732 0.26683465 3.667940257 0.290224196 2.963586337 0.191530223 Virus_target vIRF2_VP64_pos_ctrl hvTR_P08392 core_VP64_a RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
CTCACGACAAGA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.819056338 1.89E-05 0.784394255 0.66285666 3.748714601 0.303126826 2.93857917 0.230308643 Virus_target Virus_target hvTR_G3G929 hvTR_Q2HR71 ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
GATGGAGCTACA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 1.284700578 1.64E-06 5.999006799 0.00020202 3.347160245 0.402627999 2.881948626 0.218195585 Virus_target Virus_target hvTR_P03209 hvTR_Q2HR71 EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
CCCTGCCTCGGG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.064058234 0.98432612 4.099458591 0.00142037 1.129691601 2.851363423 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 LEUTX EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
CCCTGCCTCGGG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -2.824601962 0.03143968 4.216421835 0.00977601 1.129691601 2.851363423 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 hvTR_Q2HR73 EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
GAGATTGTGTCC _A_Concatenation GATCGCATGTTC _B Target(s) Target(s) 0.258232258 0.86342423 -5.123057585 0.053633 1.129691601 2.851363423 Virus_target Human target hvTR_Q2HR71 INPP4A PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER Strong_X_strong
ATGACCCCTTGT _A_Concatenation ACGGCGTAAGTA _B Target(s) Target(s) 4.044251705 0.47694407 1.129691601 2.851363423 Human target Virus_target LEUTX cds_NC_025350.1_cds_YP_009094078.1_5 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL Strong_X_mid
CCGCCCTTATGT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.026849433 0.85660869 3.135899123 0.13957468 1.129691601 2.851363423 Human target Human target ZFX LEUTX VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
TGCCAGACGGTC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.49261486 0.18562008 3.427690683 0.00216231 -1.358256942 0.973675561 2.846787307 0.057793972 Virus_target neg_ctrl hvTR_P10541 Random_1 RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
AATGGCACTAGC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.406732819 0.16187644 0.040678586 0.98246422 0.655817521 2.832620479 Virus_target Human target cds_NC_024711.1_cds_YP_009052522.2_44 LEUTX EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Weak_X_Strong
AGAGTGCGCTCT _A_Concatenation TTAAATGAGGGC _B neg_ctrl Target(s) 5.410922784 0.000332 0.788259302 0.73748128 0.655817521 2.832620479 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ Random_x_mid
ATAACTCCACGC _A_Concatenation ATTTGTAGACCG _B pos_ctrl_A Target(s) -0.732150776 0.00048411 2.326692873 0.27682961 4.371896356 0.190780897 2.810825069 0.292594387 vIRF2_VP64_pos_ctrl Human target core_VP64_a MYCN LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE vIRF2_VP64_pos_ctrl_X_mid
ACTTGATGGTTT _A_Concatenation CAACGCATGCCT _B Target(s) Target(s) -0.491748446 0.00351418 5.908196143 0.00095831 0.795526954 0.997690695 2.78053389 0.04202036 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS Mid_X_weak
ACTTGATGGTTT _A_Concatenation GTTCCACGCCTG _B Target(s) Target(s) 0.765985135 0.00067751 4.739611242 0.00557968 -0.238324519 0.998501722 2.756572008 0.021401372 Virus_target Human target hvTR_Q2HR73 ZFX RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS Mid_X_strong
ACTTGATGGTTT _A_Concatenation TATCTATCGTGT _B Target(s) Target(s) -0.820576812 0.00329937 1.869527104 0.48071279 1.245288911 0.983636094 2.755132062 0.065462272 Virus_target Virus_target hvTR_Q2HR73 hvTR_P03259 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS Mid_X_strong
GAGATTGTGTCC _A_Concatenation AATGGCACTAGC _B Target(s) Target(s) -0.298460187 0.58143595 2.179858975 0.27272364 3.335867578 0.421461462 2.677790533 0.286685424 Virus_target Virus_target hvTR_Q2HR71 cds_NC_024711.1_cds_YP_009052522.2_44 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR Strong_x_weak
CATCTGAGTCGT _A_Concatenation ATGACCCCTTGT _B pos_ctrl_A Target(s) -0.619702452 0.22269773 1.222061042 0.33790827 -3.995654902 2.628440811 vIRF2_VP64_pos_ctrl Human target RWY_VP16_vIRF2 LEUTX GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV VIRF2_VP64_pos strong_ctrl_x_strong
GGCACTGTCGAG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.908077241 0.09354755 -5.459935066 0.00071797 1.743595917 0.871241789 2.609300945 0.103147793 Virus_target vIRF2_VP64_pos_ctrl hvTR_P17147 core_VP64_a YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Unknown_X_vIRF2\u VP64_pos_ctrl
ACTTGATGGTTT _A_Concatenation CTCAGCGATATA _B Target(s) Target(s) 0.590311393 0.20781041 -5.328256146 0.00020202 3.463146968 0.412917083 2.582093793 0.341601869 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q5Y0Q4 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG Mid_X_strong
CTACGTGGCCCC _A_Concatenation CATTAGTCCCGC _B Target(s) Target(s) -0.486997868 0.78182953 -0.454112557 0.76446307 0.806372848 2.514522765 Virus_target Virus_target hvTR_Q2HR71 hvTR_P06930 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF Strong_X_strong
ATAACTCCACGC _A_Concatenation CATCTGAGTCGT _B pos_ctrl_A pos_ctrl_A -0.553105502 0.93202829 0.806372848 2.514522765 vIRF2_VP64_pos_ctrl vIRF2_VP64_pos_ctrl core_VP64_a RWY_VP16_vIRF2 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS vIRF2_VP64_pos_ctrl_X_vIRF2_VP64_pos_ctrl
ATCTCTCAGTTC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -1.142176433 2.86E-08 0.068937282 0.96917964 -4.412719073 0.199492182 2.501595252 0.389216175 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 Random_1 VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
ATGACCCCTTGT _A_Concatenation TCTTAGAGGGTG _B Target(s) Target(s) 1.15879215 0.25462682 -0.379752003 0.78843818 3.56081307 2.498363969 Human target Human target LEUTX TRO DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL Strong_X_strong
ACTTGATGGTTT _A_Concatenation CATCTGAGTCGT _B Target(s) pos_ctrl_A 1.131305308 0.44862406 3.959888493 0.00369217 3.129052409 2.413612329 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q2HR73 RWY_VP16_vIRF2 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS mid_X_vIRF2_VP64_pos_ctrl
ATAACTCCACGC _A_Concatenation CTCCATTAATGA _B pos_ctrl_A Target(s) 0.244894981 0.43060539 -2.907944626 0.0451167 2.455805436 0.665320115 2.386820795 0.285422498 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q5IXR3 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV VIRF2_VP64_pos ctrl_x_weak
TCTGTACCAACG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 1.12663374 0.09584462 -1.731980137 0.45128914 2.989888407 2.342631693 Virus_target Virus_target hvTR_Q6XA69 hvTR_Q2HR71 ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Unknown_X_strong
AATCAACGAGCA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.464791992 0.57948176 1.373632359 0.30699207 -4.903080206 0.113496245 2.306590113 0.405632218 Human target Virus_target C2orf81 hvTR_Q2HR71 SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
GAGATTGTGTCC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.586179689 0.18896329 -1.582569745 0.23647027 2.919168286 0.527905197 2.299913468 0.346191512 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q2HR71 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
ATGACCCCTTGT _A_Concatenation CGATGGCGGAAT _B Target(s) Target(s) -5.742396689 0.05235126 3.88115205 1.353397139 2.267432068 Human target Human target LEUTX KIAA1586 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG Strong_x_weak
ACGGCGTAAGTA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 7.198969509 1.353397139 2.267432068 Virus_target neg_ctrl cds_NC_025350.1_cds_YP_009094078.1_5 Random_1 LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
GAGTGACCTATT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.02687482 0.93141616 1.416967634 0.43395215 3.221274552 0.480471365 2.223844035 0.415614042 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 Random_1 HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Unknown_X_random
ATGACCCCTTGT _A_Concatenation CTGCCGGTTGCG _B Target(s) Target(s) -1.35934585 3.63E-12 -0.314377633 0.86457198 3.524896295 0.382791636 2.215090579 0.420144123 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL Strong_X_mid
ATGACCCCTTGT _A_Concatenation GTACATACCGAA _B Target(s) Target(s) 4.286139842 0.10711266 4.398446337 0.00732226 2.780471454 2.165172508 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE Strong_X_mid
TATCTATCGTGT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.303506842 0.39544886 -5.333119814 5.67E-05 3.673767162 0.518122906 2.131368901 0.570993876 Virus_target Virus_target hvTR_P03259 hvTR_Q2HR71 RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
EPICXV.47-A\u splice_ CTACGTGGCCCC _b pos_ctrl_A Target(s) -0.26277574 0.38876209 1.855481223 0.45128914 2.021471391 0.742836237 2.130010123 0.219085969 vIRF2_VP64_pos_ctrl Virus_target 3x_vIRF2_core hvTR_Q2HR71 GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL VIRF2_VP64_pos strong_ctrl_x_strong
AGAGTGCGCTCT _A_Concatenation CACACCCGGCAG _B neg_ctrl Target(s) -1.13841797 3.66E-15 0.469241478 0.78352705 0.595014322 0.997690695 2.104947116 0.098005281 neg_ctrl Virus_target Random_1 hvTR_P19893 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS Random_x_strong
AGAGTGCGCTCT _A_Concatenation CCAGACGGTCTG _B neg_ctrl Target(s) 2.511508718 0.19157528 5.135854067 0.02389526 -2.724151175 2.098570724 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN Random_x_weak
ACTTGATGGTTT _A_Concatenation TACATGAGGCAC _B Target(s) Target(s) -0.787774507 0.02419595 -6.09582715 7.36E-07 2.723003001 0.580611989 2.095151716 0.392765825 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL mid_X_mid
ATAACTCCACGC _A_Concatenation TCTTAGAGGGTG _B pos_ctrl_A Target(s) 0.285280071 0.11078803 1.26449008 0.41507208 0.769865197 0.997690695 2.073066391 0.100152293 vIRF2_VP64_pos_ctrl Human target core_VP64_a TRO LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL VIRF2_VP64_pos strong_ctrl_x_strong
ATAACTCCACGC _A_Concatenation ACTTGATGGTTT _B pos_ctrl_A Target(s) -0.8757395 0.01468232 -4.34284846 0.00016753 -2.929516439 0.492214467 2.021085227 0.391508113 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q2HR73 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR vIRF2_VP64_pos_ctrl_X_mid
TCTTGGTTTTGA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.370424575 0.93141616 1.785220715 0.56053958 -5.240903526 1.962881718 Human target Human target ZNF597 LEUTX ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
AGAGTGCGCTCT _A_Concatenation CCCCTTAGGAAT _B neg_ctrl Target(s) 0.038170676 0.9854934 -1.374515567 -1.845139475 1.948298595 neg_ctrl Human target Random_1 SYNE1 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD Random_x_mid
Stop-1_A/u splice_ ATGACCCCTTGT _b neg_ctrl Target(s) 2.628901666 0.07185248 -2.225976796 0.35259645 0.771280844 1.940135541 neg_ctrl Human target Stop_1 LEUTX ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Termination_X_strong
ATGACCCCTTGT _A_Concatenation GCAACGAGGGTC _B Target(s) Target(s) -0.81637252 0.54381256 1.960958855 -7.307013263 0.038338443 1.935711077 0.624474901 Human target Human target LEUTX QRICH2 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ Strong_X_mid
GAGATTGTGTCC _A_Concatenation GTTCCACGCCTG _B Target(s) Target(s) 0.322560522 0.81077083 0.226623032 0.89420679 -1.492746351 1.9290389 Virus_target Human target hvTR_Q2HR71 ZFX PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS Strong_X_strong
AGAGTGCGCTCT _A_Concatenation CTCAGCGATATA _B neg_ctrl Target(s) -0.496102858 0.29509007 -5.62000668 0.00022359 -2.636250542 0.772599368 1.854026899 0.590310725 neg_ctrl Virus_target Random_1 hvTR_Q5Y0Q4 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG Random_x_strong
TCAGCCGTAGGC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.330611836 0.91324404 -5.186059459 0.09535537 1.559834265 1.852843232 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q805Y1 core_VP64_a RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
CTACGTGGCCCC _A_Concatenation Joint_ 3_B Target(s) neg_ctrl 0.74798315 0.28354489 1.009977447 0.70600688 0.801134684 1.834237307 Virus_target neg_ctrl hvTR_Q2HR71 Joint_3 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD Strong-X-joint
ATGACCCCTTGT _A_Concatenation ATGACAAAACGA _B Target(s) Target(s) -0.840770349 0.66495342 -1.487358169 0.41956458 -1.006281085 1.816027803 Human target Virus_target LEUTX cds_NC_025350.1_cds_YP_009094074.1_1 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL Strong_X_strong
ATAACTCCACGC _A_Concatenation ATTGTATCTAAC _B pos_ctrl_A Target(s) -0.296462859 0.81690552 -4.56775436 0.02400653 -1.987769694 0.934700311 1.804004147 0.592785717 vIRF2_VP64_pos_ctrl Human target core_VP64_a SMARCB1 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD vIRF2_VP64_pos_ctrl_X_mid
TTAGTGCCATAC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.043136136 0.9854934 -0.235060283 0.9450104 -1.058941627 1.784784298 Human target Human target TET3 LEUTX PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
CTACGTGGCCCC _A_Concatenation GAGACAGCTCTC _B Target(s) Target(s) 1.594312617 -5.606961687 0.1133452 -1.058941627 1.784784298 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM Strong_X_mid
CTACGTGGCCCC _A_Concatenation Joint_ 2_B Target(s) neg_ctrl -1.026554704 1.20E-07 0.811184789 -2.286872555 0.856321281 1.774863257 0.592109514 Virus_target neg_ctrl hvTR_Q2HR71 Joint_2 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA Strong-X-joint
GACTGTGGGCGG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.766231411 0.57362051 2.27049263 -2.261842316 0.871139325 1.769498729 0.596765654 Human target Virus_target MON2 hvTR_Q2HR71 TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
AGAGTGCGCTCT _A_Concatenation AGCCGTGCTTGA _B neg_ctrl Target(s) -1.320950096 0.00047194 -2.415765785 0.25150303 2.430783956 0.653128487 1.76537836 0.454989974 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY Random_x_weak
ACTTGATGGTTT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.095051795 0.00593454 6.082345537 0.00062813 -2.820291451 0.622197442 1.757865626 0.550384214 Virus_target neg_ctrl hvTR_Q2HR73 Random_1 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
ATAACTCCACGC _A_Concatenation CCAGACGGTCTG _B pos_ctrl_A Target(s) -1.17156476 0.32392216 -3.341482748 0.03888733 -6.6462754 1.724130566 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN VIRF2_VP64_pos ctrl_x_weak
CAACGCATGCCT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -5.147769739 0.13847984 -0.449298295 1.651611873 1.683059042 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 hvTR_Q2HR71 YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Weak_X_Strong
AGAGTGCGCTCT _A_Concatenation CAGTGACATAGT _B neg_ctrl Target(s) 0.252101225 0.57019691 2.043095801 0.31917493 2.04868352 0.733105087 1.676660391 0.403335945 neg_ctrl Human target Random_1 EHMT2 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT Random_x_mid
AGAGTGCGCTCT _A_Concatenation GCAGATTCGAAT _B neg_ctrl Target(s) 1.569816337 1.09E-06 5.862600612 4.46E-05 -3.833339407 0.396731112 1.671872655 0.627531443 neg_ctrl Human target Random_1 AK9 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP Random_x_mid
TCTATCACAACT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 1.005722971 0.76595865 4.951823357 6.38E-05 0.907014646 1.666358022 Human target Human target TOX4 LEUTX LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
AAAACAAGCATT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.678116244 0.82936061 3.124973587 0.01033701 0.597180359 1.659069326 Virus_target Human target cds_NC_030656.1_cds_YP_009268803.1_88 LEUTX STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Weak_X_Strong
TGCCAGACGGTC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.293118193 0.50685675 3.971010231 0.00057613 0.923529399 0.997690695 1.657865275 0.290813651 Virus_target vIRF2_VP64_pos_ctrl hvTR_P10541 core_VP64_a RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation CATGCTAACACC _B neg_ctrl Target(s) -0.832451113 0.09668495 -3.049324274 0.02473656 -3.215563428 0.572967985 1.642138635 0.624474901 neg_ctrl Virus_target Random_1 hvTR_F5HCV3 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL Random_x_strong
CGCCGGCATACA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.750271117 0.66087705 -1.172515467 0.54837089 -0.238223872 1.638165971 Virus_target neg_ctrl cds_NC_030656.1_cds_YP_009268729.1_14 Random_1 DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
ATAACTCCACGC _A_Concatenation GAGATTGTGTCC _B pos_ctrl_A Target(s) -0.453424726 0.15482312 -3.734576005 0.00062712 2.61206827 0.570766083 1.63695875 0.472641252 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q2HR71 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP VIRF2_VP64_pos strong_ctrl_x_strong
CTACGTGGCCCC _A_Concatenation Joint_ 1_B Target(s) neg_ctrl 0.422626019 0.10004367 5.341492676 0.00732226 2.614882056 0.551036114 1.631748809 0.464627066 Virus_target neg_ctrl hvTR_Q2HR71 Joint_1 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK Strong-X-joint
GAGATTGTGTCC _A_Concatenation ATCCAATCTGTG _B Target(s) Target(s) 7.278730575 0.09988704 -1.159549692 1.614731197 Virus_target Human target hvTR_Q2HR71 GLI2 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT Strong_X_mid
TGGCCCCCAGTT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.904613291 0.0471767 -1.873737773 -1.813911177 0.877171774 1.587469807 0.530917523 Virus_target Virus_target hvTR_Q3I8P6 hvTR_Q2HR71 GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Weak_X_Strong
ATGACCCCTTGT _A_Concatenation GATCGCATGTTC _B Target(s) Target(s) -1.596864885 0.22734303 -5.78884547 0.09686773 -2.441116307 0.803891096 1.582442664 0.626950374 Human target Human target LEUTX INPP4A DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER Strong_X_strong
ACTTGATGGTTT _A_Concatenation TAAGATTCAACG _B Target(s) Target(s) -0.805030025 2.80E-07 2.121866298 0.21085724 2.150072676 0.703227418 1.559220448 0.464627066 Virus_target Human target hvTR_Q2HR73 ZNF292 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP Mid_X_strong
GCGAGGCAGACC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 5.642547402 0.0003937 -2.914520999 0.34689881 -4.611193359 1.51879336 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 Random_1 QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Unknown_X_random
CCCTATGTTCTA _A_Concatenation ACTTGATGGTTT _B pos_ctrl_A Target(s) 0.578763744 0.3372476 -1.822983312 0.40695666 -0.083392029 1.494846927 vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q2HR73 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR vIRF2_VP64_pos_ctrl_X_mid
AGAGTGCGCTCT _A_Concatenation AATCGAGACCAG _B neg_ctrl Target(s) -2.136686854 0.36741802 3.88115205 -3.576130805 1.480121133 neg_ctrl Virus_target Random_1 hvTR_Q8BB47 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH Random_x_strong
AGAGTGCGCTCT _A_Concatenation GTTCCACGCCTG _B neg_ctrl Target(s) 1.643484139 0.01377271 5.233804227 0.03194613 -1.276640569 0.997690695 1.476252889 0.635919588 neg_ctrl Human target Random_1 ZFX CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS Random_x_strong
ACCAGTATTTAA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.05269266 0.97860685 -0.806002169 0.68982791 -1.716422214 0.973675561 1.437745384 0.67077053 Human target neg_ctrl ARNTL2 Random_1 NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
ATAACTCCACGC _A_Concatenation GCAACGAGGGTC _B pos_ctrl_A Target(s) -2.788699662 0.09831989 -4.779729483 0.13221889 -3.257361161 1.435611111 vIRF2_VP64_pos_ctrl Human target core_VP64_a QRICH2 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ vIRF2_VP64_pos_ctrl_X_mid
AGAGTGCGCTCT _A_Concatenation ATTGTATCTAAC _B neg_ctrl Target(s) -0.430525729 0.54236082 -2.615777984 0.08383374 -4.244127073 1.433600852 neg_ctrl Human target Random_1 SMARCB1 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD Random_x_mid
GATCGCATGTTC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.606613267 0.00328682 -0.175093739 0.9356323 1.908274962 0.799861834 1.388557076 0.510873375 Human target neg_ctrl INPP4A Random_1 NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
TGGCCCCCAGTT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.353844748 0.92647166 -3.820627393 0.07411668 1.093111598 1.379923139 Virus_target Human target hvTR_Q3I8P6 LEUTX GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Weak_X_Strong
ACTTGATGGTTT _A_Concatenation AGCCATAAACCT _B Target(s) Target(s) 0.252568733 0.08334253 3.052988166 0.07667512 2.205545125 0.693965883 1.378849411 0.550384214 Virus_target Virus_target hvTR_Q2HR73 hvTR_P03209 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ mid_X_mid
CTACGTGGCCCC _A_Concatenation CTGCCGGTTGCG _B Target(s) Target(s) 0.085749267 0.71380818 5.292689342 5.90E-05 2.253234571 0.662355132 1.360765443 0.539565192 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL Strong_X_mid
CATTGTTTAAAT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.205949217 0.45505484 1.371913546 0.29899332 2.482402356 0.634646987 1.347132723 0.590310725 Human target Virus_target POLR1E hvTR_Q2HR71 AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Unknown_X_strong
ATGACCCCTTGT _A_Concatenation GCGAGGCAGACC _B Target(s) Target(s) 0.69807194 0.49872084 -5.906454068 0.00109019 -4.042323036 1.344013415 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD Strong_x_unknown
CTACGTGGCCCC _A_Concatenation AAGACCGGTGCC _B Target(s) Target(s) 0.203400124 0.70546103 -0.396322981 0.7912345 -1.814459373 0.870358589 1.301893593 0.585780662 Virus_target Virus_target hvTR_Q2HR71 hvTR_P08392 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF Strong_X_mid
TCAGCCGTAGGC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.893242587 0.74390166 -5.324706757 0.07672334 -3.351472084 1.298388752 Virus_target Virus_target hvTR_Q805Y1 hvTR_Q2HR71 RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
AGAGTGCGCTCT _A_Concatenation GGCACAGCTCCA _B neg_ctrl Target(s) 0.053532743 0.94919394 -5.131921353 0.04653661 -1.45795715 1.278378412 neg_ctrl Human target Random_1 DTX2 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG Random_x_unknown
GGGAGTATGTCG _A_Concatenation ATGACCCCTTGT _B neg_ctrl Target(s) -0.414059452 0.82490703 -3.465513926 0.1566422 1.889855634 1.268290505 neg_ctrl Human target Random_4 LEUTX LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Random_x_strong
TCTTGGTTTTGA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 1.748975568 0.63450624 -0.576022665 0.84184156 1.889855634 1.268290505 Human target Virus_target ZNF597 hvTR_Q2HR71 ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
ATCCAATCTGTG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.276785654 0.92803153 1.677139367 0.52530556 1.527207005 1.261510004 Human target vIRF2_VP64_pos_ctrl GLI2 core_VP64_a PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
CTACGTGGCCCC _A_Concatenation CATCTGAGTCGT _B Target(s) pos_ctrl_A 1.103046079 0.75590178 6.003290442 0.00156378 0.155435055 1.233550505 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q2HR71 RWY_VP16_vIRF2 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS Strong_X_vIRF2\u VP64_pos_ctrl
GGCACTGTCGAG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.164326593 0.87740996 -0.139158719 0.93504853 -2.130469701 0.868917591 1.2121708 Virus_target Virus_target hvTR_P17147 hvTR_Q2HR71 YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Unknown_X_strong
GGCACAGCTCCA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.931126466 0.45350915 -3.299251973 0.17069892 -2.028819151 0.889535073 1.205638706 0.709382773 Human target Virus_target DTX2 hvTR_Q2HR73 TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Unknown_X_mid
ACGACTCACCGC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.626838612 0.77928478 -1.780368738 0.50177205 5.393327339 1.189503102 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 hvTR_Q2HR71 PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
ACTTGATGGTTT _A_Concatenation CTCGTTATGGCA _B Target(s) Target(s) -0.48463759 0.03338662 4.819915517 0.02400653 2.570597492 0.580611989 1.179282647 0.630780636 Virus_target Human target hvTR_Q2HR73 NOVA1 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF mid_X_mid
ACTTGATGGTTT _A_Concatenation TTTAACGGATTG _B Target(s) Target(s) -0.72796167 0.09988704 -4.322471153 0.0016933 1.739365194 0.81934659 1.178331688 0.518744839 Virus_target Human target hvTR_Q2HR73 TWIST2 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN mid_X_mid
GAGATTGTGTCC _A_Concatenation CGCCGGCATACA _B Target(s) Target(s) -1.841517505 0.18334753 -3.012320806 0.29907275 0.131118399 1.162103585 Virus_target Virus_target hvTR_Q2HR71 cds_NC_030656.1_cds_YP_009268729.1_14 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI Strong_X_strong
GGCACTGTCGAG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.392592625 0.81690552 -1.713889714 0.25252688 -1.200648001 1.151663367 Virus_target Human target hvTR_P17147 LEUTX YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Unknown_X_strong
CATCTCACCTAG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.709763226 0.38922543 1.683764643 0.29264921 -5.327828084 1.147081827 Virus_target Virus_target hvTR_P03206 hvTR_Q2HR73 ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
TGATGAGCGCCC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.239696748 0.4737404 3.174959234 2.998050861 0.418960654 1.14289441 0.64853834 Human target vIRF2_VP64_pos_ctrl EEF1D core_VP64_a QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Weak_X_vIRF2\u VP64_pos_ctrl
ATAACTCCACGC _A_Concatenation AAGACCGGTGCC _B pos_ctrl_A Target(s) 0.626467073 1.20E-05 5.370664311 1.17E-05 1.6634717 0.877171774 1.101933069 0.608816077 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_P08392 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF vIRF2_VP64_pos_ctrl_X_mid
CTACGTGGCCCC _A_Concatenation GTACATACCGAA _B Target(s) Target(s) 0.865349217 0.28726554 0.256244147 0.85890913 1.382022645 1.097597554 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE Strong_X_mid
ATGACCCCTTGT _A_Concatenation TTAAATGAGGGC _B Target(s) Target(s) 0.384212959 0.85884337 -4.437953639 0.05552114 -0.913137641 1.095276065 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ Strong_X_mid
ACAACGTGCATA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.659961567 6.37E-23 -0.57384925 0.74513376 2.168025058 0.703227418 1.084864637 0.638579701 Virus_target Human target hvTR_P10407 LEUTX RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
GAGATTGTGTCC _A_Concatenation Joint_ 1_B Target(s) neg_ctrl 0.17739503 0.82429662 5.299274559 0.00940756 -1.684738002 0.976975145 1.079151833 0.760804824 Virus_target neg_ctrl hvTR_Q2HR71 Joint_1 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK Strong-X-joint
Joint_ 2_A/u splice_ ATAACTCCACGC _b neg_ctrl pos_ctrl_A 0.987791077 0.04916082 4.018063296 0.09317132 0.352906907 1.015782106 neg_ctrl vIRF2_VP64_pos_ctrl Joint_2 core_VP64_a GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Linker_X_vIRF2\u VP64_pos_ctrl
ACTTGATGGTTT _A_Concatenation CTGCCGGTTGCG _B Target(s) Target(s) -0.992557612 4.96E-10 2.120892066 0.23767729 1.242663962 0.95807515 0.937064178 0.566175704 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL mid_X_mid
ATAACTCCACGC _A_Concatenation GGCACTGTCGAG _B pos_ctrl_A Target(s) -0.248775848 0.82544101 -5.978185565 0.00029864 -2.094607356 0.798947013 0.912854627 0.72964861 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_P17147 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA VIRF2_VP64_pos ctrl_x_unknown
CTGCCGGTTGCG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -1.11970577 7.45E-18 3.136645504 0.01719167 2.253016506 0.665320115 0.908504475 0.700036819 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 hvTR_Q2HR71 LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
ACTTGATGGTTT _A_Concatenation ATGTGGCACGAC _B Target(s) Target(s) -0.256266476 0.36743075 -3.778187384 0.00259259 1.169081055 0.983636094 0.888162231 0.655523139 Virus_target Human target hvTR_Q2HR73 ANKRD17 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC mid_X_mid
CCTAAATGCAAG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -5.010681713 -1.629306057 0.877388115 Virus_target vIRF2_VP64_pos_ctrl cds_NC_025410.1_cds_YP_009094498.1_7 core_VP64_a SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
AGTTGCTTTGCT _A_Concatenation ATAACTCCACGC _B neg_ctrl pos_ctrl_A 0.960333021 0.00488169 4.179882546 0.06702671 2.282660606 0.704006661 0.855220278 0.745418229 neg_ctrl vIRF2_VP64_pos_ctrl Random_2 core_VP64_a DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Random_X_vIRF2\u VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation Joint_ 2_B neg_ctrl neg_ctrl 1.299477087 0.12232448 -4.409678395 0.14518028 2.145491451 0.853443827 neg_ctrl neg_ctrl Random_1 Joint_2 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA Random_X_joint
ATAACTCCACGC _A_Concatenation AGCCGTGCTTGA _B pos_ctrl_A Target(s) 0.539193025 0.81666774 -2.90652737 0.42930651 1.456980834 0.843830764 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY VIRF2_VP64_pos ctrl_x_weak
CTCACGACAAGA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.514506396 0.57736265 2.139412733 0.04463366 -0.346964688 0.841973254 Virus_target Human target hvTR_G3G929 LEUTX ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
AAAACAAGCATT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 1.742604782 2.42E-51 4.464018864 5.18E-06 2.54870469 0.574934744 0.839688459 0.734717606 Virus_target vIRF2_VP64_pos_ctrl cds_NC_030656.1_cds_YP_009268803.1_88 core_VP64_a STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Weak_X_vIRF2\u VP64_pos_ctrl
GAGATTGTGTCC _A_Concatenation GATGGAGCTACA _B Target(s) Target(s) 1.879945463 0.60940534 1.895753735 0.61426979 -3.877100388 0.822271134 Virus_target Virus_target hvTR_Q2HR71 hvTR_P03209 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF Strong_X_strong
GGCACAGCTCCA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 1.779855302 0.48649146 -5.062641363 0.15596426 -2.334841399 0.812865607 Human target vIRF2_VP64_pos_ctrl DTX2 core_VP64_a TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Unknown_X_vIRF2\u VP64_pos_ctrl
CTACGTGGCCCC _A_Concatenation ATTTGTAGACCG _B Target(s) Target(s) -0.533841416 0.42460236 7.563147751 0.00295321 -1.889450945 0.770864826 Virus_target Human target hvTR_Q2HR71 MYCN TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE Strong_X_mid
CTGGTAGTTGTT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.76744908 0.46430133 2.09584397 0.4070014 0.563208724 0.748168124 Virus_target Virus_target hvTR_F6KS47 hvTR_Q2HR73 SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Unknown_X_mid
GACTGTGGGCGG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.520269235 0.65676761 -4.732085826 0.04424815 -4.065153722 0.723617846 Human target Virus_target MON2 hvTR_Q2HR73 TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
CTCGTTATGGCA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.243346555 0.82429662 -2.934683909 0.0608294 -1.101450716 0.997690695 0.631504535 0.780883772 Human target vIRF2_VP64_pos_ctrl NOVA1 core_VP64_a AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
CATTAGTCCCGC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.170138862 0.8780525 4.042905152 0.10604816 -1.478634957 0.997690695 0.628296376 0.886599957 Virus_target neg_ctrl hvTR_P06930 Random_1 AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
CTACGTGGCCCC _A_Concatenation Stop_1_B Target(s) neg_ctrl -2.099795004 0.07250322 4.820861222 0.00113014 -7.036155021 0.562653541 Virus_target neg_ctrl hvTR_Q2HR71 Stop_1 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK Strong_X_termination
AATCGATCCTAC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.884048604 0.11925871 -0.474312793 0.75805561 -0.919399756 0.997690695 0.554388365 0.896063592 Human target Virus_target PRDM10 hvTR_Q2HR73 VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
GAGATTGTGTCC _A_Concatenation Joint_ 3_B Target(s) neg_ctrl 0.089154207 0.96035322 -5.797785961 0.01745071 -4.0227046 0.508387958 Virus_target neg_ctrl hvTR_Q2HR71 Joint_3 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD Strong-X-joint
ATAACTCCACGC _A_Concatenation AGCCATAAACCT _B pos_ctrl_A Target(s) 1.68124465 1.87E-16 1.245080785 0.53051558 2.136151821 0.72394284 0.506541254 0.844904201 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_P03209 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ vIRF2_VP64_pos_ctrl_X_mid
GTACATACCGAA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 2.162739642 0.00974837 -5.809727442 0.02989572 -1.373836194 0.469619153 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 core_VP64_a NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
ATAACTCCACGC _A_Concatenation TACATGAGGCAC _B pos_ctrl_A Target(s) -1.716170231 0.49700987 -5.58970336 0.03888733 -5.358663263 0.4397358 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL vIRF2_VP64_pos_ctrl_X_mid
GGTGTCTCGATT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.160633748 0.95787851 1.414931716 0.65265206 -0.421640427 0.427016738 Human target neg_ctrl E2F5 Random_1 LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
CAGTGACATAGT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.611972011 0.0863072 5.915686221 1.23E-07 3.17543956 0.570391548 0.410116426 0.914216659 Human target Virus_target EHMT2 hvTR_Q2HR71 AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
ACAACGTGCATA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.261835701 0.16937431 1.139745633 0.52277462 -2.699154352 0.574934744 0.366152286 0.902237344 Virus_target Virus_target hvTR_P10407 hvTR_Q2HR71 RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
TCAGCCGTAGGC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.061666538 0.92304294 4.087446849 0.05018079 2.63742964 0.572967985 0.354484472 0.902200589 Virus_target Human target hvTR_Q805Y1 LEUTX RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
GAGATTGTGTCC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.113009742 0.90763615 -5.019686845 0.00062712 -1.710539097 0.335420581 Virus_target Human target hvTR_Q2HR71 LEUTX PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
TCAGCCGTAGGC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.360042349 0.86342423 -3.157670546 0.2632007 2.159648733 0.326515198 Virus_target neg_ctrl hvTR_Q805Y1 Random_1 RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
ATAACTCCACGC _A_Concatenation TGGCCCCCAGTT _B pos_ctrl_A Target(s) -0.84476964 0.04064654 -5.862623718 0.00628323 -5.840837533 0.324426966 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q3I8P6 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN VIRF2_VP64_pos ctrl_x_weak
TATCGCAAGAAC _A_Concatenation AGAGTGCGCTCT _B neg_ctrl neg_ctrl -0.757692055 0.3372476 2.584470563 0.05968663 -0.90213542 0.997690695 0.32012067 0.90076366 neg_ctrl neg_ctrl Random_5 Random_1 PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Random_x_random
Stop 2_A splice ATAACTCCACGC B neg_ctrl pos_ctrl_A 0.094261317 0.97488974 -2.450279355 0.35394249 0.703495589 0.318382535 neg_ctrl vIRF2_VP64_pos_ctrl Stop_2 core_VP64_a ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Termination_X_vIRF2\u VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation CAACGCATGCCT _B neg_ctrl Target(s) -0.480443301 0.06765358 3.100307847 0.08071342 1.54805731 0.874190583 0.30577931 0.886599957 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS Random_x_weak
AGAGTGCGCTCT _A_Concatenation GGAGGCACATCG _B neg_ctrl Target(s) 1.081687677 1.43E-11 -2.285993278 0.06080289 -2.376746642 0.679495956 0.257043229 0.935730193 neg_ctrl Virus_target Random_1 hvTR_Q87042 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL Random_x_strong
AGAGTGCGCTCT _A_Concatenation Joint_ 3_B neg_ctrl neg_ctrl -0.898845443 0.45084025 -2.115347339 0.32894908 -0.228118001 0.243328485 neg_ctrl neg_ctrl Random_1 Joint_3 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD Random_X_joint
AGTTGCTTTGCT _A_Concatenation GAGATTGTGTCC _B neg_ctrl Target(s) 0.080791763 0.89675803 4.036457438 0.04816017 -1.12100915 0.222740188 neg_ctrl Virus_target Random_2 hvTR_Q2HR71 DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Random_x_strong
CTGCCGGTTGCG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -1.737752278 0.02638198 1.831779963 0.12197421 -1.051036938 0.217808708 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 Random_1 LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
GAGATTGTGTCC _A_Concatenation ACGACTCACCGC _B Target(s) Target(s) -0.47922872 0.81690552 -0.44799202 0.89519343 -0.934007198 0.213884808 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK Strong_X_mid
CATCTGAGTCGT _A_Concatenation ATAACTCCACGC _B pos_ctrl_A pos_ctrl_A -0.689048446 0.04776592 4.327088726 0.04160686 1.204366126 0.987041604 0.212151908 0.93924838 vIRF2_VP64_pos_ctrl vIRF2_VP64_pos_ctrl RWY_VP16_vIRF2 core_VP64_a GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML vIRF2_VP64_pos_ctrl_X_vIRF2_VP64_pos_ctrl
AATCGAGACCAG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.195139581 0.94919394 -4.040795247 0.20134294 -2.139599531 0.14022633 Virus_target Virus_target hvTR_Q8BB47 hvTR_Q2HR71 EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
AGCCATAAACCT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.335474845 0.00662615 0.413686103 0.82720717 1.76117014 0.851559334 0.128474535 0.965824722 Virus_target neg_ctrl hvTR_P03209 Random_1 PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
Joint_1_A/u splice_ AGAGTGCGCTCT _b neg_ctrl neg_ctrl -1.397055722 0.00091978 -3.090284227 0.11336437 -1.836287723 0.125706013 neg_ctrl neg_ctrl Joint_1 Random_1 GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Joint_X_random
AATCGAGACCAG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -1.392271569 1.18E-09 -4.967798742 0.00095831 -2.001250125 0.934152898 0.122258859 0.9828278 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q8BB47 core_VP64_a EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation AGAGTGCGCTCT _B neg_ctrl neg_ctrl -0.652574893 0.00010169 5.475209909 0.00011897 1.462798649 0.934700311 0.117179463 0.967254711 neg_ctrl neg_ctrl Random_1 Random_1 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Random_x_random
CTGGTAGTTGTT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -1.391395516 0.35312215 4.386002441 0.09576001 -0.141091018 0.112611664 Virus_target neg_ctrl hvTR_F6KS47 Random_1 SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Unknown_X_random
ACTTGATGGTTT _A_Concatenation TCAGCCGTAGGC _B Target(s) Target(s) -0.059231035 0.86342423 3.911686692 0.02595648 -0.26616897 0.997690695 0.098479384 0.945626709 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q805Y1 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR mid_X_mid
ACTTTAGTAACA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -1.059370341 8.92E-05 -0.743398989 0.66003866 -2.506505199 0.529778264 0.092808267 0.976901733 Virus_target neg_ctrl hvTR_Q69113 Random_1 NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
AGAGTGCGCTCT _A_Concatenation ACTTTAGTAACA _B neg_ctrl Target(s) -0.193717483 0.86892322 -4.87696204 0.00080119 -0.802988013 0.997690695 0.0914227 0.984564542 neg_ctrl Virus_target Random_1 hvTR_Q69113 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES Random_x_strong
GAGATTGTGTCC _A_Concatenation TTTAACGGATTG _B Target(s) Target(s) 0.276464599 0.31988807 1.676533763 0.42930651 -1.439547247 0.976975145 0.023950238 0.995060651 Virus_target Human target hvTR_Q2HR71 TWIST2 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN Strong_X_mid
CTACGTGGCCCC _A_Concatenation CAGTGACATAGT _B Target(s) Target(s) -0.31658724 0.12953115 3.97371437 0.05973494 0.769564213 0.997690695 0.02075425 0.994766086 Virus_target Human target hvTR_Q2HR71 EHMT2 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT Strong_X_mid
ATAACTCCACGC _A_Concatenation CATGCTAACACC _B pos_ctrl_A Target(s) 1.068462573 4.06E-07 -0.742993487 0.63897552 -4.09225671 0.013589008 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_F5HCV3 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL VIRF2_VP64_pos strong_ctrl_x_strong
AGAGTGCGCTCT _A_Concatenation ATGTGGCACGAC _B neg_ctrl Target(s) -0.783154462 5.17E-09 5.748255934 6.38E-05 1.080153366 0.995270541 0.002579356 0.999421067 neg_ctrl Human target Random_1 ANKRD17 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC Random_x_mid
GGCCTTTCGGGC _A_Concatenation ATAACTCCACGC _B neg_ctrl pos_ctrl_A 0.517537679 0.50380671 2.711006308 0.25363982 -5.348540709 0.097296843 0.00182921 0.999421067 neg_ctrl vIRF2_VP64_pos_ctrl Random_3 core_VP64_a GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Random_X_vIRF2\u VP64_pos_ctrl
ATAACTCCACGC _A_Concatenation EPICXV.47_B pos_ctrl_A pos_ctrl_A -0.332998303 0.91326182 4.420985537 0.00989082 -0.813179645 -0.010122676 vIRF2_VP64_pos_ctrl vIRF2_VP64_pos_ctrl core_VP64_a 3x_vIRF2_core LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL vIRF2_VP64_pos_ctrl_X_vIRF2_VP64_pos_ctrl
CTACGTGGCCCC _A_Concatenation ATGTGGCACGAC _B Target(s) Target(s) 0.691722946 5.51E-10 5.89700202 3.58E-05 0.613011962 0.997690695 -0.017156043 0.994766086 Virus_target Human target hvTR_Q2HR71 ANKRD17 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC Strong_X_mid
CTACGTGGCCCC _A_Concatenation TGCCAGACGGTC _B Target(s) Target(s) 0.549526056 4.88E-05 4.63418439 0.01582874 0.983350079 0.997690695 -0.029300373 0.994766086 Virus_target Virus_target hvTR_Q2HR71 hvTR_P10541 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG Strong_X_strong
ATAACTCCACGC _A_Concatenation ACGACTCACCGC _B pos_ctrl_A Target(s) 6.286576616 0.02699935 -4.342195826 -0.030366742 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK vIRF2_VP64_pos_ctrl_X_mid
ACCAGTATTTAA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.112221946 0.84320905 -4.749563692 0.00157575 -2.445065888 -0.036281135 Human target Virus_target ARNTL2 hvTR_Q2HR73 NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
GAGATTGTGTCC _A_Concatenation CTCACGACAAGA _B Target(s) Target(s) 0.931010243 0.23382858 -3.016944751 0.31825647 -1.742040783 0.834609851 -0.04110902 0.991112506 Virus_target Virus_target hvTR_Q2HR71 hvTR_G3G929 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP Strong_X_strong
GGTGTCTCGATT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.541075193 0.85681444 -3.27242354 0.10515713 -0.571425044 -0.052307818 Human target Virus_target E2F5 hvTR_Q2HR73 LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
AGAGTGCGCTCT _A_Concatenation CTCACGACAAGA _B neg_ctrl Target(s) -0.178603315 0.92304294 -1.517718004 0.56218914 0.335921396 -0.083908871 neg_ctrl Virus_target Random_1 hvTR_G3G929 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP Random_x_strong
GGAGGCACATCG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.714079894 0.03148891 2.108369794 -0.82241016 0.997690695 -0.106665771 0.9828278 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q87042 core_VP64_a RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
CTACGTGGCCCC _A_Concatenation CGATGGCGGAAT _B Target(s) Target(s) 0.600807126 0.38377703 -4.950844438 0.0050302 -4.863285093 -0.127964546 Virus_target Human target hvTR_Q2HR71 KIAA1586 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG Strong_x_weak
ATAACTCCACGC _A_Concatenation CGATGGCGGAAT _B pos_ctrl_A Target(s) -1.754898375 0.21779974 -0.920756691 -0.168883587 vIRF2_VP64_pos_ctrl Human target core_VP64_a KIAA1586 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG VIRF2_VP64_pos ctrl_x_weak
CTCCATTAATGA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.385870044 2.554413976 -2.362808482 -0.170160959 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q5IXR3 core_VP64_a FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Weak_X_vIRF2\u VP64_pos_ctrl
CCCCTTAGGAAT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 4.484705094 0.32767348 8.501950322 -2.362808482 -0.170160959 Human target neg_ctrl SYNE1 Random_1 SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
GAGATTGTGTCC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.486517931 0.72790325 0.156060376 0.9459521 -0.107144781 0.998501722 -0.200237849 0.965824722 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q2HR71 core_VP64_a PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
AGTTAGGCTCTT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.141306938 0.63664962 -1.490740112 0.36520524 1.457711023 0.953105946 -0.201144741 0.942416038 Human target Virus_target TOX4 hvTR_Q2HR73 EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
CCCTGCCTCGGG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.68966899 4.07E-06 5.897720999 8.78E-05 0.85935825 0.997690695 -0.207686513 0.926981938 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 core_VP64_a EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
ATGTGGCACGAC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -1.364933259 0.05848158 -3.126889817 0.15035859 -0.613171562 -0.258277926 Human target neg_ctrl ANKRD17 Random_1 QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
ATGACCCCTTGT _A_Concatenation ACTCGTGTTGTC _B Target(s) Target(s) -2.442645515 0.01342291 -1.478045212 0.54946476 -2.428403109 -0.260370546 Human target Virus_target LEUTX hvTR_Q9QB97 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT Strong_X_mid
ACTTGATGGTTT _A_Concatenation TGGCCCCCAGTT _B Target(s) Target(s) 1.387311323 0.36774226 -2.981801614 -2.428403109 -0.260370546 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q3I8P6 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN Mid_X_weak
GTCACCCTCCTT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -2.097944489 0.74855254 3.88115205 2.904704451 -0.275553032 Human target neg_ctrl E2F5 Random_1 DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
AGAGTGCGCTCT _A_Concatenation ACCAGTATTTAA _B neg_ctrl Target(s) -0.73590366 0.22130449 -2.300065395 0.24997751 -2.748593362 -0.279935607 neg_ctrl Human target Random_1 ARNTL2 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL Random_x_strong
CCCCTTAGGAAT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.347779795 0.91132524 -2.490667222 0.48071279 -1.522913448 -0.295047465 Human target Virus_target SYNE1 hvTR_Q2HR73 SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
ATGACCCCTTGT _A_Concatenation AAGGGTAAATTG _B Target(s) Target(s) -0.360341085 0.90321028 -0.628257587 0.66139704 2.933317005 -0.31618873 Human target Virus_target LEUTX hvTR_Q69513 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS Strong_X_mid
ATTGTATCTAAC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 1.087801008 0.05235126 6.106084435 0.00302307 -0.865377123 -0.336151454 Human target Virus_target SMARCB1 hvTR_Q2HR71 VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
CAACGCATGCCT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -2.03842623 0.51097628 1.354700976 0.50111097 -1.988184763 -0.341075036 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 hvTR_Q2HR71 YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Weak_X_Strong
GATCGCATGTTC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.624424336 0.40044407 0.161667261 0.93840513 0.396431551 -0.345782092 Human target Virus_target INPP4A hvTR_Q2HR71 NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
TAAAATTTATCA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.08880035 0.81666774 -3.067747497 0.01582874 -0.629781979 0.997690695 -0.351769659 0.911245907 Human target vIRF2_VP64_pos_ctrl HSF1 core_VP64_a LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
CTCACGACAAGA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.963899763 7.63E-05 5.256478517 2.17E-05 0.793255316 0.997690695 -0.359903757 0.843761696 Virus_target Virus_target hvTR_G3G929 hvTR_Q2HR71 ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
GAGATTGTGTCC _A_Concatenation ATGTGGCACGAC _B Target(s) Target(s) -0.00554909 0.9798201 5.397531899 0.00017431 0.394962514 0.997690695 -0.362974985 0.8188018 Virus_target Human target hvTR_Q2HR71 ANKRD17 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC Strong_X_mid
CTACGTGGCCCC _A_Concatenation ACAACGTGCATA _B Target(s) Target(s) 0.430920452 0.0015992 -1.086534391 -1.106282821 0.997690695 -0.371166087 0.90076366 Virus_target Virus_target hvTR_Q2HR71 hvTR_P10407 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET Strong_X_strong
GATCGCATGTTC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.869099538 0.00611211 -4.11735085 0.00047339 -0.80604411 0.997690695 -0.398308047 0.884436889 Human target Virus_target INPP4A hvTR_Q2HR71 NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
EPICXV.47-A\u splice_ ATAACTCCACGC _b pos_ctrl_A pos_ctrl_A -0.21974086 0.76101183 -6.281369053 0.00151048 -0.454561794 -0.424127461 vIRF2_VP64_pos_ctrl vIRF2_VP64_pos_ctrl 3x_vIRF2_core core_VP64_a GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML vIRF2_VP64_pos_ctrl_X_vIRF2_VP64_pos_ctrl
CTACGTGGCCCC _A_Concatenation GCAACGAGGGTC _B Target(s) Target(s) 1.128526586 5.43E-07 -3.886049763 0.00162533 -1.657071747 0.968300879 -0.453067541 0.90076366 Virus_target Human target hvTR_Q2HR71 QRICH2 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ Strong_X_mid
GAGACAGCTCTC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.081207344 0.9335834 -0.317415485 0.8725678 -2.686682065 -0.457430856 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 hvTR_Q2HR71 LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
GAGATTGTGTCC _A_Concatenation GGGAGTATGTCG _B Target(s) neg_ctrl -0.834909843 0.53101873 -1.472381741 0.65881785 -2.079525128 -0.460070369 Virus_target neg_ctrl hvTR_Q2HR71 Random_4 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS Strong_X_random
Joint_1_A/u splice_ GAGATTGTGTCC _b neg_ctrl Target(s) 0.098007972 0.81690552 2.382205906 -0.493379806 0.997690695 -0.481056749 0.805740236 neg_ctrl Virus_target Joint_1 hvTR_Q2HR71 GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Joint_X_strong
ACTTGATGGTTT _A_Concatenation AGTTAGGCTCTT _B Target(s) Target(s) -0.322296954 0.06488456 3.557915354 -1.277257877 0.983636094 -0.491245385 0.855822345 Virus_target Human target hvTR_Q2HR73 TOX4 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV mid_X_mid
Joint_ 2_A/u splice_ ATGACCCCTTGT _b neg_ctrl Target(s) 1.730229933 0.0030808 -3.967531682 0.13991271 -4.345904336 -0.501470332 neg_ctrl Human target Joint_2 LEUTX GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Joint_X_strong
CGCCGGCATACA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.08342302 0.94819089 0.793507026 0.66003866 -1.118935679 -0.523927823 Virus_target Human target cds_NC_030656.1_cds_YP_009268729.1_14 LEUTX DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
ACTTGATGGTTT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.689560504 1.76E-14 3.691474516 0.00947379 1.385903403 0.934700311 -0.52795616 0.78871534 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q2HR73 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
ATGACCCCTTGT _A_Concatenation AATGGCACTAGC _B Target(s) Target(s) -0.088918236 0.97418471 0.215391642 5.221163403 -0.554234082 Human target Virus_target LEUTX cds_NC_024711.1_cds_YP_009052522.2_44 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR Strong_x_weak
CTACGTGGCCCC _A_Concatenation AATGGCACTAGC _B Target(s) Target(s) 1.070267369 0.3372476 3.038721408 0.07417127 -5.616861225 -0.566896372 Virus_target Virus_target hvTR_Q2HR71 cds_NC_024711.1_cds_YP_009052522.2_44 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR Strong_x_weak
CTACGTGGCCCC _A_Concatenation TTCTAATTTCCT _B Target(s) Target(s) -1.208628829 0.62724977 0.86836222 -1.932944605 -0.568297856 Virus_target Human target hvTR_Q2HR71 WDR76 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS Strong_X_strong
CCCGTAGGGGCT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 2.737695209 0.00214501 -4.511290649 0.08521419 3.002873982 -0.572720507 Human target Human target TET3 LEUTX APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
ATGACCCCTTGT _A_Concatenation CATTAGTCCCGC _B Target(s) Target(s) 1.072168252 0.01046123 -2.533877112 0.18368446 -3.851308885 -0.577423979 Human target Virus_target LEUTX hvTR_P06930 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF Strong_X_strong
TACATGAGGCAC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.402852754 0.51381092 7.307747202 -0.138343082 0.998501722 -0.609543754 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 hvTR_Q2HR71 PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
CATCTGAGTCGT _A_Concatenation AGAGTGCGCTCT _B pos_ctrl_A neg_ctrl -0.538858599 0.00021163 2.314796768 0.05613091 1.925242388 0.786741685 -0.615744855 0.78871534 vIRF2_VP64_pos_ctrl neg_ctrl RWY_VP16_vIRF2 Random_1 GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP VIRF2_VP64_pos ctrl_x_random
Joint_ 3_A/u splice_ GAGATTGTGTCC _b neg_ctrl Target(s) 0.914604159 0.22734303 4.815385865 0.02204107 -1.576672657 -0.626241051 neg_ctrl Virus_target Joint_3 hvTR_Q2HR71 GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Joint_X_strong
ATGACCCCTTGT _A_Concatenation AATCAACGAGCA _B Target(s) Target(s) -5.669448975 4.097397656 0.01063608 -1.385614368 -0.65097305 Human target Human target LEUTX C2orf81 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE Strong_X_strong
Joint_ 3_A/u splice_ ATAACTCCACGC _b neg_ctrl pos_ctrl_A -0.39349819 0.31287443 -3.508590097 0.03215821 -0.765871365 0.997690695 -0.657046877 0.766038698 neg_ctrl vIRF2_VP64_pos_ctrl Joint_3 core_VP64_a GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Linker_X_vIRF2\u VP64_pos_ctrl
ATAACTCCACGC _A_Concatenation AAAACAAGCATT _B pos_ctrl_A Target(s) 1.804487019 0.39131086 -1.73865155 -2.733800907 -0.675452747 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NC_030656.1_cds_YP_009268803.1_88 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS VIRF2_VP64_pos ctrl_x_weak
ATAACTCCACGC _A_Concatenation CCCCTTAGGAAT _B pos_ctrl_A Target(s) 0.839023641 0.8027561 2.414347173 0.23997073 -4.516383077 -0.757921072 vIRF2_VP64_pos_ctrl Human target core_VP64_a SYNE1 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD vIRF2_VP64_pos_ctrl_X_mid
GGAGGCACATCG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.059934018 0.91324404 4.364228877 0.04515966 1.266872898 0.976975145 -0.765924102 0.732968602 Virus_target Virus_target hvTR_Q87042 hvTR_Q2HR71 RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
TACATGAGGCAC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 2.672962453 0.25403675 -0.937537655 0.68844106 -2.340182969 -0.797955314 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 Random_1 PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
TAAAATTTATCA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.269060835 0.43839885 1.225308514 0.44919764 -0.508753594 0.997690695 -0.808403392 0.725625971 Human target Virus_target HSF1 hvTR_Q2HR71 LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
CTACGTGGCCCC _A_Concatenation CTCGCACCGAGG _B Target(s) Target(s) 0.550957365 0.01910255 1.319338256 0.51411622 -1.516019058 0.944312715 -0.817640401 0.732968602 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN Strong_X_mid
AGCCAGGTCACC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 2.540194857 0.33149001 4.962885921 0.01226596 -1.9435047 -0.824936823 Human target Virus_target MESP1 hvTR_Q2HR71 GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
ATAACTCCACGC _A_Concatenation TGCCAGACGGTC _B pos_ctrl_A Target(s) -2.04917354 0.04492916 -4.213686875 0.01328684 -2.972487272 -0.826380727 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_P10541 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG VIRF2_VP64_pos strong_ctrl_x_strong
ACTTGATGGTTT _A_Concatenation Joint_ 2_B Target(s) neg_ctrl -0.211928221 0.65898029 -5.867676393 1.27E-05 -0.775452408 0.997690695 -0.83154227 0.688672326 Virus_target neg_ctrl hvTR_Q2HR73 Joint_2 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA Mid_X_joint
AGTTAGGCTCTT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.723996645 0.5677544 4.116801314 3.312842368 -0.846634177 Human target neg_ctrl TOX4 Random_1 EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
GAGATTGTGTCC _A_Concatenation CGATGGCGGAAT _B Target(s) Target(s) 1.146138953 0.07185248 3.207461799 0.37526174 -2.996809762 -0.85952028 Virus_target Human target hvTR_Q2HR71 KIAA1586 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG Strong_x_weak
GGCCTTTCGGGC _A_Concatenation ATGACCCCTTGT _B neg_ctrl Target(s) -0.400541896 0.04573484 4.381584373 0.06270338 0.564111354 0.997690695 -0.875074478 0.592109514 neg_ctrl Human target Random_3 LEUTX GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Random_x_strong
ACTTGATGGTTT _A_Concatenation EPICXV.47_B Target(s) pos_ctrl_A 0.494321833 0.17175141 -4.44439168 0.00263683 -0.612752693 0.997690695 -0.878427045 0.788744967 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q2HR73 3x_vIRF2_core RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
GCAGATTCGAAT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 2.120648797 8.34E-10 -3.902420942 0.12217257 -1.466572137 -0.885948177 Human target Virus_target AK9 hvTR_Q2HR71 KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
CTACGTGGCCCC _A_Concatenation GCAGATTCGAAT _B Target(s) Target(s) 0.811892488 0.62420761 -3.7416591 0.04116014 -1.846880318 -0.909683882 Virus_target Human target hvTR_Q2HR71 AK9 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP Strong_X_mid
CTACGTGGCCCC _A_Concatenation TGGCCCCCAGTT _B Target(s) Target(s) -0.494669917 0.9128227 -6.429569599 0.05144176 -2.913507046 -0.916480965 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q3I8P6 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN Strong_x_weak
ATGACCCCTTGT _A_Concatenation CGCCGGCATACA _B Target(s) Target(s) -3.687054174 0.01760626 -3.60411821 -4.979090518 -0.947124322 Human target Virus_target LEUTX cds_NC_030656.1_cds_YP_009268729.1_14 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI Strong_X_strong
AAGACCGGTGCC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.085516966 0.94114356 2.783741197 0.13013389 -2.2657589 -0.993498609 Virus_target Virus_target hvTR_P08392 hvTR_Q2HR71 RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
CTACGTGGCCCC _A_Concatenation ATTGTATCTAAC _B Target(s) Target(s) 0.590080729 4.02E-05 1.990784051 0.07667512 -0.443107974 0.997690695 -1.004313695 0.719038091 Virus_target Human target hvTR_Q2HR71 SMARCB1 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD Strong_X_mid
AATCAACGAGCA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.980585546 0.61505192 3.899811011 0.00628323 2.064058395 -1.006704138 Human target Human target C2orf81 LEUTX SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
ACTTGATGGTTT _A_Concatenation TCTGTACCAACG _B Target(s) Target(s) -0.11382315 0.8649368 2.678961277 0.08025707 -0.607800977 0.997690695 -1.010498117 0.757792132 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q6XA69 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY Mid_X_unknown
GAGATTGTGTCC _A_Concatenation TGCCAGACGGTC _B Target(s) Target(s) 0.826626805 0.01801161 -5.188256674 8.78E-05 -2.492841999 -1.031494173 Virus_target Virus_target hvTR_Q2HR71 hvTR_P10541 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG Strong_X_strong
ACTTGATGGTTT _A_Concatenation GTACATACCGAA _B Target(s) Target(s) 2.774341471 0.11494945 3.918909602 0.04835416 3.140844427 -1.040385367 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE mid_X_mid
Stop-1_A/u splice_ AGAGTGCGCTCT _b neg_ctrl neg_ctrl -0.378530815 0.9237149 -6.461330284 0.00909169 -4.991686999 -1.040751749 neg_ctrl neg_ctrl Stop_1 Random_1 ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Termination_X_random
AGAGTGCGCTCT _A_Concatenation TTTAACGGATTG _B neg_ctrl Target(s) -0.553050077 0.06230516 4.271412911 0.07496967 -0.342746968 0.997690695 -1.056103356 0.74332815 neg_ctrl Human target Random_1 TWIST2 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN Random_x_mid
AGCCAGGTCACC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -1.262070012 0.00738617 5.693277462 0.00384211 -1.231121382 0.987041604 -1.068983275 0.651954269 Human target Virus_target MESP1 hvTR_Q2HR73 GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
ATAACTCCACGC _A_Concatenation AATCAACGAGCA _B pos_ctrl_A Target(s) -1.183345249 0.3372476 -3.206646883 0.12067699 -4.733725612 -1.073277813 vIRF2_VP64_pos_ctrl Human target core_VP64_a C2orf81 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE VIRF2_VP64_pos strong_ctrl_x_strong
ATGACCCCTTGT _A_Concatenation TATTAAAAGTGC _B Target(s) Target(s) -1.698386659 0.24013675 1.752334443 0.46758098 -0.707224663 -1.077559948 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_202 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI Strong_x_weak
AAGGGTAAATTG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.642548493 0.45084025 1.649250306 0.59759538 -3.061227687 -1.098696638 Virus_target Human target hvTR_Q69513 LEUTX NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
GTCACCCTCCTT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 1.481471862 0.47572323 -5.43138779 0.05047949 -3.061227687 -1.098696638 Human target Virus_target E2F5 hvTR_Q2HR73 DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
TGGCCCCCAGTT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.830732458 0.5512515 -3.262305118 0.07113173 -0.082093905 -1.110130765 Virus_target Virus_target hvTR_Q3I8P6 hvTR_Q2HR71 GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Weak_X_Strong
TTTAACGGATTG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.232194054 0.00599572 -4.702210671 0.00133574 -1.51716174 -1.122529507 Human target neg_ctrl TWIST2 Random_1 SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
AGAGTGCGCTCT _A_Concatenation Joint_ 1_B neg_ctrl neg_ctrl -0.690665027 0.5040267 -4.242238155 0.01469877 -3.62880795 -1.133291251 neg_ctrl neg_ctrl Random_1 Joint_1 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK Random_X_joint
GAGATTGTGTCC _A_Concatenation TCAGCCGTAGGC _B Target(s) Target(s) 1.400000224 0.00086312 9.194258414 5.99E-06 -0.737175654 0.997690695 -1.134471708 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q805Y1 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR Strong_X_mid
TTAGTGCCATAC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.414165855 0.93141616 5.143894465 0.07669979 -3.29232035 -1.137645961 Human target Virus_target TET3 hvTR_Q2HR71 PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
GACTGTGGGCGG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.23397005 0.91324404 -5.357411474 0.02354207 3.531501053 -1.146163925 Human target neg_ctrl MON2 Random_1 TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
GAGATTGTGTCC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.402463196 0.82490703 -1.142345969 0.4983473 0.685612219 -1.148541528 Virus_target neg_ctrl hvTR_Q2HR71 Random_1 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
GATGGAGCTACA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.720771518 0.38424667 -6.654018347 0.00092549 -1.170940493 -1.155680211 Virus_target neg_ctrl hvTR_P03209 Random_1 EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
Joint_ 3_A/u splice_ ACTTGATGGTTT _b neg_ctrl Target(s) -0.312437299 0.82544101 1.293745977 0.50792395 -4.798331251 -1.164748571 neg_ctrl Virus_target Joint_3 hvTR_Q2HR73 GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Joint_X_mid
ATGACCCCTTGT _A_Concatenation AATCGAGACCAG _B Target(s) Target(s) -1.876497825 0.23717823 1.885843983 0.42878603 -3.1287446 -1.170254333 Human target Virus_target LEUTX hvTR_Q8BB47 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH Strong_X_strong
ATTGTATCTAAC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.676361145 0.16619678 2.070979426 0.06790309 -0.576424272 -1.178957286 Human target Virus_target SMARCB1 hvTR_Q2HR71 VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
ATGACCCCTTGT _A_Concatenation CCCTGCCTCGGG _B Target(s) Target(s) -4.915335629 0.00309758 0.335590485 0.88038733 -1.938327017 -1.186645099 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY Strong_X_mid
GGAGGCACATCG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.664609429 0.00443535 5.309696582 0.00156378 -1.784748802 0.953105946 -1.203388423 0.709382773 Virus_target Human target hvTR_Q87042 LEUTX RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
CAACGCATGCCT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.194983769 0.93141616 5.70838569 -3.171964635 -1.216060719 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 Random_1 YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Weak_X_random
GAGATTGTGTCC _A_Concatenation TGGCCCCCAGTT _B Target(s) Target(s) -0.633434576 0.92885234 -6.079958704 0.11085332 1.349299666 -1.218768567 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q3I8P6 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN Strong_x_weak
TAAGATTCAACG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.618316917 0.37291537 -2.712224398 0.08545263 0.024823774 -1.224777916 Human target Virus_target ZNF292 hvTR_Q2HR71 CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
ATGACCCCTTGT _A_Concatenation CTCGTTATGGCA _B Target(s) Target(s) -0.337309731 0.56627226 5.580687498 -0.574579268 0.997690695 -1.243522131 0.590310725 Human target Human target LEUTX NOVA1 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF Strong_X_mid
AGCCAGGTCACC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.469802137 0.6257227 -4.588019927 0.00562514 -4.505666369 -1.249186776 Human target Virus_target MESP1 hvTR_Q2HR71 GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
CTACGTGGCCCC _A_Concatenation TACATGAGGCAC _B Target(s) Target(s) 0.951393057 0.07182709 -2.193355556 0.20612557 -2.912012875 0.624937505 -1.254992502 0.700713833 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL Strong_X_mid
AGAGTGCGCTCT _A_Concatenation AAGACCGGTGCC _B neg_ctrl Target(s) 0.263197881 0.48649146 4.362150732 0.04317516 -0.41567701 0.997690695 -1.260092603 0.585780662 neg_ctrl Virus_target Random_1 hvTR_P08392 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF Random_x_mid
ATCCAATCTGTG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.207701323 0.93606303 -7.302124202 0.02483009 6.737934355 -1.292106645 Human target Virus_target GLI2 hvTR_Q2HR71 PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
GAGATTGTGTCC _A_Concatenation GGCACAGCTCCA _B Target(s) Target(s) -1.019394625 6.90E-06 -5.337569441 0.00051097 -1.344303758 0.976975145 -1.30506071 0.590310725 Virus_target Human target hvTR_Q2HR71 DTX2 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG Strong_x_unknown
ACTTGATGGTTT _A_Concatenation TAAAATTTATCA _B Target(s) Target(s) -0.236987671 0.72546447 -2.808554146 0.03117913 0.462950466 0.997690695 -1.305537631 0.683950209 Virus_target Human target hvTR_Q2HR73 HSF1 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY mid_X_mid
AGCCAGGTCACC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -1.071350579 4.95E-17 -0.519895018 -0.560485241 0.997690695 -1.31982972 0.581500394 Human target vIRF2_VP64_pos_ctrl MESP1 core_VP64_a GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
GAGATTGTGTCC _A_Concatenation ACTTTAGTAACA _B Target(s) Target(s) -0.59868988 0.16937431 1.968833576 0.23689969 -0.098878819 -1.325485885 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q69113 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES Strong_X_strong
AGAGTGCGCTCT _A_Concatenation ATGACCCCTTGT _B neg_ctrl Target(s) -0.051339931 0.95452661 -0.842480429 0.57034604 0.040324128 -1.345280033 neg_ctrl Human target Random_1 LEUTX CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Random_x_strong
ATAACTCCACGC _A_Concatenation AATGGCACTAGC _B pos_ctrl_A Target(s) 0.003300817 0.99110789 1.956345028 0.36653395 -0.188317863 0.998501722 -1.358020785 0.548443218 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NC_024711.1_cds_YP_009052522.2_44 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR VIRF2_VP64_pos ctrl_x_weak
ATGACCCCTTGT _A_Concatenation TAAGATTCAACG _B Target(s) Target(s) 2.993728693 0.12001523 1.721485604 0.2646259 0.861736055 -1.359126778 Human target Human target LEUTX ZNF292 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP Strong_X_strong
CATTAGTCCCGC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.404938826 1.616105388 0.22849993 3.419125043 -1.370894521 Virus_target Human target hvTR_P06930 LEUTX AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
CTCCATTAATGA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.501919533 0.61505192 3.854044641 0.15241597 0.655071563 0.997690695 -1.379828556 0.641868474 Virus_target Virus_target hvTR_Q5IXR3 hvTR_Q2HR73 FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Weak_X_mid
ACTTTAGTAACA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.485823829 0.7246252 3.424520929 0.02230835 -1.524999847 -1.380908539 Virus_target Virus_target hvTR_Q69113 hvTR_Q2HR71 NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
TTCTAATTTCCT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 1.320473009 0.45084025 -2.955972296 0.23767729 -1.759217618 -1.392957897 Human target Virus_target WDR76 hvTR_Q2HR71 IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
ACTTGATGGTTT _A_Concatenation CCCTATGTTCTA _B Target(s) pos_ctrl_A -0.57413273 0.12232448 -1.938414189 0.3017597 -1.039330028 0.997690695 -1.403891342 0.535512412 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q2HR73 core_VP64_b RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
CATGCTAACACC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.914882633 0.64865022 1.432179596 0.52840079 -3.156853676 -1.404876493 Virus_target vIRF2_VP64_pos_ctrl hvTR_F5HCV3 core_VP64_a TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
GAGATTGTGTCC _A_Concatenation ACTCGTGTTGTC _B Target(s) Target(s) 0.156561328 0.68024065 -2.044018389 0.29772089 -0.862335606 0.997690695 -1.405500297 0.575478509 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q9QB97 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT Strong_X_mid
ATGACCCCTTGT _A_Concatenation ACGACTCACCGC _B Target(s) Target(s) -3.130422085 0.17986762 -3.346803733 -1.405895358 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK Strong_X_mid
CTACGTGGCCCC _A_Concatenation CGGCAATTGCTT _B Target(s) Target(s) 0.458480564 0.39898368 7.739036206 0.01385828 -2.576052844 -1.425199802 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q9QB93 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK Strong_X_mid
ATGACCCCTTGT _A_Concatenation Stop_1_B Target(s) neg_ctrl -0.71048149 0.18077802 5.762022346 0.00052905 -0.315943885 0.997690695 -1.433616011 0.485515241 Human target neg_ctrl LEUTX Stop_1 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK Strong_X_termination
GAGATTGTGTCC _A_Concatenation AAGACCGGTGCC _B Target(s) Target(s) -1.502566843 0.09668495 8.608880569 -0.057058423 -1.447491287 Virus_target Virus_target hvTR_Q2HR71 hvTR_P08392 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF Strong_X_mid
ATAACTCCACGC _A_Concatenation GCAGATTCGAAT _B pos_ctrl_A Target(s) 0.46078564 0.89748071 -1.393432938 0.62046092 -5.28859147 -1.458440777 vIRF2_VP64_pos_ctrl Human target core_VP64_a AK9 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP vIRF2_VP64_pos_ctrl_X_mid
ATAACTCCACGC _A_Concatenation AGTTAGGCTCTT _B pos_ctrl_A Target(s) 0.209226693 0.81658896 -3.869650017 0.0038591 -2.047607054 -1.468468127 vIRF2_VP64_pos_ctrl Human target core_VP64_a TOX4 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV vIRF2_VP64_pos_ctrl_X_mid
GGCACAGCTCCA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.423468769 0.84967001 -5.796035749 0.01582874 -1.081408981 -1.468587795 Human target Virus_target DTX2 hvTR_Q2HR71 TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Unknown_X_strong
GGCCTTTCGGGC _A_Concatenation CTACGTGGCCCC _B neg_ctrl Target(s) 0.588772206 0.72546447 -2.145075882 0.29075112 -2.843661949 -1.472764939 neg_ctrl Virus_target Random_3 hvTR_Q2HR71 GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Random_x_strong
ATAACTCCACGC _A_Concatenation TATCGCAAGAAC _B pos_ctrl_A neg_ctrl -0.222624222 0.90519313 -5.853449744 0.13957468 -0.593060379 -1.48123928 vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_a Random_5 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV VIRF2_VP64_pos ctrl_x_random
CTACGTGGCCCC _A_Concatenation ACTCGTGTTGTC _B Target(s) Target(s) -0.424776086 0.5677544 0.068952382 0.96542848 -5.799855803 -1.504025224 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q9QB97 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT Strong_X_mid
GAGACAGCTCTC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.783689814 0.6161502 -1.323523101 0.61908967 -1.902982934 -1.511914004 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 LEUTX LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
CTCACGACAAGA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.754996002 0.18334753 -1.252801728 0.43792599 -0.77340744 -1.526512169 Virus_target Virus_target hvTR_G3G929 hvTR_Q2HR73 ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
AATGGCACTAGC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 1.462215585 0.00037251 6.14117484 -0.423953789 0.997690695 -1.534893921 0.4051583 Virus_target Virus_target cds_NC_024711.1_cds_YP_009052522.2_44 hvTR_Q2HR73 EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Weak_X_mid
TATTAAAAGTGC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 2.122770816 0.02838239 -5.2194232 0.17840605 -3.467551469 -1.538004976 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_202 Random_1 MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Weak_X_random
AGCCGTGCTTGA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.748251812 0.0007241 5.002751278 0.0029459 -0.714227476 0.997690695 -1.543872779 0.446668493 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 core_VP64_a GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Weak_X_vIRF2\u VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation GAGACAGCTCTC _B neg_ctrl Target(s) 0.126940751 0.92929931 -4.749550792 -6.860744528 -1.564192362 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM Random_x_mid
ATAACTCCACGC _A_Concatenation ACAACGTGCATA _B pos_ctrl_A Target(s) 0.882671467 0.25462682 2.657280244 3.254550073 -1.565592784 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_P10407 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET VIRF2_VP64_pos strong_ctrl_x_strong
GAGATTGTGTCC _A_Concatenation TCTTAGAGGGTG _B Target(s) Target(s) 0.39830039 0.27923268 1.601564011 0.692935403 -1.570919949 Virus_target Human target hvTR_Q2HR71 TRO PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL Strong_X_strong
CATTGTTTAAAT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.368056682 0.61264864 7.236679287 0.00064757 -1.718382566 -1.582944188 Human target Virus_target POLR1E hvTR_Q2HR71 AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Unknown_X_strong
EPICXV.47-A\u splice_ AGAGTGCGCTCT _b pos_ctrl_A neg_ctrl -0.388802177 0.72633824 -6.391157771 3.10E-05 0.778466665 0.997690695 -1.587143842 0.472641252 vIRF2_VP64_pos_ctrl neg_ctrl 3x_vIRF2_core Random_1 GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP VIRF2_VP64_pos ctrl_x_random
GGCACTGTCGAG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.36305039 0.23304751 5.324118927 0.02116379 -0.586088657 0.997690695 -1.612838325 0.518376375 Virus_target Virus_target hvTR_P17147 hvTR_Q2HR73 YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Unknown_X_mid
CTACGTGGCCCC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.726792141 2.29E-06 3.043343726 0.14649582 -0.443282723 0.997690695 -1.616257529 0.280957837 Virus_target Human target hvTR_Q2HR71 LEUTX TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
GAGATTGTGTCC _A_Concatenation AGCCGTGCTTGA _B Target(s) Target(s) -0.50711417 0.28757726 -4.558879821 0.01096213 0.115953096 0.998501722 -1.617280117 0.585780662 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY Strong_x_weak
CTACGTGGCCCC _A_Concatenation AAACTTTGAGCG _B Target(s) Target(s) 1.271730072 0.7442977 13.65581628 0.00015285 3.925342192 -1.621858471 Virus_target Human target hvTR_Q2HR71 ATF6B TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ Strong_X_strong
GAGATTGTGTCC _A_Concatenation CTCGCACCGAGG _B Target(s) Target(s) -0.67028454 0.07718954 0.900820929 -1.304407356 0.997690695 -1.636234168 0.590310725 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN Strong_X_mid
TCAGCCGTAGGC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.42417603 0.8649368 -4.902742015 0.03818366 -4.602118995 -1.640648452 Virus_target Virus_target hvTR_Q805Y1 hvTR_Q2HR73 RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
AAGCAACGATAT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 2.894668905 0.00345227 4.516546955 -3.562946978 -1.64234607 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 hvTR_Q2HR73 GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
CTGCCGGTTGCG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -1.063907298 2.04E-05 6.032562203 0.00143454 -0.604239271 0.997690695 -1.642801946 0.243715546 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 core_VP64_a LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
GGAGGCACATCG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.572873755 0.04092758 -3.479695208 0.00451841 -0.157714734 0.998501722 -1.650850724 0.445938924 Virus_target Virus_target hvTR_Q87042 hvTR_Q2HR73 RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
CTACGTGGCCCC _A_Concatenation ACTTTAGTAACA _B Target(s) Target(s) 0.733134025 0.24325835 4.158005905 0.0451167 0.615771666 -1.651783857 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q69113 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES Strong_X_strong
CCGCCCTTATGT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -1.790226474 0.01206131 -4.464166686 0.0178468 -0.566136385 0.997690695 -1.660573902 0.570993876 Human target Virus_target ZFX hvTR_Q2HR71 VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
CACACCCGGCAG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.195109347 0.86342423 1.493292461 0.48138613 0.11866136 0.998501722 -1.670331246 0.433279853 Virus_target Human target hvTR_P19893 LEUTX VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
TACTTCTAGCGG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.784841274 0.63450624 -1.064653633 0.57110568 -3.117120781 -1.673882311 Virus_target Human target cds_NC_025350.1_cds_YP_009094080.1_7 LEUTX TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Weak_X_Strong
ACTTGATGGTTT _A_Concatenation ATGACAAAACGA _B Target(s) Target(s) 0.913056242 0.01809398 0.526036262 0.77751675 0.499158773 0.997690695 -1.687160384 0.585780662 Virus_target Virus_target hvTR_Q2HR73 cds_NC_025350.1_cds_YP_009094074.1_1 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL Mid_X_strong
ATAACTCCACGC _A_Concatenation GAGTGACCTATT _B pos_ctrl_A Target(s) 0.110313531 0.78371714 -4.560661081 0.00277922 -0.449309135 0.997690695 -1.687879513 0.415099063 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI VIRF2_VP64_pos ctrl_x_unknown
TTTAACGGATTG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 2.169013307 0.00229408 7.330801724 1.10544762 -1.688418597 Human target vIRF2_VP64_pos_ctrl TWIST2 core_VP64_a SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
CTACGTGGCCCC _A_Concatenation CCAGACGGTCTG _B Target(s) Target(s) 0.328468287 0.6679075 -1.436414283 0.20783496 1.225878654 0.983636094 -1.707489963 0.361838437 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN Strong_x_weak
ACTTGATGGTTT _A_Concatenation CATGCTAACACC _B Target(s) Target(s) 1.710593625 3.78E-75 4.657317809 1.13E-06 0.039457423 0.998501722 -1.729358705 0.405632218 Virus_target Virus_target hvTR_Q2HR73 hvTR_F5HCV3 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL Mid_X_strong
Joint_ 2_A/u splice_ ACTTGATGGTTT _b neg_ctrl Target(s) -0.461872651 0.80545181 -5.383685315 0.15616264 0.40853789 -1.730223487 neg_ctrl Virus_target Joint_2 hvTR_Q2HR73 GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Joint_X_mid
ACGACTCACCGC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.47526773 0.53101873 -3.49903507 0.24883971 -3.458910944 -1.738847318 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 LEUTX PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
CCCCTTAGGAAT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 4.659860773 0.0188851 -3.173081868 0.36653395 -3.651954814 -1.739646685 Human target Virus_target SYNE1 hvTR_Q2HR71 SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
GAGATTGTGTCC _A_Concatenation TACTTCTAGCGG _B Target(s) Target(s) 0.051776607 0.9854934 1.039707877 0.6950775 -5.511509453 -1.758609214 Virus_target Virus_target hvTR_Q2HR71 cds_NC_025350.1_cds_YP_009094080.1_7 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ Strong_x_weak
ATAACTCCACGC _A_Concatenation ACTTTAGTAACA _B pos_ctrl_A Target(s) 0.166828144 0.82429662 -0.400599966 0.85606566 1.0459779 0.997690695 -1.76431366 0.557484003 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q69113 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES VIRF2_VP64_pos strong_ctrl_x_strong
AGAGTGCGCTCT _A_Concatenation TATCGCAAGAAC _B neg_ctrl neg_ctrl 0.077618259 0.93141616 5.211493031 0.07650549 -6.43656676 -1.77418533 neg_ctrl neg_ctrl Random_1 Random_5 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV Random_x_random
GCGAGGCAGACC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -1.118132135 0.29193745 -4.7347035 0.14027063 -3.096955612 -1.804408019 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 hvTR_Q2HR73 QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Unknown_X_mid
TATCGCAAGAAC _A_Concatenation ATAACTCCACGC _B neg_ctrl pos_ctrl_A -0.893661978 0.01058498 2.820458751 0.06099515 0.805997478 0.997690695 -1.810729438 0.280364708 neg_ctrl vIRF2_VP64_pos_ctrl Random_5 core_VP64_a PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Random_X_vIRF2\u VP64_pos_ctrl
ATGACCCCTTGT _A_Concatenation Stop_2_B Target(s) neg_ctrl -0.175960152 0.9238208 5.537449911 0.16379541 -3.721847523 -1.81600017 Human target neg_ctrl LEUTX Stop_2 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE Strong_X_termination
ACTTGATGGTTT _A_Concatenation GATCGCATGTTC _B Target(s) Target(s) 0.574415704 0.09356214 -6.589762935 1.01E-07 -0.343454987 0.997690695 -1.834572976 0.37951222 Virus_target Human target hvTR_Q2HR73 INPP4A RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER Mid_X_strong
CTACGTGGCCCC _A_Concatenation AGCCGTGCTTGA _B Target(s) Target(s) 0.77193196 0.78044287 -3.356763777 0.12707002 -1.059465518 -1.84197453 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY Strong_x_weak
EPICXV.47-A\u splice_ ATGACCCCTTGT _b pos_ctrl_A Target(s) -2.931314649 0.43839885 -2.301199062 0.28348131 -1.093673031 -1.870898068 vIRF2_VP64_pos_ctrl Human target 3x_vIRF2_core LEUTX GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV VIRF2_VP64_pos strong_ctrl_x_strong
GACTGTGGGCGG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.835179305 0.86010689 -3.385362422 -3.580736385 -1.873576209 Human target vIRF2_VP64_pos_ctrl MON2 core_VP64_a TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
ACTTGATGGTTT _A_Concatenation AATCGAGACCAG _B Target(s) Target(s) -2.308340043 0.37525592 -0.076247704 -3.775248939 -1.874300662 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q8BB47 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH Mid_X_strong
GAGATTGTGTCC _A_Concatenation AAAACAAGCATT _B Target(s) Target(s) -1.102525805 0.66445658 -5.643740376 0.04032533 -0.797450345 -1.875352064 Virus_target Virus_target hvTR_Q2HR71 cds_NC_030656.1_cds_YP_009268803.1_88 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS Strong_x_weak
GGAGGCACATCG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -1.144870239 0.0476953 4.542389949 0.03932477 -0.703453694 0.997690695 -1.921568235 0.447639814 Virus_target neg_ctrl hvTR_Q87042 Random_1 RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
ATAACTCCACGC _A_Concatenation CACACCCGGCAG _B pos_ctrl_A Target(s) 1.042063125 3.86E-06 6.880474115 5.64E-11 -0.376437208 0.997690695 -1.924257012 0.280453332 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_P19893 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS VIRF2_VP64_pos strong_ctrl_x_strong
ACTTGATGGTTT _A_Concatenation GGCACAGCTCCA _B Target(s) Target(s) 0.104819291 0.86342423 -1.496750801 0.3793835 -0.191747038 0.998501722 -1.942893444 0.509470315 Virus_target Human target hvTR_Q2HR73 DTX2 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG Mid_X_unknown
GAGATTGTGTCC _A_Concatenation AGTTGCTTTGCT _B Target(s) neg_ctrl 1.204079194 0.35721727 -5.634491704 0.02709971 -2.440687278 -1.952353931 Virus_target neg_ctrl hvTR_Q2HR71 Random_2 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM Strong_X_random
ATGACAAAACGA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.351001561 0.71346073 -3.61100703 0.07226686 0.92028625 0.997690695 -1.956465917 0.360767917 Virus_target Virus_target cds_NC_025350.1_cds_YP_009094074.1_1 hvTR_Q2HR71 HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
ATAACTCCACGC _A_Concatenation Joint_ 1_B pos_ctrl_A neg_ctrl -0.261725403 0.76101183 -3.972751417 0.00156378 -0.119908033 0.998501722 -1.974284598 vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_a Joint_1 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK VIRF2_VP64_pos ctrl X joint
AGAGTGCGCTCT _A_Concatenation CGGCAATTGCTT _B neg_ctrl Target(s) -1.507279606 0.27923268 -6.668416317 0.02011974 -4.09003386 -1.976355916 neg_ctrl Virus_target Random_1 hvTR_Q9QB93 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK Random_x_mid
CAGTGACATAGT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 1.406085157 0.37291537 3.528871636 0.01939364 3.963397906 -1.97969583 Human target Human target EHMT2 LEUTX AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
AATCAACGAGCA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.317086427 0.81690552 -3.834792733 0.02550846 -0.292858768 -1.982265083 Human target vIRF2_VP64_pos_ctrl C2orf81 core_VP64_a SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
ACTTGATGGTTT _A_Concatenation ACCAGTATTTAA _B Target(s) Target(s) 0.870290428 0.04649857 6.057755475 0.00040535 0.916883495 0.997690695 -1.989612997 0.213367908 Virus_target Human target hvTR_Q2HR73 ARNTL2 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL Mid_X_strong
GTCACCCTCCTT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 2.501462385 0.49700987 1.687073928 0.53133427 4.260069206 -1.991590924 Human target Human target E2F5 LEUTX DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
AGAGTGCGCTCT _A_Concatenation ACGACTCACCGC _B neg_ctrl Target(s) -0.874822372 0.27923268 -4.226350992 0.11075086 -2.71536985 -1.993643846 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK Random_x_mid
CATCTCACCTAG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 1.650312974 0.00010198 6.428866094 0.00108214 -1.163978859 -2.014637743 Virus_target Virus_target hvTR_P03206 hvTR_Q2HR71 ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
CTGGTAGTTGTT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 2.182432988 0.09988704 2.407166562 0.33543664 -5.138530846 -2.030076169 Virus_target vIRF2_VP64_pos_ctrl hvTR_F6KS47 core_VP64_a SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Unknown_X_vIRF2\u VP64_pos_ctrl
GCAACGAGGGTC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.715265396 0.36968355 -0.815252808 -4.908479493 -2.033995172 Human target neg_ctrl QRICH2 Random_1 RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
TCTGTACCAACG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.011782545 0.27923268 -5.827280848 0.00104225 0.480938784 -2.038462525 Virus_target Human target hvTR_Q6XA69 LEUTX ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Unknown_X_strong
GAGATTGTGTCC _A_Concatenation AATCAACGAGCA _B Target(s) Target(s) -0.937772811 6.28E-10 0.923364823 -1.264547408 0.976975145 -2.044713834 0.218195585 Virus_target Human target hvTR_Q2HR71 C2orf81 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE Strong_X_strong
GATGGAGCTACA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.824540175 0.23357818 -1.913333917 0.29772089 -0.376599198 -2.045450749 Virus_target Virus_target hvTR_P03209 hvTR_Q2HR73 EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
AGAGTGCGCTCT _A_Concatenation AGGCTTCCCGCT _B neg_ctrl Target(s) 0.786791801 0.7246252 -5.900124845 0.13219999 -4.931782681 -2.054669552 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV Random_x_mid
ATAACTCCACGC _A_Concatenation AAACTTTGAGCG _B pos_ctrl_A Target(s) 0.116399123 0.97914877 5.523543725 0.13526627 4.337398052 -2.076833276 vIRF2_VP64_pos_ctrl Human target core_VP64_a ATF6B LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ VIRF2_VP64_pos strong_ctrl_x_strong
ACTTGATGGTTT _A_Concatenation Stop_1_B Target(s) neg_ctrl 1.328434196 3.67E-15 5.450879913 0.00057184 -1.241199176 0.976975145 -2.07848395 0.166242467 Virus_target neg_ctrl hvTR_Q2HR73 Stop_1 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK Mid_X_termination
CATGCTAACACC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.444373572 0.80691093 7.687319261 0.00026118 -3.795047491 -2.110304387 Virus_target Virus_target hvTR_F5HCV3 hvTR_Q2HR71 TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
TGATGAGCGCCC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.366385995 0.01662228 -0.409429918 0.74529559 -0.619370617 0.997690695 -2.110912066 0.260152305 Human target Virus_target EEF1D hvTR_Q2HR73 QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Weak_X_mid
ATGACAAAACGA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.146081921 0.93202829 2.191024386 0.19018098 2.554561485 -2.117730862 Virus_target neg_ctrl cds_NC_025350.1_cds_YP_009094074.1_1 Random_1 HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
ATAACTCCACGC _A_Concatenation GCGAGGCAGACC _B pos_ctrl_A Target(s) -0.781894849 0.09842699 -6.455380163 3.89E-06 -3.024620924 -2.11829931 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD VIRF2_VP64_pos ctrl_x_unknown
CTACGTGGCCCC _A_Concatenation TTTAACGGATTG _B Target(s) Target(s) 0.286615487 0.32367186 6.31278232 0.00033095 0.176741699 0.998501722 -2.119345883 0.214928132 Virus_target Human target hvTR_Q2HR71 TWIST2 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN Strong_X_mid
TCTTAGAGGGTG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.536087357 0.27973193 -6.136270964 0.00036095 -3.307702299 -2.12122524 Human target neg_ctrl TRO Random_1 FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
AAACTTTGAGCG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.08222404 0.91324404 0.344188269 0.89007548 0.966278311 0.997690695 -2.125851811 0.205419389 Human target Virus_target ATF6B hvTR_Q2HR71 AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
CTACGTGGCCCC _A_Concatenation AATCGAGACCAG _B Target(s) Target(s) -0.276567538 0.8410287 0.796178088 0.77340766 -1.752742327 -2.129471962 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q8BB47 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH Strong_X_strong
ATGACCCCTTGT _A_Concatenation AGCCGTGCTTGA _B Target(s) Target(s) 1.073034753 0.65676761 4.400296147 -2.146092094 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY Strong_x_weak
AGCCATAAACCT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.703893146 0.2062555 10.18787157 0.00011919 -0.256995675 0.998501722 -2.147117461 0.411526549 Virus_target Virus_target hvTR_P03209 hvTR_Q2HR71 PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
GAGATTGTGTCC _A_Concatenation AGCCATAAACCT _B Target(s) Target(s) -0.486899537 0.5449371 -5.564379368 0.00710378 -4.16221969 -2.148279884 Virus_target Virus_target hvTR_Q2HR71 hvTR_P03209 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ Strong_X_mid
ACAACGTGCATA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.387306973 0.00125477 3.709797297 0.0478769 -1.353535051 -2.155272728 Virus_target neg_ctrl hvTR_P10407 Random_1 RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
CATGCTAACACC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 1.423524178 0.29119771 3.377280653 0.03336571 0.937763798 -2.156749779 Virus_target Human target hvTR_F5HCV3 LEUTX TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
TACTTCTAGCGG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -1.652334024 0.61787605 5.512740473 0.00109019 2.922156751 -2.161319299 Virus_target Virus_target cds_NC_025350.1_cds_YP_009094080.1_7 hvTR_Q2HR71 TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Weak_X_Strong
ATAACTCCACGC _A_Concatenation TAAGATTCAACG _B pos_ctrl_A Target(s) -0.418214463 0.76970415 -1.702104054 0.49899652 1.3428249 -2.1652639 vIRF2_VP64_pos_ctrl Human target core_VP64_a ZNF292 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP VIRF2_VP64_pos strong_ctrl_x_strong
TCTTGGTTTTGA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.306373142 0.77086475 -4.778064282 0.02389526 0.65423091 0.997690695 -2.17056458 0.218195585 Human target Virus_target ZNF597 hvTR_Q2HR73 ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
GAGATTGTGTCC _A_Concatenation GGAGGCACATCG _B Target(s) Target(s) -0.62525844 0.75592103 -3.708128685 0.00448128 -5.115835852 -2.170838839 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q87042 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL Strong_X_strong
GCGAGGCAGACC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -1.621708401 0.03731663 -0.11925813 0.96542848 -1.501399203 -2.172105589 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 hvTR_Q2HR71 QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Unknown_X_strong
ATAACTCCACGC _A_Concatenation CCTAAATGCAAG _B pos_ctrl_A Target(s) 0.682806739 0.59926862 -5.906280331 0.01927929 -3.037905766 -2.190606412 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NC_025410.1_cds_YP_009094498.1_7 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL vIRF2_VP64_pos_ctrl_X_mid
CTACGTGGCCCC _A_Concatenation CCGCCCTTATGT _B Target(s) Target(s) 3.06726255 0.48198592 8.72095708 1.08E-06 -4.078700875 -2.204782525 Virus_target Human target hvTR_Q2HR71 ZFX TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH Strong_X_mid
AGAGTGCGCTCT _A_Concatenation TAAAATTTATCA _B neg_ctrl Target(s) -0.20300756 0.926639 -0.931926182 0.73748128 -3.393950498 -2.214449527 neg_ctrl Human target Random_1 HSF1 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY Random_x_mid
AGTTGCTTTGCT _A_Concatenation AGAGTGCGCTCT _B neg_ctrl neg_ctrl -0.087536792 0.94626231 4.1949744 0.410153126 -2.22013316 neg_ctrl neg_ctrl Random_2 Random_1 DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Random_x_random
GATGGAGCTACA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.887384943 0.29698201 3.296477797 0.08125073 1.702949842 -2.230260513 Virus_target Virus_target hvTR_P03209 hvTR_Q2HR71 EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
TTTAACGGATTG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.174022666 0.9335834 -4.760184858 0.03096016 -3.91081508 -2.237911962 Human target Virus_target TWIST2 hvTR_Q2HR73 SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
TCTTAGAGGGTG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 1.290610473 0.00442279 -4.665711543 0.00059041 -1.837223793 -2.244081419 Human target Virus_target TRO hvTR_Q2HR71 FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
CATGCTAACACC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.631396667 0.14446217 2.547201248 0.15596426 -1.765278122 0.953105946 -2.271078994 0.410501623 Virus_target Virus_target hvTR_F5HCV3 hvTR_Q2HR73 TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
ATCCAATCTGTG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 2.241931715 0.33007877 -3.331293553 0.26509579 0.413339493 -2.273726897 Human target neg_ctrl GLI2 Random_1 PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
AAACTTTGAGCG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.096584118 0.94020164 -5.695800149 0.00855636 -2.527967424 -2.281588967 Human target Virus_target ATF6B hvTR_Q2HR73 AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
GATCGCATGTTC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.845452262 0.38265522 -1.92334639 0.31502412 -0.709287021 -2.303356666 Human target Virus_target INPP4A hvTR_Q2HR73 NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
ATGACCCCTTGT _A_Concatenation GGAGGCACATCG _B Target(s) Target(s) 1.83951509 0.21129976 -0.09648376 0.95339043 3.039571002 -2.310562021 Human target Virus_target LEUTX hvTR_Q87042 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL Strong_X_strong
AGCCATAAACCT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.23329157 0.92304294 3.123626982 0.21612015 2.269054435 -2.314081628 Virus_target Virus_target hvTR_P03209 hvTR_Q2HR71 PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
ATCTCTCAGTTC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 1.506326202 0.00031847 6.180269477 0.00071096 1.851584313 0.933317674 -2.314930064 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 core_VP64_a VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
ATAACTCCACGC _A_Concatenation CAGTGACATAGT _B pos_ctrl_A Target(s) 0.689968443 0.03846116 -6.522655629 5.25E-07 0.809519735 -2.316066855 vIRF2_VP64_pos_ctrl Human target core_VP64_a EHMT2 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT vIRF2_VP64_pos_ctrl_X_mid
ATGACCCCTTGT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.984744494 0.197473 4.395721525 0.04017978 0.04216418 -2.324458443 Human target Virus_target LEUTX hvTR_Q2HR73 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
GAGATTGTGTCC _A_Concatenation CTCGTTATGGCA _B Target(s) Target(s) 1.467396299 0.26992692 -6.566344077 0.0001379 -0.280644693 -2.340477918 Virus_target Human target hvTR_Q2HR71 NOVA1 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF Strong_X_mid
ACTTGATGGTTT _A_Concatenation GACTGTGGGCGG _B Target(s) Target(s) 0.345234247 0.93114725 -0.923832766 0.70352371 4.582024468 -2.345754002 Virus_target Human target hvTR_Q2HR73 MON2 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL Mid_X_strong
ATAACTCCACGC _A_Concatenation CCCTGCCTCGGG _B pos_ctrl_A Target(s) 1.3247522 4.75E-07 6.129584023 0.00055535 -0.182736971 0.998501722 -2.347019609 0.203901573 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY vIRF2_VP64_pos_ctrl_X_mid
GAGATTGTGTCC _A_Concatenation TAAAATTTATCA _B Target(s) Target(s) 0.380376232 0.01727004 3.792260526 0.06790309 -0.595644588 0.997690695 -2.359856949 0.146899569 Virus_target Human target hvTR_Q2HR71 HSF1 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY Strong_X_mid
ATAACTCCACGC _A_Concatenation AGTTGCTTTGCT _B pos_ctrl_A neg_ctrl -0.117883532 0.91394475 2.567459099 0.98072853 -2.367654772 vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_a Random_2 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM VIRF2_VP64_pos ctrl_x_random
ATGACCCCTTGT _A_Concatenation ACCAGTATTTAA _B Target(s) Target(s) 1.26268393 0.85449119 0.73914719 -4.236506875 -2.375972335 Human target Human target LEUTX ARNTL2 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL Strong_X_strong
ATGACCCCTTGT _A_Concatenation AGCCATAAACCT _B Target(s) Target(s) 2.966371354 0.17722959 -3.484670686 0.18368446 -3.768250082 -2.385046463 Human target Virus_target LEUTX hvTR_P03209 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ Strong_X_mid
AATCAACGAGCA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.93220789 0.35187283 -2.349232204 0.04528497 0.075912288 0.998501722 -2.391219803 0.383504477 Human target Virus_target C2orf81 hvTR_Q2HR73 SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
EPICXV.47-A\u splice_ GAGATTGTGTCC _b pos_ctrl_A Target(s) -1.150809606 0.22884458 0.918991272 0.6645866 -2.515006557 -2.400241664 vIRF2_VP64_pos_ctrl Virus_target 3x_vIRF2_core hvTR_Q2HR71 GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP VIRF2_VP64_pos strong_ctrl_x_strong
TGGCCCCCAGTT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -1.467636579 4.10E-21 -0.238324673 0.89582846 0.355614964 0.997690695 -2.40899104 0.412491923 Virus_target Virus_target hvTR_Q3I8P6 hvTR_Q2HR73 GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Weak_X_mid
AAGACCGGTGCC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.033905786 0.95058117 2.677340621 0.18898801 2.04848687 -2.416078356 Virus_target Virus_target hvTR_P08392 hvTR_Q2HR73 RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
CAACGCATGCCT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -1.507019845 0.63402488 7.341065793 0.04902037 -4.282304325 -2.425555184 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 core_VP64_a YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Weak_X_vIRF2\u VP64_pos_ctrl
ATGACCCCTTGT _A_Concatenation ATTTGTAGACCG _B Target(s) Target(s) -0.678633553 0.00020662 1.047632389 0.60710891 -0.998476402 0.997690695 -2.441086679 0.405632218 Human target Human target LEUTX MYCN DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE Strong_X_mid
ACTTGATGGTTT _A_Concatenation GGTGTCTCGATT _B Target(s) Target(s) 1.146595834 0.42503544 -4.009423488 0.06073326 -4.098673044 -2.444445304 Virus_target Human target hvTR_Q2HR73 E2F5 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV Mid_X_strong
TAAGATTCAACG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.651548081 0.73379786 -2.854050345 -0.752642529 -2.458418062 Human target Virus_target ZNF292 hvTR_Q2HR71 CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
Stop 2_A splice ACTTGATGGTTT B neg_ctrl Target(s) -0.799530923 0.00019667 2.376046442 0.14649582 -0.940138832 0.997690695 -2.467030038 0.205658608 neg_ctrl Virus_target Stop_2 hvTR_Q2HR73 ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Termination_X_mid
ACTTGATGGTTT _A_Concatenation ACTCGTGTTGTC _B Target(s) Target(s) 0.504417835 0.60751198 5.025130913 3.128750176 -2.478619765 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q9QB97 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT mid_X_mid
ATAACTCCACGC _A_Concatenation TCAGCCGTAGGC _B pos_ctrl_A Target(s) -0.925985484 0.0062098 5.631731145 -1.521299279 -2.500327335 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q805Y1 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR vIRF2_VP64_pos_ctrl_X_mid
TCTATCACAACT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 1.928985807 0.00599572 -3.882399261 0.02389526 -4.794463601 -2.509937684 Human target vIRF2_VP64_pos_ctrl TOX4 core_VP64_a LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
CTACGTGGCCCC _A_Concatenation GGCACAGCTCCA _B Target(s) Target(s) -0.307615332 0.24458972 2.246974344 0.28102262 -1.584379902 0.983066962 -2.512443507 0.384778867 Virus_target Human target hvTR_Q2HR71 DTX2 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG Strong_x_unknown
ACTTGATGGTTT _A_Concatenation GAGTGACCTATT _B Target(s) Target(s) -0.671129848 0.00228681 -2.182955645 0.10930697 0.091416746 -2.51732128 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI Mid_X_unknown
ATGACCCCTTGT _A_Concatenation CATGCTAACACC _B Target(s) Target(s) 0.594642942 0.81666774 3.919508603 0.01582874 -3.908267003 -2.537154046 Human target Virus_target LEUTX hvTR_F5HCV3 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL Strong_X_strong
CTCGTTATGGCA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.273607922 0.23167242 5.612175114 2.32E-05 0.580736127 0.997690695 -2.540269078 0.260152305 Human target Virus_target NOVA1 hvTR_Q2HR71 AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
ATGACCCCTTGT _A_Concatenation TATCGCAAGAAC _B Target(s) neg_ctrl -2.714682385 0.32731981 4.473977221 -2.547431136 Human target neg_ctrl LEUTX Random_5 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV Strong_X_random
ATAACTCCACGC _A_Concatenation GATGGAGCTACA _B pos_ctrl_A Target(s) 0.495464255 0.64978462 -2.005317482 0.29907275 -2.752933464 -2.560168569 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_P03209 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF VIRF2_VP64_pos strong_ctrl_x_strong
ACTTGATGGTTT _A_Concatenation TGATGAGCGCCC _B Target(s) Target(s) 0.075230765 0.89748071 4.653745674 0.00970884 -0.393787656 0.997690695 -2.56223596 0.384778867 Virus_target Human target hvTR_Q2HR73 EEF1D RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL Mid_X_weak
AATCGATCCTAC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.079841774 0.92647166 -2.290427844 0.23568834 -0.941067789 -2.562480729 Human target Virus_target PRDM10 hvTR_Q2HR71 VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
GCAACGAGGGTC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 1.363847444 6.90E-42 6.402369057 2.81E-05 -0.633380028 0.997690695 -2.570070018 0.218195585 Human target Virus_target QRICH2 hvTR_Q2HR73 RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
CTACGTGGCCCC _A_Concatenation AGCCAGGTCACC _B Target(s) Target(s) -0.758054252 0.72790325 -3.414493567 0.11534543 -2.690424561 -2.577403421 Virus_target Human target hvTR_Q2HR71 MESP1 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK Strong_X_strong
AGGCTTCCCGCT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 1.513624209 0.15404651 -5.628835159 0.02654179 1.480877791 -2.595460425 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 core_VP64_a VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
Stop 2_A splice AGAGTGCGCTCT B neg_ctrl neg_ctrl 0.963586094 0.63402488 -2.018303147 0.41209595 -4.2557923 -2.616565868 neg_ctrl neg_ctrl Stop_2 Random_1 ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Termination_X_random
ATGACCCCTTGT _A_Concatenation AAGCAACGATAT _B Target(s) Target(s) -2.46187331 0.36954563 -3.202659801 0.22915921 -4.271318662 -2.633544451 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL Strong_X_mid
ATGACCCCTTGT _A_Concatenation Joint_ 3_B Target(s) neg_ctrl -0.801163172 0.03425538 5.613509686 0.02064758 -0.178426148 -2.671038686 Human target neg_ctrl LEUTX Joint_3 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD Strong-X-joint
ATGTGGCACGAC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.592403851 0.47019918 -1.998318256 0.51665233 -0.475536434 -2.681127964 Human target vIRF2_VP64_pos_ctrl ANKRD17 core_VP64_a QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
ATCCAATCTGTG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.495489344 0.43839885 0.616216156 0.78985304 0.024706282 -2.684664195 Human target Virus_target GLI2 hvTR_Q2HR71 PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
CATCTCACCTAG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.277546042 0.68922551 -1.309807802 0.41356717 -5.86947971 -2.686239646 Virus_target neg_ctrl hvTR_P03206 Random_1 ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
GAGATTGTGTCC _A_Concatenation TCTTGGTTTTGA _B Target(s) Target(s) 0.811215165 0.51346664 -3.535184037 0.22849993 2.486192349 -2.697595489 Virus_target Human target hvTR_Q2HR71 ZNF597 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL Strong_X_strong
CACACCCGGCAG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.909237772 0.09988704 -0.191391092 0.92368263 1.293548845 0.983636094 -2.715470866 0.159618156 Virus_target Virus_target hvTR_P19893 hvTR_Q2HR73 VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
GAGATTGTGTCC _A_Concatenation CATTAGTCCCGC _B Target(s) Target(s) -0.62190184 0.02410705 2.43158143 0.1714026 -0.346047949 0.997690695 -2.743563182 0.068273719 Virus_target Virus_target hvTR_Q2HR71 hvTR_P06930 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF Strong_X_strong
CTACGTGGCCCC _A_Concatenation GATCGCATGTTC _B Target(s) Target(s) 2.917691612 0.02319299 -1.727426017 -4.587976911 -2.755265402 Virus_target Human target hvTR_Q2HR71 INPP4A TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER Strong_X_strong
CTACGTGGCCCC _A_Concatenation GATGGAGCTACA _B Target(s) Target(s) -1.280498165 0.04259596 -5.020768955 0.01271193 1.389927861 -2.759970942 Virus_target Virus_target hvTR_Q2HR71 hvTR_P03209 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF Strong_X_strong
CACACCCGGCAG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.342490136 0.46615283 -3.051442368 0.02709971 -0.810071219 0.997690695 -2.785211575 0.280453332 Virus_target vIRF2_VP64_pos_ctrl hvTR_P19893 core_VP64_a VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
ATGACCCCTTGT _A_Concatenation CTCCATTAATGA _B Target(s) Target(s) 1.188324125 4.48E-11 -4.295331756 0.02754099 -0.485951148 0.997690695 -2.803612047 0.258683222 Human target Virus_target LEUTX hvTR_Q5IXR3 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV Strong_x_weak
AGTTAGGCTCTT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.23280273 0.82490703 -2.802725841 0.16145271 -3.271339259 -2.813173301 Human target Virus_target TOX4 hvTR_Q2HR71 EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
AGAGTGCGCTCT _A_Concatenation TTCTAATTTCCT _B neg_ctrl Target(s) 1.411170205 1.79E-12 5.315720744 0.00258074 0.003109429 0.998980766 -2.816877029 0.086892875 neg_ctrl Human target Random_1 WDR76 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS Random_x_strong
AGAGTGCGCTCT _A_Concatenation AAAACAAGCATT _B neg_ctrl Target(s) -0.484999467 0.82490703 -5.351655277 0.05636543 -2.781124907 -2.819684396 neg_ctrl Virus_target Random_1 cds_NC_030656.1_cds_YP_009268803.1_88 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS Random_x_weak
ACTTGATGGTTT _A_Concatenation GGGAGTATGTCG _B Target(s) neg_ctrl -0.497669976 0.16187644 4.55070641 0.12742137 0.124446763 0.998501722 -2.829345751 0.073394906 Virus_target neg_ctrl hvTR_Q2HR73 Random_4 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS Mid_X_random
AGAGTGCGCTCT _A_Concatenation CGCCGGCATACA _B neg_ctrl Target(s) 0.117584584 0.93141616 -7.003859706 1.015253663 -2.830095935 neg_ctrl Virus_target Random_1 cds_NC_030656.1_cds_YP_009268729.1_14 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI Random_x_strong
GAGATTGTGTCC _A_Concatenation CAGTGACATAGT _B Target(s) Target(s) 0.127786254 0.68918478 -1.67649194 0.36927388 -0.781371005 0.997690695 -2.831372634 0.161343204 Virus_target Human target hvTR_Q2HR71 EHMT2 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT Strong_X_mid
Stop-1_A/u splice_ ATAACTCCACGC _b neg_ctrl pos_ctrl_A -2.143264575 0.22130449 4.869029951 6.896949722 -2.839304524 neg_ctrl vIRF2_VP64_pos_ctrl Stop_1 core_VP64_a ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Termination_X_vIRF2\u VP64_pos_ctrl
TATCTATCGTGT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.843580843 0.10365221 -5.426290439 0.00382558 0.99635433 -2.847863449 Virus_target vIRF2_VP64_pos_ctrl hvTR_P03259 core_VP64_a RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
ACTTGATGGTTT _A_Concatenation TTAAATGAGGGC _B Target(s) Target(s) -1.13633224 2.01E-11 -3.22787056 0.04126764 1.162386514 0.997690695 -2.85345889 0.098005281 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ mid_X_mid
CATTAGTCCCGC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.048247658 0.94601266 0.934546223 0.52954752 1.771994733 -2.854624847 Virus_target Virus_target hvTR_P06930 hvTR_Q2HR71 AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
ACTTGATGGTTT _A_Concatenation CGATGGCGGAAT _B Target(s) Target(s) -0.308847024 0.8076353 -6.088207699 0.00494879 5.05377877 -2.860304321 Virus_target Human target hvTR_Q2HR73 KIAA1586 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG Mid_X_weak
AGAGTGCGCTCT _A_Concatenation TCTTAGAGGGTG _B neg_ctrl Target(s) -0.848396207 0.12195933 -1.383589548 0.15230837 -0.979860418 -2.866564914 neg_ctrl Human target Random_1 TRO CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL Random_x_strong
CTCCATTAATGA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 2.335009095 0.68153204 -3.748197923 0.14927325 -3.085450818 -2.898929433 Virus_target Virus_target hvTR_Q5IXR3 hvTR_Q2HR71 FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Weak_X_Strong
ATGACAAAACGA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.144011117 0.91324404 -3.510311754 0.03000685 -0.257512374 -2.900221639 Virus_target vIRF2_VP64_pos_ctrl cds_NC_025350.1_cds_YP_009094074.1_1 core_VP64_a HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
ATCTCTCAGTTC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.271740283 0.82433714 -0.650581306 0.76446307 0.86694187 -2.909138674 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 hvTR_Q2HR73 VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
ACTTGATGGTTT _A_Concatenation AGCCGTGCTTGA _B Target(s) Target(s) 0.50318991 0.78044287 -4.279880806 0.15534321 -4.265174781 -2.922804022 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY Mid_X_weak
AGAGTGCGCTCT _A_Concatenation CTGGTAGTTGTT _B neg_ctrl Target(s) -0.183699853 0.52901596 2.005003881 0.19050027 -0.831108259 0.997690695 -2.934145051 0.064747465 neg_ctrl Virus_target Random_1 hvTR_F6KS47 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT Random_x_unknown
CTACGTGGCCCC _A_Concatenation AATCAACGAGCA _B Target(s) Target(s) -0.964038414 0.04988573 -1.265804476 0.58359133 1.765071327 -2.946414793 Virus_target Human target hvTR_Q2HR71 C2orf81 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE Strong_X_strong
CTCACGACAAGA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.431113823 0.35403027 -1.386833841 0.18271773 -1.52861824 -2.951658494 Virus_target neg_ctrl hvTR_G3G929 Random_1 ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
ATGACCCCTTGT _A_Concatenation CTGGTAGTTGTT _B Target(s) Target(s) 1.716816365 2.73E-21 5.061750345 0.02500981 -0.54619243 0.997690695 -2.954588117 0.054928629 Human target Virus_target LEUTX hvTR_F6KS47 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT Strong_x_unknown
TTAGTGCCATAC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.967237371 0.73293852 -4.064036569 0.18034078 -4.795333136 -2.977658898 Human target Virus_target TET3 hvTR_Q2HR73 PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
ACTTGATGGTTT _A_Concatenation CTCCATTAATGA _B Target(s) Target(s) 1.519270995 0.00011961 -4.878494401 0.01993997 -1.742377961 -3.00218938 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q5IXR3 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV Mid_X_weak
Stop 2_A splice CTACGTGGCCCC B neg_ctrl Target(s) 1.218406888 0.03952651 3.537343208 0.09700536 -0.544321389 -3.005570369 neg_ctrl Virus_target Stop_2 hvTR_Q2HR71 ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Termination_X_strong
ATAACTCCACGC _A_Concatenation CCCTATGTTCTA _B pos_ctrl_A pos_ctrl_A -1.202985609 0.12526619 -5.178078371 0.0014569 -0.82052246 -3.008285978 vIRF2_VP64_pos_ctrl vIRF2_VP64_pos_ctrl core_VP64_a core_VP64_b LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL vIRF2_VP64_pos_ctrl_X_vIRF2_VP64_pos_ctrl
ACTTGATGGTTT _A_Concatenation CTGGTAGTTGTT _B Target(s) Target(s) -0.825142393 0.05692167 2.720373731 0.18438168 -1.48747893 0.983636094 -3.020029545 0.205419389 Virus_target Virus_target hvTR_Q2HR73 hvTR_F6KS47 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT Mid_X_unknown
ATGACCCCTTGT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.304532773 0.59969203 0.801636924 0.70172052 0.814126051 -3.037549519 Human target neg_ctrl LEUTX Random_1 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
GAGATTGTGTCC _A_Concatenation CATCTCACCTAG _B Target(s) Target(s) 0.686023462 0.45505484 6.966409082 -1.759358179 0.957192732 -3.050692888 0.211428465 Virus_target Virus_target hvTR_Q2HR71 hvTR_P03206 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA Strong_X_mid
GATCGCATGTTC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.982882923 4.06E-08 4.183099174 3.89E-05 0.422317832 0.997690695 -3.052933094 0.054928629 Human target vIRF2_VP64_pos_ctrl INPP4A core_VP64_a NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
CTACGTGGCCCC _A_Concatenation TTAAATGAGGGC _B Target(s) Target(s) -1.298638283 0.00217071 -5.705827603 0.0007807 1.354254213 0.997690695 -3.057312812 0.211428465 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ Strong_X_mid
AGTTGCTTTGCT _A_Concatenation ACTTGATGGTTT _B neg_ctrl Target(s) -1.350380742 1.40E-07 0.355584788 0.85606566 -0.293709075 -3.059777805 neg_ctrl Virus_target Random_2 hvTR_Q2HR73 DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Random_x_mid
CTACGTGGCCCC _A_Concatenation AAGCAACGATAT _B Target(s) Target(s) 1.094009999 5.52E-06 6.494286403 0.01006861 0.811125418 -3.065469968 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL Strong_X_mid
TACTTCTAGCGG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.151596557 0.938493 -7.862570519 0.00036537 -1.029667825 -3.067912365 Virus_target vIRF2_VP64_pos_ctrl cds_NC_025350.1_cds_YP_009094080.1_7 core_VP64_a TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Weak_X_vIRF2\u VP64_pos_ctrl
CATTGTTTAAAT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.218479038 1.21E-16 6.552754655 8.16E-10 0.345160713 0.997690695 -3.068474652 0.023419491 Human target neg_ctrl POLR1E Random_1 AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Unknown_X_random
ATGACCCCTTGT _A_Concatenation AGTTGCTTTGCT _B Target(s) neg_ctrl -1.861610725 0.07185248 -4.563821822 0.00839444 -4.646610972 -3.071920647 Human target neg_ctrl LEUTX Random_2 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM Strong_X_random
ACTTGATGGTTT _A_Concatenation CATCTCACCTAG _B Target(s) Target(s) -1.285502444 0.01709348 -5.095976669 0.00032685 1.103695803 0.997690695 -3.072739532 0.110683896 Virus_target Virus_target hvTR_Q2HR73 hvTR_P03206 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA mid_X_mid
CCAGACGGTCTG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.615200079 0.85449119 -3.308899494 0.19711662 5.253949757 -3.076787965 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 hvTR_Q2HR71 PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Weak_X_Strong
AGTTAGGCTCTT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.12078446 0.90192409 -7.672694834 3.95E-07 0.648914138 0.997690695 -3.077019686 0.134152071 Human target vIRF2_VP64_pos_ctrl TOX4 core_VP64_a EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
GAGATTGTGTCC _A_Concatenation ATCTCTCAGTTC _B Target(s) Target(s) -1.811352055 0.00049548 4.069358341 -0.222888413 -3.079298905 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK Strong_X_mid
GAGATTGTGTCC _A_Concatenation CCTAAATGCAAG _B Target(s) Target(s) -1.018508068 0.80528278 -5.727663605 0.00295126 -0.404926024 -3.081335489 Virus_target Virus_target hvTR_Q2HR71 cds_NC_025410.1_cds_YP_009094498.1_7 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL Strong_X_mid
ATAACTCCACGC _A_Concatenation ACGGCGTAAGTA _B pos_ctrl_A Target(s) -0.530880658 0.63990277 -5.489713721 0.08135871 1.725701765 -3.08417375 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NC_025350.1_cds_YP_009094078.1_5 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL vIRF2_VP64_pos_ctrl_X_mid
CATGCTAACACC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -2.492169409 0.00050712 -5.646672883 6.47E-05 -0.241498479 0.998501722 -3.111346665 Virus_target Virus_target hvTR_F5HCV3 hvTR_Q2HR71 TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
CTACGTGGCCCC _A_Concatenation GGTGTCTCGATT _B Target(s) Target(s) 0.834420633 3.08E-06 0.713809255 1.427687102 0.972639657 -3.111461165 0.083931183 Virus_target Human target hvTR_Q2HR71 E2F5 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV Strong_X_strong
ATAACTCCACGC _A_Concatenation AATCGAGACCAG _B pos_ctrl_A Target(s) 0.590019143 0.77397082 -5.10389896 0.130578301 -3.12022939 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q8BB47 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH VIRF2_VP64_pos strong_ctrl_x_strong
GAGATTGTGTCC _A_Concatenation GCGAGGCAGACC _B Target(s) Target(s) 0.492449462 0.4982603 -2.902394156 0.10515713 -1.633450088 -3.126196665 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD Strong_x_unknown
CTGCCGGTTGCG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.844235861 1.89E-05 3.592881517 0.00139746 0.339318031 0.997690695 -3.129815647 0.054928629 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 hvTR_Q2HR73 LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
CTACGTGGCCCC _A_Concatenation TAAAATTTATCA _B Target(s) Target(s) -0.267250699 0.48822994 -3.025519692 0.03676889 0.934511497 0.997690695 -3.136670141 0.191530223 Virus_target Human target hvTR_Q2HR71 HSF1 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY Strong_X_mid
GGTGTCTCGATT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 2.04415163 8.14E-07 3.394559837 0.11423327 -4.484412352 -3.158066027 Human target Virus_target E2F5 hvTR_Q2HR71 LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
ATGACCCCTTGT _A_Concatenation CTCAGCGATATA _B Target(s) Target(s) 1.097030107 6.26E-17 2.804685847 0.16314825 -0.267587193 0.998501722 -3.186515624 0.086749068 Human target Virus_target LEUTX hvTR_Q5Y0Q4 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG Strong_X_strong
ATTTGTAGACCG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.749842212 0.09842699 -0.635733599 0.70334265 -0.125147804 0.998501722 -3.188892811 0.193267146 Human target Virus_target MYCN hvTR_Q2HR71 PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
ATGACCCCTTGT _A_Concatenation Joint_ 2_B Target(s) neg_ctrl 0.943193695 0.37874388 -5.349415963 0.173433765 -3.190062317 Human target neg_ctrl LEUTX Joint_2 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA Strong-X-joint
ATTGTATCTAAC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.521136808 0.28403057 -5.91187531 0.00022359 -0.109334128 0.998501722 -3.192965838 0.058225543 Human target vIRF2_VP64_pos_ctrl SMARCB1 core_VP64_a VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
ATAACTCCACGC _A_Concatenation CTCAGCGATATA _B pos_ctrl_A Target(s) -0.127635881 0.88732067 -6.329275937 0.00062712 0.800093013 0.997690695 -3.194500067 0.174791142 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q5Y0Q4 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation ATGACCCCTTGT _B pos_ctrl_A Target(s) 1.196056268 0.00013723 4.821823203 0.00297318 -0.298519192 0.997690695 -3.210595196 0.054928629 vIRF2_VP64_pos_ctrl Human target core_VP64_b LEUTX DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV VIRF2_VP64_pos strong_ctrl_x_strong
ACTTGATGGTTT _A_Concatenation ATTTGTAGACCG _B Target(s) Target(s) -0.379243617 0.42479369 6.324512747 0.01057624 -0.774921043 0.997690695 -3.212016361 0.107214215 Virus_target Human target hvTR_Q2HR73 MYCN RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE mid_X_mid
CTCGCACCGAGG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -3.323846571 0.01109663 -4.514996773 0.0362745 0.419970377 -3.225452222 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 hvTR_Q2HR73 AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
TTCTAATTTCCT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.667813081 0.61505192 -3.965846202 0.09576001 -1.01450497 -3.260085709 Human target Human target WDR76 LEUTX IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
ATGTGGCACGAC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 2.135636892 8.72E-05 4.505772863 0.08025707 -5.060456231 -3.260502868 Human target Virus_target ANKRD17 hvTR_Q2HR71 QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
ACAACGTGCATA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.45029505 0.13870201 2.215866379 0.26366269 -0.335065007 0.997690695 -3.264892208 0.110773865 Virus_target Virus_target hvTR_P10407 hvTR_Q2HR73 RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
CCCGTAGGGGCT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 1.089603055 0.18459129 -6.079059187 0.02861473 1.039295762 -3.265186069 Human target Virus_target TET3 hvTR_Q2HR73 APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
GAGATTGTGTCC _A_Concatenation CGGCAATTGCTT _B Target(s) Target(s) -6.052597244 0.00073643 -6.828004345 0.03413661 -4.584875173 -3.265443042 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q9QB93 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK Strong_X_mid
CCAGACGGTCTG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -1.165809915 0.04327849 1.159888611 -4.380086588 -3.270620434 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 core_VP64_a PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Weak_X_vIRF2\u VP64_pos_ctrl
ATCTCTCAGTTC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.01568368 0.96871881 -5.239788742 0.00021381 0.649623342 0.997690695 -3.274385262 0.229030263 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 hvTR_Q2HR71 VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
Joint_ 3_A/u splice_ CTACGTGGCCCC _b neg_ctrl Target(s) 0.164852824 0.8912779 4.600553386 0.03687635 0.003348726 0.998980766 -3.278363081 0.131471919 neg_ctrl Virus_target Joint_3 hvTR_Q2HR71 GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Joint_X_strong
ATAACTCCACGC _A_Concatenation Stop_1_B pos_ctrl_A neg_ctrl -3.197038046 4.21E-08 -4.555037765 0.00375262 -0.173801051 0.998501722 -3.289675393 0.049877065 vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_a Stop_1 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK VIRF2_VP64_pos ctrl X termination
CTCGCACCGAGG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.45448228 0.88732067 -2.685901243 0.41999825 5.452786141 -3.290701355 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 LEUTX AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
GAGATTGTGTCC _A_Concatenation AAGCAACGATAT _B Target(s) Target(s) -0.669449747 0.1798316 6.907790607 0.00666775 0.529309322 0.997690695 -3.296605862 0.05862272 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL Strong_X_mid
CTACGTGGCCCC _A_Concatenation CTCAGCGATATA _B Target(s) Target(s) -0.076016779 0.83433135 -5.130763938 0.00446269 -0.024053504 -3.315756982 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q5Y0Q4 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG Strong_X_strong
ATGACCCCTTGT _A_Concatenation Joint_ 1_B Target(s) neg_ctrl 0.813103962 0.03658197 4.939173144 0.04947198 2.605045529 0.641409749 -3.32587358 0.12609254 Human target neg_ctrl LEUTX Joint_1 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK Strong-X-joint
AATCGATCCTAC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.963963532 0.49700987 -0.160156745 0.95325468 2.250737289 -3.336153261 Human target Human target PRDM10 LEUTX VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
ATGACAAAACGA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.313327005 0.88092648 -0.912905208 0.75669327 3.967333369 -3.345384948 Virus_target Virus_target cds_NC_025350.1_cds_YP_009094074.1_1 hvTR_Q2HR71 HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
ATAACTCCACGC _A_Concatenation GGGAGTATGTCG _B pos_ctrl_A neg_ctrl 0.573628312 0.19477654 -8.044820843 2.81E-06 -1.196891664 0.997690695 -3.361903051 0.080335482 vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_a Random_4 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS VIRF2_VP64_pos ctrl_x_random
ATAACTCCACGC _A_Concatenation TCTTGGTTTTGA _B pos_ctrl_A Target(s) 0.086234301 0.82429662 1.100267813 -0.012188578 0.998980766 -3.368062312 0.194369637 vIRF2_VP64_pos_ctrl Human target core_VP64_a ZNF597 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL VIRF2_VP64_pos strong_ctrl_x_strong
AGTTAGGCTCTT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.731059336 0.55882961 -3.79691003 0.06620989 0.761174149 -3.368710217 Human target Virus_target TOX4 hvTR_Q2HR71 EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
CTACGTGGCCCC _A_Concatenation EPICXV.47_B Target(s) pos_ctrl_A 1.007126726 0.01046123 3.707241978 0.13230718 -0.192815431 0.998501722 -3.370810444 0.171371711 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q2HR71 3x_vIRF2_core TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation TACATGAGGCAC _B neg_ctrl Target(s) 1.192247794 0.3372476 -3.427227671 0.13890003 -3.217737725 -3.377194595 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL Random_x_mid
ATGACCCCTTGT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.762926013 0.17494383 -2.844999363 0.01939364 1.797972435 -3.379320574 Human target Virus_target LEUTX hvTR_Q2HR71 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
AGAGTGCGCTCT _A_Concatenation GGCACTGTCGAG _B neg_ctrl Target(s) -0.042917269 0.94626231 -3.015525805 0.06816441 -0.928456295 -3.384933635 neg_ctrl Virus_target Random_1 hvTR_P17147 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA Random_x_unknown
CTACGTGGCCCC _A_Concatenation CACACCCGGCAG _B Target(s) Target(s) 0.216736719 0.16187644 -2.577040087 0.06073326 0.036781643 0.998501722 -3.397764828 0.016202253 Virus_target Virus_target hvTR_Q2HR71 hvTR_P19893 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS Strong_X_strong
CCCTATGTTCTA _A_Concatenation GAGATTGTGTCC _B pos_ctrl_A Target(s) -0.314379072 0.42479369 0.650841616 0.219982754 0.998501722 -3.398785988 0.052103508 vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q2HR71 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP VIRF2_VP64_pos strong_ctrl_x_strong
AAAACAAGCATT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.688905208 0.00025938 -3.360172907 0.01063608 -1.670358502 -3.404751699 Virus_target neg_ctrl cds_NC_030656.1_cds_YP_009268803.1_88 Random_1 STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Weak_X_random
AAACTTTGAGCG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.278542152 0.84320905 -1.408856941 0.45340354 -1.50689216 -3.432815574 Human target neg_ctrl ATF6B Random_1 AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
AAGGGTAAATTG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.17786516 0.93141616 -6.293008667 0.01357904 1.182260225 -3.436016481 Virus_target Virus_target hvTR_Q69513 hvTR_Q2HR71 NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
AGAGTGCGCTCT _A_Concatenation GCGAGGCAGACC _B neg_ctrl Target(s) 0.718903331 0.73105591 -2.732505785 0.27878253 3.837640285 -3.443009776 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD Random_x_unknown
CTCGCACCGAGG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.412091989 0.53947462 0.157331695 0.95625121 -0.143428538 -3.464063231 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 core_VP64_a AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
TCTTAGAGGGTG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.2876255 0.89850726 -2.415935648 0.03743675 0.630481138 0.997690695 -3.469121613 0.122424827 Human target Virus_target TRO hvTR_Q2HR71 FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
GAGACAGCTCTC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -3.342400535 0.09772997 1.594059168 0.47823298 1.831897655 -3.472937637 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 hvTR_Q2HR73 LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
AGAGTGCGCTCT _A_Concatenation ATCTCTCAGTTC _B neg_ctrl Target(s) -0.864333359 0.4351538 -6.713042351 0.02599858 0.602283101 -3.474009871 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK Random_x_mid
CTACGTGGCCCC _A_Concatenation CATGCTAACACC _B Target(s) Target(s) -0.818417629 0.1377177 -2.30486022 0.08674401 -0.058724582 -3.474168444 Virus_target Virus_target hvTR_Q2HR71 hvTR_F5HCV3 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL Strong_X_strong
TACATGAGGCAC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 1.692110308 0.32544126 -0.577254422 -4.584638333 -3.486914102 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 core_VP64_a PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation CTCGTTATGGCA _B neg_ctrl Target(s) 0.173779365 0.83608193 -1.051383851 0.56865938 -1.376003423 0.976975145 -3.489524749 0.048359833 neg_ctrl Human target Random_1 NOVA1 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF Random_x_mid
AATCGATCCTAC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 1.049958356 0.00126147 9.487271902 1.175916714 0.997690695 -3.509388405 0.042736939 Human target vIRF2_VP64_pos_ctrl PRDM10 core_VP64_a VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
CTCGTTATGGCA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -4.637421856 0.03648591 -0.39615249 0.83938517 5.368640331 -3.525668097 Human target Human target NOVA1 LEUTX AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
ATAACTCCACGC _A_Concatenation GTTCCACGCCTG _B pos_ctrl_A Target(s) -0.773343219 0.1062238 -4.298906826 0.00036537 -1.132292516 0.997690695 -3.535330836 0.161906702 vIRF2_VP64_pos_ctrl Human target core_VP64_a ZFX LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS VIRF2_VP64_pos strong_ctrl_x_strong
ATGACCCCTTGT _A_Concatenation TGCCAGACGGTC _B Target(s) Target(s) -0.580246927 0.17468281 -4.446449782 0.00124469 1.696276859 0.933317674 -3.547839857 0.075672673 Human target Virus_target LEUTX hvTR_P10541 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG Strong_X_strong
CTACGTGGCCCC _A_Concatenation ACCAGTATTTAA _B Target(s) Target(s) -0.512887799 0.68632574 0.236257054 0.92289832 -3.533417813 -3.550591276 Virus_target Human target hvTR_Q2HR71 ARNTL2 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL Strong_X_strong
CGCCGGCATACA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -1.283743784 0.32731981 -1.462077707 0.50612999 5.700280002 -3.555425566 Virus_target vIRF2_VP64_pos_ctrl cds_NC_030656.1_cds_YP_009268729.1_14 core_VP64_a DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
ACTTGATGGTTT _A_Concatenation AATCAACGAGCA _B Target(s) Target(s) 0.576792467 0.1062238 4.500577866 1.084154525 -3.564313779 Virus_target Human target hvTR_Q2HR73 C2orf81 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE Mid_X_strong
AGAGTGCGCTCT _A_Concatenation ACTTGATGGTTT _B neg_ctrl Target(s) 0.530221894 0.12515795 -2.717551591 0.02599858 1.324298725 0.997690695 -3.578364752 0.155492795 neg_ctrl Virus_target Random_1 hvTR_Q2HR73 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Random_x_mid
CATTGTTTAAAT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.835465514 0.49700987 0.339564461 0.87357617 -4.674698983 -3.581817625 Human target vIRF2_VP64_pos_ctrl POLR1E core_VP64_a AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Unknown_X_vIRF2\u VP64_pos_ctrl
GAGATTGTGTCC _A_Concatenation ACAACGTGCATA _B Target(s) Target(s) -1.244494939 0.00704318 1.178333642 0.62151298 1.77196288 0.95807515 -3.58973331 Virus_target Virus_target hvTR_Q2HR71 hvTR_P10407 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET Strong_X_strong
ATGACCCCTTGT _A_Concatenation GAGTGACCTATT _B Target(s) Target(s) -0.400364151 0.2999327 5.61748582 0.707787252 0.997690695 -3.60294566 0.217836052 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI Strong_x_unknown
ATGACCCCTTGT _A_Concatenation AATCGATCCTAC _B Target(s) Target(s) 0.232010284 0.926639 -6.645734503 0.00323548 -1.936911862 -3.604186115 Human target Human target LEUTX PRDM10 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA Strong_X_mid
ATGACCCCTTGT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.480100889 0.43839885 -1.611388088 0.33839065 1.146522549 -3.618098542 Human target Human target LEUTX LEUTX DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
GAGATTGTGTCC _A_Concatenation CCCCTTAGGAAT _B Target(s) Target(s) -1.3128433 0.47888775 -1.293041477 0.52828339 -0.544043511 -3.626863716 Virus_target Human target hvTR_Q2HR71 SYNE1 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD Strong_X_mid
TCTTAGAGGGTG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.637131899 0.30031597 0.241076509 0.92185818 0.407037687 0.997690695 -3.633545219 0.040091246 Human target vIRF2_VP64_pos_ctrl TRO core_VP64_a FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
TATCTATCGTGT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.689609008 0.17494383 -6.342547418 0.00384211 1.822257758 -3.655908322 Virus_target Virus_target hvTR_P03259 hvTR_Q2HR73 RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
TAAGATTCAACG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.448014329 0.13204496 -0.880708036 -1.952075864 0.871241789 -3.678840321 0.080335482 Human target vIRF2_VP64_pos_ctrl ZNF292 core_VP64_a CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
CACACCCGGCAG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.266530591 0.70051124 -3.990169607 0.00617968 0.050584658 0.998501722 -3.686987915 0.099717208 Virus_target neg_ctrl hvTR_P19893 Random_1 VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
ATAACTCCACGC _A_Concatenation CTCACGACAAGA _B pos_ctrl_A Target(s) 0.980727933 0.49700987 6.097640698 0.03872928 -1.336229445 -3.689236542 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_G3G929 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP VIRF2_VP64_pos strong_ctrl_x_strong
AGAGTGCGCTCT _A_Concatenation AATCAACGAGCA _B neg_ctrl Target(s) -0.646476721 2.35E-06 5.656919127 0.00011919 -0.426309354 0.997690695 -3.694463267 0.047067196 neg_ctrl Human target Random_1 C2orf81 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE Random_x_strong
GAGATTGTGTCC _A_Concatenation GTACATACCGAA _B Target(s) Target(s) -0.643526712 0.29409568 -1.189378627 0.50792395 1.03524989 0.997690695 -3.705909936 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE Strong_X_mid
GAGATTGTGTCC _A_Concatenation TGATGAGCGCCC _B Target(s) Target(s) -1.352781981 5.86E-06 -4.049104912 0.0015566 -0.691425996 0.997690695 -3.718711591 0.099717208 Virus_target Human target hvTR_Q2HR71 EEF1D PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL Strong_x_weak
ATAACTCCACGC _A_Concatenation TTAAATGAGGGC _B pos_ctrl_A Target(s) -0.823409028 0.42955596 -5.06348277 0.11429869 2.662552567 -3.723408855 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ vIRF2_VP64_pos_ctrl_X_mid
TTCTAATTTCCT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.457972733 0.58023965 -0.161317346 0.94116917 -0.938580974 0.997690695 -3.725932057 Human target vIRF2_VP64_pos_ctrl WDR76 core_VP64_a IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation TATCTATCGTGT _B neg_ctrl Target(s) -1.950881502 0.50140379 -5.125638167 0.10980349 0.861729516 -3.734216621 neg_ctrl Virus_target Random_1 hvTR_P03259 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS Random_x_strong
CCCCTTAGGAAT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -1.120319609 0.47726148 -4.614605373 0.16671372 -1.358125308 -3.740512405 Human target vIRF2_VP64_pos_ctrl SYNE1 core_VP64_a SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
ACTTGATGGTTT _A_Concatenation AAGCAACGATAT _B Target(s) Target(s) -0.801175474 0.125695 4.55752475 0.04902037 0.628001121 0.997690695 -3.743860036 0.097385987 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL mid_X_mid
GACTGTGGGCGG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.772620133 0.00054963 -4.973040189 0.00022359 -3.114821294 -3.746590357 Human target Human target MON2 LEUTX TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
CTACGTGGCCCC _A_Concatenation CAACGCATGCCT _B Target(s) Target(s) 0.053284434 0.93141616 -3.943712612 0.00744155 0.269043499 -3.75414495 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS Strong_x_weak
TTTAACGGATTG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.54843765 0.82067579 -3.428579929 5.889814059 -3.757053539 Human target Human target TWIST2 LEUTX SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
GAGATTGTGTCC _A_Concatenation GCAGATTCGAAT _B Target(s) Target(s) -0.220587267 0.7246252 -5.562168437 0.00045446 1.558851595 -3.77615265 Virus_target Human target hvTR_Q2HR71 AK9 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP Strong_X_mid
CGATGGCGGAAT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -2.138343323 0.31988807 -4.262742693 0.08285043 0.050015017 0.998501722 -3.777927404 Human target neg_ctrl KIAA1586 Random_1 DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Weak_X_random
ATAACTCCACGC _A_Concatenation CTGGTAGTTGTT _B pos_ctrl_A Target(s) -0.427407666 0.37756408 -2.719237711 0.07789462 0.341208374 0.997690695 -3.778427356 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_F6KS47 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT VIRF2_VP64_pos ctrl_x_unknown
CTCGCACCGAGG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 1.16944363 0.60940534 -5.358788402 0.12197421 0.427880614 -3.783271352 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 hvTR_Q2HR71 AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
ATAACTCCACGC _A_Concatenation TATCTATCGTGT _B pos_ctrl_A Target(s) 1.58312524 5.60E-11 -4.376514787 0.00989082 -0.224946785 0.998501722 -3.796688035 0.099589021 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_P03259 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS VIRF2_VP64_pos strong_ctrl_x_strong
ACTTGATGGTTT _A_Concatenation ACTTTAGTAACA _B Target(s) Target(s) 0.996242005 0.03520409 1.444173994 0.48796824 -1.70599625 0.968300879 -3.797034319 Virus_target Virus_target hvTR_Q2HR73 hvTR_Q69113 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES Mid_X_strong
CCCGTAGGGGCT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.905319337 0.36741802 -1.692605498 0.56400389 -5.357696162 -3.804657816 Human target neg_ctrl TET3 Random_1 APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
ACTTGATGGTTT _A_Concatenation ACGACTCACCGC _B Target(s) Target(s) -0.800443739 0.31287443 -5.202049668 0.00062712 0.734507729 -3.810916499 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK mid_X_mid
CTACGTGGCCCC _A_Concatenation CATCTCACCTAG _B Target(s) Target(s) -0.179641186 0.67964161 -2.155426746 0.11790489 -0.302240454 0.998501722 -3.819152178 Virus_target Virus_target hvTR_Q2HR71 hvTR_P03206 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA Strong_X_mid
ATGACCCCTTGT _A_Concatenation TCTTGGTTTTGA _B Target(s) Target(s) 0.620289785 0.13290849 -3.927459134 0.01736975 3.689092628 0.527905197 -3.824635546 Human target Human target LEUTX ZNF597 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL Strong_X_strong
TGGCCCCCAGTT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.015673306 0.99110789 0.320553487 0.91072633 3.613227319 -3.828897252 Virus_target neg_ctrl hvTR_Q3I8P6 Random_1 GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Weak_X_random
GGCACAGCTCCA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -1.202206027 0.00018332 0.09390313 0.95756877 -0.380359541 0.997690695 -3.841579357 0.097929575 Human target Virus_target DTX2 hvTR_Q2HR71 TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Unknown_X_strong
AATGGCACTAGC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.906448177 1.18E-08 5.533529207 0.00015784 -0.315447609 0.997690695 -3.858380444 0.018312917 Virus_target neg_ctrl cds_NC_024711.1_cds_YP_009052522.2_44 Random_1 EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Weak_X_random
ATAACTCCACGC _A_Concatenation GAGACAGCTCTC _B pos_ctrl_A Target(s) -0.481687015 0.4982603 -3.650428503 0.02198039 5.993644316 -3.867124733 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM vIRF2_VP64_pos_ctrl_X_mid
AAAACAAGCATT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 1.798498462 7.13787243 0.00081729 -3.659571355 -3.87502489 Virus_target Virus_target cds_NC_030656.1_cds_YP_009268803.1_88 hvTR_Q2HR73 STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Weak_X_mid
TGGCCCCCAGTT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.106423079 0.93202829 -5.530031353 0.00062354 -0.602435632 -3.879971756 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q3I8P6 core_VP64_a GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Weak_X_vIRF2\u VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation GAGATTGTGTCC _B neg_ctrl Target(s) -0.674097894 0.19037284 2.79170925 0.16081235 0.23263875 0.998501722 -3.902234996 0.074608262 neg_ctrl Virus_target Random_1 hvTR_Q2HR71 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Random_x_strong
CTACGTGGCCCC _A_Concatenation AAGGGTAAATTG _B Target(s) Target(s) 1.076529483 0.32392216 -0.899522646 0.56137619 2.253647148 -3.91002615 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q69513 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS Strong_X_mid
ACTTGATGGTTT _A_Concatenation GGCACTGTCGAG _B Target(s) Target(s) -0.591566378 0.17962689 0.926423412 0.62046092 1.873513353 0.87819447 -3.915044125 0.044551217 Virus_target Virus_target hvTR_Q2HR73 hvTR_P17147 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA Mid_X_unknown
ATGACCCCTTGT _A_Concatenation TATCTATCGTGT _B Target(s) Target(s) 1.067172311 0.26797115 -4.353100344 -2.531223211 -3.918412794 Human target Virus_target LEUTX hvTR_P03259 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS Strong_X_strong
ATAACTCCACGC _A_Concatenation GGAGGCACATCG _B pos_ctrl_A Target(s) -0.805834731 0.02784752 -4.110977358 0.00325151 -0.53407454 0.997690695 -3.920799938 0.073394906 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q87042 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation AGAGTGCGCTCT _B pos_ctrl_A neg_ctrl -0.714238038 0.5677544 -3.93081177 0.03336571 -2.850743032 0.61634714 -3.943523579 0.098005281 vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_b Random_1 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP VIRF2_VP64_pos ctrl_x_random
GAGATTGTGTCC _A_Concatenation AGCCAGGTCACC _B Target(s) Target(s) -0.49631591 0.80691093 -6.008724151 -5.707374122 -3.944028045 Virus_target Human target hvTR_Q2HR71 MESP1 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK Strong_X_strong
AGAGTGCGCTCT _A_Concatenation AGCCATAAACCT _B neg_ctrl Target(s) 0.4608457 0.34346149 5.284951234 0.00476047 -2.230991344 0.871241789 -3.945381817 0.118786226 neg_ctrl Virus_target Random_1 hvTR_P03209 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ Random_x_mid
AGAGTGCGCTCT _A_Concatenation TGGCCCCCAGTT _B neg_ctrl Target(s) -0.301685533 0.80366344 -8.984482903 6.86E-05 1.287036266 0.997690695 -3.964122716 0.096401552 neg_ctrl Virus_target Random_1 hvTR_Q3I8P6 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN Random_x_weak
Joint_1_A/u splice_ ATAACTCCACGC _b neg_ctrl pos_ctrl_A 1.582635864 0.00060993 -5.966827963 0.00304453 1.035310647 0.997690695 -3.972185084 0.086227003 neg_ctrl vIRF2_VP64_pos_ctrl Joint_1 core_VP64_a GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Linker_X_vIRF2\u VP64_pos_ctrl
ATAACTCCACGC _A_Concatenation TGATGAGCGCCC _B pos_ctrl_A Target(s) -0.6126788 0.04988573 1.070975148 -0.38193355 0.997690695 -3.98390621 0.070764812 vIRF2_VP64_pos_ctrl Human target core_VP64_a EEF1D LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL VIRF2_VP64_pos ctrl_x_weak
ATAACTCCACGC _A_Concatenation TTTAACGGATTG _B pos_ctrl_A Target(s) 1.115675665 0.02103552 -6.186166974 0.00041064 0.703026163 0.997690695 -4.00151587 0.084897118 vIRF2_VP64_pos_ctrl Human target core_VP64_a TWIST2 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN vIRF2_VP64_pos_ctrl_X_mid
ATGACCCCTTGT _A_Concatenation TTTAACGGATTG _B Target(s) Target(s) -0.176088537 0.80366344 -4.93455098 0.02336695 -1.320481332 -4.004270859 Human target Human target LEUTX TWIST2 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN Strong_X_mid
ACTTGATGGTTT _A_Concatenation AGCCAGGTCACC _B Target(s) Target(s) 0.338832768 0.32167475 3.470136275 0.08834294 0.587589748 0.997690695 -4.010689266 0.123348654 Virus_target Human target hvTR_Q2HR73 MESP1 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK Mid_X_strong
GAGATTGTGTCC _A_Concatenation TACATGAGGCAC _B Target(s) Target(s) 0.492050281 0.43798565 -7.407708202 0.00015636 0.942806653 0.997690695 -4.034469668 0.079863392 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL Strong_X_mid
GGTGTCTCGATT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.590327615 0.25403675 -4.558827492 0.10327598 5.849586965 -4.041024196 Human target Human target E2F5 LEUTX LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
CTCAGCGATATA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.62827002 0.10744393 2.083278556 0.3594206 0.668893632 0.997690695 -4.050059979 0.025922119 Virus_target Virus_target hvTR_Q5Y0Q4 hvTR_Q2HR71 FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
ATGACCCCTTGT _A_Concatenation CACACCCGGCAG _B Target(s) Target(s) -0.285192534 0.26523132 1.935991004 -0.139665739 0.998501722 -4.052188304 0.017149591 Human target Virus_target LEUTX hvTR_P19893 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS Strong_X_strong
ACCAGTATTTAA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.275105314 0.92466263 -1.402860244 0.59652223 -3.893101576 -4.052936796 Human target vIRF2_VP64_pos_ctrl ARNTL2 core_VP64_a NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
GAGATTGTGTCC _A_Concatenation ATTGTATCTAAC _B Target(s) Target(s) 0.160830338 0.93141616 -3.808096085 0.10327598 1.623182714 -4.053970079 Virus_target Human target hvTR_Q2HR71 SMARCB1 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD Strong_X_mid
ATAACTCCACGC _A_Concatenation TACTTCTAGCGG _B pos_ctrl_A Target(s) -0.056309314 0.95906455 -2.000026211 0.44249238 -0.430709651 -4.059116306 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NC_025350.1_cds_YP_009094080.1_7 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ VIRF2_VP64_pos ctrl_x_weak
CCCGTAGGGGCT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 1.2796052 0.00086021 -5.792539794 0.00151048 -2.692799123 -4.066418712 Human target Virus_target TET3 hvTR_Q2HR71 APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
ATGACCCCTTGT _A_Concatenation TCTGTACCAACG _B Target(s) Target(s) 0.188482346 0.89748071 -2.302223788 0.07868483 0.484522639 -4.074166357 Human target Virus_target LEUTX hvTR_Q6XA69 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY Strong_x_unknown
CTACGTGGCCCC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.24198567 0.60940534 -4.202328901 0.01582874 -0.160284125 -4.098115076 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q2HR71 core_VP64_a TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
GATGGAGCTACA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.201843432 0.93141616 5.537111208 -0.265451173 -4.100241184 Virus_target Human target hvTR_P03209 LEUTX EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
ATGACCCCTTGT _A_Concatenation GCAGATTCGAAT _B Target(s) Target(s) -0.734452978 0.36774226 6.313016762 -0.776312246 -4.109608132 Human target Human target LEUTX AK9 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP Strong_X_mid
AGAGTGCGCTCT _A_Concatenation TCTGTACCAACG _B neg_ctrl Target(s) 0.073847045 0.88448623 2.299814527 0.16351831 0.395707328 0.997690695 -4.122965374 0.00459902 neg_ctrl Virus_target Random_1 hvTR_Q6XA69 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY Random_x_unknown
CTACGTGGCCCC _A_Concatenation GGGAGTATGTCG _B Target(s) neg_ctrl -2.792199679 0.0009267 -5.139429832 0.00108214 0.620735162 -4.130764343 Virus_target neg_ctrl hvTR_Q2HR71 Random_4 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS Strong_X_random
GAGATTGTGTCC _A_Concatenation CTGCCGGTTGCG _B Target(s) Target(s) -0.280741111 0.32544126 -0.833291201 0.6234516 0.543504842 0.997690695 -4.131180594 0.010952256 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL Strong_X_mid
ATGACCCCTTGT _A_Concatenation CAACGCATGCCT _B Target(s) Target(s) -1.51172316 0.34674225 -1.004390453 0.68250789 6.252443388 -4.140371676 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS Strong_x_weak
AGTTAGGCTCTT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 1.040760281 0.5677544 -2.063096035 0.34736481 4.532116611 -4.155895166 Human target Human target TOX4 LEUTX EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
AGAGTGCGCTCT _A_Concatenation CCCTGCCTCGGG _B neg_ctrl Target(s) 0.18186396 0.78182953 5.080091682 0.04336342 -0.108493246 0.998501722 -4.162954253 0.012582029 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY Random_x_mid
AATGGCACTAGC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.845733921 0.70546103 0.637125934 0.75004294 -5.928841198 -4.176108208 Virus_target Virus_target cds_NC_024711.1_cds_YP_009052522.2_44 hvTR_Q2HR71 EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Weak_X_Strong
CTACGTGGCCCC _A_Concatenation CTCACGACAAGA _B Target(s) Target(s) 0.544075071 0.49700987 -5.74881905 0.00370107 0.114722233 -4.204987887 Virus_target Virus_target hvTR_Q2HR71 hvTR_G3G929 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP Strong_X_strong
GAGATTGTGTCC _A_Concatenation CTCCATTAATGA _B Target(s) Target(s) -0.912131903 0.26992692 -6.499557261 8.78E-05 2.571038725 -4.213550419 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q5IXR3 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV Strong_x_weak
TATCTATCGTGT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.467924227 0.56644316 -5.719936407 0.01229306 0.600347348 -4.220508632 Virus_target Virus_target hvTR_P03259 hvTR_Q2HR71 RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
GGCCTTTCGGGC _A_Concatenation AGAGTGCGCTCT _B neg_ctrl neg_ctrl -0.267805507 0.57362051 2.087424848 0.33875688 0.106428205 0.998501722 -4.223374225 0.054917192 neg_ctrl neg_ctrl Random_3 Random_1 GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Random_x_random
AAAACAAGCATT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.941606828 0.07087739 4.250601266 0.02389526 -0.249476299 -4.225114733 Virus_target Virus_target cds_NC_030656.1_cds_YP_009268803.1_88 hvTR_Q2HR71 STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Weak_X_Strong
ATAACTCCACGC _A_Concatenation TCTATCACAACT _B pos_ctrl_A Target(s) -1.568674958 0.73268364 -4.212893889 0.26956633 1.567091002 -4.236926482 vIRF2_VP64_pos_ctrl Human target core_VP64_a TOX4 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH vIRF2_VP64_pos_ctrl_X_mid
AATCGATCCTAC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.489945159 0.19343964 0.202873674 0.92643324 -0.171853256 -4.238368242 Human target Virus_target PRDM10 hvTR_Q2HR71 VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
TACTTCTAGCGG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.771044327 0.00589775 1.802669461 0.40271471 0.227166966 0.998501722 -4.242732501 0.006225935 Virus_target Virus_target cds_NC_025350.1_cds_YP_009094080.1_7 hvTR_Q2HR71 TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Weak_X_Strong
ACTTGATGGTTT _A_Concatenation ACAACGTGCATA _B Target(s) Target(s) -0.692514716 0.04576547 -0.864133442 0.64850969 0.182135139 0.998501722 -4.244068346 0.018758746 Virus_target Virus_target hvTR_Q2HR73 hvTR_P10407 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET Mid_X_strong
AATCGATCCTAC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.179529056 0.1062238 -1.690226772 0.54837089 2.407313806 -4.248810284 Human target neg_ctrl PRDM10 Random_1 VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
TACATGAGGCAC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.110188313 4.25E-05 2.281086245 0.28160529 1.088317446 0.997690695 -4.303372063 0.065462272 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 LEUTX PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
CATCTCACCTAG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.197025042 0.52243449 -2.182788257 0.07235545 0.660085508 0.997690695 -4.304705587 0.017149591 Virus_target Virus_target hvTR_P03206 hvTR_Q2HR71 ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
CCAGACGGTCTG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.171199216 0.93141616 -1.059009138 0.73516079 1.06733435 -4.307503416 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 LEUTX PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Weak_X_Strong
CTACGTGGCCCC _A_Concatenation TATCGCAAGAAC _B Target(s) neg_ctrl -0.543683368 0.45350915 -6.560404894 1.86E-05 -0.13555782 0.998501722 -4.311285817 0.057793972 Virus_target neg_ctrl hvTR_Q2HR71 Random_5 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV Strong_X_random
AGAGTGCGCTCT _A_Concatenation AATCGATCCTAC _B neg_ctrl Target(s) -0.064062926 0.94601266 -6.7713786 0.03611553 -0.111814107 -4.314207934 neg_ctrl Human target Random_1 PRDM10 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA Random_x_mid
GATGGAGCTACA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -1.03097096 0.2685694 4.900245925 2.074265421 0.856321281 -4.315742445 0.041678294 Virus_target vIRF2_VP64_pos_ctrl hvTR_P03209 core_VP64_a EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
GTCACCCTCCTT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.466800633 0.24013675 -6.020931608 9.74E-06 -0.206537113 0.998501722 -4.320134993 0.023467952 Human target vIRF2_VP64_pos_ctrl E2F5 core_VP64_a DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
TAAGATTCAACG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.048402176 0.95906455 -5.686562047 0.00025562 1.000422586 0.997690695 -4.32626098 0.020523923 Human target neg_ctrl ZNF292 Random_1 CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
CAACGCATGCCT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.628288976 0.70817197 4.47606778 6.127730728 -4.335994205 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 hvTR_Q2HR73 YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Weak_X_mid
AGCCATAAACCT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.257278817 0.68024065 0.09908855 0.96438971 -0.359758648 0.997690695 -4.358553073 0.050540158 Virus_target Virus_target hvTR_P03209 hvTR_Q2HR73 PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
GCGAGGCAGACC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.360275424 0.32392216 -6.439546691 0.00062712 2.324548791 -4.358781485 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 LEUTX QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Unknown_X_strong
GGTGTCTCGATT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.624188523 0.21779974 -4.069767293 0.00560113 0.716245491 0.997690695 -4.374873417 0.006225935 Human target vIRF2_VP64_pos_ctrl E2F5 core_VP64_a LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
TATCTATCGTGT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.730309581 0.67511461 -6.044735209 0.0180249 0.020872066 -4.382773153 Virus_target Human target hvTR_P03259 LEUTX RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
ACTTGATGGTTT _A_Concatenation ATCTCTCAGTTC _B Target(s) Target(s) 0.354565573 0.83094496 -6.292758405 9.06E-05 1.655966316 0.976975145 -4.401995538 0.07455661 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK mid_X_mid
ACTTGATGGTTT _A_Concatenation GCAGATTCGAAT _B Target(s) Target(s) -0.503625638 0.03648591 2.403574915 0.22915921 0.224431312 0.998501722 -4.421684072 0.006225935 Virus_target Human target hvTR_Q2HR73 AK9 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP mid_X_mid
TCTGTACCAACG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.117258881 0.88561073 7.07722467 0.00648081 6.529618253 -4.431413231 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q6XA69 core_VP64_a ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Unknown_X_vIRF2\u VP64_pos_ctrl
ACTTGATGGTTT _A_Concatenation CCCTGCCTCGGG _B Target(s) Target(s) -0.790397321 0.3372476 -3.686964909 0.00950075 3.106307851 -4.433438099 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY mid_X_mid
AAGACCGGTGCC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.515793835 0.42503544 0.073904941 0.96782217 1.167830491 -4.436015135 Virus_target Human target hvTR_P08392 LEUTX RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
AGAGTGCGCTCT _A_Concatenation AGTTGCTTTGCT _B neg_ctrl neg_ctrl -0.730028947 0.48317945 -4.093175689 0.00471042 -1.094140684 -4.439776185 neg_ctrl neg_ctrl Random_1 Random_2 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM Random_x_random
AGAGTGCGCTCT _A_Concatenation AGTTAGGCTCTT _B neg_ctrl Target(s) -0.439136018 0.09988704 2.220966371 0.33910718 1.207243507 0.997690695 -4.441176819 0.009395708 neg_ctrl Human target Random_1 TOX4 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV Random_x_mid
ATTGTATCTAAC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 0.127305933 0.91324404 -2.72751786 0.10406346 2.947022023 -4.443082246 Human target neg_ctrl SMARCB1 Random_1 VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
TAAAATTTATCA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.868226949 6.68E-05 -2.768624097 0.06080289 -0.011067718 -4.452910853 Human target Human target HSF1 LEUTX LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
ATTTGTAGACCG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.433474412 0.49285741 2.718416294 0.149883058 -4.457452304 Human target Virus_target MYCN hvTR_Q2HR71 PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
ATGTGGCACGAC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.098235848 0.93545833 -2.111269769 0.22915921 6.568806835 -4.472439525 Human target Virus_target ANKRD17 hvTR_Q2HR71 QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
GAGATTGTGTCC _A_Concatenation CTGGTAGTTGTT _B Target(s) Target(s) 0.320980764 0.71346073 -5.050426907 0.00058527 -1.542496583 0.987041604 -4.479704305 0.059573762 Virus_target Virus_target hvTR_Q2HR71 hvTR_F6KS47 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT Strong_x_unknown
AAGCAACGATAT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 2.88346018 0.04090853 -5.792604137 0.0030859 -4.026688107 -4.480472219 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 hvTR_Q2HR71 GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
TTCTAATTTCCT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.540273244 0.78182953 -3.985362564 0.2155389 6.579502443 -4.483631681 Human target Virus_target WDR76 hvTR_Q2HR73 IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
ACTTGATGGTTT _A_Concatenation TATTAAAAGTGC _B Target(s) Target(s) 0.139393599 0.938493 2.006899938 0.24589402 3.957487218 -4.484349324 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_202 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI Mid_X_weak
ATAACTCCACGC _A_Concatenation AGGCTTCCCGCT _B pos_ctrl_A Target(s) 2.040915165 0.00211261 -6.668501501 5.91E-05 2.382234226 -4.484998981 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV vIRF2_VP64_pos_ctrl_X_mid
ATAACTCCACGC _A_Concatenation GATCGCATGTTC _B pos_ctrl_A Target(s) 0.082978051 0.9335834 -6.376605882 5.67E-05 -1.02920312 0.997690695 -4.526994085 0.049730741 vIRF2_VP64_pos_ctrl Human target core_VP64_a INPP4A LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER VIRF2_VP64_pos strong_ctrl_x_strong
TCTTGGTTTTGA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.451300431 0.55517332 -4.689001206 0.02606586 2.076550966 -4.52740439 Human target vIRF2_VP64_pos_ctrl ZNF597 core_VP64_a ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
CTCAGCGATATA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.624795577 0.48649146 -4.440579854 0.0042665 1.530626222 0.976975145 -4.544138798 0.031425518 Virus_target Virus_target hvTR_Q5Y0Q4 hvTR_Q2HR73 FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
ATAACTCCACGC _A_Concatenation CATTGTTTAAAT _B pos_ctrl_A Target(s) -0.004777326 0.99110789 4.326682043 0.13219999 2.290049946 -4.5449934 vIRF2_VP64_pos_ctrl Human target core_VP64_a POLR1E LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV VIRF2_VP64_pos ctrl_x_unknown
GCAACGAGGGTC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.023332314 3.78E-11 -0.850362862 0.60710891 0.435178525 0.997690695 -4.56764221 0.003310148 Human target Human target QRICH2 LEUTX RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
TCTTAGAGGGTG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.149560475 0.90029575 4.520447417 -1.029284335 -4.577139067 Human target Virus_target TRO hvTR_Q2HR73 FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
ATAACTCCACGC _A_Concatenation ACTCGTGTTGTC _B pos_ctrl_A Target(s) 0.296043249 0.72790325 -1.761697233 0.49541295 2.508671595 -4.610891336 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q9QB97 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT vIRF2_VP64_pos_ctrl_X_mid
ACTTGATGGTTT _A_Concatenation ATTGTATCTAAC _B Target(s) Target(s) -0.018695922 0.95135448 -2.643588952 0.08137967 0.026529149 0.998980766 -4.611490935 0.006225935 Virus_target Human target hvTR_Q2HR73 SMARCB1 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD mid_X_mid
GAGATTGTGTCC _A_Concatenation Joint_ 2_B Target(s) neg_ctrl -0.197893967 0.92359363 1.453565367 0.52277462 3.107333606 -4.634189797 Virus_target neg_ctrl hvTR_Q2HR71 Joint_2 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA Strong-X-joint
ACAACGTGCATA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.060447182 0.93881926 -4.573919945 0.00909169 -4.900126016 -4.664300495 Virus_target vIRF2_VP64_pos_ctrl hvTR_P10407 core_VP64_a RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
AAGACCGGTGCC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.591159752 0.001233 5.218320955 0.00297318 -0.574921152 0.997690695 -4.67892565 0.044862171 Virus_target Virus_target hvTR_P08392 hvTR_Q2HR71 RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
ACCAGTATTTAA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.132991384 0.93141616 -4.958182575 0.00228119 1.487683456 0.995270541 -4.682025328 0.054917192 Human target Virus_target ARNTL2 hvTR_Q2HR71 NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
CATTGTTTAAAT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.042359123 0.96035322 4.074295196 0.04515966 1.567252447 0.983636094 -4.695642793 Human target Virus_target POLR1E hvTR_Q2HR73 AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Unknown_X_mid
GAGATTGTGTCC _A_Concatenation AGTTAGGCTCTT _B Target(s) Target(s) 0.936025174 0.00341741 -3.936354906 0.00579804 0.671114606 -4.696133418 Virus_target Human target hvTR_Q2HR71 TOX4 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV Strong_X_mid
AGAGTGCGCTCT _A_Concatenation GATCGCATGTTC _B neg_ctrl Target(s) -0.015836742 0.9854934 -6.107048064 0.0038283 -0.116480357 0.998501722 -4.696930001 0.035570865 neg_ctrl Human target Random_1 INPP4A CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER Random_x_strong
GAGATTGTGTCC _A_Concatenation ACCAGTATTTAA _B Target(s) Target(s) 0.617764052 0.51346664 -3.734336377 0.03818366 0.048940146 -4.701853986 Virus_target Human target hvTR_Q2HR71 ARNTL2 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL Strong_X_strong
Stop 2_A splice GAGATTGTGTCC B neg_ctrl Target(s) -1.006526263 0.06075194 -1.855058029 0.25363982 0.240419385 -4.709227495 neg_ctrl Virus_target Stop_2 hvTR_Q2HR71 ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Termination_X_strong
ATGACCCCTTGT _A_Concatenation AGTTAGGCTCTT _B Target(s) Target(s) 3.009551364 1.30E-05 0.669192288 0.76446307 3.013928466 -4.724606407 Human target Human target LEUTX TOX4 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV Strong_X_mid
GTTCCACGCCTG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -1.17213267 0.38922543 0.491500432 0.82062053 4.134002823 0.412917083 -4.731036848 Human target Virus_target ZFX hvTR_Q2HR71 VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
TACATGAGGCAC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.075683209 0.93141616 -3.359850802 0.05968663 3.581867401 -4.733546803 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 hvTR_Q2HR71 PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
CATTGTTTAAAT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.61670292 0.02838239 5.120394901 0.00542747 -0.065469769 0.998501722 -4.750851108 0.007269653 Human target Human target POLR1E LEUTX AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Unknown_X_strong
ATAACTCCACGC _A_Concatenation CTGCCGGTTGCG _B pos_ctrl_A Target(s) -0.974189351 0.00589775 -0.846696447 0.62805366 0.589387686 0.997690695 -4.752187921 0.002219785 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL vIRF2_VP64_pos_ctrl_X_mid
CATTAGTCCCGC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.056339111 0.93141616 -3.614817111 0.00212573 -0.050134546 0.998501722 -4.753277601 0.035451075 Virus_target Virus_target hvTR_P06930 hvTR_Q2HR73 AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
ACGACTCACCGC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.193313147 0.88133884 -5.590261485 0.0957563 2.226551152 -4.768538394 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 hvTR_Q2HR71 PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
AGGCTTCCCGCT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -1.246599141 0.04064654 -5.850706836 0.02716395 1.051683032 -4.773700255 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 hvTR_Q2HR73 VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
TATCTATCGTGT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -1.24186904 0.3372476 -5.104786204 0.03676994 6.879812674 -4.797054281 Virus_target neg_ctrl hvTR_P03259 Random_1 RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
ATAACTCCACGC _A_Concatenation Joint_ 2_B pos_ctrl_A neg_ctrl 0.423245033 0.82196175 -6.925253057 0.00881075 6.908812568 -4.827240333 vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_a Joint_2 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA VIRF2_VP64_pos ctrl X joint
GAGATTGTGTCC _A_Concatenation TATTAAAAGTGC _B Target(s) Target(s) 1.323637823 1.86E-05 5.919891448 1.10E-14 0.486571997 0.997690695 -4.841618153 0.049877065 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_202 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI Strong_x_weak
ACTTGATGGTTT _A_Concatenation CCCCTTAGGAAT _B Target(s) Target(s) -0.074659428 0.93141616 -4.597888157 0.00062712 0.045908153 0.998501722 -4.858585044 0.031298274 Virus_target Human target hvTR_Q2HR73 SYNE1 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD mid_X_mid
CTACGTGGCCCC _A_Concatenation CCCGTAGGGGCT _B Target(s) Target(s) -0.818709091 0.12732726 -6.521214944 0.00113924 0.586946859 -4.901511591 Virus_target Human target hvTR_Q2HR71 TET3 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS Strong_X_strong
CTCGCACCGAGG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -1.596136237 0.00048803 -5.22495664 0.00156378 1.86474922 0.95807515 -4.952515892 0.050540158 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 Random_1 AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
GGTGTCTCGATT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.71708475 0.37850503 -5.956180795 0.00109079 -0.285349999 -4.959200203 Human target Virus_target E2F5 hvTR_Q2HR71 LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Strong_X_strong
CTCAGCGATATA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.063821297 0.89491762 -5.738623465 3.70E-05 0.475790468 0.997690695 -4.96663111 0.001244361 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q5Y0Q4 core_VP64_a FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
EPICXV.47-A\u splice_ ACTTGATGGTTT _b pos_ctrl_A Target(s) -0.402908865 0.76232875 -2.710973367 0.20622537 3.259352174 -4.987920795 vIRF2_VP64_pos_ctrl Virus_target 3x_vIRF2_core hvTR_Q2HR73 GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR vIRF2_VP64_pos_ctrl_X_mid
CTACGTGGCCCC _A_Concatenation CCCTATGTTCTA _B Target(s) pos_ctrl_A 0.485138676 0.29695525 1.93677332 0.36067721 1.360890619 0.997690695 -4.994725112 0.044797652 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q2HR71 core_VP64_b TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation TCAGCCGTAGGC _B neg_ctrl Target(s) 0.712239358 0.09988704 1.994360846 0.61718307 0.997690695 -5.001859257 0.044930268 neg_ctrl Virus_target Random_1 hvTR_Q805Y1 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR Random_x_mid
ACGGCGTAAGTA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.440957901 0.75770773 -5.267406288 0.00062712 1.151328375 0.997690695 -5.057401522 0.049877065 Virus_target Virus_target cds_NC_025350.1_cds_YP_009094078.1_5 hvTR_Q2HR73 LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
TCAGCCGTAGGC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 0.391192155 0.69733379 -3.579724517 0.06553086 0.547437872 -5.058243009 Virus_target Virus_target hvTR_Q805Y1 hvTR_Q2HR71 RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
CATCTCACCTAG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.351939043 0.52901596 0.222124096 0.91035624 0.062090878 -5.09443553 Virus_target Human target hvTR_P03206 LEUTX ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
ATTTGTAGACCG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.656975642 0.4482293 -3.271410603 0.01939364 3.06121137 0.665412691 -5.101365259 0.086227003 Human target vIRF2_VP64_pos_ctrl MYCN core_VP64_a PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
ATGACCCCTTGT _A_Concatenation CCCTATGTTCTA _B Target(s) pos_ctrl_A 0.840486421 0.34471177 -2.192609225 0.31960259 -3.620552389 -5.103522048 Human target vIRF2_VP64_pos_ctrl LEUTX core_VP64_b DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
TCTTGGTTTTGA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.76021617 0.37525592 -4.690895046 0.04158147 0.193477589 -5.123854273 Human target Virus_target ZNF597 hvTR_Q2HR71 ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
GGCACAGCTCCA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -1.382298774 0.19431573 -5.846903452 0.00989082 1.741784317 -5.161491347 Human target neg_ctrl DTX2 Random_1 TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Unknown_X_random
TAAGATTCAACG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.568125793 0.36878074 -2.892171441 0.08285043 2.427193341 -5.173101919 Human target Virus_target ZNF292 hvTR_Q2HR73 CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
CTACGTGGCCCC _A_Concatenation CCCTGCCTCGGG _B Target(s) Target(s) -1.487579562 0.07185248 -3.197615954 0.05166799 -3.398202267 -5.180846526 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY Strong_X_mid
TTTAACGGATTG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -1.018838252 0.570298 -3.127128404 -0.965608977 -5.195299598 Human target Virus_target TWIST2 hvTR_Q2HR71 SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
GAGACAGCTCTC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.887538375 0.3372476 4.745789853 -0.916888787 -5.205950127 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 Random_1 LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
ACTTGATGGTTT _A_Concatenation AAAACAAGCATT _B Target(s) Target(s) 1.191964895 0.02320276 -1.060619257 1.306681057 -5.223605812 Virus_target Virus_target hvTR_Q2HR73 cds_NC_030656.1_cds_YP_009268803.1_88 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS Mid_X_weak
ACTTGATGGTTT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.131125839 0.94239857 -5.647380666 0.00791621 1.443241929 -5.239741979 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q2HR73 core_VP64_a RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
GAGATTGTGTCC _A_Concatenation CAACGCATGCCT _B Target(s) Target(s) -0.512214239 0.71346073 -1.803269258 0.48762892 3.836164752 -5.277961857 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS Strong_x_weak
CTCGTTATGGCA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -0.071037944 0.9335834 -0.826197308 0.56137619 2.699232299 -5.30571031 Human target Virus_target NOVA1 hvTR_Q2HR71 AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
AGAGTGCGCTCT _A_Concatenation ATAACTCCACGC _B neg_ctrl pos_ctrl_A 0.330164685 0.81666774 -4.056944834 0.03478224 0.572663702 -5.363605979 neg_ctrl vIRF2_VP64_pos_ctrl Random_1 core_VP64_a CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Random_X_vIRF2\u VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation AAGCAACGATAT _B neg_ctrl Target(s) -2.175492411 0.44779842 -0.958233703 0.70352371 7.455397967 -5.393917529 neg_ctrl Virus_target Random_1 cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL Random_x_mid
ATGACCCCTTGT _A_Concatenation CCCCTTAGGAAT _B Target(s) Target(s) -0.556667594 0.49783248 -2.321875682 0.16019145 -2.928002943 -5.455261016 Human target Human target LEUTX SYNE1 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD Strong_X_mid
CTGGTAGTTGTT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.083074874 0.89748071 -1.23687068 0.50023889 0.358149991 0.998501722 -5.491915626 0.026617437 Virus_target Virus_target hvTR_F6KS47 hvTR_Q2HR71 SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Unknown_X_strong
CCGCCCTTATGT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 1.060129109 0.24458972 3.402400654 1.879885692 -5.50025502 Human target Virus_target ZFX hvTR_Q2HR71 VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
ACTTGATGGTTT _A_Concatenation GAGACAGCTCTC _B Target(s) Target(s) -0.209734714 0.75467612 -0.270868577 0.89582846 4.581660478 -5.507927576 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM mid_X_mid
GAGATTGTGTCC _A_Concatenation GGCACTGTCGAG _B Target(s) Target(s) -2.12334167 3.21E-05 -4.702557832 0.01719167 -1.602540214 -5.599821069 Virus_target Virus_target hvTR_Q2HR71 hvTR_P17147 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA Strong_x_unknown
GGGAGTATGTCG _A_Concatenation ATAACTCCACGC _B neg_ctrl pos_ctrl_A -2.208814368 0.16187644 -3.776665452 0.14747479 2.960548103 -5.616940031 neg_ctrl vIRF2_VP64_pos_ctrl Random_4 core_VP64_a LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Random_X_vIRF2\u VP64_pos_ctrl
CTACGTGGCCCC _A_Concatenation TACTTCTAGCGG _B Target(s) Target(s) -0.141022638 0.92647166 -7.160911356 0.00057613 -0.125933095 -5.625285367 Virus_target Virus_target hvTR_Q2HR71 cds_NC_025350.1_cds_YP_009094080.1_7 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ Strong_x_weak
AGAGTGCGCTCT _A_Concatenation AAACTTTGAGCG _B neg_ctrl Target(s) -0.340808887 0.62528738 -4.895000121 0.0363895 0.546714267 0.997690695 -5.656190678 0.023621511 neg_ctrl Human target Random_1 ATF6B CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ Random_x_strong
AGAGTGCGCTCT _A_Concatenation Stop_1_B neg_ctrl neg_ctrl -1.224087981 0.48857008 -0.753723052 0.73513388 7.712048094 -5.658725253 neg_ctrl neg_ctrl Random_1 Stop_1 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK Random_x_termination
CTCCATTAATGA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 5.404503701 0.00589775 -5.877830928 0.02663924 1.119790989 -5.681347684 Virus_target neg_ctrl hvTR_Q5IXR3 Random_1 FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Weak_X_random
AGCCATAAACCT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.735122894 0.22884458 2.995358338 0.24589402 0.344876681 -5.68630643 Virus_target vIRF2_VP64_pos_ctrl hvTR_P03209 core_VP64_a PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
CTACGTGGCCCC _A_Concatenation GGCACTGTCGAG _B Target(s) Target(s) -0.308396943 0.47815206 -6.137944266 0.00109019 -1.156379204 -5.727758564 Virus_target Virus_target hvTR_Q2HR71 hvTR_P17147 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA Strong_x_unknown
AGAGTGCGCTCT _A_Concatenation GATGGAGCTACA _B neg_ctrl Target(s) -1.33621996 0.02664091 -2.391722128 0.18668384 1.152177441 0.997690695 -5.782618445 0.015867719 neg_ctrl Virus_target Random_1 hvTR_P03209 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF Random_x_strong
TACTTCTAGCGG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -1.189787144 0.04628107 5.829358188 0.03155841 1.545703361 -5.831396042 Virus_target neg_ctrl cds_NC_025350.1_cds_YP_009094080.1_7 Random_1 TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Weak_X_random
GTTCCACGCCTG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.769981604 0.86342423 -2.886813019 0.18770774 1.994466765 -5.840797432 Human target neg_ctrl ZFX Random_1 VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
AATCGAGACCAG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -2.347860519 1.46E-14 -5.215253713 0.00075558 2.168129085 0.913394463 -5.842093368 0.038629478 Virus_target Virus_target hvTR_Q8BB47 hvTR_Q2HR73 EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Strong_X_mid
CTCGTTATGGCA _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.494353708 0.62724977 -3.236670463 0.0084926 1.91307286 -5.871878995 Human target Virus_target NOVA1 hvTR_Q2HR73 AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
CTACGTGGCCCC _A_Concatenation CTCCATTAATGA _B Target(s) Target(s) 1.179654668 0.17168108 4.358595417 0.09769005 0.116110257 -5.884099132 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q5IXR3 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV Strong_x_weak
ATAACTCCACGC _A_Concatenation AATCGATCCTAC _B pos_ctrl_A Target(s) 0.153725912 0.91324404 -4.870495372 0.0030859 -0.461338496 -5.895507635 vIRF2_VP64_pos_ctrl Human target core_VP64_a PRDM10 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA vIRF2_VP64_pos_ctrl_X_mid
ATAACTCCACGC _A_Concatenation ATGACCCCTTGT _B pos_ctrl_A Target(s) -0.424943429 0.44779842 -1.280595371 0.44159106 0.805067743 -5.943417085 vIRF2_VP64_pos_ctrl Human target core_VP64_a LEUTX LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV VIRF2_VP64_pos strong_ctrl_x_strong
ATAACTCCACGC _A_Concatenation AAGGGTAAATTG _B pos_ctrl_A Target(s) -0.769843751 0.05287662 -4.288035434 0.00046299 0.084207744 -5.958619092 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q69513 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS vIRF2_VP64_pos_ctrl_X_mid
GGAGGCACATCG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.615425531 0.39800543 6.868327583 0.577456831 -5.997110012 Virus_target Virus_target hvTR_Q87042 hvTR_Q2HR71 RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
GACTGTGGGCGG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -3.698113905 0.00599572 -5.194429706 0.00063386 0.901901075 -5.999739644 Human target Virus_target MON2 hvTR_Q2HR71 TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
TCTTGGTTTTGA _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 2.589077657 0.0005918 -5.088682444 0.00382558 -1.109166747 -6.005234499 Human target neg_ctrl ZNF597 Random_1 ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
AAGACCGGTGCC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -0.522591911 0.59725044 -4.104785351 0.06099515 2.660729441 0.812553322 -6.025279443 0.040091246 Virus_target neg_ctrl hvTR_P08392 Random_1 RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
ACTTTAGTAACA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.226775341 0.80528278 -0.288717664 0.8725678 1.612048606 -6.12645012 Virus_target Virus_target hvTR_Q69113 hvTR_Q2HR71 NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
GAGATTGTGTCC _A_Concatenation CATTGTTTAAAT _B Target(s) Target(s) -2.265185586 0.00014167 -5.983440255 0.00141375 -1.604256553 0.997690695 -6.152999877 Virus_target Human target hvTR_Q2HR71 POLR1E PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV Strong_x_unknown
CTACGTGGCCCC _A_Concatenation AAAACAAGCATT _B Target(s) Target(s) -0.493048929 0.86342423 -43.28085123 -0.457230807 0.997690695 -6.162906773 Virus_target Virus_target hvTR_Q2HR71 cds_NC_030656.1_cds_YP_009268803.1_88 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS Strong_x_weak
CTCGCACCGAGG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.76552309 0.21159296 -5.643704007 0.0252298 1.063802185 0.997690695 -6.186463038 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 hvTR_Q2HR71 AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
CTACGTGGCCCC _A_Concatenation TATTAAAAGTGC _B Target(s) Target(s) -0.07415744 0.93141616 -4.198302004 0.00604998 1.733461308 0.995270541 -6.229205387 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_202 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI Strong_x_weak
CATCTCACCTAG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.4416704 0.32544126 -2.041886392 0.24776135 1.012415533 0.997690695 -6.270797064 Virus_target vIRF2_VP64_pos_ctrl hvTR_P03206 core_VP64_a ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
AGTTGCTTTGCT _A_Concatenation ATGACCCCTTGT _B neg_ctrl Target(s) 0.301355238 0.62528738 -2.534783802 0.11387209 1.866959184 0.983636094 -6.38164972 0.095807415 neg_ctrl Human target Random_2 LEUTX DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Random_x_strong
ATAACTCCACGC _A_Concatenation GTACATACCGAA _B pos_ctrl_A Target(s) -1.171288747 0.67964161 -3.153192584 0.07956968 2.451510292 0.913394463 -6.516748909 0.097929575 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE vIRF2_VP64_pos_ctrl_X_mid
ATTTGTAGACCG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.366475595 0.73379786 -4.811401615 0.01033665 0.495767171 0.997690695 -6.640794238 0.067705317 Human target Human target MYCN LEUTX PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
AGAGTGCGCTCT _A_Concatenation AATGGCACTAGC _B neg_ctrl Target(s) -0.623623953 0.52540625 -1.157186972 0.50023889 2.196145729 0.95807515 -6.686152164 0.085225681 neg_ctrl Virus_target Random_1 cds_NC_024711.1_cds_YP_009052522.2_44 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR Random_x_weak
CTACGTGGCCCC _A_Concatenation CGCCGGCATACA _B Target(s) Target(s) -0.946148243 0.01727004 2.99290184 0.23526565 -0.059486678 0.998501722 -6.747982296 0.062504997 Virus_target Virus_target hvTR_Q2HR71 cds_NC_030656.1_cds_YP_009268729.1_14 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI Strong_X_strong
CTCAGCGATATA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 1.748245318 6.77E-14 1.721792601 0.948350533 0.997690695 -6.749330014 0.067193726 Virus_target Human target hvTR_Q5Y0Q4 LEUTX FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
TACTTCTAGCGG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.363812452 0.26992692 4.232834683 0.06073326 1.276260349 0.997690695 -6.806346096 0.068273719 Virus_target Virus_target cds_NC_025350.1_cds_YP_009094080.1_7 hvTR_Q2HR73 TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Weak_X_mid
ACTTGATGGTTT _A_Concatenation AGGCTTCCCGCT _B Target(s) Target(s) -0.027710303 0.96724179 -4.83793617 0.01741316 -0.825336777 0.997690695 -6.863460269 0.062240466 Virus_target Virus_target hvTR_Q2HR73 cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV mid_X_mid
AAACTTTGAGCG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.549842464 0.42955596 -5.729682993 0.00087405 1.155328609 0.997690695 -7.011659883 0.059746019 Human target vIRF2_VP64_pos_ctrl ATF6B core_VP64_a AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
AAGCAACGATAT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 7.944976712 7.585911696 0.02127152 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 Random_1 GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
AAGCAACGATAT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -1.664710356 0.59461055 -4.348682237 0.16881711 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 core_VP64_a GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
AAGCAACGATAT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.309483322 0.95787851 -1.478927037 Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 LEUTX GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
AAGCAACGATAT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -2.346593436 0.53232005 2.361801184 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 hvTR_Q2HR71 GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
AAGGGTAAATTG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.617316133 1.391426822 0.67033502 Virus_target neg_ctrl hvTR_Q69513 Random_1 NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
AAGGGTAAATTG _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 1.50744682 0.67511461 0.213570338 0.94993175 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q69513 core_VP64_a NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
ACGACTCACCGC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -0.585300332 0.91102312 3.314306961 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 hvTR_Q2HR73 PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
ACGACTCACCGC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.061283242 0.71346073 -16.01374614 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 Random_1 PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
ACGGCGTAAGTA _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.646268355 0.87986969 3.238855078 Virus_target vIRF2_VP64_pos_ctrl cds_NC_025350.1_cds_YP_009094078.1_5 core_VP64_a LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
ACGGCGTAAGTA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.568603773 Virus_target Virus_target cds_NC_025350.1_cds_YP_009094078.1_5 hvTR_Q2HR71 LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
ACGGCGTAAGTA _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 3.09768615 3.238855078 Virus_target Virus_target cds_NC_025350.1_cds_YP_009094078.1_5 hvTR_Q2HR71 LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
ACTCGTGTTGTC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 1.21059429 0.86342423 5.52800102 0.03526053 Virus_target Virus_target hvTR_Q9QB97 hvTR_Q2HR73 VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
ACTCGTGTTGTC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 2.380605892 3.182352799 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q9QB97 core_VP64_a VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
ACTCGTGTTGTC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.08639898 4.511437744 0.14019482 Virus_target Virus_target hvTR_Q9QB97 hvTR_Q2HR71 VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
ACTTGATGGTTT _A_Concatenation AAACTTTGAGCG _B Target(s) Target(s) 0.635247668 0.91324404 -0.120838354 0.96917964 Virus_target Human target hvTR_Q2HR73 ATF6B RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ Mid_X_strong
ACTTGATGGTTT _A_Concatenation ACGGCGTAAGTA _B Target(s) Target(s) -0.355250833 0.95038187 3.88115205 Virus_target Virus_target hvTR_Q2HR73 cds_NC_025350.1_cds_YP_009094078.1_5 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL mid_X_mid
ACTTGATGGTTT _A_Concatenation ATCCAATCTGTG _B Target(s) Target(s) -2.274709102 0.03561302 Virus_target Human target hvTR_Q2HR73 GLI2 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT mid_X_mid
ACTTGATGGTTT _A_Concatenation CCCGTAGGGGCT _B Target(s) Target(s) -1.68102392 0.8027561 -2.22903373 Virus_target Human target hvTR_Q2HR73 TET3 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS Mid_X_strong
ACTTGATGGTTT _A_Concatenation CCTAAATGCAAG _B Target(s) Target(s) -0.345572905 0.93767673 4.074562184 0.22666043 Virus_target Virus_target hvTR_Q2HR73 cds_NC_025410.1_cds_YP_009094498.1_7 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL mid_X_mid
ACTTGATGGTTT _A_Concatenation GGCCTTTCGGGC _B Target(s) neg_ctrl Virus_target neg_ctrl hvTR_Q2HR73 Random_3 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ Mid_X_random
ACTTGATGGTTT _A_Concatenation GTCACCCTCCTT _B Target(s) Target(s) Virus_target Human target hvTR_Q2HR73 E2F5 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN mid_X_mid
ACTTGATGGTTT _A_Concatenation Stop_2_B Target(s) neg_ctrl -1.424116901 0.40068284 -5.059430759 0.13874298 Virus_target neg_ctrl hvTR_Q2HR73 Stop_2 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE Mid_X_termination
ACTTGATGGTTT _A_Concatenation TCTATCACAACT _B Target(s) Target(s) -6.09199378 -5.135016247 Virus_target Human target hvTR_Q2HR73 TOX4 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH mid_X_mid
ACTTGATGGTTT _A_Concatenation TTAGTGCCATAC _B Target(s) Target(s) Virus_target Human target hvTR_Q2HR73 TET3 RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP mid_X_mid
AGAGTGCGCTCT _A_Concatenation ACGGCGTAAGTA _B neg_ctrl Target(s) 3.28719166 0.58023965 neg_ctrl Virus_target Random_1 cds_NC_025350.1_cds_YP_009094078.1_5 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL Random_x_mid
AGAGTGCGCTCT _A_Concatenation ATCCAATCTGTG _B neg_ctrl Target(s) 1.733168452 0.752551 3.170857119 neg_ctrl Human target Random_1 GLI2 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT Random_x_mid
AGAGTGCGCTCT _A_Concatenation CATCTGAGTCGT _B neg_ctrl pos_ctrl_A 0.166025685 neg_ctrl vIRF2_VP64_pos_ctrl Random_1 RWY_VP16_vIRF2 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS Random_X_vIRF2\u VP64_pos_ctrl
AGAGTGCGCTCT _A_Concatenation CCCGTAGGGGCT _B neg_ctrl Target(s) -0.962615391 neg_ctrl Human target Random_1 TET3 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS Random_x_strong
AGAGTGCGCTCT _A_Concatenation CCTAAATGCAAG _B neg_ctrl Target(s) 0.950595768 0.89056441 4.511448967 0.14019482 neg_ctrl Virus_target Random_1 cds_NC_025410.1_cds_YP_009094498.1_7 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL Random_x_mid
AGAGTGCGCTCT _A_Concatenation CTACGTGGCCCC _B neg_ctrl Target(s) 5.229533761 0.15190336 neg_ctrl Virus_target Random_1 hvTR_Q2HR71 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Random_x_strong
AGAGTGCGCTCT _A_Concatenation GGCCTTTCGGGC _B neg_ctrl neg_ctrl neg_ctrl neg_ctrl Random_1 Random_3 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ Random_x_random
AGAGTGCGCTCT _A_Concatenation GGGAGTATGTCG _B neg_ctrl neg_ctrl 0.967797242 0.11077416 -4.51458373 0.10777474 neg_ctrl neg_ctrl Random_1 Random_4 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS Random_x_random
AGAGTGCGCTCT _A_Concatenation GGTGTCTCGATT _B neg_ctrl Target(s) 0.047609976 0.99110789 3.88115205 neg_ctrl Human target Random_1 E2F5 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV Random_x_strong
AGAGTGCGCTCT _A_Concatenation GTCACCCTCCTT _B neg_ctrl Target(s) neg_ctrl Human target Random_1 E2F5 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN Random_x_mid
AGAGTGCGCTCT _A_Concatenation Stop_2_B neg_ctrl neg_ctrl 3.072851399 0.24424642 neg_ctrl neg_ctrl Random_1 Stop_2 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE Random_x_termination
AGAGTGCGCTCT _A_Concatenation TACTTCTAGCGG _B neg_ctrl Target(s) -7.853662346 0.0065342 3.320391013 0.27528996 neg_ctrl Virus_target Random_1 cds_NC_025350.1_cds_YP_009094080.1_7 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ Random_x_weak
AGAGTGCGCTCT _A_Concatenation TCTATCACAACT _B neg_ctrl Target(s) neg_ctrl Human target Random_1 TOX4 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH Random_x_mid
AGAGTGCGCTCT _A_Concatenation TTAGTGCCATAC _B neg_ctrl Target(s) neg_ctrl Human target Random_1 TET3 CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP Random_x_mid
AGCCAGGTCACC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -6.366830417 2.868800308 0.15019428 Human target neg_ctrl MESP1 Random_1 GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
AGCCAGGTCACC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -3.474457124 0.47868657 2.37307374 0.14562931 Human target Human target MESP1 LEUTX GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Strong_X_strong
AGGCTTCCCGCT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.289944801 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 Random_1 VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
AGGCTTCCCGCT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.115550616 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 hvTR_Q2HR71 VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
AGGCTTCCCGCT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 1.823085941 0.78251797 3.238855078 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 hvTR_Q2HR71 VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
ATAACTCCACGC _A_Concatenation AGCCAGGTCACC _B pos_ctrl_A Target(s) 3.046493862 0.57362051 vIRF2_VP64_pos_ctrl Human target core_VP64_a MESP1 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK VIRF2_VP64_pos strong_ctrl_x_strong
ATAACTCCACGC _A_Concatenation CCCGTAGGGGCT _B pos_ctrl_A Target(s) -1.156993199 0.73105591 4.058761015 0.17610913 vIRF2_VP64_pos_ctrl Human target core_VP64_a TET3 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS VIRF2_VP64_pos strong_ctrl_x_strong
ATAACTCCACGC _A_Concatenation CTACGTGGCCCC _B pos_ctrl_A Target(s) 1.289944801 vIRF2_VP64_pos_ctrl Virus_target core_VP64_a hvTR_Q2HR71 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL VIRF2_VP64_pos strong_ctrl_x_strong
ATAACTCCACGC _A_Concatenation GACTGTGGGCGG _B pos_ctrl_A Target(s) 3.80171056 0.18222191 3.880482673 vIRF2_VP64_pos_ctrl Human target core_VP64_a MON2 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL VIRF2_VP64_pos strong_ctrl_x_strong
ATAACTCCACGC _A_Concatenation GGCCTTTCGGGC _B pos_ctrl_A neg_ctrl vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_a Random_3 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ VIRF2_VP64_pos ctrl_x_random
ATAACTCCACGC _A_Concatenation GGTGTCTCGATT _B pos_ctrl_A Target(s) -0.27806378 0.9335834 2.406570592 vIRF2_VP64_pos_ctrl Human target core_VP64_a E2F5 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV VIRF2_VP64_pos strong_ctrl_x_strong
ATAACTCCACGC _A_Concatenation GTCACCCTCCTT _B pos_ctrl_A Target(s) vIRF2_VP64_pos_ctrl Human target core_VP64_a E2F5 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN vIRF2_VP64_pos_ctrl_X_mid
ATAACTCCACGC _A_Concatenation Stop_2_B pos_ctrl_A neg_ctrl -8.17897764 3.238855078 vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_a Stop_2 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE VIRF2_VP64_pos ctrl X termination
ATAACTCCACGC _A_Concatenation TTAGTGCCATAC _B pos_ctrl_A Target(s) vIRF2_VP64_pos_ctrl Human target core_VP64_a TET3 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP vIRF2_VP64_pos_ctrl_X_mid
ATAACTCCACGC _A_Concatenation TTCTAATTTCCT _B pos_ctrl_A Target(s) -0.485384513 0.87986969 2.802166251 0.22513253 vIRF2_VP64_pos_ctrl Human target core_VP64_a WDR76 LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS VIRF2_VP64_pos strong_ctrl_x_strong
ATCCAATCTGTG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 1.129263174 0.4610642 0.192620333 0.94571421 Human target Virus_target GLI2 hvTR_Q2HR73 PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
ATGACCCCTTGT _A_Concatenation AAAACAAGCATT _B Target(s) Target(s) 3.124427211 0.5677544 Human target Virus_target LEUTX cds_NC_030656.1_cds_YP_009268803.1_88 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS Strong_x_weak
ATGACCCCTTGT _A_Concatenation AAACTTTGAGCG _B Target(s) Target(s) -0.152739419 3.182352799 Human target Human target LEUTX ATF6B DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ Strong_X_strong
ATGACCCCTTGT _A_Concatenation AGCCAGGTCACC _B Target(s) Target(s) 0.360100154 5.977289987 0.02707288 Human target Human target LEUTX MESP1 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK Strong_X_strong
ATGACCCCTTGT _A_Concatenation AGGCTTCCCGCT _B Target(s) Target(s) -4.698834126 0.33149001 -4.353885607 0.11677552 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV Strong_X_mid
ATGACCCCTTGT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 2.795557194 Human target vIRF2_VP64_pos_ctrl LEUTX core_VP64_a DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
ATGACCCCTTGT _A_Concatenation ATCCAATCTGTG _B Target(s) Target(s) -1.103048635 0.86892322 Human target Human target LEUTX GLI2 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT Strong_X_mid
ATGACCCCTTGT _A_Concatenation ATCTCTCAGTTC _B Target(s) Target(s) 1.346083555 0.62528738 -43.28085123 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK Strong_X_mid
ATGACCCCTTGT _A_Concatenation CATCTGAGTCGT _B Target(s) pos_ctrl_A -3.023585795 0.570298 -0.945884629 0.76087181 Human target vIRF2_VP64_pos_ctrl LEUTX RWY_VP16_vIRF2 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS Strong_X_vIRF2\u VP64_pos_ctrl
ATGACCCCTTGT _A_Concatenation CCCGTAGGGGCT _B Target(s) Target(s) 3.040916671 Human target Human target LEUTX TET3 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS Strong_X_strong
ATGACCCCTTGT _A_Concatenation CCGCCCTTATGT _B Target(s) Target(s) 0.077749843 0.98703959 7.94102658 0.00337399 Human target Human target LEUTX ZFX DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH Strong_X_mid
ATGACCCCTTGT _A_Concatenation CCTAAATGCAAG _B Target(s) Target(s) 2.610465196 0.28726554 3.880482673 Human target Virus_target LEUTX cds_NC_025410.1_cds_YP_009094498.1_7 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL Strong_X_mid
ATGACCCCTTGT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) Human target Virus_target LEUTX hvTR_Q2HR71 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
ATGACCCCTTGT _A_Concatenation GAGACAGCTCTC _B Target(s) Target(s) 4.475910609 Human target Virus_target LEUTX cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM Strong_X_mid
ATGACCCCTTGT _A_Concatenation GGCACTGTCGAG _B Target(s) Target(s) -3.603196445 1.025530185 0.72115884 Human target Virus_target LEUTX hvTR_P17147 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA Strong_x_unknown
ATGACCCCTTGT _A_Concatenation GGCCTTTCGGGC _B Target(s) neg_ctrl Human target neg_ctrl LEUTX Random_3 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ Strong_X_random
ATGACCCCTTGT _A_Concatenation GGTGTCTCGATT _B Target(s) Target(s) -3.075478141 2.810937394 Human target Human target LEUTX E2F5 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV Strong_X_strong
ATGACCCCTTGT _A_Concatenation GTCACCCTCCTT _B Target(s) Target(s) Human target Human target LEUTX E2F5 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN Strong_X_mid
ATGACCCCTTGT _A_Concatenation TCTATCACAACT _B Target(s) Target(s) 3.880482673 Human target Human target LEUTX TOX4 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH Strong_X_mid
ATGACCCCTTGT _A_Concatenation TTAGTGCCATAC _B Target(s) Target(s) Human target Human target LEUTX TET3 DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP Strong_X_mid
CATGCTAACACC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -7.336940953 5.402315006 0.00370107 Virus_target neg_ctrl hvTR_F5HCV3 Random_1 TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Strong_X_random
CCAGACGGTCTG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl 1.108875481 0.76651497 -1.484006877 0.60245729 Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 Random_1 PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Weak_X_random
CCCCTTAGGAAT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) 0.078423388 0.96692351 -3.939831584 0.21401893 Human target Human target SYNE1 LEUTX SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
CCCCTTAGGAAT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -1.524525408 0.12502988 -3.929579096 0.1287179 Human target Virus_target SYNE1 hvTR_Q2HR71 SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
CCCGTAGGGGCT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 3.273902741 0.34507859 -4.56873309 0.17686603 Human target vIRF2_VP64_pos_ctrl TET3 core_VP64_a APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Strong_X_vIRF2\u VP64_pos_ctrl
CCGCCCTTATGT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 0.189224022 0.95448989 0.954544663 0.72115884 Human target vIRF2_VP64_pos_ctrl ZFX core_VP64_a VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
CCTAAATGCAAG _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) -6.206799848 0.19361934 2.704493055 Virus_target Virus_target cds_NC_025410.1_cds_YP_009094498.1_7 hvTR_Q2HR73 SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
CCTAAATGCAAG _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl -1.50970093 3.243191603 Virus_target neg_ctrl cds_NC_025410.1_cds_YP_009094498.1_7 Random_1 SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
CCTAAATGCAAG _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -0.152739419 Virus_target Human target cds_NC_025410.1_cds_YP_009094498.1_7 LEUTX SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
CCTAAATGCAAG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -4.986651789 0.21173203 3.170857119 Virus_target Virus_target cds_NC_025410.1_cds_YP_009094498.1_7 hvTR_Q2HR71 SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
CCTAAATGCAAG _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 3.88115205 Virus_target Virus_target cds_NC_025410.1_cds_YP_009094498.1_7 hvTR_Q2HR71 SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
CGATGGCGGAAT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 3.870749594 3.880482673 Human target Virus_target KIAA1586 hvTR_Q2HR73 DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Weak_X_mid
CGATGGCGGAAT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -0.152739419 Human target vIRF2_VP64_pos_ctrl KIAA1586 core_VP64_a DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Weak_X_vIRF2\u VP64_pos_ctrl
CGATGGCGGAAT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.197295569 3.880482673 Human target Human target KIAA1586 LEUTX DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Weak_X_Strong
CGATGGCGGAAT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 1.96454545 Human target Virus_target KIAA1586 hvTR_Q2HR71 DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Weak_X_Strong
CGATGGCGGAAT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 1.737842502 Human target Virus_target KIAA1586 hvTR_Q2HR71 DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Weak_X_Strong
CGGCAATTGCTT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) Virus_target Virus_target hvTR_Q9QB93 hvTR_Q2HR73 SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
CGGCAATTGCTT _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl Virus_target neg_ctrl hvTR_Q9QB93 Random_1 SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
CGGCAATTGCTT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A Virus_target vIRF2_VP64_pos_ctrl hvTR_Q9QB93 core_VP64_a SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
CGGCAATTGCTT _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) Virus_target Human target hvTR_Q9QB93 LEUTX SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
CGGCAATTGCTT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) Virus_target Virus_target hvTR_Q9QB93 hvTR_Q2HR71 SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
CGGCAATTGCTT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) Virus_target Virus_target hvTR_Q9QB93 hvTR_Q2HR71 SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
CTACGTGGCCCC _A_Concatenation ACGACTCACCGC _B Target(s) Target(s) -0.931032727 0.89748071 2.714505633 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK Strong_X_mid
CTACGTGGCCCC _A_Concatenation ACGGCGTAAGTA _B Target(s) Target(s) 2.281871687 0.64263557 -1.459937014 Virus_target Virus_target hvTR_Q2HR71 cds_NC_025350.1_cds_YP_009094078.1_5 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL Strong_X_mid
CTACGTGGCCCC _A_Concatenation ATCCAATCTGTG _B Target(s) Target(s) -1.940780491 0.37442227 5.229533761 0.15190336 Virus_target Human target hvTR_Q2HR71 GLI2 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT Strong_X_mid
CTACGTGGCCCC _A_Concatenation CCTAAATGCAAG _B Target(s) Target(s) -3.439922401 3.880482673 Virus_target Virus_target hvTR_Q2HR71 cds_NC_025410.1_cds_YP_009094498.1_7 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL Strong_X_mid
CTACGTGGCCCC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -0.241272287 3.909200449 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q2HR71 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
CTACGTGGCCCC _A_Concatenation GACTGTGGGCGG _B Target(s) Target(s) 1.239067547 0.56627226 3.450354443 0.04816017 Virus_target Human target hvTR_Q2HR71 MON2 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL Strong_X_strong
CTACGTGGCCCC _A_Concatenation GGCCTTTCGGGC _B Target(s) neg_ctrl Virus_target neg_ctrl hvTR_Q2HR71 Random_3 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ Strong_X_random
CTACGTGGCCCC _A_Concatenation GTCACCCTCCTT _B Target(s) Target(s) Virus_target Human target hvTR_Q2HR71 E2F5 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN Strong_X_mid
CTACGTGGCCCC _A_Concatenation Stop_2_B Target(s) neg_ctrl 1.456107457 0.72790325 3.552235532 0.17398177 Virus_target neg_ctrl hvTR_Q2HR71 Stop_2 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE Strong_X_termination
CTACGTGGCCCC _A_Concatenation TCTATCACAACT _B Target(s) Target(s) Virus_target Human target hvTR_Q2HR71 TOX4 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH Strong_X_mid
CTACGTGGCCCC _A_Concatenation TTAGTGCCATAC _B Target(s) Target(s) Virus_target Human target hvTR_Q2HR71 TET3 TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP Strong_X_mid
CTCCATTAATGA _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) -1.337812988 0.83885356 0.488145629 0.87178775 Virus_target Human target hvTR_Q5IXR3 LEUTX FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Weak_X_Strong
CTCCATTAATGA _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 2.885988728 0.45505484 -3.464840821 Virus_target Virus_target hvTR_Q5IXR3 hvTR_Q2HR71 FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Weak_X_Strong
GAGATTGTGTCC _A_Concatenation AAACTTTGAGCG _B Target(s) Target(s) 2.27374001 0.68922551 -4.301297729 0.17069892 Virus_target Human target hvTR_Q2HR71 ATF6B PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ Strong_X_strong
GAGATTGTGTCC _A_Concatenation AATCGATCCTAC _B Target(s) Target(s) -1.580928906 0.52243449 -3.483762303 Virus_target Human target hvTR_Q2HR71 PRDM10 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA Strong_X_mid
GAGATTGTGTCC _A_Concatenation ACGGCGTAAGTA _B Target(s) Target(s) 0.568603773 Virus_target Virus_target hvTR_Q2HR71 cds_NC_025350.1_cds_YP_009094078.1_5 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL Strong_X_mid
GAGATTGTGTCC _A_Concatenation AGGCTTCCCGCT _B Target(s) Target(s) -1.421503567 0.82433714 3.170857119 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV Strong_X_mid
GAGATTGTGTCC _A_Concatenation CATCTGAGTCGT _B Target(s) pos_ctrl_A 0.261814006 0.96306008 3.688929723 0.23657467 Virus_target vIRF2_VP64_pos_ctrl hvTR_Q2HR71 RWY_VP16_vIRF2 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS Strong_X_vIRF2\u VP64_pos_ctrl
GAGATTGTGTCC _A_Concatenation CCCGTAGGGGCT _B Target(s) Target(s) -1.866462843 3.909200449 Virus_target Human target hvTR_Q2HR71 TET3 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS Strong_X_strong
GAGATTGTGTCC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 5.944790166 3.909200449 Virus_target Virus_target hvTR_Q2HR71 hvTR_Q2HR71 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Strong_X_strong
GAGATTGTGTCC _A_Concatenation GAGACAGCTCTC _B Target(s) Target(s) -0.488412088 0.938493 4.794341901 0.15241597 Virus_target Virus_target hvTR_Q2HR71 cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM Strong_X_mid
GAGATTGTGTCC _A_Concatenation GGCCTTTCGGGC _B Target(s) neg_ctrl Virus_target neg_ctrl hvTR_Q2HR71 Random_3 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ Strong_X_random
GAGATTGTGTCC _A_Concatenation GGTGTCTCGATT _B Target(s) Target(s) 3.057939775 0.3372476 -3.350042954 0.34364709 Virus_target Human target hvTR_Q2HR71 E2F5 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV Strong_X_strong
GAGATTGTGTCC _A_Concatenation GTCACCCTCCTT _B Target(s) Target(s) Virus_target Human target hvTR_Q2HR71 E2F5 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN Strong_X_mid
GAGATTGTGTCC _A_Concatenation Stop_2_B Target(s) neg_ctrl -1.035843022 Virus_target neg_ctrl hvTR_Q2HR71 Stop_2 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE Strong_X_termination
GAGATTGTGTCC _A_Concatenation TATCGCAAGAAC _B Target(s) neg_ctrl -2.13615428 0.72546447 3.865581172 0.14649582 Virus_target neg_ctrl hvTR_Q2HR71 Random_5 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV Strong_X_random
GAGATTGTGTCC _A_Concatenation TATCTATCGTGT _B Target(s) Target(s) 0.810644487 2.700278593 Virus_target Virus_target hvTR_Q2HR71 hvTR_P03259 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS Strong_X_strong
GAGATTGTGTCC _A_Concatenation TCTATCACAACT _B Target(s) Target(s) -6.207145093 Virus_target Human target hvTR_Q2HR71 TOX4 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH Strong_X_mid
GAGATTGTGTCC _A_Concatenation TTAGTGCCATAC _B Target(s) Target(s) Virus_target Human target hvTR_Q2HR71 TET3 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP Strong_X_mid
GAGATTGTGTCC _A_Concatenation TTCTAATTTCCT _B Target(s) Target(s) -4.73677695 0.01468232 4.776309369 0.00426233 Virus_target Human target hvTR_Q2HR71 WDR76 PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS Strong_X_strong
GAGTGACCTATT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A -1.078919928 0.68024065 1.974215267 0.23654449 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 core_VP64_a HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Unknown_X_vIRF2\u VP64_pos_ctrl
GCAGATTCGAAT _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 3.635690607 0.48868143 3.880482673 Human target Virus_target AK9 hvTR_Q2HR73 KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
GCAGATTCGAAT _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 2.455930966 0.66495342 Human target vIRF2_VP64_pos_ctrl AK9 core_VP64_a KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
GCAGATTCGAAT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 2.243394991 0.70817197 Human target Virus_target AK9 hvTR_Q2HR71 KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
GGGAGTATGTCG _A_Concatenation ACTTGATGGTTT _B neg_ctrl Target(s) 1.328900573 0.49700987 7.513622093 0.03498942 neg_ctrl Virus_target Random_4 hvTR_Q2HR73 LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Random_x_mid
GGGAGTATGTCG _A_Concatenation AGAGTGCGCTCT _B neg_ctrl neg_ctrl 3.366014058 0.10382793 3.502206768 0.17906841 neg_ctrl neg_ctrl Random_4 Random_1 LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Random_x_random
GTCACCCTCCTT _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) -2.989132428 0.49700987 3.180465029 Human target Virus_target E2F5 hvTR_Q2HR71 DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
GTCACCCTCCTT _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) -3.332327621 0.28630375 3.238855078 Human target Virus_target E2F5 hvTR_Q2HR71 DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
TATTAAAAGTGC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) 0.152486738 0.91324404 -6.570098977 0.07359621 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_202 hvTR_Q2HR73 MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR Weak_X_mid
TATTAAAAGTGC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 5.637376422 0.20405616 -5.184750061 Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_202 core_VP64_a MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML Weak_X_vIRF2\u VP64_pos_ctrl
TATTAAAAGTGC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.091579619 0.96692351 1.302114269 0.64161874 Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_202 hvTR_Q2HR71 MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Weak_X_Strong
TCTGTACCAACG _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) 0.748208589 0.78182953 5.921461068 0.00956535 Virus_target Virus_target hvTR_Q6XA69 hvTR_Q2HR71 ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Unknown_X_strong
TTAAATGAGGGC _A_Concatenation ACTTGATGGTTT _B Target(s) Target(s) Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 hvTR_Q2HR73 LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ RPRVPLVSTYAGDRYAVGGYGPEQSLVPPPLGLPLTLSNLQGEDICTWEEGLGNILSELQEEPSSSTRQATDRRRPRSRSPHGRR mid_X_mid
TTAAATGAGGGC _A_Concatenation AGAGTGCGCTCT _B Target(s) neg_ctrl Virus_target neg_ctrl cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 Random_1 LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Mid_X_random
TTAAATGAGGGC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 core_VP64_a LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
TTAAATGAGGGC _A_Concatenation ATGACCCCTTGT _B Target(s) Target(s) Virus_target Human target cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 LEUTX LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ DLREPSGIKNPGGASASARVSSWDSQSYDIEQICLGASNPPWASTLFEIDEFVKIYDLPGEDDTSSLNQYLFPVCLEYDQLQSSV Mid_X_strong
TTAAATGAGGGC _A_Concatenation CTACGTGGCCCC _B Target(s) Target(s) Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 hvTR_Q2HR71 LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Mid_X_strong
TTAAATGAGGGC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) Virus_target Virus_target cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 hvTR_Q2HR71 LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
TTAGTGCCATAC _A_Concatenation ATAACTCCACGC _B Target(s) pos_ctrl_A 1.001593425 0.2685694 -5.981198825 0.12383329 Human target vIRF2_VP64_pos_ctrl TET3 core_VP64_a PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP LDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML mid_X_vIRF2_VP64_pos_ctrl
TTAGTGCCATAC _A_Concatenation GAGATTGTGTCC _B Target(s) Target(s) 1.437576689 0.31988807 -4.347452738 0.26334677 Human target Virus_target TET3 hvTR_Q2HR71 PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Mid_X_strong
Joint_ 2_A/u splice_ AGAGTGCGCTCT _b neg_ctrl neg_ctrl -17.95244707 3.880482673 neg_ctrl neg_ctrl Joint_2 Random_1 GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA CPESRKGFGTSHKKKMRMPGCHEVRVQPCEDQESQHNRGIGLLGSFENLSKIAEHARARSGYFLTTAHPGYATGMAEQRGYLIKP Joint_X_random
Joint_ 2_A/u splice_ CTACGTGGCCCC _b neg_ctrl Target(s) -0.192334425 0.95452661 -2.866008684 neg_ctrl Virus_target Joint_2 hvTR_Q2HR71 GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA TPTDLCLPTGGLPSPVIFPHETQGLLAPPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELL Joint_X_strong
Joint_ 2_A/u splice_ GAGATTGTGTCC _b neg_ctrl Target(s) -0.097806694 0.9798201 -1.837097072 0.55818445 neg_ctrl Virus_target Joint_2 hvTR_Q2HR71 GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA PPAGQSQTPFSPEGPVPSHVSGLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGELLCETASPQGPMQSEGGEEGSTESVSVLP Joint_X_strong
AAAACAAGCATT _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NC_030656.1_cds_YP_009268803.1_88 core_VP64_b STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Weak_X_vIRF2\u VP64_pos_ctrl
AAACTTTGAGCG _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl ATF6B core_VP64_b AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
AAGACCGGTGCC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_P08392 core_VP64_b RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
AAGCAACGATAT _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 core_VP64_b GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
AAGGGTAAATTG _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_Q69513 core_VP64_b NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
AATCAACGAGCA _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl C2orf81 core_VP64_b SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
AATCGAGACCAG _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_Q8BB47 core_VP64_b EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
AATCGATCCTAC _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl PRDM10 core_VP64_b VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
AATGGCACTAGC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NC_024711.1_cds_YP_009052522.2_44 core_VP64_b EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Weak_X_vIRF2\u VP64_pos_ctrl
ACAACGTGCATA _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_P10407 core_VP64_b RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
ACCAGTATTTAA _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl ARNTL2 core_VP64_b NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
ACGACTCACCGC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 core_VP64_b PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
ACGGCGTAAGTA _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NC_025350.1_cds_YP_009094078.1_5 core_VP64_b LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
ACTCGTGTTGTC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_Q9QB97 core_VP64_b VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
ACTTTAGTAACA _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_Q69113 core_VP64_b NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
AGCCAGGTCACC _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl MESP1 core_VP64_b GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
AGCCATAAACCT _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_P03209 core_VP64_b PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
AGCCGTGCTTGA _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 core_VP64_b GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Weak_X_vIRF2\u VP64_pos_ctrl
AGGCTTCCCGCT _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 core_VP64_b VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
AGTTAGGCTCTT _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl TOX4 core_VP64_b EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
AGTTGCTTTGCT _A_Concatenation CCCTATGTTCTA _B neg_ctrl vIRF2_VP64_pos_ctrl Random_2 core_VP64_b DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Random_X_vIRF2\u VP64_pos_ctrl
ATCCAATCTGTG _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl GLI2 core_VP64_b PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
ATCTCTCAGTTC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 core_VP64_b VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
ATGACAAAACGA _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NC_025350.1_cds_YP_009094074.1_1 core_VP64_b HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
ATGTGGCACGAC _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl ANKRD17 core_VP64_b QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
ATTGTATCTAAC _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl SMARCB1 core_VP64_b VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
ATTTGTAGACCG _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl MYCN core_VP64_b PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
CAACGCATGCCT _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 core_VP64_b YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Weak_X_vIRF2\u VP64_pos_ctrl
CACACCCGGCAG _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_P19893 core_VP64_b VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
CAGTGACATAGT _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl EHMT2 core_VP64_b AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
CATCTCACCTAG _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_P03206 core_VP64_b ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
CATCTGAGTCGT _A_Concatenation CCCTATGTTCTA _B vIRF2_VP64_pos_ctrl vIRF2_VP64_pos_ctrl RWY_VP16_vIRF2 core_VP64_b GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL vIRF2_VP64_pos_ctrl_X_vIRF2_VP64_pos_ctrl
CATGCTAACACC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_F5HCV3 core_VP64_b TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
CATTAGTCCCGC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_P06930 core_VP64_b AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
CATTGTTTAAAT _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl POLR1E core_VP64_b AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Unknown_X_vIRF2\u VP64_pos_ctrl
CCAGACGGTCTG _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 core_VP64_b PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Weak_X_vIRF2\u VP64_pos_ctrl
CCCCTTAGGAAT _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl SYNE1 core_VP64_b SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
CCCGTAGGGGCT _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl TET3 core_VP64_b APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
CCCTATGTTCTA _A_Concatenation AAAACAAGCATT _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NC_030656.1_cds_YP_009268803.1_88 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL STFVRQMYLPIMMDPHELTFEVWENIKRIVMDTYLHKETSGIMAINITVYKDKELPLGELINNQIMVQVPCNVVFKYYKMGDIVS VIRF2_VP64_pos ctrl_x_weak
CCCTATGTTCTA _A_Concatenation AAACTTTGAGCG _B vIRF2_VP64_pos_ctrl Human target core_VP64_b ATF6B DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL AELMLLSEIADPTRFFTDNLLSPEDWGLQNSTLYSGLDEVAEEQTQLFRCPEQDVPFDGSSLDVGMDVSPSEPPWELLPIFPDLQ VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation AAGACCGGTGCC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_P08392 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL RGALGWGAETEEGGDDPDHDPDHPHDLDDARRDGRAPAAGTDAGEDAGDAVSPRQLALLASMVEEAVRTIPTPDPAASPPRTPAF vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation AAGCAACGATAT _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155863712.1_1573 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL GHTVKDSIESTRILKDSGFKIVYHIMLGLPGADPDKDLETIKEIFANPDFMPDMLKIYPTLVVETAPLAELWKRGLYTPYDTDTL vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation AAGGGTAAATTG _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q69513 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL NKNSETLRSRSRSSESDCKKRSERSCSNHSWSSGSTQKSKRVDIDFTHGAKSIPNEYEQKKSTNKFFKNKRRNARKRFSDDSDSS vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation AATCAACGAGCA _B vIRF2_VP64_pos_ctrl Human target core_VP64_b C2orf81 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL SAFPQDPGGVDRIPLGRSWMGRGSQEQMESWEPSPQLRVTSAPPPTSELFQEAGPGGPVEEADGQSRGLSSAGSLSASFQLSVEE VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation AATCGAGACCAG _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q8BB47 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL EPAKPSGNNMGSNDERMQDYRPDPMMEESIQQILEDSLMCDTSFDDLILPGLESFGLIIPESSNNIESNNVEEGSNEDLKTLAEH VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation AATCGATCCTAC _B vIRF2_VP64_pos_ctrl Human target core_VP64_b PRDM10 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL VTTDLLTQAMTELSQTLTTDYRTPQGDYQRIQYIPVSQSASGLQQPQHIQLQVVQVASATSPHQSQQSTVDVGQLHDPQPYPQHA vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation AATGGCACTAGC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NC_024711.1_cds_YP_009052522.2_44 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL EGNLSSDVINRSFITNIAKIINDDKSTVEEQNAMVEAYAKQKFASHQYDYSNLLLEHRDSNGNIINYGLFRRVGNGTPKLTEYAR VIRF2_VP64_pos ctrl_x_weak
CCCTATGTTCTA _A_Concatenation ACAACGTGCATA _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_P10407 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL RCYEECLPPSDDEDEQAIQNAASHGVQAVSESFALDCPPLPGHGCKSCEFHRINTGDKAVLCALCYMRAYNHCVYSPVSDADDET VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation ACCAGTATTTAA _B vIRF2_VP64_pos_ctrl Human target core_VP64_b ARNTL2 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL NCRSMSNKELFPPSPSEMGELEATRQNQSTVAVHSHEPLLSDGAQLDFDALCDNDDTAMAAFMNYLEAEGGLGDPGDFSDIQWTL VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation ACGACTCACCGC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155863933.1_1849 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL PGKIYESNLHFLISRLKNDGYNVVGHKLVGDNKDKIEEAIEEASKIADVIIITGGTSAGEKDLVHQIIREKGEIIVHGLKFKPGK vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation ACGGCGTAAGTA _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NC_025350.1_cds_YP_009094078.1_5 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL LWLTILIALVGNHESTCMNINFLQSLGQINSQKRFLNFYTQQPPSYMVIRLVPTLQLSANNCTLGSIVRYRNAIKELIQPMDENL vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation ACTCGTGTTGTC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q9QB97 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL VAINLFNSEYILENIFIHSTTSFYSLNYNDNVINYNGLSRYNNAIFIIDEAHNIFGNNTGELMTVIKNKNKIPFLLLSGSPITNT vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation ACTTTAGTAACA _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q69113 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL NNYGSTIEGLLDLPDDDDAPAEAGLVAPRMSFLSAGQRPRRLSTTAPITDVSLGDELRLDGEEVDMTPADALDDFDLEMLGDVES VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation AGCCAGGTCACC _B vIRF2_VP64_pos_ctrl Human target core_VP64_b MESP1 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL GQGRGLGLVSAVRAGASWGSPPACPGARAAPEPRDPPALFAEAACPEGQAMEPSPPSPLLPGDVLALLETWMPLSPLEWLPEEPK VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation AGCCATAAACCT _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_P03209 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL PGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPLDPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQ vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation AGCCGTGCTTGA _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155863280.1_1236 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL GGGDKIIAFNSLSWDREDYVISPFELPNSQKTDEGYLARIKIPSIGYSECNPERVDDKVRINDLTLENSILKVSLNNDGDIISIY VIRF2_VP64_pos ctrl_x_weak
CCCTATGTTCTA _A_Concatenation AGGCTTCCCGCT _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155863028.1_914 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL VKGQDMSFSGLLTAALRAAKKYPIPDICYSIRENAFDMLLEATERALALTEKKEIMVVGGVAASVSLRSKLNLLAKDWNVEIKIV vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation AGTTAGGCTCTT _B vIRF2_VP64_pos_ctrl Human target core_VP64_b TOX4 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL EFPGGNDNYLTITGPSHPFLSGAETFHTPSLGDEEFEIPPISLDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMPV vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation AGTTGCTTTGCT _B vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_b Random_2 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL DYDHESVHSCWKAKAAVYLYVSRTVRYKLVKIPKDSKEVKEVFGSDLSPTEFKKYLGYEKTSMLSQIIKSPRSYVIMSPSILYLM VIRF2_VP64_pos ctrl_x_random
CCCTATGTTCTA _A_Concatenation ATCCAATCTGTG _B vIRF2_VP64_pos_ctrl Human target core_VP64_b GLI2 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL PSPGVNQVSSTVDSQLLEAPQIDFDAIMDDGDHSSLFSGALSPSLLHSLSQNSSRLTTPRNSLTLPSIPAGISNMAVGDMSSMLT vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation ATCTCTCAGTTC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155863839.1_1735 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL VKVKIVATGMCHSDVNVFEGKTPVPPPVIAGHEIAGIVEEVGPGVTRVKPGDRVVSAFIHPCGKCRNCISGHENLCETFSAVRLK vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation ATGACAAAACGA _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NC_025350.1_cds_YP_009094074.1_1 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL HFRTAERSPDCQIERIEVDGFEPGTYRIRTNARTPLTHGEVVALEEMADDIPEALANHTPFVDAQTELEECDEMEKFLEAIYSTL VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation ATGTGGCACGAC _B vIRF2_VP64_pos_ctrl Human target core_VP64_b ANKRD17 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL QQGLMVASPAQTLNDTLDDIMAAVSGRASAMSNTPTHSIAASISQPQTPTPSPIISPSAMLPIYPAIDIDAQTESNHDTALTLAC vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation ATTGTATCTAAC _B vIRF2_VP64_pos_ctrl Human target core_VP64_b SMARCB1 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL VIHENASQPEVLVPIRLDMEIDGQKLRDAFTWNMNEKLMTPEMFSEILCDDLDLNPLTFVPAIASAIRQQIESYPTDSILEDQSD vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation ATTTGTAGACCG _B vIRF2_VP64_pos_ctrl Human target core_VP64_b MYCN DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL PSCSTSTMPGMICKNPDLEFDSLQPCFYPDEDDFYFGGPDSTPPGEDIWKKFELLPTPPLSPSRGFAEHSSEPPSWVTEMLLENE vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation CAACGCATGCCT _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155862292.1_5 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL YPAEYGEEEREVKIEVSFDTYLRSKILIGSYLGISLPISRTLMLSRVKAVARQDILSISKVPSLFPQENPSGIVTPLIITVELLS VIRF2_VP64_pos ctrl_x_weak
CCCTATGTTCTA _A_Concatenation CACACCCGGCAG _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_P19893 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL VTKATTFLQTMLRKEVNSQLSLGDPLFPELAEESLKTFEQVTEDCNENPEKDVLAELGDILAQAVNHAGIDSSSTGPTLTTHSCS VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation CAGTGACATAGT _B vIRF2_VP64_pos_ctrl Human target core_VP64_b EHMT2 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL AAAAGAAAAAAAEGEAPAEMGALLLEKETRGATERVHGSLGDTPRSEETLPKATPDSLEPAGPSSPASVTVTVGDEGADTPVGAT vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation CATCTCACCTAG _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_P03206 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL ATRVYQDLGGPSQAPLPCVLWPVLPEPLPQGQLTAYHVSTAPTGSWFSAPQPAPENAYQAYAAPQLFPVSDITQNQQTNQAGGEA vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation CATCTGAGTCGT _B vIRF2_VP64_pos_ctrl vIRF2_VP64_pos_ctrl core_VP64_b RWY_VP16_vIRF2 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL GGGGSGGGGSGGSDALDDFDLDMLGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGSGGGGSGGGGSGGGGSGGGGSGGGGS vIRF2_VP64_pos_ctrl_X_vIRF2_VP64_pos_ctrl
CCCTATGTTCTA _A_Concatenation CATGCTAACACC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_F5HCV3 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL TTPSTTTPGTSLGSITTPQDVHATDVATSEGPSEAQPPLLSLPPPLDVDQSLFALLDEAGPETWDVGSPLSPTDDALLSSILQGL VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation CATTAGTCCCGC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_P06930 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL AEGAEHQQKLTEKDKAELPLSIRDLAEALGIPVIDCLIPCNFCGNFLNYLEACEFDYKRLSLIWKDYCVFACCRVCCGATATYEF VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation CATTGTTTAAAT _B vIRF2_VP64_pos_ctrl Human target core_VP64_b POLR1E DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL AETIIDTKGVTALVSDAIHNDLQDDSLYLPPCYDDAAKPEDVYKFEDLLSPAEYEALQSPSEAFRNVTSEEILKMIEENSHCTFV VIRF2_VP64_pos ctrl_x_unknown
CCCTATGTTCTA _A_Concatenation CCAGACGGTCTG _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155864195.1_2158 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL PFASLYMFNRNLRDSLINANYTSLLKSIDATNIQIIFQYILSYLLLKRAINHQPTCIDDTRNNNCEELLRDLINEAYNYLNNLNN VIRF2_VP64_pos ctrl_x_weak
CCCTATGTTCTA _A_Concatenation CCCCTTAGGAAT _B vIRF2_VP64_pos_ctrl Human target core_VP64_b SYNE1 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL SLAEELVSESCEADPAEQLALQSTLTVLAERMSTIRMKASGKRQLLEEKLNDQLEEQRQEQALQRYRCEADELDSWLLSTKATLD vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation CCCGTAGGGGCT _B vIRF2_VP64_pos_ctrl Human target core_VP64_b TET3 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL APFRSPQSYLRAPSWPVVPPEEHSSFAPDSSAFPPATPRTEFPEAWGTDTPPATPRSSWPMPRPSPDPMAELEQLLGSASDYIQS VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation CCCTATGTTCTA _B vIRF2_VP64_pos_ctrl vIRF2_VP64_pos_ctrl core_VP64_b core_VP64_b DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL vIRF2_VP64_pos_ctrl_X_vIRF2_VP64_pos_ctrl
CCCTATGTTCTA _A_Concatenation CCCTGCCTCGGG _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation CCGCCCTTATGT _B vIRF2_VP64_pos_ctrl Human target core_VP64_b ZFX DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation CCTAAATGCAAG _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NC_025410.1_cds_YP_009094498.1_7 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation CGATGGCGGAAT _B vIRF2_VP64_pos_ctrl Human target core_VP64_b KIAA1586 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG VIRF2_VP64_pos ctrl_x_weak
CCCTATGTTCTA _A_Concatenation CGCCGGCATACA _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NC_030656.1_cds_YP_009268729.1_14 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation CGGCAATTGCTT _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q9QB93 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation CTCACGACAAGA _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_G3G929 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation CTCAGCGATATA _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q5Y0Q4 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation CTCCATTAATGA _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q5IXR3 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV VIRF2_VP64_pos ctrl_x_weak
CCCTATGTTCTA _A_Concatenation CTCGCACCGAGG _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation CTCGTTATGGCA _B vIRF2_VP64_pos_ctrl Human target core_VP64_b NOVA1 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation CTGCCGGTTGCG _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation CTGGTAGTTGTT _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_F6KS47 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT VIRF2_VP64_pos ctrl_x_unknown
CCCTATGTTCTA _A_Concatenation EPICXV.47_B vIRF2_VP64_pos_ctrl vIRF2_VP64_pos_ctrl core_VP64_b 3x_vIRF2_core DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL vIRF2_VP64_pos_ctrl_X_vIRF2_VP64_pos_ctrl
CCCTATGTTCTA _A_Concatenation GACTGTGGGCGG _B vIRF2_VP64_pos_ctrl Human target core_VP64_b MON2 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation GAGACAGCTCTC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation GAGTGACCTATT _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI VIRF2_VP64_pos ctrl_x_unknown
CCCTATGTTCTA _A_Concatenation GATCGCATGTTC _B vIRF2_VP64_pos_ctrl Human target core_VP64_b INPP4A DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation GATGGAGCTACA _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_P03209 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation GCAACGAGGGTC _B vIRF2_VP64_pos_ctrl Human target core_VP64_b QRICH2 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation GCAGATTCGAAT _B vIRF2_VP64_pos_ctrl Human target core_VP64_b AK9 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation GCGAGGCAGACC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD VIRF2_VP64_pos ctrl_x_unknown
CCCTATGTTCTA _A_Concatenation GGAGGCACATCG _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q87042 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation GGCACAGCTCCA _B vIRF2_VP64_pos_ctrl Human target core_VP64_b DTX2 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG VIRF2_VP64_pos ctrl_x_unknown
CCCTATGTTCTA _A_Concatenation GGCACTGTCGAG _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_P17147 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA VIRF2_VP64_pos ctrl_x_unknown
CCCTATGTTCTA _A_Concatenation GGCCTTTCGGGC _B vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_b Random_3 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ VIRF2_VP64_pos ctrl_x_random
CCCTATGTTCTA _A_Concatenation GGGAGTATGTCG _B vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_b Random_4 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS VIRF2_VP64_pos ctrl_x_random
CCCTATGTTCTA _A_Concatenation GGTGTCTCGATT _B vIRF2_VP64_pos_ctrl Human target core_VP64_b E2F5 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation GTACATACCGAA _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation GTCACCCTCCTT _B vIRF2_VP64_pos_ctrl Human target core_VP64_b E2F5 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation GTTCCACGCCTG _B vIRF2_VP64_pos_ctrl Human target core_VP64_b ZFX DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation Stop_1_B vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_b Stop_1 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK VIRF2_VP64_pos ctrl X termination
CCCTATGTTCTA _A_Concatenation Stop_2_B vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_b Stop_2 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE VIRF2_VP64_pos ctrl X termination
CCCTATGTTCTA _A_Concatenation TAAAATTTATCA _B vIRF2_VP64_pos_ctrl Human target core_VP64_b HSF1 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation TAAGATTCAACG _B vIRF2_VP64_pos_ctrl Human target core_VP64_b ZNF292 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation TACATGAGGCAC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation TACTTCTAGCGG _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NC_025350.1_cds_YP_009094080.1_7 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ VIRF2_VP64_pos ctrl_x_weak
CCCTATGTTCTA _A_Concatenation TATCGCAAGAAC _B vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_b Random_5 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV VIRF2_VP64_pos ctrl_x_random
CCCTATGTTCTA _A_Concatenation TATCTATCGTGT _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_P03259 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation TATTAAAAGTGC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_202 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI VIRF2_VP64_pos ctrl_x_weak
CCCTATGTTCTA _A_Concatenation TCAGCCGTAGGC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q805Y1 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation TCTATCACAACT _B vIRF2_VP64_pos_ctrl Human target core_VP64_b TOX4 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation TCTGTACCAACG _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q6XA69 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY VIRF2_VP64_pos ctrl_x_unknown
CCCTATGTTCTA _A_Concatenation TCTTAGAGGGTG _B vIRF2_VP64_pos_ctrl Human target core_VP64_b TRO DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation TCTTGGTTTTGA _B vIRF2_VP64_pos_ctrl Human target core_VP64_b ZNF597 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation TGATGAGCGCCC _B vIRF2_VP64_pos_ctrl Human target core_VP64_b EEF1D DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL VIRF2_VP64_pos ctrl_x_weak
CCCTATGTTCTA _A_Concatenation TGCCAGACGGTC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_P10541 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation TGGCCCCCAGTT _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b hvTR_Q3I8P6 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN VIRF2_VP64_pos ctrl_x_weak
CCCTATGTTCTA _A_Concatenation TTAAATGAGGGC _B vIRF2_VP64_pos_ctrl Virus_target core_VP64_b cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation TTAGTGCCATAC _B vIRF2_VP64_pos_ctrl Human target core_VP64_b TET3 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation TTCTAATTTCCT _B vIRF2_VP64_pos_ctrl Human target core_VP64_b WDR76 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS VIRF2_VP64_pos strong_ctrl_x_strong
CCCTATGTTCTA _A_Concatenation TTTAACGGATTG _B vIRF2_VP64_pos_ctrl Human target core_VP64_b TWIST2 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN vIRF2_VP64_pos_ctrl_X_mid
CCCTATGTTCTA _A_Concatenation Joint_ 1_B vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_b Joint_1 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK VIRF2_VP64_pos ctrl X joint
CCCTATGTTCTA _A_Concatenation Joint_ 2_B vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_b Joint_2 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA VIRF2_VP64_pos ctrl X joint
CCCTATGTTCTA _A_Concatenation Joint_ 3_B vIRF2_VP64_pos_ctrl neg_ctrl core_VP64_b Joint_3 DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD VIRF2_VP64_pos ctrl X joint
CCCTGCCTCGGG _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863778.1_1657 core_VP64_b EVNWLKAIKVENTIRGIALAGKLEEFLDNIQDIVYLLSSERTLYFDELLLSNSIEDSIKKILEDGRKAIKNEINNEFSKDVNYIY DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
CCGCCCTTATGT _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl ZFX core_VP64_b VYMTVNDSQPEDEDLNVAEIADEVYMEVIVGEEDAAAAAAAAAVHEQQMDDNEIKTFMPIAWAAAYGNNSDGIENRNGTASALLH DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
CCTAAATGCAAG _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NC_025410.1_cds_YP_009094498.1_7 core_VP64_b SVVYKNLQLGIPSDNGNVQEFKPLWSGNSVQTDLSTKECVDYIISQVNAHTLSLITLDIEEPTPEGVHDVSPALINSFLIAHFAL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
CGATGGCGGAAT _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl KIAA1586 core_VP64_b DTENNEVSKNHCRLSKAKEPHFEYIEQPIIEEKPSLSSKKEIDNLVLPDCWNEKQAFMFTEQYKWLEIKEGKLGCKDCSAVRHLG DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Weak_X_vIRF2\u VP64_pos_ctrl
CGCCGGCATACA _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NC_030656.1_cds_YP_009268729.1_14 core_VP64_b DGNVEIPLVQQNQQPVQPVLLPPPNQLVDDVQDQADSPMLAQSSNHMNTQEDVLSSDMLNLFDDIEWNDEFIQQVLNNPETIFPI DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
CGGCAATTGCTT _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_Q9QB93 core_VP64_b SRQHAAYIDYALNRMKKMPIEMLGSDTITLKPYQHFVAKVFLGLDTMHSILLFHDTGVGKTITTVFILKHLKDIYTNWTILLLVK DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
CTCACGACAAGA _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_G3G929 core_VP64_b ADDLFDFIETEGNFSQLLAAAAAAAEEEGIASGPDGGSQGSRRRGSSGEDLLFGPGGLFSDDAAEAEAEAAVLAAAAGATRPPRP DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
CTCAGCGATATA _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_Q5Y0Q4 core_VP64_b FATGSPRRRRSTVPRTDVSLGDELCLDASPSMTPSEALDDFDLTLLGDDDHAGTLDAAAYASLDMADFEFEQMFTDALGIDEFGG DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
CTCCATTAATGA _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_Q5IXR3 core_VP64_b FCQENTPSKTINYSFFLDKIFDITSVTKNLKPQTVKNYTKNNSNQLVWENFLAHMRSKKRVTMVEDYGHEYVFVDERFSTCSLEV DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Weak_X_vIRF2\u VP64_pos_ctrl
CTCGCACCGAGG _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863808.1_1697 core_VP64_b AEGLKVEVKYSGYAQPLIFSSLKSFSSFNEILKGMYNHTVIESFIIDYGNRDVILDIFHNSKKVLECLELIPFIRFSILIEIDNN DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
CTCGTTATGGCA _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl NOVA1 core_VP64_b AFPAVLSGFTGNDLVAITSALNTLASYGYNLNTLGLGLSQAAATGALAAAAASANPAAAAANLLATYASEASASGSTAGGTAGTF DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
CTGCCGGTTGCG _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155864303.1_2298 core_VP64_b LIKIPYKTEKIFPSDVKGKYAYMKDTVIIIRNQSKVLYIDCAHCNLANYKPPSFLSNYIFEYEIMEGGEYCECIAKTLQEQLKPL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
CTGGTAGTTGTT _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_F6KS47 core_VP64_b SSGGVLQAKNALQIAKREMVLAHPLLGYLTKNASTFEESNKVAQLFSLNGYNPVDAAWYFAAWARGVWPKRRAIWLWGPASTGKT DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Unknown_X_vIRF2\u VP64_pos_ctrl
EPICXV.47-A\u splice_ CCCTATGTTCTA _b vIRF2_VP64_pos_ctrl vIRF2_VP64_pos_ctrl 3x_vIRF2_core core_VP64_b GGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDLGGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL vIRF2_VP64_pos_ctrl_X_vIRF2_VP64_pos_ctrl
GACTGTGGGCGG _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl MON2 core_VP64_b TWHTVIWKVLFHLLDRVRESSTTADKEKIESGGGNILIHHSRDTAEKQWAETWVLTLAGVARIFNTRRYLLQPLGDFSRAWDVLL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
GAGACAGCTCTC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863833.1_1727 core_VP64_b LREVKFMGMITDTFIHVGKYGKLKEFLGDKDGKSLLFVNVRSSVKFDKAIVRKNIIRDLITSESLRYPDNYSLLSYVFGDGNIMM DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
GAGTGACCTATT _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155862583.1_356 core_VP64_b HKIFPDGESYIRIPESVRGEEVAIVQSTYYPQDKHLIELLLMIEAVKNLGASKIISIIPYLAYARQDRRFKDGEALSLKIVLNLI DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Unknown_X_vIRF2\u VP64_pos_ctrl
GATCGCATGTTC _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl INPP4A core_VP64_b NSIHGLNAARPDYIASKASPTSTEEEQVMLRNDQDTLMARWTGRNSRSSLQVDWHEEEWEKVWLNVDKSLECIIQRVDKLLQKER DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
GATGGAGCTACA _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_P03209 core_VP64_b EMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLESMTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
GCAACGAGGGTC _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl QRICH2 core_VP64_b RGLVQPGAVQRGLVQPGVDQRGLVQPGAVQRGLVQPGAVQHGLVQPGADQRGLVQPGVDQRGLVQPGVDQRGLVQPGMDQRGLIQ DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
GCAGATTCGAAT _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl AK9 core_VP64_b KPLVENRASIFEKCHPIPAPLAQKMLTFTYKYISSFGYWDPVKLSEGETIKPVENAENPIYPVIHRQYIYFLSSKETKEKFMKNP DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
GCGAGGCAGACC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863098.1_1014 core_VP64_b QWGDRSGALLSYWASTKENNFPRRIFTAVSSSVLVQDYLKKYNIEVVWTKVGSVDIAHKLMEEKGIAGFEENGGFIFPLHQYVRD DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Unknown_X_vIRF2\u VP64_pos_ctrl
GGAGGCACATCG _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_Q87042 core_VP64_b RPKRSRKRSIDHESCASSGDTVANESGPLCTNTFWTPGPVLQGLLGESSNLPDLEVHMSGGPFWKEVYGDSILGPPSGSGEHSVL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
GGCACAGCTCCA _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl DTX2 core_VP64_b TTNAWGAAPPSLGSQPLYRSSLSHLGPQHLPPGSSTSGAVSASLPSGPSSSPGSVPATVPMQMPKPSRVQQALAGMTSVLMSAIG DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Unknown_X_vIRF2\u VP64_pos_ctrl
GGCACTGTCGAG _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_P17147 core_VP64_b YALEAYMPEADRVPTDLAALCAAVGCQASETTVHVVVGNGLKEFLFAGQLIPCVEEATTVRLHGGEAVRVPLYPPTLFNSLQLDA DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Unknown_X_vIRF2\u VP64_pos_ctrl
GGCCTTTCGGGC _A_Concatenation CCCTATGTTCTA _B neg_ctrl vIRF2_VP64_pos_ctrl Random_3 core_VP64_b GAKNQLKISDPRGGDMILLYKNRGCMSETGSSVEVIQVSAVWRITLDNVNDPNLITEYGQKLPERTESDLGFQTERSQSENKKKQ DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Random_X_vIRF2\u VP64_pos_ctrl
GGGAGTATGTCG _A_Concatenation CCCTATGTTCTA _B neg_ctrl vIRF2_VP64_pos_ctrl Random_4 core_VP64_b LLLDRNSHLTCHIIKVNKLVVSAYSFINQFQKSQAQDCVEDVFDQWVINALLVTTNGGDAAAIFKSAQEGNSEGISDHWLHMDDS DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Random_X_vIRF2\u VP64_pos_ctrl
GGTGTCTCGATT _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl E2F5 core_VP64_b LTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDNEGVCDLFDV DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
GTACATACCGAA _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155863816.1_1708 core_VP64_b NENKLRDIKEKIEVKKRELEDLQNRIARLDQEIKNKESLASPEIRQSYERQLQEINAQLQKITAQRNEAEIEIRLLEKVLDQIRE DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
GTCACCCTCCTT _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl E2F5 core_VP64_b DLTQPSSQSLTPVTPQKSSMATQNLPEQHVSERSQALQQTSATDISSAGSISGDIIDELMSSDVFPLLRLSPTPADDYNFNLDDN DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
GTTCCACGCCTG _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl ZFX core_VP64_b VVDSDITVHNFVPDDPDSVVIQDVIEDVVIEDVQCPDIMEEADVSETVIIPEQVLDSDVTEEVSLAHCTVPDDVLASDITSASMS DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
Stop-1_A/u splice_ CCCTATGTTCTA _b neg_ctrl vIRF2_VP64_pos_ctrl Stop_1 core_VP64_b ***IIVNSNDQIILGYIHELKGPIRRYKNNKVVSDQCICSVEMLDRDTFEDLIFQLGHKKKEAYDTGISLQGVTVICVTKYEVSK DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Termination_X_vIRF2\u VP64_pos_ctrl
Stop 2_A splice CCCTATGTTCTA B neg_ctrl vIRF2_VP64_pos_ctrl Stop_2 core_VP64_b ***LIVTSDNSKFKPEELLIAEETSYQDKINSDSPENDNLMDVCSHILYAEDPGNKKNTSGWYSEPLKINKPMMCINAILMCVLE DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Termination_X_vIRF2\u VP64_pos_ctrl
TAAAATTTATCA _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl HSF1 core_VP64_b LSDHLDAMDSNLDNLQTMLSSHGFSVDTSALLDLFSPSVTVPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHY DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
TAAGATTCAACG _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl ZNF292 core_VP64_b CFKDPSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIKQALETAGIPSTFEGAEMLSHVSTGCVSDASQVNATVMPNP DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
TACATGAGGCAC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155864086.1_2031 core_VP64_b PRKKIYTVKVLLPDFSTVADNIIGIRKARILPEFSEYIDADVIKALNRNLNAGLEESEGGMLLISIEEDQLQDLLKVLKGKVEVL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
TACTTCTAGCGG _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NC_025350.1_cds_YP_009094080.1_7 core_VP64_b TWLAAPNTENEEDISVHLHTGSSCCIRPVDMCVLNETEAEVPHLLVPRANKFVFDDNPLDNEQIEIIDNVMIQSRLSGVDCLTIQ DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Weak_X_vIRF2\u VP64_pos_ctrl
TATCGCAAGAAC _A_Concatenation CCCTATGTTCTA _B neg_ctrl vIRF2_VP64_pos_ctrl Random_5 core_VP64_b PNYTEGNKKSRRKPKRDVKKSASHMKDLKTIIIKVKYFYNTSETIEQYESHPEYRIVAWDADIILDGFNFTDKAVSVEGSFKYMV DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Random_X_vIRF2\u VP64_pos_ctrl
TATCTATCGTGT _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_P03259 core_VP64_b RTEMTPLVLSYQEADDILEHLVDNFFNEVPSDDDLYVPSLYELYDLDVESAGEDNNEQAVNEFFPESLILAASEGLFLPEPPVLS DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
TATTAAAAGTGC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_202 core_VP64_b MHVHVNLDKAAFLLENNMYEPEQFPGLIFRMDDPRVVLLIFSSGKMVITGAKREEEVYKAVKKIFDKLNELDCIKPIEEEEELEI DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Weak_X_vIRF2\u VP64_pos_ctrl
TCAGCCGTAGGC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_Q805Y1 core_VP64_b RGAAVAPARLREINYVRERLNLPLVRSAATEEPGAALTAAPALQTQRARSSGYFMTLIRVKLDAYSEGSGSEGAEVLREHAYSRR DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
TCTATCACAACT _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl TOX4 core_VP64_b LSGGLTMDLDHSIGTQYSANPPVTIDVPMTDMTSGLMGHSQLTTIDQSELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLH DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
TCTGTACCAACG _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_Q6XA69 core_VP64_b ESAKDTTSNSMFILGKPSGNNMESNEERMQNYHPDPVVEESIKEILEESLKCDVSFESLLFPELEAFDLFIPESSNDIASKNVSY DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Unknown_X_vIRF2\u VP64_pos_ctrl
TCTTAGAGGGTG _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl TRO core_VP64_b FGSVLNTSTGFGGAMSTSADFGGTLSTSVCFGGSPGTSVSFGSALNTNAGYGGAVSTNTDFGGTLSTSVCFGGSPSTSAGFGGAL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
TCTTGGTTTTGA _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl ZNF597 core_VP64_b ASMPPTPEAQGPILFEDLAVYFSQEECVTLHPAQRSLSKDGTKESLEDAALMGEEGKPEINQQLSLESMELDELALEKYPIAAPL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
TGATGAGCGCCC _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl EEF1D core_VP64_b QHVSPMRQVEPPAKKPATPAEDDEDDDIDLFGSDNEEEDKEAAQLREERLRQYAEKKAKKPALVAKSSILLDVKPWDDETDMAQL DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Weak_X_vIRF2\u VP64_pos_ctrl
TGCCAGACGGTC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_P10541 core_VP64_b RMLPDFFTGNWDDMFQGLLETEYVFDFPEPSEASEEISLHDLFDVEVDGFEEDANQEAVDGMFPERLLSEAESAAESGSGDSGVG DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
TGGCCCCCAGTT _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl hvTR_Q3I8P6 core_VP64_b GFYIDGKLFNEDLSSKYVTFTKNVIASDAVTKYFSPLFKYVISEAKDRFIKTWMINIMIHMNVDPNNIIPTLEKYYPNSGRAQIN DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Weak_X_vIRF2\u VP64_pos_ctrl
TTAAATGAGGGC _A_Concatenation CCCTATGTTCTA _B Virus_target vIRF2_VP64_pos_ctrl cds_NZ_WFIY01000004.1_cds_WP_155862466.1_219 core_VP64_b LLFIDEINRRHPVWYLGKINATNPCGEEPLLPWESCNLGSINLEKFVKETENGKYIDWDGLAETIKYAVRLLDNVIDANKYPLKQ DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
TTAGTGCCATAC _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl TET3 core_VP64_b PLPASQGSAVPLPPEPSLALFAPSPSRDSLLPPTQEMRSPSPMTALQPGSTGPLPPADDKLEELIRQFEAEFGDSFGLPGPPSVP DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
TTCTAATTTCCT _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl WDR76 core_VP64_b IEKRQPPKSKRKKPKRENGIGCRRSMRLLKVDPSGVSLPAAPTPPTLVADETPLLPPGPLEMTSENQEDNNERFKGFLHTWAGMS DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Strong_X_vIRF2\u VP64_pos_ctrl
TTTAACGGATTG _A_Concatenation CCCTATGTTCTA _B Human target vIRF2_VP64_pos_ctrl TWIST2 core_VP64_b SPTPGKRGKKGSPSAQSFEELQSQRILANVRERQRTQSLNEAFAALRKIIPTLPSDKLSKIQTLKLAARYIDFLYQVLQSDEMDN DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL mid_X_vIRF2_VP64_pos_ctrl
Joint_1_A/u splice_ CCCTATGTTCTA _b neg_ctrl vIRF2_VP64_pos_ctrl Joint_1 core_VP64_b GGSGSGAGMSPSLAGMSPGAGAMAGMGGSAGAAGVAGMGPHLSPSLSPLGGQAAGAMGGLAPYANMNSMSPMYGQAGLSRARDPK DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Linker_X_vIRF2\u VP64_pos_ctrl
Joint_ 2_A/u splice_ CCCTATGTTCTA _b neg_ctrl vIRF2_VP64_pos_ctrl Joint_2 core_VP64_b GGSGSSDSMEDTGHYSINDENRVHDRSEEEEEEEEEEEEEQPRRRVQRKRANRDQDSSDDERALEDWVSSETSALPRPRWQALPA DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Linker_X_vIRF2\u VP64_pos_ctrl
Joint_ 3_A/u splice_ CCCTATGTTCTA _b neg_ctrl vIRF2_VP64_pos_ctrl Joint_3 core_VP64_b GGSGSNKQQGDKSPKNKFQPANKFNKKRKFQPDGRSDESAAKKPKWDDFKKKKKELKQSRQLSDKTNYDIVVRAKQMWEILRRKD DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGGSLDDCLPMVDHIEGCLLDLLSDVGQELPDLGDL Linker_X_vIRF2\u VP64_pos_ctrl
TABLE 11 validated screening of engineered Gene effectors encoded by functional biological sequences
Bar code First_screening_l2FC First_screening_ padj Search_method Ml_type Start_Point Peptides Base average Log2 fold change lfcSE stat P value padj
AAACCAATTGTA 5.119999321 8.35E-43 EMC xgb Is known to be ICPSTATPGNSGSDALDEFELDMEWGMLLDDFDLDMLDLPAEEKHCMDTCTDFAALDGDCLFDDMPDHSPSPDDDCLFENFPPLQ 4526.575317 5.954022431 0.407373466 14.61563632 2.23E-48 8.51E-46
TAAGCTGCCCAC 4.323888801 1.70E-38 EMC cnn Is known to be PPFPATQVSGDTPEINNDIDQGDGVNFDCDMLESDAHDDFWLDMLGAMQQPEFDLADLDLVHGGWVEGFNWMNGDDMPEPGPGAG 2701.590351 5.564089658 1.311157407 4.243647352 2.20E-05 0.000217981
ACGGATTACTGG 4.081262297 5.07E-38 EMC Both of them Is known to be AIPDVDQFNSFEDFFQDMWDTSLPIDFDTDEFNTATTDFDDFLKLLGDNQAGIPSTPEGAEMQFHVHTGCPPVASQVSSHDTIKL 31536.81878 5.84757675 0.276898957 21.11808876 5.42E-99 2.69E-95
CCCAAACGCCCA 4.043357552 3.19E-29 EMC xgb Is known to be PDDPDEPVSPNNLATTLMYHQLGTDGNVSGGGPCGDSPLDLTLFSLHDDLSAEWPEFDPATLDMADFEFDQWKWDDLGVSDLDDV 5829.540689 7.431268254 0.407779489 18.22374211 3.34E-74 5.53E-71
ACCTTCGCCTAT 4.627459008 2.16E-26 EMC Both of them Is known to be DQPPSEEGFPPHWKEPTEPSDQGYDIPSLDEFLDDMGLVSPMTNMMSGLMGYNDLGTIDQSELAELCMLDIEQYTSDAGEGGLWF 7122.193991 6.505983853 0.529034468 12.29784493 9.30E-35 1.25E-32
TTAACTTCATTC 4.215728925 4.59E-24 EMC cnn Is known to be SPDSDWPHFSGRSDPIPDDVPVGWELPDLNFYIPDFALDDFDLDMLGASFACDEQNDMLHNPEQEETPLQTPRPTEWFYGAYNID 8326.709328 6.145640058 0.518191907 11.85977622 1.91E-32 2.16E-30
AATGCGGTACGG 3.351681816 7.70E-24 EMC cnn Is known to be GWHNEDMPDYTHMLLNSMPAHETQLAWFPPSPDTWINTNPWTDMDLPDLDINDASIQELLNLEVPVQDPDGWDSISPGFSSMEPE 9837.306217 4.340889006 0.368288226 11.78666245 4.57E-32 5.04E-30
ATGTTCCTCGTT 3.428566629 1.89E-22 EMC xgb Is known to be TPGETEYGPRLTPSPMPTDLGPDTFNFGFDMFAGLDTSPEEPIPSLASDLDDYLHTVDHQEGCCPDALDDVSYILEDLFDFMDRN 8213.662548 4.499396353 0.57951825 7.76402875 8.23E-15 2.09E-13
GATACCATTTCG 3.583504325 1.16E-21 EMC Both of them Is known to be GQSEPSMSCDFFNPFTMGAEPEYSSNDPPRFSFAPAKFTTPECFQELLYDDLDLNPSTDGEDVIMAMADALDDFDLDAWPEWPSP 3640.935737 6.106570803 1.498131723 4.076124089 4.58E-05 0.000438971
AGCAGATACTGC 2.843425118 1.45E-21 EMC Both of them Random arrangement HAQRNASMYTGFDCDSDIGPIPDCFDTSPFMDTCSDGWCHTLPHDTEDDIWMQLDFPVFLGDAPIEGPELTPDFISLLDDAIDQW 8519.847519 4.226007071 0.479086816 8.82096298 1.13E-18 4.02E-17
ATCGATGCACGA 4.716581793 2.72E-21 EMC Both of them Is known to be AHPSQPPEEYWDDFLLDMLMSDDHWIGTQYPELMHALDDFDLDMLGSDTYDMSCLTTEDFQAGGSFGGLSALCGSEGPDSPGHGD 8069.509948 6.74353574 0.944896049 7.136801714 9.55E-13 2.06E-11
TCGTCTGCGCGG 3.330713788 6.25E-21 EMC xgb Is known to be SSPASFEIPILETPDYALPSDGPGVNDDEDIIEMLLASAPPTGLVLSDQLEGQPPSGSCDEAGSLSMPHTYWGNMFEDMFGDSMT 5404.119194 4.162820957 0.381208385 10.9200666 9.24E-28 7.27E-26
ATACCAGTGCTA 3.154160636 7.37E-21 EMC cnn Is known to be VGTLPSDSENFPRSDTPPQFGMSDDNLCEMTPSDTGDSFDLHLNLPDDQAALHALLQEAGPLDWDELVPLDDFDMNFTLNGMYDQ 4805.577306 5.016679734 0.52645192 9.529226787 1.58E-21 7.55E-20
ATGCAACGATCT 3.265181601 1.05E-20 EMC Both of them Is known to be GQMDDSNTDITDDLLAPHLVSMMDEWLGDCDHIEGSVLDLFNDHEQTLPDLGILEELLIGDATAPEYQAGQCGNIMTVDDPPHLQ 2819.264236 5.032015858 0.605503943 8.310459272 9.53E-17 2.88E-15
CGGTGGAGACAT 3.055986445 1.16E-19 EMC cnn Is known to be PPVGSMPDYPHELNHRAWELLFGEEAHNGDNFPSHPSPELVHPSMLESASWDDFDQDMLKFMGYDDLWSGPVQSQILDDLHLFED 13677.24098 4.987848349 0.315029298 15.8329666 1.84E-56 1.14E-53
CGGGGCTACCAC 3.167911667 1.37E-19 EMC Both of them Is known to be IGECQLPSFDGFPALQPPVEAEFVFDFPEPSEGGEEMCYHDLPDYDVFGFEASTPTNTPRLAPGDNPEWEDYDSLFPDHFGFGWD 3300.124699 4.716206537 0.459201795 10.27044447 9.58E-25 6.41E-23
CCCATGCGTTCT 2.804166835 1.93E-19 EMC Both of them Is known to be SSENPSNLMSPGLDETGGSSEPYRDHPFPEWDTFIDDLMLDDDLPDPFTWTPGVEDTGDCTDQPLSPLWVDTYFTWDWPGENYAC 6491.65921 4.213714539 0.429936178 9.8007908 1.12E-22 5.89E-21
CTCGCAGTTCAA 3.12800265 2.13E-19 EMC Both of them Is known to be CEAPMEQFDPGCNIALVAGPLDGLGEYDFNDLDLNWFEGYHLTDHSCEQLIDFMDPDPTDFDWDDLADPSFGTSPLTPDPPIYPA 3966.283148 5.265035131 0.577915539 9.110388587 8.21E-20 3.36E-18
TCTAATGCTGCC 2.729384692 2.21E-19 EMC xgb Is known to be SGGSSPSPQYGSPEDSLLDLVDHEEHVCPSDDWLDDFNDLFSDTAELWIPTELNNWEPYGRASPSHGSSPKEASQDDPIGWIPDM 18350.7544 4.227883191 0.304669771 13.8769369 8.74E-44 2.55E-41
GAGAGGGGATGA 3.10774722 2.68E-19 EMC xgb Is known to be GHGDELMQSPDPRAQMPFLFADPLGTPALGDCAVPDALSDFNLDTLGSGNEHMFIWDMNDVDAFDHMLDDMAKDMYDCEQLGLDI 7056.450094 4.845202056 0.539042438 8.988535432 2.51E-19 9.48E-18
AAAACGGCGAGG 4.066138534 6.45E-18 EMC xgb Is known to be AAFSDLLSPEELWSDTHFDAEFTEDWWDEINCLENPAAPGGAAMDSGSGLDDCLPPVDHLEINWPSPLQDEIQEFLDLGDDCDLS 2783.459791 4.946102784 0.442942307 11.16647181 5.95E-29 5.00E-27
TGGTGTTGCGCG 2.98162189 1.53E-17 EMC xgb Is known to be PDAAQSVEDDNMTLGGEWFTGWGYDFFAGFDHIEGALLDDPPTVEYDLDINELDSILLALSAMDMPRPKEDAAECGSTQSGATTK 14490.43403 5.559255815 0.453908623 12.24752193 1.73E-34 2.20E-32
ACACATTGAGTG 3.203441202 1.59E-16 EMC xgb Is known to be LCGSELSLDSLDALEPSGVSMPSNSDECALWDPLYDLPDNFDWLEEIDRSLGIPPLPPTSMGPDLCGFIWSLEHENTTQDWGHHE 4458.715243 5.056965826 0.523655592 9.657045396 4.59E-22 2.34E-20
AACTGTGCATGA 2.880537949 2.89E-16 EMC xgb Is known to be QCTGIDEMSTWDDVMTHEYDTQTQFDFPEPLEGSVSPFRPNLGPLQYQELMAEPAEDPSWSWPYDQFDLDALGLDALDVCWYPPN 7382.676254 5.035426693 0.47729093 10.55001547 5.08E-26 3.70E-24
TCGCGTGGTCGG 2.947195769 5.28E-16 EMC Both of them Random arrangement DLTQYEVPSCSDQFIGEHFAWPVPPDLEFDQPFEFDQMEYFPAGDTDATNAGDDWPPDLSGIEEIFTIVPEDHVDMFAGLDMLFS 8658.178596 5.04575372 0.397944076 12.6795548 7.68E-37 1.27E-34
AAGCGCTTAATT 2.68966923 5.93E-16 EMC Both of them Is known to be GGGGSHPDEDLNIATDGSIVHPLCVHDELDDLTYPGALDTYDGDQLGIFDLFDFDLDMLGGTIPNTADPVISYLLQQVTHDTNDI 11013.84417 5.419354249 0.358795093 15.1043154 1.52E-51 6.26E-49
CCCTCCAAGTCC 2.865739411 8.85E-15 EMC cnn Is known to be IDATRDLIGPLDEIASDGWDTSYPIDFDFDQFNTETTDFDTLHELYDLDYTAPEWPNEEAVSGSDPIACLPDPSVGIIVDTQEWQ 4427.018617 5.301234582 0.384541921 13.78584308 3.10E-43 7.54E-41
ACGAGCCTCTGG 2.628355527 2.04E-14 EMC cnn Is known to be ANDNAASAEIIYWDTLSPCHKWLPDDFDLNMLPEDMLDEFDCDLLVSDAIHQFDLEMLGYGQGGPEMRHGLNIYSDDYVIIPSGE 13604.29819 4.344654087 0.374666137 11.59606822 4.31E-31 4.36E-29
TGCTGTGCTGAA 2.805230688 4.74E-14 EMC Both of them Is known to be TTTASGGSSAGPPMDLDNLGSLDLDFFDLDYQHEPAHPFFDLDMHGLDDLYDCLAMYEDLCKADEGDCNDAGFMDPMGGSLGSNS 4669.428431 5.40637343 0.603104015 8.96424712 3.12E-19 1.17E-17
CGGAAGGGATTC 2.578421581 5.38E-14 EMC Both of them Is known to be DNPPHSHVDPHQGCFLDTALPTIVGCQTIEPTVCSDILDLFDPYALGSEALDTDNLDTLGIDDLLRECCAGVASWQQEPEPPQCD 5871.215801 4.834202421 0.988770769 4.889103293 1.01E-06 1.18E-05
GTACATCTCTAC 2.66285405 6.66E-14 EMC xgb Is known to be LSEPEGSPGTSLDTWPVNYVTWDQVFALQGTPAEALPHDVNMTDDLDVDQGLMEMLMTDESELETLLDCLLPVETQLPPAYGPQD 4066.873224 3.115188943 0.472894893 6.587486964 4.47E-11 8.33E-10
TACGGATTCAGT 2.250071279 6.69E-14 EMC xgb Is known to be STGAPEGSLDENFNMIPGPLLPLPMQEDPWQLLPDMGTYPSGDLLGDLPDLGELSELLCPEVSMAIESQHDFFGEGSELELFASG 9579.904441 4.608126833 0.462518818 9.963112105 2.21E-23 1.29E-21
TGCACCAGGATC 2.310260534 6.78E-14 EMC Both of them Is known to be PDVDNLTGCDFDTLLNHLPGIMDQNIHDLMEPSTQPNQGGASASLLSDGSQLDFDSLYEWDADASAGEWQHLEAEAWEFVCHTFS 7842.133206 3.790652903 0.426763899 8.882318566 6.55E-19 2.37E-17
ACTCTAACCCGG 2.702462319 1.15E-12 EMC Both of them Is known to be ECPGGDDNIPAPEGYPLNNENDDCLDNHLMMDYSNIDDLLTLCGVDLSDIPSPGPPVGMSFGPFDEWFDDIDGGQNSRTAGSLLG 5051.781196 3.936630884 0.458083393 8.593699188 8.42E-18 2.80E-16
TGATACCAGCGG 2.216500899 1.74E-12 EMC Both of them Is known to be DTPMHSEMQAQHEGEMDLFHDVGWEWPDLGFAWYEDALDPFDWKILGADALYDADILASPTPASDYNLLDEHGGFMDDDFYFDFN 9456.127685 4.279782435 0.429531954 9.963827827 2.19E-23 1.29E-21
CCGGCTGTGGAT 2.750477544 2.14E-12 EMC xgb Is known to be CDSMDNDKNSETMDGLGTLPVVCDFDLDYNFDSSQLQEFLGNAGPDFDALWDMLDTYMSEDDRALFTEGAGPVCEEYGQPASWHC 5183.521362 4.444169636 0.686567256 6.473028819 9.61E-11 1.73E-09
TTTTCAAGGTCT 2.376180384 2.63E-12 EMC cnn Is known to be HLCLDCTIVETLGSGDLMLEQHPSLECPQHSGESHNLTFPATDLLAQDALDHFDLDMTMWLEEWSFNIDSADWDLAGMESGGSGI 6104.022365 3.478543963 0.396589166 8.771152278 1.77E-18 6.22E-17
AAGTCTGCCCTA 2.722695953 3.76E-12 EMC xgb Is known to be VGALLGLDSYPKTTPSLMDLSVCEYDDPDGSLGTKQVNDLDGLDMLGSRLCDWMDLDNTAFDNLMPAEGAMPDEGGDIDWFMMDL 3401.096216 4.007689605 0.320074839 12.52110169 5.72E-36 8.60E-34
CGCCAGGTCAAT 3.422410728 4.22E-12 EMC Both of them Is known to be TENNLSPMANWWDEQGLMDNDQYSDWPDLQELPPDLDLMTEALDLGSPLTPELHEILDMFGIDECGNHDSAAGDGGVVYMTKGCQ 1794.392624 3.83666413 0.981855364 3.907565484 9.32E-05 0.000850924
GACCCTGTGTTG 2.159322453 5.90E-12 EMC Both of them Random arrangement ATAQDDEAPLQVHLDEVIIPNHEPCADDSPAQEWDNWFDMLEWTDAYNHAFSSMPSFPWFEDVALWSDCNIDTVEQPDEPPCFDI 21497.78239 4.543120079 0.488102452 9.307718213 1.31E-20 5.73E-19
CTGAGAAGGACG 2.928688534 1.21E-11 EMC xgb Random arrangement QIYRFTADCTPIQCDEGQYELDFDWDFDLLEWCDPTHMTPNDEWTPEPPQGCSSSVHGFTSEFEEEYLMDQWEEFMPTPPPHHDL 2319.866342 4.481402153 0.324984114 13.78960372 2.94E-43 7.54E-41
ATGTCGGCTAAC 2.763198266 2.28E-11 EMC xgb Is known to be GWEISGGAGDADGDPHPGQAFSGPMDFDAALPNIPTFDFDDFPLFADLESFMEPLGQEGLDCLGYDASLMKAVQENLGDLTQSSW 3109.53508 5.10638125 0.457186064 11.16915332 5.77E-29 4.93E-27
CCTTCCCCAAGG 2.366018466 2.88E-11 EMC cnn Random arrangement AILENWVIPDSSSAVVCGDFSTDELQPTHVMHAWEEPPHQMDISLEEFDNFWQEQAQHDFDAEMDEWLSSEILSIFNDAPPLGTF 7098.031021 3.939302993 0.35008659 11.25236758 2.25E-29 2.07E-27
GTTGTGTGCGGC 2.837886627 5.02E-11 MH Both of them Random arrangement PLCPLGAEPQHGVGWEPCPLSCEPQHAAPPIHCYLWPDNADFLDNGAALDEVPFCEIDIFDGDEFPEIMDLGVNWLEEWDPFSES 6806.653545 3.462881162 0.448271471 7.724964411 1.12E-14 2.77E-13
AGATGGCGCGCG 2.31643741 6.61E-11 EMC xgb Is known to be GHADNPAGSSLLPSPVEPDPTQQDLLAPWSLPDLDGFNPSLWFDMESISDSETQLGDHLDVDVYALLETMWNLPDPDEIPELPKD 5995.386258 3.865825672 0.482492241 8.012202777 1.13E-15 3.14E-14
CTGATTTGGCTC 2.319825022 9.13E-11 EMC Both of them Is known to be ECEICMALDSLDEMFEQWHYVDRELIQGPSNEVIAPLITLDLMSEATYPAPGVWGTTLGMTGETVDDIFDWNIEDYLEIDFSELM 4439.357544 3.025815306 0.347732722 8.701554716 3.27E-18 1.11E-16
CTGGACTGCTGT 2.533045923 1.14E-10 EMC xgb Is known to be TYAIWPEPPPLDLHLEDLCACLEQEANGEDTPFESDAAATFDAMLDFDAALEDCDTGMADSPASLNFSLDMGDSDYFSNWMTMML 4588.405827 3.107137775 0.364739068 8.518796171 1.61E-17 5.19E-16
GCGGCTCCCTCT 2.74392385 1.43E-10 EMC cnn Is known to be APPVFDDICETLDEAEPQHGYPDHTDGLVHQVLEPELEAFDLGMLGDDSLDNFDLTHMWAMLEEEAWGESPAAECNQSVQEGQGV 4197.168894 3.46885743 0.427285237 8.118364806 4.73E-16 1.38E-14
AGATTAAAGCCT 2.751187468 2.26E-10 EMC xgb Is known to be TEAEEEASTPTEPPGSEFTDIMCDWLPCEGDTPPSDALDGTLLDILGQALDPDPWPHMNGSIAHLEFDLDMGYWCSPVQHQPEMP 3593.698148 4.393520926 0.275196928 15.96500718 2.24E-57 1.59E-54
CAAGGCTAAGAC 2.69982613 3.81E-10 EMC Both of them Is known to be CSHDRNHSTFEMFPLSESTWDVICPDAHDACLPTTTEMLPEDLYLDSPLEPELMEILDDFLGAECQLNDGFIADFLLGAGDENEP 2731.405984 4.755116421 0.442425166 10.74784345 6.07E-27 4.63E-25
TGCGCGTCGGGC 2.281598856 4.20E-10 EMC Both of them Is known to be CDVTSEYAYNQLAEEHLIPMEPHRPLCTSGPLSDTFAMDDFDSPDLNDQALDDVPALWDFFDALNNAPNAMPLLDMLTDFDLHNA 19826.41598 4.606464851 0.342756109 13.43948284 3.55E-41 8.00E-39
TTGGTGGCATTC 2.215076721 4.40E-10 EMC Both of them Is known to be FDQMWLYSFNSIMMANHEWDNQPSHDGGPNTPPPDHETMIPDLNLDWLDTPEEPDDVDMFLNDWGLVSQMLDLQDCVDWPMPYDG 4873.884194 3.216486583 0.32216115 9.984092082 1.79E-23 1.07E-21
AATGGACTGGTT 2.033061136 5.67E-10 EMC xgb Random arrangement EHTEFMATGMDQGESNDYCWMYDLACECMADSGSPNISPSLTLSEYEEFCRLLPDMDTVDVGGPWAAQEPLFMHEIYESDEPQDL 7592.360319 3.845906972 0.314701379 12.22081386 2.41E-34 2.98E-32
AAATACTGCAGG 2.494192536 1.04E-09 EMC Both of them Is known to be LGDFDLDSMLLPAVEPPCVAEPAEPPESELSSIYRPADWLAQADWHMNTTEDALSSDVNQFLSELFEPDNWENWVGENINYGLDE 3690.590661 4.873338259 0.342450862 14.23076653 5.90E-46 2.09E-43
CTCGGGCAGGTA 2.604935509 1.15E-09 EMC cnn Is known to be QDMDDAEAASTGEFPVTDWLEGPLPRQHGIASGIEGFGSTGPSNLDEDSWLDYLPMVDQILSDEYLPIADVFNELDDLLDFSCTM 6507.295553 5.120077565 0.544311251 9.406525321 5.13E-21 2.27E-19
AATTTCGACAGG 2.587277291 1.31E-09 EMC Both of them Is known to be VDWPDEGTEFEHTDYFQNYVLPDVDIDDLLALLGDASPSKEQMGESTVPRTDMTLWGYLSDGAMPEENWKDGFCDWEMGGGEGTR 3826.28169 5.079305712 0.322580517 15.74585396 7.33E-56 4.04E-53
CCCGTAACGATG 3.003944082 1.49E-09 EMC xgb Is known to be AREAIGIKTPDGAGAMACAEVFGSELFPTGDLGDNDALDDFDLDDLLNDEFVMPDMLPDILDTQFLTTLQPHSFDFMDGHGHGHL 4889.82076 5.527868326 1.429443823 3.867146254 0.000110116 0.000992249
GATATTGCGTTG 2.418220494 1.60E-09 EMC xgb Is known to be NEAFCSTICETTGEAWIDPDPRGPQPWEYAFEELCASPMFDTSFDDLIATPWYLEGLQPDHLLSIDPVCSEILGDTEENKTMQHE 2952.157287 3.696882182 0.498807356 7.411442794 1.25E-13 2.92E-12
ATTGACGGCTCC 2.163176182 1.97E-09 EMC xgb Is known to be FDDQSMLGLLDNFTPECLSSPSEQQDLCPEEDPYAFNPAGSISGDIIDELMLCEIRMDPWSYHTPADEPWWHPHCDLPQPSYEMP 6979.376905 3.851480205 0.370503752 10.3952529 2.61E-25 1.87E-23
TGTGCAGCAACA 1.982487689 2.13E-09 EMC cnn Is known to be QPYVYPSVNTNISPSEPAHEPNENEWLGWDNSLAWNVGSLNDEFWLDLLTAGQPCFELPDVGGEPMNTGLADLMEFWAASEESMS 5305.44634 2.899576542 0.385370452 7.524127828 5.31E-14 1.28E-12
GAGAGATTCCGA 2.551333283 2.57E-09 EMC xgb Is known to be AQAAPAAVEQLDDTISLLLQFGDDFDLYKGGLPAGGPWHGGPYYSTDSGGDDFDLDCIEPWVMEDSELSDVGLLDTDIADLILQG 1991.521231 3.532971756 0.404654784 8.730829087 2.53E-18 8.80E-17
ATCGTGAGTGTG 2.262308478 2.85E-09 EMC xgb Is known to be KTPVTVDTVSTCYPLERCNTVAELFDAPMAQDYTQWDNALELCQELFLDWDTEFNLHHAFEDDIMAGFDMLDQFEGTAEFLPTSP 6234.613598 3.814696977 0.372463012 10.24181422 1.29E-24 8.51E-23
TGCGTTATTACA 2.647816103 3.30E-09 EMC Both of them Is known to be PWVCPRPMGDSDAVAPEIDTVGEWHFEIPDVDIINSPPVLHDLLGESSTLPDWALPWMDDQFCELDLMGQLGGPPAEFDEHSVPY 2575.609906 4.597906052 0.495412721 9.280960814 1.68E-20 7.24E-19
GGACACAGCAAT 2.294005924 3.42E-09 EMC xgb Is known to be HTPRGSMQFSTFCELEAVDFVCPASGETPCLHDEEELLDWKDFDDLPQLAPMMDVGDLDSIGPGWSWLDSYTSPIEEFLGCFDAE 3320.657289 4.600019242 0.473741745 9.709972336 2.73E-22 1.41E-20
TATACCCTCACT 1.868296462 3.78E-09 EMC Both of them Random arrangement EGGPAWFTQPPSQDQGLPCCTCSAPECMIMPVAEWECLSQPIADFEQFTQAYLDNLNRDMSGEDNMSEQWCWDFMNSFDDTAWDT 11554.24409 3.364970384 0.270474183 12.44100395 1.57E-35 2.22E-33
AATAATCACGGA 2.544483495 4.16E-09 EMC cnn Is known to be VMQMDVEATGCCDTTDTNENWLAGCEWPLPEPYAHQDMSSLDNLLHSDLESIDTIEMWAGGDTLALLETFMETMEMPHDYHDGPY 2479.653659 4.652697059 0.59547403 7.813434047 5.57E-15 1.44E-13
ACGATGATTTAT 2.141150192 4.91E-09 EMC Both of them Is known to be LDEKCSMGFSDFIPTCPLNPMCGFMEWSSEPTDVEFDIPPTWLDDLTSLDVSDVVGIFDDLAFPPPAQTPEDQAHYSAGGGGTQG 3402.944369 3.435930164 0.362052739 9.490137204 2.31E-21 1.07E-19
GCCGGGTATACA 3.489565564 5.20E-09 EMC xgb Is known to be SPLRAPDIEFALPAWEDGLDLDWCFDATFDDFNFDTTVEDAFLDIFNWMEGPCNILSELGQGEPPPTRCQPEQGEDQYETPVSYP 542.8187585 4.939402207 1.400626722 3.526565736 0.000420987 0.003365177
GGGGGTTGAAAA 2.560598509 8.35E-09 EMC Both of them Is known to be WSYPCGIEWPGPYSSSGRNSSQTQPRSDMTDLDLDNFLDLLKTDVESIDLGDFNDDLLGDSDTLDLNQYLFPVDEEDDTCMQPDV 2204.636885 5.224970759 0.46637011 11.20348548 3.92E-29 3.47E-27
TTGTCCCCCGGT 1.808854031 8.64E-09 EMC xgb Is known to be PGLPDHVTPCPDDEFNLYDNTQHPFDMPEPELCFEEMPPLDLFEILEISLTADINCPAVDTNSLEAFDLEDIQQTDNNPFTYFNE 9957.513956 3.441297694 0.282201216 12.19448216 3.33E-34 4.02E-32
ATTGCATTTGAT 2.066328081 9.47E-09 EMC cnn Is known to be MATNSPVDSIRDETSPNKLEGSHIDPGDLDEFLDHMDLDDLWKTLWKDDQIMSHNGSINSCLENDMDVLFMHCGDFEIPLEHEEW 5277.930376 5.124824125 0.369942108 13.85304354 1.22E-43 3.36E-41
AGGTCTGACGAA 3.074632322 1.26E-08 EMC cnn Is known to be GTRVNSQPITTEAMDSVAQSQAHELYCTPSEPFDTPCLPMYSLDSDDSLMGPTSDEAIGALLDIESFFDSILGGPSLGTTDDDPM 1207.401103 4.525256318 0.584285374 7.744941976 9.56E-15 2.42E-13
TTGGGAGGAGAG 2.56784145 1.26E-08 EMC Both of them Random arrangement KQYPQGVHGAIPQDPDAPAWPEFEDAEMFFACEACDLSPPETNTEDPELQPHSWEPLLAYEPSFISETIDTMWGCWDNWPWPDLA 2008.069279 3.00813829 0.498879331 6.029791376 1.64E-09 2.60E-08
TGAATGGTCCCA 2.009243552 1.28E-08 EMC Both of them Is known to be DWWEHFCPTTDWGDPLQGELYTPWSDVPLDQIQPIEPTAGFSDADLFNMLFHSDPSSQFDISHEDWLVNEGGSMLPTTPQESENV 4160.500342 3.197160301 0.378186936 8.453915228 2.82E-17 8.95E-16
ATTTCTATATTT 2.494322665 1.32E-08 EMC Both of them Is known to be AEPSGSGMPTSQGWITGPQTPHLIDWITHDDIPGWPTDPSKDMSTLELDSSKFSLLDMISDEWLTQGSFEWPTDGEGMTGQGGGE 2358.52868 5.034214528 0.396416383 12.69930997 5.97E-37 1.02E-34
CTTTCGACGGCC 3.558736338 1.61E-08 EMC Both of them Is known to be TAEVIPYVDCPEASANAPTGANALEQPDLYDLASGSTWPLYASDMEMSDFLGDFCLDMLGMHDDIMWHNTLLDMDVLDDWDLSEP 1161.258135 4.778532339 0.882487466 5.414844427 6.13E-08 8.26E-07
AATTTTCATATT 2.247778994 1.91E-08 EMC Both of them Is known to be QEQVDLEGFALEQPIDDSNLEMNSPDGPVPQSAPRISEIGSNWFDIIDELGSPDMFDWLWCPPTPWDDICFTLDDLITDPQFEDH 3759.934356 4.584710178 0.453036696 10.11995324 4.51E-24 2.83E-22
CCGGTAAATGAG 2.475600864 2.03E-08 EMC xgb Is known to be PPSDTDQWEFASLDFFDYDPEGPGGDGGGEPTHSTLLGMWSDLHLGDSNLTESMPSPELPGPVGALLETWDISTMAEWLDIPIDG 6081.704663 4.964694822 0.287881085 17.24564439 1.21E-66 1.20E-63
CTCGGTGATCTC 1.907001695 2.04E-08 EMC Both of them Is known to be AQTEEIQTDCAVGSIPQLTISPLDWHLDMDPHQPMFSGSLSQSHPTEIDDLGFLPEDLDNLAADVDAFTEIQNGMSLAEYYLWME 6613.611521 4.213494922 0.550889685 7.648527537 2.03E-14 4.96E-13
AAAAGCGTCGCA 2.134682329 2.07E-08 EMC xgb Is known to be DPSLGCLQPASLPIDPASELINPSPSRDSNYEPENHGSDLSPMVIQEMPDLGGLWMNASELAMDWREFTACDSDLWDLLGDGSDL 3787.236222 3.691808033 0.358778117 10.28994762 7.82E-25 5.31E-23
TGGCTCTCGCGG 2.103824635 2.07E-08 EMC Both of them Is known to be GAAYQDDTLSYLSELVEQQLQQYHMEQWGSLWDSPATGHSTMSSDLELPDLGDLTELLCPPADPQLSSHFTGLSNWIQQVDGCNE 8089.092324 4.623577399 0.423171264 10.92601929 8.66E-28 6.92E-26
TGGGGGGTGAGC 1.817920948 2.07E-08 EMC cnn Is known to be GGAMTTRDDTSAISMPCFGGSQHAGDHPPSETPNEALADFDLHLLGDSDFAGVNFDWMNKGCDVADFFEICDAAFGDIRGFPEQS 10869.03137 4.181253231 0.580617637 7.201388597 5.96E-13 1.32E-11
ATAACCCATTTC 2.448547857 2.24E-08 EMC Both of them Random arrangement ALNVNAIEGSEHGDMPLMSELEHHVTHCPEWWYNDEAADFLQWFEETTNEYDNILGDHEFDPFQEDYNMPVCWPNPDFQGDINSP 7573.912503 4.600962998 0.576603402 7.97942395 1.47E-15 4.07E-14
CTGCTTCATTGC 1.938567473 2.37E-08 EMC Both of them Random arrangement ALQQHTTPVPSHSEIADIMQDWPTGLIQEDLTISPPPPETQAFWQTLINEFFNNTAMWGEYGDHNPDEWAPCGSGGDTDQNPITQ 7178.99148 3.314809881 0.216570867 15.30589007 6.98E-53 3.15E-50
AACTCCCATTAA 2.577766341 3.31E-08 EMC Both of them Is known to be VNPYFSYEDITPMTGFAPTQTCNDELPELFLEPQVAESFDEAELEMPSPHMNDFDLDMLLPDEWPPFDDWLTELMHFMDVQHAHL 3016.204519 4.523971905 0.459272214 9.85030612 6.83E-23 3.64E-21
TGCTGATTTTGT 1.811972777 3.49E-08 EMC Both of them Random arrangement MDPDNMLFGVEAWDHCFMGVEHMDNVQEFAETLGFNDMDFDDMPNDIELLNCLGSPDSPAHTSLWCPGYMEQTHDPEPTSTDHSG 7774.705425 3.233147015 0.256304436 12.61447936 1.76E-36 2.81E-34
CGCCTCTGAGTA 2.13798058 4.33E-08 EMC cnn Is known to be YNGCHPGSSNWFGDQMYTQWEEGMDLDDILGSIGEITPPTSHSLDFFTDFFALSPMNSETDLEWADFFEGQEWHGQDGCTHTEWE 3470.035309 4.747080906 0.368741237 12.87374568 6.33E-38 1.12E-35
TCTACGCGGACT 2.655393578 4.93E-08 EMC Both of them Random arrangement ELGLYGELPHRAIGSELEEYPCVWSGQTPNTQHKDANSLMSCQDFQDILDAGFEDFTQGVDFDDFTYSVLDDAWLDQGGATNQWM 2056.983884 3.268110773 0.599765711 5.448979012 5.07E-08 6.94E-07
ATCAAACGTCAA 2.144529239 6.48E-08 EMC cnn Is known to be PPDSMDCIGSPGVGVAGPSSPHSESQDYQGWETSTHEYNLEDDLQLLDLETYSPGSPLTEIEPDMLARNWFGECLFGDWVEWPPE 3372.299367 4.32565456 0.655259831 6.601434051 4.07E-11 7.62E-10
ACAGTCATTTGC 2.024450252 7.00E-08 EMC xgb Is known to be YNANHMEPPTPPTTFFTDNLLSNEDWGCWNGPNDSAVNELFSVWGPQAAPPEQSPSFIGPSFDELMDWSYGWPCGGEWSLNWEDW 3302.487431 2.744518629 0.379299743 7.235751346 4.63E-13 1.04E-11
TATTCGATCCCC 2.436808735 7.03E-08 EMC cnn Is known to be EMPEADQQSGSPWSNLLFEEVGHEPDRSPADLDQDMFSGGVTLGLNWYDLNDLMDFDCMDFNDLMNGEGETFPTLEGFLTGICSP 1806.514894 3.260025097 0.50318379 6.478795939 9.25E-11 1.68E-09
CCAATGGTGGTC 1.922808608 7.76E-08 EMC xgb Is known to be ATGRDCAEFQCDSPGSVSTSNWLQLVEDMLDPDPEGLGDFDWREILEPDCVMELDDPAFASLDMASQEMPMPGSVGFHDDLGQQE 15420.14925 4.001409923 0.229334815 17.44789567 3.57E-68 4.43E-65
CGAAAGACTTAA 2.262321474 9.87E-08 EMC Both of them Random arrangement PSSNYDGTGHSIITINPDLDDCQSQDEPGWLWDTMTHAESQEPIPFSSVEDVFRPFPCDDLGQFQEDFYTFPGDFFDYIAWGEMQ 1991.724518 3.64291212 0.493232958 7.385784068 1.52E-13 3.48E-12
AATTGGCTTAAT 1.777729074 1.13E-07 EMC xgb Random arrangement LPEADDDFCPSESELFSQIMSLWDDLQYDDPGADLCQCILDDCIIDNDIFVNAFITSGGDSPSHQSGGRFNHNDSPQAFNPCESH 8726.604729 4.04323507 0.28580354 14.14690338 1.95E-45 6.45E-43
ATGCTCCAACCG 1.784880055 1.96E-07 EMC xgb Is known to be PGPEPEDVGMGALWGETSGLGDSEQGDVFSDLDLDFWIGSLACDMDESLAMLDSVGHKLCSIYPNLPPSPLSLPDAWGQGMDQEM 6851.794893 3.919629579 0.333516693 11.75242397 6.86E-32 7.39E-30
CATTGGCAAACT 3.345768534 1.96E-07 EMC Both of them Is known to be NWFDELWAEIMDDELPNLHDVRWPQGDYQSIEAQEPLSDDCFQEMLCDNALMGNDGLPDDANLLTWLESEWSASGSTDEWPAQPR 987.2371565 4.45969524 0.540535131 8.250518769 1.58E-16 4.72E-15
GTGAGTTGCAGA 2.168095575 1.96E-07 EMC xgb Random arrangement NPNDDDYNPGDDNISWDPSDDDTMHTSPTSPVNPFDAPLGYDFNLDDFMFAQDFTDWPVFEEWMGHFEMPEWCFDPMYEDISFQM 2269.113821 3.659358342 0.481179063 7.604982481 2.85E-14 6.92E-13
TCTTTTTTACTT 1.903270723 1.96E-07 EMC xgb Is known to be ADMADFSGMECVDMFTEVELSPDVFDFELPEYTTQEMSVSEDLFLDADTFGTPQPAPAQIPGDPYIQASVRNLAAWMLEDPWDFV 7590.062409 3.965355189 0.421199489 9.414434951 4.76E-21 2.12E-19
AAGGAGACAAGG 1.811236875 2.01E-07 EMC Both of them Is known to be AVPVTQEFPCQAYDMLPEQHVEAMDSHDSTLPPGQTGSMTCPMLALDDLMIDDILPGLRLSPNLCDPFECEFGTSFGFPVTPLDQ 5676.289028 3.259391301 0.339405223 9.603244374 7.75E-22 3.88E-20
AACCATTCATCA 2.049910382 2.40E-07 EMC cnn Is known to be TNIPGKHINNWADDPLWDELLDMVPTHPCWFPMADHLEGANLFDVTMDDLEHPDLGDLGFNIFWNDALDSNHGEGQGQNESRVND 2441.593596 3.062256948 0.381310155 8.030882231 9.68E-16 2.73E-14
CGAAACCGGACA 2.851536203 2.40E-07 EMC Both of them Is known to be SFAATNMASPCSLDSTSCSHPERGHEPMPAWLNLDMFSMTEDLTTRSPLTWDLDNLLDTFDKDPWGNDAMNDVLGGDGCYTEAPQ 919.9137865 2.89632772 0.68415404 4.233443861 2.30E-05 0.000226754
TCCGCTAACCCA 2.379026112 2.40E-07 EMC Both of them Is known to be YPPVPSTTESATCTPHDGWPTETPEDFNFDEFNTDYTGFDDFLMLLCDNSNGIQSPWEGAQMPEFVDSDPVCDAPEVEGNGWIWL 3486.114268 4.578939803 0.483513381 9.470140822 2.79E-21 1.28E-19
CAAGATTGGATT 1.646640029 2.75E-07 EMC cnn Random arrangement CEAHHCSHEGDHSVGMPHMGELHDAYAEEGVNLPNSPVIDSVDCDSASSYLDLGSSQWLKEDYLDDNLLAPFAWDTIWDFEVPDW 10125.18577 3.659044994 0.275316601 13.29031734 2.63E-40 5.68E-38
GAACGCGCCAGC 4.040319734 3.02E-07 EMC xgb Is known to be EPTDLYDVPGGLPCDIDAEWNISGCLGDSDLESSGLFSGSTGSPMDWNVDMKDFDMDMLIPPCFPPADSDDGQEPPDLGYPLFLL 344.5465224 8.723555848 2.96993445 2.937289019 0.003310954 0.021705141
CTCTCACAAACG 1.672319365 3.78E-07 EMC cnn Is known to be GWGNLDATTSVWYDLPSEYWSCPDFEDVSWYCDIPLESPDLVDASLFDLFNSLDDPYAIANPQIDPESSEPFNTDIPGAEYHMLG 6522.24041 2.66580763 0.280332086 9.509463094 1.92E-21 9.05E-20
GATTGGTAACCG 2.384617935 3.88E-07 EMC xgb Is known to be QLPEWQLTADQDPLLELPDAAQYDLDALCTRPLKSDDDTLGSAWYNFSDFQSIWGEVSGMWDAIPSSDYYEILDDSLMTLLGDDD 1806.550522 3.282744095 0.520666485 6.304888422 2.88E-10 4.96E-09
AAGACGTTCATT 2.02767736 4.14E-07 EMC xgb Random arrangement LDPGLVPMGDIQALQQDRALEHARGAGQVGQPFQSLYNTPDSLHDLWYGDPATVEPWDTYDPCDFFYSLLENEPEIPHSQNTPHN 5873.792872 3.161547422 0.411187695 7.688818179 1.49E-14 3.64E-13
GCTACTGTGATT 1.740767815 4.16E-07 EMC xgb Is known to be LAWTEHPLHDFCSIHNDGFDTSEPIDWDFDEFNDCFTDFDDFLKLLGDNSPSKGGGGGGGGEPLEAFDLFIPESNGDDVTVLDAY 6048.366555 3.859590593 0.489319911 7.88766308 3.08E-15 8.20E-14
GCGGCGACGCAC 1.963052381 6.44E-07 EMC xgb Is known to be TETDGVPGPAIMTLELHPADASYATDALPPPFDLDMLGSDADDNFILDMLSGANLPCLPCLDQQPWMDQCIDFDDFDKFDDFISK 4029.311531 3.377272986 0.42626877 7.922872199 2.32E-15 6.25E-14
CGATCTTTTCGA 2.258605242 7.44E-07 EMC cnn Is known to be ASDPPIPYLPGAENRCPAFLPNTFPDSGFTEFNPLDLNNLQIEDIHPWEEGLGNILSELEEWPKHFDTFECDESMDPMWCPGGGI 1555.363598 3.967149536 0.623695058 6.360719853 2.01E-10 3.53E-09
CCTACAAGATGA 2.223179167 9.55E-07 EMC xgb Is known to be LDNHEGADEPSDEFPLEGFAHHINDYLNYVYEKECPTGPSSASLPDLYDLEFDMPPLDGMPFWPEVNFTSVLGPTAESANPPSSD 1921.043527 3.614663023 0.301099982 12.00485966 3.35E-33 3.86E-31
GAAAGCCCTCTT 1.707842058 1.43E-06 EMC xgb Random arrangement CCNMSQEPIWEDCIWQEARATVHFVAPQQDALSNANLWPDTPYDTEQISFPSLGDTPWDWFEPFPEFLDTTATPPLDIPQFTEPL 3735.639747 2.827623102 0.255103393 11.0842238 1.50E-28 1.24E-26
TCTGGCCTTCAT 1.895517456 1.66E-06 EMC xgb Is known to be GGGGSCIDTEGFPDWLDDAMGQISELPPPSNLESSAPTALVEQMGMELPCLSDLSELLCEHDEFMAGSAQLDDLDLLGQGEFKYP 7153.626361 3.75502671 0.362827957 10.34933124 4.21E-25 2.98E-23
ACCTCACTATCG 1.813800909 2.33E-06 EMC xgb Random arrangement DPFPDMSDPLLDMWICNPDAICHESLSTNQQQGQNFPDPEDNPHDMPFLEFVLNELDFYPICEDVKMQEALYLDDPVANEKNFDH 3495.016113 3.313142576 0.530493141 6.24540134 4.23E-10 7.20E-09
AGAACACGCGGT 2.078129918 3.22E-06 EMC cnn Is known to be RPGSSNGAVAWPPAAEAPMENVPPHLCTPDELDLDFFSGNLGLALDSQLLSEDVLNLNTFLNMEPEDLGFVLDDAAGVCDLSYIL 5930.616091 3.838680335 0.551811911 6.956501403 3.49E-12 7.23E-11
CCAGAGCGCACT 2.117722525 3.86E-06 EMC xgb Is known to be GSLPDVADLEDHEWGGFLSDWEYTGDCPKGLEAWDEKSLGDLFDQYVDGCENDFIEDDGSGMGLVDLLLQMDASSGEDGSMLGGS 2970.72502 5.262293891 0.534158341 9.851561774 6.75E-23 3.64E-21
GAAACTCGAGGA 1.911802186 4.59E-06 EMC Both of them Is known to be ADALDTSGPVENWLNWLPDDACYYPDEPWPPPLESFAVEPTIKEILEESLPADGFEPLCLTPDAEAFDADLTMEFGWLPNMVEGE 3607.494045 3.691619614 0.49832931 7.407992144 1.28E-13 2.98E-12
CAAGGGGCTACT 2.014298187 5.45E-06 EMC xgb Is known to be EEEATALLWGMSPMLLNPDGADTGPGGPPPIQDLDDFNLSLGWLDEQKAVPILEVLCSEDPCWEESAAPGWTLDDFLLNAHGPLD 5525.023769 3.67104978 0.368716217 9.95630138 2.37E-23 1.36E-21
ACCAATTTCCTC 1.84509684 5.51E-06 EMC Both of them Random arrangement IENATWDACECYDCDFLDNLWPYEDSFQWTPTPDVSHNFDNPRDSPIFEELFAGPTFFGDMLCRDLEDLDEYNQEYELPPPHIDL 5595.417127 3.488281304 0.298527427 11.68496085 1.52E-31 1.60E-29
CCAATATTGGAA 1.785236464 6.29E-06 EMC xgb Is known to be AVGSSDSWGLLDMPYQLDVMDHARLVESGALLDLDGFDDFDLTLPTDRDAADDMLSDLIDFKGQQALPPPPMDTYALGEDGDPCF 2537.812213 3.603052718 0.446779993 8.064489847 7.35E-16 2.08E-14
CATTGCACCAAC 1.901821121 7.23E-06 EMC Both of them Random arrangement CDIDSSLWAFEHFLEDDQFATCSNMPVDPLLDMTGDEFSGGDLFDWFDWDIAPCIHPMTYDQPDGPHHPYEWVFPEELTPPWSGP 3219.445846 3.473876631 0.342449125 10.14421232 3.52E-24 2.23E-22
AACGCGCTTCTA 1.837774515 7.36E-06 EMC Both of them Random arrangement VGIVPTWPLENEAQPCEPMAIHESQACPADSTAYNPTPSEWDFDTAINWDVWNYSSLLDFSDENMWGSLYDWDPAPCATCMGDFS 3434.004471 3.884353619 0.339526366 11.44050657 2.62E-30 2.55E-28
AACGGTAAGCTG 1.563895116 8.44E-06 EMC Both of them Is known to be PDQTAQDFWLANDFDFDFFNIEYTDLDWWQWLSPPAGPTDEQQSNEYMASDYFMDLPFSPLSADDYTWATAHYPFNLGDANEQEG 12072.30137 3.193746273 0.345438024 9.245497162 2.34E-20 1.00E-18
CGCTGGAGTGGG 1.818887982 8.75E-06 EMC xgb Is known to be CHDNVCPGPMVEEPPKENLPEHDGLDNLLGMDYYNIDTLLTQPLTDLDPPLCPGAFGDTWQVEEQASEGGLGDWGDFSDIQWTLG 4681.507255 3.887523955 0.248714316 15.63047926 4.51E-55 2.24E-52
CCAGACTCGTCG 1.832910189 9.09E-06 EMC Both of them Random arrangement DGHVDTDAEPPMENWIPMLFDTWWEQCHQLPPNHEDLTTYDPGFNLLDQVLEDDICAQMFDMPWGPEARPSDQFMLMGWPMQETD 6604.85817 3.735657694 0.417514701 8.947368043 3.64E-19 1.36E-17
TCTAACTCAGGG 2.628891059 9.46E-06 EMC Both of them Is known to be EFRHTEPDPESEAQSWDEATNANEDWPLWELTPWTPSPEFTFLMGEDYDLEDLEVHLFSGPFQDIDFMHSEDLLAGEDASSPLNE 1526.057761 2.426025006 0.454585637 5.33678323 9.46E-08 1.24E-06
CGAAAACACCAG 2.358184889 9.63E-06 EMC Both of them Is known to be FEHVPHYLHSGAPRVDMDVLPTLCYDASPGEVHTYALDDFDLNALDFSALWDYDDPAPMDPMDTYILAPFCGDGGSPSWPIPFEG 2126.907921 3.906690322 0.497832154 7.847404577 4.25E-15 1.12E-13
AGGTCGCGATCG 2.798191137 9.91E-06 EMC xgb Is known to be DATWHSLTTIGNDNEGPCNEWDDGLDNLLGMDESDFFDTITQEPRDLDIWLPYIHGVMMPQPTEGEGPGEDPCPGYLDIPLSESQ 971.6006904 3.518934311 0.747656682 4.706617885 2.52E-06 2.79E-05
CTCAACAGCTTC 1.666562722 1.17E-05 EMC xgb Is known to be GGGTDALVDLLSDYIPPLAPSLGNAEPLSEPPTWVTEMLLENMLWLSAATPDQDDPGNIGGYTEEMHGDSICMWSELGFDGGQGT 4116.017498 2.78820114 0.352623126 7.907028587 2.64E-15 7.06E-14
TGTATGTCAGGT 1.411624758 1.17E-05 EMC Both of them Is known to be AQMDHACFWETNGFDTWTDTGADDLPQVSWGSIPATPGATPDGPSHAGILDDILPMYDHLDGCLQDWPSPMEPMDVWAHLWYQLE 10295.74889 3.859525977 0.345169488 11.18153865 5.02E-29 4.37E-27
GAGCAACCTTGA 2.063704342 1.35E-05 EMC xgb Is known to be DDPFNSYAPYEWTVDELPDHPGGQSFATQPHPVLSGETDAMDDDPWSPLMGDVDLFGLLTCSPTPADTPQEALDFLEYPCDLGDG 4575.864432 4.02122563 0.348429397 11.54100562 8.20E-31 8.12E-29
ATTCACTTCGGA 1.88972354 1.54E-05 EMC cnn Is known to be ANKGHTTHLGESHSNDKDAPAVPGLTYPWCTMHSQAGSPSPDIWDELMSPVSLGDGLLSNPIGDDTNHMCWDMLFDFDSYITWDS 2979.313261 4.064341316 0.425829422 9.544529124 1.37E-21 6.58E-20
CTGAGCGGAGTA 2.026204578 2.20E-05 MH cnn Random arrangement AGVCHSDTQISGDGPGIMDVGPDASMQHCEQSPQGPVAGDWEQPTAEFNRMEQEWTGEMHAELMDFLLAWIQCDDLGATDMPWVG 1632.715796 3.570991347 1.457797228 2.449580283 0.014302282 0.075406501
CGTGAACCCCTA 1.558909953 2.27E-05 EMC xgb Is known to be WHNYDEAMPQSWTDPDLKTVLTTLDFWDLSVESFFAPSMDEQTQAPPTDESLGTTGNTPGEDVGMFPRTELWDFLLDDMGEPTSP 9781.280304 2.839488753 0.287713457 9.869155182 5.66E-23 3.08E-21
TCATTTCGATGT 2.314362348 2.29E-05 EMC Both of them Is known to be THGPEHFDTVASGFAIDDFDMGMLFDSAPCWWDCHALGSDQYDLFDLNLGDPSPDAAMYDFAWDTFCLDHEAGMAYQDPAGTDQT 2033.948036 5.571513968 1.937173934 2.876104138 0.004026169 0.025746699
TCTTTCCCCCCG 1.565888767 2.37E-05 EMC xgb Is known to be SADTEPVPAEPGSQCPKWPGEWLMDDWLDVSAFPDFDLDMLGDGLLMETVSEPDPYAPSKELDDMLDNFGNHFMNALRADDWMYG 6520.857242 3.51278242 0.271717002 12.92809206 3.12E-38 5.74E-36
GACAAGGCCTCT 1.55475138 2.65E-05 EMC xgb Is known to be TQIPPVEMPGMGFDCSLDDDLPPLGSLSPDESDWLFGGPHSTHPWWDCGDLCPDYMDPTTGGWGSTDGCADLEQLLQGSMWLENE 7034.610545 4.052621254 0.454234926 8.92186184 4.59E-19 1.68E-17
CTCAGCGAAGCG 1.903295497 2.70E-05 EMC xgb Is known to be PPELLEGMHRTNDNTQTQHDYCGFGYDTSAFDDLFAPDGDWAGTLFEWLDSSHASIQELEDCVPWPYPPFTGNSETDLEMEPIWP 1256.753073 2.772910306 0.480485027 5.771064962 7.88E-09 1.17E-07
TGCCGCGGAACG 1.550552008 2.70E-05 EMC Both of them Random arrangement GGSDSQAEDHGELPQLGYAGSIYDVDDRDLAIIGLNPASFHLCDEFSESNSPAPADCLDPGHEWAEQFLDVWDFVDDMFDFSSGM 6852.212856 3.221406373 0.242709107 13.27270499 3.33E-40 6.74E-38
CCTGACTAAGAG 1.70795474 2.91E-05 EMC cnn Is known to be AIHVAPTGMGDMQEWLYAECQAEEAPWRPVNCLEMALDDFDLDQLGMDFLWDVDLDPAAHSMLPSIANICVSDPFQGGAQGMDGY 3070.144892 2.798586276 0.30983766 9.032427743 1.68E-19 6.58E-18
GTGCGTGATCGG 1.680160079 3.32E-05 MH cnn Random arrangement PGPMSFVEEHDNLPMLHLGIPMDNVVTPPQMGEDHEWPAPNLTEYSEQILEQIASNNTLFDEQFSELLETWDLYDNSEPLGDPFV 3645.562259 4.165582067 0.523691177 7.954272001 1.80E-15 4.93E-14
GCTTTGTCACAG 1.773945171 3.87E-05 EMC xgb Random arrangement DDIVPPHTPMTFCPYWMPWEDSSGPGCVSDQCIIGDPLPVAAMQDGIDEDLSFLGMMYDEDWECPPFCVNLLSSMDQDSLPPWDM 3436.428329 3.020404911 0.38314662 7.883156866 3.19E-15 8.46E-14
AGTTTATTCTGG 1.56091292 4.00E-05 EMC Both of them Is known to be PAPQPAYFAKMIDGQLNEDPLWDFMDFDLDIPWDSFFHGAASPIDLEDYSPPPHDQDWQSLDASLSMEGVSQLDDFDLQDMYTVV 6508.593345 3.401720274 0.481398143 7.066334428 1.59E-12 3.37E-11
ACTACCTAAGTG 1.610510245 5.77E-05 EMC Both of them Is known to be GLIDTTVDIPGSPQIDEMFYEHNFEPEHVSNWGWQPGQVLQQLLGESSNLPSLEVHDAGQDLPTLMEDLGYLGLLPPLEYDEDFC 4208.271112 3.033807383 0.362785508 8.362537411 6.14E-17 1.90E-15
TCTTGCAGTGAC 1.388775892 6.72E-05 EMC cnn Is known to be GQGGTGIPMTFGNMSQTDLSIGGANDHDHTEPPSTANPGCWMSTWFPDVDSILQSDWPLVDMFQPVPDDEVHDPSLQTHEFDEWW 6647.544681 2.642347074 0.297827207 8.872080892 7.18E-19 2.58E-17
CGCTCCTGTGTT 1.424430879 7.08E-05 EMC xgb Is known to be GPTSDPGPKSEYLDVFDLFDLDMLGSDDEEGADWDMQAPKVNEDLSLDMLATSEDDCLPPVQDPSDDLCQYNNWMGLGSPMGSLS 5753.615376 2.631191121 0.289519825 9.088120727 1.01E-19 4.09E-18
CGTCCACGAGAA 1.986976204 7.82E-05 EMC xgb Is known to be GGGAERFTQPETPDGPYMDLASWLQGLGQEFPLAPPMPLLPGLLEEWIDIEIPDTSGSGDNMWEEVHFCSPPLDNSFDLWGPPLD 1300.566011 2.873242445 0.768330608 3.739591284 0.00018432 0.001594221
CCTCCGCCGCGC 1.496304604 8.09E-05 EMC cnn Is known to be GGGDLGPPISFTPTLEEIEDYTTNFNVPIDDHPDAFLPTVWDLAAVWEDIDLITGPEEELSWEPYPWQIWGDGGGESSSSPPGFK 6065.915292 2.986526468 0.246894762 12.09635408 1.10E-33 1.30E-31
GAGCACCATCCA 1.770117122 8.98E-05 EMC xgb Is known to be TDPVVPCASGPQNQCSTPTHQCESGHIEDDSQCMDMFDGWLEMDIINELESALRDADWLDDEWTLPVIPAGLDDLCGLGDPGDFQ 2171.113064 4.862623763 2.04822165 2.37407107 0.017593164 0.089888371
ATTCTCGAATCA 1.89451365 9.51E-05 EMC xgb Random arrangement EPDFDGFIEHALNFLNSMDTNLAADYEHPQIEPDTNAQLTAATQPAFHAILPDFYWDAPGECFSDFDWLMSITEHNNFESDGMVN 2598.07551 4.135865313 0.406564698 10.17271135 2.62E-24 1.69E-22
CATGGATATAAG 1.621173747 0.000100345 EMC xgb Is known to be AESSNTTPGPLAGSTFIFDSYCIGWPQYMPNVPPEDLRESQFPQYSEEFQDLDELICDQDLGAEPWTWPWSPTYDALLSDIREDW 2768.487444 3.03666882 0.499391675 6.080735769 1.20E-09 1.94E-08
CGATGTACAGGT 1.956041694 0.000107179 EMC cnn Is known to be YGGQTNTTTQAQIQELGFPMEPQRPPVAENISLDSGNQLVHYTLNSLGLDDPWGYNEGARDEPIHYYLDWGAYEMDLDDFLDWSG 3889.355596 2.921636635 0.32377667 9.023616903 1.82E-19 6.99E-18
GTCTTTTCGCTA 2.214140505 0.000122125 EMC xgb Is known to be HYDMSAEEPMEIIPNFDKEFDELFPCFPFHEGDFAIDVALDACQDLFWLDPPDGPWQPPLASQEWLQEDFAWPHGARSPGQVTQE 1177.804586 3.959176575 0.431830783 9.168351889 4.80E-20 2.02E-18
AGCTCAGGCCTA 1.825671902 0.000126691 EMC Both of them Is known to be LDTGEQQPPTQESLQGWFNPPPWTNQIPFFDALDDFDLDMLSNADSPGPSFIDVHPLWDALPWPMCKPPVGVGQDVPWGDFPDFL 3067.451939 3.775756643 0.453104285 8.333085264 7.88E-17 2.39E-15
AGAGACCGTTGT 1.467695509 0.00013304 EMC Both of them Is known to be QPAQTYQETDKQFDNVGPGIRPRAGALQPGPEAITTEMLCMEDMTQLSLGDYAELCLPEGDSSDMAELSMLMDWWLSGDADNTFG 5671.127841 2.49485807 0.416711201 5.987019455 2.14E-09 3.35E-08
AAACTGTGGCGC 1.325417524 0.000133481 EMC xgb Is known to be PQLSEYDGPLAASQDSAVSIPDMVLLPATYDEQWNTDLSFDYNDLQELDDLGDLCDLHFQLLPPMDDTEGELWRCFETEFGMSMG 7418.118402 2.852774502 0.385627616 7.397744313 1.39E-13 3.21E-12
AATTTGTGTCTG 1.8751855 0.000134213 EMC cnn Is known to be PDDCCWGDDIPFDVSQVPSLEATTECLGLSQGDFEGLLDLPDPIALPDESGPVADWMSFLNGCADLEPLHTGGMGQSSNLEDPYA 3129.552826 3.89429416 1.452141107 2.681760156 0.007323595 0.043624681
CTTAAGGTATTC 1.633637673 0.000140831 EMC xgb Random arrangement ELGASTPEMDPVSANHAADVMQERSIPDNTCNMCPQADSFLRQWLGDLADPFLEEFAQSFNPDDWLPQEAWDWHLCPGQYGENNG 5049.950134 3.609228936 0.318590719 11.3287322 9.46E-30 8.84E-28
GATATGGGGCTA 1.390468858 0.000153842 EMC xgb Is known to be GGSEPLNEINENWLNFPLCAEFSTSDEHMLLDPLYALDDFDLTLMGIDDLAPTYPAISDTHRIENTNHLYHQPQTPCHICGLGWC 5462.468111 2.949783165 0.381514571 7.731770657 1.06E-14 2.64E-13
GGATACGCACTA 1.871554369 0.000165393 MH cnn Random arrangement SVFYDLTPCMIMMMAMSLRMEFGMDFETETWWCPEQIKPPPWTLRPLNSSICGYHCHADANWYKPHIGHFDYETSSQIWMDTEKF 487.9275826 -1.11453848 0.534105974 -2.086736591 0.036911952 0.15990877
AGATATGTGGGA 1.717663604 0.000189751 EMC cnn Is known to be EDPWACDLEWLMEFINADAWGHYVDLATLDEMLDPGMLDDLWLDKWAVAASTDVVGHIDRPADPGSSQMGMHDGCMHNSANDWEQ 2840.659252 3.061593616 0.374501644 8.175113958 2.96E-16 8.72E-15
GTCACTTCGCCA 2.715535677 0.000192989 EMC Both of them Random arrangement DDWASSVLECVLSDAEDCTHGAPYSQCETQSHWAVGNHLAPPDMLECIIEEAPPLWAPMCSGGDEDLWGKALFGEWDFNNDNSST 800.2249121 3.469960705 0.614550253 5.646341673 1.64E-08 2.39E-07
TTGACGATCTGT 2.733741002 0.000213069 EMC Both of them Random arrangement AFSGADWDNLDEFLCTCHFNEGRDVADHAQADPLPEISEPTSTLVPLQFNMQEDNANPPPIYTFNAPDLMAFDFDLDDWICEQPL 660.1667638 4.998215248 0.894400445 5.58834164 2.29E-08 3.28E-07
AATGTTGCGTAA 3.48286852 0.000227314 EMC xgb Is known to be DLMVYELGFPPEDDTPAPPMSGPETGPTPSNMDMEFEIPPISLYSDWDELMSAEVNDLLRLSPPSEQPTVDLYQESAPQYSPVSH 319.1642388 6.826853115 1.156354691 5.903770846 3.55E-09 5.43E-08
TCATATTATGAA 1.418730781 0.000228602 EMC Both of them Random arrangement PPRDGSDGEDINLLMENMEECMDWSTPLRFPNSWDVDEMDLPLQLDSYDPEGFDITPYPEAFASWEEEPPDSNGWDPQFHVWYPS 3940.766957 3.414590734 0.343544046 9.939309897 2.81E-23 1.58E-21
GCCGAGAATGAG 1.72190481 0.000237397 EMC Both of them Is known to be IHPDAIDDWNQEETWNNVQPCSLEYDFDFDEFNTETTDFDDFLSLLLDNSPSKCHKPPELCGVFEIPENWRHGGYKAEGVDIPFD 2418.857757 3.123365755 0.393883679 7.929665333 2.20E-15 5.98E-14
AGTCTCAGTCTT 1.477207925 0.000273402 EMC cnn Is known to be SQARDGGAESDMMDAFDIIDLDWLGLLDDCTAPESHNMMFPDDDALMAEAGKVAANGATPHGEPYEMGPMMTLDVVLGSLDGLFA 7894.846016 3.066362091 0.379059676 8.089391425 6.00E-16 1.72E-14
AAGATGTGCAGT 1.39170327 0.000277108 EMC cnn Is known to be LEFLDPTMHMCEYGDQPLLEIVLDDLPPHDIGTFSSWEDFVQDFFDSLGPLDDLGMSIDEPSNWISQSNSEETQNEDTEDPLMSF 6239.195896 3.363275966 0.402757658 8.350619539 6.79E-17 2.09E-15
CTAATCTACCTC 1.419099967 0.000277108 EMC xgb Random arrangement PIVGPMCSAFDCFCDAIYEMTFDDPATNHNSPARLPPMDESLLNPYECDWAEFLLEDTPCGCETMGWTCDNMLAHQENQPNQHDP 4740.187145 3.023531892 0.354129126 8.537936213 1.37E-17 4.48E-16
GGTCTGTTACTG 1.434289926 0.000278986 EMC Both of them Is known to be AYAGLPIVSDEPGMDCIVPLSLAVRDTPEMMWSPPPPAYTAAANDSEIALMDQLPAMDEGLADEPLMDLDVVMWLDDLGFGPWEL 6126.177566 3.520280659 0.317773716 11.07794785 1.61E-28 1.30E-26
AAGTTCTTATGG 1.676776333 0.000295765 EMC Both of them Is known to be ETTCLPINPHDGDSVEETVTEILEPILSVGECVTLPPAPTTLSKDGTHEFLEDWGWPGIESIPPEWDDYNFVWSDVDQTLDFGYP 2831.870088 3.623003274 0.451763712 8.019686356 1.06E-15 2.97E-14
CCCGCACAATGT 1.866878787 0.000317008 EMC Both of them Is known to be MEYTSPWNPAQWLETNVTLDDMWVRFPAHDEILEDFHLHMFADDNFEGDLFQDLLDVPSELWTWLDMEEETTIPLPWSPPESGSS 1203.799202 1.271448349 0.57433101 2.213790177 0.026843216 0.124915473
TGCACTTGACCC 1.802452149 0.000395747 EMC cnn Is known to be DCELYEGFPPYPAGFNVVPPEELTIDNDFDEFFASATEFDDFSFDHALMTAGATQDSSEKMCRDFQDIMEELDILDPEWTCIPFM 3445.792073 3.814494869 0.384168714 9.929217891 3.11E-23 1.73E-21
GGGGATAACCCT 2.120371842 0.000409673 EMC Both of them Random arrangement PVPGFDMSPEQHGMEFPTTDFDFEWSEDTLWGWVTTPTGSETSFWSQPDCLFDFWSLCGQPNPETMPELPDHAVSDPQQESNDMD 4102.657829 3.716203078 0.572892902 6.486732621 8.77E-11 1.60E-09
TACCAAACTCTA 1.502655786 0.000426879 EMC xgb Is known to be AQEPADTPNHAFLPETSDLLPLMPDPWFPAWAPGSSPLMSLEQLMESWDDLELMDADFATSTGMHMSPSESGPSEAKNGEHGATT 1991.060262 4.487862313 0.566069867 7.928106708 2.23E-15 6.03E-14
AGAGTATCGGTT 1.885417237 0.000459126 EMC Both of them Random arrangement MNEGDREDPDEFAQIVGGSLDNTLDVDLFQHYIQSLLWPGNGDPLDPTPPTESPLFPPMYEPWVQYDFQPFSEYDAPELEGNFDP 1367.410173 2.979034115 0.415636024 7.167410763 7.64E-13 1.68E-11
GCAACGAAGGGG 2.119319042 0.000466378 EMC xgb Is known to be QAAPLPQIPFLENLDQPEAPQGLPLGAFPSMAPSTMEDACDYTSEGSGPNLCDLSELLGGPFWEADDEKEGMFEDFLPLHLGSTG 619.2735377 0.916067626 1.187632465 0.771339327 0.440505828 0.724816362
CCAGAGCGGTAC 1.510895283 0.00047889 EMC xgb Random arrangement IYKPQETDPTQESYIFYNQDTSSWTEPFIDGFIPQDSNAHDLGGFMLDMMPALTSDMSRDEWEFPIWLDPLAPQNMHHEWEFFDW 2666.07127 1.337007681 0.377750236 3.539396022 0.000401044 0.003231826
CTCAGTCGGCCT 1.504905037 0.000490978 EMC cnn Is known to be CQNDIASQFNLMDNVTLWNEDPPHSAQALWGSPMNESLMTPEMFWELLCDDFAYAATPEDQTMENSYASPGDPLDWFAHLETWDG 5633.585021 3.026941935 0.316033973 9.57790047 9.90E-22 4.91E-20
GCCCACCGAACA 2.138085549 0.000490978 EMC cnn Is known to be RPNDPPQQMQMPIDEQNEDPNVEDCACNPMHDDGLTVDFRFDLYGANLPNEDLVPPPLGLPLTDLEWNEEFLVDLVDFLGEGNLS 2086.0966 2.48479893 0.459097767 5.412352459 6.22E-08 8.33E-07
CAAGCATGGCAC 1.42269117 0.000565062 EMC Both of them Is known to be GIGGSGGGGSGGVDALDDFDLEYVFDMDLYCEASEMISMHDLFGSDVDGFETEANLEAFYGNFCYEPTNHAGDPIDPMCHGWTSQ 5017.966702 2.475926883 0.240420651 10.29831205 7.17E-25 4.94E-23
TCGTAAATATGC 1.74202649 0.000565062 EMC Both of them Random arrangement TQWCMYPPDENQIQPMLANEEHGQPHDDSSADVARQPCVLKTAWEGAMCLDETLEDLDLADFDLAIDTWACDAPITYDFSVNSPW 6447.832416 3.459934707 0.702778742 4.923220497 8.51E-07 1.00E-05
CGGTTGGGGCCG 1.503441525 0.00057571 EMC xgb Is known to be DGPYMPTPQSSGTLSKLWADEDYDADQPMPPPHDDYIVWESVPPGLELPGLNDLWMDELFFDASDFDWPSAALDVDDFDKLWFEH 2682.021982 2.486479024 0.4228392 5.8804364 4.09E-09 6.24E-08
ACGACGGTGTTC 1.651797702 0.000648535 EMC xgb Is known to be MAENSAYWPQIEGLDEQCFFNDTALEGDGDLPALSFLSMPEGLPSDFPPTFNTPEILDTFLNAWPSFTWSDCLDDQPWPDTGDGE 2779.909388 3.628263174 0.612072176 5.927835497 3.07E-09 4.74E-08
TTGTTCACACCG 1.989658257 0.00065162 EMC Both of them Random arrangement DWSGFSPFDEYSDWFIGIAEDDHGSPQVDWDTPTAMPTYVAPHSPDTQFDMDTQQADNWPPHWNVDFDYMLHAYGMGTTFPQLED 1736.332111 3.719430371 0.507150421 7.333978675 2.23E-13 5.08E-12
CTTTGGTTGTTC 1.633864723 0.000655269 EMC cnn Is known to be SNVMPLHDDFDITDISVECPGIHDTFLTTESGDTFWSAPEFSEFSTNILSLEDPYLDSLDLASPDLEPYWPGALIQQDEGMIDIV 2571.101089 2.493925143 0.350193898 7.121555108 1.07E-12 2.29E-11
CTACTCCAACCA 1.30030528 0.000686799 EMC Both of them Is known to be YPQDVGEAEADSVIDFQLGDCSEHWGFALPGPPPAPQHALLSYNGQNQIELDELLRQMFDYPDVNLLEEPIPPNLCCNSLLDTDL 5832.442028 2.516945674 0.300200075 8.384227335 5.11E-17 1.59E-15
CAAGTTCCACCG 1.336712612 0.000688962 EMC Both of them Is known to be QAAQEAAGPGTDAVEQWHEEPCDLGPFASLDDPNMGPGGVQMEPLWQTPGLDDDMEQWLGPIGTYCPEQADLMLGGLDEISISDW 4934.087442 3.229188743 0.353813818 9.126802231 7.06E-20 2.94E-18
AGTTCCGTATCA 1.550650617 0.000869141 EMC xgb Is known to be ASMPPTPEAQGPQLFEDLAVYGQSDERALLDQLHTLLSNTDGEGPKTCTHALEFGRELTGMQSCQSDDGWWDPDIMLGLDFGQED 5861.186403 2.782148754 0.590510007 4.711433708 2.46E-06 2.73E-05
CTACCTTTCGGT 1.219762145 0.000892994 EMC cnn Random arrangement ERTPGFPHTPLPEELDVDCWHEYVPEGFPQGHHAANQPFFHPLAEENYTDMDNCNMGLDFWGESLQHIEPEFWSISPWDFNDDEI 6334.314357 2.726797074 0.367868446 7.412424479 1.24E-13 2.91E-12
AGATTCGCTCAT 1.303123863 0.00091372 EMC Both of them Random arrangement SQPQVTAPHDFSPQIMPLPWWAPSFEQIMGPEPHLDMTPPSEATFFTTTMGDPHDQENWYRELTLEVSNWASMNLDDLDAAFLDD 4713.944467 2.653912881 0.294047414 9.025458993 1.79E-19 6.93E-18
ATGCTAATATAT 1.619607505 0.00091372 EMC xgb Is known to be GPDEDISDGQLIPGDPVAMIQPMDGTLEHWGLDLDTSMGSLSFDLHSPAEEDIFGAWGLGGGLDNPISTQDCDGTGPWGGPWQQQ 2786.63416 4.251761789 0.444024869 9.575503735 1.01E-21 4.97E-20
TTGGTGCTCCTA 1.74395297 0.000950712 EMC cnn Is known to be FSNSNLQIPDWAFETCNDNYVWEQSYFPDPLGLPSTLSALPGETLLEWEELVLTPEPAHSSFYWEGHTSGTGAGIGGGEGPVDQA 904.9740884 2.478949843 0.610761835 4.058783147 4.93E-05 0.000470144
GGCTTGTTTCTA 1.441193887 0.000997992 EMC xgb Is known to be YTPPHGQWMDLDYFFAGLPETETFFDFLEPQEASEEISISSQSIDELMGNDVFFAQPGCDPDVDPYPFNLDMGPVVSDHGGSSAQ 4417.208309 2.9152646 0.377686778 7.718736177 1.17E-14 2.90E-13
GAGTGAGCGTCT 1.855284054 0.001058625 EMC cnn Is known to be FDWHSQNAMSWEPGQNDNALFAPWPSNDNLPPPEYGLIYPSPIAQLDELLSHDVPMPADKLAELADVYNFGQDDLVGVEDLFMFM 1244.189074 3.052705039 0.681592887 4.47878066 7.51E-06 7.85E-05
TGGGTTGCCATG 1.317004679 0.0010876 EMC xgb Is known to be DVYNTWPAASCQESIKPIPPESSPGLVYDFDSVFNSFEDNFQDFFNSLECLTEFMHDAGPSATGCRIGPTDGGGGVFSTTGDWHD 4389.204355 2.63183759 0.265199015 9.924009679 3.27E-23 1.80E-21
CTACCCGGGATG 1.362642782 0.001090398 EMC xgb Is known to be PPNVNHAVVVHANTFNDVYDVCETAPEPFHPIPPMFDAFQDLLGMNLDDCMSLVPEFDQLPLTPSAVMDSFHDETHGDVLWESIG 2978.579574 2.472216011 0.479772405 5.152893305 2.56E-07 3.19E-06
GGTACGCTCGGC 1.264249642 0.001090398 MH cnn Random arrangement KTAPEGTEFTIPHPQEVEAEQCFVQPHVRDPWTPSLETKNADKDTEFMTCISTEPSMDQITNPLEDVQDDDMFYHWMDAWCDNVF 6523.173616 3.04025623 0.361593197 8.40794644 4.17E-17 1.32E-15
GGTATATGTATC 1.326587566 0.001118002 MH xgb Random arrangement HQPCWVGDGNCPETPPAPLMMGGDVDMEPLWDGDFEPFSFDEPCTWEPSNIIMELDELPFTHWDNDNNCFKPCVNWEFFAEFNNE 5531.146059 1.872695943 0.277018218 6.760190567 1.38E-11 2.73E-10
CCCACAATAAAT 1.637315472 0.001169298 EMC Both of them Is known to be MDAAEFPLAGTSSCNPDNCDGLCHQEYHQSDETPPFVERPSDDMESIILTDLMDVDDMDFTAFPDGLDYIADGDWGLWVVPGFFE 2446.155676 3.657706453 0.552620774 6.61883633 3.62E-11 6.80E-10
GATCCCAGCTAC 1.445186936 0.001198058 EMC xgb Is known to be HETITVSSISMEGPSGDSLTAEDDSDYAPMPSFLIGGHTEYPTLFGDDDHAPWAEMRVCASLDAMDFADPQLFMDWLAIPDFGSE 2958.860308 2.190439493 0.368914943 5.937519022 2.89E-09 4.48E-08
AGTGAAGAAGGG 1.408072357 0.001247189 EMC Both of them Is known to be GCITPWLSPRPAEVEWPEIPFSEEFILDILGQPHPPTPFGTPSLHDLYMVEFDTPVNDPNTSSIDDDHLELLGMKMNGGAGGQSG 1748.61217 1.694311602 0.452706958 3.742623284 0.000182109 0.001580618
ACGTCATCATTT 1.660448128 0.00133078 MH cnn Random arrangement WDFDWVGDSIMDIIPIEDMFDEVAPRNSCATWLAETCALSNIDTETLLDMTAGEDLCPSNDAGILSNLDEMEANTVWQWPPYQDE 2184.165608 3.474992893 0.53955168 6.440519084 1.19E-10 2.13E-09
AGGGCTCCCCCT 1.722086381 0.001460876 EMC Both of them Is known to be DWCTLINMSPTPFYDGPNLVEDYGSLQDPEETYQTPFWPEVLSGDFTEESLEDLALMGMPAMPNLDCLQDVDCEQPDWGDLGELP 1540.276386 2.902239373 0.629138427 4.613037841 3.97E-06 4.28E-05
TCCCGTTTTTAT 1.232146978 0.001460876 EMC cnn Is known to be GRGVSTCCSIIPGLYDETTNHWADVLMAPLNALDDFDLDFLSALPSVDSMEPQSWSDLLDLDLMAWIGSMDPPECGRPADEKTSQ 4825.288799 4.241293911 0.325477294 13.03099782 8.15E-39 1.55E-36
CCGCGTAATATC 1.455545343 0.00157129 EMC Both of them Is known to be DWEAHCFERFPPPPFNAMWDHGDVQPNPEPTDPSADLDGFDLTSLDCMLHDSIWDTPMIEFMNELEPGNRENPTCALGYEHTHKM 3828.059483 2.756195775 0.396552221 6.950398027 3.64E-12 7.52E-11
TATAATGGGAGG 1.794749727 0.001609759 EMC cnn Is known to be DHFLAAFVQPEPNEYIPIDDGGQFVDDPPPPEPLSFATGGPWNIDEFISSDVFNLEHFLAEGGSPWMMPDCALMDDDLTDLSDHR 1799.28851 3.524148519 0.49650177 7.09795762 1.27E-12 2.70E-11
GTGTCCCCATTC 1.286267461 0.001655036 EMC Both of them Is known to be LSHAPTMDEMPSIGTVMFDDAVEEPGSHCTDMEYDLMGLLSILLEESSEPMSQLGLFTGGTTINPLAQLDEYLMSTLENDASPGN 7877.4285 3.853848837 0.275004971 14.01374243 1.28E-44 3.98E-42
TCCCTGCCTCCT 1.134230603 0.001704887 EMC xgb Is known to be PEEFPWWSSFEPLELEVPEIDDVTLPWPMTNLGVSTIFPQDVYLTDLHFMLDLMGNTDHGPEMDTAMDAMCDQDVQEPDGWPATP 5860.41339 2.175256706 0.406465964 5.351633096 8.72E-08 1.16E-06
TTGTACGATTTT 1.244252005 0.0017694 EMC Both of them Is known to be PTNPSSNPGAHIIATGSEWLDWSWGDNMGDPLEHGPTPCGTPSLIDLWGMEYDVPTHDNSEGFCNDLFGVPDDFWQQQFQWCQEY 5501.425104 3.013968307 0.298245098 10.10567592 5.21E-24 3.23E-22
CTTATTTTACCC 1.463490911 0.001776126 EMC cnn Is known to be GQMIIDTWDVPSLDSPNIPLLQFEDLGTLDEFLAWGPCMLNDEFWTDLFMNDSYPALLGGGELDRDGNHMSGDICMHGGRHPGSD 2574.033948 2.860771614 0.316617747 9.035411453 1.63E-19 6.48E-18
GTAAATTTGCAG 1.630434296 0.001867301 EMC xgb Is known to be GMTDSELSDEPALWGPGSQVSPPPPHFDAHDLMLDYQDLFDLDCLGRLGLDELDSNMLFSESGGGINWPGQHQVIDGCGEIGPSQ 2771.015608 3.505506205 0.568714626 6.163910769 7.10E-10 1.17E-08
TAATCCTGTTCG 1.302654628 0.001867301 EMC xgb Random arrangement EMWPFPEANDEESLVEFWPWRDPSPNEPFNPCNTDSFGDMCEALDLDPLPAYVCNMWNNDVDTDWNPDISLAPWVDAELEIDETF 3262.98615 3.331570329 0.27180499 12.25720813 1.54E-34 2.00E-32
TAATCTAAGCAT 1.587707452 0.001950146 EMC cnn Is known to be VEWGSSSGGTWDPPHDWVDPIQHMWDCPPADLDGMFIERLFHDELWSDSLMPAPSNMMGYDFDAYEAEHGITDPWDPTDIGNDME 1914.479347 2.78681023 0.407975881 6.830821043 8.44E-12 1.70E-10
CTTTAATTACGC 2.029869614 0.001982389 EMC xgb Is known to be DDISMPGSHVDECNGDSVDSPTNPIPFDPDMANWETAIDDTSDEKLLDNLPSMSGTGPMWGLTLSNLQGEDICTWEEGLGHWCFE 1121.511367 3.82427212 0.438116358 8.728895991 2.57E-18 8.85E-17
GCAAAATATGGG 1.736157518 0.001982389 EMC xgb Is known to be DTFCPTPSAMGPGLSEDLTPGCYQDALPMLDHILGCPAVSDLLGENPKDEGMGIFELLPESWNPSLEQVDLDGYLDTWELGGGEG 1695.491372 3.973423731 0.699716047 5.678623133 1.36E-08 1.99E-07
GCCTCCGTTCTA 1.507270079 0.001997295 EMC Both of them Is known to be TDDGLELDDFLQPIVYDFLAETNDFDAPPQDYGAGHMLDGSDLEMLTSLLDDCHFMGPWIDDAFCFMPSPEPQDMTQDDFALDRQ 5708.506798 3.721184199 0.365092137 10.19245233 2.14E-24 1.40E-22
AACTTGGGACGG 1.73431518 0.002083903 EMC cnn Is known to be ENNAAPGHAQSPIAAVNPLDLPDPLGDHWFVWNNDSDAFWDILGEILSDIPSLGHNDMDNWFDSLAEQPSTPGHWTWHPEPPGFI 1566.681268 2.962139305 0.496217698 5.969435025 2.38E-09 3.72E-08
AATCATTCCATT 1.284729673 0.002164959 EMC xgb Is known to be EDDCYPPSHSQPPVDAVWVYGPQSAPIDENDEAVSDRADCNVWSPSPGPEGLGNILSELQSLDESGNFFTDMFQSDTWLDLPHML 4254.174998 3.190295875 0.395297121 8.070627657 6.99E-16 1.99E-14
GCGGGTCCGGCG 1.649396782 0.002283092 EMC cnn Is known to be QESMDHVIGTPFGTPSPHEPYDPEIDVPEGDPWELAHCDLFHDIDLLADYMVSMIGDEQLASKGVNNHGWGMQGEACSWNNHEQQ 1724.105464 2.465605702 0.504149061 4.890628376 1.01E-06 1.18E-05
TATTGCGAATGA 1.548101263 0.002321803 EMC Both of them Is known to be SSAFPEPWTNLGLLEDLLPSEYWGFCKAESTAIQGSLLDTSEDFLAMLPELGDLGSLLYETAHQQVDAVPDEGADSTQGGGQDVH 2647.452879 2.562959965 0.380888505 6.728898179 1.71E-11 3.34E-10
CTATTATGTCCA 1.412306436 0.002332137 EMC cnn Is known to be WEWFPLDPQAACPSDPNEHMEAQEHLTFQCDHTVPFCPFGTHSLHDLFDLDVDVPEMDNMLQEFWQDELAMDANDHDEQAAPAWE 2778.200845 3.144909607 0.563121841 5.584776467 2.34E-08 3.34E-07
TGCCCTAACAAT 1.224292292 0.002382444 EMC cnn Is known to be LPGEKRQKTDAPPPYIFFEEGPCHYNEPCWDMDLMSPDTVEWMMDIPDLSPAMAPMNHDMQEYWDELFPLFDIGDTECVGDLWDA 4231.105437 3.692277701 0.341528882 10.81102622 3.05E-27 2.36E-25
GAAGAACGGCTA 1.246884588 0.0025028 EMC Both of them Random arrangement PSDVLQPAEEMFDDMLWCLPNEINEWWPHEPGWPMGGISPWHALHGENEDVNVEYLQIYEEDCATANMMDSTDVNQDMDIPQPQW 4302.908545 2.752842865 0.445219963 6.183107438 6.29E-10 1.05E-08
TTATGTATGCAC 1.588943144 0.00263566 EMC Both of them Is known to be DWRTDFYCVTDDIDIESVMSGLGLELAEFQACAWAASPPFQPVQECYPGRDMGEEFNLDAPLSDGNNWELDDLLNIDEFASLPDD 2552.337876 3.765321422 0.623288571 6.04105642 1.53E-09 2.45E-08
GACCTTATTCGA 1.906039541 0.002673618 EMC Both of them Is known to be PDMNDSMGEACPPDTGPSILVPRPDTPDILGSPHPFFDPAIYGWSTMELFDPLFPELHEIGGLFEHGGSRDCTTYTEGGGRPGGD 1335.657405 1.900409962 0.440770727 4.311561189 1.62E-05 0.000163292
TTCTTCGTGGCG 1.150708474 0.002673618 EMC cnn Is known to be PDAGYSQVPMPPEGGEDHEYLEESPYMIPDYHYEGIYLDNFDNFAEDWFWPGIFDELLCVHGPPVGPMLPTTGNSFPPDIDGSQE 6191.365346 2.557105031 0.485264266 5.269510262 1.37E-07 1.77E-06
TGGCTCACATTT 1.204488061 0.00276662 EMC Both of them Is known to be HVQGADRMWDIPEWLQGMTSEHWYVVDATEPMESSQPGPTWGGSELEDWDLGGLTSLLEMGEYEPAHSPIPDTWSLDSGYNFIGQ 5349.778951 2.679110586 0.398826814 6.717478589 1.85E-11 3.55E-10
CCGAAAGAGTTT 1.228617297 0.002798046 EMC xgb Is known to be TEDDSPSQSTIEPALQDILLHPIITLDDDCLQNMDHAPGDFLDPFDRVDALPPPDPDLLEFGKAWGQLDDLGILLPTTSMHWNYF 4123.557293 2.220291542 0.315595602 7.035242342 1.99E-12 4.18E-11
GAGTCGAACGCT 1.532353989 0.002854478 EMC xgb Is known to be SPIGAPTLMFPDDDNCIEIANWCYSDLGGLPWFEPRTPFGTPSLQELIDMENDVEPPSQIAQACPDSCEGMDSNIMLEPSAWNSG 1424.123524 3.684632733 0.636383294 5.789958292 7.04E-09 1.05E-07
TGTGATAATGTC 1.246102183 0.002854478 EMC xgb Is known to be DDLEHLEGPCPPGTPEAPLPATYGRVADLWNNHSSTPSPWLLDFELNDFFLGSSPSAKETSEPVDFLPADYLPDFDLSPTMDETQ 8268.74593 3.752845527 0.272266356 13.78372848 3.19E-43 7.54E-41
ACTCAGACGCGT 1.976739229 0.002878942 MH cnn Random arrangement SMMDVMDPIMDIVMGSSQGTGYSCDVYQHPQPPNPEVAPVTEPMELGNNPFSEAWVTATWDEEVHHWVNGLNDPCDFDMCHWWVD 2534.465185 4.745649309 0.52240857 9.084172001 1.04E-19 4.21E-18
GAACCTAAAGCG 1.686149433 0.00325843 EMC xgb Is known to be DDDVNRGPWYIPDMFQGGLEDLYHLGPPEPSWTSTESWLGDLFDVEVDGNAWGASQFAGDNWLPEEALLEAYCEDIESQFCRKIQ 1317.386009 2.547775373 0.353204212 7.213321046 5.46E-13 1.22E-11
CCTCCAACCCTG 1.319682614 0.003272352 EMC xgb Is known to be DETTGGFDDMLIEPPISPVLLPTDSSCQLGDLLRPPTPITEDLLHNLFGLDVFPPESHYMLWDDFWVNDEFINRADPGGAKPFTG 2226.100627 2.619286117 0.42953265 6.097990734 1.07E-09 1.75E-08
AGCATGGTGGGT 1.765289187 0.003450569 EMC cnn Random arrangement GDQSPEALLPEGGDDPMWPVVSELCQPQTYHYETADVDLQPDMWISTTNIAKTVEGNEDATPMNFYQQEEDIIWAWLYDAMMCAC 1150.798879 1.987562997 0.55610087 3.574105174 0.000351428 0.002907639
AGGGAAAGATAA 2.141619099 0.003554675 MH cnn Random arrangement EEDHVKYDIWSIQNAFHEIHNMSNTTGQGLVWWFHRHETGLWHLKLMACYIQAAYVKTSPDVWPFNQYGRPRADDINNFMCFIKG 255.1645377 0.304045755 1.297869855 0.234265211 0.814779092 0.924037799
GCACACTGGCCA 1.343849325 0.003558755 EMC Both of them Is known to be TMDDFMPDHINDPPKDYDAADPVLDNETLPWALDHCDLDDLWATIWTDVYTSQPNLSLSHLPADEETPNDTPGTWEMPLFAQSPM 3354.161338 3.404903166 0.373682889 9.111744914 8.11E-20 3.35E-18
GCACTGCTAAAA 1.512329133 0.003589647 EMC xgb Is known to be YTPDTNMGPLMSAPGDDSSTSEHTGGRTCGGLVSGAKGMSRLHTAPPTLDLGDLFELLCETANPQGPMQSLDDFDLQGLLDGDVM 1781.676223 2.426119326 0.431084024 5.627949982 1.82E-08 2.64E-07
ACTTTTCAAAAC 1.461221901 0.003660278 EMC Both of them Is known to be ALLDDTQPPPLHSYALDGDYQGGHFPSAIDTFNSSVSSVGSEIDDWLLSDLGDLSELLFETACSEDWLESMYGGWGDVLPDQALL 3501.3409 3.362827872 0.378407854 8.886781375 6.29E-19 2.29E-17
CCCTTGTACTAC 1.461174459 0.00371128 EMC Both of them Random arrangement AFDSQQGPHDQAGLSMHTGIPNDVLPDVWLEMIAETIGQGIQYPCHDSTEMYNLPSPEDISSNDGMCLEDWWDEIMGFTPDFFPY 3360.348742 3.298042181 0.405944387 8.124369457 4.50E-16 1.32E-14
ACATCACTGCTG 1.173331243 0.003770324 EMC xgb Is known to be FPNDPWDTSLPIDYDFDYFNAIEVPDPDFLKLDSPEMSSLDLFAVECDQWELDHNMNAVPRMFPEHPSLEAEEGSMAAPTTGCGT 4854.808818 2.126511412 0.347762163 6.114844111 9.67E-10 1.58E-08
CAGGTTAAGTCC 1.269971762 0.003770324 EMC Both of them Is known to be PLLPQVPPPTPTDEGDELPLPGWDVIGDSADAHCVFELPMPGDIDQVFEDMLPCMSNPYGWLDEADFEFDIQEQMPPGVHFEGTL 6963.150138 4.046759523 0.463526082 8.730381481 2.54E-18 8.80E-17
CGGTGTTGAGCA 1.302416984 0.004141148 EMC Both of them Is known to be ADMIENLYAQECPAMGDIAEMYHWPTDQTPHPPGWEQYTDIYGLCYDVDLVPPSTSLGGMEPLPPDWECNLFELSSDWFEGTLSF 4013.140912 2.193734231 0.338695225 6.477015527 9.36E-11 1.69E-09
ATCTTCTCGGAA 1.543808173 0.004352367 EMC xgb Random arrangement QMRGDDITDPFMFWGFADSNTLSPNPRRCLEQLTNPEPAPTWSYVSDGVEDEIAQFMDNLENCWHDWDDDANQPSVDPWSWLELP 4149.574481 3.965768031 0.371570289 10.67299553 1.36E-26 1.02E-24
CGTATACCGAGG 1.29844132 0.00457581 MH xgb Random arrangement DLSGHRTSDDLGEAQPHTWDSGWSSHACPCKLPFYDEEDGPEIPAISALDTSLENVSNHEEDWSTLMEFFFPDMTCVMEDNRPFF 1813.184248 1.792415644 0.382942754 4.680636011 2.86E-06 3.15E-05
GGCGCAATCACA 1.440425115 0.004903342 EMC xgb Is known to be EMAGQSPETSVALEEEVGGLWPEQSNVPPPIGNNLTLSDLDSYVDQDLKEGLENELEFPVAPFFMEDIFTWASEGQIQPLPSMDM 3233.922268 2.461643172 0.457322217 5.382732524 7.34E-08 9.77E-07
GGAGAGGGTGGC 1.421472392 0.005156555 MH xgb Random arrangement NEAHGNCLDNAMQRFNDAFECFDVTDHAEPWRSENEMTHPFFSEDYFLGLDFNDLPFTCDYEDWEPIEECVCQGWMDPCAFTPDI 3505.860882 2.614001898 0.477582329 5.473405814 4.41E-08 6.09E-07
CGCGAACTGTCT 1.835221808 0.005225627 EMC xgb Is known to be NHDFDFNLNAWADIPDLNTVLPIWELDDAAVESFFFSSEDSTPMTEGYSKQTKHTFPPIQGTDDPYWGENMQTVTFIGGTWPPCC 647.5695441 0.545441354 0.510447068 1.06855615 0.285269716 0.591634761
AAAACAATGGAT 1.940511594 0.005698395 EMC Both of them Random arrangement VVENSIQSVHWSSDEFITWLFDTDLTEVTPAVATWELPVEEPGMDLPDWTPWPMLFPNPSEEGTWNMADLPHDQLFDMCDQDLWT 1046.096028 3.521284875 0.526185506 6.692097814 2.20E-11 4.19E-10
CGGAGGCGTATT 1.422296081 0.005919641 EMC xgb Is known to be WGDGCDSSNPMDLSPPLMDEHFSEDEIFPRDWDLDSFGLLDLPDLDIPEEAELMDARLSFLMSGGLGKNYEGPGVDLSLGSELAQ 2231.260098 2.725057627 0.643826717 4.232594816 2.31E-05 0.00022716
CAGACACGTCCA 1.48070294 0.006047264 EMC Both of them Is known to be TQDLCPTGADPIDIFEDIFATQTLDFDDALQLGIEGLLDLPMDDMWPSETGLVYLDELLRQYNLMEAKESPIAPHDNVSLYPGEQ 1781.602821 2.746020568 0.488359506 5.622948944 1.88E-08 2.71E-07
CAGCGCGGGCGG 1.317076394 0.006047264 MH Both of them Random arrangement QFDDPAFDELWNAPPSLPPEYVNETMPDFLHQESGSIFAPCELTMPSLGEMLLILDIDSQENMEGFEFFMQPNIEESNIHTLGGE 1700.870041 0.991302797 0.466369804 2.125572428 0.033538877 0.148941466
CATCGCTGTCTG 1.262909244 0.006047264 EMC xgb Is known to be WQGLLPPSHLEPPTLLEDGLTYEVEALDQMLHDCNTLSPSQLMPPAAPAQPVYGVIPLGQLSVLPIGPTIDIDSWEEFLQNILST 3595.947332 1.36125473 0.339416598 4.010572073 6.06E-05 0.000567474
TTTCGCGCTGCC 1.155383569 0.006487806 EMC xgb Is known to be NQPDNAGNYVMDDLDLFMGLNLFPQKIIPDIGDVEFLWAPAQCLPDPSALDDSDLDLLDGQAIPSSSQSWGLSFDVTEHTETDRF 4113.187327 2.142200878 0.326803932 6.555003374 5.56E-11 1.03E-09
CGTAAGCCGCGA 1.634662914 0.006959411 EMC cnn Is known to be DLDGNYFHSSPTGMWWDDFFNEMPVMELPVNCTSNELAIDPDGLEDLDMLGMIPELYSFIASLDNFDPAQQASDVVYEEGGHSGM 2505.988677 4.174856174 0.332854988 12.54256757 4.37E-36 6.76E-34
ATCTGTAAAGTC 1.062295811 0.007347236 EMC Both of them Random arrangement PWAHAQHDDLMDWLMEYFDHDATNFEPVPRTQDQWQPQCFSTADTDLSIYLYDDIMPNVDWELVPCTSADQDWCWNQVTPQPGDK 4881.246364 2.857218788 0.300596523 9.505162465 2.00E-21 9.34E-20
GTTAAGGTGGCT 1.286083245 0.007493775 EMC Both of them Is known to be DNTDSPLNLTYTDMEPWQPDEEAFAWGSNHALPHELVDFDLDITNLDFEEGGLAYYLAATNMSFWDFEFEDWYFNAHGMDMDFLE 2094.349913 0.662534342 0.508036787 1.304107023 0.192197095 0.479060318
CGCCGCGCCGGT -0.898915683 0.007570357 EMC xgb Random arrangement FLKAETPTESQISMCVGTDPTVHASIMQPILGDEPHPTPTMGEEGPPLMSPWTEDDPFNFWEVADTRIWDVLTFPDMSSAEEWWQ 2731.62001 1.845110367 0.517288742 3.566886761 0.000361247 0.002983904
GGGGCCACACCG 1.091984689 0.007570357 EMC Both of them Random arrangement LEQADRAPNALIEGLEDFESDHCIQDTPDFCFTSTPDITQDTMYPWIYDPYLGPGFMEEFVYDMNPPEMLSIIADTCQEGPSTNH 5014.363267 3.127441422 0.303043449 10.32010899 5.72E-25 3.99E-23
CCCTCTTTCACT 1.457688916 0.008218771 EMC cnn Is known to be SWYEIHVHPDMAISAPPQTSPREVNEQRSDNYEPDPLEPFYLPWPEESNDLAEDMLGALNFPEIECFDWDEQLRSNDIDSCGGHY 2360.424338 3.20525403 0.469733778 6.823554491 8.88E-12 1.77E-10
CGCGTATCTACT 1.160156084 0.008223658 EMC Both of them Random arrangement SEGPTSPYVMGTPSAAEGAHPEIGNAMTQPAQWDCQCQLFEYIDANNMCHEAPEGLLCAFGFEDLNDWINETGEGLEWLLGDFTY 1995.254972 2.394469569 0.467982874 5.116575203 3.11E-07 3.83E-06
ACCACACCCAGG 1.613777206 0.008224683 EMC xgb Is known to be VDTDLLAQAMTELSQTVSQIPPPPDLPCDGCMSEALDDFDLTLLGDDEHWDGLDMFAALFSAVNEHGKCAHMETPVWDFPSMISW 2476.992259 3.434909034 0.535244183 6.417461682 1.39E-10 2.46E-09
AGGTCTGGTGCG 1.376487762 0.008224683 EMC xgb Random arrangement DGPTTYYVLPQVPIPFEVSQEEFRCYFEMMPDPSPAQDNAFGSWMYQSDDWYEDFMQQPTSNDLDTEVSDWVDFDPNTTNSEALG 2166.794323 1.631106474 0.490490617 3.325459073 0.000882731 0.006598511
TCACAGGACCAC 1.199366681 0.008358218 EMC xgb Random arrangement SPTHPIFGPTPAFEEWSPECLGAAHHDVWWQCFDYTPDHDITQSLPQNNWDVNLEMFIQHGDPEMYGGIEMYEELVHGDTPESQP 4798.552063 2.538892681 0.40786765 6.22479542 4.82E-10 8.16E-09
ATTCGGCGAAGA 1.511099946 0.008364826 EMC cnn Is known to be ETENDVEVYVANHHGGIFEPEQEFNCADMPSDFPPETPRTMSIVGQELLDLGSLAEWSWDMLDGSPDYDQEWEILLGWGDKFMLH 755.1836636 1.49274664 0.458604844 3.254973559 0.001134029 0.008277244
GACTGGACCCTC 1.132555122 0.00867817 EMC cnn Is known to be TDDTPANEPDMWWISIIDGPLPMEPQQTVQNDMSSAGPVAEDLFETLHSRDVWNVGPDTYLNDECLLQMGGDPEYGGIGDGFLFN 4549.676459 3.251758007 0.672777854 4.833330928 1.34E-06 1.55E-05
CCGGCATATCTA 1.116149854 0.009227394 EMC xgb Is known to be HRFTTPSPSPSILHNTPEQTVGEGLGFKPTTENSEALDAGSISEDISDELMSSHDAAAYEWFDWEENHEIGMFLDMLVNPHYGIN 5383.287568 2.870743615 0.414198361 6.930842521 4.18E-12 8.60E-11
GCGCAGTCTAAA 1.495654888 0.00932812 MH xgb Random arrangement NAMPLDCNTDMGDGETLNINLPSHWCPHHGVDRNAGWATSESAHLDMEDFFPDMWFENVYAFAPIQPVSDQDSINTGLDDFMADM 1541.676253 3.194100626 0.655018431 4.876352291 1.08E-06 1.26E-05
GCAATGGCCTCA 1.446920494 0.009397213 EMC Both of them Is known to be KELDDFEMDMEDFLPGELHDLMFTHFWAEDTVWLDMLESSAFPDFDLDMLGAQLPMPEDAGNLHAPDSKTNMIDSWSELWALSDD 1538.96417 2.251471792 0.389483721 5.780656978 7.44E-09 1.10E-07
GTGCGCGTGGTA 1.559194847 0.009397213 EMC xgb Is known to be DPMCLTTNCYTDLSAIQQELPPQWQECDDLDDLMTWLSNSDAELGLEDAMRFSPLELWGQVALSDFGGDLFDGWGGFGDMYTPTV 817.4336439 4.178843929 1.256175204 3.326640994 0.000878995 0.006580515
AGTCGAAGCGTT 1.103969493 0.009428423 EMC Both of them Random arrangement PDEPTQDPGDSWFQPLDWDLVDLEYETGPYEPPLNLEDQGYHIFDSVMDEPMSQWWMDWDSWLLSGDQHPQLPWFVNENAAGIDF 4397.397595 3.71074156 0.299490372 12.3901865 2.95E-35 4.07E-33
AGACCAGCTCTG 0.991622136 0.009542547 EMC xgb Is known to be GGRTEPAPDLASSLEDILFADQSPSDEGALLDQLTCALKDDPAESLTDMALNGEDGEPEITLWDCEPFDEWGELAMLEQGCPNSP 7303.747895 2.654345163 0.266860345 9.946570212 2.61E-23 1.49E-21
AGGCCACTCGCG 1.623772181 0.010031693 EMC Both of them Random arrangement SPQEGAGPQCSHGPPLDSQQVADEFAPEALVSPCTWSNNDQIWLEEICNDQFPIDDEWILNLINPIWAFDLHAGVTEAEATDFHI 1845.41485 2.05634695 0.390007419 5.27258419 1.35E-07 1.75E-06
AGGTGGAAGTCT 1.319977899 0.010245278 EMC xgb Random arrangement GTRSSDEQGQIPEDASWALGLADEMGSEFNIFVPPELPDFGAQWDDLDFGEIFNDPWEMYSYIRDFQLSPSALPHHPNYAPTHDA 3328.013173 2.46247093 0.376661471 6.537623618 6.25E-11 1.15E-09
CATTTGCACCAT 0.955280449 0.01030946 EMC xgb Random arrangement DQDKDMYSTLMDDPWIFTSHDVELDSYDDFLSEWIWEGGTYWSFAEYSNWQDPANLTAPEQHPDNVQSGIGSQPEQPASHPFYES 6966.796334 2.641627687 0.314227942 8.406724338 4.22E-17 1.32E-15
CTAAGCACTCAT 1.312414234 0.010348854 EMC cnn Is known to be DLYDFDGIFDNIQMIIDDGDMWRPDVDPSWTPPEPLSDFDWTLLNDPAVQLPLMDPLNGLEWHQSIPSEEVFMSKIEDSNFAHDC 2433.562117 3.406892806 0.349764604 9.740530545 2.02E-22 1.06E-20
TGTGGGCCACGT 1.686825599 0.010559774 EMC Both of them Random arrangement DDDKNEFLHWLPLTDPWDLELECMCEYGAPFPPCQPTLHATQDIEDSQNMSTELCWELEQWAESSIDSFDFPAEALPDIHDAWPQ 559.5695347 2.949918542 0.878382711 3.358352238 0.000784086 0.00593272
ACAAAATTACGT 1.184821088 0.010584388 EMC Both of them Is known to be SMPPHANGDLICPPEQQWEELQEWFNRDWCLPPLEDPVEHQLSEFSMSEQTLPGLPDDLLLYELVHQFEAIFDDMSYDPVGPMGI 1992.18041 2.155853461 0.559520605 3.85303676 0.000116662 0.001045527
CGTGCTCTATCA 1.127125893 0.010721594 EMC xgb Is known to be RTPPHPQGPTCELPGEPMDLEPQSDGNMHVSSACDIRDLMDGLGIIDELMSTDGQCSSYLALDCPWWEDDPMFTDPLGQMHLGGW 5226.732262 2.780390902 0.470311197 5.911810994 3.38E-09 5.19E-08
GCCTTGCTTTCT 1.12558143 0.011302143 EMC cnn Is known to be CAQQNNPIPAIYHQLLDEVSDVANFNFDSLGWWTTPFSPHSTDWAQQMEPLEDFDMATLNTCDLDLFDADYGEIWTDDPMWTEAP 8941.401463 3.087946956 0.407783307 7.57251928 3.66E-14 8.85E-13
CTGAGGTCGGAA 1.561730326 0.011365282 EMC Both of them Random arrangement PYAVDAMIQPGFPGPIPYDLEPPADWDNVAWMEEHINPSYCADLFLPTDDLLPYLQREDVEAAIWEPFFHNLAPNQDMDQDAPYN 1527.503273 3.681994527 0.475663506 7.740754713 9.88E-15 2.49E-13
AGCTCACACACA 1.129649873 0.011895446 EMC Both of them Is known to be AQDDDSEDDHAEWDICHVELNAQAAAADLVSPVMQGDHPELNAEDLYWCFPELNEAWMNFLNDEFLTADPTNVINPEGTYSQVQH 3705.125677 3.467448825 0.442140503 7.842413889 4.42E-15 1.16E-13
CCCTCGGTACGA 1.455891879 0.011902135 EMC xgb Is known to be DITSTEQDSSTPEATGPILFEDLAVYANQSSEDNNCNPFYDLSPDGPWESMSLDKFMMARDSPTPAIDLSLESRMLWDIMDVFDP 1923.241584 2.927257661 0.564200771 5.188326231 2.12E-07 2.70E-06
ATTTCCATACCC 1.760869546 0.01251167 EMC Both of them Random arrangement YEGFSTIHMDLWQTNVDHPCTLPDVPAAPMHCEDPDTWMCGLEPLFEDMFMDMDIDFANWDDWALEPINNLGATRDEHPGASTKV 1522.492942 3.027641 0.351975036 8.601862896 7.84E-18 2.63E-16
TTGTCGCCGGAC 1.034022901 0.01251167 EMC cnn Is known to be TSMRQSQIMTDVHNGQTLLNECDFSAPWHIDLDLMGPSIADWSFDDPDLDILWSFGMMEDVPQVSTACNNDEECIPDWGTTQSAQ 9108.758479 2.951815292 0.377409972 7.8212435 5.23E-15 1.36E-13
AACGCTATGCGG 1.327310863 0.012969679 MH xgb Random arrangement LEACAGCQAETQGENWADMFSSYADTPWEPDLLPTWDGADLNNELLWTEPWTMPVDGLNAVPMFRQAEYPHHWDPMAESPEWQTP 2037.900043 2.701353121 0.619738172 4.358861922 1.31E-05 0.000132505
TATTTCCCTGGT 1.322184296 0.012969679 EMC Both of them Is known to be DMLRPGTHTSMQDGDPAGQPPALPHCVDPPSNASEWMSLHDFAFGENMAGEDINPMTMPLGDVLELLETWMPLSSWNWHLDEPNL 1711.6467 3.181158609 0.607952337 5.232578967 1.67E-07 2.14E-06
TCTCGCCCACCT 1.717018958 0.013411334 EMC xgb Is known to be FDDWDEAMTPRSGAFDLFNQLSPEDWSLQISCPLEDSLTVMGELGDLFGVLEQDFPLEHPYCGLLGGEASSEPPGGDGGSGLQLQ 538.2960854 2.329368416 0.607016958 3.837402536 0.000124343 0.001106359
CAGACTGTCATC 1.139557704 0.013635021 EMC cnn Is known to be PGVNMRPMQGEAFELSDDFKMLEAGAFAPPLSPDICTPWGTGSWTEPSDDTVSGWAGLTSEDLYMADPDWLDNLDWGEPLGWVCK 7046.966194 3.031625654 0.302325495 10.02768772 1.15E-23 7.05E-22
ATACTACATGCG 1.476860182 0.013671917 EMC xgb Is known to be FDHFWDITLPPAANDPPSHCSETTGPQDFPDLLTDPTMSHQGMFIQELPDWRDLAELLGETTSPLMPGTPLGMLNGDGENDFSCY 2010.419948 3.874257573 0.430567494 8.998026166 2.30E-19 8.76E-18
AAGATGCTGAAC 1.576052933 0.015004589 EMC xgb Is known to be HSILDQNAPDGSCPNTMLEFDSPPGEGSGPELDMDIFMHDNWDSVEDIWHPWWEVALNEDFAGTLALPPIPDGLAATPWGTSPNW 514.1384993 2.31695255 0.793004123 2.921740863 0.00348081 0.022728452
GATTAATAAGTT 1.11822062 0.01625888 EMC xgb Is known to be TSQEYLGSMEDVCENPGLPDTYFDLARFDLDFPRTPTPGILDSWIEDIWGGSDPMMTPPLLPNRDFTNSTVEYLAFCFNTLGSSE 2584.262615 1.41448368 0.434855651 3.252766009 0.001142876 0.00830512
GAACCTAATAAT 1.410626294 0.016324156 EMC cnn Is known to be TETGSRGSSSPDDSLDLLPEAEWLSEWCPFDDQLNFWEALDMDMESLIYNDLSPGLVLDFIDDNWGGPALGGIDSGYEDDAGTSS 1416.381787 3.90487506 0.4323414 9.031924914 1.69E-19 6.58E-18
TCGCTGACATTG 1.123269509 0.016768555 EMC xgb Is known to be SDGPVIHQPGTTMLNEMDEHLNLWPQLQLEQCDLFWDNPFYEHPDLNAEDPWEPLTFDMADNSEMESLLAHACNEWDMPFQWNTP 5032.918063 3.233806936 0.336175534 9.619400021 6.62E-22 3.35E-20
CATTCCAGATAA 1.012571839 0.01711985 EMC cnn Is known to be QPSWGELPLGMPINEWFQTENDQLFDPLLDWSPMSETGSGLCGLSKQGTFDEGLADLLRDALDGYCVGDGSKQQGSCIDLGGGGV 2986.106351 2.007973984 0.407299056 4.929974554 8.22E-07 9.75E-06
GTCTTAATTTAA 1.257984957 0.01711985 EMC xgb Random arrangement WPDPPDDANINYGLTFQHPNCLEANFGLDIIEDTDPQVPILDEGISQRWLNDLNSIDELDIMQCMYDCYAPAEPEMQPMSQDDCP 2264.313462 2.667781721 0.414199267 6.440817104 1.19E-10 2.13E-09
TCGGCGTTAACT 1.003143771 0.01711985 EMC Both of them Is known to be ADPGTSDHGGPDALDTASMWESCLHPWDVMDTPSMWPFPCDVLSDPEVQSAWEDPLWEEQQETFPYFSGAEAWETSYACGSHGGI 1760.343596 1.646696854 0.484712842 3.397262691 0.000680636 0.005246084
TTCTACGTCCAC 1.428007407 0.01711985 EMC Both of them Is known to be SPIEDTGDGTREGSISQAADCGGINLGTLDLGGSPGLSSDLYDVLDSDLYHDPDFCGGNGRPGWDEGSDLSYQMFELLGMPPSPF 3241.759604 2.301850716 0.61535669 3.740677159 0.000183525 0.001590123
AATGGGTTTGCT 1.149943774 0.017290882 EMC Both of them Random arrangement QAFVDTMDGPMEWPMICVEPMPDVDPWIECSAPATDLSWETFMVDLPQMSHNLMNDLENMELTFHPTPLGTSAWSSSKRSEENEP 1845.309149 1.610641127 0.557907956 2.886929841 0.00389021 0.025038804
TGAACGCACTTG 1.137142632 0.017314827 EMC Both of them Is known to be LLGGDSMLEGLPDFCVFPEFEESELFAPDLMNYQFATPHSEPWTWEITDTVVNPPGSNAGQPTDDEVFSPYDYWSYGPCDILHSD 4594.223939 2.313571267 0.430206019 5.37782171 7.54E-08 1.00E-06
ACATCGTACATA 1.592888162 0.018593655 EMC xgb Is known to be GTPSPTTAQTCSDHIATPEWPGLYDVPTLDEPDEALPDLYSGMPLGSCLYSDAGPMLEAMMGGEGCWAPLMTTQDALDWAIWAGQ 511.1722915 2.292092682 1.210213061 1.893957979 0.058230588 0.217968877
CTCTATATGCCT 1.521113047 0.018593655 EMC cnn Is known to be FEDFGPAMSWLWEFDFDEFHLVPEDPDTFLGPLGDNMFTPGAVDQNLFECPELNFNEQAVNLEFPWGAMPKDDEAEQGPPIDLHI 3736.025783 3.656939227 0.382670283 9.55637108 1.22E-21 5.93E-20
TGGGAGGGTCTC 1.059892856 0.018593655 EMC Both of them Random arrangement DQDPLDSFPHTMENSDAEQCDETINYAADQPDHNNPHGCEFDMALNTVDCFGILYDTWDPLDFAYILGLGSDPPSSPVEPDGQYQ 4369.677474 2.464301965 0.438895846 5.614776235 1.97E-08 2.84E-07
GTATATTACGCA 1.514128901 0.018704263 MH xgb Random arrangement LCNMTTFNPVLHRVTQECTEFPTEGSWDTASDAPHTVPASELENDPDICFETGWLDDMNSACFWPEVLTNTWDDVQNHLPNEWDT 1114.229819 2.077388228 0.51147197 4.061587636 4.87E-05 0.000465426
CCTTGATCCCCT 1.088586817 0.019204379 EMC Both of them Is known to be IPDHMQMLVEQITDSQSCEWSWGSPPPDDPSICHESITPILGEFGWQTPPPEESTSTDPLFNNLVDNMFACWPESELDLMAREAD 4460.218681 2.787562245 0.31182604 8.939478718 3.91E-19 1.45E-17
TCGCCGATGTCT 2.256041935 0.019492645 MH xgb Random arrangement WFSWEWYPSYKSMFCAPPDEGMYLMFVAEDDWAMIWNALHSEAEAPPAEFSAMVFHGTCEGCQWDGCMDWEWYHYITFCGEPDWF 308.5796053 -0.37528899 1.270640806 -0.295354114 0.767723382 0.910417912
AAGAGTTATTCC 1.234874603 0.019495522 EMC xgb Is known to be FDWFEQKPCLQEQTPYTPYPLQGPEACVDDVLDLVCSDMLEQMSNTDMDSSLPALLDMLRDDTWNLNQILVPNELENNPETGWGQ 1699.55695 2.874384199 0.592620356 4.850296098 1.23E-06 1.43E-05
AACGTGGTCAAA 1.56311041 0.01963273 EMC Both of them Is known to be YDNTNPPSGFENPFVVELCDWRDTIDTDALIGPVPHGAPDLSMLAFQEFIDMLTMQLAEGSGSQDMQEWGSDWLEQPATGPWIIP 261.3047295 0.559036747 0.876993677 0.637446725 0.523833901 0.780820201
AAAGCAAGATGT 1.05678338 0.019701327 EMC Both of them Is known to be TPDTWCPQAPEGNIDEGPSTHPSDDFPPLPEEPTTYWSDALRDLSLDPLTPELCTTSFQAADDMIDFEQWFMNTYCANMYAMNSS 3856.706694 2.76374481 0.353797043 7.811667343 5.64E-15 1.45E-13
AACATCTACGAT 1.16075404 0.020000803 EMC xgb Random arrangement SPDDNQPHPPSRDVDAQDIVPYVYDPFMDVQFLDQLGHDWRDAASDAVYDFNPVDDQYFTQMTQTTCWLPTMEMIQHPWEGTDTP 2072.120597 1.779952662 0.450393657 3.951993181 7.75E-05 0.000717953
TTGAGGGCATGT 0.958858093 0.020529901 EMC cnn Is known to be CHQDQCNADMPGSGDLWMSGHCYSRARTPAWEMSAIEGLSMWPPDDAQSVYSQVEFESLLFPIPEDMRECLDMDLFDLSLDELPW 5095.254024 2.191667995 0.482000376 4.547025489 5.44E-06 5.77E-05
ATAATTAAATGA 1.249387678 0.020534923 EMC cnn Is known to be DGPPKDTFGQDWALRPPLCDENECFADDILGSPSALFDMFDLSFSDLILPGYESFFGLGNTPADDAPFNMDDNEVWNEGPGGGAW 1302.602306 2.809842224 0.383575549 7.325394524 2.38E-13 5.39E-12
GCGAGATCTGCG 1.008379131 0.021779882 EMC Both of them Random arrangement EAGIIIPDSQPLIPETPVKILPCAQDAHDLSCDCINCSPEGGAQDSTIPQLDMWGGSGILSELDMLDLPGWPEYIDPAELDVFWA 4347.788744 2.954247263 0.392674885 7.523392451 5.34E-14 1.28E-12
TGCTTTTCTTTT 1.100986273 0.022754242 EMC xgb Is known to be ATMLTFPAAFPQEPFPLLDEMSDNEFGEVPSMADWAPGSTAETDSWFSSPQPAPEYAAQAVAPPQLDDVSDMFVTEYNPCLWWLS 2682.03257 2.465837164 0.462810835 5.327959023 9.93E-08 1.30E-06
CCAACAGAGGTA 1.125909189 0.022880236 EMC Both of them Random arrangement HWPPMLGPEVEEPPFLEMPPEEWYSDEIHTCSYDEFHSGAWVEIDMLEAEDGWNDSWLDGQFNDNNDTPLEELVGSLLHDFIPMD 2232.518059 1.995885189 0.399050683 5.001583188 5.69E-07 6.89E-06
CGGAAGTTCGCG 1.135318653 0.022962335 EMC Both of them Is known to be SMLDPSACAQLFDISMWDPLCEAEADWAEWWGIAQPGAVMYGNTQLDPLDWLMPSAMQESEDSMECPQFWESGEGTSQWPGWEAL 2709.969879 2.694220053 0.394707663 6.825862037 8.74E-12 1.75E-10
GACCATCAGTCT 1.599335124 0.022962335 EMC xgb Is known to be FCDQNHENSHLANDLLELLEPGWGDPYPTPSEAPEEMSLHDIFDVMVDGFESLMPPPLGEPTTQCELQGMTICTWEEGLGYATPS 1441.159795 2.722638532 0.499483021 5.450913084 5.01E-08 6.88E-07
AGCCTCGTTGAG -1.045857279 0.023372582 EMC xgb Random arrangement PEWSTNDLYFMDDPNQWWDLDWQRPEPHEDDDHMMEDHPFHEMLGSFINPPPAPDSCCDHFFPISDSYIFAHDNWETCTLPDMFP 2214.791448 1.204754332 0.451092578 2.670747403 0.007568258 0.044920101
TCATATACTGTA 1.097995541 0.023556481 MH cnn Random arrangement WDDSIEMWPPYDLMEDMIMQNIEAVHASQGWCWDDSAHSQAFTHPFCGAPTDLMVRDTQNNEAPASEWFEDIFSDMCSNLLTTNF 1842.348034 1.549201199 0.326165614 4.749737958 2.04E-06 2.30E-05
TGATCCAAGTAT 1.614411445 0.023932675 EMC Both of them Is known to be DWQLMDQMSHPDDWLCPELIGGGFSDEGALLDQLYEALVEFDCTEEPDLDLSLPELEDQWKFPAPPCPPDGDSNWYTGGPTEEPA 1385.451928 3.057010888 0.356594453 8.572794282 1.01E-17 3.34E-16
GTCACAGGATGG 0.981063302 0.024371418 EMC Both of them Is known to be FTDVHTTSYPIVPWIPCLQDLRTYDVPWIPSPIEFSPEPLWLPNMLDDLAWNEANLDFKDLYDVFYEVDGSFHAQQFPSSGNGST 5797.133801 2.755914508 0.27533456 10.00933012 1.39E-23 8.38E-22
TCGACCTGTGAG 1.407610753 0.025506233 EMC xgb Is known to be MNESCNQCYPNDDMLYFYEAPGYDIWAHWWPPGSSNATGIFDWLDLEPSSSSGSVPASIPPQTPQPSTASEALSDFELTFLGLEG 1074.733753 2.272360324 0.440297141 5.160969974 2.46E-07 3.08E-06
GGCGCTTTATCT 1.400560938 0.026607366 EMC Both of them Is known to be RDVEALEGLLDGIEDMLDYAMSWEPCQDCTPPDWNPGPLLEPLLAPSSNLPNLHDGGGLELDHGFSGWLDDGGLPESGDEQWLIG 2267.456471 2.920803821 0.625522766 4.669380521 3.02E-06 3.31E-05
TCAATTATTATA 1.237898968 0.026659198 EMC xgb Is known to be PYGMIAQTVTDISLPELPDWAQDPFANTIPLDDDMCHPSMYTELWGSTEFWDESGAGMWHGLIPNPQELQDWMDDFPLASEASVI 641.0475175 1.511216726 0.789190822 1.914893945 0.05550603 0.211932115
TACGGGCTCCTC 1.149021449 0.02679397 EMC xgb Random arrangement PDIDNALWQLVWDFNDLEPWPDPMYDFLEDQLGSVGYPVESTGTNHPQQSPVENSYTDLDAVCWTLGPVMYSDEFANLPVHGPAW 1408.772568 0.596089764 0.561698054 1.06122811 0.288586248 0.59444449
GTGCAGGGCAGC 0.955762756 0.027110878 EMC xgb Is known to be MDEENSCDPTISENMPNKDAHMAGPDGSFWSLLFTELDAFDEFIPESWTTPELNEILDTFLNSECLSEEFNALPYGLGLCSHRGL 1492.999185 -0.263691802 0.591626793 -0.445706321 0.655809398 0.849802177
CAGCCGAAGCTT 1.77082623 0.027707016 EMC cnn Is known to be IFDDGMWGSSMQFLNLPPQHVSPNSQGHCEPPRESTSPANSEYPPQDELWSAFDMPDLWGPEVWASDCPLSPEDDALIMHALWDL 809.5576008 3.856108293 0.620458088 6.214937585 5.13E-10 8.63E-09
TACCGTGCATTA 1.502856356 0.027847637 EMC xgb Random arrangement DLCPFFRTADATELGEIQCGCWNREEPSETYVQADAMFQNLPPSFPPAFDQPDWHNMPATPFELDNDDWEPPMCIFADTIDYMMP 403.5247589 0.984965748 0.766455255 1.285092302 0.198760031 0.488052659
CGTACATTTATT 1.273836191 0.029215088 EMC cnn Is known to be FECMTPLNISCSDDEFDWFNMYNTEANCVPVMEPTDQWRLCELIDLDGCMAEDFDAFAGMNEVVGMIPSPAEPIPEPHPINDEYP 1247.385731 2.342135392 0.535869043 4.370723447 1.24E-05 0.000126282
GGCGGAGTCTAA 1.661830274 0.03039262 EMC Both of them Is known to be GGGVARPMPPTPENAPNMEAVCMAQGCSPTLDLTLEAWEVSQENLGSGPEQSAAALFGELSAPEHCLQGWDDSMFLDWLGNIIEE 2807.031028 3.755407141 0.545360396 6.886101682 5.73E-12 1.16E-10
GTAATTGATTAA 1.325647744 0.033007892 EMC cnn Is known to be ASDDVFMDPLFANLLNFDQNTTSDVLAPHECQAHAPFNPVMPVYEPHSGLRDTNNMHDLLEDLLKDCEGNLIAWLPEDIGYSPMG 1787.738652 2.73489735 0.530172406 5.158505646 2.49E-07 3.12E-06
CAAGGTCTTGTG 0.820705836 0.033234016 EMC Both of them Is known to be AIAAADPTDTNEYVAEWTENLSSPHFYIPDVDQFNSFEDFFAADPEAQAMEPSDPFPLLPCDFLALFESEVQQGVIYSIFQDPLD 13539.16734 2.139092304 0.256388064 8.343182093 7.23E-17 2.21E-15
CGCTTTTGACTT 1.635760883 0.03325475 EMC xgb Is known to be LVEMSCECPQMHTSMEDVPLPSEPMFDEPITWEEPAGSSDLNTWPSSAADLMGELMGLLEDDALSLTCMELDELAAHGATRTGGP 559.2276037 3.210821998 1.160056556 2.767815052 0.005643347 0.034486348
CATCCCGCGACG 0.999728313 0.033257774 EMC Both of them Is known to be TDGPMIDLEGLGAIPLNDPKPITIEAYYAATHETNSAAGLALGSPLMHAPSDVVSSMMENLFDDISWNDEFWFATIMPDPEDPWQ 4157.703523 2.349848348 0.333285376 7.050559425 1.78E-12 3.76E-11
CTATTTTCCCCG 0.941594944 0.033551133 EMC Both of them Is known to be GCEDPLMALIQDDLLCPAQPASEPFDEYELLRELQMFSSQDLEDVFSQFDNDLYVMYDVGPNFDGDLDPPPSWLVGQLSAEVEGP 4726.505973 2.314271768 0.343661311 6.734164402 1.65E-11 3.23E-10
TTCAAAGGCCAG 0.951367939 0.033968364 EMC Both of them Is known to be PSMWEPGPPGDVFSTIDESPNIDPDAWAEWTDNILDFLDPLSPQWLYALFDNSGDPYPARDHPTVDVNNEFISPNSPGEMSSQAG 4177.135811 2.347757107 0.334800851 7.012398864 2.34E-12 4.90E-11
CCGGTCATATCG 1.601923214 0.033996957 EMC xgb Is known to be PPMETNTVHEWAPDLPDDPHVSSPYQALEQTEQTHSSWSGSTPRLSPDELMNLDITPLDGSSPTAWLEGNAGPPEYFCLDWHDGL 926.7316799 1.795602458 1.138355972 1.577364641 0.114711647 0.344970219
CGTTCCAGATTC 1.015521135 0.033996957 EMC xgb Is known to be GCHCSAPTNMHDLLQGTLFGVDESDVSDQVFNISPMPAIWSESPPLPVDEMLFDLPLLAAEFTLDEWDVFNPSMDALANAQSPEQ 6552.272037 3.051544667 0.330734639 9.226565075 2.79E-20 1.18E-18
GAACGAACTGTG 1.425181324 0.035049083 EMC xgb Random arrangement LANNEHSNWDDDPNLCNLNYAYTEEDNIWPNLTTEHACVDCWLQGWEEIDNMDLAWGIANAWSLMDWKQDTNPVEEGPSTFELPN 514.9300725 0.914022988 1.206146123 0.757804523 0.448568014 0.729383472
GGTCTGAAGATA 1.421839047 0.035135205 EMC Both of them Is known to be GWDLFDDIIQESNNEQLLYAGLLWQPALPSDLDGMPIPGSASLFDLEIDEFNKGQDLPCFDDAARLGDGSQASEASEDWFFDTLE 841.7170555 3.194802401 0.699971325 4.564190398 5.01E-06 5.34E-05
ATATAAAGTGCT 0.918552722 0.038975766 EMC xgb Is known to be DAATDFTSNPNELIELDDDTLDNEQSYDMDDFWLDAERLLALEDFWWGNLPSLGDTELSMPVHTETMQQLNAMVCSGAWDIPQDM 4067.584844 2.24834894 0.398805586 5.637706733 1.72E-08 2.50E-07
TCCAAGTGGGGA 0.989124001 0.039109349 EMC xgb Is known to be SGGGSVFGSIPTNDLLNMTDPDMLGSGETDLWQPDVLGHDDLSWFDIDYKFGSLLDSLIAVDASEGCKLALDDDMPLVGSEPGDP 4447.665827 3.593838668 0.381518268 9.419833782 4.52E-21 2.04E-19
CTCGGCGAAGGC 1.235213341 0.039495304 EMC Both of them Is known to be PNGTDLDSFMGGLGAQLQHPMGANPPEFPADLDLDYAEADMEMDTMSLILNDFQDSSLCSFAFDDELGGEGGGLDQPWSNGGPWS 2816.659385 2.071811426 0.486070669 4.262366687 2.02E-05 0.000201298
TTCTGCGCCGAC 1.302432054 0.039656979 EMC xgb Is known to be DGCTETGSQPMEHNMDQLDMVHNLFGDDLSQLLEDLSNPDHGGGHCGNGCWDKWMSDMLNLFDDIEWNDDFIRHGQPVVPGGSGE 2025.235526 3.048397037 0.588745928 5.177780245 2.25E-07 2.84E-06
TTCCTTGAAAGT 1.400807483 0.039795333 EMC xgb Is known to be YLGVPNPSTYADMQELGNPATMFVPVLSFDDLLFPELEAFDTFIPDSNNSPLGNWLSEAQSDECSNTGQTLDRWLWGYDWPWAWQ 839.3191173 2.252574248 0.49744246 4.528311174 5.95E-06 6.30E-05
TTCTTGTTTGGA 1.11251109 0.04006833 EMC xgb Is known to be QEPSWLAVSSPFLPGLDDDPLSMQHPMIPDFTYDDFHEIDFNDIFDGLRSHLADEAFGRNSGGPAPGVGSGPQEPYGLGPVGGME 1312.005464 2.331425043 1.097865397 2.123598257 0.033703751 0.149405896
TTTTTATGTGTG 1.065216684 0.04010311 EMC Both of them Is known to be GGQDGVENGKAPRPQVVQPEPFVAIADYFASDMTQLSPHRDLFMGDFLLDNLDAPDWSFDAMWGWQVPQPAPFQPYQHPDDDGNA 3277.534429 3.168252356 0.431570396 7.341217993 2.12E-13 4.83E-12
ACTGACGAGGGC 1.64329437 0.041140518 EMC Both of them Is known to be NRDSFGPPVTGVFDQLLGNDYLYELQAPAPWDASSAGPTSPYPRSSTNLFEDVSLGDLWNDDASDPPFNLDDLWHVIDHNMPLEP 1723.012354 3.684075384 1.421416507 2.591833826 0.009546587 0.054445208
ACTGTTAAGGGA 1.16546944 0.041428386 EMC cnn Is known to be AGGGSPFAMSEGWADPPHELELQQWTFNIFGSTIEGMLDLPDFDDAPAEIGASILGLDNAGGTGAGGFDPGAGQEGEVDMYHDKR 638.5501762 1.359046496 1.18761498 1.144349406 0.252478762 0.555386037
TTCAATCCAGCC 0.907892794 0.042665943 EMC cnn Is known to be TCAELNMEHLPAIWEDAWPPPTDSEFAPYDSQLNPFTPRLPDMCMPSNDTPPATLRPSWDMPGVESADLCALFQALGAFEPDFGL 3028.433187 2.071745478 0.446388446 4.64112702 3.47E-06 3.77E-05
CAGGGTGTACGA 1.11388747 0.042832033 EMC xgb Random arrangement WENTPNHTGEFDEGPDNAFYDPTWNGFQEEFVWPTTDHTPDNMDVVHPMFMIDMDEADSSDVWAHFCECSGDTLHDIWQLPFDAL 3224.146209 2.900242918 0.451883148 6.418125865 1.38E-10 2.46E-09
TACCACGAGCGT 1.327328868 0.043335962 EMC xgb Random arrangement MPLQLNEGEYTWHADMEDHHMDDLPHTFADHPSGYDTYYINPPPFNQLPQMEGPILNANMFRPFLHEWEAANDDPWVDWCWSSDF 352.6491311 0.490203073 1.268014772 0.38659098 0.699059037 0.874220688
CAGATGGAGTGC 0.976122114 0.043724344 EMC xgb Is known to be GGGPSGGSPPPDDPTEPKWMCLNHNKLSDDMDGSFALDRLFTDMLGWDPSDALDVDEFGFELGAPPQQGGDTEDEPMMFNMFEYH 4031.348751 2.453836007 0.408519218 6.00665991 1.89E-09 2.98E-08
CTGCAACTTTGG 0.984864968 0.044564947 EMC Both of them Is known to be HESPDLWCDPLPDQYELFEPGLDGLDHLLGMDQYNIDDLLTQELRDMDIPLVPGASVIDEPLGAHEGGSGSEGDEECHGEKISRP 2675.219106 2.318572176 0.469871262 4.9344839 8.04E-07 9.55E-06
GTCTACGGCCAG 1.223741511 0.046727067 MH xgb Random arrangement THSQDTMYPNPFCVPYMGLWGTDQDADMSPINDQDEGFPSEVDFWITNYVSDNNIFTFNLGQIANEWDQELNSCFETPPGEDMDD 1058.526105 1.623644138 0.786424653 2.064589573 0.038961851 0.166317771
GTCCTCAATACG 0.826300721 0.048516747 EMC xgb Is known to be EPAKPSGNGMWDHLEDQLFEFDTVFDFPEPWEISEELFAYPSLQTQNALPELTYMDWNFVESFAEPMQELCRWGLNQDENPHDVG 6464.708 2.792655406 0.359515197 7.767836883 7.98E-15 2.04E-13
CAATTCCTTTGC 0.831227396 0.048686142 EMC Both of them Random arrangement PIPAQPEDTYTALIDDDFMPEIFPGIFMDVEINSDEAPWDGCDQLCMDLEDWCHVLDNQIPMPWPGWDVEEFCNFMNLEWFGDPV 5618.669025 1.450448693 0.266773018 5.437014222 5.42E-08 7.36E-07
CTGGCGATCCGG 1.509261177 0.049857543 EMC Both of them Is known to be IALGDFPQVLQPIQVDLSPPNNMDTPEWMYETVFAASVHPSNSGTMDHSLCPDYIEDLLNWFDDIENPTGASDPVMPSPMWWWYE 506.1159552 2.448081075 0.467755573 5.233675912 1.66E-07 2.13E-06
TTAAAGATTCGA 1.057173885 0.050012032 EMC cnn Is known to be GCMFPDTPWPQNNGIMLEPEPLSEDCGAWWVTTPAPLDAVMCDEPSELPWSPGSLELLVPHLSPAYFMHDEQGWEDCTEINMWLD 2196.392731 2.081658767 0.392747232 5.300250632 1.16E-07 1.51E-06
TTGACATCGACA 1.59960326 0.050486623 EMC Both of them Is known to be ALDVLPHDPWGQWFAEDFTAISEELALNELLDNHNFDALWDLLGDNLSDTPSLGDGQPGGGSSCSWQFGFGYEAGLPAGGYEGPH 1045.805825 2.306082466 0.58880701 3.916533649 8.98E-05 0.000822928
CCACGTTTAAGT 1.062422042 0.052772993 EMC cnn Is known to be CDNAMTDDDYICFDYLMDPVDYNLFGWYDGPDYANIPDLIEQELRDVDILALLDPRLDPGVHEFGEMSGAEGSPQGGGGGEHYAM 2431.151176 2.648346079 0.607260542 4.361136445 1.29E-05 0.000131404
GTTTCGCAGCTC 1.459966262 0.054275878 EMC Both of them Random arrangement WFSVPEQEAWDTLANLYWGDDGINDHFPVPVDIQPDVLEFEWAMPCFDPIIGPENEHGPCIPDGELPLELAWTLDPGNDDAMHPP 890.7711605 2.575647313 0.767302721 3.356755088 0.000788629 0.005957999
ACCGCGAGGCCC 1.081160711 0.054628753 EMC xgb Is known to be FPVPEMIDLGTIWPDLTTIPEREEEGEPFVREWFPLDCPEFDLNLDFSCEQHLHEILDTILPDEEPYLPMHMSTGASIFETDFWG 3702.869437 3.415053974 0.441511907 7.734907981 1.03E-14 2.59E-13
CCTAAATCGTCG 0.895234697 0.055394289 EMC Both of them Random arrangement DIDWEHLPGDDDSSMAFDSEWVEHWCSRSRPTCINIPMPAYAGQCSHDDKPLEPGVEAMQFPNDFLESQLPEALMDFEETFMFPW 4137.848285 2.022660692 0.427938578 4.726521039 2.28E-06 2.55E-05
ACTCAAAAGAAA 1.45110571 0.055488103 EMC Both of them Random arrangement ITPPDVDMIDIDPMEDPFMGCYWPETDECSAIHVSAPQPYTPEADLGNFWRNDFSNTDPYHGNSLEEDLCQDLLDMVALDPDGWW 570.8229429 2.435772694 0.601549251 4.049165869 5.14E-05 0.000486297
AACGTGGCCTCG 1.063125422 0.059568712 EMC Both of them Is known to be RGAVAPQIEIPCSAPQDEQAPGPVLEYRPMSCTLEHFSLPDLADLQDEDFDGLSNEMQVIGNWLSILENGGMPEFEHFWDLMDKE 4663.068086 3.029107746 0.407391258 7.435377377 1.04E-13 2.46E-12
GTGTCAGCGTTA 0.860504926 0.059568712 EMC Both of them Is known to be VEECMWEDVIDKIMPPMDHVWGIGPDALCTDYNPGPGPVLQGWLGEVSPLPSVSLWDHGDDFHLSSMDPSSKLDDFDYTNLCDMG 3531.710966 1.916214268 0.427152258 4.486021633 7.26E-06 7.62E-05
GCCTGTGAGGCC 2.487869892 0.063693781 EMC xgb Is known to be LLDIPTDFDFAENLPVLDDDGQSFEDAVSAWYDLPFPPPRSWWDLLEQEPYDYSCFDELCLDASPMVTPELALADPDLGWMGDDD 200.4394615 4.046262688 1.287057103 3.143809764 0.001667638 0.011673467
TGCTACAGACAA 1.586007168 0.064155108 EMC Both of them Is known to be NEAQPTLEDQAWIQFQGLLMDEWLFDNPDPLEHIQCMSTCDLADGCAPPFTPLNMPELWNPLTPEWFMGYDPCEGALTDPQWFDF 1193.609723 2.781958286 0.800088348 3.477063869 0.000506937 0.003975285
AATACGCTTCAA 1.198788473 0.071261037 EMC Both of them Is known to be ARIHHVMQPTPHEMSCLEQLPWMGPGSMDDVDWSSLGPGADMPDIFDELMSSDVFRVLDLSPLPASGVGTVLEADIDVCQTYDSQ 1117.216265 2.27352054 0.386734617 5.878761398 4.13E-09 6.28E-08
ACGTCCTCGGCG 0.999863547 0.071707829 EMC Both of them Is known to be PDPQIGPWDALDDMDLLEAPNIDFDAHLLDWLQDAYLDDLSNPSVESNVLDDGLLTTTLGFMGQYIPSHSGSHRGMGGAPYFKPM 4920.800709 2.364701694 0.278886903 8.479070449 2.27E-17 7.26E-16
TGATCGGTCTAC 1.01135485 0.072269609 EMC cnn Is known to be CSDSPHSANDFQGWFDNNDYADDTPFACPWPDFASFAMGMPAMPESLSPRNDVPAGDELLLSALPDMTGSEAFDEFMYELLTIIE 3240.033364 2.742149155 0.33791799 8.114836243 4.86E-16 1.41E-14
TCGTCCTGATGA 0.99170622 0.072844177 EMC xgb Is known to be PSGGPSGEEDVVSRNLEDILGAVTSDEQAIQQIFEDSLMCDTSFDDLALPLSQLGIHLGGGTICPFAQDTENLDDNEWTCMLFDK 3363.877851 2.644819965 0.321612394 8.223625748 1.97E-16 5.86E-15
TCATTAAGAGAA 1.510648425 0.073054865 EMC Both of them Is known to be FPGTFGNWDQAMTKDLPPVLWGPSLSDEWPMWDQLDMPMWADLSDAMNSTEDVLSTDFLNTLWGILDNDNTHCGVGGVCGTQPSD 706.0666055 2.60496855 0.501874889 5.190473971 2.10E-07 2.67E-06
GTTTTACATGAG 1.39728983 0.074994939 MH cnn Random arrangement ELAQLMPYMTTSQAGVQPEDHLFVDPHVHNVCYEPELPSQNTPVDHENSPLDLLSGGFIEELWDEILAFPVDWIFSDWEEVGTEA 678.9787266 2.533953486 0.605899527 4.182134783 2.89E-05 0.000282849
GCGTATTAAGTA 1.063329795 0.077061106 EMC Both of them Random arrangement PWLEELQYWDWYLDQNEGDTEALPEVLHDEPQPRHEPPNPPAMEDWNPLSEYISWDFGYCDAGDMWGIMSDLINQDDGAFSPQST 2952.376268 3.275913405 0.604208431 5.421826702 5.90E-08 7.97E-07
ACGCCTTGCACT 1.238974983 0.078455385 EMC xgb Random arrangement SSDPEPSNIECGASYMDYFEFPAFWDQSVFEVPLDCRPAEFSMERRPHDDSLAFGEVIHDAIQALMMGPQPYWDPHDDDPPPCCF 1161.311284 2.343123557 0.560160566 4.182949856 2.88E-05 0.000282395
GACAATCCTTTT 1.087568544 0.078965637 EMC xgb Random arrangement VDNPPTDSTVNMPLHPEHSHQAQCFHEPNSCNMILDYINAVGITEPMEQWLHNDFFCPDAASLCEQGDCIPNWFEYVDDPLGWPV 2136.974915 1.330170116 0.376497503 3.533011792 0.000410854 0.003300151
TAAGGCAAATAA 1.354463512 0.080273044 EMC Both of them Random arrangement GTADPMQESKPKGHCQQVATQPDQPDWTMSAGSPKSFNIDESWFEHLFPIDLDPFAFLDIQNGLADLYFEDDQVELSDNPVGYPH 1281.63918 3.105100014 1.067662827 2.908315187 0.003633819 0.023665186
CCATACCGCGAC 1.238833894 0.085536334 EMC xgb Is known to be CDDFEIQFRTPPWVGPPMFDQLLEEVLADNTPPPSHAEPQTLDNCYDLDMTLPTWTQLAAFMNIEHAFGGLSDNLNFDDIDIGMG 1693.381549 1.509936012 0.401190571 3.763637835 0.000167459 0.001458574
TAAGGGGTATTC 0.882160586 0.085805723 EMC Both of them Is known to be FDADASPLADSLFTDIAGNGLPFFNAHDLDASWDPVPMSGWSSWELNDLALSGLWELGDILADYPKSMEGSPDAGGPADFADLMF 3492.335706 2.123958736 0.34950881 6.076981962 1.22E-09 1.98E-08
CATTATTTGATC 1.242334654 0.088998747 EMC Both of them Is known to be NGGTPETQCGSWSGKQGITCEPSFYDWYDWYEFPTAMLQHELPELWGFDTPDVTPNDSIGDLRISPDPMEELEMLLGLPEPWSPT 1609.776922 2.275132397 0.338560151 6.720024172 1.82E-11 3.50E-10
ATGACCCTACTT 1.120298384 0.089644903 EMC Both of them Random arrangement GDDNHLGNNGVLNDSNNLIEELLGWDYWGIPQPESPLLYGHEIQIDCWMPCPRYSPEDIGEPMDMPEMDPPHDVPAIDLMSDLQQ 1773.18272 2.827133615 0.70112301 4.032293301 5.52E-05 0.000519441
TTGAGGATATGA 1.005348594 0.091696334 EMC xgb Is known to be SDTPNNAGSIMEDSNCNIPTLEGYDTLTLDEFLAGSDFGDLWKPIWSDPYDLDLNPFVFVPAITSATDQQIESEPTAVMMDTFAT 1601.642966 2.984039958 0.443719387 6.725061027 1.76E-11 3.40E-10
TCTGAAGAGAAA 1.22535406 0.097816237 EMC xgb Is known to be RESVGPQTSLWGWPNDPHQWALHDVTDHPQNDATTALSANKPSGWTIDELMSSDLGSWMCLTALPDMDYNFVLDPNDIDLLGFDD 549.1390752 2.351847801 0.76176278 3.087375574 0.002019323 0.013880394
GTTTGACCGTCG 0.97334251 0.102062656 EMC xgb Random arrangement SVPGMEPPLPPTYWASLGCNADCMVECMPHEAPTTDYDDRVLYEEFDIYNFVDDMMQFTLWPPEMQDDEDMMHAGLDASIPDPGD 1516.721171 1.894818755 0.443668898 4.270794649 1.95E-05 0.000194229
CCCATTGACCTT 0.754869028 0.104788444 EMC Both of them Random arrangement PWDDVNWFVQEEGTQIEDDQPEQPFHPADYITAPTHVRTAAPTFDTYNDLDDLNNPMLDWWNTENLHTWCEFFPLIMEPANQSPE 8858.531222 2.855228564 0.322595503 8.850800878 8.69E-19 3.10E-17
GTACCAAAACAC 1.381719665 0.111266904 EMC Both of them Is known to be GNMTCDLAVDAPGGPVWETHLTAPDELPSDPMLDDFWLDLLDHVGQELPPLGVLSKDMLNLEDWAECTDWFSQSGTDGGDARNDK 1184.704884 9.345969355 1.882505198 4.964644648 6.88E-07 8.26E-06
CCATGGGATCTA 0.916129186 0.111612694 EMC Both of them Is known to be DNMTPLLEDLANIDWGAVPLQPEESHCQWVMPPQFSLMYPSIDEMRNPSPMHLDCWGLFSPLQEELEDWELLEFDLNVIPGDPPY 1270.948453 2.081030678 0.540756561 3.8483688 0.000118907 0.001062463
TACGCACGGGCT 1.215069064 0.111612694 EMC cnn Is known to be EDEEMWETRSDYMSAAVFREHAIEEVVYDNLWPPEGLEPPNLHELYDLDVNAMDDLHWEEVGQILIPDDFLPLSLNCYEEFFPDP 1694.365972 3.176125038 0.620376315 5.119674885 3.06E-07 3.77E-06
TTTGGTTACATA 1.339695443 0.113127173 EMC Both of them Is known to be FDFCLEPLDIPLCWDWDAWEQDVNAEQNWMPPGESGLQQPLDPYDLDDFALALDEAARTNDWQIYMDDGTQMFLDALEIDDMSGH 729.721501 2.14502596 0.505748046 4.241293621 2.22E-05 0.00021984
CGGGGGCAAGTG 1.081667821 0.114203045 EMC Both of them Is known to be DTGKVLQAPNQEQIPKLPLFEDLAAYFDQEYCPMWPGPLLSGELEGWKESGEDAATMLENGDMLDNVMFSLESWPLGPEDFHGGP 1539.741525 3.596535406 0.688770006 5.221678317 1.77E-07 2.26E-06
GAGCTGAGGACG 1.880670894 0.114203045 EMC xgb Is known to be TFFDTDNIGLSNDIPDLFYEIAYMGWPPLADETTQHNDTCELHTTHPETDILLGGVGDMLGDGSWFPFGDGSDDLFDLLGECWAQ 451.6239885 2.733873653 0.858988506 3.182666165 0.001459258 0.010346324
TCGGCGCGGAGG 0.982952819 0.116039372 EMC cnn Is known to be GHAIEELDMLLSLTEIDMTQDVAADHVRHSPPYSTTGGAMNLVLGNQYSNTQAAPAEWDGMASPQQEIHPLLELMPTIYDFEPYT 1379.701448 1.371728602 0.456385472 3.005635996 0.002650261 0.017725629
TCCTAATATGGC 0.91640232 0.117603625 EMC xgb Random arrangement FPQEAEDTFASVHTCCRIQDASDAPPFMPSEWGAVHDEIPDASQVLWPNCTDWQDFMDRFLGDPTYDCFEMPFGADEQYDYWPMM 3895.466532 2.3838148 0.417232455 5.713397339 1.11E-08 1.62E-07
GGGAGGTGTGCC 1.050747733 0.117911342 EMC xgb Is known to be GQGGRSQPGAGPPGPSAPESQAMEFDALPSMMPDDALDDFDLTDSDPSLAQPTWDHFTYALADPHSSQDWQMFTDALACMFFDMG 2225.309254 2.709289583 0.571290693 4.742401053 2.11E-06 2.38E-05
GCGCGAAACTGA 0.95914936 0.11912013 EMC Both of them Random arrangement DGMDGWLTEWVDTLLLDQPIDDDALCNTLPWFNSCNSWDFPVDCDCICPWNNYDVHDELFAGGMINGPLDNFDVYPISGPSNTSS 2802.450263 1.861924915 0.308376728 6.037825646 1.56E-09 2.49E-08
GAAGCATAAGAT 0.859240401 0.119318766 EMC cnn Is known to be RDMPGPQYYGTMDMDDVIPITEHLSTDFLMGFLDDMDLDMGFLEALWTDTAPANPGSQLPGDLLMPLAYGGSGMALGVASDGSTE 2148.497233 2.726834013 0.341757215 7.978863054 1.48E-15 4.07E-14
TGTGCACTCTGG 1.524932109 0.119767579 EMC cnn Is known to be GWGSSYSSPPVVTGPSHPFLSGVEQMHTVFRNDYEFEIPPDLYDIDAPEEALLAAPDVLDWMFQIGISTDTTPESQDSDQDLNDY 419.4714928 1.473969395 0.512070328 2.878451095 0.003996332 0.025588919
ATCCAATCGCAC 1.137545328 0.120571206 EMC Both of them Is known to be FAELADIPCLNACIPCDEDEPMVQEAEWFWNICSMPLPMAEGALPERQAMEPSQLSPLDPGDLMALLETWDPYSSDWGGPEHPAM 760.5860083 2.230861367 0.722297322 3.088563809 0.002011265 0.013863462
ACCCGATCAATG 1.714480525 0.122353608 EMC xgb Is known to be ETCDLLDIDWDALLDPSDTLTGGVGLDDAGPESFFWSSIDSTPAFEYENLEDNGGGEGGCDHFQGMHGGDDGLQGEGGPGNGPWV 439.443782 4.302052098 1.154562996 3.726130245 0.000194442 0.001675921
TGCCTTACCGGG 1.086437767 0.122906307 EMC Both of them Random arrangement DPYSAFEPPADAHETSEHDIMDLLGTLHMNLWPPIDSQGTIGQSGCLADAPDGASEQAGIDELLQYDWDLYCEDCFGIYGDTEQW 2023.030842 4.013622178 0.431849319 9.294033825 1.49E-20 6.46E-19
AGGGTTGTCCGT 0.924608517 0.126140328 MH cnn Random arrangement KQSTVGPSLMPQNPQQAWPVVSCDSDDFPQSPYVNHYEDYCPLASGLQTDDSPYTADWFEDLVSKAWQENEEAYQYEFGLPIFES 1853.417923 1.740973686 0.62281384 2.795335577 0.005184587 0.031995007
AGAATGACGCCG 0.81634067 0.127273243 EMC cnn Is known to be TQPPVTTPNMMAGTILTLNDELVFDCIDLDNWRNPNIELLSLESPMDVDEWLFALLDEAENMDVPVGSYGGCGMDAEGGSECGDR 1845.18406 2.571139028 0.550886889 4.667272136 3.05E-06 3.33E-05
GCCTCCGGGAAA 1.379460279 0.129668642 EMC cnn Is known to be NFAASPPSMEAALMDIENEAYEFPADEDQQVFLLPPTPFQTPPLDMTHDLLMDSPEDTGHPHDLPMLFSFVLGEDGFNVAYNEFY 947.8530223 2.196132798 0.49477287 4.438668595 9.05E-06 9.40E-05
AATACGCCATGC 0.776969229 0.13009284 EMC cnn Is known to be HDDSVIDGSIMDDSNPNIPLLQGYDLGTLPEFPDINDLDDLWKQAWHECIGDPSTTNTNFDSMLDALDPDAGTNLDAQTECYYAM 4553.881125 1.596327744 0.323888289 4.928636816 8.28E-07 9.77E-06
CAACCTGATGTC 0.690968709 0.131316944 EMC xgb Is known to be WQWPPDELWSINMQPHQPFLGSGWCFNPFGGVDFADNPNDMIMHDLFNTQESLDSQDMLNLFDDILSLIPAEPEGAIDPEPINPI 5292.507496 1.761620623 0.254737571 6.915433065 4.66E-12 9.51E-11
TATTGATCATTC 1.051778683 0.132745006 EMC xgb Is known to be TFDEPFMSNEFTALLNNLLTQGPSLQCEWVDWLGASEDTTLEFWEPWTEDIEQGLVGLGFGGGGGGGPSGDHGDTCYVKDPTLQR 610.5050665 1.392790475 0.513682205 2.711385485 0.006700268 0.040201608
GGTTCATAAAGG 1.973158256 0.134443376 EMC Both of them Random arrangement VFDAWLDVVDEENEIFADIMPSCPDIPEPQWREIHHTFTMPMQEEDLEVGQFTFPNVEEANMHSGAPLSTGYIWPQGQDEWNFLD 446.827619 2.645554067 0.773582663 3.419872489 0.000626505 0.004866706
CCCGCTCCCTTC 0.884508089 0.137123412 EMC xgb Is known to be NAYFTNSTYTDDARALPEQHTNLDDYILQQTWAISDLELGSLSAEIIDELMCSDVFPEMYLWPTPAMDPNGPLERTEWTPEKPPV 1509.242267 1.499147989 0.468586479 3.199298433 0.001377625 0.009852105
TCTCCCGACAGC 1.370895717 0.137123412 EMC cnn Is known to be YHRTGRDLGMDGNSSEPLWPSDIEEVLASQLPPPNPGDLLQGHELYMPDHAAPELPNETMIHQMFWDYVALPEYEFDDPTQWPLP 720.828702 1.64531359 0.488533237 3.367864182 0.000757529 0.005749332
CATGGTGAGAGT 0.750039511 0.138495474 EMC Both of them Is known to be QPAHTDDGEICLPFLAGGLLGPFVDGDNDMTPYEIMDMMDLPDLLSTDHDQLDFDALTDNDDVTMSAFLAWLESVVTFNDNAYNF 6493.443173 2.069861111 0.46753342 4.427193914 9.55E-06 9.84E-05
CCGTGCCTCGCG 0.841813767 0.139967567 EMC cnn Random arrangement AMNPPSEFMWDMFSDLCALDSTSDQHLMHGQTEFSTPSTVLMSGYKEQIMGVGPTDFDQVDDPSWDSLARWFIEYTDNDPSEHWE 1783.2821 2.060278246 0.391226449 5.266203885 1.39E-07 1.80E-06
CTTCGTCTATTA 1.092757219 0.140466832 EMC Both of them Random arrangement HRDNRDTDPNSIKHPHWDGAPDECFNELEIESLSYHSAGTTILCPGLSQRDQGILPGDLFEFDDPIEFDFNFWNDLATELYMCPE 654.1056356 1.75043238 0.574931544 3.044592699 0.002329956 0.015818168
CGAGTGGTTATG 0.788674218 0.141114962 EMC xgb Is known to be LDYPSSPSDYVGSIDLDDFDIDGPGATVAPGGPLRQDEPPFDSDKSDWLQMLFDLGDWTEDATSSWNQTEFPVCAEYDQPQSVWF 2851.125071 1.896178979 0.400723652 4.731886854 2.22E-06 2.49E-05
GCGATCATTGCT 0.777328062 0.141114962 EMC xgb Random arrangement DTPFWPDFPQDLSNTNAVARTPVDVQVVDMCDPDYIALNFPYTNVPCDEEWLSEFDNQDLPDYEWFAQWYMSEVPLDYQDAMSTE 4794.111117 2.442797869 0.281561209 8.675903467 4.10E-18 1.38E-16
ACCACCACATAT 0.865334629 0.143952846 MH Both of them Random arrangement DLGDYPSDGSDGCTMTFFEPLEGLWDGICGCDNWSDVETPEDHIFDFEYLQRMDEFSYQEFDVSQFLGIGVPFGPQYDTQDIIEE 2174.457799 1.385691435 0.393247099 3.523716864 0.000425539 0.003396087
GGGGCCGGTGAA 0.942643042 0.145063809 EMC cnn Is known to be FDDHINDWDELVGQPLMDPLMEFAWLDYECVYEQAPLTLPQSQIPSDPPNMWHEGTCMDSLLPEPYMTEPIMFADVGGDAHESPN 911.582063 2.604193938 0.661843186 3.934759766 8.33E-05 0.000765743
TAAACAACTTGT 0.822855448 0.147496774 EMC xgb Is known to be RPAFEGMWTWMSNDWSPGMSKELAMVEDSFQYFPVNDDFDMTLLGDTELLDDEDLATAYAWPNMDPDAYLNQWTDETWFDFDAHQ 3150.937235 2.012230767 0.473667302 4.248194375 2.16E-05 0.000214032
GCGCGCCAGCCC 0.905511852 0.147988204 EMC cnn Is known to be GDGGEAYMPPIPKCNVSYLGDECCLGDECQGPLWCGLDDFSGFLGENSDSWAFADASWIDAMLEEQFGSSLDMIEEDLLACMITP 3557.271502 3.732694471 0.332945224 11.21113684 3.60E-29 3.24E-27
ACCTCACCCCAT 1.649034885 0.149677315 EMC xgb Is known to be ELSACECPSQQNPDEGPPSKVPHLLDQTQDTDISNAGSLSNDITDALMYLDVEPVPFSDIGLDVEMDDWPLEPPCELLDVLAELG 603.6420232 2.955048036 0.789298829 3.743890056 0.000181193 0.001575425
CTCGCTGTGCGC 0.877769998 0.149677315 EMC Both of them Random arrangement APPCHPGPSSEEHDPLIIFDMYPGDLPMYTPHWQSLYDDLFWEEHCGDLHNMAEVMNNHVNDQFHTDELGQALDDGLLLNEFLPP 4820.78272 1.708532464 0.256901782 6.650527873 2.92E-11 5.50E-10
TATCGTCTTACG 0.827346521 0.149752932 MH cnn Random arrangement TMFDANTTNTMSTCIDPNPVALDCAGTCCCPGERPANAHPVVFDNKLCFEQFELGHLNELCEVDPTCACEVSPEALGWDWWDVLG 3921.041679 2.525138622 0.360561879 7.003343306 2.50E-12 5.20E-11
GTACTTGGCAGG 0.774319956 0.156685928 EMC cnn Is known to be GDGTSEQPGARIPGDPQAMSQHLLHYLTDINSLAEDVNGLNDEFWTDLFTNDIEIGSWEIKEFIEGMNFEQYGQGGCGPQEGSLG 5182.697259 3.306274335 0.609589677 5.423770221 5.84E-08 7.90E-07
TTTCTGGTTTTC 0.81132468 0.158312006 EMC xgb Is known to be TCEGPSSNDKFAPWLQDLHFWPPPQVLDFPNEDEGISPILAQPQNHMNTDTESPPWHHLNYSLYPELEEWDHNDFLSGPPSPPPE 3130.614206 1.52351491 0.435799518 3.49590774 0.000472452 0.003722531
ACCATCAGGGGA 0.789124891 0.159109547 EMC xgb Is known to be ATVAYPQSASSDSSWLPEDVDWMHVLDDAFDPSWATLLAMSMSGDIIDEEPDAHDVPLAPISPGNADGLQKCWDFFDCWWDEYMQ 3922.346339 2.05213335 0.377427458 5.437159664 5.41E-08 7.36E-07
CCAATAACCAGC 0.883991653 0.159672983 EMC cnn Is known to be LNDNTMGGVAPMDYNFGDLPDSHKDDFLLDSDVYSDLSDFDLGMLDSELAQDSVDAAMHISELMEDLGMFGEWFMPVFDSSCWRQ 2384.834045 2.327009575 0.368899535 6.30797643 2.83E-10 4.88E-09
GAAACTGCTCAG 0.888296552 0.164356207 EMC Both of them Is known to be DAYVDSMVTILNDLECPLFDFPEPETPDILGLPTVFFDTAISGWSNHSLGDPHFPDLMEESGAPPDGMTCDGGGGGGNGGGQGGI 1265.416188 3.282965396 0.600269781 5.469149873 4.52E-08 6.23E-07
TGATCGCTCAAG 1.047420386 0.167242841 EMC xgb Is known to be VPDGPDLDAITDIYDDSLLGMSVVDDDDLWGLGSDALDGFDKDMFQSDWWLEEIWGDVGNPALEPSPGQSAGPAQCNGLWGHMFP 2184.452332 3.83036588 0.614101868 6.237346084 4.45E-10 7.55E-09
AGACAGAAAAGG 1.061364816 0.167624297 EMC Both of them Is known to be SPGMTPADSSASIFELLCPIEPCPPPLYECHEPDLYCPSFWELFDPVNANYQGIMDALDENWDNVLPNQDMNASDRGDGTFEMVQ 1505.276674 3.598449421 1.052197999 3.419935625 0.00062636 0.004866706
CTCTACGTTGCA 0.785360417 0.167624297 EMC xgb Random arrangement MDNPYMSLLQACLESDHDETAYTDSQATYLPACPPWPFDFLIPDSPQWLGDTPDIVGDACPPGSEWCRADFDFNSEFPIMSPWFE 1803.702602 1.796158361 0.388047627 4.628705952 3.68E-06 3.99E-05
GGGCCAACAGAC 0.963312452 0.16855821 EMC xgb Random arrangement LIECHPHMMGVQPSPPWLGDDIPCVLDWPPLSYTEPRANQRHYATMQSMEEAIDLDQFFENIDQPCFDLQDPGCDCEGEAGCGFT 955.0232539 1.9197796 0.493573533 3.88955135 0.00010043 0.000908266
TTCTCGAAGCCC 1.228397665 0.174234659 EMC cnn Random arrangement YDEKPEQIGVWSPRAEANDDICHDHTYMECPHAETTSLYPLSLFDSETPELGPGDDWGHFYNGVIPSPLNSMIEQPRWETWDTMS 1052.602746 1.332118272 0.503339646 2.646559398 0.008131522 0.047579486
TACGTCCTATTC 0.824773638 0.17615156 EMC xgb Is known to be RFSEAFGGWQWYDMFAGMLDTSFYGLFPELLETAEPTSMQPEVQSTELSDLGDLSELLCETPSTETLANFAQHEWSHGPCDSGSD 2479.238709 2.446205101 0.407149284 6.008128211 1.88E-09 2.96E-08
GGACCGTCCCGA 0.938027023 0.17693705 EMC xgb Random arrangement NLDRWFADHLMDFWDEFDCAGFSFMMSENMNMHGYANVDWDMDNHAFEDILADWVDGPVISELCEELMGPDDPQPLLSSQDGPDD 1872.750874 2.143943732 0.483451856 4.434658188 9.22E-06 9.56E-05
ACATGATATTGT -0.718346654 0.184302798 EMC cnn Is known to be PEPGPTNASMPSQHLQMILYPEMPCVFPSDCDMSSRGSLHDLFDFEVDNWESTVNMELSPAPRWDLFGPMDNPAAEVEMSGDAAD 3167.798943 1.38857364 0.324203735 4.283027895 1.84E-05 0.000184591
AAATCACATCCT 0.966085658 0.187914817 EMC xgb Is known to be FSSQPSWHSSIMNLDDLPELPGSACDESLVNMVPPTDLSDLDNKLWWDLWIPSVVSPLLPGDVDALLEPWMPGDPLEQLAGMGLQ 1422.799477 1.996174128 0.420974801 4.741790064 2.12E-06 2.39E-05
CAGCAGATCCCG 0.726010188 0.188652742 EMC xgb Is known to be GGQGSLLQSETDLPAPSLDTREPSDGIVPDAPQEDPFWADIECILGFDGDGLRDLTFPFASLLMGDDFFEQMIENTDLCQMWGGQ 4025.422455 3.101427564 0.477351712 6.497153957 8.19E-11 1.50E-09
GGAGTGCGTAAG 0.72994988 0.190380927 EMC Both of them Is known to be WHRYSQQKTVVDVYPEGSGSKWESEQEPPCDSMNFPVVEHSIAEILENALKSDGSLEALLQDMDWLMRLDCDPPTDGPLGDDLGL 2701.452007 1.363041208 0.338687591 4.024479327 5.71E-05 0.000535976
ACTAATTAACCA 0.750381799 0.19071445 EMC cnn Is known to be EPDPSSWAPPCYDWEPVLHMPPIFPECMIMNPSETLDMELEDMDLGHPAQGDDFNALIDASLDKPDATFEQMFTICISGPDDPED 1582.003327 1.682985896 0.443220599 3.797174364 0.000146355 0.001283778
TGGCCAAGGTCC 1.105403881 0.193843295 EMC Both of them Is known to be LSVSDPFFVAQMDLFPEILGAIPEEWLADLLPPLAYHPDFPLDMLAEPRDPGADLQDLGSTGMWWESMDDLGGPSGPDLVCFPNK 1151.661159 1.287777828 0.33370124 3.859074148 0.000113817 0.001021882
GATTGTCAAAAG 0.731315186 0.19953724 EMC Both of them Is known to be VPSDSTPSGFTDNEGDDDKPNEAWSHAPRYSYMPEWIALHSHEPLISDGAQLDFDCLRDLDLTGDAIFLDLLFDFDNEMLGPVES 3399.020691 3.136484112 0.367700422 8.529998676 1.46E-17 4.77E-16
AATACTTATAAG 0.881799129 0.199616048 EMC cnn Is known to be GHDASSDGSLLPDLFDELHQEHETIDVPYMPQLEGFTLDMLGSHDSQSEWSLPWDHAPYYGHILPPALGDEQWLTTTQSDEDFDI 1528.373989 3.245029898 0.480569421 6.752468548 1.45E-11 2.86E-10
ACTGCAGTGCAC 1.018304543 0.199616048 EMC Both of them Random arrangement DPADADSMSAWDPFFHELLGDDMAIADQEPEDIIHFHPCPHDTCMEPPLHEPCCPICDIFRDLTEHCNCFAELEQFFQDYVTAFN 1102.156268 1.701588922 0.478341795 3.557265828 0.000374735 0.003074812
GCTTCGGCTGCT 0.864103099 0.199616048 EMC Both of them Is known to be GSSAMDLDDDVSEFLSAMDVAPDPVEDPQGLHLDINPLGIMTWEDAIWLCADDSSFHNIPGLDPGLCEALADDDTDFVWWDNWDA 3775.169679 3.389468583 0.470296985 7.207081257 5.72E-13 1.27E-11
TATATCTCGCAC 0.738304078 0.199634827 EMC cnn Is known to be ADSSDFNPNPSMMAIEDLWCSEMSEFAPDLRAFPWATPRTEPTETYNLDWPSGVEGTYEPNDQLSFEKDILDELYCSPKIPPDTI 1958.09611 1.155048208 0.353353664 3.268816275 0.001079984 0.007941248
CTAAGGCAATAT 0.85677454 0.205982771 EMC Both of them Is known to be ASQPSGPEAGTGPEFHELYDSANVGHQASLWEALSQMDQDGLNLINIEDIDLEMFMHDPFSLHIGEAWTVPLEDGLGVGMGTLPL 2340.591618 2.447845957 0.649723336 3.767520455 0.000164877 0.00143861
CACAGGGGCGCA 0.908792714 0.210483376 EMC Both of them Is known to be TEDFPHEPMEYPSCLPDTLWSDYFSGEHLSDCDFDNLCDDTPHDGLDVPASMNLLPTPVDTPLGESWEHLFAQIIGMNPWMDDIV 2207.87661 2.135603302 0.445187316 4.797089281 1.61E-06 1.85E-05
GCAAGGGGGGCT 0.776215386 0.211748347 EMC cnn Is known to be DWPDEFSKLLSQWMDDMMDNCEFPHDLPDPPDNYIRFCDPFDGLEVYESHHSILGTDHVMPDPFQIDLPGDGYGADNMGVMHAWY 1776.57976 2.218119515 0.547806408 4.049093771 5.14E-05 0.000486297
GGCCCGGGGTTT 0.817717846 0.222655721 EMC cnn Is known to be DFEVDASQPQYEILQTYKDSDMDEQCYCPSLCPQGPSNPPPASTAPELDEMVRDELHLTELDNIQLWMYLLDDMFEQMSGDGSES 1225.043979 1.770111195 0.667937666 2.650114353 0.008046453 0.047193162
AGTCGAACACCC 0.727634829 0.223162259 EMC Both of them Random arrangement ERQPDIQAYMDQCNPMQPEMHDDAPQPLNPETPEPSPSCWSQYDQDEVFNFIGSLHSMYMEMFTPNPNLDDDAILSLHWGEHFDQ 4171.659716 2.647979277 0.419613628 6.310517823 2.78E-10 4.82E-09
GGTCCGATCACC 1.101864779 0.228381404 EMC xgb Is known to be GRADPTQSGDAWIWEMMGMLPPLDSLKCFSDMYSTDIPCAPPIPTWHDTPSTEGVTDGLDPLPPSIESLQELEHLLGIASDGWKQ 1044.003636 2.85368111 1.0726055 2.660513217 0.007802167 0.046142646
TTAACGCGGATA 1.870729484 0.229564004 EMC Both of them Random arrangement IDYEATNLDHLDWMCDEPFGFCIHEALATANPWPPCDDEIDDNAMLDYMCEETPNHTFVPWEGSDWLGAEQEMVEPEHMPWQDEI 202.7059562 2.183549628 0.825269736 2.645861751 0.008148311 0.047621497
ACACCTATTTGG 0.980666988 0.231289336 EMC xgb Is known to be TNVGFSDHTSIPIICNLPSIPGPYHGGTLDAPVCYTDFEDLMNELPSDMCFDEGGQGDGQPFWEEVWGDSFPISPETTWPSWVSG 847.066082 1.608249803 0.507938808 3.166227461 0.0015443 0.010899739
CCCGTTTTGGTA -0.634221562 0.231654794 EMC cnn Is known to be AAPAELPCDIEPCFCITDNLLEPCVVQHQPLTPYSGLGLLDLDMESLVQPDFLLGDTADDGWDDMWQASPSWPPWEFLTNMCDLF 2522.78001 1.33651969 0.324454141 4.11928689 3.80E-05 0.000367156
CATATCCTACCG 0.771618896 0.237927275 EMC xgb Is known to be QMEPASKEAQPLCDDPAGFSTFFVDDFPAPSEASFEAQLELNTPELFDNWIEWMRGTFNGQGGSGRALSEESNMAEHGLDWWMVG 3883.020538 2.396531204 0.485440567 4.936816917 7.94E-07 9.46E-06
CCTGTGACCATA 0.670036229 0.241524211 EMC xgb Random arrangement EPSNSPQDDEPYNTDMGPLGFTPIEFCHFWWDDMYQGIPPMDIGQEGTTEGQTASEILPMNYDSVGDDTFDQMLQLVDIPAWMFY 2563.093698 1.481670756 0.367154226 4.035554138 5.45E-05 0.000514231
TTGTATGTACAG 0.826851364 0.243605834 EMC cnn Is known to be GGGGSAAPGECFDTVLMGELEAMLGHHSDKNPDVDSLEDNRLDLLSDDGAPLPDLGDLGFGFFAAGMDGSVWPMGAVCDQDTTGM 2380.406986 3.076254463 0.410099984 7.501230395 6.32E-14 1.50E-12
ATTGATCTTGGG 0.83934367 0.245107056 EMC xgb Is known to be GNDDVWPDDTPMDFCQDWWPDNLFDHTAGAKGFGKGDCWLELGLNTDLDLRAEPLDLDGLGMLMDWHALLPDLGPVGGASPNWEN 1271.856581 1.524570638 0.486393357 3.134439682 0.001721826 0.011985073
ACCGCAAGTCTA 0.899533559 0.248186901 EMC Both of them Is known to be DTQSPFQFYNDDDLFGPVELESMTFPHVGLNYPPFELPDVELPPPAASDPPLPGVINSNGGFGHLGYELDWHRMFDCPEPLPSSQ 1924.158799 2.281281913 0.507083574 4.498828262 6.83E-06 7.19E-05
CGATTCTTGCAG 0.608898928 0.250374722 EMC cnn Is known to be DWNGIDILFDDPCGNIEHSLLEPSETAPTKASYFCATMFPLEELDQLFDCPEQTSPFDMNMLDPDMDGWSGQQLMDFAQDYDNGA 5863.041743 2.138178245 0.329232565 6.494431209 8.33E-11 1.52E-09
ATTCAGGCCGTG 1.075535235 0.255890949 EMC Both of them Is known to be CPGGSSGANNALEFNIDLVDTHIWEDFCTLETTSLPGSPWAWQDTWEDELRMEPDHLMGLLDLEYPADLMPDSNAFELAPDAQPA 2727.814349 2.383112206 0.526351405 4.527606807 5.97E-06 6.30E-05
CCTCGCCGTCGA 0.889816636 0.266667261 EMC Both of them Is known to be PTDGADNAGFSTSWSPSDVGQGPWDGSALYDCLSGLMGDVLWEVVPELSDLDQMGLLMAETDPMLPAQTDEGMEPGNIEQDSNGE 1219.507193 2.190723058 0.70957249 3.087384429 0.002019263 0.013880394
TGAGGAGATTGT 1.093675198 0.2710033 MH Both of them Random arrangement IFIDPPETHSCPEEPPIPPSHWLPIPDDLLDQDVLDWMGEDSTHDNAFDPWSPTFPNLADMSYNLPYFVDVEMCQFMEDSGPGWP 899.1035928 2.182703471 0.544946663 4.005352486 6.19E-05 0.000579058
CGACGCTCTCGG 4.686639397 0.274115429 EMC xgb Is known to be AEMDFFHLTMPQSGYPLMGGRGPQPRGALDELMWSAPPYGLDLNLDSPDTSASCNMLDQFLNDPEMDRAMCDEDLFMLLGTLEWD 72.6377251 9.41060398 3.344589759 2.813679602 0.004897802 0.030379857
ACCCACATCCAA 0.718279287 0.277580839 EMC cnn Is known to be DTEDFGWEGMPSNDINGLVYDEDGFIGIPDVADYPTPWLTPMPDMDSILPLMYENIFALPNIGGSQEGHQTGATQQMHEQGPFYE 1606.858897 1.81728031 0.354491372 5.126444405 2.95E-07 3.65E-06
GTGGCTACTACT 0.827400643 0.28182294 EMC Both of them Random arrangement NTDFFNFDGECSDWEAWANTIAEPDFNTIYYPIESPEPEWLGQMDLPDVETAYAPFCSNNLMDSDPTPTGRGSEVEELRWWIHDC 1223.290926 2.453152364 0.360370525 6.807305795 9.94E-12 1.98E-10
CCGAGTGGGGCT 0.688587451 0.282567609 EMC cnn Is known to be VWRAHIGSPSPFMLGEHQPLFFPMPDVDNDELTTYEESMMPTLNLLSPGSEGVGYAMDDLLEELESMDGFWYGSTLIDEETPWLG 5695.718253 1.316300758 0.37561303 3.504406538 0.000457626 0.003611457
TGCCGATCGGTT 1.130064678 0.284327559 EMC Both of them Random arrangement ISEIQHFLQQHFNTCDDYALEPELDVGDSSPGWEAPVQSASGDLSIQAMEYTLADFFQDLNMWEWPNPDAGSWTLVDAMDGDCHT 810.0614043 3.366390864 0.609827573 5.520233936 3.39E-08 4.74E-07
TCGCCCAGCGCA 0.884188333 0.285054981 EMC xgb Is known to be ATILPIWADEPNLFTWAYTPRMEALQQTCATDISSAEMGLPGTYANEPSSEDFWGLQGPNPYTDSETEGWDTNAFWDILGNMLSH 849.5343651 1.484018786 0.527132835 2.815265313 0.004873698 0.030283245
ATTACATCGCAG 0.846304373 0.290832309 EMC Both of them Random arrangement QTWDFWDFWQGIGNISQEQEIDEIETWVPETSWTNPLHSPVAIPSHMPAPLVMDDLYLQNDLPPPVDDDFLINYPGWALEDINLT 1104.889343 1.8572555 0.394026443 4.713530097 2.43E-06 2.71E-05
CGATTTTACCAA 0.68176782 0.293505835 EMC xgb Is known to be PDCEDLLQKPEPDQPPSDYPFPEDWGPCDSSKLETDSLMCDTSVDDLILQGLEHDPLLEMVTGGLGDTGYADFPDGEDLCLMIGE 2126.216361 2.053888869 0.564297247 3.639728672 0.000272925 0.002300372
GGTTCTAACAGA 0.737498295 0.297840467 EMC xgb Is known to be VMTDCEFLAFSSADEGPSSFVPRCDVLAYDLWPPPHTMFPNLHELCESLTDPPTQWEPPEVGEDLFDESMVAHNVFDLMRGDFAY 887.2641127 1.399483623 0.509677275 2.745823075 0.00603593 0.03661453
TGAGTATTCGGA 0.740718909 0.302635669 EMC Both of them Is known to be QDQGSTLPDYPTFEDTISSWEPVWQECDQGWRGGSTAPDDFQGTSEHHPMTGITQPDMLGSDALDDFDLHMCGEWFAWINSHWMA 4838.185999 2.995752845 0.343560561 8.719722767 2.79E-18 9.53E-17
CTATCGCTGAAC 0.671213483 0.302640813 EMC Both of them Random arrangement DTPDEECPIAAHVVNSWHAEEHWGWDAWPPENETSWNTLFLGNGINPDDPIAGLDLDLMTFDWMLEMPDLWRMSQLSMSPTMDWD 4318.900939 1.700863478 0.395058453 4.305346373 1.67E-05 0.000167604
CTCTGTAAAAGC 1.303717536 0.313507356 EMC Both of them Is known to be FDTWKSDLFTALCDDMAGFCEALEKSFDLDNWWAPEATSGAGQTSLEDIFLDSPGDHWGLEQAPVVEGGGSSLDQVMEAHAPSGL 309.6516881 2.051569964 0.753066141 2.724289213 0.006444005 0.038852176
AACATCATCTTC 0.961064201 0.318525631 EMC xgb Random arrangement AHDCCYTALCPHPCCEASGGASDFGDILTSMPYEPPLESDIEWQVYEHCFTPDGIYTLFDSEQYIDMGSVQEWLMPWGMDGNAEQ 2734.436204 1.884451 0.398470056 4.72921609 2.25E-06 2.52E-05
AGTCTTGCTGAA -0.627394352 0.319126862 MH cnn Random arrangement CCPNCLLEYAWIIDICQNNWDDWDPFPDPYEIIWMMLITSQCHDEGDRDIGRTCVVHWDDMLCADCENNDEMNMTNMIVYQNNWW 953.3182969 1.526672344 0.576406769 2.648602388 0.008082536 0.047348758
TATATGGTCCCC 0.814198401 0.322057603 EMC Both of them Is known to be SVSDYSVATSNGMSVMEEDGVDWFDDFDLDWLGSDALDLLDLFDEMPALGASRAATMSFLSWQDYMGSPVGNMVGGGVSPGDDLA 2461.809222 2.541314482 0.343766574 7.392558427 1.44E-13 3.32E-12
CAACAATGTGCG 1.161817453 0.326211185 EMC cnn Is known to be IWDIPPLDCNWVGITDDSSAACGQELPGMELPEPMHPYPPTLHPLFDLQNVAPPDLLWLFPDENCQMWLDFLVDEGSAPCNGPMC 591.6430595 1.313529656 0.498521542 2.634850341 0.008417439 0.04890601
TTATATGCGTCC 0.588641541 0.327793336 EMC xgb Is known to be YPNNLLEDYWPLENIMVDDCRNDEDVGLSQQEPWTELIELTLHELYDFDVTAPEFPFEEAVSQAFWDLDEPNSYYQACGGSGVGD 3234.682381 1.241697054 0.349945585 3.548257517 0.000387789 0.00316099
ATCACTTCTACA 0.557961861 0.327909582 EMC Both of them Random arrangement DGGPHCVCLPGACQMPCPEPCRTNFASQWALYDIPMMIYEHPPDWASLPSPQPQDSNSWIEDAATMLASDWSTSDFVDIWSILDA 3638.333877 0.977918067 0.289828883 3.374122192 0.000740515 0.005637468
ATCTTGTGTATT 0.625604153 0.330816936 EMC cnn Random arrangement IGEDTPPCDMIMTQMIEAPPSDGPGEFTPCNWRGSATDTDVPHSQEDFNLATEIMLDALMSNPQWGPNTNCFPDDMLVNLDISHL 2675.816032 1.056669974 0.265115837 3.985691636 6.73E-05 0.000627983
ACTTTGGGGAGA 0.976978052 0.335376699 EMC xgb Is known to be VTDPPSQDTSSYNGPVLPEIPCGTNNCDLPEFVWNFAETCQLDLSSECAILMGEEGAFPGLHGQKYGDNDIWFPSLEGGMPGGTD 1600.044252 5.105784128 0.564110804 9.051030567 1.42E-19 5.66E-18
CTATCGGGGTCT 0.7997343 0.336648317 EMC xgb Is known to be LDTQPSPEAAEHVSLVPEDNYYTTDFDDVLKLLGEGLLDLPDLLQAPDWTLGIPEMVAQHQKRQDSNVSSQLDLLMDGGGDGPGP 4109.407625 3.287593373 0.564673805 5.822110652 5.81E-09 8.73E-08
CTGCAACTATTC 0.567200119 0.340247909 EMC cnn Is known to be HPDMEVIDLPILIPEVTDFFDQDVAFTFGDDGPEPPTPWQTPSHCDLYDNNVDVGALDYLEEMTTIGMTFAPPVESGLMNMTPFE 5020.042545 2.234948084 0.461810758 4.839532309 1.30E-06 1.51E-05
GTACGTTAAGTA 0.828811083 0.346536882 EMC Both of them Is known to be TTPGTYDHDMYKLLDNIIFQNPPAFEDLSDLDALWMHGPVIGTDHDTWHDISEMTLEPAKFPAAFEGSSLHANFLDDLSNAWDGE 1367.714446 2.128566199 0.699472967 3.04310002 0.002341545 0.015853411
TCTGCAGGGCTT -0.995673789 0.349490811 EMC Both of them Is known to be APPVDPQKSDQAHQNLPEMHDMLWVDLTDTSSDAWGPNQLNDEFWADLFMMYDDLWPGGWSGGFGQDEYEMPCVMGTEGYPSDPP 1456.627299 3.051930118 0.911163471 3.349486908 0.000809614 0.006107224
GACGCGGGCTCG 0.751155859 0.357076179 EMC Both of them Is known to be ETCQGGWQHMTELWPYDIFLGEATEHNVTDVWTFNPWNDLSPDAAGLAAEPLSHFWDDSLNWLDQLELMEDFCGDGEDSNWLTDI 2091.555501 2.055959265 0.330405997 6.222524063 4.89E-10 8.25E-09
GCTCTGTCGTTT 0.55319048 0.360260723 EMC cnn Is known to be STESNGEQTNEDLGETHLDYLCDTDTHDQFWTDHEFYFPPDSLDDLDLYSAPDVWGHFSALYAPGCPDGGAQFHEGSNPDQYWAM 5125.903758 2.465082427 0.397479536 6.201784496 5.58E-10 9.35E-09
GCTCGCCAGCTA 0.475918939 0.362948388 EMC Both of them Is known to be PDDQPGHPEPHASPPTTPGSPEPEPSHFPWDPFVYARSESWEPPLSSNGIFDIFQALEAIDMVDFMLPTEPQNWAPAEPQSRNFS 8362.731949 1.564790475 0.280764751 5.573315281 2.50E-08 3.56E-07
CTGATGCAGTCG 1.164383598 0.366500604 EMC xgb Random arrangement NDSNFEDPDWAIGMPDEYNCLGSPFMSEEYTGSDREAYLADLDDLLGLWDFMDEDLYVEEMAGLPEFEPGTNNHIWFTDMCAENP 745.957691 1.532250368 0.495037927 3.095218132 0.001966682 0.013596756
TCTTTTTATTGC 0.539307844 0.37400568 EMC cnn Is known to be EPCPETRDLDEWIDDYFHEFMAITMDVPMLPPTAGQTPSLDGLFETSENVSTEELWFLGGGTPYMSMPSSILDPWDHIGLGWSMD 5904.427796 2.02845355 0.366143922 5.54004431 3.02E-08 4.27E-07
CATTGGCACGGT 1.318288602 0.375368086 EMC Both of them Is known to be FNCDGPDTPPIIDQITEILEMCLQGDVSFESLDDLVLESLDCFSPEPSLDSMWCDVNDSSIVPEGPSEGFMNSEIVAAPQMDRHD 419.025239 1.617145476 0.577794138 2.798826381 0.005128871 0.031694118
AACGCCTCGCCT 0.582258975 0.377104419 EMC Both of them Random arrangement EAESWQPSIIGEPMAEIMDWPCPQEACCNCSEQVNMGSCCTSFNLDPVEGHSGPDIFLGLDNDEVEELFDWHVESPNIMPPSDGP 4596.953595 1.319788248 0.443791564 2.973892151 0.002940484 0.019430719
GGAAACGAGTCC 0.716344302 0.377104419 EMC xgb Is known to be GGGGSGSMPPTPEDQGPIWWNPFADYTDSDLPSGPMRVTSDDAILDFDALDYKPDPPCVDFQWYDMTPADASDWFDLLMLFDEEP 2425.819228 2.113032256 0.457828863 4.615332116 3.92E-06 4.24E-05
TCCCGTAACCGA 0.694900841 0.377104419 MH cnn Random arrangement DAQWPPPTEDMDIHMESWVPENSFDEWHPVDWEYAAETSGMMAKDMLYSSPEDNGFGFLHGIMPSPITDWNMEFGIPEFVQTDQV 4242.192157 2.029380968 0.380251696 5.336941268 9.45E-08 1.24E-06
CAGATATGACAG 1.340066593 0.388840489 EMC xgb Is known to be DEIFNETLAMGLLDYLDDFDGDWPLSLPDSTPEVDQIFGCRMPLSLNDASELKDLCWTGPPNDEGDGPGSWGTIPINVDPYPMGS 450.644316 1.841726853 0.567732744 3.244003224 0.001178624 0.008552359
ATTCCAGTTGTG 0.53200409 0.393177312 EMC Both of them Is known to be CPGPLPHVADLDILGLMFALEEYAPCAECDNFSAAVTLVDSSGLWDSDQDPARDVPFDGSFLDWIMEDFDVWPPSIGHEEDSALL 3607.455516 1.183901876 0.429118298 2.758917251 0.005799322 0.035352323
AAGGCATACAGA 1.023927198 0.394949124 EMC xgb Is known to be NWFASSIDLTAHWDLDLPSDDEEEEGAFNSGSTTEYLLDLPDDDDLLAITWGDPIGDELNLDASPPMTPSEPLDYFLSTFHGPLP 798.8873511 1.82641535 0.441758917 4.134416487 3.56E-05 0.000345133
AGCATACTTCTC 0.731642406 0.394949124 EMC xgb Random arrangement DMTSGQPLRLQNINHGTNEIADEWFPNIEPDCVWNLPCDDSPSPAWGMPTQLWSIWTCSEMHDSVDSFDMDINLFDPNAAVSPPD 1475.223784 1.254217673 0.390811404 3.209265804 0.001330744 0.009544382
GAAAGAGTGGAA 0.684581719 0.394949124 EMC xgb Is known to be DGTPNDDLFANITQNLPWHHEEPPLNALFDTSATQDSSMAWDLWLDSPESQGCVVPLGGQPTGGLLPYMNIGIDCGDIFDLGGDG 1486.64407 1.646946258 0.512511176 3.213483597 0.001311353 0.009432605
AATGACCTTGTA 0.503838558 0.395574142 EMC cnn Random arrangement HVMPNDPDMDNETICEHIQVYNDYQGYDHQYHNDPLMDTPGSSLQGFRDPGAQIEWVPMDYESFINWLDVPAWDPFDLCSPLWYD 8101.505131 1.462376599 0.289517054 5.051089668 4.39E-07 5.38E-06
CCCGGGTCGCAG 0.692624786 0.395574142 EMC Both of them Random arrangement NMDEDPNTDNEWDNPWIGPPNNWHIWEDLQCSTPFQLSDYDKECEFNGMADNHLAAQICDDLDWAPFMQEQEMWTEPLQDEQGGS 1261.232989 2.08144046 0.578778189 3.596266234 0.000322817 0.002684368
TCTCCACCTGGC 0.869250634 0.395574142 MH xgb Random arrangement DDILFNMYEPANPGSVIHNELGGHNPDLESNTEPDTPIEWTVNTGPLLNDCCEGYNLGNDMQHEMLNTVPIPWFEGCNVDDLPWV 919.346268 1.309547605 0.48728163 2.687455313 0.007199874 0.042939318
TTGTCCGGCTTG 0.533106781 0.395574142 EMC Both of them Is known to be TSSGAAQSTPESLYDEVRVEVPEPHQDLPTNLTFTTFDGVLVCDLESIINSMLMPPMQLDANFQDDFVTDLWGWGDPDGPDEGAP 3028.166948 1.675309937 0.422197331 3.968073257 7.25E-05 0.000672457
TGCAGTTGCCGT 0.80864563 0.403004453 EMC xgb Is known to be DFDDFQELLCADTLWPDLLEGGGDVGGQPHVCPMGPFADVPMTDWTSGLCGHSQLTCCDLSELLSQLGDSSDMGPFAPGDSCYWA 1216.612326 2.491532779 0.484892681 5.138317978 2.77E-07 3.44E-06
TGAATCCTTATA 0.821400683 0.406486341 EMC Both of them Random arrangement GNAMENDATTCVTMCTDEGPIDNGPAMPADALDNPSDWSDFGTSSAITPDNYWSDLSFEETIFNRMAIEPLDWYDILVGLMNGPQ 967.2924678 2.107409705 0.578936827 3.64013759 0.000272492 0.002300372
AAGCAGATCATG 0.631251013 0.407576383 MH xgb Random arrangement DAIPPTEEEMRDTHNLPAPEEFSMDEQAAIWGSGTNVTMPACALNFPDWEDMMTHGSLFVADTMEVNEEELDCLLQVAEPLCYRH 2620.498121 1.212503358 0.343635038 3.528462539 0.000417981 0.003351965
ATTCTAAGGCCC 0.824155205 0.409397986 EMC Both of them Random arrangement FPPEDPPGNACLDMAPHILGPNGCMPEGTPWPVPFPMSDETNQLGLEFCDLDIPIEMLFDWPHDIEFWWTEFPNELDIAWDPNMI 940.028485 1.559900617 0.501885885 3.108078278 0.001883082 0.013070803
AGAGAGATCTGC 0.782393593 0.410406656 EMC xgb Is known to be NDDYFCADWDMGNTAEEFFNIYEMFDLSHLPPIDSTAYPVPTSLPSGPSSSDGSAPSFGWIYGGTDMQVQEWTAGQPIGALNPTR 2050.522398 2.26188107 0.388792672 5.817705002 5.97E-09 8.93E-08
AGCGTGGCCTAC 0.800111231 0.410406656 EMC xgb Is known to be LSDGGTMDLDHSSLLWLQDLPQATIDLPPTDFYDGLMGHSDWSTGDFSEGYTPHMHDELCLDDLLSMTTSESWGAVCPPPPAKRP 1293.965525 3.757718153 0.631379418 5.951600645 2.66E-09 4.13E-08
TACACCCCCTGT 0.505717901 0.41135458 EMC xgb Is known to be SCNWSREGRMTPSDYRHTYFDDQHALTPLAHLVSSAPAQFDCLDLIDWPDFGDLSDKPYGMGDSLDFEAIFGITGHEMDWNQLWA 4460.966415 1.56046783 0.399559087 3.905474511 9.40E-05 0.000855166
GTCTTTTAAAGA 0.64033749 0.424048039 EMC cnn Is known to be DDPPVGQWDPTFYDDTDAFLPPLDHLENWGFDIQENMYLFFQSMFASHDDLQTPSPNVYDQTPAAEWICQAASPQEEEGNNPYGK 1445.969136 1.508417592 0.413503468 3.647895866 0.000264397 0.002236093
GTCGCCATAACC 0.55631528 0.429033019 MH cnn Random arrangement HGKNTLMWPGHCGSTGIEPASLELPFDGTYNALPHIEPPHIDGWLPHSLPEDSAALDWMDILEGMDFTPTDWFNLNDGPTGTIWN 6641.616251 2.458771297 0.298023835 8.250250497 1.58E-16 4.72E-15
CTACGTACCTAA 0.760742326 0.431376453 EMC cnn Is known to be PMADGGCSRHGCLWHWEDALLPGYDLGTLDEFLDPIDLGDLWKTIESGAGDFGQGDMMDPSDTWMQAHDSSQCTEHTCALDDPVG 1821.458871 2.978456683 0.493862186 6.030947027 1.63E-09 2.59E-08
TGTTTCCTATCG 0.62101963 0.43519224 EMC xgb Random arrangement PEGPTQPTHDDEYATSSAHGLEAHQDGAEMDPCMLDWMDYVYPSIDDWMCDWMHWFEAPDPAPNSPQMCMGTNCNLQWYPDHCDI 3984.753622 3.510246194 0.508193364 6.907304275 4.94E-12 1.00E-10
TGCCCGCGCGCC 0.595738791 0.438213467 MH Both of them Random arrangement ADDLCDIDDTNTKEDDDWSAFDLGVYPDWGWPDPEHIQMHFQPFHTNFIEDQHILDHVNNPWPLDNGAQGSAPQLPEMLWEAWYA 4384.221394 1.156084147 0.40038201 2.887452778 0.003883749 0.025029727
CGCTTAACCGCG 0.748777284 0.44024373 EMC cnn Is known to be ADDATTWFDMLPGDLLDDFDLTEGWAAMPWAGPANLLGQLNLDDFDASDISMGAFFSESEDAATVESTGETSHPHTWLYLTTQQW 1996.049847 1.913346409 0.680291431 2.812539337 0.004915201 0.030449671
CTGGTTTCCCTA 0.66257801 0.44024373 MH cnn Random arrangement QNPIDMALYNADFMDIFDTIPWQEGELFNLATDPGFESYQPEIIEQWTNVPTAEPLFDMCDLNHLAQQAQNTDNISQSSFPSPCQ 1509.178168 2.716887112 0.42971678 6.322506449 2.57E-10 4.51E-09
TCTGCTTGGTAA 0.817591229 0.440687506 EMC xgb Is known to be GADTLMVSHISLGQEGSHPGSEGCLCAAPSMDPEEALDFFPPEPLTSDGAFWDFDCLRCNDDTMMDFPVETMFTDALGCDIGDGT 980.0817897 1.640931537 0.582488824 2.817103897 0.004845885 0.030225608
TCGACATGGTCA 0.494615077 0.440749324 EMC xgb Random arrangement LDLSGLIPPDDNYQHCNCYCEPCKPMFCFTPQDPWGTDEDALIDSPWFASVAEDTDYYNGDNMFWDGFNPMEGVLDHILDFHQNE 2215.608345 1.421260747 0.431078609 3.296987413 0.000977279 0.007239752
AAGCGCATCCCG 0.640697077 0.448140479 EMC xgb Random arrangement DSIVGDSSHEQEGMAFRCPSLSHPWPEDMFDPSQYNDLDLPVIDFPDIELKSMERPPTMLWMHSEGYGPEAMYTHNSVEDWLMQV 1249.193839 1.657353959 0.48559224 3.413056932 0.000642385 0.004958974
CTCCCATACCTT 0.746566694 0.448140479 EMC Both of them Is known to be DYGGDALFPADVSLMDLYAAEPVDIDLTDTDCFDDFDLDLQLKTEDRHELLGMLDNAAYGWDGIPDRGFPDMSSDMTDWFATNTS 1739.3264 3.117394088 0.584439644 5.33398807 9.61E-08 1.26E-06
TGATTTCACGCA 0.48462041 0.448435646 EMC cnn Is known to be VAPATPQGPYNQHWDEVPDWPVFDHQVPCDFPVPTWPNCLTTLGELNNGLTDLDDLMNGSPNCMQDDTASPEEQEPDLYFYFALS 3093.728803 0.885526599 0.259134315 3.417249468 0.000632573 0.004890844
CCCCAGCATGTC 0.488808601 0.448770742 MH xgb Random arrangement PGHNPSNRDDFANFFNDLNACEAEISQHNWIVDWDADAWTEVLRTGPEEFDIPSAVWNMEWDDVLWWDCEDNLDSPWWNQPCDGS 5748.993938 2.567401676 0.357204047 7.187493251 6.60E-13 1.45E-11
ACTCAGTTGGGG 0.559510895 0.451501894 EMC xgb Random arrangement ELNMLEMPQDQPGAPNHTDQCEFADPLAHSQIAIPLDWMGWFHPPPSPPYMDHLNPNDYTEPWDLAWDYFLPLNHMSVPEDGADD 1683.180327 2.733334883 0.394880669 6.921926294 4.46E-12 9.12E-11
GAAACAACAGCG 0.583596123 0.461128641 EMC Both of them Is known to be MRPSPGTLFDMDTLFNMAPEDDEHLYLDLGDLTASDTVNIFDGYLAGSTCLDEFMMDMLGFAALGDMDTDALGSPDGNFPEPVGH 2351.244826 1.59931903 0.345584056 4.627872734 3.69E-06 4.00E-05
TGTCTGACCGCG 0.7896687 0.461128641 EMC xgb Is known to be LNLDDFCDWPPLGDQNENEDVCMCPEPPDAGCYLDMSSSDFLKGFDLDSLGGSMDDLIDIAAHWNTIQEGWSEAHSPELPSVPEP 1043.18878 1.764160518 0.577816228 3.05315156 0.002264515 0.015416124
GCAGTCACAGAG 0.52325986 0.469091288 MH cnn Random arrangement SCQLQEGAEQWRYHTENQMDCEHPVVTHEFDPIFVNPEFPSQDTRLSHEEDLGDPMYMPPNLDMFFYAWDFDYAPPTEWMTDGWW 2402.84172 1.107275713 0.315685764 3.507525009 0.000452296 0.003586525
CCTAAAGGGCAC 0.87504306 0.471497319 MH xgb Random arrangement YDVNESWTELTTFWDEVSEILLAPSGQSHADAHEEFHPLFQADGPAETPQPVPCVDEEWVWLHCNYPMAWFEVGPPQPEAGNAMD 668.3473905 1.681326715 0.557797196 3.014225829 0.00257636 0.017254646
TGGATCTATGGG 0.501950942 0.477295073 EMC cnn Is known to be DTDYFDAMSWFDNILAQLPQHGPHLPEHAYVAPPAFDPVMAPRRGWFCEDLLASPGLSGEDLATIMEANESVFPWEGLWCNTTDM 3273.266621 1.029318627 0.291793533 3.527558055 0.000419412 0.003358004
ATCGACCTGTTT 0.476148656 0.47845634 EMC Both of them Is known to be TIPQYGPNIDLTCDTSCGELLPPSDALAPVQGFEVVSMIEGDIIDELLWETDKLSSLEGPTPLADPWVWTDNDELRELMGARNQW 2710.254757 2.043372314 0.470796903 4.34024162 1.42E-05 0.000143953
ACGAAATAAGCC 0.538780603 0.479433422 EMC Both of them Is known to be DAQACGVTEETLETVTDLDFPEEGSVISPPFEFTPAEHTTAPLGLWFSAPMPAPELAYLYSDVALGMVNMDIDFEPWHYAGQELP 3350.105822 1.461849145 0.370834483 3.942052892 8.08E-05 0.000745589
GCAATCACGTCG 0.625319263 0.480874952 EMC Both of them Is known to be YDPGLYHGFPPPADETQWMDHIFDNSIPPDWWLTAEAAPSAPSSHDLLASGSLVPSNDRPQGDPAVWASFQCWLGPDFGGNPQMD 1871.341006 2.17792776 0.476530013 4.570389482 4.87E-06 5.21E-05
ACTGCGCTCGCG 0.62838198 0.486493981 EMC cnn Is known to be YEERLEVKQETMPVDSDGPACELWIDFDFDEFDTGLALLNGDAIDGLFSSDVTPLLRDYNTDAEQTKHDWCLNEEINVEPWGPAP 687.6330456 2.230130637 0.70276327 3.173373926 0.001506783 0.010668026
ATTTTTATTGCT 0.746227409 0.486493981 EMC Both of them Is known to be WDDMMTSTPEERLLWSCMDPLAIPLLGAPDQDTTQEMLAPYPMTNSQPMSTGDAVTAPDELETLIRHWCALEDFDLGLPDPLSSG 4217.380528 1.758442634 0.304074947 5.782925075 7.34E-09 1.09E-07
GTTTTTGCTCCA 0.492067442 0.487434115 MH cnn Random arrangement TMNQARDNIAWSCASPESEVLVPGDLVQHPCPWITPDLSDIALYDPITFDESFDPYFIDAWYFSQDDGDVGTMFIMGDDIEHMNE 3348.337306 1.041595761 0.308789872 3.373153902 0.000743124 0.005648655
GGGACGAATGCC 0.887001837 0.491580914 MH xgb Random arrangement NACTWHPEMDQLCSQLWTYDDVYFPSFDNTSDQGMELQDAMLDPAHIQWIKEMDQLLVESDYEVAMDLQDDFPSFGECNDFGDSE 1315.913761 2.458100118 0.606184126 4.055038746 5.01E-05 0.000476822
AGTGACCCGGTA 0.420291871 0.494883255 EMC Both of them Is known to be QQDLVIPTDWLMPDYILCNEPDDGLDELLGMDYTNIDHLLTPELRDLDIPLVPWGNGDPQVHEFLERSYAHFGFDGMFGDADFDG 5893.191024 1.745505741 0.288160087 6.057416756 1.38E-09 2.23E-08
GTACTCCTCCCA 0.44790399 0.501400523 EMC cnn Is known to be VDTASDMFNLCKQPADNDLWTPYCWDHMDTAPPASNAHPAPTEFSQFNEDDDMLCDLFIAGTSQAGFDLDDWMSMYSGITCREPV 3531.141461 1.161081793 0.252877262 4.591483561 4.40E-06 4.73E-05
TTCTTACATGCA 0.620904702 0.501400523 EMC xgb Random arrangement PFDIFQEVNPHPCPPSILMQPDPAWEEENQNDIIWFSGDQIDFPDMMNWSDPDVPEGEFHDFPPLTSNMYPEPLSWAAPPPELSE 3480.877895 2.450502761 0.388266836 6.31138829 2.77E-10 4.81E-09
GCGCACATAAGG 0.805295033 0.505806941 EMC xgb Is known to be DDIWWPMTHNWDDTFQGLLEYEASIDFDTSFPPMPSDSEWPLPDVMVDPIAWEPLGKLWNGMFTDTPEDGAESAAEEPKDEAECG 1461.270907 2.328405607 0.523817314 4.445071872 8.79E-06 9.15E-05
AGGTCGAACTGT 0.479654857 0.511497983 EMC cnn Is known to be STCHPCHNGWTIMEPGNMDTQESEADEWGFACGPLWGSSGISLDSDWLAANNMDQNDLGSDFAYYMEAESMNLADGSGATGPIWE 2032.31707 1.892425845 0.544238015 3.477202607 0.000506675 0.003975285
TGTTCGCAGCGG 0.4496138 0.513182699 EMC xgb Random arrangement EPSKPMDTDGDAEHHCTVVEVPAFWGADPAGAQLTFDLWPESLMNMCSEDFDVLQPQGAGMIPLDGWGFNEMFQDYIGRSHQGSY 4504.344563 1.165227509 0.328805876 3.543815953 0.00039438 0.003198935
CTAAATCCGAAG 0.735004791 0.516192204 EMC Both of them Is known to be CEDYLSTLVPPGRPDEHGPSSRGFLPQFDMNLDPGFDFDLGLDMALDDIDRLLCFIGREMPFQTVTQSGSAECNQGIPQQATHDV 1026.179351 2.007493398 0.423907039 4.735692535 2.18E-06 2.45E-05
CCCTCCCAGTTT 0.490685994 0.517655712 EMC xgb Random arrangement EAAQMSEAAQWWEDLEPLPLPLDYLTYETFDMCQGTYQWIATPQEPQYNEQIPFEVHIDRHQPFKEFYWPEPMASDDTCDQGDGP 1531.059167 1.070379084 0.369502782 2.896809271 0.003769789 0.024484747
CATCACGATCCA 0.521893436 0.520105265 EMC xgb Is known to be FVMDDGDAFDFDSIPGDDAQLPPLSSDELVICKLDMLGSDTLTAEMLDKLHPGGVDTESNDSDVWFSLGDLPDLGGWCSPCPMTF 1838.767678 1.185178814 0.420695944 2.817186215 0.004844643 0.030225608
CCTATTAATGGC 0.421465161 0.520105265 EMC cnn Is known to be PDDIYILVPSVSNSPEMIECPLFPDDHQNLLQSHIDPLLATGSMLDMSQEEGLRNDAMAMFMNYLDHADWGLMFEHAPDSQESQA 5447.480258 1.343014726 0.271644606 4.944013967 7.65E-07 9.14E-06
CAGAGATGAGGC 0.52229398 0.527985879 EMC cnn Is known to be PTPGDHFEGIYDDLFQGLAENEYVFDFPEPVERPERMNSNDDIIDHLSSADDFPLLWLSPHPGGGDDGGKYPGQGYAPAGHDWTP 2041.756366 1.388521168 0.505803384 2.74517967 0.006047778 0.036641548
CCAGGGCATTGT 0.725203123 0.532320097 EMC xgb Is known to be FDNAIPQEPHWWEGMEEGSLESFEWEGPNTFGGHGSFGATSTDPLLWLPQLNDVSSVLCETAEPHESFMNYLGEEWSAGRAAFEG 1131.014928 1.394588626 0.518693741 2.688655203 0.007174048 0.042836844
GGCTGTGACGTC 0.474397244 0.533925071 MH xgb Random arrangement TDLFGILADVEGGLDILPSQLWPTTPINDCPPLFMHDMTPNMTPINETLEQPLETWDVENCMQRWHHMPEREWLDYEFRMCGDMQ 2643.385961 1.309379549 0.3420076 3.828510098 0.000128921 0.001145043
CTTAAAACCTGC 0.571337367 0.54491886 EMC cnn Is known to be SIGDPPPGAHAPVWLHEPYDHDCNAPEDPWEEDVSQTTFDDFLELLLDSSPHMYFPPIYFMNEELVAEPGNGQPGDGACPPYGGL 2956.260764 3.350985019 0.524309362 6.391236289 1.65E-10 2.91E-09
CACTCCACGTTG 1.194002952 0.548467011 EMC xgb Random arrangement DPLEFSELAMGPKDADCYELYNDWIEAHCATDPPPSMQPEGQLWDVWGLDPPFTVDPLFAEQVQLEPWTLMHPGNVDPLCEEDWG 713.0290349 2.761047043 0.664263402 4.156554514 3.23E-05 0.000315197
AGGCTGGGTCAG 0.412448673 0.551297144 EMC Both of them Is known to be SPHGTAMDESIEGLTELPEGCNPCFCCQPLPPPSAFGPTFSTKLWQFASDPNLDAHMSGLPAQEEWYNDWILQPDFDLTLQGWDA 3959.763624 1.444716081 0.323521936 4.465589247 7.98E-06 8.33E-05
CTGTCTATCCTC 0.68414504 0.556104311 EMC xgb Is known to be DPTDLDMLFYDIFGPPPSNMFVGSCERSDSEALLSIPDMMDITPLGASDDALGFGWGLDEYADPQPSGAGLYKLEGETSPTSVPP 2064.634025 3.088389064 0.362121204 8.528605979 1.48E-17 4.80E-16
AAATTTCGCTAC 0.686578388 0.563600128 EMC xgb Is known to be EECELGTADALLDFDITMWDTEEVFDRVLDMIDQDDQSLWDLFDVEPDDFEPFDLDKHIGGGMTGLDLSMNEAHSELSMLDMLYS 1752.688428 2.303595644 0.445667046 5.168871391 2.36E-07 2.97E-06
GGTTTATTCCAG 0.454753969 0.563600128 EMC Both of them Random arrangement VPAVHPWVPPMLLHDGVIPNDAGPDFGSDILYPSTPEQDETMWLEDWNFWECLPLISELFAMPLEIEQAQSMPADAGTNVYGDLN 4350.915511 1.235546868 0.364659185 3.388223633 0.000703469 0.005405258
CTTCTTGGGCCG 0.588042659 0.568204343 EMC Both of them Random arrangement TWGAAPKPGMQELWCGLCEGEEGLGQLPDGFTTSEWPPATMEQQTCLDPSMTWGGDIFFDNNMLGPASEWQALLEEYVFNFECDQ 2364.663531 1.806509486 0.348731181 5.180235044 2.22E-07 2.81E-06
AACCATATCTGT 0.582508759 0.573131202 EMC xgb Is known to be GFPLNDIWASSYNPGEEFFPIVDEEALAHLLEANNFDADWDKLGEHLSDMPNDPFHDPGEMPEIGAFDLFNPESSGGDGEGLSGS 913.3142125 3.238256078 0.515946477 6.276341098 3.47E-10 5.94E-09
CCTTTACTAACG 0.641733335 0.578664818 EMC xgb Is known to be LSDDIPWSWINFDNFDVVPPWMMSSWDTDAMLDLAATALQGPGMSAGTDTPPPTPRPQWHNDLWGADPEAELVFLPDATEFGNQS 925.2193711 1.180834551 0.39989234 2.952881147 0.003148232 0.020720631
TGTTAACACTTA 0.518494674 0.586552639 EMC cnn Is known to be TPEPFGQPRWTDQMQDIQAIESITDYSVDNLYPPSFLLPTLAEEIYDLDDNAPPDNLVNPGMYLEPLNDAFCPYNDFDLSTFWLA 2098.168029 2.440310094 0.475518632 5.131891639 2.87E-07 3.55E-06
ACGGTTGCCAGC 0.358892172 0.587156677 EMC cnn Is known to be DAPDPPLCVSYNINAHGSTGQILSKRNEELMDDPVQDPEFDATLLLADEPDNQPFLTMLFMNDPIDFWFEEMCMDAWGCDGGLNH 6086.873006 0.880501445 0.331188106 2.658614333 0.007846272 0.046320626
ATTTGCTATCTA 0.537478905 0.587156677 MH xgb Random arrangement IATHDLWTRAPEQFDLWDYNQSDYDTEQAFLGYENPTLSYDILTPWMATQPAIHDGQCEGASPTMIDEEWTLAPPDHSHNDCRWR 1423.144353 1.25254004 0.321066297 3.901188169 9.57E-05 0.00086727
CTCCGCTGAATA -0.464349579 0.587156677 EMC xgb Is known to be TDLDDLMYSPSLSDENTDWFGANNMGKDWTQSEIEGLLDLPDDWLAPAMLPDLKVHMSPGPDWGTPDDGQKGPPSTDHSLMDTCG 1758.099737 1.689370267 0.556984441 3.033065455 0.002420831 0.016323317
AAAGACGCCTCC 0.724836603 0.594869291 EMC cnn Is known to be GDGQPAPAMAKPMPLDLGDTISAAEAGASHGDAPACPGFQATAELDFDVLGHLDDTLIYAKMFQGELLGEIGDIDVLALLDQGLQ 1068.378529 2.215224675 0.398030607 5.565463149 2.61E-08 3.71E-07
GCAGGGTTACGT 0.459355652 0.594869291 EMC Both of them Random arrangement LMDDPWDLFNNLDELFTDKEHPAVCELGLPPPDIGQTAWPMGTPRNPHDGVSTYYSMVGPGLEIEAFFDEIPNTSDAFLNDCWCF 3153.706186 1.143602058 0.337742734 3.386015285 0.000709154 0.005440509
CACCTGGGAAAT 0.519044771 0.597582565 EMC xgb Random arrangement AGDIAWQDATADESDYGGNCFDMLNVNPEFFQPDPWCVSDSEFWQFWDLPPWVNDDDIQDLMRNNMMILGEGLHDADGRSEMDVQ 1456.77916 1.987511352 0.509145995 3.903617763 9.48E-05 0.000860178
TTGTGCCCTCCG 0.513352359 0.602987999 MH xgb Random arrangement TQTGQASCTGTWNSVEDIWPEEEIEGPSTPHSSHYADNRFAEGLAGSNAMADTCIVSVGSGYGFDTIEELYWGLDLFSDQDYDAA 2464.705828 2.018713725 0.645995143 3.124967343 0.001778248 0.012360445
TCGTTTCTCCTA 0.416582288 0.605499363 EMC cnn Random arrangement EGANDCTCAPGCGQPSRYSPFAPEHAYQECELPSFPTLDDYDWMSIMSPPTHLDLQGCLMAMLEWDNFSDLPTQYQQWEWHDMIE 2290.179506 1.2532724 0.314851906 3.980513942 6.88E-05 0.000640614
TCTCCATTTTAT 0.404398049 0.609175322 EMC Both of them Random arrangement EVQDPHTPPDCFFGWPTVPNIDEYSWEWESNGENASLWHELMEHSDSPRPTLGMGTPHTFHIFPLDEFMYDTGIQDMQTLPPFDF 4117.194902 2.113339695 0.317496939 6.656252188 2.81E-11 5.33E-10
CTATGTCACCAG 0.439671912 0.609813774 EMC cnn Random arrangement VVDDIISTTNRPWWMDHEETEHNGEWNLPEGPNPPTPSPWLHSEENSSRPMEQPRVTWALDDLVMTNDMWAPCIGLTTHWDWGGI 1125.62551 1.423349259 0.482420339 2.950433768 0.003173281 0.020857797
ACTCTTAATCTT 0.727098987 0.611523779 EMC xgb Random arrangement EGPVEPTWSLTCSEHECSDDTTEQCDAQVWFEDPREFSASFNEMPLPQFAYESWWEEMNHNDDPIADIHLIWPWGDMWWEGLSDQ 751.4002355 2.675695402 0.752564648 3.555435948 0.000377353 0.003081081
TGTTGTTCTGCG 0.335052785 0.621645339 EMC Both of them Random arrangement WCNYSATGASGSGQNVECEYNEHLYEITAEQHAQPNPWPTGPTGQELPCTLWDDSGWFQDVMFDTFWDMTLDDLLQPVMPSWFSG 7473.753033 1.708689576 0.272519429 6.269973424 3.61E-10 6.17E-09
GATCCCCATCTT 0.422459121 0.627793198 EMC cnn Random arrangement KVAPIHQMFTYGSVTSSPVNDPYIETEFDSDWMEPMRTFILHNTDMNCLPNPEDTAAAMEQPEDWIMDTWFQNWLNGPGNGLPTN 2990.843163 1.548219668 0.252155543 6.139939049 8.26E-10 1.35E-08
GTCCCGACGCTG -0.922991339 0.629315335 EMC Both of them Is known to be ATLHSPAPPSMWVQGESEGHDPEHEFTLHADDLSRFDGGSILGSIIDELMDMDVFDGPAKSTPPNDWEWMAWGPSFTEWQSHLNF 1048.226493 2.182864294 0.486769808 4.48438719 7.31E-06 7.66E-05
GGACCGCGACAG 0.498091748 0.637237004 EMC xgb Is known to be GDLDDLPDENRWSEFQGLLETLDDFQFPQPSEPWHSTPAESPFVEDPYDLEGDTPYHDVGECHWNDLRIMWDICLFEELGHMTSD 1041.382859 1.36725963 0.428088375 3.193872366 0.001403782 0.010010276
CGGTGCACTTTT 0.569547115 0.642558936 EMC xgb Random arrangement WGESSHTTLIDLGYQDLPPEDEYQPWHEFPIVDFGEQDTPWDWQKWDCDIDTSNACSMGALGTSCWSDAWEFWEELDLTTPWPED 3153.085123 2.517532626 0.539279043 4.668330165 3.04E-06 3.32E-05
TCTACAAGCACC -0.430124568 0.642558936 EMC xgb Random arrangement APPIDGSSTMPAGLEDFISMPCSPGVLDPFFSYTANRTSGEVTGDESLEFELLPIAIEQIYCQEDWPDPFSQYAESDTPSQFWAF 3415.574966 1.289785309 0.366530683 3.518901335 0.000433338 0.003452769
GGTCGAAACCCC 0.947894869 0.65068702 EMC Both of them Random arrangement VGLMDTTTAPDLGAVFYDFNADDYPVQPMCHMVDNLAQADMAELKWWTFSTGAALDSIMDQPDWSQASLDPPETWAEFDFSFDLD 568.9482384 3.10932146 0.618749544 5.0251697 5.03E-07 6.14E-06
TCGTAACTGATT 0.413853462 0.65068702 EMC xgb Random arrangement GVPGGDSTQGQITMDADPQEYQHTNSHNYGGRLAPAQIDPSATTHDWPVIDDIWYEPALRLLPLDWFGEDPWVVGHDMQMDTLPL 2982.678442 1.47529963 0.34817153 4.237278185 2.26E-05 0.000223361
CCATACGTGCTA 0.389013072 0.65208549 EMC Both of them Random arrangement PYDDLSPCDGILPFDPMGMENDNTLGEPWPWSLSERPSEICLWVNDHVPGTLQDLVYAENIDNMSFQDLDWMDLHSPWTAILGDE 5527.878792 2.081086731 0.377276943 5.51607186 3.47E-08 4.83E-07
GCCTGTTCGGTC 0.438818965 0.65208549 EMC Both of them Random arrangement PYPIVGPGSPPGLEGVHTSAQPDHWLSPLQDMPAVPTHSAVDTTIWEPEWSLPAFNHYAFYPEDSWEDWVDDWNIINFTMLETNF 3418.410324 1.091649286 0.310365889 3.517297889 0.000435964 0.00346812
ACGGCCTCCACC 0.548369611 0.652721004 EMC xgb Is known to be MAYDSPTVPLNFDEQLMEEWFYDPQLDLGPDTAEPPTPNGEPSILDIEDLDVYMPKDCPMEIALLDDFYLWDARLLEEGGGTPPP 986.0036467 2.698865118 0.455479767 5.925323825 3.12E-09 4.80E-08
CGCTCTCTCTCT 0.478305757 0.659405851 EMC Both of them Is known to be SDSWLMEDPLSDMFDAFHDLLELDVPVSQLYGLGDLSPTLEHLESALSEAFTWDPSSLSDSGGGLGLWEGGMPEAPGELDCAGFH 2085.10064 1.794590585 0.553343567 3.243176014 0.001182052 0.00856469
CATATAAAGGAG 0.448710241 0.659877615 EMC xgb Is known to be NDTDGGASHYAGGDIFTPTDVLLHDSNMHPMQPDDYAFMEIPTWMEDLGIDLYAELDAPGSPEPPHPPAPPAPCTIREYFEDVGC 2145.281172 2.380842227 0.500665033 4.755359512 1.98E-06 2.25E-05
TCAACTGCACCA 0.392966639 0.66218856 EMC Both of them Random arrangement ADTCVAVMNNMHLCNPPQPELQMSDTHLYPMDEPMPNIFSMNDIHEPLYEQFELPDIDDIVEIVCNALEAGGLHSEMPWPDTWWP 2069.185349 1.246811868 0.327350309 3.808800033 0.000139643 0.001231441
AAAAAATCAATT 0.593978906 0.663077446 EMC cnn Is known to be ASGGTIFETSPMDIEFPMPAAHPWEHLGLYSAFEFWDGALPNPLDLDSLALGDHEATLSHGEDVGFCHLMKDDDIMTDPPNTVND 796.0457205 2.307707517 0.631057023 3.656892216 0.000255292 0.002162779
ACCCAGACTCAA 0.400971404 0.663077446 EMC xgb Random arrangement ENEESNEAVTYYESLSDIGDCLHPGRRLDNHEVYDAFPPHDPNYCYGVPCYEPSIQDLFENEMTLDCLMQFWTINPCDGDNIGDS 1504.539218 1.208554818 0.379340175 3.185939423 0.001442848 0.010244635
AGAGTGTTGGAT 0.327048734 0.663077446 EMC Both of them Random arrangement PPVQHQAPEDFFIFESQCDPQEVTEYFDFWFPCEVAETVPHIGPHLESMMSQSPYTEHIDDQGGQDNFDANWAECMLCMLDGFAL 4298.074635 1.103902027 0.388874774 2.838708245 0.004529655 0.028488542
ATTCGCGCAGGC 0.472993096 0.665313771 EMC Both of them Is known to be DMPIWCELDKFFDIDDCTPPVSQSAALNQMSQLSAPSPALESQGLAMTDMPLGCGELLDGEDFEAGHLDDFLDWVDLEMGAGNQF 1592.940386 1.759578798 0.366054311 4.806879048 1.53E-06 1.76E-05
TGCTCACCACTC 0.390773072 0.670693346 EMC Both of them Is known to be QEWASQSMPDTPEAQTPFLMEDLQYYFSADEIAADTVLYPSPMKDLDGTSLEPLPPADDGLESLIRLFTAGESMHLDEVTVQMWG 2281.777425 1.152092177 0.34601166 3.329633968 0.000869602 0.006520043
GGTACTTCAAAT 0.38525187 0.679806751 EMC Both of them Is known to be DMPGSSEFLEMIIDLPEEPWVGNACGWHDMDGPHQSDELPDWVESYLIDHQELATTGGCINPLPPWNMTTLDSAMWDLFTDDDFW 1958.664779 2.150966458 0.458089051 4.69552034 2.66E-06 2.94E-05
ATTCCAAGGAAC -0.399424211 0.683149493 EMC Both of them Is known to be EPATEGPSVPCLGPPPLFMDGLSSLGPMHLPPGSSTSGAESGPEILWESLKSDVSFESLLFQGPSMGGIFDLMSSITWSTWGEGE 1162.311466 1.33317693 0.473434877 2.815966871 0.004863068 0.030278097
GCAGTTTGGTTT 0.420274614 0.68838762 EMC Both of them Random arrangement VGPWMYGMGEIINNMPLDALQVDELFEGFTCDFDCQFPAAHTMLSSVLQAPETTAREEPTSWEENWPYGQASDPDPYVQATGQDS 1589.610187 1.372470978 0.370639854 3.702977336 0.000213084 0.00182391
TCCAATCGGGTT 0.424769617 0.690450185 EMC cnn Is known to be ETEPGPSFMTTIDTELEHTIPLGHSNGHTSAYGSTIAGLLDLPADNAIDELFMGMDLPGPEPPHHYHQWAELDGTDPPLMPWPHL 2110.173674 1.635115567 0.405491503 4.032428686 5.52E-05 0.000519441
TCGCTCATAACG 0.32290081 0.69534836 EMC cnn Is known to be PAGAPSRCVCYSDWEPIILCDDILGLDALNKPPTWEGGSDLLDPFDLDMFGHELAQHDFLGGQDYFTGYDGGSCSVNDPPDPAIV 3703.010277 0.990462896 0.375017062 2.641114219 0.008263386 0.048123783
CGCTGACGAAAT 0.382169923 0.695813157 EMC cnn Is known to be PGMDMEPPPDIMPLTPMPPPFEHGDWCVPTDMTQCKTMALNRDSLDLPLTPCMTGELDGFFNDMEHGELECDQFLMDVDFYILGP 1242.121419 1.253377749 0.390198768 3.212151993 0.001317447 0.009462703
TCGAAGTCTGTT 0.298853953 0.695813157 EMC Both of them Random arrangement SHDDMQCEATPLEHLHTNIQFHYAFNQDFIEPNMTDITNYWEDGLYLADLGAHLFGEPEPAAEPCSDEGLGPAINNMYIFGLDEP 5578.816716 1.3713657 0.381809012 3.591758329 0.000328454 0.002726666
AATGCGTATACC 0.332468525 0.699705219 EMC cnn Is known to be HDSQWCSTCAQRNDDPPTLPPPVPPWELSEEQSWCNENLTPPELWGLNDIEDAFVTYDLGGLGAMDIEEQTFLNPTWEHFLLGGQ 6662.949959 2.18921954 0.468408027 4.673744713 2.96E-06 3.25E-05
TGGCACAGGCCT 0.493161638 0.701459453 EMC Both of them Is known to be DKGGEVQESTPSGAPLLHDPGDFSTDVPLDDEWMFPMMQLVPDMTLPLDDYSDAEIMDLLSCCGVGWPIEADNWTPDGTEFGEPW 1166.760347 3.438410034 0.687895547 4.998447875 5.78E-07 6.99E-06
TAAATTATGTTT 0.498913449 0.703751751 EMC xgb Random arrangement CIPQSIGSSVMDMLPPLSRLHDGGPVYDVGTADHMPDNVPGPEELGFTDWSWESMDNDQDGIWNVNHFPEPTTWSSPCWDLCYLD 1506.529841 1.855957924 0.453668926 4.09099636 4.30E-05 0.000413344
GGACTAATCAAA 0.477057354 0.706227432 EMC cnn Is known to be APLRADCSADGMPPSFGLDDWDRSENNTLVTDCSSDGIIFGGTTLEEIDLDELTFWPAIGSMADDEAFNSMLYEWWANMMDSDDT 2241.074938 2.45704561 0.456012372 5.388111734 7.12E-08 9.51E-07
CTGAAGTCAAGC 0.319725929 0.706393297 EMC Both of them Random arrangement RDDALFQNENWLHEYPMDVILGDLDCPFEPPTQEECFWQLITHALWQCQEDEPAPPPEFFAWDLPMHDNELAWDPLSQDIEDHNM 2795.552072 1.412279139 0.436572755 3.234922755 0.001216756 0.008777649
TCTCTTATCAAG 0.377059005 0.709476436 EMC Both of them Is known to be QPIEDIMLGMWDEPFQGWDLPPIYPTCVEPPPSWTPAPWHGPIYDELDLESQALSAAQGPPMLHELVNTEDFYIEGGTAFDMGTG 1475.373882 1.575408526 0.513276238 3.069319035 0.002145473 0.014706731
GCCACCGAACCT 0.982067833 0.710431021 EMC xgb Random arrangement YYNYWERGGASGLEDAMDHTLQGECMPCPDHDIPFDDPGPFWDLEWTACSDIEQIFWPEPMPLQEDDNLWPSLGNSHDCAQWGTD 265.6467088 2.355150172 0.635251505 3.707429503 0.000209374 0.001798364
GGCGATCAGGCT 0.441657342 0.7135087 EMC Both of them Is known to be GGTDDFELWALMEPILGSLFEDPYQFVPDTDMTMGLDLAALGIGDSLSPGETTHDEALGENLDMPPEISTQMFLWTVESPHNDFC 3330.088461 2.144794202 0.383780606 5.588594543 2.29E-08 3.28E-07
AGGCTTATTCTT -0.401698122 0.716759526 EMC cnn Is known to be DDFDHDFEMLNAEDMATDIEIVDNFFDTVDTTDFLYLGSPAWFDFLLDIYDPNVANRLPPCDFVTEEDTQPSAGGQGGPPDLGVF 1301.97118 3.077457765 0.62429833 4.929466597 8.25E-07 9.75E-06
ATCGATGAATCG 0.429537848 0.716759526 EMC xgb Is known to be GGTDDIPDAIPLTHIFSNLCFGVDDFDYSPPCPDNADIAEDVYNFEDLMSVIIADGWDDEDQQGEDSSPGFLGLGSTPYVDHFGG 2119.766299 1.367832625 0.432979696 3.159114935 0.001582491 0.011124573
TGAGCTTACATC 0.530594857 0.716759526 EMC cnn Is known to be GTYADPSAEPSAPIRNNSPGISDANDHDLWEPSHECNPMGLPDQESIIMNCLFDYDQGGEDDIDSLPFEWFPPSVWATTDWWYVE 1587.374789 2.147039661 0.520299136 4.126548578 3.68E-05 0.000356453
GCACAAATGAAC 0.477986842 0.717327037 EMC Both of them Random arrangement VHYPAPDEFESAMIDLDPLDAVLTWNPYGCETHSIEEIHPPGDEFDEFMEALGWSNHQWLPYNLLQEPVQPNDLDILGHPQDHVC 1331.925667 1.923255144 0.499778189 3.848217439 0.00011898 0.001062463
GACGGAGGCCAT 0.839020997 0.720592542 EMC xgb Is known to be GMLPYHDGELWWDLPLPEPQEPTPWDNAEWMEYMDIFPNLTQELEWPSDHFDWCMHGLSALDMFGLQLPESQWFSELCGDGSPNP 269.6119337 4.25243518 1.399102946 3.039401205 0.002370489 0.016027483
AGGCGTCATCGT 0.29309153 0.723621932 MH xgb Random arrangement AMDAPDMSPVPQEWRPSHSTAASVNIDMIAPVDYNEQTTMSDRPWCDTLEDTFTFGLMDHWFTVGQVEPNLADWTQWTHWPDQLF 3546.284564 2.401063318 0.453368534 5.296051972 1.18E-07 1.54E-06
GATGTGTACGTC 0.544840216 0.734702518 EMC cnn Is known to be RIYEHGHFSADYNTGTGISTDEHAFWCPDDSTFPLPEPTFDLTLLGADPTFSGLLGNSWPIQNASQDEVEWMEFLLNGMFDLDEG 1347.691454 1.355186949 0.484866584 2.79496875 0.005190473 0.031995007
CGTTGTATGGCA 0.469411855 0.743626264 EMC xgb Random arrangement CECPYENHETYLYESLWEASAIEATIPTREPTPDMAPPYDVLWGMMESQISLNLSGMCPIIDDAMFNIPHDEPVQFSLFGPQDPQ 1359.462221 1.339604251 0.432639954 3.096348909 0.001959197 0.013580111
GGATGGTGCCTT 0.292993262 0.744883016 EMC xgb Random arrangement GPSTGLDYLLMITPSEVNPQICPDFFSPEIEGESSSNLPSMHMFNIASSMDESWAEFWINNLGEYPDDPCRQADDAYDENGAANE 4944.40711 2.164954711 0.397886797 5.441132316 5.29E-08 7.23E-07
TGGAACAATGCG 0.291541347 0.744883016 EMC xgb Is known to be WALDEGPLDHLGMYAMDAPHSWDLGTDALDDHVLDASNPPSWDDFVADIIGGSEGKCAPSSTPDSQNFLDWLIDYDSIFPRLGGH 6018.64377 2.620166927 0.277630674 9.437598825 3.81E-21 1.73E-19
CAAGCCTGCCTT 0.378488021 0.746016492 EMC xgb Random arrangement IPADDSTSKESYPDPYGQNNDDGDLLFGLLEPIDTCIFCEDNMLGLPPQMPVDPDLMGLIGASGNVPHDQYCDFGPIGPYFWECW 1948.073842 1.455405426 0.536690073 2.711817304 0.006691546 0.040197943
GGCATCTAAGTG 0.293092701 0.746054153 EMC xgb Is known to be AGGGGPTLSPASIWELLSFDEPGDFPKDHPMTELPAGALLDTDNESIIAEDLMDEGLLSPDMPVEFDMNFFPTEWLTCLQALDFP 4227.896641 1.584713323 0.358375659 4.421933472 9.78E-06 0.000100582
ACATCTCCCCTG 0.320032803 0.747255123 EMC cnn Is known to be TITDTVDDTPPTDNCWSEFMPCNYPCWDATRQNNSTNAVFWHEPLASDLASEDPFYLGPNDHPGMAQLQNEHEWEMGLWDYGNIY 4130.49265 2.259117298 0.366028244 6.171975353 6.74E-10 1.12E-08
CACACAGTCGTT 0.263208578 0.748482182 MH xgb Random arrangement EPDWNVCFVNDEPASESDLFRELMQAPWALSDLWHGMYTIQPSDDCVLFTTTEQPAGNPMEGVPSLYTTDPMQSFTDCQPVENPM 4259.538527 0.60192387 0.227750374 2.642910568 0.008219674 0.047925536
GAACTGGAGTGA 0.406367167 0.749437003 EMC Both of them Is known to be KDNGLSAMGPLQCAPDIPPTLPNLELDIWEHWGEGVTGGLTLPDMGFPDPAIGPTGLEGTLPDFDDCIQMWSYLYECAEWMETMY 1172.726569 1.98503626 0.345425469 5.746641276 9.10E-09 1.34E-07
ACTCTATGGGCA 0.348008148 0.751106683 EMC xgb Random arrangement DGLDNHHWWNCLMNNPCTEHPTEFGDAHRPQYPYEPNIMVGSLDPVLDWDLEYQPGGQDLDNWMEEMYNDNSIARTEEGCMDATM 2310.173976 2.041835736 0.465993253 4.381685195 1.18E-05 0.000120837
GCCGGCGACCAT 0.382954682 0.751106683 MH xgb Random arrangement AQVPYDFAWMADMYNTHPPGQLLQISFDTQESADHDSQPSTSHTSCEQFANMFFDLDWTAIPLPQCDVFTDTQMHEYTPQHNIDE 1738.458417 1.637351059 0.476649629 3.435125007 0.000592281 0.004615318
CGGCACATACGA 0.555538539 0.756608792 EMC cnn Is known to be GEAMAWLCTILNDLVGPLEDFSEPFMLDDLGSPSANPDTAFSGWAMLNDSVAIAGGETGDVWAEYWSLPEDDGGPSQSESSKGCK 380.6398133 2.264178702 0.85642623 2.643752169 0.008199267 0.047862856
CGGTGACATCCG 0.468075532 0.756608792 MH xgb Random arrangement LSPLDGCIPLYNDPRQCGPLFQPWNVWTDTTTVPRPIFEYDEQEEHEVWSLINPLAGDLNTCADNPFLNHDNFMSDFEETMRDLN 699.3632954 1.052309112 0.369268808 2.849710262 0.004375907 0.027661984
TCCCTATTCCAG 0.469136113 0.756608792 EMC cnn Random arrangement QWTDPPAQVEYAMQEIDLEACRFHPADMGHQFPNPYNAQFDSVLFRSEFANLPDWDLMGDNEPQPCNEYQYWVTHSDFHELQEEI 826.8578134 1.473720421 0.44697108 3.297127008 0.000976793 0.007239752
TTGTTCGAGGGC 0.280486478 0.758360493 EMC xgb Is known to be DGEPDTTPTAYPESITCDQDWLQGLNWMEGGVDSGQPPTLSQSPPLDHDDWLFAALDVPLSVYAGFFGHIEPGEGFFYGTYGGED 3481.935662 1.456111122 0.38163819 3.815422983 0.00013595 0.001201012
CAGTCCTCATAC 0.277317312 0.760284096 EMC xgb Random arrangement CPGCYHPMPDWINKAMEVPTTPHTQHRDIQDGFTAVASEWPEVSNDYMSQFLEPLMCQDDYSQEDEFMTEFLHFNEWHQEATDST 3944.201669 1.70994172 0.34400042 4.970754745 6.67E-07 8.02E-06
GCATTGGAATTA -0.412769667 0.760284096 EMC cnn Is known to be YNDDEGVLPPFTESVIDDIMGGQMGCLYMLTTPTPLYSPSPAPNDAPAEAGRCADTSLFLTQLQGDYWQEWMYPDEGPSGYALQP 1212.980595 1.417653235 0.503587882 2.815105934 0.004876116 0.030283245
TTCATACGTCTC 0.357117352 0.779000728 EMC cnn Is known to be AMCGTSMGIIPLMPTELAFMSEIEPWMQPVGLPGWEESPSPACSLPESPLVQPSPLLEFITQPIPGWESRQNPWDEFMDETGVDQ 1771.642796 1.605960654 0.493505435 3.254190413 0.00113716 0.008287892
AGGAGTGTGTAT 0.270144854 0.789129024 EMC cnn Is known to be HFLNMSITGVPLPDPMSEYLPAFEPSRDHVWDLEGCTLDFLSDVTQDLKDLPDIGELMCGWFQDMPPMYFFGESTPEAWSPPHSG 2638.026755 1.15430312 0.387087881 2.982018238 0.002863549 0.018947598
TCTGGGGAATGC 0.305854978 0.791847901 EMC cnn Is known to be NPAVRQRTHVIDWGANDFMHSAPSPCYDSDYFDCDDLSSPSLCDGHLPGDFPGLDDILENVLCQPIGEAPFEWDFMGVERPPFGW 2982.768087 1.359106292 0.387634451 3.506154547 0.000454631 0.003599284
ATGTCGGTTGTT 0.278168361 0.794288028 EMC xgb Random arrangement IDASPADFPIFVNDASPEPVDNCDPSQMEEWLWPIRQPCPERGEYMCFDVEDILHLPSDHDRMDIENLYYQFQDEHFDIMTYENL 2970.502412 1.209079988 0.390636034 3.095157342 0.001967085 0.013596756
AAAAGTTGACTC 0.466603514 0.797699108 EMC xgb Is known to be KAQEHPPFDIPLDEFFTLYESWDPQSYDEAQICLAIHNPPWHEPTLSDSPQLDNDILSWSDDRGGVLDFGMPLDELGCGKPGDFM 2523.591469 1.791839524 0.571180002 3.137083787 0.001706374 0.011910968
CCTGACGCTTAC 0.568565986 0.797699108 EMC xgb Is known to be ASPTLTMDCDFWDIPTHPTISGFQNFPYPFLGDEEFEIPPIPLDSDESLQVSQLYLCDDGGTIQPPAQHLEDFLSTLEEHQDMFG 2046.603158 2.34315663 0.490118207 4.780798995 1.75E-06 2.00E-05
GGCAAGGACATT -0.324384292 0.797699108 MH xgb Random arrangement LILELYEDAMATQDQFIPGEASIGSPQEVADIAETDTVGIFDSDWLQPCGPMNGLNYFLGVLSSFETQYQNSNVPEENWWWSPMQ 1476.206368 1.094247007 0.400426522 2.732703623 0.006281683 0.037919635
TCATGCCTTCAC 0.692595009 0.797699108 EMC Both of them Is known to be SIMSELSSFIWEDNWQGLPDLEGVFDCEANAFEPLVWGTLDMPEDVGMPLDDDFLTMDRMALLPDTRPGHGPGESGGGKDGGKNT 477.7717219 2.967843883 0.776969822 3.819767253 0.000133578 0.001182162
TGCGAAAGCTAC 0.28296715 0.798840129 EMC Both of them Random arrangement FVPTIMEMPSLGPSPSDQEWTHFEYQNMECIDTEPASWPGAGPSMSDEVPEFLLPPYHCDFFEWLEPAMRNWGFPVQMPSEHETE 2116.96181 1.934190615 0.4161217 4.648136868 3.35E-06 3.65E-05
AAGCCGTTGTCA 0.3521562 0.804423266 EMC xgb Is known to be AMSTPDPTETPHEMPFDWIEETVVCLGKSSTDSESLDDFDLTPLGDMDHAGQLDDFACSLCSMDIAGWGGGSPVSFPTFNSLDSK 2363.496662 1.4745547 0.407220696 3.621021019 0.000293443 0.002460748
AGAGTTTGACGT 0.306329656 0.804423266 EMC xgb Is known to be PDTECYTNSLDDLVPQPSTLEEYESLTPYIFIPENDEKNTDASGWEEPWQTPPGVELHIQGQGLEPTDDAIFGILDEPQFWDGPG 2052.268246 1.344815522 0.360459848 3.730833072 0.000190848 0.001647807
CTCATTGCCCAT -0.303599746 0.804423266 EMC cnn Is known to be LEDTPLPAYDLEQCMSIMDLWANEGGESCWTEPWNTGPLLSGNLLNEEPPPTLLVPYNTGHFHEEIMSRSCVEYVGPMEEGPSWM 1600.080888 1.190273108 0.417277662 2.852472623 0.004338055 0.027492839
GCGGCACGCGCC 0.480448459 0.804423266 EMC Both of them Random arrangement DSPMPEMDVPTGMGNMPAPLLEEGPTHPMDTQLPGLFPHNPQGWSMTEDWFQGFIDKIGWESSCDSHADFWDEAHSLEMWMELFE 349.7891471 2.193716932 0.655307287 3.347615655 0.0008151 0.006139261
CTACTCCTTTAA 0.243318181 0.806902035 EMC Both of them Is known to be HDDLPPYNHFEPALLGNPLFQGPSLADERTDWNNTGIPFWDCLFEQLETLGLLPYGPQPYSPELPYLSVQPEGHDQGGSLDAQLN 3314.624574 1.164333906 0.225819728 5.156032712 2.52E-07 3.15E-06
GTTCGTCGCTCT 1.07364918 0.809467951 EMC xgb Is known to be EVEGNTMPETPDQSPSGHITAGNVEDIPWPQEASDAVWDTPSDQVDDLGENMSTPEWAVDLIFPDLEVCPGEWWPASYTHQLWFG 450.5879696 12.6509297 3.413794776 3.705826075 0.000210703 0.001806649
GTTGAGTAAAAC 0.530644704 0.810691686 EMC cnn Is known to be ALDFENALLMAAWAPGPNELGLDCSELPEPGCLPCDFIWGTEGDYQAMCSTRDAAYMGEEFFPWINQQLSMEDTMLPEDGGQGGA 416.9812547 1.662667201 0.604089676 2.752351626 0.005916895 0.035980531
TGGGCTCAGCTC -0.457075981 0.813828664 EMC Both of them Random arrangement GMEPLPLPMSMVDLSDAFHLPPLPMTDMPDQWDDLPPALHTDHQLSNVDATMPPTLDHILQNMPGEYYWEDIAYAAGENEFDLMN 575.2856845 2.13673838 0.621721055 3.436811995 0.000588604 0.004593892
TTGGGTCGCAGT 0.268161259 0.816769684 EMC Both of them Random arrangement PEFLADFHIDSWTNEESHNWIEDWISDPFDRDWPIDPFVTSHLPTGGNSVQAANEANDNMAMLPLFLQDEDITTFEPELLMIYQI 1328.45573 1.461240929 0.526302494 2.776427899 0.005495982 0.033710506
ACGAAGATTCTT 0.246885121 0.820087193 EMC cnn Is known to be PEQTQDTMSGCVSDHDIDEAIEGAGGAAPELSFLMGHHWGVPAEEWHPFSTNDPAPQHWAFNIMEAPMDMNYLDLYNLLDEWWFG 4305.265355 2.100827455 0.379818748 5.531131542 3.18E-08 4.48E-07
GAGTTTATGTGT 0.251857684 0.820087193 MH xgb Random arrangement EGMEPTPKRTIAEHCNSMLHCEQTTYQEADPTNETFHTDVMTSWDIPNNGFNPVESWISWDTDATWALPEAFLREEWELCLFGGS 4844.629822 0.991903942 0.220000373 4.508646638 6.52E-06 6.88E-05
GTGTAAAGCATT 0.237612869 0.824613008 EMC Both of them Is known to be VEDLLDDMWSSYLNMNPDILMGLEPPCVPPDPSADMESMTCALNADGGSLNDFMEILDTFCMDDTWVEAFTPAWGNCEFMAALGS 2390.299161 1.590341769 0.329914118 4.820472 1.43E-06 1.65E-05
GAGAGCAACATG 0.21654955 0.825006034 EMC cnn Is known to be DNNNDMPDPYAGHPRCPSPILGDLAPCNDDGCTHTQYPGPAGFNPLAPPPEDVLSSDMLNLFDDIPWCDYHFNMMIDHMLFFPPN 4419.546288 1.379341398 0.236056805 5.843260486 5.12E-09 7.73E-08
GCACTGCATTTT -0.323164882 0.830793809 EMC cnn Is known to be DDFFMIDDSSLGNVLNLFADFECDAFYDDILPPPSMYHPGTDTNLFDKLFNLCEDPNWEAVFEVFEEGGGNAGEWTGDNIGAPVQ 1283.65156 1.255875981 0.445831759 2.816928034 0.004848539 0.030225608
TTCGAATGAGAA -0.386613578 0.835367503 EMC xgb Is known to be TPDDDGDDPLFSNPFRMQDWPPDISVSTSIDQDLVDSLMCFQDMALPDLGDLESHELWTPESQEPPAPNEDFLRLGGELVLPALG 1176.303244 1.559166463 0.572915299 2.721460685 0.006499412 0.039091121
ATTGTGTCTACC 0.238861542 0.850500738 EMC cnn Is known to be FDGLPFANEHTWPPAQQWEDLQSDDWNVPNEFYVSPIFPDMLGPSVQHCHDPSDPPGYPGMAEFPEESFAHFLLNDLDTVLEVLT 3586.511692 0.987959465 0.273550999 3.611609782 0.000304302 0.002543207
GCCACTTATACA 0.225989726 0.86213156 EMC Both of them Is known to be PNYIELEEFFERMIDPMLPEWIYDGQIPQAMDIPDPMMGAQPTYDGHGFMDLDPCMVFQPWDGFQDVLDKLFCDNPEGETGVQSG 2464.252545 2.331213568 0.380700885 6.12347819 9.16E-10 1.50E-08
ATAACCAAAAAT 0.281409276 0.870188081 EMC Both of them Is known to be AEYEMPPHGVVEMLTDDIFEELSKYDSFDKEPLPEFMLPSMLSLKDLMDFDLDKDLPDSNNDEIWTAANDAILLPEVQPSGQQAT 964.9009075 3.09685215 1.081058228 2.864648794 0.00417472 0.026615342
AGCGCCGACCCT 0.237757347 0.870818574 EMC Both of them Random arrangement PCEGVWQSDGDTKDLSNSDVCWLTPEAWDVTFTMQESLPWHFLQEIGMDNPWVDIPMDDWAAIFPPMDQEFPDLSAPWPDQSPSD 1902.306833 1.394454994 0.464140862 3.004378862 0.002661237 0.017775055
TCTCGGCCCGGA 0.253649991 0.8716837 EMC Both of them Random arrangement QEITTDWINQWPQLNQIPFDDEMESHEFPPNDDMLCHLHGPAIPVGWPLQDSVFGSPNAQSNTPTMEEWWECWLNMLIDPEVHCN 2740.564962 1.906317589 0.388861693 4.902302348 9.47E-07 1.11E-05
CATTCAAAGTGC 0.213290397 0.875250618 EMC Both of them Is known to be SEAMDVPDSVAHFDAAFPEVLQPLEDEEPDPFSTALDMFGCFSTLYDDQPKSLDDILAYQLLDMMDTEFYGLNGLGGAEIGAGGR 2144.385497 1.390015097 0.38599877 3.601086853 0.00031689 0.002639505
GAGACCATGTTA 0.185989083 0.875250618 EMC Both of them Is known to be PEGGSSDVDYDGEAQGSHGSNVSPQGYPWSWEFAPETPNTLLPEAIFDDTPPATPMIHTSQEFSSFISWLFDNLQDGSGQAYMDP 3073.691514 1.715947318 0.335593209 5.113176524 3.17E-07 3.89E-06
CCACTGCCGAAC 0.24475197 0.876739868 EMC Both of them Is known to be ASMEADPEQPLGGYEEDNAEFAPEPTHDTLSPAQYDGSKPCIMTADQPGSTSPLPFADDELEAILWYQPVEDELAVLLNWSDNGF 1518.148045 2.048249201 0.646974852 3.165886889 0.001546109 0.010899739
CCGGGGGGAGCT 0.282051781 0.876739868 EMC Both of them Is known to be RQADMLDSFTASLAPDAMYVEPIDEEDASELLDFFSDDMYMLPMFEVELLDPGSEDLGQLFPQEWPGGYWGGGGGETDGRAPQFL 766.8660383 1.62851445 0.558813577 2.914235653 0.003565606 0.023251507
CGCTGATGTGGA 0.402744832 0.876739868 EMC Both of them Is known to be QDLPDPTQLPMDDMNPVIPLLHGLDLGTQDEFLDHGDLDDLWKTLDCSLPVLDRVDHFFELEGQGQGGDMGGWGPQGGGREGGTG 666.4569665 6.793219132 2.270598874 2.991818243 0.002773213 0.018398989
AATAATCGCTAT 0.258738163 0.878989767 EMC xgb Is known to be AGGSEGADALASYGTLLSQLDPGDPPKLALWLDGLEIHDFDFELLSVDEPAQDDGCATTPMPQLWDSLFEPSNFVPCGTMEMDPS 980.4978716 1.839444343 0.661382806 2.781209802 0.005415673 0.033259074
GAGGTTATATGA 0.215755014 0.880008708 EMC cnn Is known to be HGFTTQQMGDGSTTYTHNFHEDIAGFWDDFLPSEPFEIPPLLGVLSPEEQEPPWLGVDLFNIWSDECFLTPDVCADGSWGGFPGY 2057.484646 2.043688746 0.373348516 5.473943674 4.40E-08 6.09E-07
CGAATTGGAGCT -0.190811858 0.880168696 EMC Both of them Random arrangement EPSHPQYLDEEAIDPLQDLGQDYPGPSCPLLYPGACPPMPPQPCINIEFNLPDINHEFSSAICEYFLDWMDNQYGWPGPDEHTPQ 2254.965605 1.686893388 0.31569823 5.343372974 9.12E-08 1.21E-06
CGGAAAATACTG 0.226030874 0.880326882 EMC xgb Is known to be AEYEVDDMGVQLVPTGDNLLQSILQYWDQDDLTEQCADPDLGPTWDLNWLEFEWLIPDQILDPVNDVMAPSEPKQWPLSMPPDGI 1464.812189 0.968039541 0.364141845 2.658413351 0.007850954 0.046320626
CGTCGGTCCTGG 0.18192314 0.880326882 EMC xgb Random arrangement GPWWHFYPNNDEEPWWPCCQECFLGQQNARADDPPPESVEHLIGHPFPLVDQLPPEDMPDMPIPEMGNIAYEDDIFADCAWYDPD 4076.251388 0.86849101 0.289365021 3.001368329 0.002687692 0.017927596
GCCTATATACCA 0.262357651 0.880326882 EMC cnn Is known to be TDDPGTDLNLGNLGDAQPGVGHELNEEHFVTYFECWTPVFSMGDQLEESLKADSAFESLWHPEEDHCYPPMMPDIEEIISPEAYL 1994.8818 2.037781199 0.503122525 4.050268271 5.12E-05 0.000485716
CCTGGGCACGAA 0.228988882 0.880374598 EMC Both of them Is known to be DPAMTDSMDAMMAGFPDDRPLIDFDSIMDEAKFDSLFSFAALETIFQSEVQLGDWLMLGGGQESMAHPPSFNPFDWDWGSDTDPV 916.3510406 1.492980185 0.454495232 3.284919355 0.001020116 0.007545811
ACAGCTACGCTA 0.169243042 0.881481343 EMC xgb Is known to be SDAVDLDTQEAEIQYLLSPFLPPRVPEASWTAPDFPRINTSEDFIDTSELDPGSASTTMNDLYMLWEQPMPLYWEYLFNELEDPM 5603.677521 0.968314532 0.340745235 2.841755167 0.004486593 0.028289511
ACTGCTCGCGTG 0.192761485 0.881481343 EMC cnn Is known to be DVLHSMIELPPTLNWTPFPLAGVPVVEPMALDQLYSALDNLDGLGMELPDAMIPELESNSEAQEGYDPVDTGFEEGWEGFMYQWP 2085.635296 1.493627199 0.434113303 3.440639091 0.000580342 0.004536554
GCATCTGTCTCA 0.245574288 0.881481343 EMC xgb Random arrangement PCWVCYQDPVWCPNAYESPSNHEDDGAELGIHEFMEPFTLMHSEPTEMEHNFYVLPEFDEPWEIASPEFNWMCEDIPGIADDVEH 1321.408705 2.678378149 0.619345257 4.324531621 1.53E-05 0.000154288
TTGCTGACCCGG 0.239698481 0.882068216 EMC xgb Is known to be PSTQPQLSVMTASHGSLADLYDLMVDYPVDSPTCKASMDHRKDAKPNTTEEATLDAAAYASLDMFDFELDGMFEQGSMWIMEDYA 1521.108993 1.798820545 0.508984884 3.53413353 0.000409114 0.003291511
GAGGTCTATCTG 0.168247496 0.884891413 EMC xgb Random arrangement PVRDEDEPPGNCFAESMYVPQGFEFVSEEMWGSLISGLCREWDAPVAEHSPETTDNAFPGDAEFDLNNPNSLMYSILYDCDTNDP 4034.866062 1.961639794 0.308125903 6.36635795 1.94E-10 3.41E-09
ACATCGGAAGTC -0.21237113 0.884971877 EMC xgb Random arrangement IAGLMEHAEAPSQHRAVEQYDQSQDSLVGFDVDHQSAATCNMYLDYWMATLRGSPMQLDDIYPLFPGWEDWNPESFCIDFATLDP 2612.754489 1.663863862 0.451887808 3.682028666 0.000231385 0.001973745
TAAGATAAGCTC 0.193305857 0.891106104 EMC xgb Is known to be GWSNDESGFTGMDLIGKLLEESLTCDVQVESLIFPELAAMALGSLDENELAMEPLGPEGLNPLLATYTMEASPAGSESFYMETMP 1288.097078 1.811585908 0.501081781 3.61534978 0.000299942 0.002511004
TGCTATCGGATT 0.211112472 0.895963871 EMC cnn Is known to be PTSTTLDSTNIGGIITLPEAPGHYNPEPPMVDIYSTALSDLWNCLWSDLYSDMDTQPDLNLSYTPADTWNPPLDGGNQGGAGDDD 3682.745789 2.733220886 0.478130758 5.716471574 1.09E-08 1.60E-07
GATAATTTAAAC 0.197665737 0.898964957 EMC Both of them Random arrangement KAAYQDNGPSPCTDFVNDWCISPELLADIHDFGWQWDHLPKFDPMTTDDLLNFNDPISTEHFKHPHMPSLEDPLPEETQAAPGSP 1489.878648 1.602015583 0.407196581 3.934255978 8.35E-05 0.00076593
AAAGGGGTCCAG 0.209157379 0.899665116 EMC xgb Is known to be ICEETPSMISPLSEAWDDFLGGGSMSLPGGGFGGGHNEWCLLDHLSDQGILLYDLGPCGSGDNTPLDNDGTDIIQLLSSIMGQEP 2618.867632 2.224899685 0.372824495 5.96768645 2.41E-09 3.75E-08
ACTTTATACCTG 0.157685909 0.899665116 EMC cnn Is known to be PGENNTLDACDNDINLAPTGACEFSPPLPCWGLPYTLSNEQHEDIICWEEGLGNYGNELEEELDLLGNAWQGMLMEQPAEPGISG 3647.202129 1.497994929 0.379603818 3.946206175 7.94E-05 0.000734147
ATTCGCGTCTAC 0.193873722 0.899665116 EMC Both of them Is known to be DACCDSWLMFDCGDMSPGCPPEHAAIMADTPLQMQSATDLELGLGLSTEDWMGSLNLDLLNMLLDSDGGEFTYFPDFLGGLGWAG 3637.292386 1.783041403 0.456478041 3.906083625 9.38E-05 0.000854583
GAGATTTCTTCC 0.19188809 0.899665116 EMC Both of them Is known to be ALTGEPMIDDMCGMITADELYYTPNPAPPVNLLDTMLSGGLEDDVESLILNLFFDSAMHDAWTDSMLGPLESGLAGAPSQSHEEQ 3626.349013 2.236305516 0.313030079 7.144059529 9.06E-13 1.97E-11
TATCCCTTATTG 0.193362075 0.899665116 EMC cnn Is known to be HHMGSSTETLVRPIASSPAGTEYVHDEPEPVYMTPYGPPDSWTPMDSQNFFDHGMQELLYEDLQAMVESWWKPPIMDIFGDEGGW 3087.714239 1.006069299 0.289251841 3.478177688 0.000504835 0.00397137
ACTGGTGTCCGT -0.199628979 0.904112763 EMC xgb Is known to be SDTDWTADNLECFPSLDLDSDQMAELLSLGHAGNNQQLCWFELDMLVSDSPDPGGAFPGDSDGSNDSPLDMLGYAGPLGNTLHLE 1775.984741 1.756068547 0.38284156 4.586932903 4.50E-06 4.83E-05
CTATTCGTACTT -0.154486757 0.904112763 MH xgb Random arrangement EPDHENGNCFKAWEAAHKWMYPFTGAWDGNYFWCQALPAQMEDLIMPNDPNVEDHGEVASEESYPSFDITQDCWDLFDNPLMTDD 2409.233382 1.319829815 0.335255716 3.936785421 8.26E-05 0.000760722
GGACGTCAACCC 0.170999331 0.906699741 MH xgb Random arrangement PGLSESSPSAWKGVPGYFTDGIGDPFRGPSMEQHANPHTQPGPTEGGDTPYDPPAGDHDFWMLLSGIDGPEIWPNVLSQIIEHDQ 2327.179157 2.075370304 0.417240233 4.974041665 6.56E-07 7.91E-06
TATTACTCGATG 0.247769604 0.906699741 EMC Both of them Random arrangement EFAWQLLQPVGVEPSWCTLSQLDLPDVDAHLEQATSNLFSQPYMHDFDPTQDDFSTINELPADAASQSDDPEPAWMMFDLDTTWP 1297.73368 1.590926177 0.488165975 3.258986204 0.001118111 0.008173095
AGTCTTTGAAGC 0.129464185 0.912372094 EMC cnn Is known to be LCGSPALQINDSFSTSGMFQEATWLDNDLAMCPYTPGPHLQDLLGESSDLPDCEVDLLISNELHEECLWEGLMPDNSMIMPGIGL 5852.614787 1.185466501 0.367393169 3.226697177 0.001252279 0.009020778
GGGGCCCTTGCG 0.126166386 0.920873484 EMC Both of them Random arrangement TAANVAQPDHTPHDTAECSPRFALTPAFDANFDMFDWINMNDSQFDSAIPEAFDYLTTLLDYIPMINMEVAPPDMEHIPFTWDAM 5227.127487 1.35536555 0.430990473 3.144769163 0.001662179 0.011651712
ACGGCGAGATCG -0.178170909 0.921341215 EMC Both of them Random arrangement CPDEMVPMAEDFFQPPDWLHELDYDWVCEPMEEFNEHLWMDWHIIPDDLLAPFSALPDSFMQGASGCSDDIQDESTQEDPSDEDP 1968.321565 1.140075476 0.335821793 3.394882348 0.000686581 0.005283688
ACGCCAGTGGAG 0.133248574 0.924095142 EMC Both of them Is known to be WETMPGPGAAYEEMIVPKWCVTPWDDDDIQELLCDDDPFGMPSLDDLYDLEVTFPEDIPHGKYVTDVFSDDEFHQHLAFPDPMVL 2184.237951 1.193030613 0.31051324 3.842124774 0.000121974 0.001087234
TGAATCCCGAGC 0.187535862 0.92633444 EMC xgb Is known to be GSSGGHLDICPELWEWHNTLGEAAPCFYPDEDDPVNGIPPQLDLDYDLWLKFEHLFEGMAPQDSVSHPDISFSWDSCPEGLPPIF 1316.32894 1.185531784 0.443796889 2.671338653 0.007554938 0.044894813
GTTGCGTAAACT 0.12383078 0.930310828 EMC Both of them Is known to be DPGGQAVQTADTFDPYLIDYMHGCDTPWQADEFTGREWDPMDMLIFLDDLQLDLGLLYDLDDTAMAAFMNQEAGEWLGTGGAGDP 2030.50667 1.914597985 0.569145248 3.363988353 0.000768248 0.005821768
CCCGGGACCGCG 0.162236798 0.93095438 EMC xgb Is known to be SSDTCLQFWEGLLSMNDFTQPLPFPSYDMLLPPAEEMSLPMTDTCLTPEVVPGVPEMDHKDALAMLHCVNLDDFLFGLAPPPGSK 849.3105549 1.737298066 0.608658667 2.854305968 0.004313097 0.027404755
TGCCGGAATTCG -0.168157585 0.931048159 EMC Both of them Random arrangement IVHGVGEDLMDFLEDPFVDELCDNDWPPQPWSWAISDAHYMTMPRDVACPWMEEENLDMCADYIYMMDSPHNYDLEFPLPDENQQ 1262.968659 2.278066698 0.643063353 3.542522969 0.000396319 0.003209406
AGACACTCGTTT 0.10463046 0.933969604 EMC cnn Is known to be EDLGVQGTDMILPNIWTVAPNHPQPARDSDLPPTTLESYAGDSHSLSQGLQGSNYILDDLLKELAQQFDEYMGFDLLPCNAWDLP 2765.572127 0.931587365 0.304680464 3.057588115 0.00223126 0.015231579
CCATGAAGTTGT 0.108991255 0.936763481 EMC cnn Random arrangement TGYPPVAGPNFSAYLGFDLGCCPHHPVNFEDVHTADAESSEIILGDSLLDHPMFEPVDWNSACYEDFFDICLQNEDLDTMIMTEQ 2816.031797 1.493584683 0.375893675 3.973423293 7.08E-05 0.000658757
TTCAGACACGCG 0.1110848 0.938739085 MH xgb Random arrangement DDGNSLFWCNVRTDLGPCVANDEIIPCEAATCVYNPEFSFDLDQAAEQIWRSEQLDLSPPTQTEPNDMNWEDMGMDFEHVEKCFC 2830.081118 1.376925805 0.35661558 3.861092677 0.000112881 0.001015315
CTACCGCTCTCA -0.134816961 0.939406874 EMC cnn Is known to be SASGSMLEGLGGTCQPYDYTVVPEYTLDADDAFFPPGPALESSLMEMNWPPDTWLWDMHGDFFDIGLNDDQLQGVLQDWIAFFHE 1927.094844 1.852085995 0.610069163 3.035862339 0.002398488 0.016194698
CATTCGTAAGTA -0.125270928 0.941656341 MH xgb Random arrangement TNPMYQELWDFEWFHVHNSDNITPEPHGVNDQPTISTGIFTPIEELAFEWLDHEPMAQDPCACPCDFTQPDHDQSLVPMWTALND 1348.898577 2.844780261 0.422888114 6.727028181 1.73E-11 3.37E-10
CCGTGTTTGGCT -0.093490099 0.941990281 EMC cnn Random arrangement PGGFRQILFENDHDECDPIVHEVSPVSHWEVYNPYPWPPLESYSDWAMQQYDVGGQPSNNPESNFDQIWAHLEEEYFSGMFTMQM 6297.667268 1.075198066 0.318149126 3.379541157 0.000726069 0.005553086
AACGCGTCCACA 0.127613256 0.943885889 MH xgb Random arrangement LDSTEPKDTGAPHPGTAGCEFHPLELLNENNPEEDQPLIETWLAGSPLRLEPPQWDAGAWEFKEWVPADADMDTMGMDQIYTMNC 1184.045615 1.823189255 0.645844723 2.822952931 0.004758356 0.029813414
ACCACGGGATTC 0.127307473 0.943885889 EMC cnn Is known to be AVAPLSDHTNLCAAMSPINIPFGYALGTLDEFVSPPTSVSHMGAIDDVFQSLFELLDGAGTESWDVFSPISPTDDGLWMFSTVGE 1136.245286 1.532950758 0.499737587 3.067511426 0.002158492 0.014775534
GAGACAGAGACG -0.120736088 0.943885889 EMC xgb Random arrangement NDPTITGPEPPNQEHGHGDMTNGGCEDADTCYKDPLQEDSMVPPIYTWSAEQAELHWEDFDGPFGIWGMNWDTILVDEIWGDDFR 1294.950658 1.396008033 0.429138975 3.253044152 0.001141758 0.00830512
GGAACATTAATC 0.165670927 0.943885889 EMC xgb Is known to be GFPAVLAPPTDLGNLDELTLLNTLASIGYSGELDDPPLDALGDGDSPDVDSLQASIQELFHPQEQPVETQEMSWFDDQYAPTTLK 782.2898385 1.896290897 0.534855452 3.545426882 0.000391978 0.003189888
GGCTCAGATTCA 0.123810939 0.943885889 EMC cnn Is known to be LSDFDFALSMDPWNPDTTSLEPEEGTQDTAEGISDAGSPPVEDILELISRDTSPHCTGSNDAIDMFFSTLTEWEMWDGILPVIPG 1714.062031 2.360451125 0.824067256 2.864391355 0.004178115 0.026615342
GGTAACGGACCT 0.112081539 0.944720487 EMC xgb Random arrangement APPFTRNMSDTVHDYSGGAPIYEWDHTQEQEAQEPIDLGEPHEWHDYEPSMLWYQYWPGMDFHDIVGPQEDDEWLGLDIFESTLL 1876.719179 1.226967288 0.406690711 3.016954299 0.002553283 0.017130867
ACGCCATCTTTG 0.108043609 0.945134648 EMC Both of them Is known to be TGGLATTTPDFTINLDGDWFPVEAFVSADRLDSPSLFSGALSPEAMPICPDLLDELMLEGADVWMQDCGQIDSFTLFDDKVSCFM 2451.496571 2.168653815 0.523747824 4.140645015 3.46E-05 0.000336552
GTTCCCTGGAGG 0.09778175 0.945134648 EMC xgb Is known to be YGEMVNGFGDFADACPCDDLMGPTYVTDHPLPPGWEQRVDLLPEDWGTDTPPATPSSEMPMPFSDNDPDMTLEQLWGIFSDIIMG 3516.107895 1.489778939 0.405926403 3.670071542 0.000242483 0.00205778
CTCAGCTCCTTG -0.091591189 0.945208758 EMC Both of them Is known to be VTPLGSSPWYGDGNHLEEDFLKTNVTQPVEDEWYVVDCDVGPDLLGLETQALEPAMDLWDMGTLLDNFMLGGPEPTADICIIDQV 5569.362324 1.670051908 0.276357855 6.043077397 1.51E-09 2.43E-08
ATGGGATACGAC 0.129209775 0.946615625 MH xgb Random arrangement EMSMEPMDASDRPLFSGLEITNMCTMSNEHLYWSDALESAHDANLINICDDLTITPNAAQDGEWDPWYDQVWQDGNIRMFDWMNV 1135.551552 1.79498866 0.543031671 3.305495342 0.000948087 0.007065742
CTGAATCTCCGA 0.129054318 0.946615625 EMC xgb Is known to be DSSHSGHGGWGGVDALGDFDEPPITSVDSILWWVNDTMQTIFPEAAICDTDIPDPGLLWPMPMPSPPPPDWEFEMDHLESMNISQ 737.5421913 1.676302664 0.525350059 3.190829877 0.001418648 0.010101751
CCCACGTGACCA 0.089386726 0.946630362 EMC Both of them Is known to be NLGYGGPPEDCMYEEIAGLEDDLWTNFTNWEPVTPVDLSDLNNLLWAPPYTEDALLMDFMSLTPGGMDGLANWLLGLDPPNSAVP 2982.969573 0.830268771 0.2647496 3.136052974 0.001712383 0.011936102
AGGCCAGTTCAA 0.109212935 0.948669104 EMC xgb Random arrangement DTETRDTSREWPSGIITQEFTAAQPEVPSVHSNCDNSPYEDMGMGDSEHTPFEFDNEWALDHHDWGDLDVNPYNFCDFFAPCTDP 901.0746968 2.621851192 0.740502866 3.540636116 0.000399164 0.003221961
CTCCGTGCGAGC -0.118647707 0.950608654 EMC Both of them Is known to be SEMVSDDSSATEEHGQEIPLEIDDLYEALGQPVIAPLDPWTMCGEELLWDLGDLAELLDPGECYQHKGGDGGGQDMGPVPWFMIA 962.3261616 2.116205299 0.559472155 3.782503347 0.000155259 0.001359476
TTATATTAAGAT 0.112562661 0.951029653 EMC Both of them Is known to be VTVTNPPGAEMMDSNPDIWDLEVPQIDDLSEILDEMLTDDLWDTIWSDIDTEDVCNGGWTLCQPLEYPNYAGYGLSQDTGDGWPG 1037.076639 1.815717774 0.643359211 2.82224571 0.004768862 0.029841518
GTAATCGATCTC 0.101363765 0.952730739 EMC xgb Is known to be PQDILSVLPGIDDLFQLLLETEYVFDSTHFEDANNEMDLAAASAPEGQAVMPSPEQEPVLSPSDLFAEHELPTYFQGTINWYGKG 2030.767961 1.507550283 0.414954433 3.633050191 0.000280091 0.002356755
TCACGGCAAATT 0.105826615 0.952730739 MH xgb Random arrangement APPKGVIDGEQSAEDAAWIDWFPLPFSLHPVTDHCFGEWDIDDWEEPLPMVQPEQDQLWKHAAAESWWENTNVDWALFQPGYTGN 1736.083832 1.322401456 0.428978455 3.082675696 0.002051486 0.014081945
GGCGTTCGAACC 0.079603449 0.953703029 EMC xgb Is known to be GEDGEAGITMEGCDLLSGPTLGDDNPYSERDFDLFWMTADWDMQDDSMLTPEIISMCLPIGPHLEMGQLEGTSDEPHIFDVPTGL 2720.642336 1.958706353 0.390523834 5.015587222 5.29E-07 6.44E-06
CGCGACCCTCGA -0.08789682 0.954528626 EMC Both of them Is known to be VHFMEELVPDWPTWQDNGPFFLIPHVLDTNLFPPSHNEPSAEDELLDLDNPAKLFPDVDFPQFVPQPFTYTWPQELEDLLTYDHI 1890.310077 1.03576394 0.338101419 3.063471139 0.002187853 0.014955862
GACTTCAAGCGG 0.174537037 0.954528626 EMC xgb Is known to be DSPCNLVGSPNMDDNPLIDLLQGYDLMQDDEFLDHGALDDLWATILSDVYPWDEPDRGLPGENMGGTGGLQTFATSLPTGASGAF 582.982711 1.991313977 0.68995804 2.886137798 0.003900014 0.025069348
ATACTCAGACTG -0.107063453 0.95628508 EMC Both of them Random arrangement CESPQIQPGDAQMGSPSVWHDCTVTEEWSTIPYVDAMIMQDITQSHMDVIAPPPPLAAHPSIWGEIADGAPDWMVAAWDEFTNYL 2091.014773 2.855949655 0.519816176 5.494153098 3.93E-08 5.45E-07
TCTCTTCCAGGA 0.088026273 0.95699198 EMC Both of them Is known to be MPEAKPHTSAILPSWPMCPCEVHSLVEDQRQEGPPMTPGVLQQIENLDSAPGLFNLDNWFTEFDLFRLDNEVYEFLGILPQEEMY 2024.635036 1.927146854 0.404658695 4.762400707 1.91E-06 2.18E-05
TCAGTGGCAACG 0.063084249 0.963959981 EMC cnn Is known to be PMPLEPGMPGDFEGPPHPVDFEPSPQRCSPNPPQYLYLDLPDDDTYLLPDLGNLFETDDIQLMDMTPTPMWLDYLDIDGMSSVWR 2792.897775 2.134777615 0.318554436 6.701453102 2.06E-11 3.95E-10
CGACGAAGGGCG 0.068050737 0.964130373 EMC Both of them Random arrangement PGYEHNDNQGGGPDPCASAYGEGEHVECQVVREALKPCAWSMAETTNIYQPPSNFDDLPMNWLAPLDYGTLHMWDDDQMDHWWDF 2433.745371 2.514062317 0.697606871 3.603838236 0.000313552 0.002616103
TTTGATGTCTTA -0.054335773 0.971246383 EMC xgb Random arrangement PATVDTSYPFGPTEDMAPALYEAITLSDEPGDSPTPTNDGSECAPPAWDQDNLPWTLDFFLNHKMYWDHMIFNQDDQEVWDCDLQ 7585.859736 1.973985264 0.25215172 7.828561578 4.93E-15 1.29E-13
ACAAGGTAATTG 0.055797001 0.9718663 EMC xgb Is known to be DDCGANSDIDSQYTAEEFFNYSEWEALNMLLERCNFDAFWDLGTENLSDMFSLLGTEALGPHGGLMDGGGEGMGLGGGGSGGGAL 425.6472243 2.869162577 0.877397587 3.270082595 0.001075161 0.007917529
AGCCGGAACAGT -0.049432068 0.974181918 EMC cnn Random arrangement GCETDVAELNANDGPKPYPPEDEYNCDTPIMYGLTTSFHDHTTLTQAQPPAQHDDPAAMEFMFDNFNDFMNSYIDTWTSSDIAIC 2799.604954 1.307542551 0.304053685 4.300367386 1.71E-05 0.000171067
CATATTGAAAAC -0.077641288 0.974181918 EMC cnn Is known to be GGPVQSQSADMWWDTLFPDVPMEDFEPEATEYTHGQVATMMHEPCTLDPYKDIMSPDPDAAFPIMLASFDETVGLNLQCMFPELP 801.8187173 1.939005928 0.533781732 3.63258203 0.000280599 0.002357034
CTCGGGCCGGGT 0.063369678 0.974805629 EMC xgb Is known to be GGGGMPAMGDHLSDVGMDFDLDMLLSNGAELPRTSDCEFGLHDLLLPLLRESSSVYPLDLGTVQDLMPTRGSVDGGGFDYGGSDA 1279.317008 2.989468704 0.570537879 5.239737475 1.61E-07 2.07E-06
AATCAAACGGCG 0.078095914 0.978086751 EMC Both of them Is known to be RAALDIEVPLGTWSGTTQLDMEECWDMIPDVDEFYTFEDFFQDFLDSLSPSPDANLCDSLAMFAGSENDEAMMDGESMQNEDRGL 1682.64982 2.991411322 0.54200408 5.519167531 3.41E-08 4.76E-07
AGTCAATTTGTA 0.039958999 0.978968472 EMC xgb Is known to be PPGEDCEVPASWTREQPVRASRGVGEHSSDVPSEVTNSISWDDIDELLNSWVSPWLSLSPTPYDFTNFNLEPEELDIDHFDWVYA 3730.356702 0.940431665 0.290184367 3.240807476 0.001191917 0.008610989
GGAGTCGCCGTT -0.038584031 0.979158812 EMC Both of them Random arrangement AMAFTEELMDNTLDDITNHVFPYPLTEDWLDMYVLGDMLNMIEMPDILAGDLQFIPHEFANPCGPQHDFQAPPMTHESARWQWDD 4345.718315 1.691360015 0.34685006 4.876343446 1.08E-06 1.26E-05
TTGGGTCAGGAA 0.056988207 0.983105362 EMC xgb Is known to be DESDLSLAAPIPNILDPPSDVHPYDEAVMEGDVDAGDLSLHNPPDLDDLYSDEIQEAMWSNETWDTGSPLYPFDAALLDAIFQGS 672.8901405 3.058028978 0.459806818 6.650682104 2.92E-11 5.50E-10
TGCGCGCTCTCA 0.027377458 0.985972545 EMC Both of them Random arrangement CAAPEGVEFDIDYPWDNVLPNVHAMEVHQSFPLPPMDPFDLQDPNEEQVPPSMSWEPYELIEFWDSIANWQHESPGIMLTFENSG 4886.088068 1.083004155 0.30420381 3.560126867 0.000370676 0.003046548
GCCTCCATCTCC 0.04618398 0.987195653 EMC Both of them Random arrangement DGVQAQAFPDAKWLDCLWPLLDDWDWCHDPTISYGSPFAPSDMTELDLCQDDWTLMEFLAGVPDMSPPEASNIDSSGDKENMDWC 680.1289921 2.416402417 0.849127618 2.845747053 0.004430737 0.027972911
TGCGAACGACCT 0.026701051 0.991080426 EMC Both of them Is known to be ADPDYSVDLLADQELFNEFNTMWGDFDDFWGWPCTPPGALAIPNSLPSALPDNDTHTMADFEADPYAISGLLSWCMGPGLQQAEY 2730.141688 2.635285296 0.6031528 4.369183558 1.25E-05 0.000126914
TTTCCCTGACGC 0.019259954 0.991080426 EMC cnn Is known to be PPTQQNLEADEDIHNNFILDPEPPSPGVSDNWPYMPLPHSVLPPQDSPLTPYLNYITDGQLNMEMYAEFGPFSWTDLDECCGGQC 7152.204529 2.626756823 0.388907857 6.754188114 1.44E-11 2.84E-10
ACCGCAACCTAT 0.01437591 0.994750304 EMC cnn Is known to be PMPLSGAADRDNDDAEEYFPISEEEALTQLLELNNWDAFLDLMTEMLSDMPMGGEEATTVQLHGGPMGPVPLLGADLFDHFGDDA 2188.778491 1.172261331 0.317784447 3.688856844 0.000225264 0.001924841
GAGATCCGTAAC 0.021377799 0.994750304 EMC xgb Is known to be LPGEQSAEDSHIWPRYMGPSPSLDSQPHPFDEDKSYTALFSHEPRIFDLPSLDTEWLADLGEMDCAAFMDELEDMGFTLGSGTLP 3355.34538 2.439445339 0.395441125 6.168921711 6.88E-10 1.14E-08
CACGGTTTGAGC -0.015136782 0.994842464 EMC cnn Is known to be ETPDLFDVDMPLTPTDDVLVRWILDNDGEHDAPMPLWTPSSFSELDTSDWGIPSPFVGEYDEWQNYMASMTNNITGGGGDEKGEY 1855.147846 3.019922449 0.557826394 5.413731731 6.17E-08 8.29E-07
GAGTTTCAGAAG 0.048836342 0.994842464 EMC xgb Is known to be MFDFEDWTPFDLDPGLPWSGATPTEVLPMVLHSEGQTPDLLSDWEQTLPDLGDPLALCNNTFDDFSDSTWLSIATGGWHMDPVLF 208.1416009 2.723348907 0.944743183 2.882634092 0.003943652 0.025284271
TCTTGTGGACAC 0.016470245 0.994842464 EMC Both of them Is known to be QYPLEPHSDDPTVDMSSGDTVPLDVEPLYEGTFWTPGPVLQDFLSWFETPNPAPNNAYQEQWGNWLEGFCENPPYSESFNHPQDG 2465.066019 1.764150488 0.355487799 4.962618952 6.95E-07 8.33E-06
TTATAAGGTAAG 0.017896687 0.994842464 EMC Both of them Random arrangement EPDQGNAPDIFMPSDHEENPEVHEYDIMALMQDFPTGVAAGPTISLPCGLHGACPCYLGWDDSAFNEEPLCMDLNLDDVMREAQY 960.6384366 3.021497096 0.645043364 4.684176696 2.81E-06 3.10E-05
GGGGGCTTTCTC -0.004857212 0.998274374 EMC xgb Is known to be TNHALYCDCVKEGTPYVQPQEHPRPPEAEDSWTDVFLLDDKGYADIYDLLDPGSEQFNSMDLPPSFELGEGTWDDEGGIGTTSHV 1938.330705 1.532771506 0.429876814 3.565606372 0.000363016 0.002993522
ATCCCGACACAC -0.002690239 0.999193785 EMC Both of them Random arrangement ADDHFDHIIAEMAHTFVCNCNPNCPPENPNIFLPPTQEHHDLLDMWHNDSFECLSGLEMNPFVFGPLETPDYEWASFMPMDDNPD 3392.326115 1.379735382 0.275205089 5.013480624 5.35E-07 6.49E-06

Claims (93)

1. An engineered gene effector, the effector comprising a polypeptide comprising:
A first peptide of 75-110, or 75-95 amino acids in length, wherein the first peptide comprises any one of SEQ ID NOs 3-100, or a sequence at least 85% identical thereto, and
A second peptide of 75-110, or 75-95 amino acids in length and heterologous to the first peptide, wherein the second peptide comprises any one of SEQ ID NOs 3-100, or a sequence at least 85% identical thereto, optionally wherein the first peptide is different from the second peptide.
2. The engineered gene effector of claim 1, wherein the first peptide and/or the second peptide is 85 or 108 amino acids in length, optionally wherein the first peptide and the second peptide are each 85 or 108 amino acids in length.
3. The engineered gene effector of claim 1 or 2, wherein the first peptide comprises any one of SEQ ID NOs 3-100 with 0-3 amino acid residue mutations and/or the second peptide comprises any one of SEQ ID NOs 3-100 with 0-3 amino acid residue mutations, optionally wherein any mutation thereof is a conservative substitution.
4. The engineered gene effector of claim 1 or 2, wherein the first peptide comprises any one of SEQ ID NOs 3-100 and/or the second peptide comprises any one of SEQ ID NOs 3-100.
5. The engineered gene effector of any one of the preceding claims, wherein the first peptide is located N-terminal to the second peptide, wherein any one of SEQ ID NOs 3-100 of the first peptide comprises SEQ ID NOs 3-100, and wherein any one of SEQ ID NOs 3-100 of the second peptide comprises SEQ ID NOs 3-100.
6. The engineered gene effector of claim 5, wherein the first peptide and the second peptide are arranged in pairs according to any one of the SEQ ID NOs of the first peptide and the second peptide listed in table 4, wherein the first peptide is located at the N-terminus of the second peptide.
7. The engineered gene effector of any one of the preceding claims, wherein the first peptide and the second peptide are linked by a linker, optionally wherein the linker comprises any one or more of SEQ ID NOs 2211-2221, optionally wherein the linker comprises SEQ ID NO 2211.
8. The engineered gene effector of any one of the preceding claims, wherein the polypeptide comprises any one of SEQ ID NO: 115-274、276-287、289-367、370-445、448-746、748-777、779-929、931-1007、1009-1156、1158-1194、1196-1288、1290-1350、1352-1451, or a sequence at least 85% identical thereto.
9. The engineered gene effector of claim 7, wherein the polypeptide comprises any one of SEQ ID NO: 115-274、276-287、289-367、370-445、448-746、748-777、779-929、931-1007、1009-1156、1158-1194、1196-1288、1290-1350、1352-1451 having a mutation of 0-3 amino acid residues, optionally wherein any mutation thereof is a conservative substitution.
10. The engineered gene effector of any one of the preceding claims, wherein the polypeptide comprises any one of SEQ ID NOs 1085, 122, 1084, 653, 1099 and 1107, or a sequence at least 85% identical thereto, or a sequence having a mutation of 0-3 amino acid residues, optionally wherein any mutation thereof is a conservative substitution.
11. The engineered gene effector of any one of the preceding claims, wherein the engineered gene effector is capable of activating a target gene in a cell when the engineered gene effector is expressed in the cell and is effective to target a site of the target gene, optionally wherein the target gene is endogenous to the cell.
12. The engineered gene effector of claim 11, wherein the target gene is a silenced gene, optionally wherein the silenced gene is a methylated gene.
13. The engineered gene effector of claim 11 or 12, wherein the engineered gene effector is capable of increasing, or increasing by at least or about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, 250%, 300%, 400%, 500% the expression level of the target gene, optionally wherein the engineered gene effector is capable of increasing the expression level of the target gene by a percentage within a range defined by any two of the foregoing values, e.g., 10% -100%, 100% -200%, 200% -400%, 250% -500%, 10% -50%, 50-100%, etc.
14. The engineered gene effector of any one of the preceding claims, wherein the polypeptide is coupled to a heterologous endonuclease, optionally wherein the heterologous endonuclease is a Cas protein.
15. The engineered gene effector of claim 14, wherein the heterologous endonuclease is, at most, or about 450, 460, 470, 480, 490, 500, 520, 540, 560, 580, 600, 620, 640, 660, 680, 700 amino acids in length, optionally wherein the heterologous endonuclease is within a range defined by any two of the foregoing values, e.g., 450-700 amino acids, 480-600 amino acids, 500-530 amino acids, 500-600 amino acids, etc.
16. The engineered gene effector of claim 14 or 15, wherein the heterologous endonuclease comprises the amino acid sequence of any one of SEQ ID NOs 2222-2422, or a sequence at least 85% identical thereto.
17. The engineered gene effector of any one of claims 14-16, wherein the polypeptide is fused to the heterologous endonuclease.
18. The engineered gene effector of claim 17, wherein the polypeptide is fused to the C-terminus of the heterologous endonuclease.
19. An engineered gene effector, the effector comprising a polypeptide comprising:
A first peptide comprising an amino acid sequence of 75-110 amino acids in length and based on a human or viral transcriptional regulator, and
A second peptide comprising an amino acid sequence of 75-110 amino acids in length and based on a human or viral transcriptional regulator, wherein the second peptide is heterologous to the first peptide, wherein the engineered gene effector is capable of activating a target gene in a cell when the engineered gene effector is expressed in the cell and is effective to target a site of the target gene.
20. The engineered gene effector of any one of the preceding claims, wherein the first peptide and/or the second peptide has a beta factor of about 30 to about 65.
21. The engineered gene effector of any one of the preceding claims, wherein the first peptide and/or the second peptide is enriched in a negative electrostatic potential.
22. The engineered gene effector of any one of the preceding claims, wherein the first peptide and/or the second peptide has a negative net charge.
23. The engineered gene effector of any one of the preceding claims, wherein the engineered gene effector is capable of activating a target gene in a cell, wherein the target gene expression level activated by the engineered gene effector lasts for a period of time that is, about or at least 9 days, 10 days, 11 days, 12 days, 13 days, 14 days, 15 days, 16 days, 17 days, 18 days or longer, or optionally, the expression level lasts for a period of time that is within a range defined by any two of the foregoing values, e.g., 9-18 days, 9-14 days, 12-18 days, 14-16 days, etc.
24. The engineered gene effector of any one of the preceding claims, wherein the engineered gene effector is capable of activating a target gene in a cell, wherein the expression level of the target gene is increased by at least 0.1-fold, 0.2-fold, 0.3-fold, 0.4-fold, 0.5-fold, 0.6-fold, 0.7-fold, 0.8-fold, 0.9-fold, 1-fold, 1.5-fold, 2-fold, 2.5-fold, 3-fold, 5-fold, 10-fold, 20-fold, 30-fold, 40-fold, 50-fold or more, or optionally the expression level of the target gene is increased by a factor within a range defined by any two of the foregoing values, e.g., 0.1-50-fold, 0.5-10-fold, 1-40-fold, 2-30-fold, etc., as compared to a control.
25. A fusion protein, the protein comprising:
The engineered gene effector of any one of the preceding claims, and
A heterologous endonuclease coupled to the polypeptide, optionally, wherein the heterologous endonuclease is a Cas protein.
26. The fusion protein of claim 25, wherein the heterologous endonuclease comprises the amino acid sequence of any of SEQ ID NOs 2222-2422, or a sequence at least 85% identical thereto.
27. The fusion protein of claim 25 or 26, wherein the polypeptide is fused to the heterologous endonuclease.
28. The fusion protein of claim 25, wherein the polypeptide is fused to the C-terminus of the heterologous endonuclease.
29. A polynucleotide comprising a nucleotide sequence encoding the engineered gene effector or fusion protein of any one of the preceding claims.
30. A vector comprising the polynucleotide of claim 29.
31. A cell comprising the polynucleotide of claim 29 or the vector of claim 30.
32. A system, the system comprising:
the engineered gene effector of any one of claims 1-24;
A heterologous endonuclease coupled to said polypeptide of said engineered gene effector, optionally wherein said heterologous endonuclease is a Cas protein, and
A guide nucleic acid capable of forming a complex with the heterologous endonuclease, wherein the complex exhibits specific binding to a target gene in a cell when the system is expressed in the cell.
33. The system of claim 32, wherein the heterologous endonuclease comprises an amino acid sequence of any of SEQ ID NOs 2222-2422, or a sequence at least 85% identical thereto.
34. The system of claim 32 or 33, wherein the polypeptide is fused to the heterologous endonuclease.
35. The system of claim 34, wherein the polypeptide is fused to the C-terminus of the heterologous endonuclease.
36. A combination of polynucleotides encoding the system of any one of claims 32-35, wherein the combination of polynucleotides is configured to express the heterologous endonuclease coupled to the engineered gene effector and the guide nucleic acid in a cell.
37. A kit comprising the engineered gene effector, fusion protein, combination, system, polynucleotide, vector, and/or cell of any one of the preceding claims.
38. A method of controlling a target gene in a cell, the method comprising contacting the cell with the engineered gene effector of any one of claims 1-24, the fusion protein of any one of claims 25-28, the polynucleotide of claim 29, the vector of claim 30, the system of any one of claims 32-35, or the combination of polynucleotides of claim 36.
39. The method of claim 38, wherein the target gene is endogenous to the cell.
40. The method of claim 38 or 39, wherein the contacting is performed in vitro or ex vivo.
41. A computer-implemented method of generating a functional biological sequence, the method comprising:
(a) Providing an fitness function trained on a biological data set, the biological data set comprising functionally determined biological sequences having a fixed length;
(b) Providing in the computer a plurality of different sequences comprising a fixed length, each sequence being associated with a temperature and a fitness based on the fitness function, wherein each sequence is associated with a different temperature of a temperature gradient;
(c) By the computer, in parallel in the plurality of different sequences:
(1) Selecting one or more random positions for introducing substitutions in one or more of said plurality of different sequences, optionally 1-5 random positions, optionally 1 random position, and
For each of the one or more sequences, evaluating a first fitness change resulting from the introduction of the substitution at the one or more randomly selected locations and accepting or rejecting the substitution based on the evaluated first fitness change and optionally further based on the temperature associated with the sequence, and/or
(2) Selecting one or more pairs of the plurality of different sequences, each selected pair comprising a sequence associated with a successive temperature of the temperature gradient, optionally selecting up to 3 pairs of the plurality of different sequences, optionally selecting 1 pair of the plurality of different sequences, and
For each of the selected pairs:
Selecting one or more domains for exchange between the sequences of the selected pair, and
Evaluating the difference in fitness of the sequences of the selected pair due to the exchange of the one or more domains, and
Accepting or rejecting the one or more domain exchanges between the selected pairs based on the fitness differences and the temperatures associated with each sequence of the selected pairs, and
(D) Iteratively performing (c), wherein in each subsequent iteration, accepted substitutions of a previous iteration and/or accepted domains of a previous iteration are swapped into the plurality of different sequences,
Thereby producing one or more functional sequences having fitness at or above a desired fitness threshold.
42. The method of claim 41, comprising accepting, at (c) (1), the substitution at the one or more randomly selected locations when the fitness of the sequence after the substitution is introduced is greater than the fitness of the sequence before the substitution is introduced.
43. The method of claim 41 or 42, comprising, at (c) (1), accepting or rejecting the substitution at the one or more randomly selected locations based on a probability weighted by a ratio of the fitness of the sequence after the substitution is introduced to the fitness of the sequence before the substitution is introduced.
44. The method of any one of claims 41-43, comprising, at (c) (1), accepting or rejecting the substitution at the one or more randomly selected locations based on Boltzmann Metropolis-hastins acceptance criterion r mh.
45. The method of any of claims 41-44, wherein the one or more randomly selected locations are selected uniformly across the fixed length.
46. The method of any one of claims 41-45, comprising, at (c) (2), accepting an exchange of the selected domain between the selected pair when the fitness of the sequence associated with a lower of the consecutive temperatures after exchange is greater than the fitness of the sequence associated with a higher of the consecutive temperatures after exchange, optionally wherein the selected domain comprises the complete sequence of the selected pair.
47. The method of any one of claims 41-46, comprising, at (c) (2), accepting an exchange of the selected domain between the selected pairs based on a probability inversely proportional to a difference between the temperatures associated with each sequence of the pairs, and a ratio of the fitness of the sequence pairs after exchange, optionally wherein the selected domain comprises the complete sequence of the selected pair.
48. The method of any one of claims 41-47, comprising, at (c) (2), accepting or rejecting an exchange of the selected domain between the pair based on parallel tempering criterion r re, optionally wherein the selected domain comprises the complete sequence of the selected pair.
49. The method of any one of claims 41-48, wherein (c) comprises:
(3) Selecting a crossover site between one or more pairs of said plurality of different sequences, and
For each of the one or more pairs in which the crossover site is selected, a second fitness change for each sequence of the selected pair resulting from crossover at the crossover site is evaluated, and crossover at the selected crossover site is accepted or rejected based on the second fitness change and the temperature associated with each sequence of the selected pair.
50. The method of claim 49, comprising accepting or rejecting crossover at the selected crossover site based on a probability weighted by a ratio of the second fitness change for each of the sequences of the selected pair.
51. The method of any of claims 41-50, wherein at least one of the generated one or more functional sequences has a fitness based on the fitness function that is greater than a fitness of each of the plurality of different sequences prior to any iteration of (c).
52. The method of any of claims 41-51, wherein the desired fitness threshold is based on fitness associated with a corresponding sequence of the plurality of different sequences in (b), optionally wherein the desired fitness threshold is based on a maximum fitness among the plurality of different sequences.
53. The method of any one of claims 41-52, wherein the plurality of different sequences in (b) comprises a plurality of different, naturally occurring sequences.
54. A computer-implemented method of generating a functional biological sequence, the method comprising:
(a) Evaluating, by a computer, sequences of a plurality of different sequences comprising a fixed length based on an fitness function trained on a biological dataset comprising functionally determined biological sequences having the fixed length;
(b) Replacing, by the computer, one or more random residues in the sequence to generate a mutated sequence;
(c) Evaluating, by the computer, the mutant sequence based on the fitness function, and
(D) The functional sequences accepted by the fitness function are collected by the computer.
55. The computer-implemented method of claim 54, further comprising randomly exchanging, by a computer, one or more subsequences from the mutant sequence with different sequences of the plurality of sequences.
56. The computer-implemented method of claim 54 or 55, wherein the fitness function comprises a threshold selected from the group consisting of a binary threshold, a numerical threshold, a multi-class threshold, a confidence threshold, a decision threshold, and any combination thereof.
57. The computer-implemented method of claim 56, wherein a functional sequence is accepted by the fitness function when a fitness score assigned to the functional sequence by the fitness function exceeds the threshold.
58. The computer-implemented method of any one of claims 54-57, wherein the plurality of different sequences in (a) comprises a plurality of different naturally occurring sequences or different random sequences.
59. The computer-implemented method of any one of claims 41-58, wherein the functionally defined biological sequence comprises an amino acid sequence or nucleotide sequence of a protein or peptide, or an amino acid sequence or nucleotide sequence encoding a protein or peptide, optionally wherein the functionally defined biological sequence comprises a transcriptional activator, further optionally wherein the functionally defined biological sequence comprises an engineered gene effector.
60. The computer-implemented method of claim 59, wherein said protein or said peptide is an epigenetic regulator, a transcription factor, an enzyme, a nuclease, an agonist, an antagonist, a regulatory factor, or an inhibitor.
61. The computer-implemented method of any one of claims 41-60, wherein the functionally determined biological sequence comprises an amino acid sequence or a nucleotide sequence.
62. The computer-implemented method of any one of claims 41-61, wherein the functionally defined biological sequence comprises an amino acid sequence, and further wherein the fixed length is at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, at least 100, at least 105, at least 110, at least 115, at least 120, at least 125, at least 130, at least 140, at least 150, or at least 200 amino acids, or at most 500, at most 300, at most 200, at most 150, at most 120, at most 100, at most 95, at most 90, at most 85, at most 75, or at most 70 amino acids, or optionally wherein the fixed length is within a range defined by any two of the foregoing values, e.g., 30-500 amino acids, 50-300 amino acids, at least 150-200 amino acids, 75-150 amino acids, or the like.
63. The computer-implemented method of any of claims 41-62, wherein the fitness function is based on one or more machine learning models, wherein the machine learning model is selected from the group consisting of a supervised machine learning model, an unsupervised machine learning model, a reinforcement learning model, a deep learning model, a transfer learning model, and any combination thereof.
64. The computer-implemented method of claim 63, wherein the one or more machine learning models are selected from the group consisting of a classification model, a regression model, a decision tree model, a Convolutional Neural Network (CNN), a Recurrent Neural Network (RNN), a limiting gradient boost (XGBoost), a long and short term memory network, a Generative Antagonism Network (GAN), an automatic encoder, a transformer network, an evolutionary Monte Carlo, and any combination thereof.
65. The computer-implemented method of claim 64, wherein the fitness function is based on an integrated model comprising a decision tree model and a convolutional neural network, optionally wherein the integrated model comprises CNNs and XGBoost.
66. The computer-implemented method of any of the preceding claims, the method comprising evaluating, by the computer, the biological data set to generate the fitness function, the evaluating comprising:
Generating a sequence embedding from a Large Protein Language Model (LPLM) based on the biological dataset, and
Training a machine learning model with the generated sequence embedding as input, optionally wherein the LPLM comprises an Evolutionary Scale Modeling (ESM) language model, optionally wherein the LPLM comprises ESM-2.
67. The computer-implemented method of claim 66, wherein the machine learning model comprises an integrated model of more than two different models.
68. The computer-implemented method of claim 67, wherein said integrated model comprises CNNs and XGBoost.
69. The computer-implemented method of any of the preceding claims, wherein the plurality of different sequences comprises a plurality of different random sequences.
70. The computer-implemented method of any of the preceding claims, wherein the biological dataset comprises up to about 10 5 biological sequences or biological sequences on the order of about 10 5 biological sequences.
71. The computer-implemented method of any one of the preceding claims, wherein up to 5% of the functionally determined biological sequences comprise functional sequences.
72. A computer-implemented system comprising a computing device comprising at least one processor and instructions executed by the at least one processor to provide an application, the application comprising:
(a) A software module configured to evaluate, by a computer, a sequence of a plurality of different sequences comprising a fixed length based on an fitness function trained on a biological data set, the biological data set comprising a functionally determined biological sequence having the fixed length;
(b) A software module configured to replace, by the computer, one or more random residues in the sequence to generate a mutated sequence;
(c) A software module configured to evaluate, by the computer, the mutant sequence based on the fitness function, and
(D) A software module configured to collect, by the computer, a sequence of functionalities accepted by the fitness function.
73. A non-transitory computer readable medium having stored thereon computer readable instructions that, when executed by a processor, cause the processor to perform a method comprising:
(a) Evaluating sequences of a plurality of different sequences comprising a fixed length based on an fitness function trained on a biological dataset comprising functionally determined biological sequences having the fixed length;
(b) Replacing one or more random residues in the sequence to generate a mutated sequence;
(c) Evaluating the mutant sequence based on the fitness function, and
(D) Functional sequences accepted by the fitness function are collected.
74. A computer-implemented system comprising a computing device comprising at least one processor and instructions executed by the at least one processor to provide an application comprising one or more software modules for performing the method of any of claims 41-71.
75. A non-transitory computer readable medium having stored thereon computer readable instructions which, when executed by a processor, cause the processor to perform the method of any of claims 41-71.
76. An engineered gene effector comprising one or more polypeptides produced by the method of any one of claims 41-71, or a sequence at least 85% identical thereto.
77. An engineered gene effector comprising a polypeptide of 85 amino acids in length comprising any one of SEQ ID NOs 1495, 1592, 1595, 1634, 1654, 1665, 1677, 1686, 1689, 1716, or a sequence at least 85% identical thereto.
78. The engineered gene effector of claim 76 or 77, wherein the engineered gene effector is capable of activating a target gene in a cell when the engineered gene effector is expressed in the cell and is effective to target a site of the target gene, optionally wherein the target gene is endogenous to the cell.
79. The engineered gene effector of any one of claims 76-78, wherein the polypeptide is coupled to a heterologous endonuclease, optionally wherein the heterologous endonuclease is a Cas protein.
80. The engineered gene effector of claim 79, wherein the heterologous endonuclease comprises the amino acid sequence of any of SEQ ID NOs 2222-2422, or a sequence at least 85% identical thereto.
81. The engineered gene effector of claim 79 or 80, wherein the polypeptide is fused to the heterologous endonuclease, optionally wherein the polypeptide is fused to the C-terminus of the heterologous endonuclease.
82. A fusion protein, the fusion protein comprising:
the engineered gene effector of any one of claims 76-81, and
The heterologous endonuclease coupled to the polypeptide, optionally, wherein the heterologous endonuclease is a Cas protein.
83. A polynucleotide comprising a nucleotide sequence encoding the engineered gene effector of any one of claims 76-81 or the fusion protein.
84. A vector comprising the polynucleotide of claim 83.
85. A cell comprising the polynucleotide of claim 83 or the vector of claim 84.
86. A system, the system comprising:
the engineered gene effector of any one of claims 76-81;
Optionally, a heterologous endonuclease coupled to the engineered gene effector, wherein the heterologous endonuclease is a Cas protein, and
A guide nucleic acid capable of forming a complex with the heterologous endonuclease, wherein the complex exhibits specific binding to a target gene in a cell when the system is expressed in the cell.
87. A combination of polynucleotides encoding the system of claim 86, wherein the combination of polynucleotides is configured to express the heterologous endonuclease coupled to the engineered gene effector and the guide nucleic acid in a cell.
88. A method of controlling a target gene in a cell, the method comprising contacting the cell with the system of claim 86 or the combination of polynucleotides of claim 87.
89. The method of claim 88, wherein the target gene is endogenous to the cell.
90. The method of claim 88 or 89, wherein the contacting is performed in vitro or ex vivo.
91. A computer-implemented system comprising a computing device comprising at least one processor and instructions executed by the at least one processor to provide an application, the application comprising:
(a) A software module configured to provide, by a computer, an fitness function trained on a biological data set, the biological data set comprising a functionally determined biological sequence having a fixed length;
(b) A software module configured to provide, by the computer, a plurality of different sequences comprising a fixed length, each sequence associated with a temperature and a fitness based on the fitness function, wherein each sequence is associated with a different temperature of a temperature gradient;
(c) Parallel in the plurality of different sequences:
(1) A software module configured to select, by the computer, one or more random positions for introducing substitutions in one or more of the plurality of different sequences, optionally 1-5 random positions, optionally 1 random position, and
For each of the one or more sequences, evaluating a first fitness change resulting from the introduction of the substitution at the one or more randomly selected locations and accepting or rejecting the substitution based on the evaluated first fitness change and optionally further based on the temperature associated with the sequence, and/or
(2) A software module configured to select, by the computer, one or more pairs of the plurality of different sequences, each selected pair comprising a sequence associated with a successive temperature of the temperature gradient, optionally at most 3 pairs of the plurality of different sequences, optionally 1 pair of the plurality of different sequences, and
For each of the selected pairs:
Selecting one or more domains for exchange between the sequences of the selected pair, and
Evaluating the difference in fitness of the sequences of the selected pair due to the exchange of the one or more domains, and
Accepting or rejecting the one or more domain exchanges between the selected pairs based on the fitness differences and the temperatures associated with each sequence of the selected pairs, and
(D) A software module configured to be executed iteratively by the computer (c), wherein in each subsequent iteration, accepted substitutions of a previous iteration and/or accepted domains of a previous iteration are swapped into the plurality of different sequences, thereby producing one or more functional sequences having fitness that meets or is above a desired fitness threshold.
92. An engineered gene effector comprising a polypeptide comprising any one of SEQ ID NO: 115-274、276-287、289-367、370-445、448-746、748-777、779-929、931-1007、1009-1156、1158-1194、1196-1288、1290-1350、1352-1451, or a sequence at least 85% identical thereto, or a sequence having a mutation of 0-3 amino acid residues, optionally wherein any mutation thereof is a conservative substitution.
93. An engineered gene effector comprising a polypeptide comprising any one of SEQ ID NOs 1085, 122, 1084, 653, 1099 and 1107, or a sequence at least 85% identical thereto, or a sequence having a mutation of 0-3 amino acid residues, optionally, wherein any mutation thereof is a conservative substitution.
CN202480047487.0A 2023-05-17 2024-05-15 Systems and methods for regulating target genes Pending CN121532411A (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US63/502,891 2023-05-17
US63/504,661 2023-05-26
US63/504,663 2023-05-26
US63/504,660 2023-05-26
US63/520,251 2023-08-17

Publications (1)

Publication Number Publication Date
CN121532411A true CN121532411A (en) 2026-02-13

Family

ID=

Similar Documents

Publication Publication Date Title
US20220025369A1 (en) Rna encoding a therapeutic protein
US20240165239A1 (en) Covalent Binding Compounds for the Treatment of Disease
US12281301B2 (en) Sequencing-based proteomics
US20220401460A1 (en) Modulating resistance to bcl-2 inhibitors
US20200399714A1 (en) Cancer-related biological materials in microvesicles
US20230193205A1 (en) Gene modified fibroblasts for therapeutic applications
CN110499364A (en) A kind of probe groups and its kit and application for detecting the full exon of extended pattern hereditary disease
US12160984B2 (en) Radio wave absorber
CN117730164A (en) Method for managing cell quality and method for producing cell
US12054756B2 (en) Engineered nucleases, compositions, and methods of use thereof
JP7794719B2 (en) Cell quality control method and cell manufacturing method
TW202307215A (en) Cell quality management method and cell production method
CN117677707A (en) Quality control method for specific cells and method for producing specific cells
US20240233867A1 (en) Quality management method for specific cell and method of producing specific cell
WO2019008414A1 (en) Exosome based gene expression analysis for cancer management
WO2019008415A1 (en) Exosome and pbmc based gene expression analysis for cancer management
US20250388634A1 (en) Engineered gene effectors, compositions, and methods of use thereof
US20250305061A1 (en) Methods and systems for inferring gene expression using cell-free dna fragments
CN121532411A (en) Systems and methods for regulating target genes
Schwarzer et al. Supplementary Data from Targeting Aggressive B-cell Lymphomas through Pharmacological Activation of the Mitochondrial Protease OMA1 [Dataset]
HK40109321A (en) Method for managing quality of specific cells, and method for manufacturing specific cells
HK40109609A (en) Cell quality management method and cell production method

Legal Events

Date Code Title Description
PB01 Publication