AU2014236763B2 - Thioesterases and cells for production of tailored oils - Google Patents
Thioesterases and cells for production of tailored oils Download PDFInfo
- Publication number
- AU2014236763B2 AU2014236763B2 AU2014236763A AU2014236763A AU2014236763B2 AU 2014236763 B2 AU2014236763 B2 AU 2014236763B2 AU 2014236763 A AU2014236763 A AU 2014236763A AU 2014236763 A AU2014236763 A AU 2014236763A AU 2014236763 B2 AU2014236763 B2 AU 2014236763B2
- Authority
- AU
- Australia
- Prior art keywords
- oil
- seq
- amino acid
- cuphea
- nucleic acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/8247—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified lipid metabolism, e.g. seed oil composition
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6409—Fatty acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/02—Thioester hydrolases (3.1.2)
- C12Y301/02014—Oleoyl-[acyl-carrier-protein] hydrolase (3.1.2.14), i.e. ACP-thioesterase
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E50/00—Technologies for the production of fuel of non-fossil origin
- Y02E50/10—Biofuels, e.g. bio-diesel
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Cell Biology (AREA)
- Nutrition Science (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
- Fats And Perfumes (AREA)
- Edible Oils And Fats (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
The invention features plant acyl-ACP thioesterase genes of the FatB class and proteins encoded by these genes. The genes are useful for constructing recombinant host cells having altered fatty acid profiles. Oleaginous microalga host cells with the new genes or previously identified FatB genes are disclosed. The microalgae cells produce triglycerides with useful fatty acid profiles.
Description
THIOESTERASES AND CELLS FOR PRODUCTION OF TAILORED OILS Cross Reference to related Applications [0001] This application is a Continuation-in-part of United States Patent 5 Application No. 13/837,996, filed March 15, 2013, and claims the benefit of United States Provisional Patent Application Serial No. 61/791,861, filed March 15, 2013, and United States Provisional Patent Application Serial No. 61/917,217, filed December 17, 2013, each of which is hereby incorporated by reference herein in its entirety. 10 Background [0002] Certain organisms including plants and some microalgae use a type II fatty acid biosynthetic pathway, characterized by the use of discrete, monofunctional enzymes for fatty acid synthesis. In contrast, mammals and fungi use a single, large, multifunctional protein. 15 [0003] Type II fatty acid biosynthesis typically involves extension of a growing acyl-ACP (acyl-carrier protein) chain by two carbon units followed by cleavage by an acyl-ACP thioesterase. In plants, two main classes of acyl-ACP thioesterases have been identified: (i) those encoded by genes of the FatA class, which tend to hydrolyze oleoyl-ACP into oleate (an 18:1 fatty acid) and ACP, and (ii) 20 those encoded by genes of the FatB class, which liberate C8-C16 fatty acids from corresponding acyl-ACP molecules.
[0004] Different FatB genes from various plants have specificities for different acyl chain lengths. As a result, different gene products will produce different fatty acid profiles in plant seeds. See, US Patent Nos. 5,850,022; 5,723,761; 25 5,639,790; 5,807,893; 5,455,167; 5,654,495; 5,512,482;5,298,421;5,667,997; and 5,344,771; 5,304,481. Recently, FatB genes have been cloned into oleaginous microalgae to produce triglycerides with altered fatty acid profiles. See, WO2010/063032, WO2011/150411, W02012/106560, and WO2013/158938.
Summary [0005] In various aspects, the invention(s) contemplated herein may include, but need not be limited to, any one or more of the following embodiments: [0006] Embodiment 1: A nucleic acid construct including a regulatory 5 element and a FatB gene expressing an active acyl-ACP thioesterase operable to produce an altered fatty acid profile in an oil produced by a cell expressing the nucleic acid construct, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 5 of Table la, the sequence having at least 94.6% sequence identity with each of SEQ ID NOs: 88, 82, 85, and 103, and optionally 10 wherein the fatty acid of the oil is enriched in C8 and CIO fatty acids.
[0007] Embodiment 2: A nucleic acid construct including a regulatory element and a FatB gene expressing an active acyl-ACP thioesterase operable to produce an altered fatty acid profile in an oil produced by a cell expressing the nucleic acid construct, wherein the FatB gene expresses a protein having an amino acid 15 sequence falling within one of clades 1-12 of Table la.
[0008] Embodiment 3: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 1 of Table la, the sequence having at least 85.9% sequence identity with each of SEQ ID NOs: 19, 161, 22, and 160, and optionally wherein the fatty acid of the oil is 20 enriched in C14 and C16 fatty acids.
[0009] Embodiment 4: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 2 of Table la, the sequence having at least 89.5% sequence identity with each of SEQ ID NOs: 134-136, 132, 133, 137, 124, 122, 123, 125, and optionally wherein the fatty 25 acid of the oil is enriched in C12 and C14 fatty acids.
[0010] Embodiment 5: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 3 of Table la, the sequence having at least 92.5% sequence identity with each of SEQ ID NOs: 126 and 127, and optionally wherein the fatty acid of the oil is enriched in 30 C12 and C14 fatty acids.
[0011] Embodiment 6: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 4 of Table la, the sequence having at least 83.8% sequence identity with SEQ ID NO: 79, and optionally wherein the fatty acid of the oil is enriched in C12 and C14 fatty acids.
[0012] Embodiment 7: The nucleic acid construct of embodiment 2, wherein 5 the FatB gene expresses a protein having an amino acid sequence falling within clade 6 of Table la, the sequence having at least 99.9% sequence identity with each of SEQ ID NOs: 111 and 110, and optionally wherein the fatty acid of the oil is enriched in CIO fatty acids.
[0013] Embodiment 8: The nucleic acid construct of embodiment 2, wherein 10 the FatB gene expresses a protein having an amino acid sequence falling within clade 7 of Table la, the sequence having at least 89.5% sequence identity with each of SEQ ID NOs: 73, 106, 185, 172, 171, 173, 174, and optionally wherein the fatty acid of the oil is enriched in CIO and C12 fatty acids.
[0014] Embodiment 9: The nucleic acid construct of embodiment 2, wherein 15 the FatB gene expresses a protein having an amino acid sequence falling within clade 8 of Table la, the sequence having at least 85.9% sequence identity with each of SEQ ID NOs: 112, 113, 142, 145, 143, 144, 139, 140, 138, 141, and optionally wherein the fatty acid of the oil is enriched in C12 and C14 fatty acids.
[0015] Embodiment 10: The nucleic acid construct of embodiment 2, 20 wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 9 of Table la, the sequence having at least 83.8% sequence identity with each of SEQ ID NOs: 187-189, and optionally wherein the fatty acid of the oil is enriched in C12 and C14 fatty acids.
[0016] Embodiment 11: The nucleic acid construct of embodiment 2, 25 wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 10 of Table la, the sequence having at least 95.9% sequence identity with each of SEQ ID NOs: 147, 149, 146, 150, 152, 151, 148, 154, 156, 155, 157, 108, 75, 190, 191, and 192, and optionally wherein the fatty acid of the oil is enriched in C14 and C16 fatty acids. 30 [0017] Embodiment 12: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 11 of Table la, the sequence having at least 88.7% sequence identity with SEQ ID NO: 121, and optionally wherein the fatty acid of the oil is enriched in C14 and C16 fatty acids.
[0018] Embodiment 13: The nucleic acid construct of embodiment 2, wherein the FatB gene expresses a protein having an amino acid sequence falling within clade 12 of Table 5 1 a, the sequence having at least 72.8% sequence identity with each of SEQ ID NOs: 129 and 186, and optionally wherein the fatty acid of the oil is enriched in C16 fatty acids.
[0019] Embodiment 14: An isolated nucleic acid or recombinant DNA construct including a nucleic acid, wherein the nucleic acid has at least 80% sequence identity to any of SEQ ID NOS: 2,3, 5, 6, 8, 9,11,12,14,15, 17,18,20,21,23, 24,26, 27, 29, 30,32, 33, 35, 10 36,38,39,41,42,44,45,47,48,50,51, 53, 54,56,57,59,60,62,63,65, 66,68,69, 71,72, 74, 76, 78, 80, 81, 83, 84, 86, 87, 89, 90, 92, 93, 95, 96, 98, 99,101, 102,104,105, 107,109 or any equivalent sequences by virtue of the degeneracy of the genetic code.
[0020] Embodiment 15: An isolated nucleic acid sequence encoding a protein or a host cell expressing a protein having at least 80% sequence identity to any of SEQ ID NOS: 1, 15 4, 7,10,13,16,19,22,25,28,31, 34,37,40,43,46,49,52,55,58,61,64,67,70,73,75,77,
79, 82, 85, 88,91,94,97,100,103,106,108,110-192 or a fragment thereof having acyl-ACP thioesterase activity.
[0021] Embodiment 16: The isolated nucleic acid of embodiment 15, wherein, the protein has acyl-ACP thioesterase activity operable to alter the fatty acid profile of an oil 20 produced by a recombinant cell including that sequence.
[0021a] Embodiment 17: A recombinant nucleic acid encoding an acyl-ACP thioesterase, wherein the nucleic acid has at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO 93, and wherein the nucleic acid encodes an acyl-ACP thioesterase having at least 96% sequence identity to SEQ ID NOs: 91, 171 or 179, or 25 a C-terminal and/or N-terminal truncated fragment comprising at least 90% of the full-length sequences thereof.
[0021b] Embodiment 18: A recombinant nucleic acid encoding an acyl-ACP thioesterase, wherein the nucleic acid has at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% sequence identity to any of SEQ ID NO: 90, and wherein the nucleic acid encodes 30 an acyl-ACP thioesterase of any one of SEQ ID NOs: 88, 170 or 178, or a C-terminal and/or N-terminal truncated fragment comprising at least 90% of the full-length sequences thereof.
[0021c] Embodiment 19: A recombinant nucleic acid encoding an acyl-ACP thioesterase, wherein the nucleic acid has at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 5 or 99% sequence identity to any of SEQ ID NO: 84, and wherein the nucleic acid encodes an acyl-ACP thioesterase having at least 95% sequence identity to SEQ ID NO:82, or a C-terminal and/or N-terminal truncated fragment comprising at least 90% of the full-length sequences thereof.
[0021d] Embodiment 20: A vector comprising the recombinant nucleic acid of any 10 one of embodiments 17 to 19.
[0022] Embodiment 21: A method of producing a recombinant cell that produces an altered fatty acid profile, the method including transforming the cell with a nucleic acid according to any of embodiments 1-3, or 17-19.
[0023] Embodiment 22: A host cell produced by the method of embodiment 21. 15 [0024] Embodiment 23: The host cell of embodiment 22, wherein the host cell is selected from a plant cell, a microbial cell, and a microalgal cell.
[0024a] Embodiment 24: A method of producing a recombinant acyl-ACP thioesterase, the method comprising transforming a Prototheca microalgal cell with a nucleic acid according to any one of embodiments 17-19.
20 [0024b] Embodiment 25: A method of producing a recombinant acyl-ACP thioesterase, the method comprising transforming a Prototheca microalgal cell with a nucleic acid encoding an acyl-ACP thioesterase: a) having at least 96% sequence identity to SEQ ID NOs: 91, 171 or 179; b) of any one of SEQ ID NOs:88, 170 or 178; or 25 c) having at least 95% sequence identity to SEQ ID NO:82; or d) a C-terminal and/or N-terminal truncated fragment of a)-c) comprising at least 90% of the full-length sequences thereof.
[0024c] Embodiment 26: A Prototheca microalgal host cell comprising a recombinant nucleic acid of any one of embodiments 17-19, wherein the recombinant 30 nucleic acid comprises an exogenous nucleic acid, and wherein the host cell has a fatty acid profile having increased C8 and/or CIO fatty acids. 10024(11 Embodiment 27: A Prototheca microalgal host cell comprising a recombinant nucleic acid encoding an acyl-ACP thioesterase: a) having at least 96% sequence identity to SEQ ID NOs: 91, 171 or 179; b) of any one of SEQ ID NOs:88, 170 or 178; or 5 c) having at least 95% sequence identity to SEQ ID NO:82; or d) a C-terminal and/or N-terminal truncated fragment of a)-c) comprising at least 90% of the full-length sequences thereof, and wherein the host cell has a fatty acid profile having increased C8 and/or CIO fatty acids.
[0025] Embodiment 28: A method for producing an oil or oil-derived product, the 10 method including cultivating a host cell of embodiment 21, 22 or 26, and extracting oil produced thereby, optionally wherein the cultivation is heterotrophic growth on sugar.
[0026] Embodiment 29: The method of embodiment 28, further including producing a fatty acid, fuel, chemical, or other oil-derived product from the oil.
[0027] Embodiment 30: An oil produced by the method of embodiment 28, 15 optionally having a fatty acid profile including at least about 8% C8 and/or at least about 9% CIO fatty acids.
[0028] Embodiment 31: An oil-derived product produced by the method of embodiment 29.
[0029] Embodiment 32: The oil-derived product of embodiment 31, wherein the 20 oil is produced by a microalgae and optionally, lacks C24-alpha sterols.
[0029a] Embodiment 33: The oil-derived product of embodiment 32, wherein the oil comprises sterols with C24f:S stereochemistry.
[0029b] Embodiment 34: The oil-derived product of embodiment 32 or 33, wherein the oil comprises one or more of ergosterol, brassicasterol, 22, 23-dihydrobrassicasterol, 25 clionasterol and poriferasterol.
[0029c] Embodiment 35: The oil-derived product of any one of embodiments 32-34, wherein the oil comprises one or more of ergosterol, brassicasterol, 22, 23-dihydrobrassicasterol, clionasterol and poriferasterol.
[0029d] The reference in this specification to any prior publication (or information 30 derived from it), or to any matter which is known, is not, and should not be taken as an acknowledgment or admission or any form of suggestion that that prior publication (or information derived from it) or known matter forms part of the common general knowledge in the field of endeavour to which this specification relates.
[0029e] Throughout this specification and the claims which follow, unless the context requires otherwise, the word “comprise”, and variations such as “comprises” and 5 “comprising”, will be understood to imply the inclusion of a stated integer or step or group of integers or steps but not the exclusion of any other integer or step or group of integers or steps.
Description of Illustrative Embodiments of the Invention 10 Definitions [0030] As used with respect to nucleic acids, the term “isolated” refers to a nucleic acid that is free of at least one other component that is typically present with the naturally occurring nucleic acid. Thus, a naturally occurring nucleic acid is isolated if it has been purified away from at least one other component that occurs naturally with the nucleic 15 acid.
[0031] A “natural oil” or “natural fat” shall mean a predominantly triglyceride oil obtained from an organism, where the oil has not undergone blending with another natural or synthetic oil, or fractionation so as to substantially alter the fatty acid profile of the triglyceride. In connection with an oil comprising triglycerides of a particular 20 regiospecificity, the natural oil or natural fat has not been subjected to interesterification or other synthetic process to obtain that regiospecific triglyceride profile, rather the regiospecificity is produced naturally, by a cell or population of cells. In connection with a natural oil or natural fat, and as used generally throughout the present disclosure, the terms oil and fat are used interchangeably, except where otherwise noted. Thus, an “oil” 25 or a “fat” can be liquid, solid, or partially solid at room temperature, depending on the makeup of the substance and other conditions. Here, the term “fractionation” means removing material from the oil in a way that changes its fatty acid profile relative to the profile produced by the organism, however accomplished. The terms “natural oil” and “natural fat” encompass such oils obtained from an organism, where the oil has undergone minimal processing, including refining, bleaching and/or degumming, which does not substantially change its 5 triglyceride profile. A natural oil can also be a “noninteresterified natural oil”, which means that the natural oil has not undergone a process in which fatty acids have been redistributed in their acyl linkages to glycerol and remain essentially in the same configuration as when recovered from the organism.
[0032] “Exogenous gene” shall mean a nucleic acid that codes for the 10 expression of an RNA and/or protein that has been introduced into a cell (e.g. by transformation/transfection), and is also referred to as a “transgene”. A cell comprising an exogenous gene may be referred to as a recombinant cell, into which additional exogenous gene(s) may be introduced. The exogenous gene may be from a different species (and so heterologous), or from the same species (and so 15 homologous), relative to the cell being transformed. Thus, an exogenous gene can include a homologous gene that occupies a different location in the genome of the cell or is under different control, relative to the endogenous copy of the gene. An exogenous gene may be present in more than one copy in the cell. An exogenous gene may be maintained in a cell, for example, as an insertion into the genome 20 (nuclear or plastid) or as an episomal molecule.
[0033] “Fatty acids” shall mean free fatty acids, fatty acid salts, or fatty acyl moieties in a glycerolipid. It will be understood that fatty acyl groups of glycerolipids can be described in terms of the carboxylic acid or anion of a carboxylic acid that is produced when the triglyceride is hydrolyzed or saponified. 25 [0034] “Microalgae” are microbial organisms that contain a chloroplast or other plastid, and optionally that are capable of performing photosynthesis, or a prokaryotic microbial organism capable of performing photosynthesis. Microalgae include obligate photoautotrophs, which cannot metabolize a fixed carbon source as energy, as well as heterotrophs, which can live solely off of a fixed carbon source. 30 Microalgae include unicellular organisms that separate from sister cells shortly after cell division, such as Chlamydomonas, as well as microbes such as, for example, Volvox, which is a simple multicellular photosynthetic microbe of two distinct cell types. Microalgae include cells such as Chlorella, Dunaliella, and Prototheca.
Microalgae also include other microbial photosynthetic organisms that exhibit cellcell adhesion, such as Agmenellum, Anabaena, and Pyrobotrys. Microalgae also include obligate heterotrophic microorganisms that have lost the ability to perform photosynthesis, such as certain dinoflagellate algae species and species of the genus 5 Prototheca.
[0035] An “oleaginous” cell is a cell capable of producing at least 20% lipid by dry cell weight, naturally or through recombinant or classical strain improvement. An “oleaginous microbe” or “oleaginous microorganism” is a microbe, including a microalga that is oleaginous. 10 [0036] The term “percent sequence identity,” in the context of two or more amino acid or nucleic acid sequences, refers to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same, when compared and aligned for maximum correspondence, as measured using a sequence comparison algorithm or by visual 15 inspection. For sequence comparison to determine percent nucleotide or amino acid identity, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are input into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. The sequence 20 comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters. Optimal alignment of sequences for comparison can be conducted using the NCBI BLAST software (ncbi.nlm.nih.gov/BLAST/) set to default parameters. For example, to compare two nucleic acid sequences, one may use blastn with the 25 "BLAST 2 Sequences" tool Version 2.0.12 (Apr. 21, 2000) set at the following default parameters: Matrix: BLOSUM62; Reward for match: 1; Penalty for mismatch: -2; Open Gap: 5 and Extension Gap: 2 penalties; Gap x drop-off: 50; Expect: 10; Word Size: 11; Filter: on. For a pairwise comparison of two amino acid sequences, one may use the “BLAST 2 Sequences” tool Version 2.0.12 (Apr. 21, 30 2000) with blastp set, for example, at the following default parameters: Matrix: BLOSUM62; Open Gap: 11 and Extension Gap: 1 penalties; Gap x drop-off 50; Expect: 10; Word Size: 3; Filter: on.
[0037] In connection with a natural oil, a “profile” is the distribution of particular species or triglycerides or fatty acyl groups within the oil. A “fatty acid profile” is the distribution of fatty acyl groups in the triglycerides of the oil without reference to attachment to a glycerol backbone. Fatty acid profiles are typically 5 determined by conversion to a fatty acid methyl ester (FAME), followed by gas chromatography (GC) analysis with flame ionization detection (FID). The fatty acid profile can be expressed as one or more percent of a fatty acid in the total fatty acid signal determined from the area under the curve for that fatty acid. FAME-GC-FID measurement approximate weight percentages of the fatty acids. 10 [0038] As used herein, an oil is said to be “enriched” in one or more particular fatty acids if there is at least a 10% increase in the mass of that fatty acid in the oil relative to the non-enriched oil. For example, in the case of a cell expressing a heterologous FatB gene described herein, the oil produced by the cell is said to be enriched in, e.g., C8 and C16 fatty acids if the mass of these fatty acids in the oil is at 15 least 10% greater than in oil produced by a cell of the same type that does not express the heterologous FatB gene (e.g., wild type oil).
[0039] ‘Recombinant” is a cell, nucleic acid, protein or vector that has been modified due to the introduction of an exogenous nucleic acid or the alteration of a native nucleic acid. Thus, e.g., recombinant (host) cells can express genes that are not 20 found within the native (non-recombinant) form of the cell or express native genes differently than those genes are expressed by a non-recombinant cell. Recombinant cells can, without limitation, include recombinant nucleic acids that encode a gene product or suppression elements such as mutations, knockouts, antisense, interfering RNA (RNAi) or dsRNA that reduce the levels of active gene product in a cell. A 25 “recombinant nucleic acid” is a nucleic acid originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using polymerases, ligases, exonucleases, and endonucleases, using chemical synthesis, or otherwise is in a form not normally found in nature. Recombinant nucleic acids may be produced, for example, to place two or more nucleic acids in operable linkage. Thus, an isolated nucleic acid or an 30 expression vector formed in vitro by nucleic by ligating DNA molecules that are not normally joined in nature, are both considered recombinant for the purposes of this invention. Recombinant nucleic acids can also be produced in other ways; e.g., using chemical DNA synthesis. Once a recombinant nucleic acid is made and introduced into a host cell or organism, it may replicate using the in vivo cellular machinery of the host cell; however, such nucleic acids, once produced recombinantly, although subsequently replicated intracellularly, are still considered recombinant for purposes of this invention. Similarly, a “recombinant protein” is a protein made using 5 recombinant techniques, i.e., through the expression of a recombinant nucleic acid.
[0040] Embodiments of the present invention relate to the use of FatB genes isolated from plants, which can be expressed in a host cell in order to alter the fatty acid profile of an oil produced by the recombinant cell. Although the microalga, Prototheca moriformis, was used to screen the genes for ability to the alter fatty acid 10 profile, the genes are useful in a wide variety of host cells. For example, the genes can be expressed in bacteria, other microalgae, or higher plants. The genes can be expressed in higher plants according to the methods of US Patent Nos. 5,850,022; 5,723,761; 5,639,790; 5,807,893; 5,455,167; 5,654,495; 5,512,482;5,298,421;5,667,997; 5,344,771; and 5,304,481. The fatty acids can be 15 further converted to triglycerides, fatty aldehydes, fatty alcohols and other oleochemicals either synthetically or biosynthetically.
[0041] In specific embodiments, triglycerides are produced by a host cell expressing a novel FatB gene. A triglyceride-containing natural oil can be recovered from the host cell. The natural oil can be refined, degummed, bleached and/or 20 deodorized. The oil, in its natural or processed form, can be used for foods, chemicals, fuels, cosmetics, plastics, and other uses. In other embodiments, the FatB gene may not be novel, but the expression of the gene in a microalga is novel.
[0042] The genes can be used in a variety of genetic constructs including plasmids or other vectors for expression or recombination in a host cell. The genes 25 can be codon optimized for expression in a target host cell. The proteins produced by the genes can be used in vivo or in purified form.
[0043] For example, the gene can be prepared in an expression vector comprising an operably linked promoter and 5’UTR. Where a plastidic cell is used as the host, a suitably active plastid targeting peptide can be fused to the FATB gene, 30 as in the examples below. Generally, for the newly identified FATB genes, there are roughly 50 amino acids at the N-terminal that constitute a plastid transit peptide, which are responsible for transporting the enzyme to the chloroplast. In the examples below, this transit peptide is replaced with a 38 amino acid sequence that is effective in the Prototheca moriformis host cell for transporting the enzyme to the plastids of those cells. Thus, the invention contemplates deletions and fusion proteins in order to optimize enzyme activity in a given host cell. For example, a transit peptide from the host or related species may be used instead of that of the newly discovered plant 5 genes described here.
[0044] A selectable marker gene may be included in the vector to assist in isolating a transformed cell. Examples of selectable markers useful in microlagae include sucrose invertase and antibiotic resistance genes.
[0045] The gene sequences disclosed can also be used to prepare antisense, or 10 inhibitory RNA (e.g., RNAi or hairpin RNA) to inhibit complementary genes in a plant or other organism.
[0046] FatB genes found to be useful in producing desired fatty acid profiles in a cell are summarized below in Table 1. Nucleic acids or proteins having the sequence of SEQ ID NOS: 1-109 can be used to alter the fatty acid profile of a 15 recombinant cell. Variant nucleic acids can also be used; e.g., variants having at least 70, 80, 85, 90, 95, 96, 97, 98, or 99% sequence identity to SEQ ID NOS: 2, 3, 5, 6, 8, 9, 11, 12, 14, 15, 17, 18, 20, 21, 23, 24, 26, 27, 29, 30, 32, 33, 35, 36, 38, 39, 41, 42, 44, 45, 47, 48, 50, 51, 53, 54, 56, 57, 59, 60, 62, 63, 65, 66, 68, 69, 71, 72, 74, 76, 78, 80, 81, 83, 84, 86, 87, 89, 90, 92, 93, 95, 96, 98, 99, 101, 102, 104, 105, 107 or 109. 20 Codon optimization of the genes for a variety of host organisms is contemplated, as is the use of gene fragments. Preferred codons for Prototheca strains and for Chlorella protothecoides are shown below in Tables 2 and 3, respectively. Codon usage for Cuphea wrightii is shown in Table 3a. Codon usage for Arabidopsis is shown in Table 3b; for example, the most preferred of codon for each amino acid can be 25 selected. Codon tables for other organisms including microalgae and higher plants are known in the art. In some embodiments, the first and/or second most preferred Prototheca codons are employed for codon optimization. In specific embodiments, the novel amino acid sequences contained in the sequence listings below are converted into nucleic acid sequences according to the most preferred codon usage in 30 Prototheca, Chlorella, Cuphea wrightii, or Arabidopsis as set forth in tables 2 through 3b or nucleic acid sequences having at least 70, 80, 85, 90, 95, 96, 97, 98, or 99% sequence identity to these derived nucleic acid sequences.
[0047] In embodiments of the invention, there is protein or a nucleic acid encoding a protein having any of SEQ ID NOS: 1,4, 7, 10, 13, 16, 19,22,25,28,31, 34, 37, 40, 43, 46, 49, 52, 55, 58, 61, 64, 67, 70, 73, 75, 77, 79, 82, 85, 88, 91, 94, 97, 100, 103, 106, 108, or 110-192. In an embodiment, there is protein or a nucleic acid 5 encoding a protein having at least 70, 75, 80, 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% sequence identity with any of SEQ ID NOS: 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34, 37, 40, 43, 46, 49, 52, 55, 58, 61, 64, 67, 70, 73, 75, 77, 79, 82, 85, 88, 91, 94, 97, 100, 103, 106, 108, or 110-192. In certain embodiments, the invention encompasses a fragment any of the above-described proteins or nucleic acids 10 (including fragments of protein or nucleic acid variants), wherein the protein fragment has acyl-ACP thioesterase activity or the nucleic acid fragment encodes such a protein fragment. In other embodiments, the fragment includes a domain of an acyl-ACP thioesterase that mediates a particular function, e.g., a specificity-determining domain. Illustrative fragments can be produced by C-terminal and/or N-terminal truncations 15 and include at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% of the full-length sequences disclosed herein.
[0048] In certain embodiments, percent sequence identity for variants of the nucleic acids or proteins discussed above can be calculated by using the full-length nucleic acid sequence (e.g., one of SEQ ID NOS: 2, 3, 5, 6, 8, 9, 11, 12, 14, 15, 17, 20 18, 20, 21, 23, 24, 26, 27, 29, 30, 32, 33, 35, 36, 38, 39, 41, 42, 44, 45, 47, 48, 50, 51, 53, 54, 56, 57, 59, 60, 62, 63, 65, 66, 68, 69, 71, 72, 74, 76, 78, 80, 81, 83, 84, 86, 87, 89, 90, 92, 93, 95, 96, 98, 99, 101, 102, 104, 105, 107 or 109) or full-length amino acid sequence (e.g., one of SEQ ID NOS: 1, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 34, 37, 40, 43, 46, 49, 52, 55, 58, 61, 64, 67, 70, 73, 75, 77, 79, 82, 85, 88, 91, 94, 97, 100, 25 103, 106, 108, or 110-192) as the reference sequence and comparing the full-length test sequence to this reference sequence. In some embodiments relating to fragments, percent sequence identity for variants of nucleic acid or protein fragments can be calculated over the entire length of the fragment.
[0049] The nucleic acids can be in isolated form, or part of a vector or other 30 construct, chromosome or host cell. It has been found that is many cases the full length gene (and protein) is not needed; for example, deletion of some or all of the N-terminal hydrophobic domain (typically an 18 amino acid domain starting with LPDW) yields a still-functional gene. In addition, fusions of the specificity determining regions of the genes in Table 1 with catalytic domains of other acyl-ACP thioesterases can yield functional genes. Thus, in certain embodiments, the invention encompasses functional fragments (e.g., specificity determining regions) of the disclosed nucleic acid or amino acids fused to heterologous acyl-ACP thioesterase 5 nucleic acid or amino acid sequences, respectively.
Table 1: FatB genes according to embodiments of the present invention
In certain embodiments, a host cell (e.g. plant or microalgal cell) is transformed to produce a recombinant FATB protein falling into one of clades 1-12 of Table la. These clades were determined by sequence alignment and observation of changes in 5 fatty acid profile when expressed in Prototheca. See Example 5. The FATB amino acid sequence can fall within x% amino acid sequence identity of each sequence in that clade listed in Table la, where x is a first second or third cutoff value, also listed in Table la.
Table la: Groupings of Novel FatB genes into clades.
Host Cells [0050] The host cell can be a single cell (e.g., microalga, bacteria, yeast) or part of a multicellular organism such as a plant or fungus. Methods for expressing Fatb genes in a plant are given in 5,850,022; 5,723,761; 5,639,790; 5,807,893; 5,455,167; 5,654,495; 5,512,482;5,298,421;5,667,997; and 5,344,771; 5,304,481, or can be accomplished using other techniques generally known in plant biotechnology. Engineering of oleaginous microbes including those of Chlorophyta is disclosed in WO2010/063032, WO2011,150411, and W02012/106560 and in the examples below.
[0051] Examples of oleaginous host cells include plant cells and microbial cells having a type II fatty acid biosynthetic pathway, including plastidic oleaginous cells such as those of oleaginous algae. Specific examples of microalgal cells include heterotrophic or obligate heterotrophic microalgae of the phylum Chlorophtya, the class Trebouxiophytae, the order Chlorellales, or the family Chlorellacae. Examples of oleaginous microalgae are provided in Published PCT Patent Applications W02008/151149, W02010/06032, WO2011/150410, and WO2011/150411, including species of Chlorella and Prototheca, a genus comprising obligate heterotrophs. The oleaginous cells can be, for example, capable of producing 25, 30, 40, 50, 60, 70, 80, 85, or about 90% oil by cell weight, ±5%. Optionally, the oils produced can be low in DHA or EPA fatty acids. For example, the oils can comprise less than 5%, 2 %, or 1% DHA and/or EPA. The above-mentioned publications also disclose methods for cultivating such cells and extracting oil, especially from microalgal cells; such methods are applicable to the cells disclosed herein and incorporated by reference for these teachings. When microalgal cells are used they can be cultivated autotrophically (unless an obligate heterotroph) or in the dark using a sugar (e.g., glucose, fructose and/or sucrose). In any of the embodiments described herein, the cells can be heterotrophic cells comprising an exogenous invertase gene so as to allow the cells to produce oil from a sucrose feedstock. Alternately, or in addition, the cells can metabolize xylose from cellulosic feedstocks. For example, the cells can be genetically engineered to express one or more xylose metabolism genes such as those encoding an active xylose transporter, a xylulose-5-phosphate transporter, a xylose isomerase, a xylulokinase, a xylitol dehydrogenase and a xylose reductase. See WO2012/154626, “GENETICALLY ENGINEERED MICROORGANISMS THAT METABOLIZE XYLOSE”, published Nov 15, 2012.
Oils and Related Products [0052] The oleaginous cells express one or more exogenous genes encoding fatty acid biosynthesis enzymes. As a result, some embodiments feature natural oils that were not obtainable from a non-plant or non-seed oil, or not obtainable at all.
[0053] The oleaginous cells produce a storage oil, which is primarily triacylglyceride and may be stored in storage bodies of the cell. A raw oil may be obtained from the cells by disrupting the cells and isolating the oil. W02008/151149, W02010/06032, WO2011/150410, and WO2011/1504 disclose heterotrophic cultivation and oil isolation techniques. For example, oil may be obtained by cultivating, drying and pressing the cells. The oils produced may be refined, bleached and deodorized (RBD) as known in the art or as described in W02010/120939. The raw or RBD oils may be used in a variety of food, chemical, and industrial products or processes. After recovery of the oil, a valuable residual biomass remains. Uses for the residual biomass include the production of paper, plastics, absorbents, adsorbents, as animal feed, for human nutrition, or for fertilizer.
[0054] Where a fatty acid profile of a triglyceride (also referred to as a “triacylglyceride” or “TAG”) cell oil is given here, it will be understood that this refers to a nonfractionated sample of the storage oil extracted from the cell analyzed under conditions in which phospholipids have been removed or with an analysis method that is substantially insensitive to the fatty acids of the phospholipids (e.g. using chromatography and mass spectrometry). The oil may be subjected to an RBD process to remove phospholipids, free fatty acids and odors yet have only minor or negligible changes to the fatty acid profile of the triglycerides in the oil. Because the cells are oleaginous, in some cases the storage oil will constitute the bulk of all the TAGs in the cell.
[0055] The stable carbon isotope value 813C is an expression of the ratio of 13C/12C relative to a standard (e.g. PDB, carbonite of fossil skeleton of Belemnite americana from Peedee formation of South Carolina). The stable carbon isotope value 613C (0/00) of the oils can be related to the 613C value of the feedstock used.
In some embodiments, the oils are derived from oleaginous organisms heterotrophically grown on sugar derived from a C4 plant such as com or sugarcane. In some embodiments the 513C (0/00) of the oil is from -10 to -17 0/00 or from -13 to -16 0/00.
[0056] The oils produced according to the above methods in some cases are made using a microalgal host cell. As described above, the microalga can be, without limitation, fall in the classification of Chlorophyta, Trebouxiophyceae , Chlorellales, Chlorellaceae, or Chlorophyceae. It has been found that microalgae of Trebouxiophyceae can be distinguished from vegetable oils based on their sterol profiles. Oil produced by Chlorella protothecoides was found to produce sterols that appeared to be brassicasterol, ergosterol, campesterol, stigmasterol, and β-sitosterol, when detected by GC-MS. However, it is believed that all sterols produced by Chlorella have 024β stereochemistry. Thus, it is believed that the molecules detected as campesterol, stigmasterol, and β-sitosterol, are actually 22,23-dihydrobrassicasterol, proferasterol and clionasterol, respectively. Thus, the oils produced by the microalgae described above can be distinguished from plant oils by the presence of sterols with C24ft stereochemistry and the absence of C24a stereochemistry in the sterols present. For example, the oils produced may contain 22, 23-dihydrobrassicasterol while lacking campesterol; contain clionasterol, while lacking in β-sitosterol, and/or contain poriferasterol while lacking stigmasterol. Alternately, or in addition, the oils may contain significant amounts of Δ7-poriferasterol.
[0057] In one embodiment, the oils provided herein are not vegetable oils. Vegetable oils are oils extracted from plants and plant seeds. Vegetable oils can be distinguished from the non-plant oils provided herein on the basis of their oil content. A variety of methods for analyzing the oil content can be employed to determine the source of the oil or whether adulteration of an oil provided herein with an oil of a different (e.g. plant) origin has occurred. The determination can be made on the basis of one or a combination of the analytical methods. These tests include but are not limited to analysis of one or more of free fatty acids, fatty acid profile, total triacylglycerol content, diacylglycerol content, peroxide values, spectroscopic properties (e.g. UV absorption), sterol profile, sterol degradation products, antioxidants (e.g. tocopherols), pigments (e.g. chlorophyll), dl3C values and sensory analysis (e.g. taste, odor, and mouth feel). Many such tests have been standardized for commercial oils such as the Codex Alimentarius standards for edible fats and oils.
[0058] Sterol profile analysis is a particularly well-known method for determining the biological source of organic matter. Campesterol, b-sitosterol, and stigamsterol are common plant sterols, with b-sitosterol being a principle plant sterol. For example, b-sitosterol was found to be in greatest abundance in an analysis of certain seed oils, approximately 64% in com, 29% in rapeseed, 64% in sunflower, 74% in cottonseed, 26% in soybean, and 79% in olive oil (Gul et al. J. Cell and Molecular Biology 5:71-79, 2006).
[0059] Oil isolated from Prototheca moriformis strain UTEX1435 were separately clarified (CL), refined and bleached (RB), or refined, bleached and deodorized (RBD) and were tested for sterol content according to the procedure described in JAOCS vol. 60, no.8, August 1983. Results of the analysis are shown below (units in mg/lOOg):
[0060] These results show three striking features. First, ergosterol was found to be the most abundant of all the sterols, accounting for about 50% or more of the total sterols. The amount of ergosterol is greater than that of campesterol, β-sitosterol, and stigmasterol combined. Ergosterol is steroid commonly found in fungus and not commonly found in plants, and its presence particularly in significant amounts serves as a useful marker for non-plant oils. Secondly, the oil was found to contain brassicasterol. With the exception of rapeseed oil, brassicasterol is not commonly found in plant based oils. Thirdly, less than 2% β-sitosterol was found to be present. β-sitosterol is a prominent plant sterol not commonly found in microalgae, and its presence particularly in significant amounts serves as a useful marker for oils of plant origin. In summary, Prototheca moriformis strain UTEX1435 has been found to contain both significant amounts of ergosterol and only trace amounts of β-sitosterol as a percentage of total sterol content. Accordingly, the ratio of ergosterol: β-sitosterol or in combination with the presence of brassicasterol can be used to distinguish this oil from plant oils.
[0061] In some embodiments, the oil content of an oil provided herein contains, as a percentage of total sterols, less than 20%, 15%, 10%, 5%, 4%, 3%, 2%, or 1% β-sitosterol. In other embodiments the oil is free from β-sitosterol.
[0062] In some embodiments, the oil is free from one or more of β-sitosterol, campesterol, or stigmasterol. In some embodiments the oil is free from β-sitosterol, campesterol, and stigmasterol. In some embodiments the oil is free from campesterol. In some embodiments the oil is free from stigmasterol.
[0063] In some embodiments, the oil content of an oil provided herein comprises, as a percentage of total sterols, less than 20%, 15%, 10%, 5%, 4%, 3%, 2%, or 1% 24-ethylcholest-5-en-3-ol. In some embodiments, the 24-ethylcholest-5-en-3-ol is clionasterol. In some embodiments, the oil content of an oil provided herein comprises, as a percentage of total sterols, at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, or 10% clionasterol.
[0064] In some embodiments, the oil content of an oil provided herein contains, as a percentage of total sterols, less than 20%, 15%, 10%, 5%, 4%, 3%, 2%, or 1% 24-methylcholest-5-en-3-ol. In some embodiments, the 24-methylcholest-5-en-3-ol is 22, 23-dihydrobrassicasterol. In some embodiments, the oil content of an oil provided herein comprises, as a percentage of total sterols, at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, or 10% 22,23-dihydrobrassicasterol.
[0065] In some embodiments, the oil content of an oil provided herein contains, as a percentage of total sterols, less than 20%, 15%, 10%, 5%, 4%, 3%, 2%, or 1% 5,22-cholestadien-24-ethyl-3-ol. In some embodiments, the 5, 22-cholestadien-24-ethyl-3-ol is poriferasterol. In some embodiments, the oil content of an oil provided herein comprises, as a percentage of total sterols, at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, or 10% poriferasterol.
[0066] In some embodiments, the oil content of an oil provided herein contains ergosterol or brassicasterol or a combination of the two. In some embodiments, the oil content contains, as a percentage of total sterols, at least 5%, 10%, 20%, 25%, 35%, 40%, 45%, 50%, 55%, 60%, or 65% ergosterol. In some embodiments, the oil content contains, as a percentage of total sterols, at least 25% ergosterol. In some embodiments, the oil content contains, as a percentage of total sterols, at least 40% ergosterol. In some embodiments, the oil content contains, as a percentage of total sterols, at least 5%, 10%, 20%, 25%, 35%, 40%, 45%, 50%, 55%, 60%, or 65% of a combination of ergosterol and brassicasterol.
[0067] In some embodiments, the oil content contains, as a percentage of total sterols, at least 1%, 2%, 3%, 4% or 5% brassicasterol. In some embodiments, the oil content contains, as a percentage of total sterols less than 10%, 9%, 8%, 7%, 6%, or 5% brassicasterol.
[0068] In some embodiments the ratio of ergosterol to brassicasterol is at least 5:1, 10:1, 15:1,or 20:1.
[0069] In some embodiments, the oil content contains, as a percentage of total sterols, at least 5%, 10%, 20%, 25%, 35%, 40%, 45%, 50%, 55%, 60%, or 65% ergosterol and less than 20%, 15%, 10%, 5%, 4%, 3%, 2%, or 1% β-sitosterol. In some embodiments, the oil content contains, as a percentage of total sterols, at least 25% ergosterol and less than 5% β-sitosterol. In some embodiments, the oil content further comprises brassicasterol.
[0070] Sterols contain from 27 to 29 carbon atoms (C27 to C29) and are found in all eukaryotes. Animals exclusively make C27 sterols as they lack the ability to further modify the C27 sterols to produce C28 and C29 sterols. Plants however are able to synthesize C28 and C29 sterols, and C28/C29 plant sterols are often referred to as phytosterols. The sterol profile of a given plant is high in C29 sterols, and the primary sterols in plants are typically the C29 sterols b-sitosterol and stigmasterol. In contrast, the sterol profile of non-plant organisms contain greater percentages of C27 and C28 sterols. For example the sterols in fungi and in many microalgae are principally C28 sterols. The sterol profile and particularly the striking predominance of C29 sterols over C28 sterols in plants has been exploited for determining the proportion of plant and marine matter in soil samples (Huang, Wen-Yen, Meinschein W. G., “Sterols as ecological indicators”; Geochimica et Cosmochimia Acta. Vol 43. pp 739-745).
[0071] In some embodiments the primary sterols in the microalgal oils provided herein are sterols other than b-sitosterol and stigmasterol. In some embodiments of the microalgal oils, C29 sterols make up less than 50%, 40%, 30%, 20%, 10%, or 5% by weight of the total sterol content.
[0072] In some embodiments the microalgal oils provided herein contain C28 sterols in excess of C29 sterols. In some embodiments of the microalgal oils, C28 sterols make up greater than 50%, 60%, 70%, 80%, 90%, or 95% by weight of the total sterol content. In some embodiments the C28 sterol is ergosterol. In some embodiments the C28 sterol is brassicasterol.
[0073] In embodiments of the present invention, oleaginous cells expressing one or more of the genes of Table 1 can produce an oil with at least 20,40, 60 or 70% of C8, CIO, C12, C14 or C16 fatty acids. In a specific embodiment, the level of myristatc (C14:0) in the oil is greater than 30%.
[0074] Thus, in embodiments of the invention, there is a process for producing an oil, triglyceride, fatty acid, or derivative of any of these, comprising transforming a cell with any of the nucleic acids discussed herein. In another embodiment, the transformed cell is cultivated to produce an oil and, optionally, the oil is extracted.
Oil extracted in this way can be used to produce food, oleochemicals or other products.
[0075] The oils discussed above alone or in combination are useful in the production of foods, fuels and chemicals (including plastics, foams, films, etc). The oils, triglycerides, fatty acids from the oils may be subjected to C-H activation, hydroamino methylation, methoxy-carbonation, ozonolysis, enzymatic transformations, epoxidation, methylation, dimerization, thiolation, metathesis, hydroalkylation, lactonization, or other chemical processes.
[0076] After extracting the oil, a residual biomass may be left, which may have use as a fuel, as an animal feed, or as an ingredient in paper, plastic, or other product. For example, residual biomass from heterotrophic algae can be used in such products.
Example 1. Discovery of Novel FATB sequences [0077] Sequences of novel plant acyl-ACP thioesterases involved in seed-specific mid-chain (C8-C16) fatty acid biosynthesis in higher plants were isolated. Seed-specific lipid production genes were isolated through direct interrogation of RNA pools accumulating in oilseeds. Based on phylogenetic analysis, novel enzymes can be classified as members of FatB family of acyl-ACP thioesterases.
[0078] Seeds of oleaginous plants were obtained from local grocery stores or requested through USD A ARS National Plant Germplasm System (NPGS) from North Central Regional Plant Introduction Station (NCRIS) or USDA ARS North Central Soil Conservation Research Laboratory (Morris, MI). Dry seeds were homogenized in liquid nitrogen to powder, resuspended in cold extraction buffer containing 6-8M Urea and 3M LiCl and left on ice for a few hours to overnight at 4 °C. The seed homogenate was passed through NucleoSpin Filters (Macherey-Nagel) by centrifugation at 20,000g for 20 minutes in the refrigerated microcentrifuge (4 °C). The resulting RNA pellets were resuspended in the buffer containing 20 mM Tris HC1, pH7.5, 0.5% SDS, 100 mM NaCl, 25 mM EDTA, 2% PVPP) and RNA was subsequently extracted once with Phenol-Chloroform-Isoamyl Alcohol (25:24:1, v/v) and once with chloroform. RNA was finally precipitated with isopropyl alcohol (0.7 Vol.) in the presence of 150 mM of Na Acetate, pH5.2, washed with 80% ethanol by centrifugation, and dried. RNA samples were treated with Turbo DNAse (Lifetech) and purified further using RNeasy kits (Qiagen) following manufacturers’ protocols.
The resulting purified RNA samples were converted to pair-end cDNA libraries and subjected to next-generation sequencing (2xl00bp) using Illumina Hiseq 2000 platform. RNA sequence reads were assembled into corresponding seed transcriptomes using Trinity or Oases packages. Putative thioesterase-containg cDNA contigs were identified by mining transcriptomes for sequences with homology to known thioesterases. These in silico identified putative thioesterase cDNAs have been further verified by direct reverse transcription PCR analysis using seed RNA and primer pairs targeting full-length thioesterase cDNAs. The resulting amplified products were cloned and sequenced de novo to confirm authenticity of identified thioesterase genes.
[0079] To interrogate evolutionary and functional relationship between novel acyl-ACP thioesterases and the members of two existing thioesterase classes (FatA and FatB), we performed a phylogenetic analysis using published full-length (Mayer and Shanklin, 2007) and truncated (THYME database) amino acid thioesterase sequences. Novel proteins appear to group with known acyl-ACP FatB thioesterases involved in biosynthesis of C8-C16 fatty acids. Moreover, novel thioesterases appear to cluster into 3 predominant out-groups suggesting distinct functional similarity and evolutionary relatedness among members of each cluster.
[0080] The amino acid sequences of the FatB genes follow are shown in Table 4.
Table 4: Amino acid sequences of FatB genes:
ChsFATB3j:
MVAAEASSALFSVRTPGTSPKPGKFGNWPTSLSVPFKSKSNHNGGFQV KANASARPKANGSAVSLKSGSLDTQEDTSSSSSPPRTFINQLPDWSMLLSAITT VFVAAEKQWTMLDRKSKRPDMLMDPFGVDRVVQDGAVFRQSFSIRSYEIGA DRT ASIETLMNIF QETSLNHCKSIGLLNDGF GRTPEMCKRDLIWVVTKMHIE V NRYPTWGDTIEVNTWVSESGKTGMGRDWLISDFHTGDILIRATSVCAMMNQ KTRRF SKFP YE VRQEL APHF VD SAP VIED Y QKLHKLD VKT GD SICN GLTPRW NDLDVNQHVNNVKYIGWILESVPTEVFETQELCGLTLEYRQECGRDSVLESV T AMDP SKEGDRSL Y QHLLRLEDGTDIAKGRTKWRPKN AGKT SN GN SIS
Example 2. Cloning and fatty acid analysis of cells transformed with novel FATB genes [0081] In the example below, we detail the effect of expressing plant oilseed transcriptome-derived, heterologous thioesterases in the UTEX1435 (web.biosci.utexas.edu/utex/) strain, Strain A.
[0082] As in Example 1, RNA was extracted from dried plant seeds and submitted for paired-end sequencing using the Illumina Hiseq 2000 platform. RNA sequence reads were assembled into corresponding seed transcriptomes using Trinity or Oases packages and putative thioesterase-containing cDNA contigs were identified by mining transcriptomes for sequences with homology to known thioesterases.
These in silico identified putative thioesterase cDNAs were verified by direct reverse transcription PCR analysis using seed RNA and primer pairs targeting full-length thioesterase cDNAs. The resulting amplified products were cloned and sequenced de novo to confirm authenticity of identified thioesterase genes and to identify sequence variants arising from expression of different gene alleles or diversity of sequences within a population of seeds. The resulting amino acid sequences were subjected to phylogenetic analysis using published full-length (Mayer and Shanklin, 2007) and truncated (THYME database) FatB sequences. The thioesterases that clustered with acyl-ACP FatB thioesterases, which are involved in biosynthesis of C8-C16 fatty acids, were pursued.
Construction of Transforming Vectors Expressing Acyl-ACP FatB Thioesterases [0083] 27 putative acyl-ACP FatB thioesterases from the species Cinnamomum camphora, Cuphea hyssopifolia, Cuphea PSR23, Cuphea wrightii, Cuphea heterophylla, and Cuphea viscosissima were synthesized in a codon-optimized form to reflect Prototheca moriformis (UTEX 1435) codon usage. Of the 27 genes synthesized, 24 were identified by our transcriptome sequencing efforts and the 3 genes from Cuphea viscosissima, were from published sequences in GenBank.
[0084] Transgenic strains were generated via transformation of the base strain Strain A (Prototheca moriformis, derived from UTEX 1435 by classical mutation and screening for high oil production) with a construct encoding 1 of the 27 FatB thioesterases. The construct pSZ2760 encoding Cinnamomum camphora (Cc) FATB lb is shown as an example, but identical methods were used to generate each of the remaining 26 constructs encoding the different respective thioesterases. Construct pSZ2760 can be written as 6S::CrTUB2:ScSUC2:CvNR::PmAMT3:CcFATBlb:CvNR::6S. The sequence of the transforming DNA is provided in Table 5 (pSZ2760). The relevant restriction sites in the construct from 5’-3’, BspQl, Kpnl, AscI, Mfel, EcoRI, Spel, Xhol, SacI, BspQl, respectively, are indicated in lowercase, bold, and underlined. BspQl sites delimit the 5 ’ and 3 ’ ends of the transforming DNA. Bold, lowercase sequences at the 5 ’ and 3 ’ end of the construct represent genomic DNA from UTEX 1435 that target integration to the 6S locus via homologous recombination. Proceeding in the 5 ’ to 3 ’ direction, the selection cassette has the C. reinhardtii β-tubulin promoter driving expression of the S. cerevisiae gene SUC2 (conferring the ability to grow on sucrose) and the Chlorella vulgaris Nitrate Reductase (NR) gene 3 ’ UTR. The promoter is indicated by lowercase, boxed text. The initiator ATG and terminator TGA for ScSUC2 are indicated by bold, uppercase italics, while the coding region is indicated with lowercase italics. The 3’ UTR is indicated by lowercase underlined text. The spacer region between the two cassettes is indicated by upper case text. The second cassette containing the codon optimized CcFATBlb gene (Table 5; pSZ2760) from Cinnamomum camphora is driven by the Prototheca moriformis endogenous AMT3 promoter, and has the Chlorella vulgaris Nitrate Reductase (NR) gene 3’ UTR. In this cassette, the AMT3 promoter is indicated by lowercase, boxed text. The initiator ATG and terminator TGA for the CcFATBlb gene are indicated in bold, uppercase italics, while the coding region is indicated by lowercase italics and the spacer region is indicated by upper case text. The 3’ UTR is indicated by lowercase underlined text. The final construct was sequenced to ensure correct reading frame and targeting sequences.
Table 5: pSZ2760 Transforming construct gctcttcgccgccgccactcctgctcgagcgcgcccgcgcgtgcgccgccagcgccttggccttttcgccgcgctcgtgc gcgtcgctgatgtccatcaccaggtccatgaggtctgccttgcgccggctgagccactgcttcgtccgggcggccaagag gagcatgagggaggactcctggtccagggtcctgacgtggtcgcggctctgggagcgggccagcatcatctggctctgc cgcaccgaggccgcctccaactggtcctccagcagccgcagtcgccgccgaccctggcagaggaagacaggtgaggg gggtatgaattgtacagaacaaccacgagccttgtctaggcagaatccctaccagtcatggctttacctggatgacggcctg cgaacagctgtccagcgaccctcgctgccgccgcttctcccgcacgcttctttccagcaccgtgatggcgcgagccagcg ccgcacgctggcgctgcgcttcgccgatctgaggacagtcggggaactctgatcagtctaaacccccttgcgcgttagtgtt gccatcctttgcagaccggtgagagccgacttgttgtgcgccaccccccacaccacctcctcccagaccaattctgtcacct ttttggcgaaggcatcggcctcggcctgcagagaggacagcagtgcccagccgctgggggttggcggatgcacgctca ggtacdctttcttgcgctatgacacttccagcaaaaggtagggcgggctgcgagacggcttcccggcgctgcatgcaaca ccgatgatgcttcgaccccccgaagctccttcggggctgcatgggcgctccgatgccgctccagggcgagcgctgtttaa atagccaggcccccgattgcaaagacattatagcgagctaccaaagccatattcaaacacctagatcactaccacttctaca caggccactcgagcttgtgatcgcactccgctaagggggcgcctcttcctcttcgtttcagtcacaacccgcaaaqggcgc gccATGctgctgcaggccttcctgttcctgctggccggcttcgccgccaagatcagcgcctccatgacgaacgagac gtccgaccgccccctggtgcacttcacccccaacaagggctggatgaacgaccccaacggcctgtggtacgacgag aaggacgccaagtggcacctgtacttccagtacaacccgaacgacaccgtctgggggacgcccttgttctggggcca cgccacgtccgacgacctgaccaactgggaggaccagcccatcgccatcgccccgaagcgcaacgactccggcgc cttctccggctccatggtggtggactacaacaacacctccggcttcttcaacgacaccatcgacccgcgccagcgctgc gtggccatctggacctacaacaccccggagtccgaggagcagtacatctcctacagcctggacggcggctacaccttc accgagtaccagaagaaccccgtgctggccgccaactccacccagttccgcgacccgaaggtcttctggtacgagcc ctcccagaagtggatcatgaccgcggccaagtcccaggactacaagatcgagatctactcctccgacgacctgaagt cctggaagctggagtccgcgttcgccaacgagggcttcctcggctaccagtacgagtgccccggcctgatcgaggtcc ccaccgagcaggaccccagcaagtcctactgggtgatgttcatctccatcaaccccggcgccccggccggcggctcct tcaaccagtacttcgtcggcagcttcaacggcacccacttcgaggccttcgacaaccagtcccgcgtggtggacttcgg caaggactactacgccctgcagaccttcttcaacaccgacccgacctacgggagcgccctgggcatcgcgtgggcctc caactgggagtactccgccttcgtgcccaccaacccctggcgctcctccatgtccctcgtgcgcaagttctccctcaaca ccgagtaccaggccaacccggagacggagctgatcaacctgaaggccgagccgatcctgaacatcagcaacgccg gcccctggagccggttcgccaccaacaccacgttgacgaaggccaacagctacaacgtcgacctgtccaacagcac cggcaccctggagttcgagctggtgtacgccgtcaacaccacccagacgatctccaagtccgtgttcgcggacctctcc ctctggttcaagggcctggaggaccccgaggagtacctccgcatgggcttcgaggtgtccgcgtcctccttcttcctgga ccgcgggaacagcaaggtgaagttcgtgaaggagaacccctacttcaccaaccgcatgagcgtgaacaaccagcc cttcaagagcgagaacgacctgtcctactacaaggtgtacggcttgctggaccagaacatcctggagctgtacttcaac gacggcgacgtcgtgtccaccaacacctacttcatgaccaccgggaacgccctgggctccgtgaacatgacgacggg ggtggacaacctgttctacatcgacaagttccaggtgcgcgaggtcaagTGAcaattzzcazcazcazctczzataz tatcgacacactctggacgctggtcgtgtgatggactgttgccgccacacttgctgccttgacctgtgaatatccctgccgctt ttatcaaacagcctcagtgtgtttgatcttgtgtgtacgcgcttttgcgagttgctagctgcttgtgctatttgcgaataccacccc cagcatccccttccctcgtttcatatcgcttgcatcccaaccgcaacttatctacgctgtcctgctatccctcagcgctgctcct gctcctgctcactgcccctcgcacagccttggtttgggctccgcctgtattctcctggtactgcaacctgtaaaccagcactg caatgctgatgcacgggaagtagtgggatgggaacacaaatggaAAGCT GT AT AGGG AT AAgaattcIggl ccgacaggacgcgcgtcaaaggtgctggtcgtgtatgccctggccggcaggtcgttgctgctgctggttagtgattccgca accctgattttggcgtcttattttggcgtggcaaacgctggcgcccgcgagccgggccggcggcgatgcggtgccccacg gctgccggaatccaagggaggcaagagcgcccgggtcagttgaagggctttacgcgcaaggtacagccgctcctgcaa ggctgcgtggtggaattggacgtgcaggtcctgctgaagttcctccaccgcctcaccagcggacaaagcaccggtgtatc aggtccgtgtcatccactctaaagaactcgactacgacctactgatggccctagattcttcatcaaaaacgcctgagacactt gcccaggattgaaactccctgaagggaccaccaggggccctgagttgttccttccccccgtggcgagctgccagccagg ctgtacctgtgatcgaggctggcgggaaaataggcttcgtgtgctcaggtcatgggaggtgcaggacagctcatgaaacg ccaacaatcgcacaattcatgtcaagctaatcagctatttcctcttcacgagctgtaattgtcccaaaattctggtctaccgggg gtgatccttcgtgtacgggcccttccctcaaccctaggtatgcgcgcatgcggtcgccgcgcaactcgcgcgagggccga gggtttgggacgggccgtcccgaaatgcagttgcacccggatgcgtggcaccttttttgcgataatttatgcaatggactgct ctgcaaaattctggctctgtcgccaaccctaggatcagcggcgtaggatttcgtaatcattcgtcctgatggggagctaccg actaccctaatatcagcccgactgcctgacgccagcgtccacttttgtgcacacattccattcgtgcccaagacatttcattgt ggtgcgaagcgtccccagttacgctcacctgtttcccgacctccttactgttctgtcgacagagcgggcccacaggccggt cgcagcmctaetATGgccaccacctccctggcctccgccttctgctccatgaaggccgtgatgctggcccgcgacg gccgcggcctgaagccccgctcctccgacctgcagctgcgcgccggcaacgcccagacctccctgaagatgatcaac ggcaccaagttctcctacaccgagtccctgaagaagctgcccgactggtccatgctgttcgccgtgatcaccaccatctt ctccgccgccgagaagcagtggaccaacctggagtggaagcccaagcccaaccccccccagctgctggacgacca cttcggcccccacggcctggtgttccgccgcaccttcgccatccgctcctacgaggtgggccccgaccgctccacctcc atcgtggccgtgatgaaccacctgcaggaggccgccctgaaccacgccaagtccgtgggcatcctgggcgacggctt cggcaccaccctggagatgtccaagcgcgacctgatctgggtggtgaagcgcacccacgtggccgtggagcgctacc ccgcctggggcgacaccgtggaggtggagtgctgggtgggcgcctccggcaacaacggccgccgccacgacttcct ggtgcgcgactgcaagaccggcgagatcctgacccgctgcacctccctgtccgtgatgatgaacacccgcacccgcc gcctgtccaagatccccgaggaggtgcgcggcgagatcggccccgccttcatcgacaacgtggccgtgaaggacga ggagatcaagaagccccagaagctgaacgactccaccgccgactacatccagggcggcctgaccccccgctggaa cgacctggacatcaaccagcacgtgaacaacatcaagtacgtggactggatcctggagaccgtgcccgactccatctt cgagtcccaccacatctcctccttcaccatcgagtaccgccgcgagtgcacccgcgactccgtgctgcagtccctgacc accgtgtccggcggctcctccgaggccggcctggtgtgcgagcacctgctgcagctggagggcggctccgaggtgct gcgcgccaagaccgagtggcgccccaagctgtccttccgcggcatctccgtgatccccgccgagtcctccgtgatgga ctacaaggaccacgacggcgactacaaggaccacgacatcgactacaaggacgacgacgacaagTGAclcga ggcagcagcagctcggatagtatcgacacactctggacgctggtcgtgtgatggactgttgccgccacacttgctgccttg acctgtgaatatccctgccgcttttatcaaacagcctcagtgtgtttgatcttgtgtgtacgcgcttttgcgagttgctagctgctt gtgctatttgcgaataccacccccagcatccccttccctcgtttcatatcgcttgcatcccaaccgcaacttatctacgctgtcc tgctatccctcagcgctgctcctgctcctgctcactgcccctcgcacagccttggtttgggctccgcctgtattctcctggtact
gcaacctgtaaaccagcactgcaatgctgatgcacgggaagtagtgggatgggaacacaaatggaAAGCTGTAT AGGGATAACAGGGTAATgagctcttgttttccagaaggagttgctccttgagcctttcattctcagcctcgata acctccaaagccgctctaattgtggagggggttcgaatttaaaagcttggaatgttggttcgtgcgtctggaacaagcccag acttgttgctcactgggaaaaggaccatcagctccaaaaaacttgccgctcaaaccgcgtacctctgctttcgcgcaatctgc cctgttgaaatcgccaccacattcatattgtgacgcttgagcagtctgtaattgcctcagaatgtggaatcatctgccccctgtg cgagcccatgccaggcatgtcgcgggcgaggacacccgccactcgtacagcagaccattatgctacctcacaatagttca taacagtgaccatatttctcgaagctccccaacgagcacctccatgctctgagtggccaccccccggccctggtgcttgcg gagggcaggtcaaccggcatggggctaccgaaatccccgaccggatcccaccacccccgcgatgggaagaatctctcc ccgggatgtgggcccaccaccagcacaacctgctggcccaggcgagcgtcaaaccataccacacaaatatccttggcat cggccctgaattccttctgccgctctgctacccggtgcttctgtccgaagcaggggttgctagggatcgctccgagtccgca aacccttgtcgcgtggcggggcttgttcgagcttgaagagc [0085] Constmcts encoding the identified heterologous FatB genes, such as
CcFATBlb from pSZ2760 in Table 6, were transformed into Strain A, and selected for the ability to grow on sucrose. Transformations, cell culture, lipid production and fatty acid analysis were all carried out as previously described. After cultivating on sucrose under low nitrogen conditions to accumulate oil, fatty acid profiles were determined by FAME-GC. The top performer from each transformation, as judged by the ability to produce the highest level of midchain fatty acids, is shown in Table 4.
Table 6: Alteration of Fatty Acid Profiles in S3150 upon Expression of Heterologous FatB Thioesterases
[0086] Many of the acyl-ACP FatB thioesterases were found to exhibit midchain activity when expressed in Prototheca moriformis. For example, expression of CcFATBlb causes an increase in myristate levels from 2% of total fatty acids in the parent, Strain A, to ~ 15% in the D1670-13 primary transformant. Other examples include CcFATB4, which exhibits an increase in laurate levels from 0% in Strain A to ~33%, and ChsFATB3, which exhibits an increase in myristate levels to ~34%. Although some of the acyl-ACP thioesterases did not exhibit dramatic effects on midchain levels in the current incarnation, efforts will likely develop to optimize some of these constructs.
[0087] Sequences of the Heterologous Acyl-ACP Thioesterases Identified and Transformed into P. moriformis (UTEX 1435) [0088] A complete listing of relevant sequences for the transforming constructs, such as the deduced amino acid sequence of the encoded acyl-ACP thioesterase, the native CDS coding sequence, the Prototheca moriformis codon-optimized coding sequence, and the nature of the sequence variants examined, is provided as SEQ ID NOS: 1-78.
Example 3. Discovery and Cloning of Additional FATB genes
Additional FATB genes were obtained from seeds as described above. The species and number of FatB genes identified were:
[0089] The thioesterases that clustered with acyl-ACP FatB thioesterases, which are involved in biosynthesis of C8-C16 fatty acids, were pursued. The native, putative plastid-targeting transit peptide sequence is indicated by underlining.
[0090] Construction of Transforming Vectors Expressing Acyl-ACP FatB Thioesterases. The nine putative Acyl-ACP FatB Thioesterases from the species Cuphea calcarata, Cuphea painter, Cuphea hookeriana, Cuphea avigera var. pulcherrima, Cuphea paucipetala, Cuphea procumbens, and Cuphea ignea were synthesized in a codon-optimized form to reflect UTEX 1435 codon usage. In contrast to the previous example, the new Acyl-ACP FatB thioesterases were synthesized with a modified transit peptide from Chlorella protothecoides (Cp) in place of the native transit peptide. The modified transit peptide derived from the CpSADl gene, “CpSADltp trimmed”, was synthesized as an in-frame, N-terminal fusion to the FatB acyl-ACP thioesterases in place of the native transit peptide; the resulting sequences are listed below. The novel FatB genes were cloned into Prototheca moriformis as described above. Constructs encoding heterologous FatB genes were transformed into strain S6165 (a descendant of S3150/Strain A) and selected for the ability to grow on sucrose. Transformations, cell culture, lipid production and fatty acid analysis were all carried out as previously described. The results for the nine novel FatB acyl-ACP thioesterases are displayed in the table immediately below.
ο ^ οο 3 ο so £ “ Ρ ο & <ϊΓ 5 C ^ SL ^ Ρ· 3-
4? I § ^ £ Ρ Η S’ CO p Η- l-t - ο ^ ο Ο ρ ^ ^· S > ΕΤ Η
Cd Γ"Κ vi
1/1 C so <, so Ef ® o o o O x O 5t p P S-·
O CO
v O "-J
0s NO
-. °X £ n
§ I l-h P p
3 P "C a- 00 so ox acid levels; CigneaFATBl, which exhibits 8% 00:0 and 1% 02:0 fatty acid levels; CcalcFATBl, which exhibits 18% 04:0 and 12% 02:0 levels; and CaFATBl, which exhibits 22% C8:0 and 9% 00:0 fatty acid levels.
[0092] CaFATBl, which exhibits high C8:0 and 00:0 levels, is of particular interest. CaFATBl arose from two separate contigs that were assembled from the Cupha avigera var. pulcherrima transcriptome, S17_Cavig_trinity_7406 and S17_Cavig_trinity_7407. Although the two partial contigs exhibit only 17 nucleotides of overlap, we were able to assemble a putative full length transcript encoding CaFATBl from the two contigs and then subsequently confirm the existence of the full-length transcript by direct reverse transcription PCR analysis using seed RNA and primer pairs targeting the full-length CaFATBl thioesterase cDNA. Tjellstrom et al. (2013) discloses the expression of a newly identified fatty acyl-ACP thioesterase from Cuphea pulcherrima that they named “CpuFATB3” (Genbank accession number KC675178). The coding sequence of CpuFATB3 is 100% identical to the CaFATBl gene we identified and contains one nucleotide difference in the RNA sequence outside the predicted coding region. Tjellstrom et al. (2013) showed that CpuFATB3 produces an average of 4.8% C8:0 when expressed in Arabidopsis, and further requires deletion of two acyl-ACP synthetases, AAE15/16, to produce an average of 9.2% C8:0 with a maximum level of-12% C8.0. The CaFATBl gene we identified was codon-optimized for expression in UTEX1435 and generated as a CpSADltp-trimmed transit peptide fusion before introduction into S6165. The CpSADltp_trimmed:CaFATBl gene produces an average C8:0 level of 14% and a maximum level of 22% C8:0 without requiring the deletion of endogenous acyl-ACP synthetases.
[0093] Table 7. Amino Acid Sequences of Additional Novel FatB Acyl-ACP Thioesterases. In the appended sequence listings, the native, putative plastid-targeting transit peptide sequence is underlined:
Example 4. FATB consensus sequences: Discovery, cloning and fatty acid profiles
[0094] In the course of testing several new putative midchain FatB thioesterases in UTEX1435, S3150 (Strain A above), we identified several thioesterases with increased C10:0 and C16:0 activity above the background midchain levels found in the strain. We reasoned that a consensus sequence could be obtained for an idealized C10:0 thioesterase and C16:0 thioesterase from aligning the bestperforming 00:0 and 06:0 thioesterases. A consensus 00:0 specific thioesterase sequence was generated using the C. palustris FatBl (CpFATBl), C. PSR23 FatB3 (CuPSR23FATB3), C. viscosissima FatBl (CvisFATBl), C. glossostoma FatBl (CgFATBl), and C. carthagenensis FatB2 (CcrFATB2) sequences as inputs resulting in a 00:0 specific consensus sequence termed JcFATBl/SzFATBl. A consensus 06:0 specific thioesterase sequence was generated using the C. heterophylla FatB3a (ChtFATB3a), C. carthagenensis FatBl (CcrFATBl), C. viscosissima FatB2 (CvisFATB2), C. hookeriana FatBl (ChFATBl; AAC48990), C. hyssopifolia FatB2 (ChsFATB2), C. calophylla FatB2 (CcalFATB2; ABB71581), C. hookeriana FatBl-1 (ChFATBl-1; AAC72882), C. lanceolata FatBl (C1FATB1; CAC19933), and C. wrightii FatB4a (CwFATB4a) sequences as inputs resulting in a 06:0 specific consensus sequence termed JcFATB2/SzFATB2. The resulting consensus sequences were synthesized, cloned into a vector identical to that used to test other FatB thioesterases, and introduced into S3150 as described above. The consensus amino acid sequences are given as SEQ ID NOs. 106 and 107; the nucleic acid sequences were based on these amino acid sequences using codon optimization for Prototheca moriformis. The transformants were selected, cultivated and the oil was extracted and analyzed by FAME-GC-FID. The fatty acid profiles obtained are given in the table below.
Example 5: Clade analysis
Various novel FATB thioesterases were clustered according to a neighbor joining algorithm. These were found to form twelve clades as listed in Table la. Putative function was assigned based on expression in Prototheca as described above.
[0095] The described embodiments of the invention are intended to be merely exemplary and numerous variations and modifications will be apparent to those skilled in the art. All such variations and modifications are intended to be within the scope of the present invention.
Sequence Listing SEQ ID NO 1:
Cinnamomum camphora (Cc) FATBlb variant M25L, M322R, AT367-D368 amino acid sequence
MATTSLASAFCSMKAVMLARDGRGLKPRSSDLQLRAGNAQTSLKMINGTKFSYTESLKKLPD
WSMLFAVITTIFSAAEKQWTNLEWKPKPNPPQLLDDHFGPHGLVFRRTFAIRSYEVGPDRSTSI
VAVMNHLQEAALNHAKS VGILGDGF GTTLEM SKRDLIWVVKRTHVAVERYPAW GDT VEVE
CWVGASGNNGRRHDFLVRDCKTGEILTRCTSLSVMMNTRTRRLSKIPEEVRGEIGPAFIDNVA
VKDEEIKKPQKLNDSTADYIQGGLTPRWNDLDINQHVNNIKYVDWILETVPDSIFESHHISSFTI
EYRRECTRDSVLQSLTTVSGGSSEAGLVCEHLLQLEGGSEVLRAKTEWRPKLSFRGISVIPAES sv* SEQ ID NO 2:
Cinnamomum camphora (Cc) FATBlb variant M25L, M322R, AT367-D368 coding DNA sequence
TTAGCTTCTGCTTTCTGCTCGATGAAAGCTGTAATGTTGGCTCGTGATGGCAGGGGCTTGA
AACCCAGGAGCAGTGATTTGCAGCTGAGGGCGGGAAATGCACAAACCTCTTTGAAGATGA
TCAATGGGACCAAGTTCAGTTACACAGAGAGCTTGAAAAAGTTGCCTGACTGGAGCATGC
TCTTTGCAGTGATCACGACCATCTTTTCGGCTGCTGAGAAGCAGTGGACCAATCTAGAGTG
GAAGCCGAAGCCGAATCCACCCCAGTTGCTTGATGACCATTTTGGGCCGCATGGGTTAGTT
TTCAGGCGCACCTTTGCCATCAGATCGTATGAGGTGGGACCTGACCGCTCCACATCTATAG
TGGCTGTTATGAATCACTTGCAGGAGGCTGCACTTAATCATGCGAAGAGTGTGGGAATTCT
AGGAGATGGATTCGGTACGACGCTAGAGATGAGTAAGAGAGATCTGATATGGGTTGTGAA
ACGCACGCATGTTGCTGTGGAACGGTACCCTGCTTGGGGTGATACTGTTGAAGTAGAGTG
CTGGGTTGGTGCATCGGGAAATAATGGCAGGCGCCATGATTTCCTTGTCCGGGACTGCAA
AACAGGCGAAATTCTTACAAGATGTACCAGTCTTTCGGTGATGATGAATACAAGGACAAG
GAGGTTGTCCAAAATCCCTGAAGAAGTTAGAGGGGAGATAGGGCCTGCATTCATTGATAA
TGTGGCTGTCAAGGACGAGGAAATTAAGAAACCACAGAAGCTCAATGACAGCACTGCAG
ATTACATCCAAGGAGGATTGACTCCTCGATGGAATGATTTGGATATCAATCAGCACGTTA
ACAACATCAAATACGTTGACTGGATTCTTGAGACTGTCCCAGACTCAATCTTTGAGAGTCA
TCATATTTCCAGCTTCACTATTGAATACAGGAGAGAGTGCACGAGGGATAGCGTGCTGCA
GTCCCTGACCACTGTCTCCGGTGGCTCGTCGGAAGCTGGGTTAGTGTGCGAGCACTTGCTC
CAGCTTGAAGGTGGGTCTGAGGTATTGAGGGCAAAAACAGAGTGGAGGCCTAAGCTTAGT
TTCAGAGGGATTAGTGTGATACCCGCAGAATCGAGTGTCTAA SEQ ID NO 3:
Cinnamomum camphora (Cc) FATBlb variant M25L, M322R, ΔΤ367-0368 coding DNA sequence codon optimized for Prototheca moriformis
TTAGCTTCTGCTTTCTGCTCGATGAAAGCTGTAATGTTGGCTCGTGATGGCAGGGGCTTGA
AACCCAGGAGCAGTGATTTGCAGCTGAGGGCGGGAAATGCACAAACCTCTTTGAAGATGA
TCAATGGGACCAAGTTCAGTTACACAGAGAGCTTGAAAAAGTTGCCTGACTGGAGCATGC
TCTTTGCAGTGATCACGACCATCTTTTCGGCTGCTGAGAAGCAGTGGACCAATCTAGAGTG
GAAGCCGAAGCCGAATCCACCCCAGTTGCTTGATGACCATTTTGGGCCGCATGGGTTAGTT
TTCAGGCGCACCTTTGCCATCAGATCGTATGAGGTGGGACCTGACCGCTCCACATCTATAG
TGGCTGTTATGAATCACTTGCAGGAGGCTGCACTTAATCATGCGAAGAGTGTGGGAATTCT
AGGAGATGGATTCGGTACGACGCTAGAGATGAGTAAGAGAGATCTGATATGGGTTGTGAA
ACGCACGCATGTTGCTGTGGAACGGTACCCTGCTTGGGGTGATACTGTTGAAGTAGAGTG
CTGGGTTGGTGCATCGGGAAATAATGGCAGGCGCCATGATTTCCTTGTCCGGGACTGCAA
AACAGGCGAAATTCTTACAAGATGTACCAGTCTTTCGGTGATGATGAATACAAGGACAAG
GAGGTTGTCCAAAATCCCTGAAGAAGTTAGAGGGGAGATAGGGCCTGCATTCATTGATAA
TGTGGCTGTCAAGGACGAGGAAATTAAGAAACCACAGAAGCTCAATGACAGCACTGCAG
ATTACATCCAAGGAGGATTGACTCCTCGATGGAATGATTTGGATATCAATCAGCACGTTA
ACAACATCAAATACGTTGACTGGATTCTTGAGACTGTCCCAGACTCAATCTTTGAGAGTCA
TCATATTTCCAGCTTCACTATTGAATACAGGAGAGAGTGCACGAGGGATAGCGTGCTGCA
GTCCCTGACCACTGTCTCCGGTGGCTCGTCGGAAGCTGGGTTAGTGTGCGAGCACTTGCTC
CAGCTTGAAGGTGGGTCTGAGGTATTGAGGGCAAAAACAGAGTGGAGGCCTAAGCTTAGT
TTCAGAGGGATTAGTGTGATACCCGCAGAATCGAGTGTCTAA SEQ ID N0:4
Cinnamomum camphor a (Cc) FATB4 amino acid sequence
MVTTSLASAYFSMKAVMLAPDGRGIKPRSSGLQVRAGNERNSCKVINGTKVKDTEGLKGCST LQGQSMLDDHF GLHGLVFRRTF AIRC YE VGPDRST SIMAVMNHLQEAARNHAE SLGLLGDGF GETLEMSKRDLIWWRRTHVAVERYPAWGDTVEVEAWVGASGNTGMRRDFLVRDCKTGHI LTRCTSVSVMMNMRTRRLSKIPQEVRAEIDPLFIEKVAVKEGEIKKLQKLNDSTADYIQGGWT PRWNDLDVNQHVNNIIYVGWIFKSVPDSISENHHLSSITLEYRRECTRGNKLQSLTTVCGGSSE AGIICEHLLQLEDGSEVLRARTEWRPKHTDSFQGISERFPQQEPHK SEQ ID NO: 5
Cinnamomum camphor a (Cc) FATB4 coding DNA sequence
ATGGTCACCACCTCTTTAGCTTCCGCTTACTTCTCGATGAAAGCTGTAATGTTGGCTCCTGA
CGGCAGGGGCATAAAGCCCAGGAGCAGTGGTTTGCAGGTGAGGGCGGGAAATGAACGAA
ACTCTTGCAAGGTGATCAATGGGACCAAGGTCAAAGACACGGAGGGCTTGAAAGGGTGC
AGC ACGTTGCAAGGCCAGAGC ATGCTT GAT GACCATTTTGGT CT GC ATGGGCT AGTTTTCA
GGCGCACCTTTGCAATCAGATGCTATGAGGTTGGACCTGACCGCTCCACATCCATAATGGC
TGTTATGAATCACTTGCAGGAAGCTGCACGTAATCATGCGGAGAGTCTGGGACTTCTAGG
AGATGGATTCGGTGAGACACTGGAGATGAGTAAGAGAGATCTGATATGGGTTGTGAGACG
CACGCATGTTGCTGTGGAACGGTACCCTGCTTGGGGCGATACTGTTGAAGTCGAGGCCTG
GGTGGGTGCATCAGGTAACACTGGCATGCGCCGCGATTTCCTTGTCCGCGACTGCAAAAC
TGGCCACATTCTTACAAGATGTACCAGTGTTTCAGTGATGATGAATATGAGGACAAGGAG
ATTGTCCAAAATTCCCCAAGAAGTTAGAGCGGAGATTGACCCTCTTTTCATTGAAAAGGTT
GCTGTCAAGGAAGGGGAAATTAAAAAATTACAGAAGTTGAATGATAGCACTGCAGATTAC
ATTC AAGGGGGTTGGACTCCTCGAT GGAAT GATTT GGAT GTC AATCAGC ACGT GAAC AAT
ATCATATACGTTGGCTGGATTTTTAAGAGCGTCCCAGACTCTATCTCTGAGAATCATCATC
TTTCTAGCATCACTCTCGAATACAGGAGAGAGTGCACAAGGGGCAACAAGCTGCAGTCCC
TGACCACTGTTTGTGGTGGCTCGTCGGAAGCTGGGATCATATGTGAGCACCTACTCCAGCT
TGAGGATGGGTCTGAGGTTTTGAGGGCAAGAACAGAGTGGAGGCCCAAGCACACCGATA
GTTTCC AAGGC ATTAGT GAGAGATTCCCGCAGCAAGAACCGCAT AAGT AA SEQ ID NO: 6
Cinnamomum camphora (Cc) FATB4 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGACCACCTCCCTGGCCTCCGCCTACTTCTCCATGAAGGCCGTGATGCTGGCCCCCG
ACGGCCGCGGCATCAAGCCCCGCTCCTCCGGCCTGCAGGTGCGCGCCGGCAACGAGCGCA
ACTCCTGCAAGGT GATCAACGGCACCAAGGTGAAGGAC ACCGAGGGCCT GAAGGGCT GC
TCCACCCTGCAGGGCCAGTCCATGCTGGACGACCACTTCGGCCTGCACGGCCTGGTGTTCC
GCCGCACCTTCGCCATCCGCTGCTACGAGGTGGGCCCCGACCGCTCCACCTCCATCATGGC
CGTGATGAACCACCTGCAGGAGGCCGCCCGCAACCACGCCGAGTCCCTGGGCCTGCTGGG
CGACGGCTTCGGCGAGACCCTGGAGATGTCCAAGCGCGACCTGATCTGGGTGGTGCGCCG
CACCCACGTGGCCGTGGAGCGCTACCCCGCCTGGGGCGACACCGTGGAGGTGGAGGCCTG
GGTGGGCGCCTCCGGCAACACCGGCATGCGCCGCGACTTCCTGGTGCGCGACTGCAAGAC
CGGCCACATCCTGACCCGCTGCACCTCCGTGTCCGTGATGATGAACATGCGCACCCGCCGC
CTGTCCAAGATCCCCCAGGAGGTGCGCGCCGAGATCGACCCCCTGTTCATCGAGAAGGTG
GCCGTGAAGGAGGGCGAGATCAAGAAGCTGCAGAAGCTGAACGACTCCACCGCCGACTA
CATCCAGGGCGGCTGGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAA
CATCATCTACGTGGGCTGGATCTTCAAGTCCGTGCCCGACTCCATCTCCGAGAACCACCAC
CTGTCCTCCATCACCCTGGAGTACCGCCGCGAGTGCACCCGCGGCAACAAGCTGCAGTCC
CTGACCACCGTGTGCGGCGGCTCCTCCGAGGCCGGCATCATCTGCGAGCACCTGCTGCAG
CTGGAGGACGGCTCCGAGGTGCTGCGCGCCCGCACCGAGTGGCGCCCCAAGCACACCGAC
TCCTTCCAGGGCATCTCCGAGCGCTTCCCCCAGCAGGAGCCCCACAAGTGA SEQ ID NO: 7
Cinnamomum camphora (Cc) FATB3 amino acid sequence
MVAT AAAS AFFPVGAPAT S S ATS AKASMMPDNLDARGIKPKPAS S SGLQVKANAHASPKING
SKVSTDTLKGEDTLTSSPAPRTFINQLPDWSMFLAAITTIFLAAEKQWTNLDWKPRRPDMLAD
PFGIGRFMQDGLIFRQHFAIRSYEIGADRTASIETLMNHLQETALNHVRSAGLLGDGFGATPEM
SRRDLIWVVTRMQVLVDRYPAWGDIVEVETWVGASGKNGMRRDWLVRDSQTGEILTRATSV
WVMMNKRTRRLSKLPEEVRGEIGPYFIEDVAIIEEDNRKLQKLNENTADNVRRGLTPRWSDLD
VNQHVNNVKYIGWILESAPGSILESHELSCMTLEYRRECGKDSVLQSMTAVSGGGSAAGGSPE
S S VECDHLLQLE SGPEWRGRTE WRPKS ANN SRSILEMPAE SL SEQ ID NO: 8
Cinnamomum camphora (Cc) FATB3 coding DNA sequence
ATGGTTGCCACCGCTGCTGCTTCTGCTTTCTTCCCGGTCGGTGCTCCGGCTACGTCATCTGC
AACTTCAGCCAAAGCGTCGATGATGCCTGATAATTTGGATGCCAGAGGCATCAAACCGAA
GCCGGCTTCGTCCAGCGGCTTGCAGGTTAAGGCAAATGCCCATGCCTCTCCCAAGATTAAT
GGTTCCAAGGTGAGCACGGATACCTTGAAGGGGGAAGACACCTTAACTTCCTCGCCCGCC
CCACGGACCTTTATCAACCAATTGCCTGACTGGAGCATGTTCCTTGCTGCCATCACAACTA
TTTTCTTGGCTGCCGAGAAGCAGTGGACGAATCTCGACTGGAAGCCCAGAAGACCCGACA
TGCTTGCTGACCCGTTTGGCATCGGGAGGTTTATGCAGGATGGGCTGATTTTCAGGCAGCA
CTTTGCAATCAGATCTTATGAGATTGGGGCTGATAGAACGGCGTCTATAGAGACTTTAATG
AATCACTTGCAGGAGACTGCACTTAATCATGTGAGGAGTGCTGGACTCCTAGGTGATGGA
TTTGGTGCGACACCTGAGATGAGTAGAAGAGATCTGATATGGGTTGTAACACGTATGCAG
GTTCTTGTGGACCGCTACCCTGCTTGGGGTGATATTGTTGAAGTAGAGACCTGGGTTGGTG
CATCTGGAAAAAATGGTATGCGCCGTGATTGGCTTGTTCGGGACAGCCAAACTGGTGAAA
TTCTCACACGAGCTACCAGTGTTTGGGTGATGATGAATAAACGGACAAGGCGATTGTCCA
AACTTCCTGAAGAAGTTAGAGGGGAAATAGGGCCTTATTTTATAGAAGATGTTGCTATCA
TAGAGGAGGACAACAGGAAACTACAGAAGCTCAATGAAAACACTGCTGATAATGTTCGA
AGGGGTTTGACTCCTCGCTGGAGTGATCTGGATGTTAATCAGCATGTGAACAATGTCAAAT
ACATTGGTTGGATTCTTGAGAGTGCACCAGGATCCATCTTGGAGAGTCATGAGCTTTCCTG
CATGACCCTTGAATACAGGAGAGAATGTGGGAAGGACAGTGTGCTGCAGTCAATGACTGC
TGTCTCTGGTGGAGGCAGTGCAGCAGGTGGCTCACCAGAATCTAGCGTTGAGTGTGACCA
CTTGCTCCAGCTAGAGAGTGGGCCTGAAGTTGTGAGGGGAAGAACCGAGTGGAGGCCCA
AGAGTGCTAATAACTCGAGGAGCATCCTGGAGATGCCGGCCGAGAGC SEQ ID NO: 9
Cinnamomum camphora (Cc) FATB4 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCACCGCCGCCGCCTCCGCCTTCTTCCCCGTGGGCGCCCCCGCCACCTCCTCCG
CCACCTCCGCCAAGGCCTCCATGATGCCCGACAACCTGGACGCCCGCGGCATCAAGCCCA
AGCCCGCCTCCTCCTCCGGCCTGCAGGTGAAGGCCAACGCCCACGCCTCCCCCAAGATCA
ACGGCTCCAAGGTGTCCACCGACACCCTGAAGGGCGAGGACACCCTGACCTCCTCCCCCG
CCCCCCGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGTTCCTGGCCGCCATCACCAC
CATCTTCCTGGCCGCCGAGAAGCAGTGGACCAACCTGGACTGGAAGCCCCGCCGCCCCGA
CATGCTGGCCGACCCCTTCGGCATCGGCCGCTTCATGCAGGACGGCCTGATCTTCCGCCAG
C ACTT CGCC AT CCGCT CCT ACGAGATCGGCGCCGACCGC ACCGCCTCC ATCGAGACCCT GA
TGAACCACCTGCAGGAGACCGCCCTGAACCACGTGCGCTCCGCCGGCCTGCTGGGCGACG
GCTTCGGCGCCACCCCCGAGATGTCCCGCCGCGACCTGATCTGGGTGGTGACCCGCATGC
AGGTGCTGGTGGACCGCTACCCCGCCTGGGGCGACATCGTGGAGGTGGAGACCTGGGTGG
GCGCCTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGGTGCGCGACTCCCAGACCGGCG
AGAT CCT GACCCGCGCC ACCT CCGTGT GGGTGAT GAT GAAC AAGCGC ACCCGCCGCCT GT
CCAAGCTGCCCGAGGAGGTGCGCGGCGAGATCGGCCCCTACTTCATCGAGGACGTGGCCA
TCATCGAGGAGGACAACCGCAAGCTGCAGAAGCTGAACGAGAACACCGCCGACAACGTG
CGCCGCGGCCTGACCCCCCGCTGGTCCGACCTGGACGTGAACCAGCACGTGAACAACGTG
AAGTACATCGGCTGGATCCTGGAGTCCGCCCCCGGCTCCATCCTGGAGTCCCACGAGCTGT
CCTGCATGACCCTGGAGTACCGCCGCGAGTGCGGCAAGGACTCCGTGCTGCAGTCCATGA
CCGCCGTGTCCGGCGGCGGCTCCGCCGCCGGCGGCTCCCCCGAGTCCTCCGTGGAGTGCG
ACCACCTGCTGCAGCTGGAGTCCGGCCCCGAGGTGGTGCGCGGCCGCACCGAGTGGCGCC
CCAAGTCCGCCAACAACTCCCGCTCCATCCTGGAGATGCCCGCCGAGTCCCTGTGA SEQ ID NO: 10
Cuphea hyssopifolia (Chs) FATB1 amino acid sequence
MVATNAAAFSAYTFFLTSPTHGYSSKRLADTQNGYPGTSLKSKSTPPPAAAAARNGALPLLAS
ICKCPKKADGSMQLDSSLVFGFQFYIRSYEVGADQTVSIQTVLNYLQEAAINHVQSAGYFGDS
FGATPEMTKRNLIWVITKMQVLVDRYPAWGDVVQVDTWTCSSGKNSMQRDWFVRDLKTGD
IITRASSVWVLMNRLTRKLSKIPEAVLEEAKLFVMNTAPTVDDNRKLPKLDGSSADYVLSGLT
PRWSDLDMNQHVNNVKYIAWILESVPQSIPETHKLSAITVEYRRECGKNSVLQSLTNVSGDGI TCGNSIIECHHLLQLETGPEILLARTEWISKEPGFRGAPIQAEKVYNNK* SEQ ID NO: 11
Cuphea hyssopifolia (Chs) FATB1 coding DNA sequence
ATGGTTGCCACTAATGCTGCTGCCTTTTCTGCTTATACTTTCTTCCTTACTTCACCAACTCA
TGGTTACTCTTCCAAACGTCTCGCCGATACTCAAAATGGTTATCCGGGTACCTCCTTGAAA
TCGAAATCCACTCCTCCACCAGCTGCTGCTGCTGCTCGTAACGGTGCATTGCCACTGCTGG
CCTCCATCTGCAAATGCCCCAAAAAGGCTGATGGGAGTATGCAACTAGACAGCTCCTTGG
TCTTCGGGTTTCAATTTTACATTAGATCATATGAAGTGGGTGCGGATCAAACCGTGTCAAT
ACAGACAGTACTCAATTACTTACAGGAGGCAGCCATCAATCATGTTCAGAGTGCTGGCTA
TTTTGGTGATAGTTTTGGCGCCACCCCGGAAATGACCAAGAGGAACCTCATCTGGGTTATC
ACTAAGATGCAGGTTTTGGTGGATCGCTATCCCGCTTGGGGCGATGTTGTTCAAGTTGATA
CATGGACCTGTAGTTCTGGTAAAAACAGCATGCAGCGTGATTGGTTCGTACGGGATCTCA
AAACTGGAGATATTATAACAAGAGCCTCGAGCGTGTGGGTGCTGATGAATAGACTCACCA
GAAAATTATCAAAAATTCCTGAAGCAGTTCTGGAAGAAGCAAAACTTTTTGTGATGAACA
CTGCCCCCACCGTAGATGACAACAGGAAGCTACCAAAGCTGGATGGCAGCAGTGCTGATT
ATGTCCTCTCTGGCTTAACTCCTAGATGGAGCGACTTAGATATGAACCAGCATGTCAACAA
TGTGAAGTACATAGCCTGGATCCTTGAGAGTGTCCCTCAGAGCATACCGGAGACACACAA
GCTGTCAGCGATAACCGTGGAGTACAGGAGAGAATGTGGCAAGAACAGCGTCCTCCAGTC
TCTGACCAACGTCTCCGGGGATGGAATCACATGTGGAAACAGTATTATCGAGTGCCACCA
TTTGCTTCAACTTGAGACTGGCCCAGAGATTCTACTAGCGCGGACGGAGTGGATATCCAA
GGAACCTGGGTTCAGGGGAGCTCCAATCCAGGCAGAGAAAGTCTACAACAACAAATAA SEQ ID NO: 12
Cuphea hyssopifolia (Chs) FATB1 coding DNA sequence codon optimized for Prototheca moriformis
AT GGTGGCC ACC AACGCCGCCGCCTT CT CCGCCT AC ACCTT CTT CCT G ACCT CCCCC ACCC
ACGGCTACTCCTCCAAGCGCCTGGCCGACACCCAGAACGGCTACCCCGGCACCTCCCTGA
AGTCCAAGTCCACCCCCCCCCCCGCCGCCGCCGCCGCCCGCAACGGCGCCCTGCCCCTGCT
GGCCTCCATCTGCAAGTGCCCCAAGAAGGCCGACGGCTCCATGCAGCTGGACTCCTCCCT
GGTGTTCGGCTTCCAGTTCTACATCCGCTCCTACGAGGTGGGCGCCGACCAGACCGTGTCC
ATCCAGACCGTGCTGAACTACCTGCAGGAGGCCGCCATCAACCACGTGCAGTCCGCCGGC
TACTTCGGCGACTCCTTCGGCGCCACCCCCGAGATGACCAAGCGCAACCTGATCTGGGTG
ATCACCAAGATGCAGGTGCTGGTGGACCGCTACCCCGCCTGGGGCGACGTGGTGCAGGTG
GACACCTGGACCTGCTCCTCCGGCAAGAACTCCATGCAGCGCGACTGGTTCGTGCGCGAC
CTGAAGACCGGCGACATCATCACCCGCGCCTCCTCCGTGTGGGTGCTGATGAACCGCCTG
ACCCGCAAGCTGTCCAAGATCCCCGAGGCCGTGCTGGAGGAGGCCAAGCTGTTCGTGATG
AACACCGCCCCCACCGTGGACGACAACCGCAAGCTGCCCAAGCTGGACGGCTCCTCCGCC
GACTACGTGCTGTCCGGCCTGACCCCCCGCTGGTCCGACCTGGACATGAACCAGCACGTG
AACAACGTGAAGTACATCGCCTGGATCCTGGAGTCCGTGCCCCAGTCCATCCCCGAGACC
CACAAGCTGTCCGCCATCACCGTGGAGTACCGCCGCGAGTGCGGCAAGAACTCCGTGCTG
CAGTCCCTGACCAACGTGTCCGGCGACGGCATCACCTGCGGCAACTCCATCATCGAGTGC
CACCACCTGCTGCAGCTGGAGACCGGCCCCGAGATCCTGCTGGCCCGCACCGAGTGGATC
TCCAAGGAGCCCGGCTTCCGCGGCGCCCCCATCCAGGCCGAGAAGGTGTACAACAACAAG
TGA SEQ ID NO: 13
Cuphea hyssopifolia (Chs) FATB2 amino acid sequence
MVATAASSAFFPVPSPDASSRPGKLGNGSSSLSPLKPKLMANGGLQVKANASAPPKINGSSVG
LKSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDP
FGLGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFGRTLEM
YKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRASS
VWVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTPKWNDL
DVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGKGSGSQFQ HLLRLEDGGEIVKGRTEWRPKTAGINGPIASGETSPGDSS* SEQ ID NO: 14
Cuphea hyssopifolia (Chs) FATB2 coding DNA sequence
ATGGTGGCTACCGCTGCAAGTTCAGCATTCTTCCCTGTGCCGTCCCCCGACGCCTCCTCTA
GACCTGGAAAGCTCGGCAATGGGTCATCGAGCTTGAGCCCCCTCAAGCCCAAATTGATGG
CCAATGGCGGGTTGCAGGTTAAGGCAAACGCCAGTGCCCCTCCTAAGATCAATGGTTCTT
CGGTCGGTCTAAAGTCCGGCAGTCTCAAGACTCAGGAAGACACTCCTTCGGCGCCTCCTCC
CCGGACTTTTATTAACCAGCTGCCTGATTGGAGTATGCTTCTTGCTGCAATCACTACTGTCT
TCTTGGCAGCAGAGAAGCAGTGGATGATGCTTGATTGGAAACCCAAGAGGCCTGACATGC
TTGTGGACCCGTTCGGATTGGGAAGGATTGTTCAAGATGGGCTTGTGTTCAGGCAGAATTT
TTCGATTAGGTCCTATGAAATAGGCGCTGATCGCACTGCGTCTATAGAGACGGTGATGAA
CCACTTGCAGGAAACAGCTCTCAATCATGTTAAGAGTGCTGGGCTTCTTAATGACGGCTTT
GGTCGTACTCTTGAGATGTATAAAAGGGACCTTATTTGGGTTGTTGCAAAAATGCAGGTCA
TGGTTAACCGCTATCCTACTTGGGGCGACACGGTTGAAGTGAATACTTGGGTTGCCAAGTC
AGGGAAAAATGGTATGCGTCGTGATTGGCTCATAAGTGATTGCAATACAGGAGAAATTCT
TACTAGAGCATCAAGTGTGTGGGTCATGATGAATCAAAAGACAAGAAGATTGTCAAAAAT
TCCAGATGAGGTTCGACATGAGATAGAGCCTCATTTCGTGGACTCTGCTCCCGTCATTGAA
GATGATGACCGGAAACTTCCCAAGCTGGATGAGAAGACTGCTGACTCCATCCGCAAGGGT
CTAACTCCGAAGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTACATT
GGGTGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGTTCCCTTA
CCCTGGAATATAGGCGGGAATGCGGAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTGTGG
ACCCCTCTGGAAAGGGCTCTGGGTCTCAGTTCCAGCACCTTCTGCGGCTTGAGGATGGAG
GTGAGATTGTGAAGGGGAGAACTGAGTGGCGACCCAAGACTGCAGGAATCAATGGGCCA
ATAGCATCCGGGGAGACCTCACCTGGAGACTCTTCTTAG SEQ ID NO: 15
Cuphea hyssopifolia (Chs) FATB2 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCCCCGACGCCTCCTCCCG
CCCCGGCAAGCTGGGCAACGGCTCCTCCTCCCTGTCCCCCCTGAAGCCCAAGCTGATGGCC
AACGGCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCCTCC
GTGGGCCTGAAGTCCGGCTCCCTGAAGACCCAGGAGGACACCCCCTCCGCCCCCCCCCCC
CGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCGTGT
TCCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACATGC
TGGTGGACCCCTTCGGCCTGGGCCGCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAACTT
CTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATGAA
CCACCTGCAGGAGACCGCCCTGAACCACGTGAAGTCCGCCGGCCTGCTGAACGACGGCTT
CGGCCGCACCCTGGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAGGT
GATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCCAA
GTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGAT
CCTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCCGCCTGTCCAA
GATCCCCGACGAGGTGCGCCACGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTGAT
CGAGGACGACGACCGCAAGCTGCCCAAGCTGGACGAGAAGACCGCCGACTCCATCCGCA
AGGGCCTGACCCCCAAGTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGC
TCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACC
GCCGTGGACCCCTCCGGCAAGGGCTCCGGCTCCCAGTTCCAGCACCTGCTGCGCCTGGAG
GACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGACCGCCGGCATCAA
CGGCCCCATCGCCTCCGGCGAGACCTCCCCCGGCGACTCCTCCTGA SEQ ID NO: 16
Cuphea hyssopifolia (Chs) FATB2b +a.a.248-259 variant amino acid sequence
MVATAASSAFFPVPSPDASSRPGKLGNGSSSLSPLKPKLMANGGLQVKANASAPPKINGSSVG
LKSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDP
FGLGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFGRTLEM
YKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRASS
KSQIMLPLHYCSVWVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADS
IRKGLTPKWNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTA VDPSGKGSGSQFQHLLRLEDGGEIVKGRTEWRPKTAGINGPIASGETSPGDSS* SEQ ID NO: 17
Cuphea hyssopifolia (Chs) FATB2b+a.a.248-259 variant coding DNA sequence
ATGGTGGCTACCGCTGCAAGTTCAGCATTCTTCCCTGTGCCGTCCCCCGACGCCTCCTCTA
GACCTGGAAAGCTCGGCAATGGGTCATCGAGCTTGAGCCCCCTCAAGCCCAAATTGATGG
CCAATGGCGGGTTGCAGGTTAAGGCAAACGCCAGTGCCCCTCCTAAGATCAATGGTTCTT
CGGTCGGTCTAAAGTCCGGCAGTCTCAAGACTCAGGAAGACACTCCTTCGGCGCCTCCTCC
CCGGACTTTTATTAACCAGCTGCCTGATTGGAGTATGCTTCTTGCTGCAATCACTACTGTCT
TCTTGGCAGCAGAGAAGCAGTGGATGATGCTTGATTGGAAACCCAAGAGGCCTGACATGC
TTGTGGACCCGTTCGGATTGGGAAGGATTGTTCAAGATGGGCTTGTGTTCAGGCAGAATTT
TTCGATTAGGTCCTATGAAATAGGCGCTGATCGCACTGCGTCTATAGAGACGGTGATGAA
CCACTTGCAGGAAACAGCTCTCAATCATGTTAAGAGTGCTGGGCTTCTTAATGACGGCTTT
GGTCGTACTCTTGAGATGTATAAAAGGGACCTTATTTGGGTTGTTGCAAAAATGCAGGTCA
TGGTTAACCGCTATCCTACTTGGGGCGACACGGTTGAAGTGAATACTTGGGTTGCCAAGTC
AGGGAAAAATGGTATGCGTCGTGATTGGCTCATAAGTGATTGCAATACAGGAGAAATTCT
TACTAGAGCATCAAGTAAAAGCCAAATTATGTTACCCTTACATTATTGCAGTGTGTGGGTC
ATGATGAATCAAAAGACAAGAAGATTGTCAAAAATTCCAGATGAGGTTCGACATGAGATA
GAGCCTCATTTCGTGGACTCTGCTCCCGTCATTGAAGATGATGACCGGAAACTTCCCAAGC
TGGATGAGAAGACTGCTGACTCCATCCGCAAGGGTCTAACTCCGAAGTGGAATGACTTGG
ATGTCAATCAGCACGTCAACAACGTGAAGTACATTGGGTGGATTCTTGAGAGTACTCCAC
CAGAAGTTCTGGAGACCCAGGAGTTATGTTCCCTTACCCTGGAATATAGGCGGGAATGCG
GAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTGTGGACCCCTCTGGAAAGGGCTCTGGGT
CTCAGTTCCAGCACCTTCTGCGGCTTGAGGATGGAGGTGAGATTGTGAAGGGGAGAACTG
AGTGGCGACCCAAGACTGCAGGAATCAATGGGCCAATAGCATCCGGGGAGACCTCACCTG
GAGACTCTTCTTAG SEQ ID NO: 18
Cuphea hyssopifolia (Chs) FATB2b +a.a.248-259 variant coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCCCCGACGCCTCCTCCCG
CCCCGGCAAGCTGGGCAACGGCTCCTCCTCCCTGTCCCCCCTGAAGCCCAAGCTGATGGCC
AACGGCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCCTCC
GTGGGCCTGAAGTCCGGCTCCCTGAAGACCCAGGAGGACACCCCCTCCGCCCCCCCCCCC
CGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCGTGT
TCCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACATGC
TGGTGGACCCCTTCGGCCTGGGCCGCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAACTT
CTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATGAA
CCACCTGCAGGAGACCGCCCTGAACCACGTGAAGTCCGCCGGCCTGCTGAACGACGGCTT
CGGCCGCACCCTGGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAGGT
GATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCCAA
GTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGAT
CCTGACCCGCGCCTCCTCCAAGTCCCAGATCATGCTGCCCCTGCACTACTGCTCCGTGTGG
GTGATGATGAACCAGAAGACCCGCCGCCTGTCCAAGATCCCCGACGAGGTGCGCCACGAG
ATCGAGCCCCACTTCGTGGACTCCGCCCCCGTGATCGAGGACGACGACCGCAAGCTGCCC
AAGCTGGACGAGAAGACCGCCGACTCCATCCGCAAGGGCCTGACCCCCAAGTGGAACGA
CCTGGACGTGAACCAGCACGTGAACAACGTGAAGTACATCGGCTGGATCCTGGAGTCCAC
CCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGCTCCCTGACCCTGGAGTACCGCCGCGA
GTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACCGCCGTGGACCCCTCCGGCAAGGGCTC
CGGCTCCCAGTTCCAGCACCTGCTGCGCCTGGAGGACGGCGGCGAGATCGTGAAGGGCCG
CACCGAGTGGCGCCCCAAGACCGCCGGCATCAACGGCCCCATCGCCTCCGGCGAGACCTC
CCCCGGCGACTCCTCCTGA SEQ ID NO: 19
Cuphea hyssopifolia (Chs) FATB3 amino acid sequence
MVAAEASSALFSVRTPGTSPKPGKFGNWPTSLSVPFKSKSNHNGGFQVKANASARPKANGSA
V SLKSGSLDTQEDTS S S S SPPRTFINQLPD W SMLLSAITTVFVAAEKQWTMLDRKSKRPDMLM
DPFGVDRWQDGAVFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEM
CKRDLIWWTKMHVEVNRYPTWGDTIEVNTWVSESGKTGMGRDWLISDCHTGEILIRATSMC
AMMNQKTRRFSKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPRWNDLDV
NQHVNNVKYIGWILESVPTEVFETQELCGLTLEYRRECGRDSVLESVTAMDPSKEGDRSLYQH LLRLEDGADIAKGRTKWRPKNAGTNGAISTGKTSNGNSIS* SEQ ID NO: 20
Cuphea hyssopifolia (Chs) FATB3 coding DNA sequence
ATGGTGGCTGCCGAAGCAAGTTCTGCACTCTTCTCCGTTCGAACCCCGGGAACCTCCCCTA
AACCCGGGAAGTTCGGGAATTGGCCAACGAGCTTGAGCGTCCCCTTCAAGTCCAAATCAA
ACCACAATGGCGGCTTTCAGGTTAAGGCAAACGCCAGTGCCCGTCCTAAGGCTAACGGTT
CTGCAGTAAGTCTAAAGTCTGGCAGCCTCGACACTCAGGAGGACACTTCATCGTCGTCCTC
TCCTCCTCGGACTTTCATTAACCAGTTGCCCGACTGGAGTATGCTGCTGTCCGCGATCACG
ACCGTCTTCGTGGCGGCTGAGAAGCAGTGGACGATGCTTGATCGGAAATCTAAGAGGCCC
GACATGCTCATGGACCCGTTTGGGGTTGACAGGGTTGTTCAGGATGGGGCTGTGTTCAGA
CAGAGTTTTTCGATTAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAGACG
CTGATGAACATCTTCCAGGAAACATCTCTCAATCATTGTAAGAGTATCGGTCTTCTCAATG
ACGGCTTTGGTCGTACTCCTGAGATGTGTAAGAGGGACCTCATTTGGGTGGTTACAAAAAT
GCACGTCGAGGTTAATCGCTATCCTACTTGGGGTGATACTATCGAGGTCAATACTTGGGTC
TCCGAGTCGGGGAAAACCGGTATGGGTCGTGATTGGCTGATAAGTGATTGTCATACAGGA
GAAATTCTAATAAGAGCAACGAGCATGTGTGCTATGATGAATCAAAAGACGAGAAGATTC
TCAAAATTTCCATATGAGGTTCGACAGGAGTTGGCGCCTCATTTTGTGGACTCTGCTCCTG
TCATTGAAGACTATCAAAAATTGCACAAGCTTGATGTGAAGACGGGTGATTCCATTTGCA
ATGGCCTAACTCCAAGGTGGAATGACTTGGATGTCAATCAGCACGTTAACAATGTGAAGT
ACATTGGGTGGATTCTCGAGAGTGTTCCAACGGAAGTTTTCGAGACCCAGGAGCTATGTG
GCCTCACCCTTGAGTATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCCGTGACCG
CTATGGATCCATCAAAAGAGGGAGACAGATCTCTGTACCAGCACCTTCTTCGGCTTGAGG
ATGGGGCTGATATCGCGAAGGGCAGAACCAAGTGGCGGCCGAAGAATGCAGGAACCAAT
GGGGCAATATCAACAGGAAAGACTTCAAATGGAAACTCGATCTCTTAG SEQ ID NO: 21
Cuphea hyssopifolia (Chs) FATB3 coding DNA sequence codon optimized for Prototheca moriformis
AT GGTGGCCGCCGAGGCCT CCTCCGCCCT GTT CT CCGTGCGC ACCCCCGGC ACCT CCCCC A
AGCCCGGCAAGTTCGGCAACTGGCCCACCTCCCTGTCCGTGCCCTTCAAGTCCAAGTCCAA
CCACAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCGCCCCAAGGCCAACGGCTC
CGCCGTGTCCCTGAAGTCCGGCTCCCTGGACACCCAGGAGGACACCTCCTCCTCCTCCTCC
CCCCCCCGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGTCCGCCATCACCA
CCGTGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCG
ACATGCTGATGGACCCCTTCGGCGTGGACCGCGTGGTGCAGGACGGCGCCGTGTTCCGCC
AGTCCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACATCTTCCAGGAGACCTCCCTGAACCACTGCAAGTCCATCGGCCTGCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGAT
GCACGTGGAGGTGAACCGCTACCCCACCTGGGGCGACACCATCGAGGTGAACACCTGGGT
GTCCGAGTCCGGCAAGACCGGCATGGGCCGCGACTGGCTGATCTCCGACTGCCACACCGG
CGAGATCCTGATCCGCGCCACCTCCATGTGCGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGTTCCCCTACGAGGTGCGCCAGGAGCTGGCCCCCCACTTCGTGGACTCCGCCCCC
GTGATCGAGGACTACCAGAAGCTGCACAAGCTGGACGTGAAGACCGGCGACTCCATCTGC
AACGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAA
GTACATCGGCTGGATCCTGGAGTCCGTGCCCACCGAGGTGTTCGAGACCCAGGAGCTGTG
CGGCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTGAC
CGCCATGGACCCCTCCAAGGAGGGCGACCGCTCCCTGTACCAGCACCTGCTGCGCCTGGA
GGACGGCGCCGACATCGCCAAGGGCCGCACCAAGTGGCGCCCCAAGAACGCCGGCACCA
ACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCATCTCCTGA SEQ ID NO: 22
Cuphea hyssopifolia (Chs) FATB3b (V204I,C239F, E243D, M251V variant) amino acid sequence
MVAAEASSALFSVRTPGTSPKPGKFGNWPTSLSVPFKSKSNHNGGFQVKANASARPKANGSA
VSLKSGSLDTQEDTSSSSSPPRTFINQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLM
DPFGVDRWQDGAVFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEM
CKRDLIWWTKMHIEVNRYPTWGDTIEVNTWVSESGKTGMGRDWLISDFHTGDILIRATSVC
AMMNQKTRRFSKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPRWNDLDV
NQHVNNVKYIGWILESVPTEVFETQELCGLTLEYRRECGRDSVLESVTAMDPSKEGDRSLYQH LLRLEDGADIAKGRTKWRPKNAGTNGAISTGKTSNGNSIS* SEQ ID NO: 23
Cuphea hyssopifolia (Chs) FATB3b (V204I,C239F, E243D, M251V variant) coding DNA sequence
ATGGTGGCTGCCGAAGCAAGTTCTGCACTCTTCTCCGTTCGAACCCCGGGAACCTCCCCTA
AACCCGGGAAGTTCGGGAATTGGCCAACGAGCTTGAGCGTCCCCTTCAAGTCCAAATCAA
ACCACAATGGCGGCTTTCAGGTTAAGGCAAACGCCAGTGCCCGTCCTAAGGCTAACGGTT
CTGCAGTAAGTCTAAAGTCTGGCAGCCTCGACACTCAGGAGGACACTTCATCGTCGTCCTC
TCCTCCTCGGACTTTCATTAACCAGTTGCCCGACTGGAGTATGCTGCTGTCCGCGATCACG
ACCGTCTTCGTGGCGGCTGAGAAGCAGTGGACGATGCTTGATCGGAAATCTAAGAGGCCC
GACATGCTCATGGACCCGTTTGGGGTTGACAGGGTTGTTCAGGATGGGGCTGTGTTCAGA
CAGAGTTTTTCGATTAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAGACG
CTGATGAACATCTTCCAGGAAACATCTCTCAATCATTGTAAGAGTATCGGTCTTCTCAATG
ACGGCTTTGGTCGTACTCCTGAGATGTGTAAGAGGGACCTCATTTGGGTGGTTACAAAAAT
GCACATCGAGGTTAATCGCTATCCTACTTGGGGTGATACTATCGAGGTCAATACTTGGGTC
TCCGAGTCGGGGAAAACCGGTATGGGTCGTGATTGGCTGATAAGTGATTTTCATACAGGA
GACATTCTAATAAGAGCAACGAGCGTGTGTGCTATGATGAATCAAAAGACGAGAAGATTC
TCAAAATTTCCATATGAGGTTCGACAGGAGTTAGCGCCTCATTTTGTGGACTCTGCTCCAG
TCATTGAAGACTATCAAAAATTGCACAAGCTTGATGTGAAGACGGGTGATTCCATTTGCA
ATGGCCTAACTCCAAGGTGGAATGACTTGGATGTCAATCAGCACGTTAACAATGTGAAGT
AC ATT GGGT GGATTCTCGAGAGT GTT CC AACGGAAGTTTTCGAGACCC AGGAGCTAT GTG
GCCTCACCCTTGAGTATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCCGTGACCG
CTATGGATCCCTCAAAAGAGGGAGACAGATCTCTGTACCAGCACCTTCTTCGGCTTGAGG
ATGGGGCTGATATCGCGAAGGGCAGAACCAAGTGGCGGCCGAAGAATGCAGGAACCAAT
GGGGCAATATCAACAGGAAAGACTTCAAATGGAAACTCGATCTCTTAG SEQ ID NO: 24
Cuphea hyssopifolia (Chs) FATB3b (V204I,C239F, E243D, M251V variant) coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCGCCGAGGCCTCCTCCGCCCTGTTCTCCGTGCGCACCCCCGGCACCTCCCCCA AGCCCGGCAAGTTCGGCAACTGGCCCACCTCCCTGTCCGTGCCCTTCAAGTCCAAGTCCAA CCACAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCGCCCCAAGGCCAACGGCTC CGCCGTGTCCCTGAAGTCCGGCTCCCTGGACACCCAGGAGGACACCTCCTCCTCCTCCTCC CCCCCCCGC ACCTT CAT CAACCAGCTGCCCGACT GGTCCATGCTGCT GTCCGCC ATC ACC A CCGTGTTCGT GGCCGCCGAGAAGCAGTGGACC AT GCT GGACCGC AAGTCC AAGCGCCCCG
ACATGCTGATGGACCCCTTCGGCGTGGACCGCGTGGTGCAGGACGGCGCCGTGTTCCGCC
AGTCCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACATCTTCCAGGAGACCTCCCTGAACCACTGCAAGTCCATCGGCCTGCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGAT
GCACATCGAGGTGAACCGCTACCCCACCTGGGGCGACACCATCGAGGTGAACACCTGGGT
GTCCGAGTCCGGCAAGACCGGCATGGGCCGCGACTGGCTGATCTCCGACTTCCACACCGG
CGACATCCTGATCCGCGCCACCTCCGTGTGCGCCATGATGAACCAGAAGACCCGCCGCTT
CTCC AAGTTCCCCT ACGAGGT GCGCC AGGAGCT GGCCCCCCACTTCGT GGACTCCGCCCCC
GTGATCGAGGACTACCAGAAGCTGCACAAGCTGGACGTGAAGACCGGCGACTCCATCTGC
AACGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAA
GTACATCGGCTGGATCCTGGAGTCCGTGCCCACCGAGGTGTTCGAGACCCAGGAGCTGTG
CGGCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTGAC
CGCCATGGACCCCTCCAAGGAGGGCGACCGCTCCCTGTACCAGCACCTGCTGCGCCTGGA
GGACGGCGCCGACATCGCCAAGGGCCGCACCAAGTGGCGCCCCAAGAACGCCGGCACCA
ACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCATCTCCTGA SEQ ID NO: 25
Cuphea PSR23 (Cu) FATB3 amino acid sequence
MWAAATSAFFPVPAPGTSPKPGKSGNWPSSLSPTFKPKSIPNAGFQVKANASAHPKANGSAV
NLKSGSLNTQEDTS SSPPPRAFLNQLPDW SMLLTAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKCIVRDGLVSRQSFLIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPTWGDTVEINTWFSQSGKIGMASDWLISDCNTGEILIRATSVW
AMMNQKTRRFSRLPYEVRQELTPHFVDSPHVIEDNDQKLHKFDVKTGDSIRKGLTPRWNDLD
VNQHVSNVKYIGWILESMPIEVLETQELCSLTVEYRRECGMDSVLESVTAVDPSENGGRSQYK HLLRLEDGTDIVKSRTEWRPKNAGTNGAISTSTAKTSNGNSVS* SEQ ID NO: 26
Cuphea PSR23 (Cu) FATB3 coding DNA sequence
ATGGTGGTGGCTGCAGCAACTTCTGCATTCTTCCCCGTTCCAGCCCCGGGAACCTCCCCTA
AACCCGGGAAGTCCGGCAACTGGCCATCGAGCTTGAGCCCTACCTTCAAGCCCAAGTCAA
TCCCCAATGCCGGATTTCAGGTTAAGGCAAATGCCAGTGCCCATCCTAAGGCTAACGGTTC
TGCAGTAAATCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCCTCCT
CCCCGGGCTTTCCTTAACCAGTTGCCTGATTGGAGTATGCTTCTGACTGCAATCACGACCG
TCTTCGTGGCGGCAGAGAAGCAGTGGACTATGCTTGATAGGAAATCTAAGAGGCCTGACA
TGCTCGTGGACTCGGTTGGGTTGAAGTGTATTGTTCGGGATGGGCTCGTGTCCAGACAGAG
TTTTTTC , ATT AG ATCTT ATG A A AT A( i( iC( iCTG ATCG A AC AGCCTCT ATAG AGACGCTGATG
AACCACTTGCAGGAAACATCTATCAATCATTGTAAGAGTTTGGGTCTTCTCAATGACGGCT
TTGGTCGTACTCCTGGGATGTGTAAAAACGACCTCATTTGGGTGCTTACAAAAATGCAGAT
CATGGTGAATCGCTACCCAACTTGGGGCGATACTGTTGAGATCAATACCTGGTTCTCTCAG
TCGGGGAAAATCGGTATGGCTAGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATT
CTTATAAGAGCAACGAGCGTGTGGGCTATGATGAATCAAAAGACGAGAAGATTCTCAAGA
CTTCCATACGAGGTTCGCCAGGAGTTAACGCCTCATTTTGTGGACTCTCCTCATGTCATTG
AAGACAATGATCAGAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTCGCAAGG
GTCTAACTCCGAGGTGGAACGACTTGGATGTGAATCAGCACGTAAGCAACGTGAAGTACA
TT GGGT GGATTCTCGAGAGTATGCCAATAGAAGTTTTGGAGAC AC AGGAGCT AT GCTCTCT
CACCGTAGAATATAGGCGGGAATGCGGAATGGACAGTGTGCTGGAGTCCGTGACTGCTGT
GGATCCCTCAGAAAATGGAGGCCGGTCTCAGTACAAGCACCTTCTGCGGCTTGAGGATGG
GACTGATATCGTGAAGAGCAGAACTGAGTGGCGACCGAAGAATGCAGGAACTAACGGGG
CGATATCAACATCAACAGCAAAGACTTCAAATGGAAACTCGGTCTCTTAG SEQ ID NO: 27
Cuphea PSR23 (Cu) FATB3 coding DNA sequence codon optimized for Prototheca moriformis
AT GGT GGT GGCCGCCGCC ACCT CCGCCTT CTT CCCCGTGCCCGCCCCCGGC ACCTCCCCC A AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCACCTTCAAGCCCAAGTCCAT CCCCAACGCCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCGCCTT CCT GAACCAGCT GCCCGACT GGT CC AT GCTGCTGACCGCC AT CACC ACCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGACA
TGCTGGTGGACTCCGTGGGCCTGAAGTGCATCGTGCGCGACGGCCTGGTGTCCCGCCAGT
CCTTCCTGATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGAT
GAACC ACCT GC AGGAGACCT CC AT CAACC ACT GCAAGTCCCTGGGCCTGCT GAACGACGG
CTTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCA
GAT CAT GGT GAACCGCT ACCCC ACCTGGGGCGAC ACCGTGGAGATC AAC ACCT GGTT CTC
CCAGTCCGGCAAGATCGGCATGGCCTCCGACTGGCTGATCTCCGACTGCAACACCGGCGA
GATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCC
CGCCT GCCCT ACGAGGTGCGCC AGGAGCT GACCCCCC ACTT CGT GGACTCCCCCC ACGT G
AT CGAGGAC AACGACCAGAAGCT GC AC AAGTT CGACGTGAAGACCGGCGACTCC AT CCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCATGCCCATCGAGGTGCTGGAGACCCAGGAGCTGTGC
TCCCTGACCGTGGAGTACCGCCGCGAGTGCGGCATGGACTCCGTGCTGGAGTCCGTGACC
GCCGTGGACCCCTCCGAGAACGGCGGCCGCTCCCAGTACAAGCACCTGCTGCGCCTGGAG
GACGGC ACCGACATCGT GAAGT CCCGC ACCGAGT GGCGCCCCAAGAACGCCGGCACCAA
CGGCGCCATCTCCACCTCCACCGCCAAGACCTCCAACGGCAACTCCGTGTCCTGA SEQ ID NO: 28
Cuphea wrightii (Cw) FATB3 amino acid sequence
MWAAAASSAFFPVPAPRTTPKPGKFGNWPSSLSPPFKPKSNPNGRFQVKANVSPHPKANGSA
V SLKSGSLNTLEDPP S SPPPRTFLNQLPDW SRLRTAITT WVAAEKQFTRLDRKSKRPDMLVDW
FGSETIVQDGLVFRERFSIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGFGRTSEMC
TRDLIWVLTKMQIWNRYPTWGDTVEINSWFSQSGKIGMGRDWLISDCNTGEILVRATSAWA
MMNQKTRRFSKLPCEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPGWNDLDV
NQHVSNVKYIGWILESMPTEVLETQELCSLTLEYRRECGRESWESVTSMNPSKVGDRSQYQH LLRLEDGADIMKGRTEWRPKNAGTNRAIST* SEQ ID NO: 29
Cuphea wrightii (Cw) FATB3 coding DNA sequence
ATGGTGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGTTCCAGCACCTAGAACCACGC
CTAAACCCGGGAAGTTCGGCAATTGGCCATCGAGCTTGAGCCCGCCCTTCAAGCCCAAGT
CAAACCCCAATGGTAGATTTCAGGTTAAGGCAAATGTCAGTCCTCATCCTAAGGCTAACG
GTTCTGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTCTGGAGGACCCTCCGTCGTCCCC
TCCTCCTCGGACTTTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCGGACTGCAATCACG
ACCGTCTTCGTGGCGGCAGAGAAGCAGTTCACTAGGCTCGATCGAAAATCTAAGAGGCCT
GACATGCTAGTGGACTGGTTTGGGTCAGAGACTATTGTTCAGGATGGGCTCGTGTTCAGA
GAGAGATTTTCGATCAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAGACG
CTGATGAACCACTTGCAGGACACATCTCTGAATCATTGTAAGAGTGTGGGTCTTCTCAATG
ACGGCTTTGGTCGTACCTCGGAGATGTGTACAAGAGACCTCATTTGGGTGCTTACAAAAAT
GCAGATCGTGGTGAATCGCTATCCAACTTGGGGCGATACTGTCGAGATCAATAGCTGGTT
CTCCCAGTCGGGGAAAATCGGTATGGGTCGCGATTGGCTAATAAGTGATTGCAACACAGG
AGAAATTCTTGTAAGAGCAACGAGCGCTTGGGCCATGATGAATCAAAAGACGAGAAGATT
CTCAAAACTTCCATGCGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTGGACGCTCCTCCT
GTCATTGAAGACAATGATCGGAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTT
GCAAGGGTCTAACTCCGGGGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGA
AGTACATTGGGTGGATTCTCGAGAGTATGCCTACAGAAGTTTTGGAGACCCAGGAGCTAT
GCTCTCTCACCCTTGAATATAGGCGGGAATGTGGAAGGGAAAGTGTGGTAGAGTCCGTGA
CCTCTATGAATCCCTCAAAAGTTGGAGACCGGTCTCAGTACCAACACCTTCTGCGGCTTGA
GGATGGGGCTGATATCATGAAGGGCAGAACTGAGTGGAGACCAAAGAATGCAGGAACCA
ACCGGGCGATATCAACATGA SEQ ID NO: 30
Cuphea wrightii (Cw) FATB3 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCCCCCGCACCACCC
CCAAGCCCGGCAAGTTCGGCAACTGGCCCTCCTCCCTGTCCCCCCCCTTCAAGCCCAAGTC
CAACCCCAACGGCCGCTTCCAGGTGAAGGCCAACGTGTCCCCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCTGGAGGACCCCCCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCGCACCGCCATCACCA
CCGTGTTCGTGGCCGCCGAGAAGCAGTTCACCCGCCTGGACCGCAAGTCCAAGCGCCCCG
ACATGCTGGTGGACTGGTTCGGCTCCGAGACCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGCGCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACCACCTGCAGGACACCTCCCTGAACCACTGCAAGTCCGTGGGCCTGCTGAACGA
CGGCTTCGGCCGCACCTCCGAGATGTGCACCCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCGTGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCGACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGGTGCGCGCCACCTCCGCCTGGGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGCTGCCCTGCGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CTGCAAGGGCCTGACCCCCGGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCACCGAGGTGCTGGAGACCCAGGAGCT
GT GCT CCCT GACCCT GGAGT ACCGCCGCGAGTGCGGCCGCGAGTCCGTGGTGGAGTCCGT
GACCTCCATGAACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCGCCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACCGCGCCATCTCCACCTGA SEQ ID NO: 31
Cuphea wrightii (Cw) FATB4a amino acid sequence
MVATAASSAFFPVPSADTSSSRPGKLGSGPSSLSPLKPKSIPNGGLQVKANASAPPKINGSSVGL
KSGGFKTQEDSPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDPF
GLGSIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKIAGLSNDGFGRTPEMYK
RDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRASSVW
VMMNQKTRRLSKIPDEVRNEIEPHFVDSAPWEDDDRKLPKLDENTADSIRKGLTPRWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSAEGYASRFQH LLRLEDGGEIVKARTEWRPKNAGINGWPSEESSPGDFF* SEQ ID NO: 32
Cuphea wrightii (Cw) FATB4a coding DNA sequence
TTGGTGGCTACCGCTGCAAGTTCTGCATTTTTCCCCGTGCCATCCGCCGACACCTCCTCCTC
GAGACCCGGAAAGCTCGGCAGTGGACCATCGAGCTTGAGCCCCCTCAAGCCCAAATCGAT
CCCCAATGGCGGCTTGCAGGTTAAGGCAAACGCCAGTGCCCCTCCTAAGATCAATGGTTC
CTCGGTCGGTCTAAAGTCGGGCGGTTTCAAGACTCAGGAAGACTCTCCTTCGGCCCCTCCT
CCGCGGACTTTTATCAACCAGTTGCCTGATTGGAGTATGCTTCTTGCTGCAATCACTACTG
TCTTCTTGGCTGCAGAGAAGCAGTGGATGATGCTTGATTGGAAACCTAAGAGGCCTGACA
TGCTCGTGGACCCGTTCGGATTGGGAAGTATTGTTCAGGATGGGCTTGTGTTCAGGCAGAA
TTTTTCAATTAGGTCCTACGAAATAGGCGCCGATCGAACTGCGTCTATAGAGACGGTGATG
AACCATTTGCAGGAAACAGCTCTCAATCATGTCAAGATTGCTGGGCTTTCTAATGACGGCT
TTGGTCGTACTCCTGAGATGTATAAAAGAGACCTTATTTGGGTTGTTGCAAAAATGCAGGT
CATGGTTAACCGCTATCCTACTTGGGGTGACACGGTTGAAGTGAATACTTGGGTTGCCAAG
TCAGGGAAAAATGGTATGCGTCGTGACTGGCTCATAAGTGATTGCAATACTGGAGAGATT
CTTACAAGAGCATCAAGCGTGTGGGTCATGATGAATCAAAAGACAAGAAGATTGTCAAAA
ATTCCAGATGAGGTTCGAAATGAGATAGAGCCTCATTTTGTGGACTCTGCTCCCGTCGTTG
AAGATGATGATCGGAAACTTCCCAAGCTGGATGAGAACACTGCTGACTCCATCCGCAAGG
GTCTAACTCCGAGGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTACA
TCGGATGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGCTCCCT
GACCCTGGAATACAGGCGGGAATGTGGAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTGT
CGACCCGTCTGCAGAGGGCTATGCGTCCCGGTTTCAGCACCTTCTGCGGCTTGAGGATGGA
GGTGAGATCGTGAAGGCGAGAACTGAGTGGCGACCCAAGAATGCTGGAATCAATGGGGT
GGTACCATCCGAGGAGTCCTCACCTGGAGACTTCTTTTAG SEQ ID NO: 33
Cuphea wrightii (Cw) FATB4a coding DNA sequence codon optimized for Prototheca moriformis
AT GGT GGCC ACCGCCGCCT CCT CCGCCTT CTTCCCCGTGCCCTCCGCCGACACCTCCT CCT C
CCGCCCCGGC AAGCT GGGCTCCGGCCCCT CCT CCCT GTCCCCCCT GAAGCCC AAGT CC ATC
CCCAACGGCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCC
TCCGTGGGCCTGAAGTCCGGCGGCTTCAAGACCCAGGAGGACTCCCCCTCCGCCCCCCCC
CCCCGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCG
TGTTCCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACA
TGCTGGTGGACCCCTTCGGCCTGGGCTCCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAA
CTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATG
AACCACCTGCAGGAGACCGCCCTGAACCACGTGAAGATCGCCGGCCTGTCCAACGACGGC
TTCGGCCGCACCCCCGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAG
GTGATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCC
AAGTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAG
ATCCTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCCGCCTGTCC
AAGATCCCCGACGAGGTGCGCAACGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTG
GTGGAGGACGACGACCGCAAGCTGCCCAAGCTGGACGAGAACACCGCCGACTCCATCCG
CAAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGA
AGTACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGT
GCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGA
CCGCCGTGGACCCCTCCGCCGAGGGCTACGCCTCCCGCTTCCAGCACCTGCTGCGCCTGGA
GGACGGCGGCGAGATCGTGAAGGCCCGCACCGAGTGGCGCCCCAAGAACGCCGGCATCA
ACGGCGTGGTGCCCTCCGAGGAGTCCTCCCCCGGCGACTTCTTCTGA SEQ ID NO: 34
Cuphea wrightii (Cw) FATB4b amino acid sequence
MVATAASSAFFPVPSADTSSSRPGKLGNGPSSLSPLKPKSIPNGGLQVKANASAPPKINGSSVGL
KSGSFKTQEDAPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDPF
GLGSIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKIAGLSSDGFGRTPAMSK
RDLIWWAKMQVMVNRYPAWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRASSV
WVMMNQKTRRLSKIPDEVRNEIEPHFVDSAPWEDDDRKLPKLDENTADSIRKGLTPRWNDL
DVNQHVNNVKYIGWILESTPAEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGEGDGSKF QHLLRLEDGGEIVKARTEWRPKNAGINGWPSEESSPGGDFF* SEQ ID NO: 35
Cuphea wrightii (Cw) FATB4b coding DNA sequence
TTGGTGGCTACCGCTGCAAGTTCTGCATTTTTCCCCGTACCATCCGCCGACACCTCCTCATC
GAGACCCGGAAAGCTCGGCAATGGGCCATCGAGCTTGAGCCCCCTCAAGCCGAAATCGAT
CCCCAATGGCGGGTTGCAGGTTAAGGCAAACGCCAGTGCCCCTCCTAAGATCAATGGTTC
CTCGGTCGGTCTGAAGTCGGGCAGTTTCAAGACTCAGGAAGACGCTCCTTCGGCCCCTCCT
CCTCGGACTTTTATCAACCAGTTGCCTGATTGGAGTATGCTTCTTGCTGCAATCACTACTGT
CTTCTTGGCTGCAGAGAAGCAGTGGATGATGCTTGATTGGAAACCTAAGAGGCCTGACAT
GCTTGTCGACCCGTT CGGATT GGGAAGT ATT GTT CAGGATGGGCTTGTTTT CAGGC AGAAT
TTCTCGATTAGGTCCTACGAAATAGGCGCTGATCGCACTGCGTCTATAGAGACGGTGATG
AACCATTTGCAGGAAACAGCTCTCAATCATGTTAAGATTGCTGGGCTTTCTAGTGATGGCT
TTGGTCGTACTCCTGCGATGTCTAAACGGGACCTCATTTGGGTTGTTGCGAAAATGCAGGT
CATGGTTAACCGCTACCCTGCTTGGGGTGACACGGTTGAAGTGAATACTTGGGTTGCCAA
GTCAGGGAAAAATGGTATGCGTCGTGACTGGCTCATAAGTGATTGCAACACTGGAGAGAT
TCTTACAAGAGCATCAAGCGTGTGGGTCATGATGAATCAAAAGACAAGAAGATTGTCAAA
AATTCCAGATGAGGTTCGAAATGAGATAGAGCCTCATTTTGTGGACTCTGCGCCCGTCGTT
GAAGACGATGACCGGAAACTTCCCAAGCTGGATGAGAACACTGCTGACTCCATCCGCAAG
GGTCTAACTCCGAGGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTAC
ATTGGGTGGATTCTTGAGAGTACTCCAGCAGAAGTTCTGGAGACCCAGGAATTATGTTCCC
TGACCCTGGAATACAGGCGGGAATGTGGAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTG
TAGATCCGTCTGGAGAGGGCGATGGGTCCAAGTTCCAGCACCTTCTGCGGCTTGAGGATG
GAGGTGAGATCGTGAAGGCGAGAACTGAGTGGCGACCAAAGAATGCTGGAATCAATGGG
GTGGTACCATCCGAGGAGTCCTCACCTGGTGGAGACTTCTTTTAA SEQ ID NO: 36
Cuphea wrightii (Cw) FATB4b coding DNA sequence codon optimized for Prototheca moriformis
AT GGT GGCC ACCGCCGCCT CCT CCGCCTT CTTCCCCGTGCCCTCCGCCG AC ACCTCCT CCT C
CCGCCCCGGCAAGCTGGGCAACGGCCCCTCCTCCCTGTCCCCCCTGAAGCCCAAGTCCATC
CCCAACGGCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCC
TCCGTGGGCCTGAAGTCCGGCTCCTTCAAGACCCAGGAGGACGCCCCCTCCGCCCCCCCCC
CCCGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCGT
GTTCCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACAT
GCTGGTGGACCCCTTCGGCCTGGGCTCCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAA
CTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATG
AACCACCTGCAGGAGACCGCCCTGAACCACGTGAAGATCGCCGGCCTGTCCTCCGACGGC
TTCGGCCGCACCCCCGCCATGTCCAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAG
GTGATGGTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCC
AAGTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAG
ATCCTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCCGCCTGTCC
AAGATCCCCGACGAGGTGCGCAACGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTG
GTGGAGGACGACGACCGCAAGCTGCCCAAGCTGGACGAGAACACCGCCGACTCCATCCG
CAAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGA
AGTACATCGGCTGGATCCTGGAGTCCACCCCCGCCGAGGTGCTGGAGACCCAGGAGCTGT
GCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGA
CCGCCGTGGACCCCTCCGGCGAGGGCGACGGCTCCAAGTTCCAGCACCTGCTGCGCCTGG
AGGACGGCGGCGAGATCGTGAAGGCCCGCACCGAGTGGCGCCCCAAGAACGCCGGCATC
AACGGCGTGGTGCCCTCCGAGGAGTCCTCCCCCGGCGGCGACTTCTTCTGA SEQ ID NO: 37
Cuphea wrightii (Cw) FATB5 amino acid sequence
MVAAAASSAFFSVPTPGTPPKPGKFGNWPSSLSVPFKPDNGGFHVKANASAHPKANGSAVNL
KSGSLETPPRSFINQLPDLSVLLSKITTVFGAAEKQWKRPGMLVEPFGVDRIFQDGVFFRQSFSI
RSYEIGVDRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEMCKRDLIWWTKIQVEVNRYP
TWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAMMNQNTRRLSKFPYEVRQE
IAPHFVDSAPVIEDDQKLQKLDVKTGDSIRDGLTPRWNDLDVNQHVNNVKYIGWILKSVPIEV
FETQELCGVTLEYRRECGRDSVLESVTAMDPAKEGDRCVYQHLLRLEDGADITIGRTEWRPK NAGANGAMSSGKTSNGNCLIEGRGWQPFRWRLIF* SEQ ID NO: 38
Cuphea wrightii (Cw) FATB5 coding DNA sequence
ATGGTGGCTGCCGCAGCAAGTTCTGCATTCTTCTCTGTTCCAACCCCGGGAACGCCCCCTA
AACCCGGGAAGTTCGGTAACTGGCCATCGAGCTTGAGCGTCCCCTTCAAGCCCGACAATG
GTGGCTTTCATGTCAAGGCAAACGCCAGTGCCCATCCTAAGGCTAATGGTTCTGCGGTAA
ATCTAAAGTCTGGCAGCCTCGAGACTCCTCCTCGGAGTTTCATTAACCAGCTGCCGGACTT
GAGTGTGCTTCTGTCCAAAATCACGACTGTCTTCGGGGCGGCTGAGAAGCAGTGGAAGAG
GCCCGGCATGCTCGTGGAACCGTTTGGGGTTGACAGGATTTTTCAGGATGGTGTTTTTTTC
AGACAGAGTTTTTCTATCAGGTCTTACGAAATAGGCGTTGATCGAACAGCCTCGATAGAG
ACACTGATGAACATCTTCCAGGAAACATCTTTGAATCATTGCAAGAGTATCGGTCTTCTCA
ACGATGGCTTTGGTCGTACTCCTGAGATGTGTAAGAGGGACCTCATTTGGGTGGTTACGAA
AATTCAGGTCGAGGTGAATCGCTATCCTACTTGGGGTGACACTATCGAAGTCAATACTTGG
GTCTCGGAGTCGGGGAAAAACGGTATGGGTCGGGATTGGCTGATAAGTGATTGCCGTACT
GGAGAGATTCTTATAAGAGCAACGAGCGTGTGGGCGATGATGAATCAAAACACGAGAAG
ATTGTCAAAATTTCCATATGAGGTTCGACAGGAGATAGCGCCTCATTTTGTGGACTCTGCT
CCTGTCATTGAAGACGATCAAAAGTTGCAGAAGCTTGATGTGAAGACAGGTGATTCCATT
CGCGATGGTCTAACTCCGAGATGGAATGACTTGGATGTCAATCAACACGTTAACAATGTG
AAGTACATTGGATGGATTCTCAAGAGTGTTCCAATAGAAGTTTTCGAGACACAGGAGCTA
TGCGGCGTCACACTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCAGTG
ACCGCTATGGATCCAGCAAAAGAGGGAGACCGGTGTGTGTACCAGCACCTTCTTCGGCTT
GAGGATGGAGCTGATATCACTATAGGCAGAACCGAGTGGCGGCCGAAGAATGCAGGAGC
CAATGGTGCAATGTCATCAGGAAAGACTTCAAATGGAAACTGTCTCATAGAAGGAAGGGG
TTGGCAACCTTTCCGAGTTGTGCGTTTAATTTTCTGA SEQ ID NO: 39
Cuphea wrightii (Cw) FATB5 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCTCCGTGCCCACCCCCGGCACCCCCCCCA
AGCCCGGCAAGTTCGGCAACTGGCCCTCCTCCCTGTCCGTGCCCTTCAAGCCCGACAACGG
CGGCTTCCACGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTCCGCCGTGAA
CCTGAAGTCCGGCTCCCTGGAGACCCCCCCCCGCTCCTTCATCAACCAGCTGCCCGACCTG
TCCGTGCTGCTGTCCAAGATCACCACCGTGTTCGGCGCCGCCGAGAAGCAGTGGAAGCGC
CCCGGCATGCTGGTGGAGCCCTTCGGCGTGGACCGCATCTTCCAGGACGGCGTGTTCTTCC
GCCAGTCCTTCTCCATCCGCTCCTACGAGATCGGCGTGGACCGCACCGCCTCCATCGAGAC
CCTGATGAACATCTTCCAGGAGACCTCCCTGAACCACTGCAAGTCCATCGGCCTGCTGAAC
GACGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAG
ATCCAGGTGGAGGTGAACCGCTACCCCACCTGGGGCGACACCATCGAGGTGAACACCTGG
GTGTCCGAGTCCGGCAAGAACGGCATGGGCCGCGACTGGCTGATCTCCGACTGCCGCACC
GGCGAGATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAACACCCGCCGC
CTGTCCAAGTTCCCCTACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACTCCGCCC
CCGTGATCGAGGACGACCAGAAGCTGCAGAAGCTGGACGTGAAGACCGGCGACTCCATC
CGCGACGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTG
AAGTACATCGGCTGGATCCTGAAGTCCGTGCCCATCGAGGTGTTCGAGACCCAGGAGCTG
TGCGGCGTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTG
ACCGCCATGGACCCCGCCAAGGAGGGCGACCGCTGCGTGTACCAGCACCTGCTGCGCCTG
GAGGACGGCGCCGACATCACCATCGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCGC
CAACGGCGCCATGTCCTCCGGCAAGACCTCCAACGGCAACTGCCTGATCGAGGGCCGCGG
CTGGCAGCCCTTCCGCGTGGTGCGCCTGATCTTCTGA SEQ ID NO: 40
Cuphea heterophylla (Cht) FATBla amino acid sequence
MVAAAASSAFFSVPTPGTSTKPGNFGNWPSSLSVPFKPESNHNGGFRVKANASAHPKANGSA
VNLKSGSLETQEDTSSSSPPPRTFIKQLPDWGMLLSKITTVFGAAERQWKRPGMLVEPFGVDRI
FQDGVFFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEMCKRDLIWV
VTKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAMMNRKT
RRLSKFPYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDSIRKGLTPRWNDLDVNQHVNNV
KYIGWILKSVPAEVFETQELCGVTLEYRRECGRDSVLESVTAMDTAKEGDRSLYQHLLRLEDG ADITIGRTEWRPKNAGANGAISTGKTSNENSVS* SEQ ID NO: 41
Cuphea heterophylla (Cht) FATBla coding DNA sequence
ATGGTGGCTGCCGCAGCAAGTTCTGCATTCTTCTCCGTTCCAACCCCGGGAACCTCCACTA
AACCCGGGAACTTCGGCAATTGGCCATCGAGCTTGAGCGTCCCCTTCAAGCCCGAATCAA
ACCACAATGGTGGCTTTCGGGTCAAGGCAAACGCCAGTGCTCATCCTAAGGCTAACGGTT
CTGCAGTAAATCTAAAGTCTGGCAGCCTCGAGACTCAGGAGGACACTTCATCGTCGTCCC
CTCCTCCTCGGACTTTTATTAAGCAGTTGCCCGACTGGGGTATGCTTCTGTCCAAAATCAC
GACTGTCTTCGGGGCGGCTGAGAGGCAGTGGAAGAGGCCCGGCATGCTTGTGGAACCGTT
TACGAAATAGGCGCTGATCGAACAGCCTCAATAGAGACGCTGATGAACATCTTCCAGGAA
ACATCTCTGAATCATTGTAAGAGTATCGGTCTTCTCAATGACGGCTTTGGTCGTACTCCTG
AGATGTGTAAGAGGGACCTCATTTGGGTGGTTACGAAAATTCAGGTCGAGGTGAATCGCT
ATCCTACTTGGGGTGATACTATTGAGGTCAATACTTGGGTCTCAGAGTCGGGGAAAAACG
GTATGGGTCGTGATTGGCTGATAAGCGATTGCCGTACCGGAGAAATTCTTATAAGAGCAA
CGAGCGTGTGGGCTATGATGAATCGAAAGACGAGAAGATTGTCAAAATTTCCATATGAGG
TTCGACAGGAGATAGCGCCTCATTTTGTGGACTCTGCTCCTGTCATTGAAGACGATAAAAA
ATTGCACAAGCTTGATGTTAAGACGGGTGATTCCATTCGCAAGGGTCTAACTCCAAGGTG
GAATGACTTGGATGTCAATCAGCACGTTAACAATGTGAAGTACATTGGGTGGATTCTCAA
GAGTGTTCCAGCAGAAGTTTTCGAGACCCAGGAGCTATGCGGAGTCACCCTTGAGTACAG
GCGGGAATGTGGAAGGGACAGTGTGCTGGAGTCCGTGACCGCTATGGATACCGCAAAAG
AGGGAGACCGGTCTCTGTACCAGCACCTTCTTCGGCTTGAGGATGGGGCTGATATCACCAT
AGGCAGAACCGAGTGGCGGCCGAAGAATGCAGGAGCCAATGGGGCAATATCAACAGGAA
AGACTTCAAATGAAAACTCTGTCTCTTAG SEQ ID NO: 42
Cuphea heterophylla (Cht) FATBla coding DNA sequence codon optimized for Prototheca moriformis
AT GGT GGCCGCCGCCGCCT CCT CCGCCTT CTT CT CCGTGCCC ACCCCCGGC ACCT CC ACC A
AGCCCGGCAACTTCGGCAACTGGCCCTCCTCCCTGTCCGTGCCCTTCAAGCCCGAGTCCAA
CCACAACGGCGGCTTCCGCGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGGAGACCCAGGAGGACACCTCCTCCTCCTCCCCC
CCCCCCCGCACCTTCATCAAGCAGCTGCCCGACTGGGGCATGCTGCTGTCCAAGATCACCA
CCGTGTTCGGCGCCGCCGAGCGCCAGTGGAAGCGCCCCGGCATGCTGGTGGAGCCCTTCG
GCGTGGACCGCATCTTCCAGGACGGCGTGTTCTTCCGCCAGTCCTTCTCCATCCGCTCCTA
CGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGATGAACATCTTCCAGGAGAC
CTCCCTGAACCACTGCAAGTCCATCGGCCTGCTGAACGACGGCTTCGGCCGCACCCCCGA
GATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGATCCAGGTGGAGGTGAACCGCTA
CCCCACCTGGGGCGACACCATCGAGGTGAACACCTGGGTGTCCGAGTCCGGCAAGAACGG
CATGGGCCGCGACTGGCTGATCTCCGACTGCCGCACCGGCGAGATCCTGATCCGCGCCAC
CTCCGTGTGGGCCATGATGAACCGCAAGACCCGCCGCCTGTCCAAGTTCCCCTACGAGGT
GCGCCAGGAGATCGCCCCCCACTTCGTGGACTCCGCCCCCGTGATCGAGGACGACAAGAA
GCTGCACAAGCTGGACGTGAAGACCGGCGACTCCATCCGCAAGGGCCTGACCCCCCGCTG
GAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAAGTACATCGGCTGGATCCTGAA
GTCCGTGCCCGCCGAGGTGTTCGAGACCCAGGAGCTGTGCGGCGTGACCCTGGAGTACCG
CCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTGACCGCCATGGACACCGCCAAGGA
GGGCGACCGCTCCCTGTACCAGCACCTGCTGCGCCTGGAGGACGGCGCCGACATCACCAT
CGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCGCCAACGGCGCCATCTCCACCGGCAA
GACCTCCAACGAGAACTCCGTGTCCTGA SEQ ID NO: 43
Cuphea heterophylla (Cht) FATBlb (P16S, T20P, G94S, G105W, S293F, L305F variant) amino acid sequence
MVAAAAS S AFF S VPTSGT SPKPGNF GNWP S SLS VPFKPES SHNGGF QVKANAS AHPKANGS AV
NLKSGSLETQEDT S S S SPPPRTFIKQLPD W SMLLSKITTVF WAAERQ WKRPGMLVEPF GVDRIF
QDGVFFRQ SF SIRS YEIGADRT ASIETLMNIF QETSLNHCKSIGLLNDGF GRTPEMCKRDLIWW
TKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAMMNRKTRR
LSKFPYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDFIRKGLTPRWNDFDVNQHVNNVKYI
GWILKSVPAEVFETQELCGVTLEYRRECGRDSVLESVTAMDTAKEGDRSLYQHLLRLEDGADI TIGRTEWRPKNAGANGAISTGKTSNENSVS* SEQ ID NO: 44
Cuphea heterophylla (Cht) FATBlb(P16S, T20P, G94S, G105W, S293F, L305F variant) coding DNA sequence
ATGGTGGCTGCCGCAGCAAGTTCTGCATTCTTCTCCGTTCCAACCTCGGGAACCTCCCCTA
AACCCGGGAACTTCGGCAATTGGCCATCGAGCTTGAGCGTCCCCTTCAAGCCCGAATCAA
GCCACAATGGTGGCTTTCAGGTCAAGGCAAACGCCAGTGCCCATCCTAAGGCTAACGGTT
CTGCAGTAAATCTAAAGTCTGGCAGCCTCGAGACTCAGGAGGACACTTCATCGTCGTCCC
CTCCTCCTCGGACTTTTATTAAGCAGTTGCCCGACTGGAGTATGCTTCTGTCCAAAATCAC
GACTGTCTTCTGGGCGGCTGAGAGGCAGTGGAAGAGGCCCGGCATGCTTGTGGAACCGTT
TGGGGTTGACAGGATTTTTCAGGATGGGGTTTTTTTCAGACAGAGTTTTTCGATCAGGTCT
TACGAAATAGGCGCTGATCGAACAGCCTCAATAGAGACGCTGATGAACATCTTCCAGGAA
ACATCTCTGAATCATTGTAAGAGTATCGGTCTTCTCAATGACGGCTTTGGTCGTACTCCTG
AGATGTGTAAGAGGGACCTCATTTGGGTGGTTACGAAAATTCAGGTCGAGGTGAATCGCT
ATCCTACTTGGGGTGATACTATTGAGGTCAATACTTGGGTCTCAGAGTCGGGGAAAAACG
GTATGGGTCGTGATTGGCTGATAAGCGATTGCCGTACCGGAGAAATTCTTATAAGAGCAA
CGAGCGTGTGGGCTATGATGAATCGAAAGACGAGAAGATTGTCAAAATTTCCATATGAGG
TTCGACAGGAGATAGCGCCTCATTTTGTGGACTCTGCTCCTGTCATTGAAGACGATAAAAA
ATTGCACAAGCTTGATGTTAAGACGGGTGATTTCATTCGCAAGGGTCTAACTCCAAGGTG
GAATGACTTTGATGTCAATCAGCACGTTAACAATGTGAAGTACATTGGGTGGATTCTCAA
GAGTGTTCCAGCAGAAGTTTTCGAGACCCAGGAGCTATGCGGAGTCACCCTTGAGTATAG
GCGGGAATGTGGAAGGGACAGTGTGCTGGAGTCCGTGACCGCTATGGATACCGCAAAAG
AGGGAGACCGGTCTCTGTACCAGCACCTTCTTCGGCTTGAGGATGGGGCTGATATCACCAT
AGGCAGAACCGAGTGGCGGCCGAAGAATGCAGGAGCCAATGGGGCAATATCAACAGGAA
AGACTTCAAATGAAAACTCTGTCTCTTAG SEQ ID NO: 45
Cuphea heterophylla (Cht) FATBlb (P16S, T20P, G94S, G105W, S293F, L305F variant) coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCTCCGTGCCCACCTCCGGCACCTCCCCCA
AGCCCGGCAACTTCGGCAACTGGCCCTCCTCCCTGTCCGTGCCCTTCAAGCCCGAGTCCTC
CCACAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGGAGACCCAGGAGGACACCTCCTCCTCCTCCCCC
CCCCCCCGCACCTTCATCAAGCAGCTGCCCGACTGGTCCATGCTGCTGTCCAAGATCACCA
CCGTGTTCTGGGCCGCCGAGCGCCAGTGGAAGCGCCCCGGCATGCTGGTGGAGCCCTTCG
GCGTGGACCGCATCTTCCAGGACGGCGTGTTCTTCCGCCAGTCCTTCTCCATCCGCTCCTA
CGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGATGAACATCTTCCAGGAGAC
CTCCCTGAACCACTGCAAGTCCATCGGCCTGCTGAACGACGGCTTCGGCCGCACCCCCGA
GATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGATCCAGGTGGAGGTGAACCGCTA
CCCCACCTGGGGCGACACCATCGAGGTGAACACCTGGGTGTCCGAGTCCGGCAAGAACGG
CATGGGCCGCGACTGGCTGATCTCCGACTGCCGCACCGGCGAGATCCTGATCCGCGCCAC
CTCCGTGTGGGCCATGATGAACCGCAAGACCCGCCGCCTGTCCAAGTTCCCCTACGAGGT
GCGCCAGGAGATCGCCCCCCACTTCGTGGACTCCGCCCCCGTGATCGAGGACGACAAGAA
GCTGCACAAGCTGGACGTGAAGACCGGCGACTTCATCCGCAAGGGCCTGACCCCCCGCTG
GAACGACTTCGACGTGAACCAGCACGTGAACAACGTGAAGTACATCGGCTGGATCCTGAA
GTCCGTGCCCGCCGAGGTGTTCGAGACCCAGGAGCTGTGCGGCGTGACCCTGGAGTACCG
CCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTGACCGCCATGGACACCGCCAAGGA
GGGCGACCGCTCCCTGTACCAGCACCTGCTGCGCCTGGAGGACGGCGCCGACATCACCAT
CGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCGCCAACGGCGCCATCTCCACCGGCAA
GACCTCCAACGAGAACTCCGTGTCCTGA SEQ ID NO: 46
Cuphea heterophylla (Cht) FATB2b amino acid sequence
MWAAAASSAFFPVPASGTSPKPGKFGTWLSSSSPSYKPKSNPSGGFQVKANASAHPKANGSA
VSLKSGSLNTQEGTSSSPPPRTFLNQLPDWSRLRTAITTVFVAAEKQLTMLDRKSKKPDMHVD
WFGLEIIVQDGLVFRESFSIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGFGRTPEM
CKRDLIWVLTKMQIMVNRYPTWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIRATSIWA
MMNQKTRRFSKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPEWNDLDV
NQHVSNVKYIGWILESMPKEVLDTQELCSLTLEYRRECGRDSVLESVTAMDPSKVGDRSQYQ HLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS* SEQ ID NO: 47
Cuphea heterophylla (Cht) FATB2b coding DNA sequence
ATGGTGGTGGCTGCTGCAGCAAGCTCTGCATTCTTCCCTGTTCCGGCATCTGGAACCTCCC
CTAAACCCGGGAAGTTCGGGACTTGGCTATCGAGCTCGAGCCCTTCCTACAAGCCCAAGT
CAAACCCCAGTGGTGGATTTCAGGTTAAGGCAAATGCCAGTGCTCATCCTAAGGCTAACG
GTTCCGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGGGCACTTCGTCGTCCCC
TCCTCCTCGGACTTTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCGGACTGCAATCACG
ACCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAGTCTAAGAAGCCT
GACATGCACGTGGACTGGTTTGGGTTGGAGATTATTGTTCAGGATGGGCTCGTGTTCAGAG
AGAGTTTTTCGATCAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAAACGTT
GATGAACCATTTGCAGGACACATCTTTGAACCATTGTAAGAGTGTGGGTCTTCTCAATGAC
GGCTTTGGTCGTACCCCGGAGATGTGTAAAAGGGACCTCATTTGGGTGCTTACAAAAATG
CAGATCATGGTGAATCGCTATCCAACTTGGGGCGATACTGTCGAGATCAATAGCTGGTTCT
CCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGGAG
AAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAAGACGAGAAGATTCT
CAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTGGACGCCCCTCCTGT
CATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTTGC
AAGGGTCTAACACCGGAGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAAG
TACATTGGGT GGATTCTCGAGAGTATGCC AAAAGAAGTTTT GGAC ACCC AGGAGCT AT GC
TCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGATAGTGTGCTGGAGTCTGTGACC
GCTATGGATCCCTCAAAAGTTGGAGACCGATCTCAGTACCAGCACCTTCTGCGGCTTGAA
GATGGGACTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCAA
CGGGGCTATATCAACAGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG SEQ ID NO: 48
Cuphea heterophylla (Cht) FATB2b coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCTCCGGCACCTCCC
CCAAGCCCGGCAAGTTCGGCACCTGGCTGTCCTCCTCCTCCCCCTCCTACAAGCCCAAGTC
CAACCCCTCCGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGGGCACCTCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCGCACCGCCATCACCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGAAGCCCG
ACATGCACGTGGACTGGTTCGGCCTGGAGATCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGTCCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACCACCTGCAGGACACCTCCCTGAACCACTGCAAGTCCGTGGGCCTGCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CTGCAAGGGCCTGACCCCCGAGTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCAAGGAGGTGCTGGACACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGT
GACCGCCATGGACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCACCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA SEQ ID NO: 49
Cuphea heterophylla (Cht) FATB2a (S17P, P21S, T28N, L30P, S33L, G76D, S78P, G137W variant) amino acid sequence
MWAAAASSAFFPVPAPGTTSKPGKFGNWPSSLSPSFKPKSNPNGGFQVKANASAHPKANGS
AVSLKSGSLNTKEDTPSSPPPRTFLNQLPDWSRLRTAITTVFVAAEKQLTMLDRKSKKPDMHV
DWFGLEIIVQDWLVFRESFSIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGFGRTPE
MCKRDLIWVLTKMQIMVNRYPTWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIRATSIW
AMMNQKTRRFSKLPNEVRQEIAPHFVDAPPLIEDNDRKLHKFDVKTGDSICKGLTPEWNDLD
VNQHVSNVKYIGWILESMPKEVLDTQELCSLTLEYRRECGRDSVLESVTAMDPSKVGDRSQY QHLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS* SEQ ID NO: 50
Cuphea heterophylla (Cht) FATB2a (S17P, P21S, T28N, L30P, S33L, G76D, S78P, G137W variant) coding DNA sequence
ATGGTGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGTTCCAGCACCTGGAACCACGT
CTAAACCCGGGAAGTTCGGCAATTGGCCATCGAGCTTGAGCCCTTCCTTCAAGCCCAAGTC
AAACCCCAATGGTGGATTTCAGGTTAAGGCAAATGCCAGCGCTCATCCTAAGGCTAACGG
GTCTGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTAAGGAGGACACTCCGTCGTCCCC
TCCTCCTCGGACTTTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCGGACTGCAATCACG
ACCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAGTCTAAGAAGCCT
GACATGCACGTGGACTGGTTTGGGTTGGAGATTATTGTTCAGGATTGGCTCGTGTTCAGAG
AGAGTTTTTCGATCAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAAACGTT
GATGAACCATTTGCAGGACACATCTTTGAACCATTGTAAGAGTGTGGGTCTTCTCAATGAC
GGCTTTGGTCGTACCCCGGAGATGTGTAAAAGGGACCTCATTTGGGTGCTTACAAAAATG
CAGATCATGGTGAATCGCTATCCAACTTGGGGCGATACTGTCGAGATCAATAGCTGGTTCT
CCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGGAG
AAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAAGACGAGAAGATTCT
CAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCTCCTCATTTTGTGGACGCCCCTCCTCT
CATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTTGC
AAGGGTCTAACACCGGAGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAAG
TACATTGGGTGGATTCTCGAGAGTATGCCAAAAGAAGTTTTGGACACCCAGGAGCTATGC
TCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCTGTGACC
GCTATGGATCCCTCAAAAGTTGGAGACCGATCTCAGTACCAGCACCTTCTGCGGCTTGAA
GATGGGACTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCAA
CGGGGCGATATCAACAGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG SEQ ID NO: 51
Cuphea heterophylla (Cht) FATB2a (S17P, P21S, T28N, L30P, S33L, G76D, S78P, G137W variant) coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCCCCGGCACCACCT
CCAAGCCCGGCAAGTTCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGCCCAAGTC
CAACCCCAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCAAGGAGGACACCCCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCGCACCGCCATCACCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGAAGCCCG
ACATGCACGTGGACTGGTTCGGCCTGGAGATCATCGTGCAGGACTGGCTGGTGTTCCGCG
AGTCCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GAT GAACCACCT GCAGGACACCT CCCT GAACC ACTGC AAGT CCGT GGGCCTGCT GAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CCTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CTGC AAGGGCCT GACCCCCGAGT GGAACGACCT GGACGT GAACC AGCACGT GT CC AACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCAAGGAGGTGCTGGACACCCAGGAGCT
GT GCT CCCT GACCCT GGAGT ACCGCCGCGAGT GCGGCCGCGACT CCGT GCT GGAGT CCGT
GACCGCCATGGACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCACCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA SEQ ID NO: 52
Cuphea heterophylla (Cht) FATB2c (G76D, S78P variant) amino acid sequence
MWAAAASSAFFPVPASGTSPKPGKFGTWLSSSSPSYKPKSNPSGGFQVKANASAHPKANGSA
VSLKSGSLNTKEDTPSSPPPRTFLNQLPDWNRLRTAITTVFVAAEKQLTMLDRKSKKPDMHVD
WFGLEIIVQDGLVFRESFSIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGFGRTPEM
CKRDLIWVLTKMQIMVNRYPTWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIRATSIWA
MMNQKTRRFSKLPNEWQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPEWNDLDV
NQHVSNVKYIGWILESMPKEVLDTQELCSLTLEYRRECGRDSVLESVTAMDPSKVGDRSQYQ HLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS* SEQ ID NO: 53
Cuphea heterophylla (Cht) FATB2c (G76D, S78P variant) coding DNA sequence
ATGGTGGTGGCTGCTGCAGCAAGCTCTGCATTCTTCCCTGTTCCGGCATCTGGAACCTCCC
CTAAACCCGGGAAGTTCGGGACTTGGCTATCGAGCTCGAGCCCTTCCTACAAGCCCAAGT
CAAACCCCAGTGGTGGATTTCAGGTTAAGGCAAATGCCAGTGCTCATCCTAAGGCTAACG
GTTCCGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTAAGGAGGACACTCCGTCGTCCC
CTCCTCCTCGGACTTTCCTTAACCAGTTGCCTGATTGGAATAGGCTTCGGACTGCAATCAC
GACCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAGTCTAAGAAGCC
TGACATGCACGTGGACTGGTTTGGGTTGGAGATTATTGTTCAGGATGGGCTCGTGTTCAGA
GAGAGTTTTTCGATCAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAAACG
TTGATGAACCATTTGCAGGACACATCTTTGAACCATTGTAAGAGTGTGGGTCTTCTCAATG
ACGGCTTTGGTCGTACCCCGGAGATGTGTAAAAGGGACCTCATTTGGGTGCTTACAAAAA
TGCAGATCATGGTGAATCGCTATCCAACTTGGGGCGATACTGTCGAGATCAATAGCTGGTT
CTCCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGG
AGAAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAAGACGAGAAGATT
CTCAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTGGACGCCCCTCCT
GTCATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTT
GCAAGGGTCTAACACCGGAGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGA
AGTACATTGGGTGGATTCTCGAGAGTATGCCAAAAGAAGTTTTGGACACCCAGGAGCTAT
GCTCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCTGTGA
CCGCTATGGATCCCTCAAAAGTTGGGGACCGATCTCAGTACCAGCACCTTCTGCGGCTTGA
AGATGGGACTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCA
ACGGGGCT AT AT CAACAGGAAAGACTT CAAATGGAAACTCGGTCTCTT AG SEQ ID NO: 54
Cuphea heterophylla (Cht) FATB2c (G76D, S78P variant) coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCTCCGGCACCTCCC
CCAAGCCCGGCAAGTTCGGCACCTGGCTGTCCTCCTCCTCCCCCTCCTACAAGCCCAAGTC
CAACCCCTCCGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCAAGGAGGACACCCCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGAACCGCCTGCGCACCGCCATCACCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGAAGCCCG
ACATGCACGTGGACTGGTTCGGCCTGGAGATCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGTCCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACCACCTGCAGGACACCTCCCTGAACCACTGCAAGTCCGTGGGCCTGCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CTGCAAGGGCCTGACCCCCGAGTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCAAGGAGGTGCTGGACACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGT
GACCGCCATGGACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCACCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA SEQ ID NO: 55
Cuphea heterophylla (Cht) FATB2d (S21P, T28N, L30P, S33L, G76D, R97L, H124L, W127L, I132S, K258N, C303R, E309G, K334T, T386A variant) amino acid sequence
MWAAAASSAFFPVPAPGTTSKPGKFGNWPSSLSPSFKPKSNPNGGFQVKANASAHPKANGS
AVSLKSGSLNTQEDTSSSPPPRTFLNQLPDWSRLLTAISTVFVAAEKQLTMLDRKSKRPDMLV
DLFGLESIVQDGLVFRESYSIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGFGRTPE
MCKRDLIWVLTKMQIMVNRYPTWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIRATSIW
AMMNQNTRRFSKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSIRKGLTPGWNDLD
VNQHVSNVKYIGWILESMPTEVLETQELCSLTLEYRRECGRESVLESVTAMNPSKVGDRSQYQ HLLRLEDGADIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS* SEQ ID NO: 56
Cuphea heterophylla (Cht) FATB2d (S21P, T28N, L30P, S33L, G76D, R97L, H124L, W127L, I132S, K258N, C303R, E309G, K334T, T386A variant) coding DNA sequence
ATGGTGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGTTCCAGCACCTGGAACCACGT
CTAAACCCGGGAAGTTCGGCAATTGGCCATCGAGCTTGAGCCCTTCCTTCAAGCCCAAGTC
AAACCCCAATGGTGGATTTCAGGTTAAGGCAAATGCCAGTGCTCATCCTAAGGCTAACGG
TTCTGCGGTAAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCCT
CCTCCTCGGACATTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCTGACTGCAATCTCGA
CCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAATCTAAGAGGCCTG
ACATGCTCGTGGACTTGTTTGGGTTGGAGAGTATTGTTCAGGATGGGCTCGTGTTCAGAGA
GAGTTATTCGATCAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATAGAAACGTT
GATGAACCATTTGCAGGACACATCTTTGAACCATTGTAAGAGTGTGGGTCTTCTCAATGAC
GGCTTTGGTCGTACCCCGGAGATGTGTAAAAGGGACCTCATTTGGGTGCTTACAAAAATG
CAGATCATGGTGAATCGCTATCCAACTTGGGGCGATACTGTCGAGATCAATAGCTGGTTCT
CCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGGAG
AAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAATACGAGAAGATTCT
CAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTTGACGCTCCTCCTGT
CATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTCG
CAAGGGTCTAACTCCGGGGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAA
GTACATTGGGTGGATTCTCGAGAGTATGCCAACAGAAGTTTTGGAGACCCAGGAGCTATG
CTCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGAAAGTGTGCTGGAGTCCGTGAC
CGCTATGAATCCCTCAAAAGTTGGAGACCGGTCTCAGTACCAGCACCTTCTACGGCTTGAG
GATGGGGCTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCAA
CGGGGCGATATCAACAGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG SEQ ID NO: 57
Cuphea heterophylla (Cht) FATB2d (S21P, T28N, L30P, S33L, G76D, R97L, H124L, W127L, I132S, K258N, C303R, E309G, K334T, T386A variant) coding DNA sequence codon optimized for Prototheca moriformis
AT GGTGGT GGCCGCCGCCGCCT CCTCCGCCTTCTT CCCCGT GCCCGCCCCCGGC ACC ACCT
CCAAGCCCGGCAAGTTCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGCCCAAGTC
CAACCCCAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCTGACCGCCATCTCCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGCGCCCCG
ACATGCTGGTGGACCTGTTCGGCCTGGAGTCCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGTCCTACTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACCACCTGCAGGACACCTCCCTGAACCACTGCAAGTCCGTGGGCCTGCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAACACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGT GCGCC AGGAGATCGCCCCCC ACTTCGT GGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CCGCAAGGGCCTGACCCCCGGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCACCGAGGTGCTGGAGACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCGT
GACCGCCATGAACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCGCCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA SEQ ID NO: 58
Cuphea heterophylla (Cht) FATB2e (G76D, R97L, H124L, I132S, G152S, H165L, T211N, K258N, C303R, E309G, K334T, T386A variant) amino acid sequence
MWAAAASSAFFPVPASGTSPKPGKFGTWLSSSSPSYKPKSNPSGGFQVKANASAHPKANGSA
VSLKSGSLNTQEDTSSSPPPQTFLNQLPDWSRLLTAISTVFVAAEKQLTMLDRKSKRPDMLVD
WFGLESIVQDGLVFRESYSIRSYEISADRTASIETVMNLLQETSLNHCKSMGILNDGFGRTPEM
CKRDLIWVLTKMQILVNRYPNWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIRATSIWA
MMNQNTRRFSKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSIRKGLTPGWNDLDV
NQHVSNVKYIGWILESMPTEVLETQELCSLTLEYRRECGRDSVLESVTAMNPSKVGDRSQYQ HLLRLEDGADIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS* SEQ ID NO: 59
Cuphea heterophylla (Cht) FATB2e (G76D, R97L, H124L, I132S, G152S, H165L, T211N, K258N, C303R, E309G, K334T, T386A variant) coding DNA sequence
ATGGTGGTGGCTGCTGCAGCAAGCTCTGCATTCTTCCCTGTTCCGGCATCTGGAACCTCCC
CTAAACCCGGGAAGTTCGGGACTTGGCTATCGAGCTCGAGCCCTTCCTACAAGCCCAAGT
CAAACCCCAGTGGTGGATTTCAGGTTAAGGCAAATGCCAGTGCTCATCCTAAGGCTAACG
GTTCTGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCC
TCCTCCTCAGACATTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCTGACAGCAATCTCG
ACCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAATCTAAAAGGCCT
GACATGCTCGTGGACTGGTTTGGGTTGGAGAGTATTGTTCAGGATGGGCTCGTGTTCAGAG
AGAGTTATTCGATCAGGTCTTACGAAATAAGCGCTGATCGAACAGCCTCTATAGAGACGG
TGATGAACCTCTTGCAGGAAACATCTCTCAATCATTGTAAGAGTATGGGTATTCTCAATGA
CGGCTTTGGTCGTACCCCGGAGATGTGCAAAAGGGACCTCATTTGGGTGCTTACAAAAAT
GCAGATCTTGGTGAATCGCTATCCAAATTGGGGTGATACTGTCGAGATCAATAGCTGGTTC
TCCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGGA
GAAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAATACGAGAAGATTC
TCAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTTGACGCTCCTCCTG
TCATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTCG
CAAGGGTCTAACTCCGGGGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAA
GTACATTGGGTGGATTCTCGAGAGTATGCCAACAGAAGTTTTGGAGACCCAGGAGCTATG
CTCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCCGTGAC
CGCTATGAATCCCTCAAAAGTTGGAGACCGGTCTCAGTACCAGCACCTTCTACGGCTTGAG
GATGGGGCTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCAA
CGGGGCGATATCAACAGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG SEQ ID NO: 60
Cuphea heterophylla (Cht) FATB2e (G76D, R97L, H124L, I132S, G152S, H165L, T211N, K258N, C303R, E309G, K334T, T386A variant) coding DNA sequence codon optimized for Prototheca moriformis
AT GGTGGT GGCCGCCGCCGCCT CCTCCGCCTTCTT CCCCGT GCCCGCCTCCGGC ACCT CCC
CCAAGCCCGGCAAGTTCGGCACCTGGCTGTCCTCCTCCTCCCCCTCCTACAAGCCCAAGTC
CAACCCCTCCGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCC
CCCCCCCAGACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCTGACCGCCATCTCCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGCGCCCCG
ACATGCTGGTGGACTGGTTCGGCCTGGAGTCCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGTCCTACTCCATCCGCTCCTACGAGATCTCCGCCGACCGCACCGCCTCCATCGAGACCGT
GATGAACCTGCTGCAGGAGACCTCCCTGAACCACTGCAAGTCCATGGGCATCCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCCTGGTGAACCGCTACCCCAACTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAACACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CCGCAAGGGCCTGACCCCCGGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCACCGAGGTGCTGGAGACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGT
GACCGCCATGAACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCGCCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA SEQ ID NO: 61
Cuphea heterophylla (Cht) FATB2f (R97L, H124L, I132S, G152S, H165L, T211N variant) amino acid sequence
MWAAAASSAFFPVPASGTSPKPGKFGTWLSSSSPSYKPKSNPSGGFQVKANASAHPKANGSA
VSLKSGSLNTQEGTSSSPPPRTFLNQLPDWSRLLTAISTVFVAAEKQLTMLDRKSKRPDMLVD
WFGLESIVQDGLVFRESYSIRSYEISADRTASIETVMNLLQETSLNHCKSMGILNDGFGRTPEM
CKRDLIWVLTKMQILVNRYPNWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIRATSIWA
MMNQKTRRFSKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPEWNDLDV
NQHVSNVKYIGWILESMPKEVLDTQELCSLTLEYRRECGRDSVLESVTAMDPSKVGDRSQYQ HLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS* SEQ ID NO: 62
Cuphea heterophylla (Cht) FATB2f (R97L, H124L, I132S, G152S, H165L, T21 IN variant) coding DNA sequence
ATGGTGGTGGCTGCTGCAGCAAGCTCTGCATTCTTCCCTGTTCCGGCATCTGGAACCTCCC
CTAAACCCGGGAAGTTCGGGACTTGGCTATCGAGCTCGAGCCCTTCCTACAAGCCCAAGT
CAAACCCCAGTGGTGGATTTCAGGTTAAAGCAAATGCCAGTGCTCATCCTAAGGCTAACG
GTTCCGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGGGCACTTCGTCGTCCCC
TCCTCCTCGGACATTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCTGACTGCAATCTCG
ACCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAATCTAAGAGGCCT
GACATGCTCGTGGACTGGTTTGGGTTGGAGAGTATTGTTCAGGATGGGCTCGTGTTCAGAG
AGAGTTATTCGATCAGGTCTTACGAAATAAGCGCTGATCGAACAGCCTCTATAGAGACGG
TGATGAACCTCTTGCAGGAAACATCTCTCAATCATTGTAAGAGTATGGGTATTCTCAATGA
CGGCTTTGGTCGTACCCCGGAGATGTGCAAAAGGGACCTCATTTGGGTGCTTACAAAAAT
GCAGATCTTGGTGAATCGCTATCCAAATTGGGGTGATACTGTCGAGATCAATAGCTGGTTC
TCCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGGA
GAAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAAGACGAGAAGATTC
TCAAAACTTCCAAATGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTGGACGCCCCTCCTG
TCATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTTG
CAAGGGTCTAACACCGGAGTGGAACGACTTGGATGTCAATCAGCACGTAAGCAACGTGAA
GTACATTGGGTGGATTCTCGAGAGTATGCCAAAAGAAGTTTTGGACACCCAGGAGCTATG
CTCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCTGTGAC
CGCTATGGATCCCTCAAAAGTTGGAGACCGATCTCAGTACCAGCACCTTCTGCGGCTTGAA
GATGGGACTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCAA
CGGGGCGATATCAACAGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG SEQ ID NO: 63
Cuphea heterophylla (Cht) FATB2f (R97L, H124L, I132S, G152S, H165L, T211N variant) coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCTCCGGCACCTCCC
CCAAGCCCGGCAAGTTCGGCACCTGGCTGTCCTCCTCCTCCCCCTCCTACAAGCCCAAGTC
CAACCCCTCCGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGGGCACCTCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCTGACCGCCATCTCCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGCGCCCCG
ACATGCTGGTGGACTGGTTCGGCCTGGAGTCCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGTCCTACTCCATCCGCTCCTACGAGATCTCCGCCGACCGCACCGCCTCCATCGAGACCGT
GATGAACCTGCTGCAGGAGACCTCCCTGAACCACTGCAAGTCCATGGGCATCCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCCTGGTGAACCGCTACCCCAACTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CTGCAAGGGCCTGACCCCCGAGTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCAAGGAGGTGCTGGACACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGT
GACCGCCATGGACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCACCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA SEQ ID NO: 64
Cuphea heterophylla (Cht) FATB2g (A6T, A16V, S17P, G76D, R97L, H124L, I132S, S143I, G152S, A157T, H165L, T21 IN, G414A variant) amino acid sequence
MWAATASSAFFPVPVPGTSPKPGKFGTWLSSSSPSYKPKSNPSGGFQVKANASAHPKANGSA
VSLKSGSLNTQEDTSSSPPPRTFLNQLPDWSRLLTAISTVFVAAEKQLTMLDRKSKRPDMLVD
WFGLESrVQDGLVFREIYSIRSYEISADRTTSIETVMNLLQETSLNHCKSMGILNDGFGRTPEMC
KRDLIWVLTKMQILVNRYPNWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIRATSIWAM
MNQKTRRFSKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPEWNDLDVN
QHVSNVKYIGWILESMPKEVLDTQELCSLTLEYRRECGRDSVLESVTAMDPSKVGDRSQYQH LLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKTSNANSVS* SEQ ID NO: 65
Cuphea heterophylla (Cht) FATB2g (A6T, A16V, S17P, G76D, R97L, H124L, I132S, S143I, G152S, A157T, H165L, T21 IN, G414A variant) coding DNA sequence
ATGGTGGTGGCTGCTACAGCAAGTTCTGCATTCTTCCCTGTTCCTGTACCTGGAACCTCCC
CTAAACCCGGAAAGTTCGGGACTTGGCTATCGAGCTCGAGCCCTTCCTACAAGCCCAAGT
CAAACCCCAGTGGTGGATTTCAGGTTAAGGCAAATGCCAGTGCTCATCCTAAGGCTAACG
GTTCTGCAGTAAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCC
TCCTCCTCGGACATTCCTTAACCAGTTGCCTGATTGGAGTAGGCTTCTGACTGCAATCTCG
ACCGTCTTCGTGGCGGCAGAGAAGCAGTTGACTATGCTCGATCGAAAATCTAAGAGGCCT
GACATGCTCGTGGACTGGTTTGGGTTGGAGAGTATTGTTCAGGATGGGCTCGTGTTCAGAG
AGATTTATTCGATCAGGTCTTACGAAATAAGCGCTGATCGAACAACCTCTATAGAGACGG
TGATGAACCTCTTGCAGGAAACATCTCTCAATCATTGTAAGAGTATGGGTATTCTCAATGA
CGGCTTTGGTCGTACCCCGGAGATGTGCAAAAGGGACCTCATTTGGGTGCTTACAAAAAT
GCAGATCTTGGTGAATCGCTATCCAAATTGGGGTGATACTGTCGAGATCAATAGCTGGTTC
TCCCAGTCCGGGAAAATCGGTATGGGTCGCAATTGGCTAATAAGTGATTGCAACACAGGA
GAAATTCTTATAAGAGCAACGAGCATTTGGGCCATGATGAATCAAAAGACGAGAAGATTC
TCAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCGCCTCATTTTGTGGACGCCCCTCCTG
TCATTGAAGACAATGATCGAAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTTG
CAAGGGTCTAACACCGGAGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAA
GTACATTGGGTGGATTCTCGAGAGTATGCCAAAAGAAGTTTTGGACACCCAGGAGCTATG
CTCTCTCACCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCTGTGAC
CGCTATGGATCCCTCAAAAGTTGGAGACCGATCTCAGTACCAGCACCTTCTGCGGCTTGAA
GATGGGACTGATATCATGAAGGGCAGAACTGAGTGGCGACCAAAGAATGCAGGAACCAA
CGGGGCGATATCAACAGGAAAGACTTCAAATGCAAACTCGGTCTCTTAG SEQ ID NO: 66
Cuphea heterophylla (Cht) FATB2g (A6T, A16V, S17P, G76D, R97L, H124L, I132S, S143I, G152S, A157T, H165L, T21 IN, G414A variant) coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGTGGCCGCCACCGCCTCCTCCGCCTTCTTCCCCGTGCCCGTGCCCGGCACCTCCC
CCAAGCCCGGCAAGTTCGGCACCTGGCTGTCCTCCTCCTCCCCCTCCTACAAGCCCAAGTC
CAACCCCTCCGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGG
CTCCGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCC
CCCCCCCGCACCTTCCTGAACCAGCTGCCCGACTGGTCCCGCCTGCTGACCGCCATCTCCA
CCGTGTTCGTGGCCGCCGAGAAGCAGCTGACCATGCTGGACCGCAAGTCCAAGCGCCCCG
ACATGCTGGTGGACTGGTTCGGCCTGGAGTCCATCGTGCAGGACGGCCTGGTGTTCCGCG
AGATCTACTCCATCCGCTCCTACGAGATCTCCGCCGACCGCACCACCTCCATCGAGACCGT
GATGAACCTGCTGCAGGAGACCTCCCTGAACCACTGCAAGTCCATGGGCATCCTGAACGA
CGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGCTGACCAAGAT
GCAGATCCTGGTGAACCGCTACCCCAACTGGGGCGACACCGTGGAGATCAACTCCTGGTT
CTCCCAGTCCGGCAAGATCGGCATGGGCCGCAACTGGCTGATCTCCGACTGCAACACCGG
CGAGATCCTGATCCGCGCCACCTCCATCTGGGCCATGATGAACCAGAAGACCCGCCGCTT
CTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCACTTCGTGGACGCCCCCCC
CGTGATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCAT
CTGCAAGGGCCTGACCCCCGAGTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGT
GAAGTACATCGGCTGGATCCTGGAGTCCATGCCCAAGGAGGTGCTGGACACCCAGGAGCT
GTGCTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGT
GACCGCCATGGACCCCTCCAAGGTGGGCGACCGCTCCCAGTACCAGCACCTGCTGCGCCT
GGAGGACGGCACCGACATCATGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
CCAACGGCGCCATCTCCACCGGCAAGACCTCCAACGCCAACTCCGTGTCCTGA SEQ ID NO: 67
Cuphea heterophylla (Cht) FATB3aamino acid sequence
MVATAASSAFFPVPSPDTSSRPGKLGNGSSSLRPLKPKFVANAGLQVKANASAPPKINGSSVSL
KSCSLKTHEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDPF
GLGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNEGFGRTPEMY
KRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRASSV
WVMMNQKTRKLSKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTPKWNDL
DVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGKGFGPQFQ HLLRLEDGGEIVKGRTEWRPKTAGINGTLASGETSPGNS* SEQ ID NO: 68
Cuphea heterophylla (Cht) FATB3a coding DNA sequence
ATGGTGGCCACCGCTGCAAGTTCTGCATTCTTCCCGGTGCCGTCCCCGGACACCTCCTCTA
GACCGGGAAAGCTCGGAAATGGGTCATCAAGCTTGAGGCCCCTCAAGCCCAAATTTGTTG
CCAATGCTGGGCTGCAGGTTAAGGCAAACGCCAGTGCCCCTCCTAAGATCAATGGTTCCT
CGGTCAGTCTAAAGTCTTGCAGTCTCAAGACTCATGAAGACACTCCTTCAGCTCCTCCTCC
GCGGACTTTTATCAACCAGTTGCCTGATTGGAGCATGCTTCTTGCTGCAATCACTACTGTC
TTCTTGGCAGCAGAGAAGCAGTGGATGATGCTTGATTGGAAACCAAAGAGGCCTGACATG
CTTGTGGACCCGTTCGGATTGGGAAGGATTGTTCAGGATGGGCTTGTGTTCAGGCAGAATT
TTTCGATTAGGTCCTATGAAATAGGCGCTGATCGCACTGCATCCATAGAGACGGTGATGA
ACCACTTGCAGGAAACGGCTCTCAATCATGTTAAGAGTGCGGGGCTTCTTAATGAAGGCT
TTGGTCGTACTCCTGAGATGTATAAAAGGGACCTTATTTGGGTTGTCGCGAAAATGCAGGT
CATGGTTAACCGCTATCCTACTTGGGGTGACACGGTTGAAGTGAATACTTGGGTTGCCAAG
TCAGGGAAAAATGGTATGCGTCGTGATTGGCTCATAAGTGATTGCAATACAGGAGAAATT
CTTACAAGGGCATCAAGTGTGTGGGTCATGATGAATCAAAAGACAAGAAAATTGTCAAAG
ATTCCAGATGAGGTTCGGCATGAGATAGAGCCTCATTTTGTGGACTCTGCTCCCGTCATTG
AAGACGATGACTGGAAACTTCCCAAGCTGGATGAGAAAACTGCTGACTCCATCCGCAAGG
GT CTAACT CCGAAGT GGAATGACTTGGATGTCAATC AGCACGTC AAC AACGT GAAGT AC A
TTGGGTGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGTTCCCT
TACCCTGGAATACAGGCGGGAATGCGGAAGGGAGAGTGTGCTGGAGTCCCTCACTGCTGT
GGACCCCTCTGGAAAGGGCTTTGGGCCCCAGTTTCAGCACCTTCTGAGGCTTGAGGATGG
AGGTGAGATCGTAAAGGGGAGAACTGAGTGGCGACCCAAGACTGCAGGTATCAATGGGA
CGATTGCATCTGGGGAGACCTCACCTGGAAACTCTTAG SEQ ID NO: 69
Cuphea heterophylla (Cht) FATB3a coding DNA sequence codon optimized for Pro to theca moriformis
ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCCCCGACACCTCCTCCCG
CCCCGGCAAGCTGGGCAACGGCTCCTCCTCCCTGCGCCCCCTGAAGCCCAAGTTCGTGGCC
AACGCCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCCTCC
GTGTCCCTGAAGTCCTGCTCCCTGAAGACCCACGAGGACACCCCCTCCGCCCCCCCCCCCC
GCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCGTGTT
CCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACATGCT
GGTGGACCCCTTCGGCCTGGGCCGCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAACTT
CTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATGAA
CCACCTGCAGGAGACCGCCCTGAACCACGTGAAGTCCGCCGGCCTGCTGAACGAGGGCTT
CGGCCGCACCCCCGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAGGT
GATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCCAA
GTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGAT
CCTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCAAGCTGTCCAA
GATCCCCGACGAGGTGCGCCACGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTGAT
CGAGGACGACGACTGGAAGCTGCCCAAGCTGGACGAGAAGACCGCCGACTCCATCCGCA
AGGGCCTGACCCCCAAGTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGC
TCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACC
GCCGTGGACCCCTCCGGCAAGGGCTTCGGCCCCCAGTTCCAGCACCTGCTGCGCCTGGAG
GACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGACCGCCGGCATCAA
CGGCACCATCGCCTCCGGCGAGACCTCCCCCGGCAACTCCTGA SEQ ID NO: 70
Cuphea heterophylla (Cht) FATB3b (C67G, H72Q, L128F, N179I variant) amino acid sequence
MVATAASSAFFPVPSPDTSSRPGKLGNGSSSLRPLKPKFVANAGLQVKANASAPPKINGSSVSL
KSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDPF
GFGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLIEGFGRTPEMYK
RDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRASSVW
VMMNQKTRKLSKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTPKWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGKGFGPQFQH LLRLEDGGEIVKGRTE WRPKTAGINGTIASGET SPGN S * SEQ ID NO: 71
Cuphea heterophylla (Cht) FATB3b (C67G, H72Q, L128F, N179I variant) coding DNA sequence
ATGGTGGCCACCGCTGCAAGTTCTGCATTCTTCCCGGTGCCATCCCCGGACACCTCCTCTA
GACCGGGAAAGCTCGGAAATGGGTCATCAAGCTTGAGGCCCCTCAAGCCCAAATTTGTTG
CCAATGCTGGGCTGCAGGTTAAGGCAAACGCCAGTGCCCCTCCTAAGATCAATGGTTCCT
CGGTCAGTCTAAAGTCTGGCAGTCTCAAGACTCAGGAAGACACTCCTTCGGCTCCTCCTCC
GCGGACTTTTATCAACCAGTTGCCTGATTGGAGCATGCTTCTTGCTGCAATCACTACTGTC
TTCTTGGCAGCAGAGAAGCAGTGGATGATGCTTGATTGGAAACCAAAGAGGCCTGACATG
CTTGTGGACCCGTTCGGATTTGGAAGGATTGTTCAGGATGGGCTTGTGTTCAGGCAGAATT
TTTCGATTAGGTCCTATGAAATAGGCGCTGATCGCACTGCATCTATAGAGACGGTGATGA
ACCACTTGCAGGAAACGGCTCTCAATCATGTTAAGAGTGCGGGGCTTCTTATTGAAGGCTT
TGGTCGTACTCCTGAGATGTATAAAAGGGACCTTATTTGGGTTGTCGCGAAAATGCAGGTC
AT GGTT AACCGCT ATCCTACTTGGGGT GAC ACGGTT GAAGT GAAT ACTTGGGTTGCC AAGT
CAGGGAAAAATGGTATGCGTCGTGATTGGCTCATAAGTGATTGCAATACAGGAGAAATTC
TTACTAGAGCATCAAGTGTGTGGGTCATGATGAATCAAAAGACAAGAAAATTGTCAAAGA
TTCCAGATGAGGTTCGGCATGAGATAGAGCCTCATTTTGTGGACTCTGCTCCCGTCATTGA
AGACGATGACTGGAAACTTCCCAAGCTGGATGAGAAAACTGCTGACTCCATCCGCAAGGG
TCTAACTCCGAAGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTACAT
TGGGTGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGTTCCCTT
ACCCTGGAATACAGGCGGGAATGCGGAAGGGAGAGTGTGCTGGAGTCCCTCACTGCTGTG
GACCCCTCTGGAAAGGGCTTTGGGCCCCAGTTTCAGCACCTTCTGAGGCTTGAGGATGGA
GGTGAGATCGTAAAGGGGAGAACTGAGTGGCGACCCAAGACTGCAGGTATCAATGGGAC
GATTGCATCTGGGGAGACCTCACCTGGAAACTCTTAG SEQ ID NO: 72
Cuphea heterophylla (Cht) FATB3b (C67G, H72Q, L128F, N179I variant) coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCCCCGACACCTCCTCCCG
CCCCGGCAAGCTGGGCAACGGCTCCTCCTCCCTGCGCCCCCTGAAGCCCAAGTTCGTGGCC
AACGCCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCCTCC
GT GTCCCT GAAGT CCGGCT CCCTGAAGACCCAGGAGG AC ACCCCCT CCGCCCCCCCCCCCC
GCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCGTGTT
CCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACATGCT
GGTGGACCCCTTCGGCTTCGGCCGCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAACTTC
TCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATGAAC
CACCTGCAGGAGACCGCCCTGAACCACGTGAAGTCCGCCGGCCTGCTGATCGAGGGCTTC
GGCCGCACCCCCGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAGGTG
ATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCCAAG
TCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGATC
CTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCAAGCTGTCCAAG
ATCCCCGACGAGGTGCGCCACGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTGATC
GAGGACGACGACTGGAAGCTGCCCAAGCTGGACGAGAAGACCGCCGACTCCATCCGCAA
GGGCCTGACCCCCAAGTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAAGT
ACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGCT
CCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACCG
CCGTGGACCCCTCCGGCAAGGGCTTCGGCCCCCAGTTCCAGCACCTGCTGCGCCTGGAGG
ACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGACCGCCGGCATCAAC
GGCACCATCGCCTCCGGCGAGACCTCCCCCGGCAACTCCTGA SEQ ID NO: 73
Cuphea viscosissima (Cvis) FATB1 amino acid sequence
MVAAAATSAFFPVPAPGTSPKPGKSGNWPSSLSPTFKPKSIPNGGFQVKANASAHPKANGSAV
NLKSGSLNTQEDTS SSPPPRAFLNQLPDW SMLLTAITTVFVAAEKQ WTMLDRKSKRPDMLVD
SVGLKSrVRDGLVSRHSFSIRSYEIGADRTASIETLMNHLQETTINHCKSLGLHNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPTWGDTVEINTWFSQSGKIGMASDWLISDCNTGEILIRATSVW
AMMNQKTRRFSRLPYEVRQELTPHFVDSPHVIEDNDQKLRKFDVKTGDSIRKGLTPRWNDLD
VNQHVSNVKYIGWILESMPIEVLETQELCSLTVEYRRECGMDSVLESVTAVDPSENGGRSQYK
HLLRLEDGTDIVKSRTEWRPKNAGTNGAISTSTAKTSNGNSVS SEQ ID NO: 74
Cuphea viscosissima (Cvis) FATB1 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCGCCGCCGCCACCTCCGCCTTCTTCCCCGTGCCCGCCCCCGGCACCTCCCCCA
AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCACCTTCAAGCCCAAGTCCAT
CCCCAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCGCCTTCCTGAACCAGCTGCCCGACTGGTCCATGCTGCTGACCGCCATCACCACCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGACA
TGCTGGTGGACTCCGTGGGCCTGAAGTCCATCGTGCGCGACGGCCTGGTGTCCCGCCACTC
CTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGATG
AACCACCTGCAGGAGACCACCATCAACCACTGCAAGTCCCTGGGCCTGCACAACGACGGC
TTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCAG
ATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACACCTGGTTCTCC
C AGTCCGGC AAGAT CGGCAT GGCCTCCGACTGGCTGAT CT CCGACT GC AAC ACCGGCGAG
ATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCCC
GCCTGCCCTACGAGGTGCGCCAGGAGCTGACCCCCCACTTCGTGGACTCCCCCCACGTGAT
CGAGGACAACGACCAGAAGCTGCGCAAGTTCGACGTGAAGACCGGCGACTCCATCCGCA
AGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAGT
ACATCGGCTGGATCCTGGAGTCCATGCCCATCGAGGTGCTGGAGACCCAGGAGCTGTGCT
CCCTGACCGTGGAGTACCGCCGCGAGTGCGGCATGGACTCCGTGCTGGAGTCCGTGACCG
CCGTGGACCCCTCCGAGAACGGCGGCCGCTCCCAGTACAAGCACCTGCTGCGCCTGGAGG
ACGGCACCGACATCGTGAAGTCCCGCACCGAGTGGCGCCCCAAGAACGCCGGCACCAAC
GGCGCCATCTCCACCTCCACCGCCAAGACCTCCAACGGCAACTCCGTGTCCTGA SEQ ID NO: 75
Cuphea viscosissima (Cvis) FATB2 amino acid sequence
MVATAASSAFFPVPSADTSSRPGKLGNGPSSFSPLKPKSIPNGGLQVKASASAPPKINGSSVGLK
SGGLKTHDDAPSAPPPRTFINQLPDWSMLLAAITTAFLAAEKQWMMLDRKPKRLDMLEDPFG
LGRWQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKTAGLSNDGFGRTPEMYK
RDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRASSVW
VMMNQKTRKLSKIPDEVRREIEPHFVDSAPVIEDDDRKLPKLDEKSADSIRKGLTPRWNDLDV
NQHVNNAKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGEGYGSQFQHL
LRLEDGGEIVKGRTE WRPKNAGINGWPSEE S SPGDY S SEQ ID NO: 76
Cuphea viscosissima (Cvis) FATB2 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCGCCGACACCTCCTCCC
GCCCCGGCAAGCTGGGCAACGGCCCCTCCTCCTTCTCCCCCCTGAAGCCCAAGTCCATCCC
CAACGGCGGCCTGCAGGTGAAGGCCTCCGCCTCCGCCCCCCCCAAGATCAACGGCTCCTC
CGTGGGCCTGAAGTCCGGCGGCCTGAAGACCCACGACGACGCCCCCTCCGCCCCCCCCCC
CCGCACCTTCATCAACCAGCT GCCCGACT GGTCC ATGCT GCT GGCCGCC ATC ACC ACCGCC
TTCCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACCGCAAGCCCAAGCGCCTGGACATG
CTGGAGGACCCCTTCGGCCTGGGCCGCGTGGTGCAGGACGGCCTGGTGTTCCGCCAGAAC
TTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATGA
ACCACCTGCAGGAGACCGCCCTGAACCACGTGAAGACCGCCGGCCTGTCCAACGACGGCT
TCGGCCGCACCCCCGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAGG
TGATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCCA
AGTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGA
TCCTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCAAGCTGTCCA
AGATCCCCGACGAGGTGCGCCGCGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTGA
TCGAGGACGACGACCGCAAGCTGCCCAAGCTGGACGAGAAGTCCGCCGACTCCATCCGCA
AGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGCCAAGT
ACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGCT
CCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACCG
CCGTGGACCCCTCCGGCGAGGGCTACGGCTCCCAGTTCCAGCACCTGCTGCGCCTGGAGG
ACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCATCAAC
GGCGTGGTGCCCTCCGAGGAGTCCTCCCCCGGCGACTACTCCTGA SEQ ID NO: 77
Cuphea viscosissima (Cvis) FATB3 amino acid sequence
MVAAAASSAFFSFPTPGTSPKPGKFGNWPSSLSIPFNPKSNHNGGIQVKANASAHPKANGSAVS
LKAGSLETQEDTSSPSPPPRTFISQLPDWSMLVSAITTVFVAAEKQWTMLDRKSKRPDVLVEPF
VQDGVSFRQSFSIRSYEIGVDRTASIETLMNIFQETSLNHCKSLGLLNDGFGRTPEMCKRDLIW
WTKMQIEVNRYPTWGDTIEVTTWVSESGKNGMSRDWLISDCHSGEILIRATSVWAMMNQK
TRRLSKIPDEVRQEIVPYFVDSAPVIEDDRKLHKLDVKTGDSIRNGLTPRWNDFDVNQHVNNV
KYIAWLLKSVPTEVFETQELCGLTLEYRRECRRDSVLESVTAMDPSKEGDRSLYQHLLRLENG ADIALGRTE WRPKNAGAT GAV STGKTSNGN S VS SEQ ID NO: 78
Cuphea viscosissima (Cvis) FATB3 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCTCCTTCCCCACCCCCGGCACCTCCCCCAA
GCCCGGCAAGTTCGGCAACTGGCCCTCCTCCCTGTCCATCCCCTTCAACCCCAAGTCCAAC
CACAACGGCGGCATCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTCC
GCCGTGTCCCTGAAGGCCGGCTCCCTGGAGACCCAGGAGGACACCTCCTCCCCCTCCCCCC
CCCCCCGCACCTTCATCTCCCAGCTGCCCGACTGGTCCATGCTGGTGTCCGCCATCACCAC
CGTGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGA
CGTGCTGGTGGAGCCCTTCGTGCAGGACGGCGTGTCCTTCCGCCAGTCCTTCTCCATCCGC
TCCTACGAGATCGGCGTGGACCGCACCGCCTCCATCGAGACCCTGATGAACATCTTCCAG
GAGACCTCCCTGAACCACTGCAAGTCCCTGGGCCTGCTGAACGACGGCTTCGGCCGCACC
CCCGAGATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGATGCAGATCGAGGTGAAC
CGCTACCCCACCTGGGGCGACACCATCGAGGTGACCACCTGGGTGTCCGAGTCCGGCAAG
AACGGCATGTCCCGCGACTGGCTGATCTCCGACTGCCACTCCGGCGAGATCCTGATCCGC
GCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCCTGTCCAAGATCCCCGAC
GAGGTGCGCCAGGAGATCGTGCCCTACTTCGTGGACTCCGCCCCCGTGATCGAGGACGAC
CGCAAGCTGCACAAGCTGGACGTGAAGACCGGCGACTCCATCCGCAACGGCCTGACCCCC
CGCTGGAACGACTTCGACGTGAACCAGCACGTGAACAACGTGAAGTACATCGCCTGGCTG
CTGAAGTCCGTGCCCACCGAGGTGTTCGAGACCCAGGAGCTGTGCGGCCTGACCCTGGAG
TACCGCCGCGAGTGCCGCCGCGACTCCGTGCTGGAGTCCGTGACCGCCATGGACCCCTCC
AAGGAGGGCGACCGCTCCCTGTACCAGCACCTGCTGCGCCTGGAGAACGGCGCCGACATC
GCCCTGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCGCCACCGGCGCCGTGTCCACC
GGCAAGACCTCCAACGGCAACTCCGTGTCCTGA SEQ ID NO: 79
Cuphea calcarata (Ccalc) FATB1 amino acid sequence
MVAASASSAFFSVPTPGTSPKPGKFGNWPSSLSVPFKPRSNNSGGFQVKANASAHPKANGSAV
SLKSGSLETQEDNSSSSRPPRTFIKQLPDWSMLLSAITTVFVAAEKQWTMFDRKSKRSDMLVD
PFWDRTVQDGVLFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSMGLLYEGFGRTPEMC
KRDLIWWTKIHIKVNRYPTWGDTIEVTTWVSESGKNGMGRDWLISDCHTGEILIRATSVWA
MMNQTTRRLSKFPYELRQEIAPHFVDSDPVIEDNRRLLNFDVKTGDSIRKGLTPRWNDLDVNQ
HVNNVKYIGWILESVPTEVFDTRELCGLTLEYRQECGRGSVLESVTAMDPSKEGDRSLYQHLL
RLEDGTDIVKGRTE WRPKNAGTNGPV STRKTTNGS S VS SEQ ID NO: 80
Cuphea calcarata (Ccalc) FATB1 coding DNA sequence
ATGGTGGCTGCTTCAGCAAGTTCTGCATTCTTCTCCGTCCCAACCCCGGGAACCTCTCCTA
AACCCGGGAAGTTCGGCAATTGGCCATCGAGCTTGAGCGTCCCATTCAAGCCCAGATCAA
ACAACAGTGGCGGCTTTCAGGTTAAGGCAAACGCCAGTGCTCATCCTAAGGCTAACGGTT
CTGCAGTAAGTCTAAAGTCTGGGAGCCTCGAGACTCAGGAGGACAATTCGTCGTCGTCTC
GTCCTCCTCGGACTTTCATTAAACAGTTGCCGGACTGGAGTATGCTTCTTTCCGCGATCAC
AACCGTCTTCGTGGCGGCTGAGAAGCAGTGGACGATGTTTGATCGGAAATCTAAGAGGTC
TGACATGCTCGTGGACCCGTTTGTGGTTGACAGGATTGTTCAGGATGGGGTTCTGTTCAGA
CAGAGTTTTTCGATTAGGTCTTACGAAATAGGCGCTGATCGAACAGCCTCTATTGAGACGC
TGATGAACATCTTCCAGGAAACATCTCTCAATCATTGTAAGAGTATGGGTCTTCTCTATGA
AGGCTTTGGTCGTACTCCTGAGATGTGTAAGAGGGACCTCATTTGGGTGGTTACGAAAAT
ACATATCAAGGTGAATCGCTATCCGACTTGGGGTGATACTATCGAGGTCACTACTTGGGTC
TCCGAGTCGGGCAAAAACGGTATGGGTCGCGATTGGCTGATAAGTGATTGCCATACAGGA
GAAATTCTTATAAGAGCAACGAGTGTGTGGGCTATGATGAATCAAACGACGAGAAGATTG
TCGAAATTTCCATATGAGCTTCGACAGGAGATAGCGCCACATTTTGTGGACTCGGATCCTG
TCATTGAAGACAATCGAAGATTGCTCAACTTTGATGTGAAGACGGGTGATTCCATTCGCA
AGGGTCTAACTCCAAGGTGGAATGACTTGGATGTCAATCAGCACGTTAACAATGTGAAGT
ACATTGGGTGGATTCTCGAGAGTGTTCCAACAGAAGTTTTCGATACCCGGGAGCTATGCG
GCCTCACCCTTGAGTATAGGCAGGAATGCGGAAGAGGAAGTGTGCTGGAGTCCGTGACCG
CTATGGATCCCTCAAAAGAGGGAGACCGGTCTCTGTACCAGCACCTTCTTCGGCTTGAGG
ATGGGACTGATATCGTGAAGGGCAGAACCGAGTGGCGGCCAAAGAATGCAGGAACCAAT
GGGCCAGTATCAACAAGAAAGACTACAAATGGAAGCTCAGTCTCTTAG SEQ ID NO: 81
Cuphea calcarata (Ccalc) FATB1 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCGCCTCCGCCTCCTCCGCCTTCTTCTCCGTGCCCACCCCCGGCACCTCCCCCA
AGCCCGGCAAGTTCGGCAACT GGCCCT CCT CCCTGT CCGT GCCCTT C AAGCCCCGCTCC AA
CAACTCCGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGTCCCTGAAGTCCGGCTCCCTGGAGACCCAGGAGGACAACTCCTCCTCCTCCCGC
CCCCCCCGCACCTTCATCAAGCAGCTGCCCGACTGGTCCATGCTGCTGTCCGCCATCACCA
CCGTGTTCGTGGCCGCCGAGAAGCAGTGGACCATGTTCGACCGCAAGTCCAAGCGCTCCG
ACATGCTGGTGGACCCCTTCGTGGTGGACCGCATCGTGCAGGACGGCGTGCTGTTCCGCC
AGTCCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCT
GATGAACATCTTCCAGGAGACCTCCCTGAACCACTGCAAGTCCATGGGCCTGCTGTACGA
GGGCTTCGGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGAT
CCACATCAAGGTGAACCGCTACCCCACCTGGGGCGACACCATCGAGGTGACCACCTGGGT
GTCCGAGTCCGGCAAGAACGGCATGGGCCGCGACTGGCTGATCTCCGACTGCCACACCGG
CGAGATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGACCACCCGCCGCCT
GTCCAAGTTCCCCTACGAGCTGCGCCAGGAGATCGCCCCCCACTTCGTGGACTCCGACCCC
GTGATCGAGGACAACCGCCGCCTGCTGAACTTCGACGTGAAGACCGGCGACTCCATCCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAA
GTACATCGGCTGGATCCTGGAGTCCGTGCCCACCGAGGTGTTCGACACCCGCGAGCTGTG
CGGCCTGACCCTGGAGTACCGCCAGGAGTGCGGCCGCGGCTCCGTGCTGGAGTCCGTGAC
CGCCATGGACCCCTCC AAGGAGGGCGACCGCTCCCT GT ACC AGCACCTGCT GCGCCT GGA
GGACGGCACCGACATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCACCA
ACGGCCCCGTGTCCACCCGCAAGACCACCAACGGCTCCTCCGTGTCCTGA SEQ ID NO: 82
Cuphea painteri (Cpai) FATB1 amino acid sequence
MVAAAATSAFFPVPAPGTSPNPRKFGSWPSSLSPSLPKSIPNGGFQVKANASAHPKANGSAVSL
KSGSLNTQENTSSSPPPRTFLHQLPDWSRLLTAITTVFVKSKRPDMHDRKSKRPDMLVDLFGLE
SSVQDALVFRQSFSIRSYEIGTDRTASIETLMNHLQETSLNHCKSTGILLDGFGRTLEMCKRELI
WWIKMQIQVNRYPAWGDTVEINTRFSRLGKIGMGRDWLISDCNTGEILIRATSEYAMMNQK
TRRLSKLPYEVHQEIAPLFVDSPPVIEDNDLKVHKFEVKTGDSIQKGLSPGWNDLDVNQHVSN
VKYIGWILESMPTEVLETQELCSLALEYRRECGRDSVLESVTAMDPSKVGGRSQYQHLLRLED
GTAIVNGITEWRPKNAGANGAISTGKTSNGNSVS SEQ ID NO: 83
Cuphea painteri (Cpai) FATB1 coding DNA sequence
ATGGTGGCTGCTGCAGCAACTTCTGCATTCTTCCCTGTTCCAGCCCCGGGAACCTCCCCAA
ATCCCAGGAAATTCGGAAGTTGGCCATCGAGCTTGAGCCCTTCCTTGCCCAAGTCAATCCC
CAATGGCGGATTTCAGGTAAAGGCAAATGCCAGTGCCCATCCGAAGGCTAACGGTTCTGC
AGTTAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGAACACTTCGTCGTCCCCTCCTCCT
CGGACTTTCCTTCACCAGTTGCCTGATTGGAGTAGGCTTCTGACTGCAATCACGACCGTGT
TCGTGAAATCTAAGAGGCCTGACATGCATGATCGGAAATCTAAGAGGCCTGACATGCTGG
TGGACTTGTTTGGGTTGGAAAGTAGTGTTCAGGATGCGCTCGTGTTCAGACAGAGTTTTTC
GATTAGGTCTTATGAAATAGGCACTGATCGAACAGCCTCTATAGAGACGCTGATGAACCA
CTTGCAGGAAACATCTCTCAATCATTGTAAAAGTACCGGTATTCTCCTTGACGGCTTCGGT
CGTACTCTTGAGATGTGTAAAAGGGAACTCATTTGGGTGGTAATAAAAATGCAAATTCAG
GTGAATCGCTATCCAGCATGGGGCGATACTGTCGAGATCAATACCCGGTTCTCCCGGTTGG
GGAAAATTGGTATGGGTCGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATTCTAA
TAAGAGCAACGAGCGAGTATGCCATGATGAATCAAAAGACGAGAAGACTCTCAAAACTT
CCATACGAGGTTCACCAGGAGATAGCGCCTCTTTTTGTCGACTCTCCTCCTGTGATTGAAG
ACAATGATCTGAAAGTGCATAAATTTGAAGTGAAGACTGGTGATTCCATTCAAAAGGGTC
TATCCCCGGGGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAAGTACATTG
GGTGGATTCTCGAGAGTATGCCAACAGAAGTTTTGGAGACCCAGGAGCTATGCTCTCTCG
CCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTGGAGTCCGTGACCGCAATGG
ATCCCTCAAAAGTTGGAGGCCGTTCTCAGTACCAGCACCTTCTGCGGCTTGAGGATGGGA
CTGCTATCGTGAACGGCATAACTGAGTGGCGGCCGAAGAATGCAGGAGCTAATGGGGCG
ATATCAACGGGAAAGACTTCAAATGGAAACTCGGTCTCTTAG SEQ ID NO: 84
Cuphea painteri (Cpai) FATB1 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCGCCGCCGCCACCTCCGCCTTCTTCCCCGTGCCCGCCCCCGGCACCTCCCCCA
ACCCCCGCAAGTTCGGCTCCTGGCCCTCCTCCCTGTCCCCCTCCCTGCCCAAGTCCATCCCC
AACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTCCGCC
GTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGAACACCTCCTCCTCCCCCCCCCCCC
GCACCTTCCTGCACCAGCTGCCCGACTGGTCCCGCCTGCTGACCGCCATCACCACCGTGTT
CGTGAAGTCCAAGCGCCCCGACATGCACGACCGCAAGTCCAAGCGCCCCGACATGCTGGT
GGACCTGTTCGGCCTGGAGTCCTCCGTGCAGGACGCCCTGGTGTTCCGCCAGTCCTTCTCC
ATCCGCTCCTACGAGATCGGCACCGACCGCACCGCCTCCATCGAGACCCTGATGAACCAC
CTGCAGGAGACCTCCCTGAACCACTGCAAGTCCACCGGCATCCTGCTGGACGGCTTCGGC
CGCACCCTGGAGATGTGCAAGCGCGAGCTGATCTGGGTGGTGATCAAGATGCAGATCCAG
GTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAGATCAACACCCGCTTCTCCCGCCTG
GGCAAGATCGGCATGGGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGATCCTG
ATCCGCGCCACCTCCGAGTACGCCATGATGAACCAGAAGACCCGCCGCCTGTCCAAGCTG
CCCTACGAGGTGCACCAGGAGATCGCCCCCCTGTTCGTGGACTCCCCCCCCGTGATCGAG
GACAACGACCTGAAGGTGCACAAGTTCGAGGTGAAGACCGGCGACTCCATCCAGAAGGG
CCTGTCCCCCGGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAGTACAT
CGGCTGGATCCTGGAGTCCATGCCCACCGAGGTGCTGGAGACCCAGGAGCTGTGCTCCCT
GGCCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTGACCGCCAT
GGACCCCTCCAAGGTGGGCGGCCGCTCCCAGTACCAGCACCTGCTGCGCCTGGAGGACGG
CACCGCCATCGTGAACGGCATCACCGAGTGGCGCCCCAAGAACGCCGGCGCCAACGGCGC
CATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA SEQ ID NO: 85
Cuphea hookeriana (Chook) FATB4 amino acid sequence
MVAAAATSAFFPVPAPGTSPNPRKFGSWPSSLSPSLPNSIPNGGFQVKANASAHPKANGSAVSL KSGSLNTQENT S S SPPPRTFLHQLPDW SRLLT AITTVFVKSKRPDMHDRKSKRPDMLVDLFGLE SSVQDALVFRQRFSIRSYEIGTDRTASMETLMNHLQETSLNHCKSTGILLDGFGRTLEMCKREL IWWIKMQIQVNRYPAWGDTVEINTRFSRLGKIGMGRDWLISDCNTGEILIRATSEYAMMNQK TRRLSKLPYEVRQEIAPLFVDSPPVIEDNDLKVHKFEVKTGDSIHKGLTPGWNDLDVNQHVNN VKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAMDPSGGGYGSQFQHLLRLEDG GEIVKGRTEWRPKNGVINGWPTGESSPGDYS SEQ ID NO: 86
Cuphea hookeriana (Chook) FATB4 coding DNA sequence
ATGGTGGCTGCTGCAGCAACTTCTGCATTCTTCCCTGTTCCAGCCCCGGGAACCTCCCCTA
ATCCCAGGAAATTCGGAAGTTGGCCATCGAGCTTGAGCCCTTCCTTGCCCAACTCAATCCC
CAATGGCGGATTTCAGGTAAAGGCAAATGCCAGTGCCCATCCGAAGGCTAACGGTTCTGC
AGTTAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGAACACTTCGTCGTCCCCTCCTCCT
CGGACTTTCCTTCACCAGTTGCCTGATTGGAGTAGGCTTCTGACTGCAATCACGACCGTGT
TCGTGAAATCTAAGAGGCCTGACATGCATGATCGGAAATCTAAGAGGCCTGACATGCTGG
TGGACTTGTTTGGGTTGGAGAGTAGTGTTCAGGATGCGCTCGTGTTCAGACAGAGATTTTC
GATTAGGTCTTATGAAATAGGCACTGATCGAACAGCCTCTATGGAGACGCTGATGAACCA
CTTGCAGGAAACATCTCTCAATCATTGTAAAAGTACCGGTATTCTCCTTGACGGCTTCGGT
CGTACTCTTGAGATGTGTAAAAGGGAACTCATTTGGGTGGTAATAAAAATGCAGATTCAG
GTGAATCGCTATCCAGCATGGGGCGATACTGTCGAGATCAATACCCGGTTCTCCCGGTTGG
GGAAAATTGGTATGGGTCGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATTCTTA
TAAGAGCAACGAGCGAGTATGCCATGATGAATCAAAAGACGAGAAGACTCTCAAAACTT
CCATACGAGGTTCGCCAGGAGATAGCGCCTCTTTTTGTCGACTCTCCTCCTGTGATTGAAG
ACAATGATCTGAAAGTGCATAAATTTGAAGTGAAGACTGGTGATTCCATTCACAAGGGTC
TAACTCCGGGGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTACATCG
GGTGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGTTCCCTTAC
TCTGGAATACAGGCGGGAATGTGGAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTATGGA
TCCCTCTGGAGGGGGTTATGGGTCCCAGTTTCAGCACCTTCTGCGGCTTGAGGATGGAGGT
GAGATCGTGAAGGGGAGAACCGAGTGGCGACCCAAGAATGGTGTAATCAATGGGGTGGT
ACCAACCGGGGAGTCCTCACCTGGAGACTACTCTTAG SEQ ID NO: 87
Cuphea hookeriana (Chook) FATB4 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCGCCGCCGCCACCTCCGCCTTCTTCCCCGTGCCCGCCCCCGGCACCTCCCCCA
ACCCCCGCAAGTTCGGCTCCTGGCCCTCCTCCCTGTCCCCCTCCCTGCCCAACTCCATCCCC
AACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTCCGCC
GTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGAACACCTCCTCCTCCCCCCCCCCCC
GCACCTTCCTGCACCAGCTGCCCGACTGGTCCCGCCTGCTGACCGCCATCACCACCGTGTT
CGTGAAGTCCAAGCGCCCCGACATGCACGACCGCAAGTCCAAGCGCCCCGACATGCTGGT
GGACCTGTTCGGCCTGGAGTCCTCCGTGCAGGACGCCCTGGTGTTCCGCCAGCGCTTCTCC
ATCCGCTCCTACGAGATCGGCACCGACCGCACCGCCTCCATGGAGACCCTGATGAACCAC
CTGCAGGAGACCTCCCTGAACCACTGCAAGTCCACCGGCATCCTGCTGGACGGCTTCGGC
CGCACCCTGGAGATGTGCAAGCGCGAGCTGATCTGGGTGGTGATCAAGATGCAGATCCAG
GTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAGATCAACACCCGCTTCTCCCGCCTG
GGCAAGATCGGCATGGGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGATCCTG
ATCCGCGCCACCTCCGAGTACGCCATGATGAACCAGAAGACCCGCCGCCTGTCCAAGCTG
CCCTACGAGGTGCGCCAGGAGATCGCCCCCCTGTTCGTGGACTCCCCCCCCGTGATCGAG
GACAACGACCTGAAGGTGCACAAGTTCGAGGTGAAGACCGGCGACTCCATCCACAAGGG
CCTGACCCCCGGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAAGTACAT
CGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGCTCCCT
GACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACCGCCAT
GGACCCCTCCGGCGGCGGCTACGGCTCCCAGTTCCAGCACCTGCTGCGCCTGGAGGACGG
CGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGGCGTGATCAACGGCG
TGGTGCCCACCGGCGAGTCCTCCCCCGGCGACTACTCCTGA SEQ ID NO: 88
Cuphea avigera var. pulcherrima (Ca) FATB1 amino acid sequence
MVAAAASSAFFSVPVPGTSPKPGKFRIWPSSLSPSFKPKPIPNGGLQVKANSRAHPKANGSAVS
LKSGSLNTQEDTSSSPPPRTFLHQLPDWSRLLTAITTVFVKSKRPDMHDRKSKRPDMLMDSFG
LESIVQEGLEFRQSFSIRSYEIGTDRTASIETLMNYLQETSLNHCKSTGILLDGFGRTPEMCKRDL
IWWTKMKIKVNRYPAWGDTVEINTWFSRLGKIGKGRDWLISDCNTGEILIRATSAYATMNQ
KTRRLSKLPYEVHQEIAPLFVDSPPVIEDNDLKLHKFEVKTGDSIHKGLTPGWNDLDVNQHVS
NVKYIGWILESMPTEVLETQELCSLALEYRRECGRDSVLESVTAMDPTKVGGRSQYQHLLRLE
DGTDIVKCRTEWRPKNPGANGAISTGKTSNGNSVS SEQ ID NO: 89
Cuphea avigera var. pulcherrima (Ca) FATB1 coding DNA sequence
ATGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCTCTGTTCCAGTCCCGGGAACCTCTCCTA
AACCCGGGAAGTTCAGAATTTGGCCATCGAGCTTGAGCCCTTCCTTCAAGCCCAAGCCGA
TCCCCAATGGTGGATTGCAGGTTAAGGCAAATTCCAGGGCACATCCGAAGGCTAACGGTT
CTGCAGTTAGTCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCCTCC
TCCTCGGACTTTCCTTCACCAGTTGCCTGATTGGAGTAGGCTTCTGACTGCAATCACGACC
GTGTTCGTGAAATCTAAGAGGCCTGACATGCATGATCGGAAATCTAAGAGGCCTGACATG
CTGATGGACTCGTTTGGGTTGGAGAGTATTGTTCAAGAAGGGCTCGAGTTCAGACAGAGT
TTTTCGATTAGGTCTTATGAAATAGGCACTGATCGAACAGCCTCTATAGAGACGCTGATGA
ACTACTTGCAGGAAACATCTCTCAATCATTGTAAGAGTACCGGTATTCTCCTTGACGGCTT
TGGTCGTACTCCTGAGATGTGTAAAAGGGACCTCATTTGGGTGGTAACAAAAATGAAGAT
CAAGGTGAATCGCTATCCAGCTTGGGGCGATACTGTCGAGATCAATACCTGGTTCTCCCGG
TTGGGGAAAATCGGAAAGGGTCGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATT
CTTATAAGAGCAACGAGCGCGTATGCCACGATGAATCAAAAGACGAGAAGACTCTCAAA
ACTTCCATACGAGGTTCACCAGGAGATAGCGCCTCTCTTTGTCGACTCTCCTCCTGTCATT
GAAGACAATGATCTGAAATTGCATAAGTTTGAAGTGAAGACTGGTGATTCCATTCACAAG
GGTCTAACTCCGGGGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAAGTAC
ATTGGGTGGATTCTCGAGAGTATGCCAACAGAAGTTTTGGAGACCCAGGAGCTATGCTCT
CTCGCCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTGCTAGAGTCCGTGACAGCT
ATGGATCCCACAAAAGTTGGAGGCCGGTCTCAGTACCAGCACCTTCTGCGACTTGAGGAT
GGGACTGATATCGTGAAGTGCAGAACTGAGTGGCGGCCGAAGAATCCAGGAGCTAATGG
GGC AAT ATCAACGGGAAAGACTTC AAATGGAAACTCGGTCTCTT AG SEQ ID NO: 90
Cuphea avigera var. pulcherrima (Ca) FATB1 coding DNA sequence codon optimized for Prototheca moriformis
AT GGT GGCCGCCGCCGCCTCCT CCGCCTT CTT CT CCGT GCCCGT GCCCGGC ACCTCCCCC A
AGCCCGGCAAGTTCCGCATCTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGCCCAAGCCCAT
CCCCAACGGCGGCCTGCAGGTGAAGGCCAACTCCCGCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGTCCCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGC ACCTT CCT GC ACC AGCT GCCCGACT GGTCCCGCCT GCT G ACCGCC ATC ACC ACCG
TGTTCGTGAAGTCCAAGCGCCCCGACATGCACGACCGCAAGTCCAAGCGCCCCGACATGC
TGATGGACTCCTTCGGCCTGGAGTCCATCGTGCAGGAGGGCCTGGAGTTCCGCCAGTCCTT
CTCCATCCGCTCCTACGAGATCGGCACCGACCGCACCGCCTCCATCGAGACCCTGATGAA
CTACCTGCAGGAGACCTCCCTGAACCACTGCAAGTCCACCGGCATCCTGCTGGACGGCTTC
GGCCGCACCCCCGAGATGTGCAAGCGCGACCTGATCTGGGTGGTGACCAAGATGAAGATC
AAGGTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAGATCAACACCTGGTTCTCCCGC
CTGGGCAAGATCGGCAAGGGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGATC
CTGATCCGCGCCACCTCCGCCTACGCCACCATGAACCAGAAGACCCGCCGCCTGTCCAAG
CTGCCCTACGAGGTGCACCAGGAGATCGCCCCCCTGTTCGTGGACTCCCCCCCCGTGATCG
AGGACAACGACCTGAAGCTGCACAAGTTCGAGGTGAAGACCGGCGACTCCATCCACAAG
GGCCTGACCCCCGGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAGTAC
ATCGGCTGGATCCTGGAGTCCATGCCCACCGAGGTGCTGGAGACCCAGGAGCTGTGCTCC
CTGGCCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCCGTGCTGGAGTCCGTGACCGCC
AT GGACCCC ACC AAGGT GGGCGGCCGCT CCC AGT ACCAGCACCTGCT GCGCCTGGAGGAC
GGCACCGACATCGTGAAGTGCCGCACCGAGTGGCGCCCCAAGAACCCCGGCGCCAACGG
CGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCC SEQ ID NO: 91
Cuphea paucipetala (Cpau) FATB1 amino acid sequence
MVAAAASSAFFPVPAPGTSPKPGKSGNWPSSLSPSIKPMSIPNGGFQVKANASAHPKANGSAV
NLKSGSLNTQEDTSSSPPPRAFLNQLPDWSMLLTAITTVFVAAEKQWTMRDRKSKRPDMLVD
SVGLKSWLDGLVSRQIFSIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGMC
KNDLIWVLTKMQIMVNRYPTWGDTVEINTWFSHSGKIGMASDWLITDCNTGEILIRATSVWA
MMNQKTRRFSRLPYEVRQELTPHYVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLDV
NQHVSNVKYIGWILESMPIEVLETQELCSLTVEYRRECGMDSVLESVTAMDPSEDEGRSQYKH
LLRLEDGTDIVKGRTEWRPKNAGTNGAISTAKPSNGNSVS SEQ ID NO: 92
Cuphea paucipetala (Cpau) FATB1 coding DNA sequence
ATGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGTTCCAGCCCCCGGAACCTCCCCTA
AACCCGGGAAGTCCGGCAACTGGCCATCAAGCTTGAGCCCTTCCATCAAGCCCATGTCAA
TCCCCAATGGCGGATTTCAGGTTAAGGCAAATGCCAGTGCCCATCCTAAGGCTAACGGTT
CTGCAGTAAATCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCCTCC
TCCTCGGGCTTTCCTTAACC AGTTGCCT GATTGGAGTAT GCTT CT GACTGC AAT CACGACC
GTCTTCGTGGCGGCAGAGAAGCAGTGGACTATGCGTGATCGGAAATCTAAGAGGCCTGAC
ATGCTCGTGGACTCGGTTGGGTTGAAGAGTGTTGTTCTGGATGGGCTCGTGTCCAGACAGA
TTTTTTCGATTAGGTCTTATGAAATAGGCGCTGATCGAACTGCCTCTATAGAGACGCTGAT
GAACCACTTGCAGGAAACATCTATCAATCATTGTAAGAGTTTGGGTCTTCTCAATGACGGC
TTTGGTCGTACTCCTGGGATGTGTAAAAATGACCTCATTTGGGTGCTTACAAAAATGCAGA
TCATGGTGAATCGCTACCCAACTTGGGGCGATACTGTTGAGATCAATACCTGGTTCTCCCA
TTCGGGGAAAATTGGTATGGCTAGCGATTGGCTAATAACTGATTGCAACACAGGAGAAAT
TCTTATAAGAGCAACGAGCGTGTGGGCCATGATGAATCAAAAGACGAGAAGATTCTCAAG
ACTTCCATACGAGGTTCGCCAGGAGTTAACGCCTCATTATGTGGACTCTCCTCATGTCATT
GAAGATAATGATCGGAAATTGCATAAGTTTGATGTGAAGACTGGTGATTCCATTCGTAAG
GGTCTAACTCCGAGGTGGAATGACTTGGATGTCAATCAGCACGTAAGCAACGTGAAGTAC
ATTGGGTGGATTCTCGAGAGTATGCCAATAGAAGTTTTGGAGACCCAGGAGCTATGCTCT
CTCACCGTTGAATATAGGCGGGAATGCGGAATGGACAGTGTGCTGGAGTCCGTGACTGCT
ATGGATCCCTCAGAAGATGAAGGCCGGTCTCAGTACAAGCACCTTCTGCGGCTTGAGGAT
GGGACTGACATCGTGAAGGGCAGAACTGAGTGGCGACCGAAGAATGCAGGAACTAACGG
GGCGATATCAACAGCAAAGCCTTCAAATGGAAACTCGGTCTCTTAG SEQ ID NO: 93
Cuphea paucipetala (Cpau) FATB1 coding DNA sequence codon optimized for Pro to theca moriformis
ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCGCCCCCGGCACCTCCCCCA
AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCATCAAGCCCATGTCCAT
CCCCAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCGCCTTCCTGAACCAGCTGCCCGACTGGTCCATGCTGCTGACCGCCATCACCACCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCGCGACCGCAAGTCCAAGCGCCCCGACA
TGCTGGTGGACTCCGTGGGCCTGAAGTCCGTGGTGCTGGACGGCCTGGTGTCCCGCCAGA
TCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGAT
GAACC ACCT GC AGGAGACCT CC AT CAACC ACT GCAAGTCCCTGGGCCTGCT GAACGACGG
CTTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCA
GAT CAT GGT GAACCGCT ACCCC ACCTGGGGCGAC ACCGTGG AGATC AAC ACCT GGTTCTC
CCACTCCGGCAAGATCGGCATGGCCTCCGACTGGCTGATCACCGACTGCAACACCGGCGA
GATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCC
CGCCTGCCCTACGAGGTGCGCCAGGAGCTGACCCCCCACTACGTGGACTCCCCCCACGTG
AT CGAGGAC AACGACCGC AAGCT GCACAAGTT CGACGT GAAGACCGGCGACTCC ATCCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCATGCCCATCGAGGTGCTGGAGACCCAGGAGCTGTGC
TCCCTGACCGTGGAGTACCGCCGCGAGTGCGGCATGGACTCCGTGCTGGAGTCCGTGACC
GCCATGGACCCCTCCGAGGACGAGGGCCGCTCCCAGTACAAGCACCTGCTGCGCCTGGAG
GACGGCACCGACATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCACCAA
CGGCGCCATCTCCACCGCCAAGCCCTCCAACGGCAACTCCGTGTCCTGA SEQ ID NO: 94
Cuphea procumbens (Cproc) FATB1 amino acid sequence
MVAAAASSAFFPAPAPGSSPKPGKSGNWPSSLSPSFKSKSIPYGRFQVKANASAHPKANGSAV
NLKSGSLNTQEDTSSSPPPRAFLNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKNIVRDGLVSRQSFLIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPAWGDTVEINTWFSQSGKIGMGSDWLISDCNTGEILIRATSVW
AMMNQKTRRFSRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLD
VNQHVSNVKYIGWILESMPIEVLEAQELCSLTVEYRRECGMDSVLESVTAVDPSEDGGRSQYN
HLLRLEDGTDWKGRTEWRPKNAETNGAISPGNTSNGNSIS SEQ ID NO: 95
Cuphea procumbens (Cproc) FATB1 coding DNA sequence
ATGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGCTCCAGCCCCGGGATCCTCACCTA
AACCCGGGAAGTCCGGTAATTGGCCATCGAGCTTGAGCCCTTCCTTCAAGTCCAAGTCAAT
CCCCTATGGCCGATTTCAGGTTAAGGCAAATGCCAGTGCCCATCCTAAGGCTAACGGTTCT
GC AGT AAAT CT AAAGT CT GGC AGCCTC AAC ACT C AGGAGGAC ACTT CGTCGT CCCCTCCTC
CTCGGGCTTTCCTTAACCAGTTGCCTGATTGGAGTATGCTTCTGTCTGCAATCACGACTGT
ATTCGTGGCGGCAGAGAAGCAGTGGACTATGCTTGATCGGAAATCTAAGAGGCCTGACAT
GCTTGTGGACT CGGTT GGGTT GAAGAAT ATT GTT CGGGAT GGGCT CGT GT CC AGAC AGAG
TTTTTTGATTAGATCTTATGAAATAGGCGCTGATCGAACAGCTTCTATAGAGACACTGATG
AACCACTTGCAGGAAACATCTATCAATCATTGTAAGAGTTTGGGTCTTCTCAATGACGGCT
TTGGTCGTACTCCTGGGATGTGTAAAAACGACCTCATTTGGGTGCTTACTAAAATGCAGAT
CATGGTGAATCGCTACCCAGCTTGGGGCGATACTGTTGAGATCAATACCTGGTTCTCCCAG
TCGGGGAAAATCGGTATGGGTAGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATT
CTTATAAGAGCAACGAGCGTGTGGGCCATGATGAATCAAAAAACGAGAAGATTCTCAAG
ACTTCCATACGAGGTTCGCCAGGAGTTAACGCCTCATTTTGTGGACTCTCCTCATGTCATT
GAAGACAATGATCGGAAATTGCATAAGTTCGATGTGAAGACTGGTGATTCTATTCGCAAG
GGTCTAACTCCGAGGTGGAATGACTTGGATGTCAATCAGCACGTGAGCAACGTGAAGTAC
ATTGGGTGGATTCTCGAGAGTATGCCAATAGAAGTTTTGGAGGCCCAGGAACTATGCTCT
CTCACCGTTGAATATAGGCGGGAATGCGGAATGGACAGTGTGCTGGAGTCCGTGACTGCT
GTAGATCCCTCAGAAGATGGAGGCCGGTCTCAGTACAATCACCTTCTGCGGCTTGAGGAT
GGGACTGATGTCGTGAAGGGCAGAACTGAGTGGCGACCGAAGAATGCAGAAACTAACGG
GGCGATATCACCAGGAAACACTTCAAATGGAAACTCGATCTCCTAG SEQ ID NO: 96
Cuphea procumbens (Cproc) FATB1 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGCCCCCGCCCCCGGCTCCTCCCCCA
AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGTCCAAGTCCAT
CCCCTACGGCCGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCGCCTTCCTGAACCAGCTGCCCGACTGGTCCATGCTGCTGTCCGCCATCACCACCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGACA
TGCTGGTGGACTCCGTGGGCCTGAAGAACATCGTGCGCGACGGCCTGGTGTCCCGCCAGT
CCTTCCTGATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGAT
GAACCACCTGCAGGAGACCTCCATCAACCACTGCAAGTCCCTGGGCCTGCTGAACGACGG
CTTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCA
GATCATGGTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAGATCAACACCTGGTTCTC
CCAGTCCGGCAAGATCGGCATGGGCTCCGACTGGCTGATCTCCGACTGCAACACCGGCGA
GATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCC
CGCCTGCCCTACGAGGTGCGCCAGGAGCTGACCCCCCACTTCGTGGACTCCCCCCACGTG
ATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCATCCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCATGCCCATCGAGGTGCTGGAGGCCCAGGAGCTGTGC
TCCCTGACCGTGGAGTACCGCCGCGAGTGCGGCATGGACTCCGTGCTGGAGTCCGTGACC
GCCGTGGACCCCTCCGAGGACGGCGGCCGCTCCCAGTACAACCACCTGCTGCGCCTGGAG
GACGGCACCGACGTGGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGAGACCAA
CGGCGCCATCTCCCCCGGCAACACCTCCAACGGCAACTCCATCTCCTGA SEQ ID NO: 97
Cuphea procumbens (Cproc) FATB2 amino acid sequence
MVAAAASSAFFPAPAPGSSPKPGKSGNWPSSLSPSFKSKSIPYGRFQVKANASAHPKANGSAV
NLKSGSLNTQEDTSSSPPPRAFLNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKNIVRDGLVSRQSFLIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPAWGDTVEINTWFSQSGKIGMGSDWLISDCNTGEILIRATSVW
AMMNQKTRRFSRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRQECGRESVLESLTAVDPSGKGFGSQFQH
LLRLEDGGEIVKGRTEWRPKTAGINGAIASGETSPGDF SEQ ID NO: 98
Cuphea procumbens (Cproc) FATB2 coding DNA sequence
ATGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGCTCCAGCCCCGGGATCCTCACCTA
AACCCGGGAAGTCCGGTAATTGGCCATCGAGCTTGAGCCCTTCCTTCAAGTCCAAGTCAAT
CCCCTATGGCCGATTTCAGGTTAAGGCAAATGCCAGTGCCCATCCTAAGGCTAACGGTTCT
GCAGTAAATCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCCTCCTC
CTCGGGCTTTCCTTAACCAGTTGCCTGATTGGAGTATGCTTCTGTCTGCAATCACGACTGT
ATTCGTGGCGGCAGAGAAGCAGTGGACTATGCTTGATCGGAAATCTAAGAGGCCTGACAT
GCTTGTGGACTCGGTTGGGTTGAAGAATATTGTTCGGGATGGGCTCGTGTCCAGACAGAG
TTTTTTGATTAGATCTTATGAAATAGGCGCTGATCGAACAGCTTCTATAGAGACACTGATG
AACCACTTGCAGGAAACATCTATCAATCATTGTAAGAGTTTGGGTCTTCTCAATGACGGCT
TTGGTCGTACTCCTGGGATGTGTAAAAACGACCTCATTTGGGTGCTTACTAAAATGCAGAT
CATGGTGAATCGCTACCCAGCTTGGGGCGATACTGTTGAGATCAATACCTGGTTCTCCCAG
TCGGGGAAAATCGGTATGGGTAGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATT
CTTATAAGAGCAACGAGCGTGTGGGCCATGATGAATCAAAAAACGAGAAGATTCTCAAG
ACTTCCATACGAGGTTCGCCAGGAGTTAACGCCTCATTTTGTGGACTCTCCTCATGTCATT
GAAGACAATGATCGGAAATTGCATAAGTTCGATGTGAAGACTGGTGATTCTATTCGCAAG
GGTCTAACTCCGAGGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTAC
ATCGGGTGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGTTCCC
TTACCCTGGAATACAGGCAGGAATGCGGAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTG
TGGACCCCTCTGGAAAGGGCTTTGGGTCCCAGTTCCAACACCTTCTGAGGCTTGAGGATGG
AGGTGAGATCGTGAAGGGGAGAACTGAGTGGCGACCCAAGACTGCAGGTATCAATGGGG
CGATAGCATCCGGGGAGACCTCACCTGGAGACTTTTAG SEQ ID NO: 99
Cuphea procumbens (Cproc) FATB2 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGCCCCCGCCCCCGGCTCCTCCCCCA
AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGTCCAAGTCCAT
CCCCTACGGCCGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAA.CCTGAA.GTCCGGCTCCCTGAACA.CCCA.GGA.GGA.CA.CCTCCTCCTCCCCCCCC
CCCCGCGCCTTCCTGAACCAGCTGCCCGA.CTGGTCCA.TGCTGCTGTCCGCCATCA.CCA.CCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGACA
TGCTGGTGGACTCCGTGGGCCTGAAGAACA.TCGTGCGCGACGGCCTGGTGTCCCGCCA.GT
CCTTCCT GAT CCGCT CCT ACGAGATCGGCGCCGACCGC ACCGCCTCC ATCGAGACCCTGAT
GAACCACCTGCAGGAGACCTCCATCAACCACTGCAAGTCCCTGGGCCTGCTGAACGACGG
CTTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCA
GAT CAT GGT GAACCGCT ACCCCGCCTGGGGCGAC ACCGTGG AGATC AAC ACCT GGTTCTC
CCAGTCCGGCAAGATCGGCATGGGCTCCGACTGGCTGATCTCCGACTGCAACACCGGCGA
GATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCC
CGCCT GCCCT ACGAGGTGCGCC AGGAGCT GACCCCCC ACTTCGT GGACTCCCCCC ACGT G
ATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCATCCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAA
GTACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTG
CTCCCTGACCCTGGAGTACCGCCAGGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGAC
CGCCGTGGACCCCTCCGGCAAGGGCTTCGGCTCCCAGTTCCAGCACCTGCTGCGCCTGGA
GGACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGACCGCCGGCATCA
ACGGCGCCATCGCCTCCGGCGAGACCTCCCCCGGCGACTTCTGA SEQ ID NO: 100
Cuphea procumbens (Cproc) FATB3 amino acid sequence
MVAAAASSAFFPAPAPGSSPKPGKSGNWPSSLSPSFKSKSIPYGRFQVKANASAHPKANGSAV
NLKSGSLNTQEDTSSSPPPRAFLNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKNIVRDGLVSRQSFLIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPAWGDTVEINTWFSQSGKIGMGSDWLISDCNTGEILIRATSVW
AMMNQKTRRFSRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGEGGYGSQFQ HLLRLEDGGEIVKGRTEWRPKNAGINGVLPTGE* SEQ ID NO: 101
Cuphea procumbens (Cproc) FATB3 coding DNA sequence
ATGGTGGCTGCTGCAGCAAGTTCTGCATTCTTCCCTGCTCCAGCCCCGGGATCCTCACCTA
AACCCGGGAAGTCCGGTAATTGGCCATCGAGCTTGAGCCCTTCCTTCAAGTCCAAGTCAAT
CCCCTATGGCCGATTTCAGGTTAAGGCAAATGCCAGTGCCCATCCTAAGGCTAACGGTTCT
GCAGTAAATCTAAAGTCTGGCAGCCTCAACACTCAGGAGGACACTTCGTCGTCCCCTCCTC
CTCGGGCTTTCCTTAACCAGTTGCCTGATTGGAGTATGCTTCTGTCTGCAATCACGACTGT
ATTCGTGGCGGCAGAGAAGCAGTGGACTATGCTTGATCGGAAATCTAAGAGGCCTGACAT
GCTTGTGGACTCGGTTGGGTTGAAGAATATTGTTCGGGATGGGCTCGTGTCCAGACAGAG TTTTTT(jATTA(jATCTTAT(jAAATAG(jCGCT(jATC(jAACAGCTTCTATA(jA(jACACT(jAT(j
AACCACTTGCAGGAAACATCTATCAATCATTGTAAGAGTTTGGGTCTTCTCAATGACGGCT
TTGGTCGTACTCCTGGGATGTGTAAAAACGACCTCATTTGGGTGCTTACTAAAATGCAGAT
CATGGTGAATCGCTACCCAGCTTGGGGCGATACTGTTGAGATCAATACCTGGTTCTCCCAG
TCGGGGAAAATCGGTATGGGTAGCGATTGGCTAATAAGTGATTGCAACACAGGAGAAATT
CTTATAAGAGCAACGAGCGTGTGGGCCATGATGAATCAAAAAACGAGAAGATTCTCAAG
ACTTCCATACGAGGTTCGCCAGGAGTTAACGCCTCATTTTGTGGACTCTCCTCATGTCATT
GAAGACAATGATCGGAAATTGCATAAGTTCGATGTGAAGACTGGTGATTCTATTCGCAAG
GGTCTAACTCCGAGGTGGAATGACTTGGATGTCAATCAGCACGTCAACAACGTGAAGTAC
ATCGGGTGGATTCTTGAGAGTACTCCACCAGAAGTTCTGGAGACCCAGGAGTTATGTTCCC
TTACCCTGGAATACAGGCGGGAATGTGGAAGGGAGAGCGTGCTGGAGTCCCTCACTGCTG
TGGACCCCTCTGGAGAGGGGGGCTATGGATCCCAGTTTCAGCACCTTCTGCGGCTTGAGG
ATGGAGGTGAGATCGTGAAGGGGAGAACTGAGTGGCGACCCAAGAATGCTGGAATCAAT
GGGGTGTTACCAACCGGGGAGTAG SEQ ID NO: 102
Cuphea procumbens (Cproc) FATB3 coding DNA sequence codon optimized for Prototheca moriformis
ATGGTGGCCGCCGCCGCCTCCTCCGCCTTCTTCCCCGCCCCCGCCCCCGGCTCCTCCCCCA
AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGTCCAAGTCCAT
CCCCTACGGCCGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCGCCTT CCT GAACCAGCT GCCCGACT GGT CC AT GCT GCTGTCCGCCATCACCACCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGACA
TGCT GGT GGACTCCGTGGGCCT GAAGAACAT CGT GCGCGACGGCCT GGTGTCCCGCCAGT
CCTTCCTGATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGAT
GAACCACCTGCAGGAGACCTCCATCAACCACTGCAAGTCCCTGGGCCTGCTGAACGACGG
CTTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCA
GATCATGGTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAGATCAACACCTGGTTCTC
CCAGTCCGGCAAGATCGGCATGGGCTCCGACTGGCTGATCTCCGACTGCAACACCGGCGA
GATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCC
CGCCT GCCCT ACGAGGTGCGCC AGGAGCT GACCCCCC ACTT CGT GGACTCCCCCCACGTG
ATCGAGGAC AACGACCGC AAGCT GCACAAGTT CGACGT GAAGACCGGCGACTCCATCCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAA
GTACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTG
CTCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGAC
CGCCGTGGACCCCTCCGGCGAGGGCGGCTACGGCTCCCAGTTCCAGCACCTGCTGCGCCT
GGAGGACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCA
TCAACGGCGT GCT GCCCACCGGCGAGT GA SEQ ID NO: 103
Cuphea ignea (Cignea) FATB1 amino acid sequence
PGT SRKTGKF GNWPS SLSPSFKPKSIPNGGF QVKANARAHPKANGS A V SLKS V SLNTQEDTSLS
PPPRAFLNQLPDWRMLRTALTTVFVAAEKQWTMLDRKSKRPDMLVDSFGLESIVQEGLVFRQ
SFSIRSYEIGIDRTASIETLMNHLQETSLNQCKSAGILHDGFGRTLEMCKRDLIWWTKMQIKV
NRYPAWGDTVEISTRFSRLGKIGMGRDWLICDCNTGEILIRATSAYAMMNQKTRRLSKLPNEV
RQEIAPLF VD SDPVIEENDMKLHKFEVKTGDSICKGLTPRW SDLDVNQHV SNVKYIGWILESM
PTEVLETQELCSLALEYRRECGRDSVLESVTSMDPSKVGGWSQYQHLLRLEDGADIVKGRTE
WRPKNAGANGAISTGKT SEQ ID NO: 104
Cuphea ignea (Cignea) FATB1 coding DNA sequence
CCGGGAACCTCACGTAAAACCGGGAAGTTCGGCAATTGGCCATCAAGCTTGAGCCCTTCC
TTCAAGCCCAAGTCAATCCCCAATGGCGGATTTCAGGTTAAGGCTAATGCCAGAGCCCAT
CCTAAGGCTAACGGTTCTGCAGTAAGTCTAAAGTCTGTCAGCCTCAACACTCAGGAGGAC
ACTTCGTTGTCCCCTCCTCCTCGTGCTTTCCTTAACCAGTTGCCTGATTGGAGGATGCTTCG
GACTGCACTCACGACCGTCTTTGTGGCGGCAGAGAAGCAGTGGACTATGCTTGATCGGAA
ATCTAAGAGGCCTGACATGCTCGTGGACTCGTTTGGGTTGGAGAGTATTGTTCAAGAAGG
GCTCGTGTTCAGACAGAGCTTTTCGATTAGGTCTTATGAAATAGGCATTGATCGAACAGCC
TCTATAGAGACGCTGATGAACCACTTGCAGGAAACATCTCTCAATCAATGTAAGAGTGCT
GGTATTCTCCATGACGGCTTCGGTCGTACTCTTGAGATGTGTAAAAGGGACCTCATTTGGG
TTGTTACGAAAATGCAGATCAAGGTGAATCGCTATCCAGCTTGGGGCGATACTGTCGAGA
TCAGTACCCGGTTCTCCCGGTTGGGGAAAATCGGTATGGGTCGCGATTGGCTAATATGTGA
TTGCAACACAGGAGAAATTCTTATAAGAGCAACGAGCGCGTATGCCATGATGAATCAAAA
GACGAGAAGACTCTCAAAACTTCCAAACGAGGTTCGCCAGGAGATAGCGCCTCTTTTTGT
GGACTCTGATCCTGTCATTGAAGAAAATGATATGAAATTGCATAAGTTTGAAGTGAAGAC
TGGTGATTCCATTTGCAAGGGTCTAACTCCGAGGTGGAGTGACTTGGATGTCAATCAGCAC
GTAAGCAACGTGAAGTACATAGGGTGGATTCTCGAGAGTATGCCAACAGAAGTTTTGGAG
ACACAGGAGCTATGCTCTCTCGCCCTTGAATATAGGCGGGAATGCGGAAGGGACAGTGTG
CTGGAGTCTGTGACCTCTATGGATCCCTCAAAAGTTGGAGGCTGGTCTCAGTACCAGCACC
TTCTGCGACTTGAGGATGGGGCGGATATCGTGAAGGGCAGAACTGAGTGGCGGCCGAAG
AAT GCAGGAGCTAACGGGGCGATATC AACAGGAAAG ACTT GA SEQ ID NO: 105
Cuphea ignea (Cignea) FATB1 coding DNA sequence codon optimized for Prototheca moriformis
CCCGGCACCTCCCGCAAGACCGGCAAGTTCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCT
TCAAGCCCAAGTCCATCCCCAACGGCGGCTTCCAGGTGAAGGCCAACGCCCGCGCCCACC
CCAAGGCCAACGGCTCCGCCGTGTCCCTGAAGTCCGTGTCCCTGAACACCCAGGAGGACA
CCTCCCT GTCCCCCCCCCCCCGCGCCTTCCT G AACC AGCT GCCCG ACT GGCGC AT GCT GCG
CACCGCCCTGACCACCGTGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAA
GTCCAAGCGCCCCGACATGCTGGTGGACTCCTTCGGCCTGGAGTCCATCGTGCAGGAGGG
CCTGGTGTTCCGCCAGTCCTTCTCCATCCGCTCCTACGAGATCGGCATCGACCGCACCGCC
TCCATCGAGACCCTGATGAACCACCTGCAGGAGACCTCCCTGAACCAGTGCAAGTCCGCC
GGCATCCTGCACGACGGCTTCGGCCGCACCCTGGAGATGTGCAAGCGCGACCTGATCTGG
GTGGTGACCAAGATGCAGATCAAGGTGAACCGCTACCCCGCCTGGGGCGACACCGTGGAG
ATCTCCACCCGCTTCTCCCGCCTGGGCAAGATCGGCATGGGCCGCGACTGGCTGATCTGCG
ACTGCAACACCGGCGAGATCCTGATCCGCGCCACCTCCGCCTACGCCATGATGAACCAGA
AGACCCGCCGCCTGTCCAAGCTGCCCAACGAGGTGCGCCAGGAGATCGCCCCCCTGTTCG
TGGACTCCGACCCCGTGATCGAGGAGAACGACATGAAGCTGCACAAGTTCGAGGTGAAG
ACCGGCGACTCCATCTGCAAGGGCCTGACCCCCCGCTGGTCCGACCTGGACGTGAACCAG
CACGTGTCCAACGTGAAGTACATCGGCTGGATCCTGGAGTCCATGCCCACCGAGGTGCTG
GAGACCCAGGAGCTGTGCTCCCTGGCCCTGGAGTACCGCCGCGAGTGCGGCCGCGACTCC
GTGCTGGAGTCCGTGACCTCCATGGACCCCTCCAAGGTGGGCGGCTGGTCCCAGTACCAG
CACCTGCTGCGCCTGGAGGACGGCGCCGACATCGTGAAGGGCCGCACCGAGTGGCGCCCC
AAGAACGCCGGCGCCAACGGCGCCATCTCCACCGGCAAGACCTGA SEQ ID NO: 106
JcFatBl consensus amino acid sequence
MVAAAASSAFFPVPAPGTSPKPGKSGNWPSSLSPSFKPKSIPNGGFQVKANASAHPKANGSAV
NLKSGSLNTQEDTSSSPPPRAFLNQLPDWSMLLTAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKRIVQDGLVSRQSFSIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPTWGDTVEINTWFSQSGKIGMGSDWLISDCNTGEILIRATSVW
AMMNQKTRRFSRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLD
VNQHVSNVKYIGWILESMPIEVLETQELCSLTVEYRRECGMDSVLESVTAMDPSENGGRSQYK HLLRLEDGTDIVKGRTEWRPKNAGTNGAISTGKTSNGNSVS* SEQ ID NO: 107
JcFatBl consensus DNA sequence codon optimized for Prototheca
AT GGT GGCCGCCGCCGCCT CCT CCGCCTT CTTCCCCGTGCCCGCCCCCGGC ACCT CCCCC A
AGCCCGGCAAGTCCGGCAACTGGCCCTCCTCCCTGTCCCCCTCCTTCAAGCCCAAGTCCAT
CCCCAACGGCGGCTTCCAGGTGAAGGCCAACGCCTCCGCCCACCCCAAGGCCAACGGCTC
CGCCGTGAACCTGAAGTCCGGCTCCCTGAACACCCAGGAGGACACCTCCTCCTCCCCCCCC
CCCCGCGCCTTCCTGAACCAGCTGCCCGACTGGTCCATGCTGCTGACCGCCATCACCACCG
TGTTCGTGGCCGCCGAGAAGCAGTGGACCATGCTGGACCGCAAGTCCAAGCGCCCCGACA
TGCTGGTGGACTCCGTGGGCCTGAAGCGCATCGTGCAGGACGGCCTGGTGTCCCGCCAGT
CCTTCTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCCTGAT
GAACCACCTGCAGGAGACCTCCATCAACCACTGCAAGTCCCTGGGCCTGCTGAACGACGG
CTTCGGCCGCACCCCCGGCATGTGCAAGAACGACCTGATCTGGGTGCTGACCAAGATGCA
GATCATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGATCAACACCTGGTTCTC
CCAGTCCGGCAAGATCGGCATGGGCTCCGACTGGCTGATCTCCGACTGCAACACCGGCGA
GATCCTGATCCGCGCCACCTCCGTGTGGGCCATGATGAACCAGAAGACCCGCCGCTTCTCC
CGCCTGCCCTACGAGGTGCGCCAGGAGCTGACCCCCCACTTCGTGGACTCCCCCCACGTG
ATCGAGGACAACGACCGCAAGCTGCACAAGTTCGACGTGAAGACCGGCGACTCCATCCGC
AAGGGCCTGACCCCCCGCTGGAACGACCTGGACGTGAACCAGCACGTGTCCAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCATGCCCATCGAGGTGCTGGAGACCCAGGAGCTGTGC
TCCCTGACCGTGGAGTACCGCCGCGAGTGCGGCATGGACTCCGTGCTGGAGTCCGTGACC
GCCATGGACCCCTCCGAGAACGGCGGCCGCTCCCAGTACAAGCACCTGCTGCGCCTGGAG
GACGGCACCGACATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGAACGCCGGCACCAA
CGGCGCCATCTCCACCGGCAAGACCTCCAACGGCAACTCCGTGTCCTGA SEQ ID NO: 108
JcFatB2 consensus amino acid sequence
MVATAASSAFFPVPSPDTSSRPGKLGNGSSSLSPLKPKSVANGGLQVKANASAPPKINGSSVGL
KSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLVDPF
GLGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFGRTPEMY
KRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRASSV
WVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTPKWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGKGYGSQFQ HLLRLEDGGEIVKGRTE WRPKT AGINGAIASGETSPGDS S * SEQ ID NO: 109
JcFatB2 consensus DNA sequence codon optimized for Prototheca
ATGGTGGCCACCGCCGCCTCCTCCGCCTTCTTCCCCGTGCCCTCCCCCGACACCTCCTCCCG
CCCCGGCAAGCTGGGCAACGGCTCCTCCTCCCTGTCCCCCCTGAAGCCCAAGTCCGTGGCC
AACGGCGGCCTGCAGGTGAAGGCCAACGCCTCCGCCCCCCCCAAGATCAACGGCTCCTCC
GTGGGCCTGAAGTCCGGCTCCCTGAAGACCCAGGAGGACACCCCCTCCGCCCCCCCCCCC
CGCACCTTCATCAACCAGCTGCCCGACTGGTCCATGCTGCTGGCCGCCATCACCACCGTGT
TCCTGGCCGCCGAGAAGCAGTGGATGATGCTGGACTGGAAGCCCAAGCGCCCCGACATGC
TGGTGGACCCCTTCGGCCTGGGCCGCATCGTGCAGGACGGCCTGGTGTTCCGCCAGAACTT
CTCCATCCGCTCCTACGAGATCGGCGCCGACCGCACCGCCTCCATCGAGACCGTGATGAA
CCACCTGCAGGAGACCGCCCTGAACCACGTGAAGTCCGCCGGCCTGCTGAACGACGGCTT
CGGCCGCACCCCCGAGATGTACAAGCGCGACCTGATCTGGGTGGTGGCCAAGATGCAGGT
GATGGTGAACCGCTACCCCACCTGGGGCGACACCGTGGAGGTGAACACCTGGGTGGCCAA
GTCCGGCAAGAACGGCATGCGCCGCGACTGGCTGATCTCCGACTGCAACACCGGCGAGAT
CCTGACCCGCGCCTCCTCCGTGTGGGTGATGATGAACCAGAAGACCCGCCGCCTGTCCAA
GATCCCCGACGAGGTGCGCCACGAGATCGAGCCCCACTTCGTGGACTCCGCCCCCGTGAT
CGAGGACGACGACCGCAAGCTGCCCAAGCTGGACGAGAAGACCGCCGACTCCATCCGCA
AGGGCCTGACCCCCAAGTGGAACGACCTGGACGTGAACCAGCACGTGAACAACGTGAAG
TACATCGGCTGGATCCTGGAGTCCACCCCCCCCGAGGTGCTGGAGACCCAGGAGCTGTGC
TCCCTGACCCTGGAGTACCGCCGCGAGTGCGGCCGCGAGTCCGTGCTGGAGTCCCTGACC
GCCGTGGACCCCTCCGGCAAGGGCTACGGCTCCCAGTTCCAGCACCTGCTGCGCCTGGAG
GACGGCGGCGAGATCGTGAAGGGCCGCACCGAGTGGCGCCCCAAGACCGCCGGCATCAA
CGGCGCC AT CGCCT CCGGCGAGACCT CCCCCGGCGACTCCTCCT GA SEQ ID NO: 110
CuPSR23 FATB3 amino acid sequence
MWAAATSAFFPVPAPGTSPKPGKSGNWPSSLSPTFKPKSIPNAGFQVKANASAHPKA
NGSAVNLKSGSLNTQEDTSSSPPPRAFLNQLPDWSMLLTAITTVFVAAEKQWTMLDRKSKRP
DMLVDSVGLKCIVRDGLVSRQSFLIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFG
RTPGMCKNDLIWVLTKMQIMVNRYPTWGDTVEINTWFSQSGKIGMASDWLISDCNTGEILIR
ATSVWAMMNQKTRRFSRLPYEVRQELTPHFVDSPHVIEDNDQKLHKFDVKTGDSIRKGLTPR
WNDLDVNQHVSNVKYIGWILESMPIEVLETQELCSLTVEYRRECGMDSVLESVTAVDPSENG
GRSQYKHLLRLEDGTDIVKSRTEWRPKNAGTNGAISTSTAKTSNGNSVS SEQ ID NO: 111
CuPSR23 FATB3b amino acid sequence
MWAAATSAFFPVPAPGTSPKPGKSGNWPSSLSPTFKPKSIPNAGFQVKANASAHPKA
NGSAVNLKSGSLNTQEDTSSSPPPRAFLNQLPDWSMLLTAITTVFVAAEKQWTMLDRKSKRP
DMLVDSVGLKSIVRDGLVSRQSFLIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFG
RTPGMCKNDLIWVLTKMQIMVNRYPTWGDTVEINTWFSQSGKIGMASDWLISDCNTGEILIR
ATSVWAMMNQKTRRFSRLPYEVRQELTPHFVDSPHVIEDNDQKLHKFDVKTGDSIRKGLTPR
WNDLDVNQHVSNVKYIGWILESMPIEVLETQELCSLTVEYRRECGMDSVLESVTAVDPSENG
GRSQYKHLLRLEDGTDIVKSRTEWRPKNAGTNGAISTSTAKTSNGNSAS SEP ID NO: 112
CwFATB3 amino acid sequence:
MWAAAASSAFFPVPAPRTTPKPGKFGNWPSSLSPPFKPKSNPNGRFQVKANVSPHPK ANGSAVSLKSGSLNTLEDPPSSPPPRTFLNQLPDWSRLRTAITTVFVAAEKQFTRLDRKSKRPD MLVD WF GSETIVQDGLVFRERF SIRS YEIGADRT ASIETLMNHLQDT SLNHCKS V GLLNDGFG RTSEMCTRDLIWVLTKMQIWNRYPTWGDTVEINSWFSQSGKIGMGRDWLISDCNTGEILVR ATSAWAMMNQKTRRFSKLPCEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPG WNDLDVNQHVSNVKYIGWILESMPTEVLETQELCSLTLEYRRECGRESWESVTSMNPSKVG DRSQYQHLLRLEDGADIMKGRTEWRPKNAGTNRAIST SEP ID NO: 113
CwFATB3a amino acid sequence:
MWAAAASSAFFPVPAPRTTPKPGKFGNWPSSLSPPFKPKSNPNGRFQVKANVSPHPK ANGSAVSLKSGSLNTLEDPPSSPPPRTFLNQLPDWSRLRTAITTVFVAAEKQFTRLDRKSKRPD MLVD WF GSETIVQDGLVFRERF SIRS YEIGADRT ASIETLMNHLQDT SLNHCKS V GLLNDGF G RTSEMCTRDLIWVLTKMQIWNRYPTWGDTVEINSWFSQSGKIGMGRDWLISDCNTGEILVR ATSAWAMMNQKTRRFSKLPCEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPG WNDLDVNQHVSNVKYIGWILESMPTEVLETQELCSLTLEYRRECGRESWESVTSMNPSKVG DRSQYQHLLRLEDGADIMKGRTEWRPKNAGTNRAIST SEP ID NO: 114 CwFATB3b amino acid sequence
MWAAAASSAFFPVPAPRTTPKPGKFGNWPSSLSPPFKPKSNPNGRFQVKANVSPHPK ANGS A V SLKSGSLNTLEDLPS SPPPRTFLNQLPD W SRLRT AITTVFVAAEKQFTRLDRKSKRPD MLVDWFGSETIVQDGLVFRERFSIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGFG RTSEMCTRDLIWVLTKMQIWNRYPTWGDTVEINSWFSQSGKIGMGRDWLISDCNTGEILVR ATSAWAMMNQKTRRFSKLPCEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPG WNDLDVNQH V SNVKYIGWILEKF WRPRS Y ALSPLNIGGNVEGKVW SEP ID NO: 115
CwFATB3c amino acid sequence
MWAAAASSAFFPVPAPRTTPKPGKFGNWPSSLSPPFKPKSNPNGRFQVKANVSPHPK ANGSAVSLKSGSLNTLEDLPSSPPPRTFLNQLPDWSRLRTAITTVFVATEKQFTRLDRKSKRPD MLVDWFGSETIVQDGLVFRERFSIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGFG RTSEMCTRDLIWVLTKMQIWNRYPTWGDTVEINSWFSQSGKIGMGRDWLISDCNTGEILVR ATSAWAMMNQKTRRFSKLPCEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPG WNDLDVNQH V SNVKYIGWILEKF WRPRS Y ALSPLNIGGNVEGKVW SEP ID NO: 116
CwFATB4a amino acid sequence
MVATAASSAFFPVPSADTSSSRPGKLGSGPSSLSPLKPKSIPNGGLQVKANASAPPKIN
GSSVGLKSGGFKTQEDSPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGSIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKIAGLSNDGFGR
TPEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILT
RASSVWVMMNQKTRRLSKIPDEVRNEIEPHFVDSAPWEDDDRKLPKLDENTADSIRKGLTPR
WNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSAEGY
ASRFQHLLRLEDGGEIVKARTEWRPKNAGINGWPSEESSPGDFF SEP ID NO: 117
CwFATB4a.l amino acid sequence
MVATAASSAFFPVPSADTSSSRPGKLGSGPSSLSPLKPKSIPNGGLQVKANASAPPKIN
GSSVGLKSGGFKTQEDSPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGSIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKLAGLSNDGFGR
TPEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILT
RASSVWVMMNQKTRRLSKIPDEVRNEIEPHFVDSAPWEDDDRKLPKLDENTADSIRKGLTPR
WNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSAEGY
ASRFQHLLRLEDGGEIVKARTEWRPKNAGINWWPSEESSPGDFF SEP ID NO: 118
CwFATB4a,2 amino acid sequence:
MVATAASSAFFPVPSADTSSSRPGKLGNGPSSLSPLKPKSIPNGGLQVKANASAPPKIN
GSSVGLKSGSFKTQEDAPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGSIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKLAGLSNDGFGR
TPEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILT
RASSVWVMMNQKTRRLSKIPDEVRNEIEPHFVDSAPWEDDDRKLPKLDENTADSIRKGLTPR
WNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSAEGY
ASRFQHLLRLEDGGEIVKARTEWRPKNAGINGWPSEESSPGDFF SEP ID NO: 119
CwFATB4a,3 amino acid sequence
MVATAASSAFFPVPSADTSSSRPGKLGSGPSSLSPLKPKSIPNGGLQVKANASAPPKIN
GSSVGLKSGGFKTQEDSPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGSIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKLAGLSNDGFGR
TPEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILT
RASSVWVMMNQKTRRLSKIPDEVRNEIEPHFVDSAPWEDDDRKLPKLDENTADSIRKGLTPR
WNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSAEGY
VSRFQHLLRLEDGGEIVKARTEWRPKNAGINGWPSEESSPGDFF SEP ID NO: 120
CwFATB4b amino acid sequence
MVATAASSAFFPVPSADTSSSRPGKLGNGPSSLSPLKPKSIPNGGLQVKANASAPPKIN
GSSVGLKSGSFKTQEDAPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGSIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKIAGLSSDGFGR
TPAMSKRDLIWWAKMQVMVNRYPAWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILT
RASSVWVMMNQKTRRLSKIPDEVRNEIEPHFVDSAPWEDDDRKLPKLDENTADSIRKGLTPR
WNDLDVNQHVNNVKYIGWILESTPAEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGEGD
GSKFQHLLRLEDGGEIVKARTEWRPKNAGINGWPSEESSPGGDFF SEP ID NO: 121
CwFATB4b.l amino acid sequence
MVATAASSAFFPVPSADTSSSRPGKLGSGPSSLSPLKPKSIPNGGLQVKANASAPPKIN
GSSVGLKSGSFKTQEDAPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGSIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKIAGLSSDGFGR
TPAMSKRDLIWWAKMQVMVNRYPAWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILT
RASSVWVMMNQKTRRLSKIPDEVRNEIEPHFVDSAPWEDDDRKLPKLDENTADSIRKGLTPR
WNDLDVNQHVNNVKYIGWILESTPAEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGEGD
GSKFQHLLRLEDGGEIVKARTEWRPKNAGINGWPSEESSPGGDFF SEP ID NO: 122
CwFATB5 amino acid sequence
MVAAAASSAFFSVPTPGTPPKPGKFGNWPSSLSVPFKPDNGGFHVKANASAHPKANG SAVNLKSGSLETPPRSFINQLPDLSVLLSKITTVFGAAEKQWKRPGMLVEPFGVDRIFQDGVFF RQSFSIRSYEIGVDRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEMCKRDLIWWTKIQVE VNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAMMNQNTRRLSKFP YEVRQEIAPHFVDSAPVIEDDQKLQKLDVKTGDSIRDGLTPRWNDLDVNQHVNNVKYIGWIL KSVPIEVFETQELCGVTLEYRRECGRDSVLESVTAMDPAKEGDRCVYQHLLRLEDGADITIGR TE WRPKNAGANGAMS SGKT SNGNCLIEGRGW QPFRWRLIF SEP ID NO: 123
CwFATB5a amino acid sequence
MVAAAASSAFFSVPTPGTPPKPGKFGNWPSSLSVPFKPDNGGFHVKANASAHPKANG
SAVNLKSGSLETPPRSFINQLPDLSVLLSKITTVFGAAEKQWKRPGMLVEPFGVDRIFQDGFFFR
QSFSIRSYEIGVDRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEMCKRDLIWWTKIQVEV
NRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAMMNQNTRRLSKFPYE
VRQEIAPHFVDSAPVIEDDQKLQKLDVKTGDSIRDGLTPRWNDLDVNQHVNNVKYIGWILKS
VPIEVFETQELCGVTLEYRRECGRDSVLESVTAMDPAKEGDRCVYQHLLRLEDGADITIGRTE
WRPKNAGANGAMSSGKTSNGNCLIEGRGWQPFRWRLIF SEP ID NO: 124
CwFATB5b amino acid sequence
MVAAAASSAFFSVPTPGTPPKPGKFGNWPSSLSVPFKPDNGGFHVKANASAHPKANG SAVNLKSGSLETPPRSFINQLPDLSVLLSKITTVFGAAEKQWKRPGMLVEPFGVDRIFQDGVFF RQSF SIRS YEIGVDRTASIETLMNIF QET SLNHCKSIGLLNDGF GRTPEMCKRDLIWWTKIQVE VNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAMMNQNTRRLSKFP YEVRQEIAPHFVDSAPVIEDDQKLQKLDVKTGDSIRDGLTPRWNDLDVNQHVNNVKYIGWIL KSVPIEVFETQELCGVTLEYRRECGRDSVLESVTAMDPAKEGDRCVYQHLLWLEDGADITIGR TE WRPKNAGANGAMS SGKT SNGNCLIEGRGW QPFRWRLIF SEP ID NO: 125
CwFATB5c amino acid sequence
MVAAAASSAFFSVPTPGTPPKPGKFGNWPSSLSVPFKPDNGGFHVKANASAHPKANG
SAVNLKSGSLETPPRSFINQLPDLSVLLSKITTVFGAAEKQWKRPGMLVEPFGVDRIFQDGVFF
RQSFSIRSYEIGVDRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEMCKRDLIWWTKIQVE
VNRYPIWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAMMNQNTRRLSKFPY
EVRQEIAPHFVDSAPVIEDDQKLQKLDVKTGDSIRDGLTPRWNDLDVNQHVNNVKYIGWILK
SVPIEVFETQELCGVTLEYRRECGRDSVLESVTAMDPAKEGDRCVYQHLLRLEDGADITIGRTE
WRPKNAGANGAMSSGKTSNGNCLIEGMGWQPFRWRLIF SEP ID NO: 126
CwFATB5.1 amino acid sequence
MVAAAASSAFFSVPTPGTSPKPGKFRNWPSSLSVPFKPETNHNGGFHIKANASAHPKA
NGSALNLKSGSLETQEDTSLSSPPRTFIKQLPDWSMLLSKITTVFGAAEKQLKRPGMLVEPFGV
DRIFQDGVFFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEMCKRDLI
WWTKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAMMNQ
NTRRLSKFPYEVRQEIAPHFVDSAPVIEDDRKLYKLNVKTGDSIRDGLTPRWNDLDVNQHVNN
VKFIGWILKSVPTKVFETQELCGVTLEYRRECGKDSVLESVTAMDPAKEGDRSVYQHLLRLED
GADITIGRTEWRPKNAGANEAISSGKTSNGNSAS SEP ID NO: 127
CwFATB5.1a amino acid sequence
MVAAAASSAFFSVPTPGTSPKPGKFRNWPLSLSVPFKPETNHNGGFHIKANASAHPKA
NGSALNLKSGSLETQEDTSLSSPPRTFIKQLPDWSMLLSKITTVFGAAEKQLKRPGMLVEPFGV
DRIFQDGVFFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEMCKRDLI
WWTKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAMMNQ
NTRRLSKFPYEVRQEIAPHFVDSAPVIEDDRKLYKLNVKTGDSIRDGLTPRWNDLDVNQHVNN
VKFIGWILKSVPTKVFETQELCGVTLEYRRECGKDSVLESVTAMDPAKEGDRSVYQHLLRLED
GADITIGRTEWRPKNAGANEAISSGKTSNGNSAS SEP ID NO: 128
CcFATB2b amino acid sequence
MVTTSLASAYFSMKAVMLAPDGRGIKPRSSGLQVRAGNERNSCKVINGTKVKDTEG
LKGCSTLQGQSMLDDHFGLHGLVFRRTFAIRCYEVGPDRSTSIMAVMNHLQEAARNHAESLG
LLGDGFGETLEMSKRDLIWWRRTHVAVERYPAWGDTVEVEAWVGASGNTGMRRDFLVRD
CKTGHILTRCTSVSVMMNMRTRRLSKIPQEVRAEIDPLFIEKVAVKEGEIKKLQKLNDSTADYI
QGGWTPRWNDLDVNQHVNNIIYVGWIFKSVPDSISENHHLSSITLEYRRECIRGNKLQSLTTVC
GGSSEAGIICEHLLQLEDGSEVLRARTEWRPKHTDSFQGISERFPQQEPHK SEP ID NO: 129
CcFATB3 amino acid sequence
MVATAAASAFFPVGAPATSSATSAKASMMPDNLDARGIKPKPASSSGLQVKANAHA
SPKINGSKVSTDTLKGEDTLTSSPAPRTFINQLPDWSMFLAAITTIFLAAEKQWTNLDWKPRRP
DMLADPFGIGRFMQDGLIFRQHFAIRSYEIGADRTASIETLMNHLQETALNHVRSAGLLGDGF
GATPEMSRRDLIWWTRMQVLVDRYPAWGDIVEVETWVGASGKNGMRRDWLVRDSQTGEI
LTRATSVWVMMNKRTRRLSKLPEEVRGEIGPYFIEDVAIIEEDNRKLQKLNENTADNVRRGLT
PRWSDLDVNQHVNNVKYIGWILESAPGSILESHELSCMTLEYRRECGKDSVLQSMTAVSGGG
SAAGGSPESSVECDHLLQLESGPEWRGRTEWRPKSANNSRSILEMPAESL SEP ID NO: 130
CcFATB3b amino acid sequence
MVATAAASAFFPVGAPATSSATSAKASMMPDNLDARGIKPKLASSSGLQVKANAHA
SPKINGSKVSTDTLKGEDTLTSSPAPRTFINQLPDWSMFLAAITTIFLAAEKQWTNLDWKPRRP
DMLADPFGIGRFMQDGLIFRQHFAIRSYEIGADRTASIETLMNHLQETALNHVRSAGLLGDGF
GATPEMSRRDLIWWTRMQVLVDRYPAWGDIVEVETWVGASGKNGMRRDWLVRDSQTGEI
LTRATSVWVMMNKRTRRLSKLPEEVRGEIGPYFIEDVAIIEEDNRKLQKLNENTADNVRRGLT
PRWSDLDVNQHVNNVKYIGWILESAPGSILESHELSCMTLEYRRECGKDSVLQSMTAVSGGG
SAAGGSPESSVECDHLLQLESGPEWRGRTEWRPKSANNSRSILEMPAESL SEP ID NO: 131 CcFATB3c amino acid sequence
MVATAAASAFFPVGAPATSSATSAKASMMPDNLDARGIKPKPASSSGLQVKANAHA SPKINGSKV STDTLKGEDTLTSSPAPRTFINQLPDW SMFLAAITTIFLAAEKQWTNLDWKPRRP DMLADPFGIGRFMQDGLIFRQHFAIRSYEIGADRTASIETLMNHLQETALNHVRSAGLLGDGF GATPEMSRRDLIWWTRMQVLVDRYPAWGDIVEVETWVGASGKNGMRRDWLVRDSQTGEI LTRATSVWVMMNKRTRRLSKLPEEVRGEIGPYFIEDVAIIEEDNRKLQKLNENTADNVRRGLT PRWSDLDVNQHVNNAKYIGWILESAPGSILESHELSCMTLEYRRECGKDSVLQSMTAVSGGG SAAGGSPESSVECDHLLQLESGPEWRGRTEWRPKSANNSRSILEMPAESL SEP ID NO: 132
ChtFATBla amino acid sequence
MVAAAASSAFFSVPTPGTSTKPGNFGNWPSSLSVPFKPESNHNGGFRVKANASAHPK
ANGSAVNLKSGSLETQEDTSSSSPPPRTFIKQLPDWGMLLSKITTVFGAAERQWKRPGMLVEP
FGVDRIFQDGVFFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEMCK
RDLIWWTKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAM
MNRKTRRLSKFPYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDSIRKGLTPRWNDLDVNQ
HVNNVKYIGWILKSVPAEVFETQELCGVTLEYRRECGRDSVLESVTAMDTAKEGDRSLYQHL
LRLEDGADITIGRTEWRPKNAGANGAISTGKTSNENSVS SEP ID NO: 133
ChtFATBla.l amino acid sequence
MVAAAASSAFFSVPTPGTSPKPGNFGNWPSSLSVPFKPESNHNGGFRVKANASAHPK
ANGSAVNLKSGSLETQEDTSSSSPPPRTFIKQLPDWGMLLSKITTVFGAAERQWKRPGMLVEP
FGVDRIFQDGVFFRHSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEMCK
RDLIWWTKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLIGDCRTGEILIRATSVWAM
MNRKTRRLSKFPYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDSIRKGLTPRWNDLDVNQ
HVNNVKYIGWILKSVPAEVFETQELCGVTLEYRRECGRDSVLESVTAMDTAKEGDRSLYQHL
LRLEDGADITIGRTEWRPKNAGANGALSTGKTSNGNSVS SEP ID NO: 134
ChtFATBla.2 amino acid sequence
MVAAAAS S AFF S VPTPGT SPKPGNFGNWPSNLSVPFKPESNHNGGFRVKANASAHPK ANGSAVNLKSGSLETQEDTSSSSPPPRTFIKQLPDWGMLLSKITTVFGAAERQWKRPGMLVEP FGVDRIFQDGVFFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFGRTPEMCK RDLIWWTKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAM MNRKTRRLSKFPYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDSIRKGLTPRWNDFDVNQ HVNNVKYIGWILKSVPAEVFETQELCGVTLEYRRECGRDSVLESVTAMDTAKEGDRSLYQHL LRLEDGADITIGRTEWRPKNAGANGAISTGKTSNENSVS SEP ID NO: 135
ChtFATBla,3 amino acid sequence
MVAAAASSAFFSVPTPGTSPKPGNFGNWPSSLSVPFKPESNHNGGFRVKANASAHPK ANGSAVNLKSGSLETQEDTSSSSPPPRTFIKQLPDWGMLLSKITTVFGAAERQWKRPGMLVEP F GVDRIF QDGVFFRQSF SIRS YEIGADRTASIETLMNIF QETSLNHCKSIGLLNDGF GRTPEMCK RDLIWWTKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAM MNRKTRRLSKFPYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDSIRKGLTPRWNDFDVNQ HVNNVKYIGWILKSVPAEVFETQELCGVTLEYRRECGRDSVLESVTAMDTAKEGDRSLYQHL LRLEDGADITIGRTEWRPKNAGVNGAISTGKTSNENSVS SEP ID NO: 136
ChtFATBla.4 amino acid sequence
MVAAAASSAFFSVPTPGTSPKPGNFGNWPSSLSVPFKPESNHNGGFRVKANASAHPK ANGSAVNLKSGSLETQEDTSSSSPPPRTFIKQLPDWSMLLSKITTVFGAAERQWKRPGMLVEPF GVDRIF QDGVFFRQSF SIRS YEIGADRT ASIETLMNIF QET SLNHCKSIGLLNDGF GRTPEMCKR DLIWWTKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAMM
NRKTRRLSKFPYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDSIRKGLTPRWNDFDVNQHV NNVKYIGWILKSVPAEVFETQELCGVTLEYRRECGRDSVLESVTAMDTAKEGDRSLYQHLLR LEDGADITIGRTE WRPKNAGANGAISTGKT SNEN S V S SEP ID NO: 137
ChtFATBlb amino acid sequence
MVAAAAS S AFF S VPTSGT SPKPGNF GNWP S SLS VPFKPES SHNGGF QVKANAS AHPK ANGSAVNLKSGSLETQEDTSSSSPPPRTFIKQLPDWSMLLSKITTVFWAAERQWKRPGMLVEP F GVDRIF QDGVFFRQSF SIRS YEIGADRTASIETLMNIF QETSLNHCKSIGLLNDGF GRTPEMCK RDLIWWTKIQVEVNRYPTWGDTIEVNTWVSESGKNGMGRDWLISDCRTGEILIRATSVWAM MNRKTRRLSKFPYEVRQEIAPHFVDSAPVIEDDKKLHKLDVKTGDFIRKGLTPRWNDFDVNQ HVNNVKYIGWILKSVPAEVFETQELCGVTLEYRRECGRDSVLESVTAMDTAKEGDRSLYQHL LRLEDGADITIGRTEWRPKNAGANGAISTGKTSNENSVS SEP ID NO: 138
ChtFATB2b amino acid sequence
MWAAAASSAFFPVPASGTSPKPGKFGTWLSSSSPSYKPKSNPSGGFQVKANASAHP KANGSAVSLKSGSLNTQEGTSSSPPPRTFLNQLPDWSRLRTAITTVFVAAEKQLTMLDRKSKK PDMHVDWFGLEIIVQDGLVFRESFSIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGF GRTPEMCKRDLIWVLTKMQIMVNRYPT W GDTVEIN SWF SQ SGKIGMGRNWLISDCNTGEILIR ATSIWAMMNQKTRRFSKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPEW NDLDVNQHVSNVKYIGWILESMPKEVLDTQELCSLTLEYRRECGRDSVLESVTAMDPSKVGD RSQYQHLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS SEP ID NO: 139
ChtFATB2a amino acid sequence
MWAAAASSAFFPVPAPGTTSKPGKFGNWPSSLSPSFKPKSNPNGGFQVKANASAHP KANGS AV SLKSGSLNTKEDTPSSPPPRTFLNQLPD W SRLRT AITTVF V AAEKQLTMLDRKSKK PDMHVDWFGLEIIVQDWLVFRESFSIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGF GRTPEMCKRDLIWVLTKMQIMVNRYPTWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIR ATSIWAMMNQKTRRFSKLPNEVRQELAPHFVDAPPLIEDNDRKLHKFDVKTGDSICKGLTPEW NDLDVNQHVSNVKYIGWILESMPKEVLDTQELCSLTLEYRRECGRDSVLESVTAMDPSKVGD RSQYQHLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS SEP ID NO: 140
ChtFATB2c amino acid sequence
MWAAAASSAFFPVPASGTSPKPGKFGTWLSSSSPSYKPKSNPSGGFQVKANASAHP
KANGSAVSLKSGSLNTKEDTPSSPPPRTFLNQLPDWNRLRTAITTVFVAAEKQLTMLDRKSKK
PDMHVDWFGLEIIVQDGLVFRESFSIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGF
GRTPEMCKRDLIWVLTKMQIMVNRYPTWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIR
ATSIWAMMNQKTRRFSKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPEW
NDLDVNQHVSNVKYIGWILESMPKEVLDTQELCSLTLEYRRECGRDSVLESVTAMDPSKVGD
RSQYQHLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS SEP ID NO: 141
ChtFATB2d amino acid sequence
MWAAAASSAFFPVPAPGTTSKPGKFGNWPSSLSPSFKPKSNPNGGFQVKANASAHP
KANGSAVSLKSGSLNTQEDTSSSPPPRTFLNQLPDWSRLLTAISTVFVAAEKQLTMLDRKSKRP
DMLVDLFGLESIVQDGLVFRESYSIRSYEIGADRTASIETLMNHLQDTSLNHCKSVGLLNDGFG
RTPEMCKRDLIWVLTKMQIMVNRYPTWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIRA
TSIWAMMNQNTRRFSKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSIRKGLTPGWN
DLDVNQHVSNVKYIGWILESMPTEVLETQELCSLTLEYRRECGRESVLESVTAMNPSKVGDRS
QYQHLLRLEDGADIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS SEP ID NO: 142 ChtFATB2e amino acid sequence
MWAAAASSAFFPVPASGTSPKPGKFGTWLSSSSPSYKPKSNPSGGFQVKANASAHP
KANGSAVSLKSGSLNTQEDTSSSPPPQTFLNQLPDWSRLLTAISTVFVAAEKQLTMLDRKSKRP
DMLVDWFGLESIVQDGLVFRESYSIRSYEISADRTASIETVMNLLQETSLNHCKSMGILNDGFG
RTPEMCKRDLIWVLTKMQILVNRYPNWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIRA
TSIWAMMNQNTRRFSKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSIRKGLTPGWN
DLDVNQHVSNVKYIGWILESMPTEVLETQELCSLTLEYRRECGRDSVLESVTAMNPSKVGDRS
QYQHLLRLEDGADIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS SEP ID NO: 143
ChtFATB2f amino acid sequence
MWAAAASSAFFPVPASGTSPKPGKFGTWLSSSSPSYKPKSNPSGGFQVKANASAHP KANGSAVSLKSGSLNTQEGTSSSPPPRTFLNQLPDWSRLLTAISTVFVAAEKQLTMLDRKSKRP DMLVDWFGLESIVQDGLVFRESYSIRSYEISADRTASIETVMNLLQETSLNHCKSMGILNDGFG RTPEMCKRDLIWVLTKMQILVNRYPNWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIRA TSIWAMMNQKTRRFSKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPEWN DLDVNQHVSNVKYIGWILESMPKEVLDTQELCSLTLEYRRECGRDSVLESVTAMDPSKVGDR SQY QHLLRLEDGTDIMKGRTE WRPKNAGTNGAIST GKTSNGN S V S SEP ID NO: 144
ChtFATB2g amino acid sequence
MWAATASSAFFPVPVPGTSPKPGKFGTWLSSSSPSYKPKSNPSGGFQVKANASAHPK
ANGSAVSLKSGSLNTQEDTSSSPPPRTFLNQLPDWSRLLTAISTVFVAAEKQLTMLDRKSKRPD
MLVDWFGLESIVQDGLVFREIYSIRSYEISADRTTSIETVMNLLQETSLNHCKSMGILNDGFGRT
PEMCKRDLIWVLTKMQILVNRYPNWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIRATSI
WAMMNQKTRRFSKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSICKGLTPEWNDL
DVNQHVSNVKYIGWILESMPKEVLDTQELCSLTLEYRRECGRDSVLESVTAMDPSKVGDRSQ
YQHLLRLEDGTDIMKGRTEWRPKNAGTNGAISTGKTSNANSVS SEP ID NO: 145
ChtFATB2h amino acid sequence
MWAAAASSAFFPVPASGTSPKPGKFGTWLSSSSPSYKPKSNPSGGFQVKANASAHP KANGSAV SLKSGSLNTQEGTSSSPPPRTFLNQLPDW SRLLTAISTVFVAAEKQLTMLDRKSKRP DMLVDWFGLESIVQDGLVFRESYSIRSYEISADRTASIETVMNLLQETSLNHCKSMGILNDGFG RTPEMCKRDLIWVLTKMQILVNRYPNWGDTVEINSWFSQSGKIGMGRNWLISDCNTGEILIRA TSIWAMMNQNTRRFSKLPNEVRQEIAPHFVDAPPVIEDNDRKLHKFDVKTGDSIRKGLTPGWN DLDVNQHVSNVKYIGWILESIPTEVLETQELCSLTLEYRRECGRESVLESVTAMNPSKVGDRSQ YQHLLRLEDGADIMKGRTEWRPKNAGTNGAISTGKTSNGNSVS SEP ID NO: 146
ChtFATB3a amino acid sequence
MVATAASSAFFPVPSPDTSSRPGKLGNGSSSLRPLKPKFVANAGLQVKANASAPPKIN GSSV SLKSCSLKTHEDTPSAPPPRTFINQLPDW SMLLAAITTVFLAAEKQWMMLDWKPKRPD MLVDPFGLGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNEGFG RTPEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL TRASSVWVMMNQKTRKLSKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTP KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGKG FGPQFQHLLRLEDGGEIVKGRTEWRPKTAGINGTIASGETSPGNS SEP ID NO: 147
ChtFATB3b amino acid sequence
MVATAASSAFFPVPSPDTSSRPGKLGNGSSSLRPLKPKFVANAGLQVKANASAPPKIN
GSSVSLKSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGFGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLIEGFGR
TPEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILT
RASSVWVMMNQKTRKLSKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTPK WNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGKGF GPQF QHLLRLEDGGEIVKGRTE WRPKT AGINGTIASGET SPGN S SEP ID NO: 148
ChtFATB3c amino acid sequence
MVAT AAS S AFFP VPSPDT S SRPGKLGNGS S SLRPLKPKF VANAGLQVKANAS APPKIN GSSVSLKSCSLKTHEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD MLVDPF GLGRIVQDGLVFRQNF SIRS YEIGADRT ASIET VMNHLQETALNHVKS AGLLNEGF G RTPEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL TRASSVWVMMNQKTRKLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTP KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSEKG FGPQFQHLLRLEDGGEIVKGRTEWRPKTAGINGAIAFGETSPGDS SEP ID NO: 149
ChtFATB3d amino acid sequence
MVAT AAS S AFFP VPSPDT S SRPGKLGNGS S SLRPLKPKF VANAGLQVKANAS APPKIN GSSVSLKSCSLKTHEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD MLVDPF GLGRIVQDGLVFRQNF SIRS YEIGADRT ASIKTVMNHLQET ALNHVKSAGLLNEGF G RTPEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL TRASSVWVMMNQKTRKLSKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTP KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGKG F GPQF QHLLRLEDGGEIVKGRTE WRPKT AGINGTIASGETSPGNS SEP ID NO: 150
ChtFATB3e amino acid sequence
MVATAASSAFFPVPSPDTSSRPGKLGNGSSSLRPLKPKFVANAGLQVKANAS APPKIN GSSVSLKSGSLKTHEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD MLVDPFGLGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNEGFG RTPEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL TRASSVWVMMNQKTRKLSKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTP KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGKG FGPQFQHLLRLEDGGEIVKGRTEWRPKTAGINGTLASGETSPGNS SEP ID NO: 151
ChtFATB3f amino acid sequence
MVATAASSAFFPVPSPDTSSRLGKLGNGSSSLRPLKPKFVANAGLQVKANASAPPKIN
GSSVSLKSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MPVDPFGLGRTVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNEGFG
RTPEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL
TRASSVWVMMNQKTRKLSKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTP
KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSEKG
FGPQFQHLLRLEDGGEIVKGRTEWRPKTAGINGTLASGETSPGNS SEP ID NO: 152
ChtFATB3g amino acid sequence
MVATAASSAFFPVPSPDTSSRAGKLGNGSSSLRPLKPKFVANAGLQVKANASAPPKIN
GSSVSLKSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGRTVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNEGFG
RTPEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL
TRASSVWVMMNQKTRKLSKIPDEVRHEIEPHFVDSAPVIEDDDWKLPKLDEKTADSIRKGLTP
KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGKG
FGPQFQHLLRLEDGGEIVKGRTEWRPKTAGINGTLASGETSPGNS SEP ID NO: 153 ChsFATBl amino acid sequence
MVATNAAAFSAYTFFLTSPTHGYSSKRLADTQNGYPGTSLKSKSTPPPAAAAARNGA
LPLLASICKCPKKADGSMQLDSSLVFGFQFYIRSYEVGADQTVSIQTVLNYLQEAAINHVQSAG
YFGDSFGATPEMTKRNLIWVITKMQVLVDRYPAWGDWQVDTWTCSSGKNSMQRDWFVRD
LKTGDIITRASSVWVLMNRLTRKLSKIPEAVLEEAKLFVMNTAPTVDDNRKLPKLDGSSADYV
LSGLTPRWSDLDMNQHVNNVKYIAWILESVPQSIPETHKLSAITVEYRRECGKNSVLQSLTNV
SGDGITCGNSIIECHHLLQLETGPEILLARTEWISKEPGFRGAPIQAEKVYNNK SEQ ID NO: 154
ChsFATB2 amino acid sequence
MVATAASSAFFPVPSPDASSRPGKLGNGSSSLSPLKPKLMANGGLQVKANASAPPKIN GSSVGLKSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD MLVDPFGLGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFG RTLEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEI LTRASSVWVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTP KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGKG SGSQF QHLLRLEDGGEIVKGRTE WRPKT AGINGPLASGETSPGDS S SEP ID NO: 155
ChsFatB2b amino acid sequence
MVATAASSAFFPVPSPDASSRPGKLGNGSSSLSPLKPKLMANGGLQVKANASAPPKIN GSSVGLKSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD MLVDPFGLGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFG RTLEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEI LTRAS SKSQIMLPLHY C S VWVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLD EKTADSIRKGLTPKWNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESV LESLTAVDPSGKGSGSQFQHLLRLEDGGEIVKGRTEWRPKTAGINGPLASGETSPGDSS SEP ID NO: 156
ChsFatB2c amino acid sequence
MVATAASSAFFPVPSPDASSRPGKLGNGSSSLSPLKPKLMANGGLQVKANASAPPKIN
GSSVGLKSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD
MLVDPFGLGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFG
RTLEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEI
LTRASSVWVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTP
KWNDLDVNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGKG
SGSQFQHLMRLEDGGEIVKGRTEWRPKTAGINGPIASGETSPGDSS SEP ID NO: 157
ChsFatB2d amino acid sequence
MVAT AAS S AFFPVPSPDAS SRPGKLGNGS S SLSPLKPKLM ANGGLQVKANAS APPKIN GSSVGLKSGSLKTQEDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPD MLVDPFGLGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFG RTPEMYKRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEIL TRASSVWVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTP KWNDLDVNQHVNNVKYIGWILESTPPEVLET QELC SLTLE YRRECGRE S VLESLT AVDPSGKG SGSQFQHLLRLEDGGEIVKGRTEWRPKTAGINGPIASGETSPGDSS SEP ID NO: 158
Chs FATB3 amino acid sequence
MVAAEAS S ALF S VRTPGTSPKPGKF GNWPT SLS VPFKSKSNHNGGF QVKANAS ARPK ANGS AV SLKSGSLDTQEDT S S S S SPPRTFINQLPD W SMLLS AITT VF VAAEKQ WTMLDRKSKRP DMLMDPFGVDRWQDGAVFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFG RTPEMCKRDLIWWTKMHVEVNRYPTWGDTIEVNTWVSESGKTGMGRDWLISDCHTGEILIR ATSMCAMMNQKTRRFSKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPRW
NDLDVNQHVNNVKYIGWILESVPTEVFETQELCGLTLEYRRECGRDSVLESVTAMDPSKEGD
RSLYQHLLRLEDGADIAKGRTKWRPKNAGTNGAISTGKTSNGNSIS SEP ID NO: 159
ChsFatb3b amino acid sequence
MVAAEASSALFSVRTPGTSPKPGKFGNWPTSLSVPFKSKSNHNGGFQVKANASARPK
ANGSAVSLKSGSLDTQEDTSSSSSPPRTFINQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRP
DMLMDPFGVDRWQDGAVFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFG
RTPEMCKRDLIWWTKMHIEVNRYPTWGDTIEVNTWVSESGKTGMGRDWLISDFHTGDILIR
ATSVCAMMNQKTRRFSKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPRW
NDLDVNQHVNNVKYIGWILESVPTEVFETQELCGLTLEYRRECGRDSVLESVTAMDPSKEGD
RSLYQHLLRLEDGADIAKGRTKWRPKNAGTNGAISTGKTSNGNSIS SEP ID NO: 160
ChsFatB3c amino acid sequence
MVAAEASSALFSVRTPGTSPKPGKFGNWPTSLSVPFKSKSNHNGGFPVKANASARPK
ANGSAVSLKSGSLDTPEDTSSSSSPPRTFINPLPDWSMLLSAITTVFVAAEKPWTMLDRKSKRP
DMLMDPFGVDRWPDGAVFRPSFSIRSYEIGADRTASIETLMNIFPETSLNHCKSIGLLNDGFG
RTPEMCKRDLIWWTKMHVEVNRYPTWGDTIEVNTWVSESGKTGMGRDWLISDCHTGEILIR
ATSMCAMMNPKTRRFSKFPYEVRPELAPHFVDSAPVIEDYPKLHKLDVKTGDSICNGLTPRW
NDLDVNPHVNNVKYIGWILESVPTEVFETPELCGLTLEYRPECGRDSVLESVTAMDPSKEGD
RSLYQHLLRLEDGTDIAKGRTKWRPKNAGKTSNGNSIS SEP ID NO: 161
ChsFATB3d amino acid sequence
MVAAEASSALFSVRTPGTSPKPGKFGNWPSSLSVPFKSKSNHNGGFPVKANASARPK
ANGSAVSLKSGSLDTOEDASSSSSPPRTFINPLPDWSMLLSAITTVFVAAEKPWTMLDRKSKR
SDMLMDPFGVDRWpDGAVFRpSFSIRSYEIGADRTASIETLMNIFpETSLNHCKSIGLLNDGF
GRTPEMCKRDLIWWTKMHVEVNRYPTWGDTIEVNTWVSESGKTGMGRDWLISDCHTGEILI
RATSMCAMMNQKTRRFSKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPR
WNDLDVNQHVNNVKYIGWILESVPTEVFETpELCGLTLEYRRECGRDSVLESVTAMDPSKEG
DRSLYpHLLRLEDGADIAKGRTKWRPKNAGTNGAISTGKTSNGNSIS SEP ID NO: 162
ChsFATB3e amino acid sequence
MVAAE AS S ALF S VRTPGTSPKPGKF GNWPS SLS VPFKSKSNHNGGF QVKANAS ARPK ANGSAVSLKSGSLDTQEDASSSSSPPRTFINpLPDWSMLLSAITTVFVAAEKQWTMLDRKSKR SDMLMDPFGVDRWQDGWFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGF GRTPEMCKRDLIWWTKMHVE VNRYPT W GDTIE VNT WV SE SGKTGMGRD WLISDCHTGEILI RATSMCAMMNQKTRRFSKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPR WNDLDVNQHVNNVKYIGWILESVPTEVFETOELCGLTLEYRRECGRDSVLESVTAMDPSKEG DRSLYOHLLRLEDGADIAKGRTKWRPKNAGTNGAISTGKTSNGNSIS SEP ID NO: 163
ChsFATB3f amino acid sequence
MVAAE AS S ALF S VRTPGTSPKPGKF GNWPS SLS VPFKSKSNHNGGF QVKANAS ARPK ANGSAVSLKSGSLDTQEDTSSSSSPPRTFINOLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRP DMLMDPFGVDRWQDGAVFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFG RTPEMCKRDLIWWTKMHVEVNRYPTWGDTIEVNTWVSESGKTGMGRDWLISDCHTGEILIR ATSMCAMMNQKTRRFSKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPRW NDLDVNOHVNNVKYIGWILESVPTEVFETOELCGLTLEYRRECGRDSVLESVTAMDPSKEGD RSLY PHLLRLEDGADIAKGRTKWRPKNAGTNGAIST GKT SNGN SIS SEP ID NO: 164
ChsFATB3g amino acid sequence
MVAAEAS S ALF S VRTPGTSPKPGKF GNWPT SLS VPFKSKSNHNGGF QVKANAS ARPK
ANGSAVSLKSGSLDTQEDTSSSSSPPRTFINQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRP
DMLMDPFGVDRWQDGAVFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFG
RTPEMCKRDLIWWTKMHIEVNRYPTWGDTIEVNTWVSESGKTGMGRDWLISDFHTGDILIR
ATSVCAMMNQKTRRFSKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPRW
NDLDVNQHVNNVKYIGWILESVPTEVFETQELCGLTLEYRQECGRDSVLESVTAMDPSKEGD
RSLYQHLLRLEDGTDIAKGRTKWRPKNAGKTSNGNSIS SEP ID NO: 165
ChsFATB3h amino acid sequence
MVAAEASSALFSVRTPGTSPKPGKFGNWPSSLSVPFKSKSNHNGGFQVKANASARPK
ANGSAVSLKSGSLDTQEDASSSSSPPRTFINQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKR
SDMLMDPFGVDRWQDGWFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGF
GRTPEMCKRDLIWWTKMHIEVNRYPTWGDTIEVNTWVSESGKTGMGRDWLISDFHTGDILI
RATSVCAMMNQKTRRFSKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPR
WNDLDVNQHVNNVKYIGWILESVPTEVFETQELCGLTLEYRQECGRDSVLESVTAMDPSKEG
DRSLYQHLLRLEDGTDIAKGRTKWRPKNAGKTSNGNSIS SEP ID NO: 166
ChsFATB3i amino acid sequence
MVAAEASSALFSVRTPGTSPKPGKFGNWPTSLSVPFKSKSNHNGGFPVKANASARPK
ANGSAVSLKSGSLDTPEDTSSSSSPPRTFINPLPDWSMLLSAITTVFVAAEKPWTMLDRKSKRP
DMLMDPFGVDRWPDGAVFRPSFSIRSYEIGADRTASIETLMNIFPETSLNHCKSIGLLNDGFG
RTPEMCKRDLIWWTKMHVEVNRYPTWGDTIEVNTWVSESGKTGMGRDWLISDCHTGEILIR
ATSMCAMMNPKTRRFSKFPYEVRPELAPHFVDSAPVIEDYPKLHKLDVKTGDSICNGLTPRW
NDLDVNOHVNNVKYIGWILESVPTEVFETPELCGLTLEYRRECGGDSVLESVTAMDPSKEGD
RSLYOHLLRLEDGADIAKGRTKWRPKNAGTNGAISTGKTSNGNSIS SEP ID NO: 167
ChsFATB3i amino acid sequence
MVAAEAS S ALF S VRTPGTSPKPGKF GNWPT SLS VPFKSKSNHNGGFQVKANAS ARPK ANGSAVSLKSGSLDTQEDTSSSSSPPRTFINQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRP DMLMDPFGVDRWQDGAVFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSIGLLNDGFG RTPEMCKRDLIWWTKMHIEVNRYPTWGDTIEVNTWVSESGKTGMGRDWLISDFHTGDILIR ATSVCAMMNQKTRRFSKFPYEVRQELAPHFVDSAPVIEDYQKLHKLDVKTGDSICNGLTPRW NDLDVNQHVNNVKYIGWILESVPTEVFETpELCGLTLEYRQECGRDSVLESVTAMDPSKEGD RSLY QHLLRLEDGTDIAKGRTKWRPKNAGKT SNGN SIS SEP ID NO: 168
CcalcFATBl (Cuphea calcarata FATB1)
MVAAAATS AFFPVPAPGT SPNPRKF GS WPS SLSP SLPKSIPNGGF OVKANAS AHPKANGS AVSL
KSGSLNTQENT S S SPPPRTFLHQLPD W SRLLT AITT VF VKSKRPDMHDRKSKRPDMLVDLF GLE
SSVQDALVFRQSFSIRSYEIGTDRTASIETLMNHLOETSLNHCKSTGILLDGFGRTLEMCKRELI
WWIKMQIQVNRYPAWGDTVEINTRFSRLGKIGMGRDWLISDCNTGEILIRATSEYAMMNQK
TRRLSKLPYEVHQEIAPLFVDSPPVIEDNDLKVHKFEVKTGDSIQKGLSPGWNDLDVNQHVSN
VKYIGWILESMPTEVLETQELCSLALEYRRECGRDSVLESVTAMDPSKVGGRSQYQHLLRLED
GT AIVNGITE WRPKNAGANGAISTGKT SNGN S V S SEP ID NO: 169
ChookFATB4 (Cuphea hookeriana FATB4)
MVAAAATS AFFPVPAPGT SPNPRKF GS WPS SLSP SLPNSIPNGGF OVKANAS AHPKANGS AVSL KSGSLNTQENT S S SPPPRTFLHQLPD W SRLLT AITT VF VKSKRPDMHDRKSKRPDMLVDLF GLE S S V QDALVFRQRF SIRS YEIGTDRTASMETLMNHLQET SLNHCKSTGILLDGF GRTLEMCKREL IWWIKMQIQVNRYPAWGDTVEINTRFSRLGKIGMGRDWLISDCNTGEILIRATSEYAMMNQK TRRLSKLPYEVRQEIAPLFVDSPPVIEDNDLKVHKFEVKTGDSIHKGLTPGWNDLDVNQHVNN
VKYIGWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAMDPSGGGYGSQFQHLLRLEDG
GEIVKGRTEWRPKNGVINGWPTGESSPGDYS SEP ID NO: 170
CaFATBl (Cuphea avigera var. pulcherrima FATB1)
MVAAAASSAFFSVPVPGTSPKPGKFRIWPSSLSPSFKPKPIPNGGLOVKANSRAHPKANGSAVS
LKSGSLNTQEDTS S SPPPRTFLHQLPD W SRLLT AITTVF VKSKRPDMHDRKSKRPDMLMDSF G
LESIVQEGLEFRQSFSIRSYEIGTDRTASIETLMNYLQETSLNHCKSTGILLDGFGRTPEMCKRDL
IWWTKMKIKVNRYPAWGDTVEINTWFSRLGKIGKGRDWLISDCNTGEILIRATSAYATMNQ
KTRRLSKLPYEVHQEIAPLFVDSPPVIEDNDLKLHKFEVKTGDSIHKGLTPGWNDLDVNQHVS
NVKYIGWILESMPTEVLETQELCSLALEYRRECGRDSVLESVTAMDPTKVGGRSQYQHLLRLE
DGTDIVKCRTEWRPKNPGANGAISTGKTSNGNSVS SEP ID NO: 171
CpauFATBl (Cupheapaucipetala FATB1)
MVAAAASSAFFPVPAPGTSPKPGKSGNWPSSLSPSIKPMSIPNGGFOVKANASAHPKANGSAV
NLKSGSLNTQEDTS SSPPPRAFLNQLPDW SMLLTAITTVFVAAEKQWTMRDRKSKRPDMLVD
SVGLKSWLDGLVSRQIFSIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGMC
KNDLIWVLTKMQIMVNRYPTWGDTVEINTWFSHSGKIGMASDWLITDCNTGEILIRATSVWA
MMNQKTRRFSRLPYEVRQELTPHYVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLDV
NQHVSNVKYIGWILESMPIEVLETQELCSLTVEYRRECGMDSVLESVTAMDPSEDEGRSQYKH
LLRLEDGTDIVKGRTEWRPKNAGTNGAISTAKPSNGNSVS SEP ID NO: 172
CprocFATBl (Cuphea procumbens FATB1)
MVAAAASSAFFPAPAPGSSPKPGKSGNWPSSLSPSFKSKSIPYGRFOVKANASAHPKANGSAV
NLKSGSLNTQEDTSSSPPPRAFLNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKNIVRDGLVSRQSFLIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPAWGDTVEINTWFSQSGKIGMGSDWLISDCNTGEILIRATSVW
AMMNQKTRRFSRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLD
VNQHVSNVKYIGWILESMPIEVLEAQELCSLTVEYRRECGMDSVLESVTAVDPSEDGGRSQYN
HLLRLEDGTDWKGRTEWRPKNAETNGAISPGNTSNGNSIS SEP ID NO: 173
CprocFATB2 (Cuphea procumbens FATB2)
MVAAAASSAFFPAPAPGSSPKPGKSGNWPSSLSPSFKSKSIPYGRFOVKANASAHPKANGSAV
NLKSGSLNTQEDTSSSPPPRAFLNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKNIVRDGLVSRQSFLIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPAWGDTVEINTWFSQSGKIGMGSDWLISDCNTGEILIRATSVW
AMMNQKTRRFSRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRQECGRESVLESLTAVDPSGKGFGSQFQH
LLRLEDGGEIVKGRTEWRPKTAGINGAIASGETSPGDF SEP ID NO: 174
CprocFATB3 (Cuphea procumbens FATB3)
MVAAAASSAFFPAPAPGSSPKPGKSGNWPSSLSPSFKSKSIPYGRFOVKANASAHPKANGSAV
NLKSGSLNTQEDTSSSPPPRAFLNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVD
SVGLKNIVRDGLVSRQSFLIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGM
CKNDLIWVLTKMQIMVNRYPAWGDTVEINTWFSQSGKIGMGSDWLISDCNTGEILIRATSVW
AMMNQKTRRFSRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLD
VNQHVNNVKYIGWILESTPPE VLETQELC SLTLEYRRECGRES VLE SLT AVDPSGEGGY GSQFQ
HLLRLEDGGEIVKGRTEWRPKNAGINGVLPTGE SEP ID NO: 175
CigneaFATBl (Cuphea ignea FATB1)
PGT SRKTGKF GNWPS SLSPSFKPKSIPNGGF OVKANARAHPKANGS AY SLKS V SLNTOEDT SLS
PPPRAFLNQLPDWRMLRTALTTVFVAAEKQWTMLDRKSKRPDMLVDSFGLESIVQEGLVFRQ
SFSIRSYEIGIDRTASIETLMNHLQETSLNQCKSAGILHDGFGRTLEMCKRDLIWWTKMQIKV
NRYPAWGDTVEISTRFSRLGKIGMGRDWLICDCNTGEILIRATSAYAMMNQKTRRLSKLPNEV
RQEIAPLF VD SDPVIEENDMKLHKFEVKT GDSICKGLTPRW SDLDVNQHV SNVKYIGWILESM
PTEVLETQELCSLALEYRRECGRDSVLESVTSMDPSKVGGWSQYQHLLRLEDGADIVKGRTE
WRPKNAGANGAISTGKT SEP ID NO: 176
CcalcFATBl (Cuphea calcarata FATB1)
MAT ASTF S AFNARCGDLRRS AGSGPRRPARPLPVRAAI|NASAHPKANGSAV SLKSGSLETQED NS S S SRPPRTFIKQLPD W SMLLSAITTVFVAAEKQWTMFDRKSKRSDMLVDPFWDRIVQDGV LFRQSFSIRSYEIGADRTASIETLMNIFQETSLNHCKSMGLLYEGFGRTPEMCKRDLIWWTKIH IKVNRYPTWGDTIEVTTWVSESGKNGMGRDWLISDCHTGEILIRATSVWAMMNQTTRRLSKF PYELRQEIAPHFVDSDPVIEDNRRLLNFDVKTGDSIRKGLTPRWNDLDVNQHVNNVKYIGWIL ESVPTEVFDTRELCGLTLEYRQECGRGSVLESVTAMDPSKEGDRSLYQHLLRLEDGTDIVKGR TEWRPKNAGTNGPV STRKTTNGSSV S SEP ID NO: 177
ChookFATB4 (Cuphea hookeriana FATB4)
MAT ASTF s afnarcgdlrrsagsgprrparplpvraai|nasahpkangsav SLKSGSLNTQEN
TSSSPPPRTFLHQLPDWSRLLTAITTVFVKSKRPDMHDRKSKRPDMLVDLFGLESSVQDALVFR
QRFSIRSYEIGTDRTASMETLMNHLQETSLNHCKSTGILLDGFGRTLEMCKRELIWWIKMQIQ
VNRYPAWGDTVEINTRFSRLGKIGMGRDWLISDCNTGEILIRATSEYAMMNQKTRRLSKLPYE
VRQEIAPLFVDSPPVIEDNDLKVHKFEVKTGDSIHKGLTPGWNDLDVNQHVNNVKYIGWILES
TPPEVLETQELCSLTLEYRRECGRESVLESLTAMDPSGGGYGSQFQHLLRLEDGGEIVKGRTE
WRPKNGVINGWPTGESSPGDYS SEP ID NO: 178
CaFATBl (Cuphea avigera var. pulcherrima FATB1) matastfsafnarcgdlrrsagsgprrparplpvraaiNsrahpkangsavslksgslntqed
TSSSPPPRTFLHQLPDWSRLLTAITTVFVKSKRPDMHDRKSKRPDMLMDSFGLESIVQEGLEFR
QSFSIRSYEIGTDRTASIETLMNYLQETSLNHCKSTGILLDGFGRTPEMCKRDLIWWTKMKIK
VNRYPAWGDTVEINTWFSRLGKIGKGRDWLISDCNTGEILIRATSAYATMNQKTRRLSKLPYE
VHQEIAPLFVDSPPVIEDNDLKLHKFEVKTGDSIHKGLTPGWNDLDVNQHVSNVKYIGWILES
MPTEVLETQELCSLALEYRRECGRDSVLESVTAMDPTKVGGRSQYQHLLRLEDGTDIVKCRTE
WRPKNPGANGAISTGKTSNGNSVS SEP ID NO: 179
CpauFATBl (Cupheapaucipetala FATB1)
MATASTFSAFNARCGDLRRSAGSGPRRPARPLPVRAAltNASAHPKANGSAVNLKSGSLNTQE
DTSSSPPPRAFLNQLPDWSMLLTAITTVFVAAEKQWTMRDRKSKRPDMLVDSVGLKSWLDG
LVSRQIFSIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGMCKNDLIWVLTK
MQIMVNRYPTWGDTVEINTWFSHSGKIGMASDWLITDCNTGEILIRATSVWAMMNQKTRRFS
RLPYEVRQELTPHYVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLDVNQHVSNVKYI
GWILESMPIEVLETQELCSLTVEYRRECGMDSVLESVTAMDPSEDEGRSQYKHLLRLEDGTDI
VKGRTE WRPKNAGTNGAIST AKPSNGN S V S SEP ID NO: 180
CprocFATBl (Cuphea procumbens FATB1)
MATASTFSAFNARCGDLRRSAGSGPRRPARPLPVRAAltNASAHPKANGSAVNLKSGSLNTQE
DTSSSPPPRAFLNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVDSVGLKNIVRDG
LVSRQSFLIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGMCKNDLIWVLTK
MQIMVNRYPAWGDTVEINTWFSQSGKIGMGSDWLISDCNTGEILIRATSVWAMMNQKTRRFS
RLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLDVNQHVSNVKYI
GWILESMPIEVLEAQELCSLTVEYRRECGMDSVLESVTAVDPSEDGGRSQYNHLLRLEDGTDV
VKGRTEWRPKNAETNGAISPGNTSNGNSIS SEP ID NO: 181
CprocFATB2 (Cuphea procumbens FATB2)
MATASTFSAFNARCGDLRRSAGSGPRRPARPLPVRAAltNASAHPKANGSAVNLKSGSLNTQE
DTSSSPPPRAFLNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVDSVGLKNIVRDG
LVSRQSFLIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGMCKNDLIWVLTK
MQIMVNRYPAWGDTVEINTWFSQSGKIGMGSDWLISDCNTGEILIRATSVWAMMNQKTRRFS
RLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLDVNQHVNNVKYI
GWILESTPPEVLETQELCSLTLEYRQECGRESVLESLTAVDPSGKGFGSQFQHLLRLEDGGEIV
KGRTEWRPKTAGINGAIASGETSPGDF SEP ID NO: 182
CprocFATB3 (Cuphea procumbens FATB3)
MATASTFSAFNARCGDLRRSAGSGPRRPARPLPVRAAllNASAHPKANGSAVNLKSGSLNTQE
DTSSSPPPRAFLNQLPDWSMLLSAITTVFVAAEKQWTMLDRKSKRPDMLVDSVGLKNIVRDG
LVSRQSFLIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGMCKNDLIWVLTK
MQIMVNRYPAWGDTVEINTWFSQSGKIGMGSDWLISDCNTGEILIRATSVWAMMNQKTRRFS
RLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLDVNQHVNNVKYI
GWILESTPPEVLETQELCSLTLEYRRECGRESVLESLTAVDPSGEGGYGSQFQHLLRLEDGGEI
VKGRTEWRPKNAGINGVLPTGE SEP ID NO: 183
CigneaFATBl (Cuphea ignea FATB1)_ matastfsafnarcgdlrrsagsgprrparplpvraai|narahpkangsavslksvslntqed
TSLSPPPRAFLNQLPDWRMLRTALTTVFVAAEKQWTMLDRKSKRPDMLVDSFGLESIVQEGL
VFRQSFSIRSYEIGIDRTASIETLMNHLQETSLNQCKSAGILHDGFGRTLEMCKRDLIWWTKM
QIKVNRYPAWGDTVEISTRFSRLGKIGMGRDWLICDCNTGEILIRATSAYAMMNQKTRRLSKL
PNEVRQELAPLFVDSDPVIEENDMKLHKFEVKTGDSICKGLTPRW SDLDVNQHV SNVKYIGWI
LESMPTEVLETQELCSLALEYRRECGRDSVLESVTSMDPSKVGGWSQYQHLLRLEDGADIVK
GRTEWRPKNAGANGAISTGKT SEP ID NO: 184
CgFATBl (Cuphea glossostoma FATB1)_ MVAAAASSAFFPSPAPGSSPKPGNRPSSLSPSFKPKSIPNGAFQVKANASAHPKANGSAVNLKSl
GSLNTQEDSSSSPSPRAFLNQLPDWSVLLTAITTVFVAAEKQWTMLDRKSKRPDVLVDSVGLK SIVQDGLVSRQSFSIRSYEIGADRTASIETLMNHLQETSINHCKSLGLLNDGFGRTPGMCKNDLl|
wvltkmqimvnrypawgdtveintwfsqsgkigmgsdwlisdcntgeiliratsvwammnqI
KTRRFSRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLDVNQHVS
NVKYIGWILESMPIEVLETQELCSLTVEYRRECGMDSVLESVTAMDPSEDGGRSQYNHLLRLEI DGTDWKGRTEWRPKNAGTNGAISTTKTSNGNSVSl SEP ID NO: 185
CgFATBlb (Cuphea glossostoma FATB1 C170F,M198T,T374S variant)
MVAAAASSAFFPSPAPGSSPKPGNRPSSLSPSFKPKSIPNGAFQVKANASAHPKANGSAVNLKS
GSLNTQEDSSSSPSPRAFLNQLPDWSVLLTAITTVFVAAEKQWTMLDRKSKRPDVLVDSVGLK
SIVQDGLVSRQSFSIRSYEIGADRTASIETLMNHLQETSINHFKSLGLLNDGFGRTPGMCKNDLI
WVLTKTQIMVNRYPAWGDTVEINTWFSQSGKIGMGSDWLISDCNTGEILIRATSVWAMMNQ
KTRRFSRLPYEVRQELTPHFVDSPHVIEDNDRKLHKFDVKTGDSIRKGLTPRWNDLDVNQHVS
NVKYIGWILESMPIEVLETQELCSLTVEYRRECGMDSVLESVSAMDPSEDGGRSQYNHLLRLE
DGTDWKGRTEWRPKNAGTNGAISTTKTSNGNSVS SEP ID NO: 186
Umbellularia califomica UcFATB3 amino acid sequence
MVATAAASAFFPVGSPATSSATSAKASMMPDNLDARGIKPKPASSSGLQVKANAHASPKINGS
KV STDTLKGEDTLTSSPAPRTFINQLPDW SMFLAAITTIFLAAEKQWTNLDWKPRRPDMLADP
FGIGRFMQDGLIFRQHFAIRSYEIGADRTASIETLMNHLQETALNHVRSAGLLGDGFGATPEMS
RRDLIWWTRMQVLVDRYPAWGDIVEVETWVGASGKNGMRRDWLVRDSQTGEILTRATSV
WVMMNKRTRRLSKIPEEVRGEIGPYFMENVAIIEEDSRKLQKLNENIIEEDSRKLQKLNENTAD
NVRRGLTPRWSDLDVNQHVNNVKYIGWILESAPGSILESHELSCMTLEYRRECGKDSVLQSM TVVSGGGSAAGGSPESSVECDHLLQLESGPEWKARTEWRPKSANNPRSILEMPAESS* SEP ID NO: 187
Cuphea carthagenensis CCrFATB2c (V138L variant of FATB2)
MVAAAASSAFFPVTTPGTSRKPGKFGNWLSSLSPPFRPKSIPSGGFQVKANASAHPKANGSAV
SLKSGSLNTQEDTSSSPPPRAFINQLPDWSMLLTAITTVFVAAEKQWTMLDRKSKRSDMLVDS
FGMERIVQDGLLFRQSFSIRSYEIGADRRASIETLMNHLQETSLNHCKSIRLLNEGFGRTPEMCK
RDLIWWTRMHIMVNRYPTWGDTVEINTWVSQSGKNGMGRDWLISDCNTGEILIRATSAWA
MMNQKTRRLSKLPYEVSQEIAPHFVDSPPVIEDGDRKLHKFDVKTGDSIRKGLTPRWNDLDV
NQHVNNVKYIGWILESMPTEVLETHELCFLTLEYRRECGRDSVLESVTAMDPSNEGGRSHYQ HLLRLEDGTDIVKGRTEWRPKNARNIGAISTGKTSNGNPAS* SEP ID NO: 188
Cuphea carthagenensis CCrFATB2
MVAAAASSAFFPVTTPGTSRKPGKFGNWLSSLSPPFRPKSIPSGGFQVKANASAHPKANGSAV
SLKSGSLNTQEDTSSSPPPRAFINQLPDWSMLLTAITTVFVAAEKQWTMLDRKSKRSDMLVDS
FGMERIVQDGLVFRQSFSIRSYEIGADRRASIETLMNHLQETSLNHCKSIRLLNEGFGRTPEMCK
RDLIWWTRMHIMVNRYPTWGDTVEINTWVSQSGKNGMGRDWLISDCNTGEILIRATSAWA
MMNQKTRRLSKLPYEVSQEIAPHFVDSPPVIEDGDRKLHKFDVKTGDSIRKGLTPRWNDLDV
NQHVNNVKYIGWILESMPTEVLETHELCFLTLEYRRECGRDSVLESVTAMDPSNEGGRSHYQ HLLRLEDGTDIVKGRTEWRPKNARNIGAISTGKTSNGNPAS* SEP ID NO: 189
CcrFATB2b
MVAAAASSAFFPVTTPGTSRKPGKFGNWLSSLSPPFRPKSIPSGGFQVKANASAHPKANGSAV
SLKSGSLNTQEDTSSSPPPRAFINQLPDWSMLLTAITTVFVAAEKQWTMLDRKSKRSDMLVDS
FGMERIVQDGLVFRQSFSIRSYEIGADRRASIETLMNHLQETSLNHCKSIRLLNEGFGRTPEMCK
RDLIWVFTRMHIMVNRYPTWGDTVEINTWVSQSGKNGMGRDWLISDCNTGEILIRATSAWA
MMNQKTRRLSKLPYEVSQEIAPHFVDSPPVIEDGDRKLHKFDVKTGDSIRKGLTPRWNDLDV
NQHVNNVKYIGWILESMPTEVLETHELCFLTLEYRRECGRDSVLESVTAMDPSNEGGRSHYQ HLLRLEDGTDIVKGRTEWRPKNARNIGAIPTGKTSNGNPAS* SEP ID NO: 190 CcrFATBl
MVATAASSAFFPVPSPDSSSRPGKLGNGPSSLSPLKPKSTPNGGLQVKANASAPPKINGSSVGL
KSSSLKTQDDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLTDPF
GLGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFGRTPEMY
KRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRASSV
WVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTPKWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGKESVLESLTAVDPSGKGWGSHFQ HLLRLEDGGEIVKGRTEWRPKNAGINGAVAFEETSPGDS* SEP ID NO: 191
CcrFATBlb
MVATAASSAFFPVPSPDSSSRPGKLGNGPSSLSPLKPKSTPNGGLQVKANASAPPKINGSSVGL
KSSSLKTQDDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLTDPF
GLGRIAQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFGRTPEMY
KRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRASSV
WVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTPKWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGKESVLESLTAVDPSGKGWGSHFQ HLLRLEDGGErVKGRTEWRPKNAGINGAVAFEETSPGDS* SEP ID NO: 192 CCrFATBlc
MVATAASSAFFPVPSPDSSSRPGKLGNGPSSLSPLKPKSTPNGGLQVKANASAPPKINGSSVGL
KSSSLKTQDDTPSAPPPRTFINQLPDWSMLLAAITTVFLAAEKQWMMLDWKPKRPDMLTDPF
GLGRIVQDGLVFRQNFSIRSYEIGADRTASIETVMNHLQETALNHVKSAGLLNDGFGRTPEMY
KRDLIWWAKMQVMVNRYPTWGDTVEVNTWVAKSGKNGMRRDWLISDCNTGEILTRASSV
WVMMNQKTRRLSKIPDEVRHEIEPHFVDSAPVIEDDDRKLPKLDEKTADSIRKGLTPKWNDLD
VNQHVNNVKYIGWILESTPPEVLETQELCSLTLEYRRECGKESVLESLTAVDPSGKGWGSHFQ HLLRLEDGGEIVKGRTEWRPKNA*
Claims (16)
- THE CLAIMS DEFINING THE INVENTION ARE AS FOLLOWS:1. A recombinant nucleic acid encoding an acyl-ACP thioesterase, wherein the nucleic acid has at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 93, and wherein the nucleic acid encodes an acyl-ACP thioesterase having at least 96% sequence identity to SEQ ID NOs: 91, 171 or 179, or a C-terminal and/or N-terminal truncated fragment comprising at least 90% of the full-length sequences thereof.
- 2. A vector comprising the recombinant nucleic acid of claim 1.
- 3. A method of producing a recombinant acyl-ACP thioesterase, the method comprising transforming a Prototheca microalgal cell with a nucleic acid according to claim 1.
- 4. A method of producing a recombinant acyl-ACP thioesterase, the method comprising transforming a Prototheca microalgal cell with a nucleic acid encoding an acyl-ACP thioesterase having at least 96% sequence identity to SEQ ID NOs: 91, 171 or 179 or a C-terminal and/or N-terminal truncated fragment comprising at least 90% of the full-length sequences thereof.
- 5. A Prototheca microalgal host cell comprising a recombinant nucleic acid of claim 1, wherein the recombinant nucleic acid comprises an exogenous nucleic acid, and wherein the host cell has a fatty acid profile having increased C8 and/or CIO fatty acids.
- 6. A Prototheca microalgal host cell comprising a recombinant nucleic acid encoding an acyl-ACP thioesterase having at least 96% sequence identity to SEQ ID NOs: 91, 171 or 179 or a C-terminal and/or N-terminal truncated fragment comprising at least 90% of the full-length sequences thereof, and wherein the host cell has a fatty acid profile having increased C8 and/or CIO fatty acids.
- 7. A method for producing an oil or oil-derived product, the method comprising cultivating a host cell of claim 5 or claim 6, and extracting oil produced thereby.
- 8. The method of claim 7, wherein the cultivation is heterotrophic growth on sugar.
- 9. The method of claim 7 or claim 8, further comprising producing a fatty acid, fuel, chemical, or other oil-derived product from the oil.
- 10. An oil produced by the method of claim 7 or claim 8.
- 11. The oil of claim 10 having a fatty acid profile comprising at least about 8% C8 and/or at least about 9% CIO fatty acids.
- 12. An oil-derived product produced by the method of claim 9.
- 13. The oil-derived product of claim 12, wherein the oil is produced by a microalga and lacks C24-alpha sterols.
- 14. The oil-derived product of claim 13, wherein the oil comprises sterols with C24P stereochemistry.
- 15. The oil-derived product of claim 13 or 14, wherein the oil comprises one or more of ergosterol, brassicasterol, 22, 23-dihydrobrassicasterol, clionasterol and poriferasterol.
- 16. The oil-derived product of any one of claims 14 to 15, wherein the oil is free from one or more of β-sitosterol, campesterol, or stigmasterol.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AU2018267601A AU2018267601A1 (en) | 2013-03-15 | 2018-11-21 | Thioesterases and cells for production of tailored oils |
Applications Claiming Priority (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201361791861P | 2013-03-15 | 2013-03-15 | |
| US13/837,996 | 2013-03-15 | ||
| US61/791,861 | 2013-03-15 | ||
| US13/837,996 US9290749B2 (en) | 2013-03-15 | 2013-03-15 | Thioesterases and cells for production of tailored oils |
| US201361917217P | 2013-12-17 | 2013-12-17 | |
| US61/917,217 | 2013-12-17 | ||
| PCT/US2014/026644 WO2014151904A1 (en) | 2013-03-15 | 2014-03-13 | Thioesterases and cells for production of tailored oils |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AU2018267601A Division AU2018267601A1 (en) | 2013-03-15 | 2018-11-21 | Thioesterases and cells for production of tailored oils |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| AU2014236763A1 AU2014236763A1 (en) | 2015-10-01 |
| AU2014236763B2 true AU2014236763B2 (en) | 2018-08-23 |
Family
ID=51581068
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AU2014236763A Ceased AU2014236763B2 (en) | 2013-03-15 | 2014-03-13 | Thioesterases and cells for production of tailored oils |
| AU2018267601A Abandoned AU2018267601A1 (en) | 2013-03-15 | 2018-11-21 | Thioesterases and cells for production of tailored oils |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AU2018267601A Abandoned AU2018267601A1 (en) | 2013-03-15 | 2018-11-21 | Thioesterases and cells for production of tailored oils |
Country Status (9)
| Country | Link |
|---|---|
| EP (1) | EP2971024A4 (en) |
| JP (1) | JP2016518112A (en) |
| KR (1) | KR20150128770A (en) |
| CN (1) | CN105143458A (en) |
| AU (2) | AU2014236763B2 (en) |
| BR (1) | BR112015023192A8 (en) |
| CA (1) | CA2904395A1 (en) |
| MX (1) | MX2015011507A (en) |
| WO (1) | WO2014151904A1 (en) |
Families Citing this family (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9816079B2 (en) | 2013-01-29 | 2017-11-14 | Terravia Holdings, Inc. | Variant thioesterases and methods of use |
| US9567615B2 (en) | 2013-01-29 | 2017-02-14 | Terravia Holdings, Inc. | Variant thioesterases and methods of use |
| US9783836B2 (en) | 2013-03-15 | 2017-10-10 | Terravia Holdings, Inc. | Thioesterases and cells for production of tailored oils |
| US9290749B2 (en) | 2013-03-15 | 2016-03-22 | Solazyme, Inc. | Thioesterases and cells for production of tailored oils |
| US9765368B2 (en) | 2014-07-24 | 2017-09-19 | Terravia Holdings, Inc. | Variant thioesterases and methods of use |
| BR112017005370A2 (en) * | 2014-09-18 | 2017-12-12 | Terravia Holdings Inc | acyl acp thioesterases and mutants thereof |
| US20180142218A1 (en) | 2016-10-05 | 2018-05-24 | Terravia Holdings, Inc. | Novel acyltransferases, variant thioesterases, and uses thereof |
| BR112019020784A2 (en) | 2017-04-03 | 2020-04-28 | Genomatica Inc | thioesterase variants with improved activity for the production of medium chain fatty acid derivatives |
| EP4090735A1 (en) | 2020-01-16 | 2022-11-23 | Corbion Biotech, Inc. | Beta-ketoacyl-acp synthase iv variants |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1995013390A2 (en) * | 1993-11-10 | 1995-05-18 | Calgene, Inc. | Plant acyl acp thioesterase sequences |
| US5667997A (en) * | 1992-10-30 | 1997-09-16 | Calgene, Inc. | C8 and C10 medium-chain thioesterases in plants |
| US5910631A (en) * | 1993-09-03 | 1999-06-08 | Max-Planck-Gesellschaft Zur Forderung Der Wissenschaften E.V. | Middle chain-specific thioesterase genes from Cuphea lanceolata |
| US20130029387A1 (en) * | 2011-07-27 | 2013-01-31 | Iowa State University Research Foundation, Inc. | Materials and methods for using an acyl-acyl carrier protein thioesterase and mutants and chimeras thereof in fatty acid synthesis |
Family Cites Families (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5298421A (en) | 1990-04-26 | 1994-03-29 | Calgene, Inc. | Plant medium-chain-preferring acyl-ACP thioesterases and related methods |
| US5512482A (en) * | 1990-04-26 | 1996-04-30 | Calgene, Inc. | Plant thioesterases |
| US5344771A (en) | 1990-04-26 | 1994-09-06 | Calgene, Inc. | Plant thiosterases |
| US5639790A (en) | 1991-05-21 | 1997-06-17 | Calgene, Inc. | Plant medium-chain thioesterases |
| US5654495A (en) | 1992-10-30 | 1997-08-05 | Calgene, Inc. | Production of myristate in plant cells |
| US5850022A (en) | 1992-10-30 | 1998-12-15 | Calgene, Inc. | Production of myristate in plant cells |
| US5807893A (en) | 1993-11-18 | 1998-09-15 | Voelker; Toni Alois | Plant thioesterases and use for modification of fatty acid composition in plant seed oils |
| EP2351845A1 (en) | 2007-06-01 | 2011-08-03 | Solazyme, Inc. | Renewable chemicals and fuels from oleaginous yeast |
| PL2167667T3 (en) * | 2007-07-09 | 2015-10-30 | Bayer Cropscience Nv | Brassica plant comprising mutant fatty acyl-acp thioesterase alleles |
| US7982035B2 (en) | 2007-08-27 | 2011-07-19 | Duquesne University Of The Holy Spirit | Tricyclic compounds having antimitotic and/or antitumor activity and methods of use thereof |
| WO2009153208A1 (en) * | 2008-06-20 | 2009-12-23 | Basf Plant Science Gmbh | Plants having enhanced yield-related traits and a method for making the same |
| ES2742527T3 (en) | 2008-11-28 | 2020-02-14 | Corbion Biotech Inc | Manufacture of customized oils in recombinant heterotrophic microorganisms |
| CA2758301A1 (en) * | 2009-04-10 | 2010-10-14 | Ls9, Inc. | Production of fatty acid derivatives |
| EP2576800B1 (en) | 2010-05-28 | 2019-01-30 | Corbion Biotech, Inc. | Method for producing oils from Prototheca |
| CN110066836A (en) | 2011-02-02 | 2019-07-30 | 柯碧恩生物技术公司 | Originate from the customization oil of recombination oleaginous microorganism |
| CN102586350A (en) * | 2012-01-09 | 2012-07-18 | 北京化工大学 | Production method for C8:0/C10:0/C12:0/C14:0 medium-chain fatty acid and ethyl ester thereof |
| EP2839018B1 (en) | 2012-04-18 | 2019-06-05 | Corbion Biotech, Inc. | Tailored oils |
-
2014
- 2014-03-13 AU AU2014236763A patent/AU2014236763B2/en not_active Ceased
- 2014-03-13 BR BR112015023192A patent/BR112015023192A8/en not_active Application Discontinuation
- 2014-03-13 JP JP2016502205A patent/JP2016518112A/en active Pending
- 2014-03-13 KR KR1020157027058A patent/KR20150128770A/en not_active Withdrawn
- 2014-03-13 CA CA2904395A patent/CA2904395A1/en not_active Abandoned
- 2014-03-13 WO PCT/US2014/026644 patent/WO2014151904A1/en not_active Ceased
- 2014-03-13 EP EP14769502.7A patent/EP2971024A4/en not_active Withdrawn
- 2014-03-13 CN CN201480020002.5A patent/CN105143458A/en active Pending
- 2014-03-13 MX MX2015011507A patent/MX2015011507A/en unknown
-
2018
- 2018-11-21 AU AU2018267601A patent/AU2018267601A1/en not_active Abandoned
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5667997A (en) * | 1992-10-30 | 1997-09-16 | Calgene, Inc. | C8 and C10 medium-chain thioesterases in plants |
| US5910631A (en) * | 1993-09-03 | 1999-06-08 | Max-Planck-Gesellschaft Zur Forderung Der Wissenschaften E.V. | Middle chain-specific thioesterase genes from Cuphea lanceolata |
| WO1995013390A2 (en) * | 1993-11-10 | 1995-05-18 | Calgene, Inc. | Plant acyl acp thioesterase sequences |
| US20130029387A1 (en) * | 2011-07-27 | 2013-01-31 | Iowa State University Research Foundation, Inc. | Materials and methods for using an acyl-acyl carrier protein thioesterase and mutants and chimeras thereof in fatty acid synthesis |
Non-Patent Citations (2)
| Title |
|---|
| JING, F., ET AL., BMC Biochemistry, 2011, 12.1, 44. & Genebank Accession No. JF338906 27 Aug 2011. * |
| MITTENDORF, V., ET AL., "Polyhydroxyalkanoate synthesis in transgenic plants as a new tool to study carbon flow through ß-oxidation.", The Plant Journal, 1999, 20.1, 45-55. & Genebank Accession No. AJ131740 14 Nov 2006. * |
Also Published As
| Publication number | Publication date |
|---|---|
| MX2015011507A (en) | 2016-04-07 |
| WO2014151904A1 (en) | 2014-09-25 |
| AU2018267601A1 (en) | 2018-12-06 |
| EP2971024A1 (en) | 2016-01-20 |
| BR112015023192A8 (en) | 2018-01-02 |
| BR112015023192A2 (en) | 2017-11-21 |
| CN105143458A (en) | 2015-12-09 |
| AU2014236763A1 (en) | 2015-10-01 |
| EP2971024A4 (en) | 2016-11-16 |
| JP2016518112A (en) | 2016-06-23 |
| KR20150128770A (en) | 2015-11-18 |
| CA2904395A1 (en) | 2014-09-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US10557114B2 (en) | Thioesterases and cells for production of tailored oils | |
| AU2014236763B2 (en) | Thioesterases and cells for production of tailored oils | |
| US10125382B2 (en) | Acyl-ACP thioesterases and mutants thereof | |
| US10316299B2 (en) | Ketoacyl ACP synthase genes and uses thereof | |
| US10053715B2 (en) | Tailored oils | |
| US20200392470A1 (en) | Novel acyltransferases, variant thioesterases, and uses thereof | |
| CA2870364A1 (en) | Recombinant microbes with modified fatty acid synthetic pathway enzymes and uses thereof | |
| US9290749B2 (en) | Thioesterases and cells for production of tailored oils | |
| EP4514987A2 (en) | Regiospecific incorporation of fatty acids in triglyceride oil | |
| CA3060515A1 (en) | Novel acyltransferases, variant thioesterases, and uses thereof | |
| WO2025096830A1 (en) | Regiospecific triglyceride oils and blends and related food and nutrition applications | |
| JP2016034258A (en) | Method for producing lipid using diacylglycerol acyltransferase |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PC1 | Assignment before grant (sect. 113) |
Owner name: CORBION BIOTECH, INC. Free format text: FORMER APPLICANT(S): TERRAVIA HOLDINGS, INC. |
|
| FGA | Letters patent sealed or granted (standard patent) | ||
| MK14 | Patent ceased section 143(a) (annual fees not paid) or expired |