NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F081503

Metagenome Family F081503

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F081503
Family Type Metagenome
Number of Sequences 114
Average Sequence Length 299 residues
Representative Sequence VTVRARNAALALLLVAPALVAALACGREDRQTPDQLRAEIEALEKERLSLRERVNELILKDPRVKDMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLHVTIHRVKGKLKTGKPDVTFGGNKVAIALPVTVASGTGQATINFKWDGKNVADATCGDMEVTQDVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAILAQKEGLCGYVVEKVNVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALGIKVGQLT
Number of Associated Samples 87
Number of Associated Scaffolds 114

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 65.48 %
% of genes near scaffold ends (potentially truncated) 46.49 %
% of genes from short scaffolds (< 2000 bps) 48.25 %
Associated GOLD sequencing projects 77
AlphaFold2 3D model prediction Yes
3D model pTM-score0.44

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (61.404 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Wetlands → Unclassified → Soil
(27.193 % of family members)
Environment Ontology (ENVO) Unclassified
(40.351 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(47.368 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 21.49%    β-sheet: 36.12%    Coil/Unstructured: 42.39%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.44
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 114 Family Scaffolds
PF00196GerE 8.77
PF01750HycI 3.51
PF01541GIY-YIG 2.63
PF00027cNMP_binding 2.63
PF07715Plug 2.63
PF08459UvrC_RNaseH_dom 2.63
PF13432TPR_16 1.75
PF01425Amidase 0.88
PF13492GAF_3 0.88
PF00474SSF 0.88
PF12006DUF3500 0.88
PF02518HATPase_c 0.88
PF04955HupE_UreJ 0.88
PF02151UVR 0.88
PF13460NAD_binding_10 0.88
PF01850PIN 0.88
PF03729DUF308 0.88
PF00696AA_kinase 0.88
PF08220HTH_DeoR 0.88
PF01979Amidohydro_1 0.88
PF01364Peptidase_C25 0.88
PF14520HHH_5 0.88

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 114 Family Scaffolds
COG0680Ni,Fe-hydrogenase maturation factorEnergy production and conversion [C] 3.51
COG0322Excinuclease UvrABC, nuclease subunitReplication, recombination and repair [L] 2.63
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 0.88
COG2370Hydrogenase/urease accessory protein HupEPosttranslational modification, protein turnover, chaperones [O] 0.88
COG3247Acid resistance membrane protein HdeD, DUF308 familyGeneral function prediction only [R] 0.88


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms61.40 %
UnclassifiedrootN/A38.60 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003994|Ga0055435_10006042All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2092Open in IMG/M
3300004156|Ga0062589_100704341Not Available897Open in IMG/M
3300004479|Ga0062595_100472288Not Available930Open in IMG/M
3300005356|Ga0070674_100748564Not Available839Open in IMG/M
3300005844|Ga0068862_100749442Not Available950Open in IMG/M
3300006224|Ga0079037_100519490All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1146Open in IMG/M
3300006845|Ga0075421_100152448All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2887Open in IMG/M
3300006847|Ga0075431_100756452Not Available947Open in IMG/M
3300009075|Ga0105090_10096506All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1850Open in IMG/M
3300009082|Ga0105099_10252751All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1022Open in IMG/M
3300009091|Ga0102851_10023857All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4555Open in IMG/M
3300009100|Ga0075418_10261023Not Available1845Open in IMG/M
3300009100|Ga0075418_10452690Not Available1375Open in IMG/M
3300009111|Ga0115026_10302924All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1119Open in IMG/M
3300009111|Ga0115026_10352412All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1050Open in IMG/M
3300009147|Ga0114129_10064888All Organisms → cellular organisms → Bacteria5096Open in IMG/M
3300009157|Ga0105092_10099945All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1591Open in IMG/M
3300009166|Ga0105100_10059018All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium2227Open in IMG/M
3300009167|Ga0113563_10780512All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1080Open in IMG/M
3300009167|Ga0113563_11296268All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium851Open in IMG/M
3300009609|Ga0105347_1247159All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium732Open in IMG/M
3300012964|Ga0153916_10555430All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1222Open in IMG/M
3300014324|Ga0075352_1059438All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium922Open in IMG/M
3300014864|Ga0180068_1030914All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium843Open in IMG/M
3300015257|Ga0180067_1010917All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1601Open in IMG/M
3300018077|Ga0184633_10054731All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium2033Open in IMG/M
3300021090|Ga0210377_10026187All Organisms → cellular organisms → Bacteria4214Open in IMG/M
3300021090|Ga0210377_10083642All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium2152Open in IMG/M
3300024056|Ga0124853_1325890All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2097Open in IMG/M
3300025165|Ga0209108_10067360All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1964Open in IMG/M
3300025324|Ga0209640_10019337All Organisms → cellular organisms → Bacteria5944Open in IMG/M
3300025551|Ga0210131_1017848Not Available1018Open in IMG/M
3300025917|Ga0207660_10227019Not Available1467Open in IMG/M
3300025937|Ga0207669_10736405Not Available813Open in IMG/M
3300027871|Ga0209397_10155631All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1019Open in IMG/M
3300027890|Ga0209496_10119048All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1162Open in IMG/M
3300027902|Ga0209048_10057714All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3133Open in IMG/M
3300027909|Ga0209382_10345534All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1668Open in IMG/M
(restricted) 3300028043|Ga0233417_10123956All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1102Open in IMG/M
3300028380|Ga0268265_10664003Not Available1003Open in IMG/M
3300030606|Ga0299906_10014584All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6295Open in IMG/M
3300030613|Ga0299915_10045384All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3272Open in IMG/M
3300030619|Ga0268386_10046515All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium3419Open in IMG/M
3300030620|Ga0302046_10382718All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1156Open in IMG/M
3300031576|Ga0247727_10007107All Organisms → cellular organisms → Bacteria21562Open in IMG/M
3300031576|Ga0247727_10013123All Organisms → cellular organisms → Bacteria14044Open in IMG/M
3300031682|Ga0318560_10305798All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium857Open in IMG/M
3300031854|Ga0310904_10161477Not Available1313Open in IMG/M
3300031949|Ga0214473_10029970All Organisms → cellular organisms → Bacteria6375Open in IMG/M
3300031949|Ga0214473_10118978All Organisms → cellular organisms → Bacteria3084Open in IMG/M
3300031949|Ga0214473_10290140All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1871Open in IMG/M
3300031965|Ga0326597_10036770All Organisms → cellular organisms → Bacteria6195Open in IMG/M
3300031997|Ga0315278_10194083All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium2088Open in IMG/M
3300032143|Ga0315292_10062509All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium2802Open in IMG/M
3300032164|Ga0315283_10483609All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1345Open in IMG/M
3300032397|Ga0315287_10157825All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2626Open in IMG/M
3300032421|Ga0310812_10256700Not Available770Open in IMG/M
3300032516|Ga0315273_10107779All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3807Open in IMG/M
3300032955|Ga0335076_10010657All Organisms → cellular organisms → Bacteria → Acidobacteria9263Open in IMG/M
3300033004|Ga0335084_10017396All Organisms → cellular organisms → Bacteria7273Open in IMG/M
3300033004|Ga0335084_10818276All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium945Open in IMG/M
3300033406|Ga0316604_10177285All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1146Open in IMG/M
3300033408|Ga0316605_10345951All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1323Open in IMG/M
3300033412|Ga0310810_10469050Not Available1267Open in IMG/M
3300033414|Ga0316619_10084493All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1967Open in IMG/M
3300033416|Ga0316622_100006003All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium9152Open in IMG/M
3300033416|Ga0316622_100384061All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1573Open in IMG/M
3300033416|Ga0316622_100672454All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1196Open in IMG/M
3300033433|Ga0326726_10032138All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia → Pirellulales → Thermoguttaceae → Thermogutta → Thermogutta terrifontis4573Open in IMG/M
3300033481|Ga0316600_10208856All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1278Open in IMG/M
3300033482|Ga0316627_100102116All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1969Open in IMG/M
3300033482|Ga0316627_100643222All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium974Open in IMG/M
3300033483|Ga0316629_10124079All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1513Open in IMG/M
3300033486|Ga0316624_10320454All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1265Open in IMG/M
3300033486|Ga0316624_10405042All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1142Open in IMG/M
3300033486|Ga0316624_10865373All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium807Open in IMG/M
3300033488|Ga0316621_10276597All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1092Open in IMG/M
3300033513|Ga0316628_100068325All Organisms → cellular organisms → Bacteria3809Open in IMG/M
3300033513|Ga0316628_102422843All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium694Open in IMG/M
3300033521|Ga0316616_100003647All Organisms → cellular organisms → Bacteria7428Open in IMG/M
3300033521|Ga0316616_100659028All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1243Open in IMG/M
3300033521|Ga0316616_101118064All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium994Open in IMG/M
3300033557|Ga0316617_100347991All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1281Open in IMG/M
3300034147|Ga0364925_0072997All Organisms → cellular organisms → Bacteria → Calditrichaeota → Calditrichia → Calditrichales → Calditrichaceae → unclassified Calditrichaceae → Calditrichaceae bacterium1190Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil27.19%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil7.89%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment7.02%
WetlandEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Wetland5.26%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands5.26%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.26%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment4.39%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands3.51%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil3.51%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil2.63%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.75%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.75%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.75%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm1.75%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.75%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.75%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.75%
Freshwater Lake SedimentEnvironmental → Aquatic → Freshwater → Lentic → Sediment → Freshwater Lake Sediment0.88%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater0.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.88%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.88%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.88%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.88%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.88%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.88%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.88%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.88%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000881Soil microbial communities from Great Prairies - Wisconsin Restored Prairie soilEnvironmentalOpen in IMG/M
3300003994Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004020Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleC_D2EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005356Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaGHost-AssociatedOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300006224Freshwater wetland microbial communities from Ohio, USA, analyzing the effect of biotic and abiotic controls - Mud 3 Core 4 Depth 4 metaGEnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300009075Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 1-3cm March2015EnvironmentalOpen in IMG/M
3300009082Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 1-3cm May2015EnvironmentalOpen in IMG/M
3300009091Freshwater wetland microbial communities from Ohio, USA, analyzing the effect of biotic and abiotic controls - Mud 3 Core 4 Depth 3 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009111Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Mud_0915_D1EnvironmentalOpen in IMG/M
3300009131Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Open_0915_D1EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009153Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 10-12cm March2015EnvironmentalOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009166Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm May2015EnvironmentalOpen in IMG/M
3300009167Freshwater wetland microbial communities from Ohio, USA, analyzing the effect of biotic and abiotic controls - Mud 3 Core 4 Depth 3 metaG - Illumina Assembly (version 2)EnvironmentalOpen in IMG/M
3300009179Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Plant_0915_D1EnvironmentalOpen in IMG/M
3300009455Groundwater microbial communities from Big Spring, Nevada to study Microbial Dark Matter (Phase II) - Ash Meadows Crystal SpringEnvironmentalOpen in IMG/M
3300009609Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890EnvironmentalOpen in IMG/M
3300011432Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT718_2EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012964Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 4 metaGEnvironmentalOpen in IMG/M
3300014324Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleA_D1EnvironmentalOpen in IMG/M
3300014864Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT231A'_16_10DEnvironmentalOpen in IMG/M
3300015257Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT231_16_10DEnvironmentalOpen in IMG/M
3300017944Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0815_BV2_10_20_MGEnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300024056Freshwater wetland microbial communities from Ohio, USA, analyzing the effect of biotic and abiotic controls - Mud 3 Core 4 Depth 3 metaG (PacBio error correction)EnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025538Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025551Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025937Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300027871Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Open_0915_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300027890Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Plant_0915_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300027902Freshwater lake sediment microbial communities from the University of Notre Dame, USA, for methane emissions studies - CRP12 CR (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028043 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0.5_MGEnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300030606Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT145D125EnvironmentalOpen in IMG/M
3300030613Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT92D227EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031547Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D4EnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031640Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f23EnvironmentalOpen in IMG/M
3300031682Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f22EnvironmentalOpen in IMG/M
3300031854Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D1EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300031965Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT100D185EnvironmentalOpen in IMG/M
3300031997Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G06_0EnvironmentalOpen in IMG/M
3300032143Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G13_0EnvironmentalOpen in IMG/M
3300032164Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G09_0EnvironmentalOpen in IMG/M
3300032397Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G11_0EnvironmentalOpen in IMG/M
3300032421Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NN3EnvironmentalOpen in IMG/M
3300032516Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G02_0EnvironmentalOpen in IMG/M
3300032955Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.5EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033406Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_soil_day20_CTEnvironmentalOpen in IMG/M
3300033408Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_soil_day20_noCTEnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300033414Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D4_BEnvironmentalOpen in IMG/M
3300033416Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_OW2_C1_D5_CEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033481Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_soil_day5_CTEnvironmentalOpen in IMG/M
3300033482Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D1_CEnvironmentalOpen in IMG/M
3300033483Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_May_M1_C1_D1_AEnvironmentalOpen in IMG/M
3300033485Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_T1_C1_D5_AEnvironmentalOpen in IMG/M
3300033486Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_AEnvironmentalOpen in IMG/M
3300033488Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_OW2_C1_D1_CEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300033521Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D1_BEnvironmentalOpen in IMG/M
3300033557Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D2_BEnvironmentalOpen in IMG/M
3300034147Sediment microbial communities from East River floodplain, Colorado, United States - 44_j17EnvironmentalOpen in IMG/M
3300034178Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10515524213300000364SoilTTLARDLIERVVSGFVDHVTLELKNLKVNKTGSVKKVVTIGQYELHVKINKVSGKLKTGDPKVTFGGNKVALSMPVTVASGSGNATINFKWDGKNVSGAVCGDMEITQDVSGGVKPASYPVSGMLVLTATAKEILAEPKFPLIKINLKVDPSDESWGAVQKILDDKEGVCGYVVDKVNILKIVEGLINKGFNVRLPTEKIKPMAVPVGIEPSMQVQGRQVDLGIKLGELAITNDMIWLGA
JGI10215J12807_109011413300000881SoilSGFVDHVTLELKNLKVNKTGTVKKVVTIGQYELHVKINKVSGKLKTGKPSVTFGGNKVALAMPVTVASGSGNATINFKWDGKNVSGAVCGDMEITQDVSGGVKPASYPVSGSLILTATAKEILAQPKFPLIKINLKVAPSDESWGAVQKILDDKEGVCGYVVDKVNILKIVEGLINKGFNVRLPTEKIKPMAVPVGIEPSMKVQGRQVDLAIKLGELAITNDMIWLGANVHVDVGEAGSAASPAPSPSGSPSPSASPADPAKSGAAPK
Ga0055435_1000604213300003994Natural And Restored WetlandsTELREKVNQLMEGNPLLEGMPTQPVRVGVPTELARALITKVVTGFVDQVTLELKNLKARKAGTVKKVVTIGEYELDVRIHRVKGRLKTGTPTLGFGGNRVKLGLPVTIASGSGEATIHFKWNGKNVSGAICGDMEISQDVSGGVKPESYPLSGAIELTATAREILASPRFPLIKVRLKVRPSDESWAAVQKILDDKEGLCGYAVDKVNVRGIVERLVDKGFNVRLPTEKLKPMAVPVGIQPTMQVRGQPVALAIKLGHLAITSRMIWLGADVRIELPEAPQPAPSASAPVSSAGS*
Ga0055440_1004598123300004020Natural And Restored WetlandsTELARALITKVVTGFVDQVTLELKNLKARKAGTVKKVVTIGEYELDVRIHRVKGRLKTGTPTLGFGGNRVKLGLPVTIASGSGEATIHFKWNGKNVSGAICGDMEISQDVSGGVKPESYPLSGAIELTATAREILASPRFPLIKVRLKVRPSDESWAAVQKILDDKEGLCGYAVDKVNVRGIVERLVDKGFNVRLPTEKLKPMAVPVGIQPTMQVRGQPVALAIKLGHLAITSRMIWLGADVRIELPEAPQPAPSASAPVSSAGS*
Ga0062589_10070434113300004156SoilKLDALISKDPRIAGMPKAPVRVGVPTSLARDLIERVVSGFVDHVTLELKNLKVNKTGTVKKVVTIGQYELHVKINKVSGKLKTGKPSVTFGGNKVALAMPVTVASGSGNATINFKWDGKNVSGAVCGDMEITQDVSGGVKPASYPVSGSLILTATAKEILAQPKFPLIKINLKVAPSDESWGAVQKILDDKEGVCGYVVDKVNILKIVEGLINKGFNVRLPTEKIKPMAVPVGIEPSMKVQGRQVDLAIKLGELAITNDMIWLGANVHVDVGEAGSAASPAPSASPSPSPSPTANAAA
Ga0062595_10047228813300004479SoilGRLALLGLAGALAAPACSQRKDRQSPDELRAEIAALEKERDLLRPKLDQLISKDPRISGMPKTPVRVGVPTTLARDLIERVTSGFVDHVTLELKNLKVNKTGTVKKVVTLGQYELHVLIKRVTGRLKTGKPSVTFGGNQVSLSMPVTVASGSGNANIEFNWDGKGMSDAVCGDLKVNQDVTGGVKPASYPVSGTLVLTATAEQILAQPKFPLIKINLKVDPSDESWAAVQKILDDQEGICGYVVDKVNILKIVQGLIDKGFNVRLPTEKIKPMAIPVGIEPSMQVQGRQVDLGIKLGELAITNDMIWLG
Ga0070674_10074856413300005356Miscanthus RhizosphereEIAALEKERDLLRPKLDQLISKDPRIAGMPKTPVRVGVPTTLAGDLIERVVSGFVDHVTLELKNLKVNKTGTVKKVVTIGQYELHVKINKVSGKLKTGKPSVTFGGNKVALAMPVTVASGSGNATINFKWDGKNVSGAVCGDMEITQDVSGGVKPASYPVSGSLILTATAKEILAQPKFPLIKINLKVAPSDESWGAVQKILDDKEGVCGYVVDKVNILKIVEGLINKGFNVRLPTEKIKPMAVPVGIEPSMKVQGRQVDLAIKLGELAITNDMIWLGA
Ga0070705_10076459313300005440Corn, Switchgrass And Miscanthus RhizosphereDMLRPKLDQLISKDPRIAGMPKTPVRVGVPTTLAGDLIERVVSGFVDHVTLELKNLKVNKTGTVKKVVTIGQYELHVKINKVSGKLKTGKPSVTFGGNKVALAMPVTVASGSGNATINFKWDGKNVSGAVCGDMEITQDVSGGVKPASYPVSGSLILTATAKEILAQPKFPLIKINLKVAPSDESWGAVQKILDDKEGVCGYVVDKVNILKIVEGLINKGFNVRLPTEKIKPMAVPVGIEPSMKVQGRQVDLAI
Ga0066903_10529599713300005764Tropical Forest SoilKKGTVKKVITLGDYELHVVIKRVSGRLKTGKPNVTFGGNRIQLALPVTVASGSGNANIDFKWDGKGMSDAVCGDLQVNRDVNGGVKPANYAVSGALVLTATAQEILAEPKFPLTKINLKIDPSDQSWAAVQKILDDQEGICGYVVDKVNVLKIVQGLIDKGFNIRLPTEKIKPMAVPLGIEPTMEVQGRQVALGIKIGELAITSDMIWLGANVSVGVSPSPSPSAS
Ga0068862_10074944213300005844Switchgrass RhizosphereERADRESPEQIRAEIAALEKERDLLRPKLDQLISKDPRIAGMPKTPVRVGVPTTLAGDLIERVVSGFVDHVTLELKNLKVNKTGTVKKVVTIGQYELHVKINKVSGKLKTGKPSVTFGGNKVALAMPVTVASGSGNATINFKWDGKNVSGAVCGDMEITQDVSGGVKPASYPVSGSLILTATAKEILAQPKFPLIKINLKVAPSDESWGAVQKILDDKEGVCGYVVDKVNILKIVEGLINKGFNVRLPTEKIKPMAVPVGIEPSMKVQGRQVDLAIKLGELAITNDMIWLGANVHVDVGEAGSAAASSSPSPSPTA
Ga0079037_10051949023300006224Freshwater WetlandsVTVRAPALALATLAVVPVLAVSLSCGRKDRATPEQLRSEIAAFEKERETLRGRFNELILNDPRIEGMPTTPVRVGIPTTLARDLIQRVVEGFVDQVTLELKNLKVKKSGTVKKVVTLGQYDLHVTIHRVAGKLKTGKPDVTFGGNKVSLALPVTVASGSGNATIHFKWDGKGVADATCGDLEVTQEVSGSVRPDTYPVSGGLVLTATAQEILAEPRFPVIKVRLKVNPSAESWGAVQKILDEKEGLCGYVVDKVNVLGIVQRLIDKGFNVRLPTEKIKSMAVPVGIEPTMEVRGQPVALAIRLGQLAITEHVIWLGADVSVSTGAEAAAKMAAREAP*
Ga0075421_10015244823300006845Populus RhizosphereVTVRAGLVFVPAVAALALACGRADRTTSSELRAQIDVLEKERDALDAKLHTLMLSDPLLEGIPTQPVLVGVPTALARELITKVMTGFVDQVTLDLKNLSVKRNGTVRRVVDIGHYEVQVRIHRVTGKLKTGTPTLRFGGNQVRLAMPVSVASGSGRATIAFKWDGIGVSGALCGDMDITRDVTGGVKPANYSVAGGIVFLATPRAILASPRFPLIKVNLKVEPSEASWAAVAKILEEKEGLCGYVLEKVNLLKMVQGLIERGFNVRLPTEKIKPMAVPVGVEPTMQVRGQPVALAIKVGELAITEQMIWLGADVHVNLKDLQLKPSP*
Ga0075431_10075645223300006847Populus RhizosphereVTRRALATRLALLGLAGALAAPACSERKDRQSPDALRAEIAALEKERDLLRPKLDQLVSKDPRIAGMPKTPVRVGVPTTLARDLVERVTSGFVDHVTLELKNLKVNKKGTVKKVVTLGEFELHVLIKRVSGRLKTGKPRVTFGGNRVELSMPVSVASGTGNATINFKWDGKGMSDAVCGDMEITRDVSGGVKPASYPVSGALVLTATAKEILAEPKFPLIKINLKVDPSDESWASVQKILDDQEGICGYVVDKVNILKIVQGLIDKGFNVRLPTEKIKPMAVPVGIEPSM
Ga0105090_1009650613300009075Freshwater SedimentMRAPSAFAALAVSTALAASLACGREDRQTPDKLRAEIQALEKERQSLRQRMDELMVNDPRLKGMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVRKVVTIGEYDLDVTIHRVTGKLKTGKPEVTFGGDKVAIALPVTVASGSGRATIRFKWDGKNVAGATCGDMEVTQEVSGGVRPDTYPVKGGLVLTATAQDILAQPRFPQTRVNLKVVPSPQSWAAVDGILGGKDGVCGYAVEKVNVRGIVQRLVDNFFNVRLPTEKIKPMAVPVGIEPTMM
Ga0105099_1025275113300009082Freshwater SedimentMRAPSAVAALAVSTALAASLACGRGDRQTPDQLRAEIQALEKERQSLRQRMDELMVNDPRVESMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVRKVVTIGEYDLDVTIHRVTGKLKTGKPEVTFGGDKVAIALPVTVASGSGRATIRFKWDGKNVAGATCGDMEVTQEVSGGVRPDTYPVRGGLVLTATAKDILAQPRFPQTRVNLKVVPSPQSWAAVDGILGGKDGVCGYAVEKVNVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMMVRGEPVALSIQLGDLAITEDVIWLGARVSVAVGEEAEK
Ga0102851_1002385723300009091Freshwater WetlandsVTVRARNAALALLLVAPALVAALACGREDRQTPDQLRAEIEALEKERLSLRERVNELILKDPRVKDMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLHVTIHRVKGKLKTGKPDVTFGGNKVAIALPVTVASGTGHATINFKWDGKGVADATCGDMEVTQDVSGGVRPDTYPVKGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAVLAQKEGLCGYVVDKVNVRGIVQRLVDRGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALGIKVGQLTITESVIWLGARVSVAVGEEVAARKAPEKKAPEKKALPPKAPAS*
Ga0075418_1026102323300009100Populus RhizosphereVTRRALATRLALLGLAGALAAPACSERKDRQSPDALRAEIAALEKERDLLRPKLDQLVSKDPRISGMPKTPVRVGVPTTLARDLVERVTSGFVDHVTLELKNLKVNKKGSVKKVVTIGEFELHVLIKRVSGRLKTGKPRVTFGGNRVELSMPVSVASGTGNATINFKWDGKGMSDAVCGDMEITRDVSGGVKPASYPVSGALVLTATAKEILAEPKFPLIKINLKVDPSDESWASVQKILDDQEGICGYVVDKVNILKIVQGLIDKGFNVRLPTEKIKPMAVPVGIEPSMKVKGRQVDLGIKVGELAITGDMIWLGAHVSVDVGEAGSAASPSPAASPAASGKPKAA
Ga0075418_1045269023300009100Populus RhizosphereVTVRAGLVFVPAVAALALACGRADRTTSSELRAQIDVLEKERDALDAKLHTLMLSDPLLEGIPTQPVLVGVPTALARELITKVMTGFVDQVTLDLKNLSVKRNGTVRRVVDIGHYEVQVRIHRVTGKLKTGTPTLRFGGNQVRLAMPVSVASGSGRATIAFKWDGIGVSGALCGDMDITRDVTGGVKPANYSVAGGIVFLATPRAILASPRFPLIKVNLKVEPSEASWAAMAKILEEKEGLCGYVLEKVNLLKMVQGLIERGFNVRLPTEKIKPMAVPVGVEPTMQVRGQPVALAIKVGELAITEQMIWLGADVHVNLKDLQLKPSP*
Ga0115026_1030292413300009111WetlandVTVRSPALVLASVAVAPVLALSLSCGRKDRATPDTLRSDITALEKERETLRGRFNELILNDPRIEGMPTAPVRVGIPTTLARDLIQRVVEGFVDQVTLELKNLKVKKSGTVKKVVTLGQYDLHVTIHRVAGKLKTGKPDVTFGGNKVSLALPVTVASGSGNATIHFKWDGKGVADATCGDLEVTQEVSGSVRPDTYPVSGGLELTATAQEILAEPRFPVIKVRLKVNPSAESWAAVQKILDEKEGICGYVVDKVNVLGIVQRLIDKGFNVRLPTEKIKSMAVPVGIEPTMEVRGQPVALAIRLGQLAITEHVIWLGADVSVSTGAEAAAKPRGSGAPPSASRPAATTASSPSG
Ga0115026_1035241213300009111WetlandMRARTAVLATLVAASVLAVSAACGRKDRETPDGLRADIAALEKERDTLRGRMNELMVRDPRLKGMPATPVKVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGTVKKVVTIGQYDLNVTINRVIGKLKTGKPDVTFGGNKVKVALPVTVASGTGNATIHFKWDGKNVAGATCGDLEVTQEVSGSVRPDTYPVSGGLVLTATAKEILAEPRFPLIKIRLKVNPSAESWAAVQKVLDEKEGLCGYVVDKVNVLGIVQRLIDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPV
Ga0115027_1058055213300009131WetlandAGFVDQVTLELKNLKVKKSGRVRKVVTIGEYDLDVTIHRVTGKLKTGKPEVTFGGDKVAIALPVTVASGSGRATIRFKWDGKNVAGATCGDMEVTQEVSGGVRPDTYPVQGGLVLTATAQDILAQPRFPQTKVNLKVVPSQESWAAVDGILGGKDGVCGYAVEKVNVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMMVRGQPVALSIKLGDLAITEDVIWLGARVSVAVGEEAAKQIEKKQAEEEKKAEGRKAPAGKAPAARPATGKL
Ga0114129_1006488813300009147Populus RhizosphereQIDVLEKERDALDAKLHTLMLSDPLLEGIPTQPVLVGVPTALARELITKVMTGFVDQVTLDLKNLSVKRNGTVRRVVDIGHYEVQVRIHRVTGKLKTGTPTLRFGGNQVRLAMPVSVASGSGRATIAFKWDGIGVSGALCGDMDITRDVTGGVKPANYSVAGGIVFLATPRAILASPRFPLIKVNLKVEPSEASWAAVAKILEEKEGLCGYVLEKVNLLKMVQGLIERGFNVRLPTEKIKPMAVPVGVEPTMQVRGQPVALAIKVGELAITEQMIWLGADVHVNLKDLQLKPSP*
Ga0105094_1007957623300009153Freshwater SedimentVKKVVTLGHYDLHVTIHRVAGKLKTGKPDVTFGGNKVSLALPVTVASGSGNATIHFKWDGRGVAGATCGDLEVTQEVSGSVRPDTYPVAGGLVLTATAEEILAEPRFPKIKVNLKVVPSAESWGAVQRILDEKEGLCGYVVEKVNVLGIVQRLVDKGFNVRLPTEKIKSMAVPVGIEPTMEVHGQAVALGIKVGDLAITEHVIWLGAHVSVATGEEALAKKVDGRTSK*
Ga0105092_1009994513300009157Freshwater SedimentVSARLVAASLAVAALAVGACRKDRVRPDQLKAEIAALEQEREALRAKVGELMIKDPRVKGMPDAPVRVGVPTSLTRVLIEKVTAGFVDQVTLELKNLRVRKSGTVKKVVTLGEYALDVHIEKVQGRLKTGKPDVRFGGNKVSIALPVTVASGSGKATIRFKWDGKNVSGAVCGDLDVEQAVTGKVKPDSYPVAGALLLTATSQQILAAPKFPLIKVNLKIDPSPESWAAVQKIIDDKEGACGFVLDKVDVLKIVRGLIDRGFNVRLPTEKIKP
Ga0105100_1005901823300009166Freshwater SedimentMPATSVAAITSASLLLSSLACGRKDRQGPDQIRAQIQALENERNSLRERMNELMVKDPRLPGMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLRVVINRVSGRLKTGKPEVTFGGNKVTLAMPVTVASGSGNATIHFKWDGKNIAGATCGDIEVTREVSGGVRPDTYPVSGGLVLTATAKEILAEPRFPLIKINLKVNPSAESWDAVQKILDEKEGLCGYVVEKVDVRGVIQRLVDKGFNVRLPTEKLKPMAVPVGIEPTMEVRGQPVALGIKLGELAITEQMIWLGADVSVATGAEAEKEIERREAEKKGAR*
Ga0113563_1078051223300009167Freshwater WetlandsVTVRARNAALALLLVAPALVAALACGREDRQTPDQLRAEIEALEKERLSLRERVNELILKDPRVKDMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLHVTIHRVKGRLKTGKPDVTFGGNKVAIALPVTVASGTGRATIHFKWNGKNVAGATCGDMEVTQEVSGGVRPDTYPVSGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAILAQKEGLCGYVVEKVNVRGIVQRLVDKGFNVRLPTEKI
Ga0113563_1129626813300009167Freshwater WetlandsLATLAAASVLAVSAACGRKDRETPDGLRADIAALEKERDTLRGRMNELMVRDPRLKGMPATPVKVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGTVKKVVTIGQYDLNVTINRVIGKLKTGKPDVTFGGNKVKVALPVTVASGTGNATIHFKWDGKNVAGATCGDLEVTQEVSGSVRPDTYPVAGGLVLTATAEDILASPRFPVIKVNLKVNPSAESWGAVQKILDEKEGLCDYVVEKVNIPGILQRLLHFFFNDPAPTEKIKPMAVPVGIEPTMEV
Ga0113563_1164164613300009167Freshwater WetlandsTPVRVGVPTALARDLIQRVVAGFVDQVTLDLKNLKAKKSGKVKKVVTLGAYDLKVDIHRVTGRLKTGKPEVTFGGDKVALALPVTIASGSGRATIHFKWDGKNVAGATCGDLDVTQEGTGGVRPDTYPVSGALVLTATAEDILAQPRFPQIKVRLKVAPSAESWAAVDAILGQKDGLCGYVVEKVDVRGIVQRLVEKGFNVRLPTEKIKPMAVPVGIEPTMVVHGQPVALAIQVADLAITPDIIWLGARVSVA
Ga0115028_1090013313300009179WetlandKVKKSGRVKKVVTLGEYDLHVTIHRVKGRLKTGKPDVTFGGNQVAIALPVTVASGTGRATIHFKWNGKNVAGATCGDMEVTQEVSGGVRPDTYPVSGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAILAQKEGLCGYVVEKVNVRGIVQKLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALGIKLGDLAITEEMIWLGARVSVAVGEEAEKEIEEKKA
Ga0114939_1024374313300009455GroundwaterVGELMANDSRLKGMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLDLKNLKAKKSGTVKKVVTLGQYDLKVDIHRVSGRLKTGKPEVTFGGDQVALALPVTIASGTGRATIHFKWDGKGAAGATCGDLEVTQEVSGGVRPDTYPVAGGLVLTATAKDILAQPRFPLIKVRLKVVPSQESWAAVDKILGEKTGLCGYVVEKVNVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMQVRGQPVALGIKLADLA
Ga0105347_124715913300009609SoilDALEKERDSLRTKLHDLMQGDPLLQGMPTTPVRVGVPTSLASELITKVVSGFVDQVTLELKNLKVKKSGNVKKVVTIGQYDLRVLIHKVTGKLKTGKPNVKFGGNEVSLALPVRVVSGSGRATINFKWDGKNVSGAVCGDMEITQDVSGGVKPESYPVAGRLLLTATAREILASPRFPVIKIRLKIEPSQESWSAVEKILADKEGVCGYVVDKVNVLGIVQGLIDKGFNVRLPTEKIKPMAIP
Ga0137428_102063833300011432SoilVSGFVDQVTLELKNLKVKKSGNVKKVVTIGQYDLRVLIHKVTGKLKTGKPNVKFGGNEVSLELPVRVVSGSGHATIDFKWDGKNVSGAVCGDMEITQDVSGGVKPESYPVAGSLVLTATAREILASPRFPVIKIRLKIEPSQESWSAVEKILADKEGVCGYVVDKVNVLGIVQGLIEKGFNVRLPTEKIKPMAIPVGIEPSMQVRGKQVELGIKVGELAITEKMIWLGADIRVEVEGPLLPKPKAPASSGGS*
Ga0153915_1063425013300012931Freshwater WetlandsDQVTLELKNIKVKKSGTVRKVVTIGQYDLHVTINRVTGKLKTGKPDVTFGGNMVSLALRVTVASGSGNATIQFKWDGRNVAGATCGDLEVTQEVSGSVRPHTYPVSGGLVLTATAKEILAEPRFPVIKIRLRVNPSAQSWGAVQKILDEKEGVCGFVVDKVDVLGLVKKIVDRGFNVRLPTEKIKPMAIPVGIEPTMEVHGKPVALGIKVGALAITKHVIWLGAKVSVAVGEEAQKEIEERKIEAQKKGKAR*
Ga0153916_1055543013300012964Freshwater WetlandsVTVRAPALALATIAVVPVLAVALSCGRKDRASPEQLRSEIAALEKERETLRGRFNELILNDPRIEGMPTTPVRVGIPTTLARDLIQRVVEGFVDQVTLELKNLKVKKSGTVKKVVTLGQYDLHVIIHRVAGKLKTGKPDVTFGGNKVSLALPVTVASGSGNATIHFKWDGKGVAGATCGDLEVTREVSGRVRPDTYPVSGGLVLTATAQEILAEPRFPVIKVRLKVNPSAESWGAVQKILDEKEGLCGYVVEKVNVLGIVQRLIDKGFNVRLPTEKIKPMAIPVGIEPTMEVRGQPVALGIRIGQLAITEHVIWLGAHVSVAVGDEAMAKKAVQKAAEKKAP*
Ga0075352_105943813300014324Natural And Restored WetlandsLRAQIDALENERTALRGKLDVLMQGDPILEGMPTRPVRVGVPTALARELITKVVTGFVDHVTLELKNLNVKKSGNVKKVVSIGHYDLQVRIHRVTGKLKTGTPELTFGGNQVKLALPVTVASGSGRATINFKWDGKNVSGAVCGDMEVTQEVSGGVKPANYPVSGGIVLTATTREILASPKFPLIKVNLKVQPSAESWAAVQKILDEKTGMCGYVVDKVNVLGIVQGLIDRGFNVRLPTEKIKPMAIPVGIEPTMQVRGQPVELAIKVGHLAITSHMIWLGAEVSIALPGAPQTASSASAPVSSAGS
Ga0180068_103091413300014864SoilQRMNELIVRDPRVKSMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLDLKNLKVKKSGTVKKIVTIGQYDLNVTIHRVTGKLKTGKPEVTFGGDKVAIALPVTVVSGTGQATIHFKWDGKNVAGATCGDMEVTQEVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPQTKVNLKVVPSEKSWAAVDSILGGKDGLCGYAVEKVDVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMQVRGQPVALAIQVGQLAITEDVIWLGAKVSVAVGEEAAKQIE
Ga0180067_101091723300015257SoilVTMRARNAAAALAVTPVLAAFLACGRQDRETPDQLRAEIQALEKERQSLRQRMNELIVRDPRVKSMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLDLKNLKVKKSGTVKKIVTIGQYDLNVTIHRVTGKLKTGKPEVTFGGDKVAIALPVTVVSGTGQATIHFKWDGKNVAGATCGDMEVTQEVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPQTKVILKVVPSEKSWAAVDSILGGKDGLCGYAVEKVDVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMQVRGQPVALAIQVGDLAITEDVIWLGARVSVAVGEEAAKQIE
Ga0187786_1029356813300017944Tropical PeatlandQRVVSGFVDQVTLELNNLKVEKHGQVKKVVTLGEYDLEVLITRVSGHLKTGKPDVSFGGNKVTLALPVTIASGSGRATIHFTWDSKGVADATCGDLDVTREVWGGVRPDTYPLSGGLVLTATAKEILAEPRFPLIKVNLKVNPSKESWDSVQKILDEKEGVCGYVLDKVDVRAILERLVSRGFSVRLPTEKIKPMAVPVGIEPTMEVRGQPVTLGIKLGRL
Ga0184615_1018996413300018059Groundwater SedimentTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGTVKKVVSIGQYELQVRIHRVSGKLKTGTPELTFGGNQVKLALPVTVASGSGRATIHFKWDGKNVSGAVCGDMEVTQEVSGGVKPANYPVSGGIVLTATSREILASPKFPLIKVNLKVQPSAESWAAVQKILDEKTGMCGYVVEKVNVLKIVQGLIDRGFNVRLPTEKIKPMAIPVGIEPTMQVRGQPVELAIKVGELAVTEKMIWLGADVQVAMPAPVPPPP
Ga0184633_1005473123300018077Groundwater SedimentMSARAAVVLLPAVAAFALACGRGDRTTPAELRAQIDGLEKERDALREKLHALMQGDPLLEGMPTQPVRVGVPTALARELITKVVTGFVDRVTLELKNLKVKKTGTVKKIVSIGQYQLQVHIHRVTGKLKTGTPTLKFGGNQVSLAMPVTVASGSGRATINFKWDGKNVSGAVCGDMEITQEVSGGVKPASYPVAGGIVLTATAREILASPRFPLIKVNLKVQPSQESWAAVEKILEEKAGLCGYVLEKVNVLKIVQGLIERGFNVRLPTEKIKPMAVPVGIEPTMQVRGQPVELAIKVGELAITERMIWLGADVRVEMPDSLKPSP
Ga0210377_1002618733300021090Groundwater SedimentMSARHVRALALAAATLTLACGRGDRTTPAELRAQIDALENEREALRGKLDVLMQGDPLLEGMPTRPVRVGVPTALARELITKVVTGFVDHVTLELKNLTVKKSGTVKKVVSIGQYELQVRIHRVSGKLKTGTPELTFGGNQVKLALPVTVASGSGRATIHFKWDGKNVSGAVCGDMEVTQEVSGGVKPANYPVSGGIVLTATSREILASPKFPLIKVNLKVQPSAESWAAVQKILDEKTGMCGYVVEKVNVLKIVQGLIDRGFNVRLPTEKIKPMAIPVGIEPTMQVRGQPVELAIKVGELAVTEKMIWLGADVQVAMPAPVPPPP
Ga0210377_1008364213300021090Groundwater SedimentMRARTAAAALAVASVLVTSLACGRQDRQTPDQLRAEIQALEKERQSLRERLNELIVSDPRVKTMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGTVKKVVTLGQYDLNVTIKRVMGKLKTGKPEVTFGGDKVALALPVTIVSGTGQATIHFKWDGKNVAGATCGDLEVTQEVSGGVRPDTYPVTGGLVLTATAKDILAEPRFPLIKVNLKVVPSRESWAAVQKILDEKTGLCGYVVEKVNVLGIVQKLIDKGFNVRLPTEKIKPMAVPVGIEPT
Ga0124853_132589023300024056Freshwater WetlandsMPTTPVRVGIPTTLARDLIQRVVEGFVDQVTLELKNLKVKKSGTVKKVVTLGQYDLHVIIHRVSGKLKTGKPEVTFGANRVTIALPVTVASGSGNATIDFKWDGKGLADATCGDMEVTQEVSGSVRPETYPVSGGLVLSATAEQILAEPRFPLIKVRLKVNPSAESWAAVQKILDEKEGLCGYVVDKVNVLGIVQRLIDKGFNVRLPTEKIKSMAVPVGIEPTMEVRGQPVALAIRLGQLAITEHVIWLGADVSVSTGAEAAAKLAAREAP
Ga0209108_1006736023300025165SoilMSARAAFVLLPAVAALALACGRGDRTTPAELRAQIDGLEQERDALREKLHALMQGDPLLEGMPTQPVRVGVPTALARELITRVVTGFVDQVTLELKNLKVKKAGTVKKIVSIGQYELQVHIHRVTGKLKTGTPVLKFGGNQVSLAMPVTVASGSGRATINFKWDGKNVSGAVCGDMEVTQEVSGGVKPANYPVSGGIVLTATSREILASPKFPLIKVNLKVQPSAESWAAVQKILDEKQGVCGYVVEKVNVLKIVQGLIDRGFNVRLPTEKIKPMAIPVGIEPTMQVRGQPVELAIKVGELAITEKMIWLGADVRLAMPDAVPAPP
Ga0209640_1001933723300025324SoilMSARAAFVLLPAVAALALACGRGDRTTPAELRAQIDGLEKERDALREKLHALMQGDPLLEGMPTQPVRVGVPTALARELITRVVTGFVDQVTLELKNLKVKKAGTVKKIVSIGQYELQVHIHRVTGKLKTGTPVLKFGGNQVRLAMPVTVASGSGRATINFKWDGRNVSGAVCGDMEVTQEVSGGVKPANYPVAGGIVLTATSREILASPKFPLIKVNLKVQPSAESWAAVQKILDEKQGVCGYVVEKVNVLKIVQGLIDRGFNVRLPTEKIKPMAVPVGIEPTMQVRGQPVELAIKVGELAITERMIWLGADVRVEMRDPTQPSP
Ga0210132_102456623300025538Natural And Restored WetlandsTVKKVVTIGEYELDVRIHRVKGRLKTGTPTLGFGGNRVKLGLPVTIASGSGEATIHFKWNGKNVSGAICGDMEISQDVSGGVKPESYPLSGAIELTATAREILASPRFPLIKVRLKVRPSDESWAAVQKILDDKEGLCGYAVDKVNVRGIVERLVDKGFNVRLPTEKLKPMAVPVGIQPTMQVRGQPVALAIKLGHLAITSRMIWLGADVRIELPEAPQPAPSASAPVSSAGS
Ga0210131_101784823300025551Natural And Restored WetlandsTELREKVNQLMEGNPLLEGMPTQPVRVGVPTELARALITKVVTGFVDQVTLELKNLKARKAGTVKKVVTIGEYELDVRIHRVKGRLKTGTPTLGFGGNRVKLGLPVTIASGSGEATIHFKWNGKNVSGAICGDMEISQDVSGGVKPESYPLSGAIELTATAREILASPRFPLIKVRLKVRPSDESWAAVQKILDDKEGLCGYAVDKVNVRGIVERLVDKGFNVRLPTEKLKPMAVPVGIQPTMQVRGQPVALAIKLGHLAITSRMIWLGADVRIELPEAPQPAPSASAPVSSAGS
Ga0207660_1022701913300025917Corn RhizosphereVSARSLAAGFALLGLAGALALAACRERADRESPEQIRAEIAALEKERDLLRPKLDQLISKDPRIAGMPKTPVRVGVPTTLAGDLIERVVSGFVDHVTLELKNLKVNKTGTVKKVVTIGQYELHVKINKVSGKLKTGKPSVTFGGNKVALAMPVTVASGSGNATINFKWDGKNVSGAVCGDMEITQDVSGGVKPASYPVSGSLILTATAKEILAQPKFPLIKINLKVAPSDESWGAVQKILDDKEGVCGYVVDKVNILKIVEGLINKGFNVRLPTEKIKPMAVPVGIEPSMKVQGRQ
Ga0207669_1073640513300025937Miscanthus RhizosphereRDLLRPKLDQLISKDPRIAGMPKTPVRVGVPTTLAGDLIERVVSGFVDHVTLELKNLKVNKTGTVKKVVTIGQYELHVKINKVSGKLKTGKPSVTFGGNKVALAMPVTVASGSGNATINFKWDGKNVSGAVCGDMEITQDVSGGVKPASYPVSGSLILTATAKEILAQPKFPLIKINLKVAPSDESWGAVQKILDDKEGVCGYVVDKVNILKIVEGLINKGFNVRLPTEKIKPMAVPVGIEPSMKVQGRQVDLAIKLGELAITNDMIWLGA
Ga0256867_1017490213300026535SoilIQRVVAGFVDQVTLELKNLRVRKSGTVTKVVTLGRYDLKVDINRVSGRLKTGNPQVTFGGDQVSLALPVTVASGQGRATIHFKWDGRGVAGATCGDLEVTQEVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPQIKINLKVLPSDESWAAVDKILEDKEGICGYVVEKVNVRGIVQRLVDRGFNVRLPTERIKPMAVPVGIEPTMEVRGQPVALGIELGELAITEHMIWLGARVSVAVGEQAEKVIEEKTDKAKAEEKKAPAS
Ga0209397_1015563113300027871WetlandGAVTMRALFAVAALAASSVLASLACGREDRQSPDQIRAEILALEKERDSLRERVNELIVKDPRMKSMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGKVKKVVTLGQYDLRVDIHRVSGKLKTGKPEVTFGGDKVSIALPVTVASGSGRATIRFKWDGKNVAGATCGDMEVTQEVSGGVRPDTYPVQGGLVLTATAQDILAQPRFPQTKVNLKVVPSKESWAAVDGILGGKDGVCGYAVEKVNVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMMVRGQPVALSIKLGDLAITEDVIWLGARVSVAVGEEAAKQIEKKQAEEE
Ga0209496_1011904813300027890WetlandSTVAALAASAVLAATLACGREDRQTPDQIRAGIQALETERDALRQRVNELIVNDPRVKTMPRTPVRVGVPTTLARDLIQKVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLHVTIHRVKGKLKTGKPDVTFGGNKVAIALPVTVASGTGQATINFKWDGKNVADATCGDMEVTQDVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAILAQKEGLCGYVVEKVNVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALGIKVGKLAITESVIWLGAEVSVAVGKEAGAKKAPETPEKTPEKTPAS
Ga0209048_1005771433300027902Freshwater Lake SedimentMRARTTVLATLVAVSALTTLATCGRKDRETPDSLRAEIAALEKERDTLRGRMNELMVKDPRLEGMPKAPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGTVKKVVTIGQYDLNVTINRVIGKLKTGKPDVTFGGNKVQVALPVTVASGSGNATIHFKWDGKNVAGATCGDLEVTQDVSGSVRPDRYPVAGGLVLTATAEDILASPRFPVIKVNLKVKPSEKSWAAVQKILGEKTGLCGYVVEKVNVLGIVQKLIDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALAIKLGELAITEHMIWLGANVAVAVGDDVVEKKAPAAKAPAAKAPMPKGAERKPPAT
Ga0209382_1034553423300027909Populus RhizosphereVTVRAGLVFVPAVAALALACGRADRTTSSELRAQIDVLEKERDALDAKLHTLMLSDPLLEGIPTQPVLVGVPTALARELITKVMTGFVDQVTLDLKNLSVKRNGTVRRVVDIGHYEVQVRIHRVTGKLKTGTPTLRFGGNQVRLAMPVSVASGSGRATIAFKWDGIGVSGALCGDMDITRDVTGGVKPANYSVAGGIVFLATPRAILASPRFPLIKVNLKVEPSEASWAAVAKILEEKEGLCGYVLEKVNLLKMVQGLIERGFNVRLPTEKIKPMAVPVGVEPTMQVRGQPVALAIKVGELAITEQMIWLGADVHVNLKDLQLKPSP
(restricted) Ga0233417_1012395613300028043SedimentLLAYLACGRKDRMAPDEIRAQIQALEKERLSLRERMDELMVTDPRLPGMPGTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKKGRVRKVVTLGEYDLRVEIHRVRGRLKTGKPEVTFGGNEVTLAMPVTVASGSGNATLHFKWDGKNVAGATCGDLDVTQVVSGGVKPDTYPVSGGLELTATAREILAEPRFPLMKIRLKVDPSAESWAAVDRILGEKDGLCGYVVEKVNVRAILQRLVDKGFNVRLPTEKIKPMTVPVGIEPTMEVRGKPVALGIKLGELAITEQMIWLGANVSVATGAEAAREIEKRRPEKKGATPGRTPTPGAPAPKPTGGQPPAG
Ga0268265_1066400313300028380Switchgrass RhizosphereLAGALALAACRERADRESPEQIRAEIAALEKERDLLRPKLDQLISKDPRIAGMPKTPVRVGVPTTLAGDLIERVVSGFVDHVTLELKNLKVNKTGTVKKVVTIGQYELHVKINKVSGKLKTGKPSVTFGGNKVALAMPVTVASGSGNATINFKWDGKNVSGAVCGDMEITQDVSGGVKPASYPVSGSLILTATAKEILAQPKFPLIKINLKVAPSDESWGAVQKILDDKEGVCGYVVDKVNILKIVEGLINKGFNVRLPTEKIKPMAVPVGIEPSMKVQGRQVDLAIKLGELAITNDMIWLGANVHVDVGEAGSAAASSSPSPSPTANAAAPA
Ga0299906_1001458433300030606SoilMSARAAFVLLPAVAALALACGRGDRTTPAELRAQIDGLEKERDALREKLHALMQGDPLLEGMPTQPVRVGVPTALARELITRVVTGFVDQVTLELKNLKVKKAGTVKKIVSIGQYELQVHIHRVTGKLKTGTPVLKFGGNQVSLAMPVTVASGSGRATINFKWDGRNVSGAVCGDMEVTQEVSGGVKPANYPVAGGIVLTATSREILASPKFPLIKVNLKVQPSAESWAAVQKILDEKQGVCGYVVEKVNVLKIVQGLIDRGFNVRLPTEKIKPMAVPVGIEPTMQVRGQPVELAIKVGELAITERMIWLGADVRVEMKDPKQPSP
Ga0299915_1004538413300030613SoilRRAGGQGQGEVTMRALSAVAALAASPLLVASLACSRKDRVAPDHIHAQIQALEIERQSLRERMNELMVKDPRLPGMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVRKVVTLGEYDLRVVIHRVSGRLKTGKPDVTFGGNQVTLAMPVTVASGSGNATIHFKWDGKNIAGATCGDLDVTQEVSGGVRPDTYPVSGGLELTATAKEILAEPRFPLMKIRLKVNPSAESWAAVDKILGEKEGLCGYVVEKVDVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALGIKLGDLAITEHVIWLGARVSVAVGEEALKEIGQKKVEKGGPAGKAPAAKAPATKPAGSQPPAS
Ga0268386_1004651523300030619SoilMRARDAVATLVSSVLVVSVACGRSDRQSPDELRADIQALEKERQSLRERMNDLMAGEKRVKGMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLRVRKSGTVTKVVTLGRYDLKVDINRVSGRLKTGKPQVTFGGDQVSLALPVTVASGQGRATIHFKWDGRGVAGATCGDLEVTQEVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPQIKINLKVLPSDESWAAVDKILEDKEGICGYVVEKVNVRGIVQRLVDRGFNVRLPTERIKPMAVPVGIEPTMEVRGQPVALGIELGELAITEHMIWLGARVSVAVGEQAEKVIEEKTDKAKAEEKKAPAS
Ga0302046_1038271813300030620SoilMRARDAVATLVSSVLVVSVACGRSDRQSPDELRADIQALEKERQSLRERMNDLMAGEKRVKGMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLRVRKSGTVTKVVTLGRYDLKVDINRVSGRLKTGKPQVTFGGDQVSLALPVTVASGQGRATIHFKWDGRGVAGATCGDLEVTQEVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPQIKINLKVLPSDESWAAVDKILEDKEGICGYVVEKVNVRGIVQRLVDRGFNVRLPTERIKPMAVPVGIEPTMEVRGQPVALGIELGELAITEHMIWLGARVS
Ga0310887_1053600313300031547SoilTVKKVVTIGQYELHVKINKVSGKLKTGKPSVTFGGNKVALAMPVTVASGSGNATINFKWDGKNVSGAVCGDMEITQDVSGGVKPASYPVSGSLILTATAKEILAQPKFPLIKINLKVAPSDESWGAVQKILDDKEGVCGYVVDKVNILKIVEGLINKGFNVRLPTEKIKPMAVPVGIEPSMKVQGRQVDLAIKLGELAITNDMIWLGANVHVDVGEAGSAASPAPSASPSPSPSPT
Ga0247727_10007107183300031576BiofilmVSARAEVTLGLVVACGAALACGRADRTSSKELRSQIAALEQERATLREKLHALMQNDPLLEGMPTQPVLVGVPTVLARELITRVVAGFVDQVTLELKNLKVKKTGTVKKVISIGQYELHVLIHRVTGKLKTGTPDVRFGGNKVSLRLPVTVASGQGRATIHFNWDGKNVSGAVCGDMNISQEVSGGVKPASYPVSGGLVLTATAKEILASPRFPLIKVNLKVQPSAESWAAVQKILDEKQGLCGYVLEKVNVLKIVQGLIERGFGVRLPTEKLKPMAVPVGIEPRMQVKGQEVELAIKLGQLAITEQMIWLGADVRIELPAPAPPAASPRAGLSSTGS
Ga0247727_1001312353300031576BiofilmVTARAGVTLGLVVACGAALACGRADRTTSKELRSQIVALEQERAALRHKLDALMQNDPLLVGMPTQPVRVGVPTVLARELITKVVAGFVDQVTLELKNLKVRKTGTVKKIVSIGQYELHVQIHRVTGKLKTGTPDVRFGGNKVSLGLPVSVASGHGRATINFNWDGKNVSGAVCGDMNISQEVSGGVKPASYPVSGGLVLSATANEILASPRFPLIKLNLKVQPSAESWAAVKKILEEKQGLCGYVLEKVNVLKIVQGLIDRGFNVRLPTEKIKPMAVPVGIEPRMQVRGQEVELAIEVGELAITEQMIWLGAEVRLAMQAPVKSAP
Ga0318555_1026484613300031640SoilDPRIEGMPKAPVRVGVPTSLAKDLIERVTAGFVDHVTLELKNLKVDKTGTVKKVITLGEYELHVRIHRVSGKLKTGKPEVSFGGNKVALAMPVTIASGGGNATIHFKWDGKGVSDAVCGDLDVTEDVRGGVKPASYPVSGALVLTATAQEILASPKFPVIKVNLKVDPSEASWAAVQKILDDQHGVCGYVVDKVDVLKIVKGLIDRGFNVRLPTEKIKPMAVPVGIAPTMEVHGRQVELGIKLGELAITQDMIWLGAQVSVDIAPRS
Ga0318560_1030579813300031682SoilAPALLGALALLLERGDLRAKLFGRLAVLAAAAGLERDELRPKLDERIVNDPRIEGMPKAPVRVGVPTSLAKDLIERVTAGFVDHVTLELKNLKVDKTGTVKKVITLGEYELHVRIHRVSGKLKTGKPEVSFGGNKVALAMPVTIASGGGNATIHFKWDGKGVSDAVCGDLDVTEDVRGGVKPASYPVSGALVLTATAQEILASPKFPVIKVNLKVDPSEASWAAVQKILDDQHGVCGYVVDKVDVLKIVKGLIDRGFNVRLPTEKIKPMAVPVGIAPTMEVHGRQ
Ga0310904_1016147723300031854SoilVTRRALATGLAFVALAGALAAPACSERKDRQSPDELRAEIAALEKERDLLRPKLDQLISKDPRIAGMPKTPVRVGVPTTLAGDLIERVVSGFVDHVTLELKNLKVNKTGTVKKVVTIGQYELHVKINKVSGKLKTGKPSVTFGGNKVALAMPVTVASGSGNATINFKWDGKNVSGAVCGDMEITQDVSGGVKPASYPVSGSLILTATAKEILAQPKFPLIKINLKVAPSDESWGAVQKILDDKEGVCGYVVDKVNILKIVEGLINKGFNVRLPTEKIKPMAVPVGIEPSMKVQGRQVDLAIKLGELAITNDMIWLGANVHVDVGEAGSAASPAPSASPSPSPSPTANAA
Ga0214473_1002997033300031949SoilMSVRTAVVLLPAVAALALACGRGDRTTPAELRAQLDALEKQRDALREKVNALMQGDPLLEGMPTQPVRVGVPTALARELITKVVTGFVDQVTLELKNLKVKKAGTVKKVVSIGQYELQVRIHRVTGKLKTGPPTLKFGGNQVSLAMPVKVASGSGRATINFKWHGKNVSGAVCGDMDITQEVSGGVKPANYPVAGGIVLTATAREILASPRFPLIKVNLKVQPSQASWAAVEKILEDKEGLCGYVVEKVNVLKIVQGLVERGFNVRLPTEKLKPMAVPVGIEPTMQVRGQPVELAIKVGKLAITERMIWLGADVRVEMKDLQLKPSR
Ga0214473_1011897823300031949SoilMSARHVAVVALAAGSLALACGRGDRTTPAELRAQIDALEKEREALREKLDVLMQGDPLLEGMPTRPVRVGVPTALARDLITKVVTGFVDQVTLELKNLTVKKSGSVRKVVSIGQYELQVRIHRVSGKLKTGTPELTFGGNQVKLALPVTVASGSGRATIHFKWDGKNVSGAVCGDMEVTQEVSGGVKPASYPVSGGIVLTATAREILASPKFPLIKVNLKVQPSAESWAAVQKILDEKTGVCGYVVEKVNVLKIVQGLIDRGFNVRLPTEKIKPMAIPVGIEPTMQVRGQPVELAIKVGELAVTEKMIWLGADVSVAMPELAPKAP
Ga0214473_1029014013300031949SoilMSARHAALLLAAMASMLACGRGDRTTPAELRQQIDALEQERATLREKIDLLMQGDPLLEGMPTRPVRVGVPTELARELITKVVTGFVDQVTLELKNLTVKKSGTVRKVVSIGQYQLQVRIHRVSGKLKTGTPEITFGGNQVKLALPITVASGSGRATINFKWDGKNVSGAVCGDMQVTQEVSGGVKPANYPVSGGIVLTATTREILASPKFPLIKVNLKVQPSEASWAAVQKILDEQQGLCGYVVEKVNVLKIVQGLIDRGFNVRLPTEKIKPMAIPVGIE
Ga0326597_1003677043300031965SoilMSVRTAVVLLPAVAALALACGRGDRTTPAELRAQLDALEKQRDALREKVNALMQGDPLLEGMPTQPVRVGVPTALARELITKVVTGFVDQVTLELKNLKVKKAGTVKKVVSIGQYELQVRIHRVTGKLKTGPPTLKFGGNQVSLAMPVTVASGSGRATINFKWHGKNVSGAVCGDMDITQEVSGGVKPANYPVAGGIVLTATAREILASPRFPLIKVNLKVQPSQASWAAVEKILEDKEGLCGYVVEKVNVLKIVQGLVERGFNVRLPTEKLKPMAVPVGIEPTMQVRGQPVELAIKVGKLAITERMIWLGADVRVEMKDLQLKPSR
Ga0326597_1110760613300031965SoilEGMPTQPVRVGVPTALARELITRVVTGFVDQVTLELKNLKVKKAGTVKKIVSIGQYDLQVHIHRVTGKLKTGTPVLKFGGNQVSLAMPVTVASGSGRATINFKWDGRNVSGAVCGDMEVTQEVSGGVKPANYPVAGGIVLTATSREILASPKFPLIKVNLKVQPSAESWAAVQKILDEKQGVCGYVVEKVNVLKIVQGLIDRGFNVRLPTEKIKPMAVPVGIEPTMQVRGQPVELAIKVGELAITERMIWLGADVRVEMPGST
Ga0315278_1019408323300031997SedimentMRAVSAVAALAASSVLAASLACGREDRQSPDQIRAEIQGLEKERDSLRERVNELIVKDPRVASMPSTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGKVKKVVTLGHYDLNVTIHRVTGKLKTGKPEVTFGGDKVAIALPVTVASGSGNATIHFKWDGRGAAGATCGDLEVTQEVSGGVRPDTYPVAGGLVLTATAKDILAQPRFPLIKVNLKVLPSDESWAAVDRILGEKEGLCGYVVDKVNVRGIVQKLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQTVALEIKLGELAITEDMIWLGARVSVAVGEEAAKQIAEKKAQEKKAPARNGPPTKPAAANPPAS
Ga0315292_1006250923300032143SedimentMRAVSAVAALAASSVLAASLACGREDRQSPDQIRAEIQELEKERDSLRERVNELIVKDPRVASMPSTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGKVKKVVTLGHYDLNVTIHRVTGKLKTGKPEVTFGGDKVAIALPVTVASGSGNATIHFKWDGRGAAGATCGDLEVTQGVSGGVRPDTYPVAGGLVLTATAKDILAQPRFPLIKVNLKVLPSDESWAAVDRILGEKEGLCGYVVDKVNVRGIVQKLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQTVALEIKLGELAITEDMIWLGARVSVAVGEEAAKQIAEKKAQEKKAPARNGPPTKPAAANPPAS
Ga0315283_1048360923300032164SedimentMRARTAVLATLAAVSVLSSLAACGRENRQSPKLLRAEIEALEKERDTLRARMNELMVKDPRLEGMPGAPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGTVKKVVTIGQYDLNVTINRVIGKLKTGKPDVTFGGNKVAIALPVTVASGTGNATINFKWDGKNVAGATCGDLEVTQEVSGSVRPDSYPVAGGLVLTATAKDILASPRFPVIKVNLKVKPSEASWGAVQKLLDEKTGVCGYVVEKVNVLGIVQKLIDKGFNVRLPTEKIKPMAIPVGIEPTMQVRGQTVELGIKLGELAITEHMIWLGANVSVTVGEDAAENKAPADKKAPAQK
Ga0315283_1062344723300032164SedimentLLEGMPTKPVRVGVPTALVRELITKVVTGFVDQVTLELKNLNVKKSGNVKKVVSIGHYDLQVRIHRVTGKLKTGAPVLTFGGNLVKLALPVSVASGSGRATINFKWDGKNVSGAVCGDMEVTQEVSGGVKPASYPVVGGLVLTATAREILASPKFPLIKVNLKVQPSAESWAAVQKILDEKTGVCGYVVEKVNVLGIVQGLIDRGFNVRLPTEKIKPMAIPVGIEPTMQVRGQPVELAIKVGELAITEKMIWLGADVQIAMPAPVAPPP
Ga0315283_1087808213300032164SedimentLKNLKVKKSGKVKKVVTLGHYDLNVTIHRVTGKLKTGKPEVTFGGDKVAIALPVTVASGSGNATIHFKWDGRGAAGATCGDLEVTQGVSGGVRPDTYPVAGGLVLTATAKDILAQPRFPLIKVNLKVLPSDESWAAVDRILGEKEGLCGYVVDKVNVRGIVQKLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQTVALEIKLGELAITEDMIWLGARVSVAVGEEAAKQIAEKKAQEKKAQEKKAPARNGPTTKPAAAKPPAS
Ga0315287_1015782513300032397SedimentREDRQSPDQIRAEIQELEKERDSLRERVNELIVKDPRVASMPSTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGKVKKVVTLGHYDLNVTIHRVTGKLKTGKPEVTFGGDKVAIALPVTVASGSGNATIHFKWDGRGAAGATCGDLEVTQEVSGGVRPDTYPVAGGLVLTATAKDILAQPRFPLIKVNLKVLPSDESWAAVDRILGEKEGLCGYVVDKVNVRGIVQKLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQTVALEIKLGELAITEDMIWLGARVSVAVGEEAAKQIAEKKAQEKKAPARNGPPTKPAAANPPAS
Ga0310812_1025670013300032421SoilETPEQIRAEIASLEKERELLRPKLDALISKDPRIAGMPKAPVRVGVPTSLARDLIERVVSGFVDHVTLELKNLKVNKTGSVKKVVTIGQYELHVKINKVSGKLKTGDPKVTFGGNKVALSMPVTVASGSGNATINFKWDGKNVSGAVCGDMEITQDVSGGVKPASYPVSGMLVLTATAKEILAEPKFPLIKINLKVDPSDESWGAVQKILDDKEGVCGYVVDKVNILKIVEGLINKGFNVRLPTEKIKPMAIPVGI
Ga0315273_1010777923300032516SedimentMRAVSAVAALATSSVLAASLACGREDRQSPDQIRAEIQELEKERDSLRERVNELIVKDPRVASMPGTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGKVKKVVTLGHYELNVTIHRVTGKLKTGKPEVTFGGDKVAIALPVTVASGSGNATIHFKWDGRGAAGATCGDLEVTQEVSGGVRPDTYPVAGGLVLTATAKDILAQPRFPLIKVNLKVLPSDESWAAVDRILGEKEGLCGYVVDKVNVRGIVQKLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQTVALEIKLGELAITEDMIWLGARVSVAVGEEAAKQIAEKKAQEKKAPARNGPPTKPAAANPPAS
Ga0335076_1001065763300032955SoilMTKPALLAALAPALLGALVLEACAGRKDRETPEELRAEIAALAKERDELRPKLDALIVKDPRIQGMPKAPVRVGVPTTLAKDLIERVTAGFVDHVTLELKNLKVDKTGTVKKVITLGDYELHVRIHRVSGRLKTGKPEVSFGGNKVSLAMPVTIASGGGNATINFKWDGKGVSDAVCGDLDVTQDVGGGVKPASYPVAGALVLTATAQEILASPKFPVIKVNLKVDPSEASWAAVQKILDEKHGVCGYVIDKVDVLKIVKGLIDKGFNVRLPTEKIKPMAVPVGIAPTMLVQGRQVELGIKLGELAITEDMIWLGAQVSVDIKEAE
Ga0335084_1001739623300033004SoilVTKQVLIAALGTPLLGALVLPACGGRQDRETPAALRAEIAALEKERDALRPKLDELIVKDPRIEGMPKAPVRVGVPTTLAKDLIQRVTAGFVDHVTLELKNLKVDKTGTVKKVITLGQYELHVAIHRVSGRLKTGKPEVTFGGNKVALAMPVAIASGAGNATIHFKWDGKGVSDAVCGDLDVTQDVGGGVKPASYPVAGALVLTATAKEILASPKFPLIKVNLKVDPSDESWAAVQKILEEKHGVCGYVLDKVDVLKIVRGLIDKGFDVRLPTEKIKPMAIPVGIEPTMQVQGRQVELGIKLGELAITEDMIWLGAHVSVDIGPAGTTPSPAPRP
Ga0335084_1022230013300033004SoilMPKTPVRVGVPTTLARDLIERVTSGFVDHVTLELKNLKVNKTGTVKKVVTLGEYELNVKVRRVTGKLKTGKPVVTFGGNKVALALPVTVASGSGNANITFKWDGKGVSDAICGDLNVNQDVSGGVKPASYSVSGALVLTATAREILASPRFPLTKVNLKVDPSDESWAAVQKILDDQKGMCGYVVDKVNVRKIVQGLIDRGFNVRLPTEKIKPMAVPVGIEPSMDVRGRRVELGIKLGELAITKDMIWLGAQVSVDVRPARQPQPGP
Ga0335084_1081827613300033004SoilMLAALSLWGGCGRKDRATPEQLRAQIAALEAERQVLRGRFNDLVANDPRIQGMPDTPVRVGVPTTLVRGLVQRVLAGVLDQVTLDLRNIKVRKSGTVKKVVTIGQYDLNVVIDHVTGKLKTGAPRVTFGGNKVSLALPVSVASGTGSATIDFKWDGRNVSGAVCGDMAVTRVVTGSVRPDRYPVSGALVLKATAEEILAEPRFPVLRVNLKIDPSAESWGAVQRILDEKDGVCGYVVHKVDILGVVRRLIEKGFSVRLPTEKLKPMAVPISVQPVMQVRGQPVGLAINVGQLAITEHVIWLGARVCVASGEPVA
Ga0316604_1017728523300033406SoilMRARAAAATLAVLSALVASLACGREGRQAPDQIRADIQALEKERQSLRERVDELMVSDPRLKGMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLHVTIHRVKGKLKTGKPDVTFGGNKVAIALPVTVASGTGHATINFKWDGKGVADATCGDMEVTQDVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAILAQKEGLCGYVVEKVNVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIE
Ga0316605_1018045113300033408SoilLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLHVTIHRVKGKLKTGKPDVTFGGNKVAIALPVTVASGTGQATINFKWDGKNVADATCGDMEVTQDVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAVLAQKEGLCGYVVDKVNVRGIVQRLVDRGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALGIKVGQLTITESVIWLGARVSVAVGEEVAARKAPEKKAPEKKAPEKKALPPKAPAS
Ga0316605_1034595113300033408SoilRARADELMVNDPRVKTMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTIGEYDLDVTIHRVIGKLKTGKPEVTFGGDKVAIALPVTVASGSGRATIRFKWDGKNVAGATCGDMEVTQEVSGGVRPDTYPVQGGLVLTATAQDILAQPRFPQTKVNLKVVPSKESWAAVDGILGGKDGVCGYAVEKVNVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMMVRGEPVALSIKLGELAITEDVIWLGARVSVAVGEEAERQVEARKAEQSKAREKKPPAGKAPAARPADARPPAS
Ga0316605_1086066213300033408SoilTTPVRVGIPTTLARDLIERVVEGFVDQVTLELKNLKVKKRGTVKKVVTLGQYDLQVIIHRVAGKLKTGKPDVAFGGNKVSIALPVTVASGSGNATIAFKWDGKGVADATCGDLEVTQEVSGSVRPDTYPVSGGLVLTATAEQILAEPRFPVIKVRLKVNPSAESWAAVQKILDEKEGICGYVVDKVNVLGIVQRLIDKGFNVRLPTEKIKSMAVPVGIEPTMEVHGQPVALGIKIGELAITEHVIWLGAHVSVAVGDEAVAKKAAATKAREKKAP
Ga0310810_1046905023300033412SoilVSRRAPATSLVLLAAGALSLAGCRERADRETPEQIRAEIAALEKERDLLRPKLDALISKDPRIAGMPKAPVRVGVPTSLARDLIERVVSGFVDHVTLELKNLKVNKTGSVKKVVTIGQYELHVKINKVSGKLKTGQPSVTFGGNKVALSMPVTVASGSGNATINFKWDGKNVSGAVCGDMEITQDVSGGVKPASYPVSGMLVLTATAKEILAEPKFPLIKINLKVDPSDESWGAVQKILDDKEGVCGYVVDKVNILKIVEGLINKGFNVRLPTEKIKPMAIPVGIEPSMQVQGRQVDLGIKLGELAITNDMIWLGAN
Ga0316619_1008449323300033414SoilVTVRARNAALALLLVAPALVAALACGREDRQTPDQLRAEIEALEKERLSLRERVNELILKDPRVKDMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLHVTIHRVKGKLKTGKPDVTFGGNKVAIALPVTVASGTGQATINFKWDGKNVADATCGDMEVTQDVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAVLAQKEGLCGYVVDKVNVRGIVQRLVDRGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALGIKVGQLTITESVIWLGARVSVAVGEEVAARKAPEKKAPEKKALPPKAPAS
Ga0316622_10000600333300033416SoilVTVRARNAALALLLVAPALVAALACGREDRQTPDQLRAEIEALEKERLSLRERVNELILKDPRVKDMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLHVTIHRVKGKLKTGKPDVTFGGNKVAIALPVTVASGTGQATINFKWDGKNVADATCGDMEVTQDVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAVLAQKEGLCGYVVDKVNVRGIVQRLVDRGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALGIKVGQLTITESVIWLGARVSVAVGEEVAARKAPEKKAPEKKAPEKKALPPKAPAS
Ga0316622_10038406123300033416SoilMRALSAVAALAASSVLAASLACGREDRQSPDQIHAEILALEKERDSLRERVNELMVNDPRVASMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGTVRKVVTIGQYDLRVTIHRVTGKLKTGKPEVTFGGDKIAIALPVTVVSGTGQATIHFKWDGKNVAGATCGDMEVTQEVSGGVRPDTYPVAGGLVLTATAKDILAQPRFPQTRINLKVVPSEKSWAAVDGILGAKDGLCGYAVEKVNVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALAIQVGDLAITEDVIWLGARVSVAVGEEAKRQVEEKKAQQKKGKGEARPGKAPATKPEAAEPPAG
Ga0316622_10067245423300033416SoilVTVRAPALALATLAVVPVLAVSLSCGRKDRATPEQLRSEIAALEKEREILRGRFNELILNDPRIEGMPTTPVRVGIPTTLARDLIQRVVEGFVDQVTLELKNLKVKKSGTVKKVVTLGQYDLHVIIHRVSGKLKTGKPEVTFGGNRVTIAMPVTVASGSGNATIEFKWDGKGLADATCGDMQVTQDVSGSVRPDTYPVSGGLVLTATAKEILAEPRFPLMKIRLKVNPSAESWAAVDKILGEKEGVCGYVVDKVNVRGILQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALGIKLGQLAITEQMIWLGADVTVATGAAAAKEIKKKKAEEGGAR
Ga0316622_10170605813300033416SoilILNDPRIEGMPTAPVRVGIPTTLARDLIQRVVEGFVDQVTLELKNLKVKKSGTVKKVVTLGQYDLHVIIHRVAGKLKTGKPDVTFGGNKVSIALPVTVASGSGNATIDFKWDGRGVADATCGDLEVTQEVSGSVRPDTYPVSGGLVLTATAEQILAEPRFPVIKVRLKVNPSAESWAAVQKILDEKEGICGYVVDKVNVLGIVQRLIDKGFNVRLPTEKIKSMAVPVGIEPTMEVHGQAVALGI
Ga0316622_10188293913300033416SoilGKVKKVVTLGEYDLKVDIHRVTGKLKTGKPEVTFGGNKVAVALPVTVASGSGRATIHFRWDGKNVAGATCGDLEVTQEVTGGVKPETYPVSGSLVLTATAKDILAEPRFPTIKARLKVVPSAESWGAVDRILGEKGGVCGYVVEKVNVRGIVQKLVDKGFNVRLPTEKLKPMAIPVGIEPTMDVRGQPVALAIQVGDLAITEDVIWLGARVSVAVGEEAKKQVEKKEAEHK
Ga0326726_1003213853300033433Peat SoilVTVRARSAVPATIALASLLTLPLSCGWKDRAASDKLRGGIEALEKERDGLRGRLDALMAADPQLEGMSETPVRVGVPTTLVRDLIERLVAGFVDQVTLELRNLKVEKSGTVKKVVTIGQYDLHVTVNRVTGRLKTGKPVVKFGGNRVSVALPVTVASGTGNATIRFRWNGKGIGGALCGDLDITREVEGGVRPDRYNVSGSLVLTATAKEILAEPHFPVVKINLKVVPSPESWGAAQKVLDDKEGVCGFVLDKVDVLGFVRKIVDKGFDVRLPTEKIKPMAVPVGIEPTMEVRGQPVALGIEMGGLAITEHAIWLGAHVSVAIGAEAVARSR
Ga0316600_1020885613300033481SoilMRALFAVAALAASSVLASLACGREDRQSPDQIRAEILALEKERDSLRERVNELIVKDPRMKSMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGKVKKVVTLGQYDLNVIIKRVTGKLKTGKPEVTFGGDKVALALPVTVVSGTGEATIQFKWDGKGAAGATCGDLEVTQEVSGGVRPDTYPVAGSLVLTATARDILAEPRFPLIKINLKVLPSEDSWAAVDRVLGEREGLCGYVVDKVDVRGIVQKLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGHPVALAIQVGDLAITEDVIWLGARVSVAVGEEAAKQVEDKKAQEKKAPARRTPAAKPAAGKPSPS
Ga0316627_10010211613300033482SoilRGERRQAGGQGKREVTVRARNAALALLLVAPALVAALACGREDRQTPDQLRAEIEALEKERLSLRERVNELILKDPRVKDMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLHVTIHRVKGKLKTGKPDVTFGGNKVAIALPVTVASGTGQATINFKWDGKNVADATCGDMEVTQDVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAVLAQKEGLCGYVVDKVNVRGIVQRLVDRGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALGIKVGQLTITESVIWLGARVSVAVGEEVAARKAPEKKAPEKKAPEKKALPPKAPAS
Ga0316627_10064322213300033482SoilMRVPSAVVALAVSSALAASLSCGRGDRQTPDQLRAELRALESERDALRARADALMANDPRVKTMPDTPVRVGVPTQLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVRKVVTIGEYDLDVTIHRVTGKLKTGKPEVTFGGDKVAIALPVTVASGSGRATIRFKWDGKNVAGATCGDMEVTQEVSGGVRPDTYPVQGGLVLTATAQDILAQPRFPQTKVNLKVVPSKESWAAVDGILGGKDGVCGYAVEKVNVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMMVKGQPVALSIKLGDLAITE
Ga0316629_1012407923300033483SoilVTVRARNAALALLLVAPALVAALACGREDRQTPDQLRAEIEALEKERLSLRERVNELILKDPRVKDMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLHVTIHRVKGKLKTGKPDVTFGGNKVAIALPVTVASGTGQATINFKWDGKNVADATCGDMEVTQDVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAVLAQKEGLCGYVVDKVNVRGIVQRLVDRGFNVRLPTEKIKPMAVPVGIEPTMEVR
Ga0316626_1069228613300033485SoilEGFVDQVTLELKNLKVKKSGTVKKVVTLGQYDLHVTIHRVAGRLKTGKPDVTFGGNKVSLALPVTVASGSGNATIHFKWDGKGVAGATCGDLEVTREVSGRVRPDTYPVSGGLVLTATAQEILAEPRFPVIKVRLKVNPSAESWGAVQKILDEKEGLCGYVVDKVNVLGIVQRLIDKGFNVRLPTEKIKSMAVPVGIEPTMEVHGQAVALGIKVGDLAITEHVIWLGAHVSVATGEEALAKKVDGRTSK
Ga0316624_1032045423300033486SoilVTVRARTDVLATIALAPVFVLSLSCARKDRATPEQLRSEIAALEKERQILRGRFNELILNDPRIEGMPETPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNIKVKKSGTVRKVVTIGQYDLHVTINRVTGKLKTGKPDVTFGGNMVSLALRVTVASGSGNATIQFKWDGRNVAGATCGDLEVTQEVSGSVRPDTYPVSGGLVLTATAKEILAEPRFPVIKIRLRVNPSAQSWGAVQKILDEKEGVCGFVVDKVDVLGLVKKIVDRGFNVRLPTEKIKPMAIPVGIEPTMEVHGKPVALGIKVGALAITKHVIWLGAKVSVAVGEEAQKEIEERELDAQKKGKTR
Ga0316624_1033605123300033486SoilVAGFVDQVTLELKNLKVKKHGRVKKVVTIGEYDLDVLINRVAGRLKTGRPDVTFGGNKVTLALPVTVASGSGKATIHFKWDGKNVAGATCGDLEVTREVSGGVKPDTYPVSGGLVLTATAKEIMAQPRFPLIKINLKVNPSAQSWEAVDKILDEKEGLCGYVVDKVNVRGILQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMDVRGQPVALGIKLGQLAITEQMIWLGARVSVAMGEGAHKGIEERKPEAPKQGKAR
Ga0316624_1040504223300033486SoilMRARTAVLATLAAVSVLAVSAACGRKDRETPDNLRADIAALEKERDTLRGRMNELMVRDPRLKGMPATPVKVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGTVKKVVTIGQYDLNVTINRVTGKLKTGKPDVTFGGNKVSLALPVTVASGSGNATIHFKWDGKGVAGATCGDLEVTQEVSGRVRPDTYPVSGGLVLTATAQEILAEPRFPVIKVRLKVNPSAESWAAVQKILDEKEGICGYVVDKVNVLGIVQRLIDKGFNVRLPTEKIKSMAVPVG
Ga0316624_1086537313300033486SoilQVPDQIRAQIEALEKERAALRERMNELMVKDPRIPGMPDSPVRVGVPTTLARDLIQRVMSGFVDQVTLELKNLKVKKHGQVKKVVTLGEYDLNVLITRVSGRLKTGKPDVTFGGNKVTLALPVTIASGSGRAAIHFKWEGKSVAGATCGDLEVTREVWGGVRPDTYPLSGGLVLTATAKEILAEPRFPLIKVNLKVNPSQESWDSVQKILDEKEGVCGYVLDKVNVRGILERLVNKGFNVRLPTEKIKPMAVPVGIEPTMQVRGKPVA
Ga0316621_1027659713300033488SoilLRAEIEALEKERLSLRERVNELILKDPRVKDMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLHVTIHRVKGKLKTGKPDVTFGGNKVAIALPVTVASGTGHATINFKWDGKGVADATCGDMEVTQDVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAILAQKEGLCGYVVDKVNVRGIVQRLVDRGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALGIRVGQLAITESVIWLGAQVSVAVGDEAAAKKASVKKAPEKKAPEKKALPPKAPAS
Ga0316628_10006832563300033513SoilMRALPAVAVLAVSPLLVASLACGRKDRQSPDQIRAQIQGLQKERDSLRERMNELMVKDPRLPGMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKHGRVKKVVTIGEYDLDVLINRVAGRLKTGRPDVTFGGNKVTLALPVTVASGSGKATIHFKWDGKNVAGATCGDLEVTREVSGGVKPDTYPVSGGLVLTATAKEIMAQPRFPLIKINLKVNPSAQSWEAVDKILDEKEGLCGYVVDKVNVRGILQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMDVRGQPVALGIKLGQLAITEQMIWLGARVSVAMGEGAHKGIEERKPEAPKQGKAR
Ga0316628_10242284313300033513SoilLRERMNELMVKDPRIPGMPDSPVRVGVPTTLARDLIQRVMSGFVDQVTLELKNLKVKKHGQVKKVVTLGEYDLNVLITRVSGRLKTGKPDVTFGGNKVTLALPVTIASGSGRATIHFKWDGKSVAGATCGDLEVTREVWGGVRPDTYPLSGGLVLTATAKEILAEPRFPLIKVNLKVNPSPESWDSVQKILDEKEGVCGYVLDKVNVRGILERLVNKGFNVRLPTEKIKPM
Ga0316616_10000364763300033521SoilVTVRARNAALALLLVAPALVAALACGREDRQTPDQLRAEIEALEKERLSLRERVNELILKDPRVKDMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLHVTIHRVKGKLKTGKPDVTFGGNKVAIALPVTVASGTGQATINFKWDGKNVADATCGDMEVTQDVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAVLAQKEGLCGYVVDKVNVRGIVQRLVDRGFNVRLPTEKIKPMAVPVGIEPTME
Ga0316616_10065902813300033521SoilAALACGREDRQTPDQLRAEIEALEKERLSLRERVNELIVKDPRVKDMPKTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKRGRVKKVVTLGEYDLHVTIHRVKGRLKTGKPDVTFGGNKVAIALPVTVASGTGRATIHFKWNGKNVAGATCGDMEVTQEVSGGVRPDTYPVSGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAILAQKEGLCGYVVEKVNVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQTVALGIRVGQLAITESVIWLGAQVSVAVGDEAAAKKAPETEVPGKKALPPKAPAS
Ga0316616_10111806423300033521SoilERMNELMVNDPRLPGMPNTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTIGHYDLNVVIHRVSGKLKTGKPDVAFGGDKVSLALPVTVASGSGHATIHFKWDGKNVAGATCGDLEVTQEVSGGVRPETYPVAGGLELTATAKEILARPRFPVIKVRLKVNPSAESWGAVDRILGEKEGLCGYVVEKVNVRGIVQRLVDKGFNVRLPTEKIKPLAVPVGIEPTMEVRGQPVALGIKLADLAITEHMIWLGAKVSVAVGDDAVRLIERKKAGEKKPR
Ga0316616_10273666013300033521SoilLKVKKSGRVRKVVTIGEYDLDVTIHRVTGKLKTGKPEVTFGGDKVAIALPVTVASGSGRATIRFKWDGKNVAGATCGDMEVTQEVSGGVRPDTYPVQGGLVLTATAQDILAQPRFPQTKVNLKVVPSKESWAAVDGILGGKDGVCGYAVEKVNVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMMVRGQPVALSIKLGDLAITEEVIWLGARVSVAV
Ga0316617_10034799113300033557SoilVTVRARNAALALLLVAPALVAALACGREDRQTPDQLRAEIEALEKERLSLRERVNELILKDPRVKDMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLELKNLKVKKSGRVKKVVTLGEYDLHVTIHRVKGKLKTGKPDVTFGGNKVAIALPVTVASGTGQATINFKWDGKNVADATCGDMEVTQDVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPLIKVNLKVVPSDESWAAVDAILAQKEGLCGYVVEKVNVRGIVQRLVDKGFNVRLPTEKIKPMAVPVGIEPTMEVRGQPVALGIKVGQLT
Ga0364925_0072997_3_8333300034147SedimentMRARNAAAALAVTPVLAAFLACGRQDRETPDQLRAEIQALEKERQSLRQRMNELIVRDPRVKSMPDTPVRVGVPTTLARDLIQRVVAGFVDQVTLDLKNLKVKKSGTVKKIVTIGQYDLNVTIHRVTGKLKTGKPEVTFGGDKVAIALPVTVVSGTGQATIHFKWDGKNVAGATCGDMEVTQEVSGGVRPDTYPVAGGLVLTATAKDILAEPRFPQTKVNLKVVPSEKSWAAVDSILGGKDGLCGYAVEKVDVRGIVQRLVDKGFNVRLPTEKIKPM
Ga0364934_0210834_7_7353300034178SedimentMPTQPVRVGVPTALARELITKVVTGFVDQVTLELKNLKVKKAGTVKKVVSIGQYELQVHIHRVTGKLKTGTPTLKFGGNQVSLAMPVTVASGSGRATINFKWDGKNVSGAVCGDMEITQEVSGGVKPANYPVAGGIVLTATAREILASPKFPLIKVNLKVQPSRESWAAVEKILEEKEGLCGYVLEKVNVLKIVQGLIERGFNVRLPTEKIKPMAVPVGIEPTMQVRGQPVELAIKLGQLAIT


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.