NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F057963

Metagenome / Metatranscriptome Family F057963

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F057963
Family Type Metagenome / Metatranscriptome
Number of Sequences 135
Average Sequence Length 82 residues
Representative Sequence MRQLQGVLKGRASAVTGLVAVLALVSLPVEVTIRFVEHPVVAPGPVVSLLLIGSCLLLVGSGLGLRRLEAARAGARRPTPGV
Number of Associated Samples 117
Number of Associated Scaffolds 135

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 20.00 %
% of genes near scaffold ends (potentially truncated) 25.19 %
% of genes from short scaffolds (< 2000 bps) 70.37 %
Associated GOLD sequencing projects 109
AlphaFold2 3D model prediction Yes
3D model pTM-score0.37

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (77.037 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(15.556 % of family members)
Environment Ontology (ENVO) Unclassified
(40.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(35.556 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 59.09%    β-sheet: 0.00%    Coil/Unstructured: 40.91%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.37
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 135 Family Scaffolds
PF04519Bactofilin 44.44
PF00072Response_reg 13.33
PF13432TPR_16 5.19
PF13304AAA_21 3.70
PF028262-Hacid_dh_C 3.70
PF00263Secretin 2.22
PF13476AAA_23 2.22
PF14559TPR_19 1.48
PF13565HTH_32 0.74
PF03309Pan_kinase 0.74
PF07045DUF1330 0.74
PF13578Methyltransf_24 0.74
PF05138PaaA_PaaC 0.74

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 135 Family Scaffolds
COG1664Cytoskeletal protein CcmA, bactofilin familyCytoskeleton [Z] 44.44
COG1521Pantothenate kinase type IIICoenzyme transport and metabolism [H] 0.74
COG33961,2-phenylacetyl-CoA epoxidase, catalytic subunitSecondary metabolites biosynthesis, transport and catabolism [Q] 0.74
COG5470Uncharacterized conserved protein, DUF1330 familyFunction unknown [S] 0.74


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms77.04 %
UnclassifiedrootN/A22.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002122|C687J26623_10033600All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1373Open in IMG/M
3300002886|JGI25612J43240_1048098All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria626Open in IMG/M
3300003995|Ga0055438_10015574All Organisms → cellular organisms → Bacteria1643Open in IMG/M
3300004009|Ga0055437_10001049All Organisms → cellular organisms → Bacteria → Proteobacteria4281Open in IMG/M
3300004052|Ga0055490_10250135All Organisms → cellular organisms → Bacteria546Open in IMG/M
3300004156|Ga0062589_100194640All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1455Open in IMG/M
3300004156|Ga0062589_101895298Not Available601Open in IMG/M
3300004463|Ga0063356_100778418All Organisms → cellular organisms → Bacteria1331Open in IMG/M
3300005183|Ga0068993_10016971All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1775Open in IMG/M
3300005334|Ga0068869_100093162All Organisms → cellular organisms → Bacteria2269Open in IMG/M
3300005336|Ga0070680_100747851All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria842Open in IMG/M
3300005444|Ga0070694_101140118All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria652Open in IMG/M
3300005458|Ga0070681_11405885All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria621Open in IMG/M
3300005459|Ga0068867_100166385All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1743Open in IMG/M
3300005468|Ga0070707_101078308All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria769Open in IMG/M
3300005468|Ga0070707_101127930Not Available750Open in IMG/M
3300005617|Ga0068859_101478133Not Available750Open in IMG/M
3300005719|Ga0068861_100495712All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Xanthomonadales → Xanthomonadaceae1103Open in IMG/M
3300006845|Ga0075421_100403502All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1641Open in IMG/M
3300006845|Ga0075421_102311020All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia565Open in IMG/M
3300009038|Ga0099829_10000377All Organisms → cellular organisms → Bacteria23174Open in IMG/M
3300009053|Ga0105095_10785561Not Available533Open in IMG/M
3300009078|Ga0105106_10873145Not Available640Open in IMG/M
3300009078|Ga0105106_10987365Not Available598Open in IMG/M
3300009087|Ga0105107_10887425All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria621Open in IMG/M
3300009089|Ga0099828_10001567All Organisms → cellular organisms → Bacteria15422Open in IMG/M
3300009093|Ga0105240_12026577All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria598Open in IMG/M
3300009147|Ga0114129_10021789All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium9096Open in IMG/M
3300009157|Ga0105092_10507070All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria692Open in IMG/M
3300010391|Ga0136847_13506729Not Available906Open in IMG/M
3300010399|Ga0134127_10084947All Organisms → cellular organisms → Bacteria2730Open in IMG/M
3300011410|Ga0137440_1072627Not Available683Open in IMG/M
3300011427|Ga0137448_1152997Not Available640Open in IMG/M
3300011429|Ga0137455_1043881All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1255Open in IMG/M
3300011438|Ga0137451_1033797Not Available1486Open in IMG/M
3300012164|Ga0137352_1085625Not Available628Open in IMG/M
3300012225|Ga0137434_1007086All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1192Open in IMG/M
3300012226|Ga0137447_1001811All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2373Open in IMG/M
3300012355|Ga0137369_10569235Not Available792Open in IMG/M
3300012685|Ga0137397_10107864All Organisms → cellular organisms → Bacteria2045Open in IMG/M
3300012685|Ga0137397_10446784All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria963Open in IMG/M
3300012918|Ga0137396_10795682All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria695Open in IMG/M
3300012922|Ga0137394_10001758All Organisms → cellular organisms → Bacteria → Proteobacteria15916Open in IMG/M
3300012929|Ga0137404_10422733All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1178Open in IMG/M
3300012929|Ga0137404_11315766Not Available666Open in IMG/M
3300014308|Ga0075354_1163351Not Available516Open in IMG/M
3300014321|Ga0075353_1034288All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria992Open in IMG/M
3300014877|Ga0180074_1044720All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria930Open in IMG/M
3300014877|Ga0180074_1160071All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria503Open in IMG/M
3300014885|Ga0180063_1033500All Organisms → cellular organisms → Bacteria1440Open in IMG/M
3300014885|Ga0180063_1242659Not Available576Open in IMG/M
3300015245|Ga0137409_10062120All Organisms → cellular organisms → Bacteria3511Open in IMG/M
3300015264|Ga0137403_10007450All Organisms → cellular organisms → Bacteria12355Open in IMG/M
3300018028|Ga0184608_10050115All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1641Open in IMG/M
3300018052|Ga0184638_1027550All Organisms → cellular organisms → Bacteria2037Open in IMG/M
3300018053|Ga0184626_10032065All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2171Open in IMG/M
3300018056|Ga0184623_10014164All Organisms → cellular organisms → Bacteria3476Open in IMG/M
3300018059|Ga0184615_10514235All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria642Open in IMG/M
3300018063|Ga0184637_10025538All Organisms → cellular organisms → Bacteria3561Open in IMG/M
3300018074|Ga0184640_10009658All Organisms → cellular organisms → Bacteria → Proteobacteria3417Open in IMG/M
3300018075|Ga0184632_10025166All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2529Open in IMG/M
3300018075|Ga0184632_10195380Not Available891Open in IMG/M
3300018079|Ga0184627_10004834All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria5928Open in IMG/M
3300018084|Ga0184629_10022287All Organisms → cellular organisms → Bacteria2642Open in IMG/M
3300018422|Ga0190265_10052049All Organisms → cellular organisms → Bacteria → Proteobacteria3577Open in IMG/M
3300018422|Ga0190265_10095891All Organisms → cellular organisms → Bacteria2764Open in IMG/M
3300018422|Ga0190265_10456508All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1385Open in IMG/M
3300018429|Ga0190272_10158817All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1570Open in IMG/M
3300019255|Ga0184643_1235697All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria636Open in IMG/M
3300019259|Ga0184646_1508240Not Available645Open in IMG/M
3300019360|Ga0187894_10087569All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1695Open in IMG/M
3300019458|Ga0187892_10064081All Organisms → cellular organisms → Bacteria2400Open in IMG/M
3300019487|Ga0187893_10785371Not Available579Open in IMG/M
3300019789|Ga0137408_1363071All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3010Open in IMG/M
3300019869|Ga0193705_1095219All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria551Open in IMG/M
3300019879|Ga0193723_1020831All Organisms → cellular organisms → Bacteria2011Open in IMG/M
3300019882|Ga0193713_1014722All Organisms → cellular organisms → Bacteria2346Open in IMG/M
3300019882|Ga0193713_1058190All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1104Open in IMG/M
3300019883|Ga0193725_1021649All Organisms → cellular organisms → Bacteria1754Open in IMG/M
3300019889|Ga0193743_1085341All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1227Open in IMG/M
3300019998|Ga0193710_1002675All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1787Open in IMG/M
3300020004|Ga0193755_1018276All Organisms → cellular organisms → Bacteria2311Open in IMG/M
3300020022|Ga0193733_1200929All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria512Open in IMG/M
3300021051|Ga0206224_1004419All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1454Open in IMG/M
3300021073|Ga0210378_10200930Not Available761Open in IMG/M
3300021081|Ga0210379_10053255All Organisms → cellular organisms → Bacteria1616Open in IMG/M
3300021081|Ga0210379_10496144Not Available542Open in IMG/M
3300021090|Ga0210377_10043210All Organisms → cellular organisms → Bacteria → Proteobacteria3153Open in IMG/M
3300022195|Ga0222625_1666989All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria666Open in IMG/M
3300022534|Ga0224452_1035502All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1451Open in IMG/M
3300022694|Ga0222623_10020795All Organisms → cellular organisms → Bacteria → Proteobacteria2453Open in IMG/M
3300025165|Ga0209108_10150156All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1229Open in IMG/M
3300025324|Ga0209640_10009726All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria8379Open in IMG/M
3300025580|Ga0210138_1006823All Organisms → cellular organisms → Bacteria2144Open in IMG/M
3300025885|Ga0207653_10342389Not Available583Open in IMG/M
3300025917|Ga0207660_10023666All Organisms → cellular organisms → Bacteria → Proteobacteria4151Open in IMG/M
3300025923|Ga0207681_11654542All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria535Open in IMG/M
3300025933|Ga0207706_11532276Not Available543Open in IMG/M
3300025935|Ga0207709_10096541All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1944Open in IMG/M
3300025942|Ga0207689_10023019All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales5232Open in IMG/M
3300025965|Ga0210090_1016578All Organisms → cellular organisms → Bacteria1006Open in IMG/M
3300025973|Ga0210145_1006116All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1217Open in IMG/M
3300026118|Ga0207675_100847258All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria929Open in IMG/M
3300026285|Ga0209438_1000687All Organisms → cellular organisms → Bacteria10615Open in IMG/M
3300026354|Ga0257180_1035340Not Available688Open in IMG/M
3300026371|Ga0257179_1008998All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1015Open in IMG/M
3300026499|Ga0257181_1009416Not Available1275Open in IMG/M
3300026535|Ga0256867_10147436All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria883Open in IMG/M
3300027388|Ga0208995_1076974All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria584Open in IMG/M
3300027669|Ga0208981_1095568All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria758Open in IMG/M
(restricted) 3300027799|Ga0233416_10126117Not Available875Open in IMG/M
3300027815|Ga0209726_10021513All Organisms → cellular organisms → Bacteria5462Open in IMG/M
3300027909|Ga0209382_10125526All Organisms → cellular organisms → Bacteria → Proteobacteria2991Open in IMG/M
3300027979|Ga0209705_10599869Not Available526Open in IMG/M
3300028792|Ga0307504_10032650All Organisms → cellular organisms → Bacteria → Proteobacteria1389Open in IMG/M
3300028792|Ga0307504_10303949All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria601Open in IMG/M
3300028803|Ga0307281_10081453All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1065Open in IMG/M
3300028828|Ga0307312_10022835All Organisms → cellular organisms → Bacteria3579Open in IMG/M
3300028828|Ga0307312_10143978All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1508Open in IMG/M
3300030006|Ga0299907_10567573Not Available887Open in IMG/M
(restricted) 3300031150|Ga0255311_1009725All Organisms → cellular organisms → Bacteria → Proteobacteria1932Open in IMG/M
(restricted) 3300031150|Ga0255311_1102338Not Available621Open in IMG/M
3300031152|Ga0307501_10134502All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria658Open in IMG/M
(restricted) 3300031197|Ga0255310_10006068All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3126Open in IMG/M
3300031229|Ga0299913_10318550All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1542Open in IMG/M
(restricted) 3300031248|Ga0255312_1163399Not Available556Open in IMG/M
3300031720|Ga0307469_10014945All Organisms → cellular organisms → Bacteria3996Open in IMG/M
3300031720|Ga0307469_11322094Not Available685Open in IMG/M
3300031740|Ga0307468_100009402All Organisms → cellular organisms → Bacteria3723Open in IMG/M
3300031949|Ga0214473_12034258All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria559Open in IMG/M
3300032174|Ga0307470_10002584All Organisms → cellular organisms → Bacteria6913Open in IMG/M
3300032174|Ga0307470_11957326Not Available500Open in IMG/M
3300032205|Ga0307472_102178875All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria559Open in IMG/M
3300033233|Ga0334722_10137694All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1834Open in IMG/M
3300033407|Ga0214472_11677232All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria538Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil15.56%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil8.89%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil8.15%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment8.15%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment5.19%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands5.19%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment4.44%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.44%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.22%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil2.22%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere2.22%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.22%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere2.22%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.96%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil2.96%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.96%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.48%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.48%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.48%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands1.48%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.48%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.48%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.48%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.74%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.74%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.74%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.74%
Deep Subsurface SedimentEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface Sediment0.74%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.74%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.74%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.74%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.74%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.74%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.74%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002122Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_2EnvironmentalOpen in IMG/M
3300002886Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cmEnvironmentalOpen in IMG/M
3300003995Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2EnvironmentalOpen in IMG/M
3300004009Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004052Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005183Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1EnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009053Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009087Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011410Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT222_2EnvironmentalOpen in IMG/M
3300011427Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT418_2EnvironmentalOpen in IMG/M
3300011429Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT600_2EnvironmentalOpen in IMG/M
3300011438Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT500_2EnvironmentalOpen in IMG/M
3300012164Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT730_2EnvironmentalOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012226Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT400_2EnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014308Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D1EnvironmentalOpen in IMG/M
3300014321Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleB_D1EnvironmentalOpen in IMG/M
3300014877Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT366_16_10DEnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019255Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019259Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019869Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m2EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019889Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2EnvironmentalOpen in IMG/M
3300019998Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m1EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300021051Subsurface sediment microbial communities from Mancos shale, Colorado, United States - Mancos A1EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300022195Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025580Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025935Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025942Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025965Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025973Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026354Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-BEnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300027388Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027669Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300027979Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015 (SPAdes)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028803Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031152Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 15_SEnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033233Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_bottomEnvironmentalOpen in IMG/M
3300033407Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
C687J26623_1003360023300002122SoilMRWARQAAKRRASAVTGLLAGVTLVALPVEITIGLFEQPVIAPPPVASLLLLGSCLLLIGSGLGLRRQEAARSAARPPAPGA*
JGI25612J43240_104809823300002886Grasslands SoilRFPAALKWRASAVTGFIAGLTLVSLPIEVTIGLSEHPLIPPGRLASLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA*
Ga0055438_1001557443300003995Natural And Restored WetlandsMRRLQGVLKGRASAVTGLVAVLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGFGLRRLEAARVGARRPTPGV*
Ga0055437_1000104963300004009Natural And Restored WetlandsMSSAYGLFRLLSGISVATLRAMRRLQGVLKGRASAVTGLVAVLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGFGLRRLEAARVGARRPTPGV*
Ga0055490_1025013523300004052Natural And Restored WetlandsVATLRAMRRIHRLLRWRASAVTGLIAVLALVSVPVEVTIGLAEHPVVPPGRLASLLLIGSCLLLVGSGFGLRREEAARAGARRPTPSA*
Ga0062589_10019464023300004156SoilMRRFPAALKWRASAVTGFIAGLALVSLPIEVTIGLSEHPVIPPGWLASFLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA*
Ga0062589_10189529823300004156SoilMLKGRAHTVTGVLAGLALISLPVEITIGLAEHPVVAPAPAVSLLLISSCLLLVGSGLGLRRLEAMRVGAP
Ga0063356_10077841833300004463Arabidopsis Thaliana RhizosphereMRHLSRMLKGRAHTVTGVLAGLALISLPVEITIGLAEHPVVAPAPAVSLLLISSCLLLVGSGLGLRRLEAMRVGAPRPTPGL*
Ga0068993_1001697143300005183Natural And Restored WetlandsMRRLQGVLKGRASAVTGLVAVLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGFSLRRLEAARVGARRPTPGV*
Ga0068869_10009316253300005334Miscanthus RhizosphereMRRFPAALKWRASAVTGFIAGLALVSLPIEVTIGFSEHPVIPPGWLASFLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA*
Ga0070680_10074785113300005336Corn RhizosphereATLRAMRRFPAALKWRASAVTGFIAGLALVSLPIEVTIGLSEHPVIPPGWLASFLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA*
Ga0070694_10114011823300005444Corn, Switchgrass And Miscanthus RhizosphereGISVATLRAMRRVQGMLKWRGSAITGLIAGLALVSLPLEVTIGLAEHPVIAPTPGVSLLLIGSCLLLVGSGVGLRREEVSRAGARRPTPRA*
Ga0070681_1140588513300005458Corn RhizosphereIPLATLRAMRRFPAALKWRASAVTGFIAGLALVSLPIEVTIGLSEHPVIPPGWLASFLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA*
Ga0068867_10016638543300005459Miscanthus RhizosphereSAVTGFIAGLALVSLPIEVTIGLSEHPVIPPGWLASFLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA*
Ga0070707_10107830813300005468Corn, Switchgrass And Miscanthus RhizosphereMRQVRGVLKGRASAATGLFAVLALVSLPVEVTIGFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLEAARAGARRPTPGV*
Ga0070707_10112793023300005468Corn, Switchgrass And Miscanthus RhizosphereMRRVQGMLKWRGSAITGLIAGLALVSLPLEVTIGLAEHPVIAPTPGVSLLLIGSCLLLVGSGVGLRREEASRAGARRPTPSA*
Ga0068859_10147813323300005617Switchgrass RhizosphereMRRFPAALKWRASAVTGFIAGLALVSLPIEVTIGLSEHPVIPPGWLASFLLVGSCLLLVGSGLGLRRVEAARAGAGRPTPSA*
Ga0068861_10049571213300005719Switchgrass RhizosphereSTGRARFRLLSGIPLATLRAMRRFPAALKWRASAVTGFIAGLALVSLPIEVTIGLSEHPVIPPGWLASFLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA*
Ga0075421_10040350243300006845Populus RhizosphereMRQGQGLLRGWASAGAGLVAVLSLVSLPVEVTIQLVEHPVVAPTPVVSLLLISSCLLLVGSGLGLRRLEAARTGARRPTPGV*
Ga0075421_10231102013300006845Populus RhizosphereMRGLSWALRWRASAVTALVAALGLVSLPFEVTVGLAEHPVVAPAPLVSLLLIGSCLLLVGGGLGLRRLEAARLAAQPPSPDA*
Ga0099829_10000377183300009038Vadose Zone SoilMRQLQRVLKGRASAVTGLVAGLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETARAGAQRPTPGV*
Ga0105095_1078556123300009053Freshwater SedimentMRRVKRMLKWRASAVTGLIAVLALVGLPVEVTIGLVEQPVVPPGRLASLLLIGSCLLLVGGGLGLRREEAARAGARRPTPSA*
Ga0105106_1087314513300009078Freshwater SedimentMRRVKRMLKWRASAVTGLIAVLALVGLPVEVTIGLVEQPVVPPGRLASLLLIGSCLLLVGSGLGLRREEAARAGARRSTPSA*
Ga0105106_1098736513300009078Freshwater SedimentMLRWRASVVTGLIAVLALVSLPVEVTIGLVDHPVVPPGRLASLLLIGSCLLLVGGGLGLRREEAARAGARRPTPSA*
Ga0105107_1088742513300009087Freshwater SedimentMTFSLDSDSGGSLFWLLSGIPVATLRAMRWVQRMLRWRASVVTGLIAVLALVSLPVEVTIGLVDHPVVPPGRLASLLLIGSCLLLVGGGLGLRREEAARAGARRPTPSA*
Ga0099828_1000156783300009089Vadose Zone SoilMRQLQRVLKGRASAVTGLVAGLALVSLPVEVTLRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETARAGAQRPTPGV*
Ga0105240_1202657713300009093Corn RhizosphereVSAVTGFIAGLALVSLPIEVTIGFSEHPVIPPGWLASFLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA*
Ga0114129_10021789103300009147Populus RhizosphereMLKWRASAITGLIAGLALVSLPLEVTIELAEHPVIAPTPGVSLLLIGSCLLLVGSGVGLRREEASRAGARRPTPSA*
Ga0105092_1050707023300009157Freshwater SedimentKWRPSAIAGLIAVLALVSLPVELTIGLTEQPVVPPGRLASLLLIGSCLLLVGSGLGLRREEAARAGARRPTPSG*
Ga0136847_1350672933300010391Freshwater SedimentMLKWRASAVTGLIAVLALVSLPVEVTVGLVDHPVVPPGRLASLLLIGSCLLLVGSGLGLRREEAARVGARRPTPSA*
Ga0134127_1008494743300010399Terrestrial SoilMLKGRAHTVTGVLAGLALISLPVEITIGLAEHPVVAPAPAVSLLLISSCLLLVGSGLGLRRLEAMRVGAPRPTPGL*
Ga0137440_107262723300011410SoilMRQLTGLVAVLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLEAARAGARRPTPGV*
Ga0137448_115299723300011427SoilMRQLTGLVAVLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLEAARAGARRLTPGV*
Ga0137455_104388133300011429SoilMRQLQRVLKGRASAVTGLVAGLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGAQRPTPGV*
Ga0137451_103379743300011438SoilMRQLQRALKGRASAVTGLVAVLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGAQRPTPGV*
Ga0137352_108562523300012164SoilVPKGRAGLVTGLVAVLTLVSLPVEVTIQLVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETARAGARRPTPGV*
Ga0137434_100708613300012225SoilMRQLQRVLKERASAVTGLVAGLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRPETARAGAQRPTPGV*
Ga0137447_100181153300012226SoilTRMLKWRASAVTGLLAVLALVSLPVEVTIGFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETARAGAQRPTPGV*
Ga0137369_1056923523300012355Vadose Zone SoilMLKWRASAVTGVVAGLALVSLPVEVRIGLVEHPVIAPTPGVSLLLIGSCLLLVGSGVGLRRVEASRADARRPTPSA*
Ga0137397_1010786453300012685Vadose Zone SoilMRRFPAALKWRASAVTGFIAGLALVSLPIEVTIGLSEHPLIPPGRLASLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA*
Ga0137397_1044678423300012685Vadose Zone SoilMLKWRASALTGLLAGLGLVSLPVEVTIGLAEHPVIAPTPEVSLLLIGSCFLLVGSGVGLRRDEASRAGARRPTPSA*
Ga0137396_1079568223300012918Vadose Zone SoilMLKWRASALTGLLAGLGLVSLPVEVTIGLAEHPVIAPTPEVSLLLIGSCLLLVGSGVGLRRDEASRAGARRPTPSA*
Ga0137394_10001758203300012922Vadose Zone SoilMRRFPAALKWRASAVTGFIAGLTLVSLPIEVTIGLSEHPLIPPGRLASLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA*
Ga0137404_1042273333300012929Vadose Zone SoilMLKWRGSAITGLIAGLALVSLPLEVTIGLAEHPVIAPTPEVSLLLIGSCLLLVGSGVGLRREEVSRAGARRPTPSA*
Ga0137404_1131576633300012929Vadose Zone SoilMRRFPAALKWRASAVTGFIAGLTLVSLPIEVTIGLSEHPLIPPGRLASLLLIGSCLLLVGSGLGLRRVEAARAGAG
Ga0075354_116335123300014308Natural And Restored WetlandsMRRLQGVLRGRASAVTGLVAGLALVSLPVEVTIGFVEHPVVAPAPVASLLLIGSCLLLVGSGLGLRRLEA
Ga0075353_103428833300014321Natural And Restored WetlandsMRRLQGVLRGRASAVTGLVAGLALVSLPVEVTIGFVEHPVVAPAPVASLLLIGSCLLLVGSGLGLRRLEAARVSARRPTPGV*
Ga0180074_104472023300014877SoilMRRVTRILKWRASAVTGLLAVLALVSLPVEVTIGLVEHPVVPPGRLASLLLIGSCLLLVGGGLGLRREEAARAGARRPTPSA*
Ga0180074_116007123300014877SoilWLLSGISLAALRAMRRVQRMLKWRASAVTGLIAVLALVSLPVEVTIGFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRQETVRAGAQRPTPGV*
Ga0180063_103350013300014885SoilMRRVTRILKWRASAVTGLLAVLALVSLPVEVTIGLVEHPVVPPGRLASLLLIGSCLLLVGGGLGLRREEAARAGARRPTPS
Ga0180063_124265923300014885SoilMRQLQRVLRGRASAVIGLVAGLALVSLPVEVTIGLVDHPVVPPGRLASLLLIGSCLLLVGGGLGLRREEAARAGARRPTPSA*
Ga0137409_1006212043300015245Vadose Zone SoilMLKWRASALTGLLAGLGLISLPVEVTIGLAEHPVIAPTPEASLLLIGSCLLLVGSGVGLRRDEASRAGARRPTPSA*
Ga0137403_10007450153300015264Vadose Zone SoilMLKWRASALTGLLAGLGLVSLPVEVTIGLAEHPVIAPTPEVSLLLIGSCLLLVGSGVGLRREEVSRAGARRPTPSA*
Ga0184608_1005011523300018028Groundwater SedimentMRRVQGMLKWRGSAITGLIAGLALVSLPLEVTIGLAEHPVIAPTPGVSLLLIGSCLLLVGSGVGLRREEASRAGTRRPTPSA
Ga0184638_102755023300018052Groundwater SedimentMRQLPRVLKGRASAVTGLIAGLALVSLPVEITLRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGAQRPTPGV
Ga0184626_1003206533300018053Groundwater SedimentMRQLQRALKGRASAVTGLVAGLALVSLPVEVTLRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGAQRPSPGV
Ga0184623_1001416463300018056Groundwater SedimentMRQLQRALKGRASAVTGLVAGLALVSLPVEVTLGFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGAQRPTPGV
Ga0184615_1051423513300018059Groundwater SedimentRARSASSACGPFRLLSRISVATLRAMRQLQGVLKGRASAVTGLVAVLALVSLPVEVTIRFVEHPVVAPGPVVSLLLIGSCLLLVGSGLGLRRLEAARAGARRPTPGV
Ga0184637_1002553863300018063Groundwater SedimentMRQLQRALKGRASAVTGLVAGLALVSLPVEVTLRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGAQRPTPGV
Ga0184640_1000965863300018074Groundwater SedimentMRQLQGVLKGRASAVTGLVAVLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLEAARAGARRRTPGV
Ga0184632_1002516633300018075Groundwater SedimentMRQLQRVLKGRASAVTGLVAVLALVSLPVEVTLRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGAQRPTPGV
Ga0184632_1019538013300018075Groundwater SedimentMRQLQRVLKGRASAVTGLVAGLALVSLPVEVTIGFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGAQRPSPGV
Ga0184627_1000483413300018079Groundwater SedimentATLRAMRQLQRVLKGRASAVTGLVAGLALVSLPVEVTLRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGAQRPTPGV
Ga0184629_1002228753300018084Groundwater SedimentMRQLQRLLKGRASAVTGLVAGLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETARAGAQRPTPGV
Ga0190265_1005204923300018422SoilMRCFLGWLKGRASAVTGFVAVLALVSLPVELRVGFVEHPVIAPAPVASMLLIGSCLLLVGSGLGLRRLEAARAGARRPTPGL
Ga0190265_1009589123300018422SoilMRRAWSAARRRASAVTGLLAVVALVGLPVEVTIRLFEHPVIAPPRLASLLLLGSCLLLVGSGLGLRRLEARSGARPPASGA
Ga0190265_1045650833300018422SoilMQRAAMRRVQGMLKWRVSAVTGLIAGLGLVTLPLELTIGLVEHPVSAPTPVVSLLLISSCLLLVGTGLSLRRVDAARVGARRHTPGA
Ga0190272_1015881723300018429SoilMRQLQRVLKGRASAVTGLVAGLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGAPRPTPGV
Ga0184643_123569713300019255Groundwater SedimentGMLKWRGSAITGLIAGLALVSLPVEVTIALAEHPVIAPTPGVSLLLIGSCLLLVGSGVGLRREEASRAGTRRPTPSA
Ga0184646_150824023300019259Groundwater SedimentMRQLQRVLKGRASAVTGLVAVLALVSLPVEVTIRFVEYPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGAQRPTPGV
Ga0187894_1008756943300019360Microbial Mat On RocksMRQSQGLPRGWASAGAGFVAVLSLVSLPVEVTIRLVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLEAARTGARRPTQGV
Ga0187892_1006408143300019458Bio-OozeMRPGQGLLRGWASAGAGLVAGLSLVSLPVEVTVRLVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLEAARTGARRPTPGI
Ga0187893_1078537113300019487Microbial Mat On RocksMRTFLLDSDSAGGPLRLLSRIPVATLRGMRQGQGLLRGWASAGAGLVAVLSLVSLPVEVTIRLVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLEAARTGARRPTPGV
Ga0137408_136307153300019789Vadose Zone SoilMLKWRASALTGLLAGLGLVSLPVEVTIGLAEHPVIAPTPEVSLLLIGSCLLLVGSGVGLRRDEASRAGARRPTPSA
Ga0193705_109521923300019869SoilPSSGRALFWLLSGIPVATLRAMRRVQGMLKWRASAITGLIAGLALVSLPVEVTIALTEHPVIAPTPGVSLLLIGSCLLLVGSGVGLRREEASRAGTRRPTPSA
Ga0193723_102083133300019879SoilMRRLPGALKWRASAVTGFIAGLALVSLPIEVTIGLSDHPVIPPGRLVSLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0193713_101472233300019882SoilMRRVQGMLKWRASAITGLIAGLALVSLPVEVTIALAEHPVIAPTPGVSLLLIGSCLLLVGSGVGLRREEASRAGTRRPTPSA
Ga0193713_105819033300019882SoilMRRLPGALKWRASAVTGFIAGLALVSLPIEVTIGLSDHPVIPPGHLVSLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0193725_102164953300019883SoilMRQLQRVLKGRASAVTGLVAGLALVSLPVEVTLRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETARAGAQRPTPGV
Ga0193743_108534133300019889SoilLKGRASAVTGLVAVLALVSLPVEVTLRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGARRPTPGV
Ga0193710_100267533300019998SoilMRRLPGALKWRASAVIGFIAGLALVSLPIEVTIGLSDHPVIPPGRLVSLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0193755_101827653300020004SoilMRRLPGALKWRASAVTGFIAGLALVSLPIEVTIGLSDHPVIPPGHLASLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0193733_120092913300020022SoilLFWLLSGIPVATLRAMRRVQGMLKWRASAITGLIAGLALVSLPVEVTIALTEHPVIAPTPGVSLLLIGSCLLLVGSGVGLRREEASRAGTRRPTPSA
Ga0206224_100441933300021051Deep Subsurface SedimentMRQLQRVLKERASAVTGLVAGLALVSLPVEVTLRFVEHPVIAPAPVVSLLLIGSCLLLVGSGLGLRRLETARAGAQRPTPGV
Ga0210378_1020093023300021073Groundwater SedimentMRQLPRVLKGRASAVTGLVAGLALVSLPVEVTLRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETARAGAQRPTPGV
Ga0210379_1005325533300021081Groundwater SedimentMRQLQRVLKGRASAVTGLVAVLALVSLPVEVTIGFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGAQRPTPGV
Ga0210379_1049614423300021081Groundwater SedimentMRQLQGVLKGRASAVTGLVAVLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLEAARAGARRLTPGV
Ga0210377_1004321063300021090Groundwater SedimentMRQLQGVLKGRASAVTGLVAVLALVSLPVEVTIRFVEHPVVAPGPVVSLLLIGSCLLLVGSGLGLRRLEAARAGARRPTPGV
Ga0222625_166698923300022195Groundwater SedimentMRQLPRVLKGRASAVTGLIAGLALVSLPVEVTLRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGARRPTPGV
Ga0224452_103550233300022534Groundwater SedimentMRQLQRVLKGRASAVTGLVAVLALVSLPVEVTIRFVEYPVVAPAPVVSLLLISSCLLLVGSGLGLRRLETARAGAQRPTPGV
Ga0222623_1002079533300022694Groundwater SedimentMRQLPRVLKGRASAVTGLIAGLALVSLPVEITLRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGARRPTPGV
Ga0209108_1015015623300025165SoilMRWARQAAKRRASAVTGLLAVVTLVALPVEITIGLFEQPVIAPPPVASLLLLGSCLLLIGSGLGLRRQEAARSAARPPAPGA
Ga0209640_1000972653300025324SoilMRWARQAAKRRASAVTGLLAGVTLVALPVEITIGLFEQPVIAPPPVASLLLLGSCLLLVGSGLGLRRQEAARSAARPPAPGA
Ga0210138_100682343300025580Natural And Restored WetlandsMRRLQGVLKGRASAVTGLVAVLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGFSLRRLEAARVGARRPTPGV
Ga0207653_1034238923300025885Corn, Switchgrass And Miscanthus RhizosphereMRRFPAALKWRASAVTGFIAGLALVSLPIEVTIGLSEHPVIPPGWLASLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0207660_1002366643300025917Corn RhizosphereMLKGRAHTVTGVLAGLALISLPVEITIGLAEHPVVAPAPAVSLLLISSCLLLVGSGLGLRRLEAMRVGAPRPTPGL
Ga0207681_1165454213300025923Switchgrass RhizospherePLATLRAMRRFPAALKWRASAVTGFIAGLALVSLPIEVTIGLSEHPVIPPGWLASFLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0207706_1153227623300025933Corn RhizosphereMLKGRAHTVTGVLAGLALISLPVEITIGLAEHPVVAPAPAVSVLLISSCLLLVGSGLGLRRLEAMRVGAPRPTPGL
Ga0207709_1009654123300025935Miscanthus RhizosphereMRRFPAALKWRASAVTGFIAGLALVSLPIEVTIGLSEHPVIPPGWLASFLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0207689_1002301923300025942Miscanthus RhizosphereMRRFPAALKWRASAVTGFIAGLALVSLPIEVTIGFSEHPVIPPGWLASFLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0210090_101657823300025965Natural And Restored WetlandsMRRLQGVLRGRASAVTGLVAGLALVSLPVEVTIGFVEHPVVAPAPVASLLLIGSCLLLVGSGLGLRRLEAARLSARRPTPGV
Ga0210145_100611623300025973Natural And Restored WetlandsMRRLQGVLRGRASAVTGLVAGLALVSLPVEVTIGFVEHPVVAPAPVASLLLIGSCLLLVGSGLGLRRLEAARVSARRPTPGV
Ga0207675_10084725833300026118Switchgrass RhizosphereKWRASAVTGFIAGLALVSLPIEVTIGLSEHPVIPPGWLASFLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0209438_1000687123300026285Grasslands SoilMRRFPAALKWRASAVTGFIAGLTLVSLPIEVTIGLSEHPLIPPGRLASLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0257180_103534023300026354SoilMRQLQRVLKGRASAVTGLVAGLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETARAGAQRPTPGV
Ga0257179_100899813300026371SoilMRQLQRVLKGRVSAVTGLVAGLALVSLPVEVTLRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETARAGAQRPTPGV
Ga0257181_100941613300026499SoilMRQLQRVLKGRASAVTGLVAGLALVSLPVEVTIGFVEHPVVTPAPVVSLLLIGSCLLLVGSGLGLRRLETARAGAQRPTPGV
Ga0256867_1014743623300026535SoilMRRVRRALKGRASAITGLVAVVALVSLPVEVRIGLVKHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLEAARADARRPTPGV
Ga0208995_107697413300027388Forest SoilVNDSGRALFWLLSGISVATLRAMRRVQGMLKWRASAITGLLAGLALVSLPVEVTIGLTEHPVIAPTPEESLLLISSCLLLVGSGVGLRREEASRAGTRRPTPSA
Ga0208981_109556813300027669Forest SoilAAPKYPHDAARAKALLAGLGLISLPVEVTIGLAEHPVIAPTPEVSLLLIGSCLLLVGSGVGLRRDEASRAGARRPTPSA
(restricted) Ga0233416_1012611723300027799SedimentMQRPRQALKRRASAVTGLAAVVALVGLPVEVTIGLAEQPVLPPPPVASLLLIASCLLLVGSGLGLRRLAAARLAVPPPVPGL
Ga0209726_1002151353300027815GroundwaterMRQLQGALKGRASAVTGLVTVLTLVSLPVEVTIGFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLEAARAGARRPTPGV
Ga0209382_1012552653300027909Populus RhizosphereMRQGQGLLRGWASAGAGLVAVLSLVSLPVEVTIQLVEHPVVAPTPVVSLLLISSCLLLVGSGLGLRRLEAARTGARRPTPGV
Ga0209705_1059986913300027979Freshwater SedimentMRWVQRMLRWRASVVTGLIAVLALVSLPVEVTIGLVDHPVVPPGRLASLLLIGSCLLLVGGGLGLRREEAARAGARRPTPSA
Ga0307504_1003265033300028792SoilLLSGIPLATLRAMRRLPGALKWRASAVTGFIAGLALVSLPIEVTIGLSDHPVIAPGRPASLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0307504_1030394923300028792SoilMRRLPGTLKWRASAVTGFIAGLALVSLPMEVTIGLSEHPVIPPGRVASLLLIGSCLLLVGSGLGLRRVEAARAGAGRPSPSA
Ga0307281_1008145313300028803SoilGPFRLLSRISVATLRAMRQLQGVLKGRASAVTGLVAVLALVSLPVEVTIRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLEATRAGARRPTPGV
Ga0307312_1002283573300028828SoilAMRRVQGMLKWRASAITGLIAGLALVSLPVEVTIALAEHPVIAPTPGVSLLLIGSCLLLVGSGVGLRREEASRAGTRRPTPSA
Ga0307312_1014397843300028828SoilMRQLQRVLKGRASAVTGLVAGLALVSLPVEVTLRFVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETVRAGAQRPTPGV
Ga0299907_1056757323300030006SoilMQRVRRALKGRASAITGLVAVVALVSLPVEVRIGLVKHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLEAARADARRPTPGV
(restricted) Ga0255311_100972553300031150Sandy SoilMRRVKRMLKWRASAVTGLIAVLALVSLPVEVTIGLIEHPVVPPGRLASLLLIGSCLLLVGSGLGLRREEAARAGARRSTPSA
(restricted) Ga0255311_110233813300031150Sandy SoilMRWVQRMLRWRASVVTGLIAVLALVSLPVEVTIGLVDHPVVPPGRLASLLLIGSCLLLVGGGLGLRREEAARAGARRPT
Ga0307501_1013450213300031152SoilLRAMRRVQGMLKWRASAITGLIAGLALVSLPVEVTIALTEHPVIAPTPGVSLLLIGSCLLLVGSGVGLRREEASRAGTRRPTPSA
(restricted) Ga0255310_1000606823300031197Sandy SoilMRRLQGALKWRASAVTGLIAGLALVSLPVEVTIGLSEHPVIPPGRLASLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0299913_1031855033300031229SoilMRRVRRALKGRASAITGLVAVVALVSLPVEVRIGLVKHPVVAPPPVVSLLLIGSCLLLVGSGLGLRRLEAARADARRPTPGV
(restricted) Ga0255312_116339923300031248Sandy SoilMRRLQGALKWRASAVTGLIAGLALVSLPVEVTIGLSEHPVIPPGRLASFLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0307469_1001494553300031720Hardwood Forest SoilMRRLPGALKWRASAVTGFIAGLALVSLPIEVTIGLSEHPVIPPGRLASLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0307469_1132209413300031720Hardwood Forest SoilMRQLAGVLKGRAHAVTGFVAGLALVSVPVEVTIGFGEYPVVAPAPVVSLLLISSCLLLVGSGLGLRRLEAMRA
Ga0307468_10000940213300031740Hardwood Forest SoilAMRRLPGALKWRASAVTGFIAGLALVSLPIEVTIGLSDHPVIPPGRLASLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0214473_1203425813300031949SoilASAVTGLVAVLTLVSLPVEVTIRLVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLEAARAGVRRPTPGV
Ga0307470_1000258483300032174Hardwood Forest SoilMLKWRPSVITGLIAVLALVSVPVEITIGLVEHTPLVPPGRLASLLLIGSCLLLVGGGLGLRREEAARAGARRPTPSA
Ga0307470_1195732613300032174Hardwood Forest SoilMRRLPGALKWRASAVTGFIAGLALVSLPIEVTIGLSDHPVIAPGRLASLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0307472_10217887513300032205Hardwood Forest SoilRARFRLLSGIPLATLRAMRRLPGALKWRASAVTGFIAGLALVSLPIEVTIGLSDHPVIAPGRLASLLLIGSCLLLVGSGLGLRRVEAARAGAGRPTPSA
Ga0334722_1013769433300033233SedimentMRRVQRMLKWRASAVTGLIAVLALVSLPVEITIGLVEHPVVPPGRLASLLLIGSCLLLVGSGLGLRREEAARAGARRPTPSA
Ga0214472_1167723213300033407SoilATLRAMRRLPGVLKGRASAITGLVAVLTLVSLPVEVTIRLVEHPVVAPAPVVSLLLIGSCLLLVGSGLGLRRLETARAGARRPTPGV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.