NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F041325

Metagenome Family F041325

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F041325
Family Type Metagenome
Number of Sequences 160
Average Sequence Length 143 residues
Representative Sequence MRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVAEYAESRAAAEPRYSDFRVLQIRRSPSRVVFRVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Number of Associated Samples 137
Number of Associated Scaffolds 160

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 62.50 %
% of genes near scaffold ends (potentially truncated) 38.12 %
% of genes from short scaffolds (< 2000 bps) 72.50 %
Associated GOLD sequencing projects 133
AlphaFold2 3D model prediction Yes
3D model pTM-score0.72

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (80.625 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(20.000 % of family members)
Environment Ontology (ENVO) Unclassified
(41.875 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(30.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 35.43%    β-sheet: 24.57%    Coil/Unstructured: 40.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.72
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.17.4.1: Scytalone dehydratased1idpa_1idp0.72621
d.17.4.27: ECA1476-liked3d9ra13d9r0.71744
d.17.4.17: SAV4671-liked3cnxa13cnx0.7085
d.17.4.18: BxeB1374-liked2owpa12owp0.69738
d.17.4.28: BaiE/LinA-liked2rfra12rfr0.68995


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 160 Family Scaffolds
PF03602Cons_hypoth95 39.38
PF00271Helicase_C 21.25
PF01467CTP_transf_like 9.38
PF06240COXG 2.50
PF00155Aminotran_1_2 2.50
PF07690MFS_1 1.25
PF17191RecG_wedge 1.25
PF09952AbiEi_2 1.25
PF00830Ribosomal_L28 1.25
PF030614HBT 1.25
PF031712OG-FeII_Oxy 0.62
PF13183Fer4_8 0.62
PF05494MlaC 0.62

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 160 Family Scaffolds
COG074216S rRNA G966 N2-methylase RsmDTranslation, ribosomal structure and biogenesis [J] 39.38
COG109223S rRNA G2069 N7-methylase RlmK or C1962 C5-methylase RlmITranslation, ribosomal structure and biogenesis [J] 39.38
COG2242Precorrin-6B methylase 2Coenzyme transport and metabolism [H] 39.38
COG2265tRNA/tmRNA/rRNA uracil-C5-methylase, TrmA/RlmC/RlmD familyTranslation, ribosomal structure and biogenesis [J] 39.38
COG2890Methylase of polypeptide chain release factorsTranslation, ribosomal structure and biogenesis [J] 39.38
COG3427Carbon monoxide dehydrogenase subunit CoxGEnergy production and conversion [C] 2.50
COG0227Ribosomal protein L28Translation, ribosomal structure and biogenesis [J] 1.25
COG2854Periplasmic subunit MlaC of the ABC-type intermembrane phospholipid transporter MlaCell wall/membrane/envelope biogenesis [M] 0.62


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms80.62 %
UnclassifiedrootN/A19.38 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000443|F12B_12388398All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria690Open in IMG/M
3300000550|F24TB_10694497All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria516Open in IMG/M
3300000559|F14TC_101032544All Organisms → cellular organisms → Bacteria879Open in IMG/M
3300002122|C687J26623_10168535Not Available605Open in IMG/M
3300002886|JGI25612J43240_1036771All Organisms → cellular organisms → Bacteria705Open in IMG/M
3300002907|JGI25613J43889_10074397All Organisms → cellular organisms → Bacteria → Proteobacteria915Open in IMG/M
3300003349|JGI26129J50193_1002149All Organisms → cellular organisms → Bacteria1379Open in IMG/M
3300003911|JGI25405J52794_10076125All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria736Open in IMG/M
3300004009|Ga0055437_10034309All Organisms → cellular organisms → Bacteria1282Open in IMG/M
3300004025|Ga0055433_10092842All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria676Open in IMG/M
3300004047|Ga0055499_10049505All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria670Open in IMG/M
3300004114|Ga0062593_100104386All Organisms → cellular organisms → Bacteria2011Open in IMG/M
3300004156|Ga0062589_100020724All Organisms → cellular organisms → Bacteria3098Open in IMG/M
3300005093|Ga0062594_103068850All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria522Open in IMG/M
3300005183|Ga0068993_10023472All Organisms → cellular organisms → Bacteria1593Open in IMG/M
3300005341|Ga0070691_10010200All Organisms → cellular organisms → Bacteria4284Open in IMG/M
3300005458|Ga0070681_10309531All Organisms → cellular organisms → Bacteria1489Open in IMG/M
3300005471|Ga0070698_101831302Not Available560Open in IMG/M
3300005546|Ga0070696_100511808All Organisms → cellular organisms → Bacteria956Open in IMG/M
3300005713|Ga0066905_100005142All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria5494Open in IMG/M
3300005937|Ga0081455_10000208All Organisms → cellular organisms → Bacteria74660Open in IMG/M
3300006176|Ga0070765_100207885All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1779Open in IMG/M
3300006845|Ga0075421_100692265All Organisms → cellular organisms → Bacteria1186Open in IMG/M
3300006847|Ga0075431_100926937All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria840Open in IMG/M
3300006847|Ga0075431_101451819All Organisms → cellular organisms → Bacteria645Open in IMG/M
3300009038|Ga0099829_10000366All Organisms → cellular organisms → Bacteria23568Open in IMG/M
3300009089|Ga0099828_10001160All Organisms → cellular organisms → Bacteria17232Open in IMG/M
3300009098|Ga0105245_10147739All Organisms → cellular organisms → Bacteria2219Open in IMG/M
3300009147|Ga0114129_10097604All Organisms → cellular organisms → Bacteria4067Open in IMG/M
3300009153|Ga0105094_10325721All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria886Open in IMG/M
3300009821|Ga0105064_1140981Not Available515Open in IMG/M
3300010362|Ga0126377_11389112Not Available775Open in IMG/M
3300010371|Ga0134125_11625535All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria703Open in IMG/M
3300010400|Ga0134122_10482210All Organisms → cellular organisms → Bacteria1117Open in IMG/M
3300010401|Ga0134121_10374927All Organisms → cellular organisms → Bacteria1279Open in IMG/M
3300011419|Ga0137446_1015438All Organisms → cellular organisms → Bacteria1509Open in IMG/M
3300011428|Ga0137456_1027395All Organisms → cellular organisms → Bacteria1260Open in IMG/M
3300011429|Ga0137455_1113117All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria800Open in IMG/M
3300012034|Ga0137453_1027526All Organisms → cellular organisms → Bacteria941Open in IMG/M
3300012133|Ga0137329_1008468All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1081Open in IMG/M
3300012174|Ga0137338_1040206All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria971Open in IMG/M
3300012208|Ga0137376_11551133All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria553Open in IMG/M
3300012225|Ga0137434_1050091All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria635Open in IMG/M
3300012355|Ga0137369_10600417Not Available766Open in IMG/M
3300012685|Ga0137397_10032975All Organisms → cellular organisms → Bacteria3694Open in IMG/M
3300012929|Ga0137404_10122551All Organisms → cellular organisms → Bacteria2127Open in IMG/M
3300012929|Ga0137404_11848705Not Available562Open in IMG/M
3300012930|Ga0137407_10226799All Organisms → cellular organisms → Bacteria1686Open in IMG/M
3300014262|Ga0075301_1038397All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria884Open in IMG/M
3300014299|Ga0075303_1038340All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria777Open in IMG/M
3300014318|Ga0075351_1119119All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria592Open in IMG/M
3300014326|Ga0157380_10643571All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1056Open in IMG/M
3300014873|Ga0180066_1000550All Organisms → cellular organisms → Bacteria4016Open in IMG/M
3300014882|Ga0180069_1050456All Organisms → cellular organisms → Bacteria935Open in IMG/M
3300014884|Ga0180104_1025501All Organisms → cellular organisms → Bacteria1491Open in IMG/M
3300014885|Ga0180063_1024181All Organisms → cellular organisms → Bacteria1671Open in IMG/M
3300014885|Ga0180063_1024309All Organisms → cellular organisms → Bacteria1667Open in IMG/M
3300014968|Ga0157379_10282847All Organisms → cellular organisms → Bacteria1510Open in IMG/M
3300015170|Ga0120098_1023306Not Available775Open in IMG/M
3300015251|Ga0180070_1018596Not Available809Open in IMG/M
3300015259|Ga0180085_1014777All Organisms → cellular organisms → Bacteria2141Open in IMG/M
3300015371|Ga0132258_10016370All Organisms → cellular organisms → Bacteria15923Open in IMG/M
3300017939|Ga0187775_10158149All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium812Open in IMG/M
3300017961|Ga0187778_10033533All Organisms → cellular organisms → Bacteria3141Open in IMG/M
3300017973|Ga0187780_11086422Not Available584Open in IMG/M
3300017997|Ga0184610_1022947All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1696Open in IMG/M
3300018053|Ga0184626_10036958All Organisms → cellular organisms → Bacteria2022Open in IMG/M
3300018059|Ga0184615_10173128All Organisms → cellular organisms → Bacteria1217Open in IMG/M
3300018063|Ga0184637_10108547Not Available1709Open in IMG/M
3300018071|Ga0184618_10092885All Organisms → cellular organisms → Bacteria1172Open in IMG/M
3300018074|Ga0184640_10075100All Organisms → cellular organisms → Bacteria1437Open in IMG/M
3300018084|Ga0184629_10062232All Organisms → cellular organisms → Bacteria1738Open in IMG/M
3300018422|Ga0190265_10056882All Organisms → cellular organisms → Bacteria → Terrabacteria group3450Open in IMG/M
3300018422|Ga0190265_10154650All Organisms → cellular organisms → Bacteria2249Open in IMG/M
3300018422|Ga0190265_10345005Not Available1574Open in IMG/M
3300018422|Ga0190265_10905037All Organisms → cellular organisms → Bacteria1005Open in IMG/M
3300018422|Ga0190265_12064197All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria675Open in IMG/M
3300018429|Ga0190272_10239293All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1351Open in IMG/M
3300018429|Ga0190272_12345840Not Available577Open in IMG/M
3300019360|Ga0187894_10254704All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria829Open in IMG/M
3300019458|Ga0187892_10019846All Organisms → cellular organisms → Bacteria6566Open in IMG/M
3300019458|Ga0187892_10139731All Organisms → cellular organisms → Bacteria1374Open in IMG/M
3300019458|Ga0187892_10288918All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria823Open in IMG/M
3300019869|Ga0193705_1104897Not Available511Open in IMG/M
3300019879|Ga0193723_1064234All Organisms → cellular organisms → Bacteria1065Open in IMG/M
3300019881|Ga0193707_1006563All Organisms → cellular organisms → Bacteria4043Open in IMG/M
3300019882|Ga0193713_1010329All Organisms → cellular organisms → Bacteria2833Open in IMG/M
3300019886|Ga0193727_1024724All Organisms → cellular organisms → Bacteria2109Open in IMG/M
3300019886|Ga0193727_1109479All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria804Open in IMG/M
3300019886|Ga0193727_1176898Not Available552Open in IMG/M
3300019889|Ga0193743_1034287All Organisms → cellular organisms → Bacteria2389Open in IMG/M
3300020003|Ga0193739_1004632All Organisms → cellular organisms → Bacteria3708Open in IMG/M
3300020004|Ga0193755_1020578All Organisms → cellular organisms → Bacteria2180Open in IMG/M
3300020060|Ga0193717_1057671All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1345Open in IMG/M
3300020060|Ga0193717_1099823All Organisms → cellular organisms → Bacteria919Open in IMG/M
3300020579|Ga0210407_10739389Not Available761Open in IMG/M
3300020580|Ga0210403_10461747All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1034Open in IMG/M
3300020583|Ga0210401_11245271Not Available602Open in IMG/M
3300021080|Ga0210382_10074912All Organisms → cellular organisms → Bacteria1376Open in IMG/M
3300021090|Ga0210377_10032686All Organisms → cellular organisms → Bacteria3703Open in IMG/M
3300021170|Ga0210400_11359552Not Available567Open in IMG/M
3300021344|Ga0193719_10075141Not Available1467Open in IMG/M
3300021344|Ga0193719_10102280All Organisms → cellular organisms → Bacteria1246Open in IMG/M
3300021420|Ga0210394_10961565Not Available741Open in IMG/M
3300021559|Ga0210409_11064116All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria684Open in IMG/M
3300022756|Ga0222622_10166634All Organisms → cellular organisms → Bacteria1435Open in IMG/M
3300023073|Ga0247744_1037493Not Available759Open in IMG/M
3300025160|Ga0209109_10212415All Organisms → cellular organisms → Bacteria951Open in IMG/M
3300025324|Ga0209640_10005004All Organisms → cellular organisms → Bacteria11593Open in IMG/M
3300025324|Ga0209640_10104040All Organisms → cellular organisms → Bacteria2447Open in IMG/M
3300025558|Ga0210139_1022975All Organisms → cellular organisms → Bacteria1208Open in IMG/M
3300025907|Ga0207645_10034642All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3244Open in IMG/M
3300025907|Ga0207645_10483356All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria838Open in IMG/M
3300025912|Ga0207707_10429068All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes1132Open in IMG/M
3300025933|Ga0207706_10062069All Organisms → cellular organisms → Bacteria3292Open in IMG/M
3300025957|Ga0210089_1000978All Organisms → cellular organisms → Bacteria2306Open in IMG/M
3300026041|Ga0207639_11233137Not Available702Open in IMG/M
3300026285|Ga0209438_1002064All Organisms → cellular organisms → Bacteria6702Open in IMG/M
3300026360|Ga0257173_1023848All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria782Open in IMG/M
3300026469|Ga0257169_1006625All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → Nostocales → Tolypothrichaceae → Tolypothrix → Tolypothrix campylonemoides → Tolypothrix campylonemoides VB5112881361Open in IMG/M
3300027378|Ga0209981_1003923All Organisms → cellular organisms → Bacteria1938Open in IMG/M
3300027526|Ga0209968_1020138All Organisms → cellular organisms → Bacteria1077Open in IMG/M
3300027552|Ga0209982_1090021Not Available508Open in IMG/M
(restricted) 3300027799|Ga0233416_10005738All Organisms → cellular organisms → Bacteria3998Open in IMG/M
3300027815|Ga0209726_10048886All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes2972Open in IMG/M
3300027846|Ga0209180_10002390All Organisms → cellular organisms → Bacteria9402Open in IMG/M
3300027903|Ga0209488_10886771Not Available626Open in IMG/M
3300028536|Ga0137415_10332693All Organisms → cellular organisms → Bacteria1322Open in IMG/M
3300028715|Ga0307313_10041209All Organisms → cellular organisms → Bacteria1334Open in IMG/M
3300028791|Ga0307290_10022878All Organisms → cellular organisms → Bacteria2189Open in IMG/M
3300028819|Ga0307296_10135919All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1325Open in IMG/M
3300028828|Ga0307312_10936185Not Available574Open in IMG/M
3300028828|Ga0307312_11121825Not Available520Open in IMG/M
3300028878|Ga0307278_10047501All Organisms → cellular organisms → Bacteria1946Open in IMG/M
3300030006|Ga0299907_10091465All Organisms → cellular organisms → Bacteria2486Open in IMG/M
3300030620|Ga0302046_10153284All Organisms → cellular organisms → Bacteria1890Open in IMG/M
3300030620|Ga0302046_10749692Not Available788Open in IMG/M
(restricted) 3300031150|Ga0255311_1072424All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria734Open in IMG/M
3300031152|Ga0307501_10132197Not Available662Open in IMG/M
(restricted) 3300031197|Ga0255310_10050257All Organisms → cellular organisms → Bacteria1089Open in IMG/M
(restricted) 3300031197|Ga0255310_10228590All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium524Open in IMG/M
3300031229|Ga0299913_10254075All Organisms → cellular organisms → Bacteria1741Open in IMG/M
(restricted) 3300031237|Ga0255334_1019191All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria824Open in IMG/M
(restricted) 3300031248|Ga0255312_1005854All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes2929Open in IMG/M
(restricted) 3300031248|Ga0255312_1044826All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1058Open in IMG/M
3300031455|Ga0307505_10382638All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria668Open in IMG/M
3300031720|Ga0307469_10010651All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes4484Open in IMG/M
3300031720|Ga0307469_11348457All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria679Open in IMG/M
3300031720|Ga0307469_12167944Not Available541Open in IMG/M
3300031949|Ga0214473_10014316All Organisms → cellular organisms → Bacteria9291Open in IMG/M
3300031949|Ga0214473_10783803All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1030Open in IMG/M
3300031962|Ga0307479_10022479All Organisms → cellular organisms → Bacteria5967Open in IMG/M
3300032174|Ga0307470_10055613All Organisms → cellular organisms → Bacteria2053Open in IMG/M
3300032770|Ga0335085_10002916All Organisms → cellular organisms → Bacteria30769Open in IMG/M
3300032829|Ga0335070_10129485All Organisms → cellular organisms → Bacteria2597Open in IMG/M
3300033233|Ga0334722_10774844Not Available680Open in IMG/M
3300033480|Ga0316620_10451113All Organisms → cellular organisms → Bacteria1177Open in IMG/M
3300033813|Ga0364928_0010958Not Available1687Open in IMG/M
3300034164|Ga0364940_0036868All Organisms → cellular organisms → Bacteria1288Open in IMG/M
3300034257|Ga0370495_0152932Not Available732Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil20.00%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil8.75%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil6.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment4.38%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.12%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands3.75%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil3.75%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere2.50%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.50%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.88%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.88%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.88%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.88%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands1.88%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.88%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.88%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.88%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.88%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze1.88%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.25%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.25%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.25%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.25%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere1.25%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.25%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.25%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.62%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.62%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.62%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.62%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.62%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.62%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.62%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.62%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.62%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.62%
FossillEnvironmental → Terrestrial → Soil → Fossil → Unclassified → Fossill0.62%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.62%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.62%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.62%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.62%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.62%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000443Amended soil microbial communities from Kansas Great Prairies, USA - BrdU F1.2B clc assemlyEnvironmentalOpen in IMG/M
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300002122Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_2EnvironmentalOpen in IMG/M
3300002886Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cmEnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300003349Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Endophyte Co-N S PMHost-AssociatedOpen in IMG/M
3300003911Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300004009Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004025Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004047Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqC_D1EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005183Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1EnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009153Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 10-12cm March2015EnvironmentalOpen in IMG/M
3300009821Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011419Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT357_2EnvironmentalOpen in IMG/M
3300011428Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT615_2EnvironmentalOpen in IMG/M
3300011429Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT600_2EnvironmentalOpen in IMG/M
3300012034Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT526_2EnvironmentalOpen in IMG/M
3300012133Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT121_2EnvironmentalOpen in IMG/M
3300012174Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT366_2EnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014262Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D1EnvironmentalOpen in IMG/M
3300014299Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleC_D1EnvironmentalOpen in IMG/M
3300014318Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D1_rdEnvironmentalOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300014873Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200B_16_10DEnvironmentalOpen in IMG/M
3300014882Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT231B'_16_10DEnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300014968Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S2-5 metaGHost-AssociatedOpen in IMG/M
3300015170Fossil microbial communities from human bone sample from Teposcolula Yucundaa, Mexico - TP48EnvironmentalOpen in IMG/M
3300015251Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT293_16_10DEnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300017939Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_10_MGEnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300017973Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_20_MGEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019869Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m2EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300019889Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2EnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020060Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300023073Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, United States - UWRJ-S154-409C-5EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025558Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025907Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025957Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300026041Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026469Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-BEnvironmentalOpen in IMG/M
3300027378Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M1 PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027526Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027552Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M1 S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028715Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_203EnvironmentalOpen in IMG/M
3300028791Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_144EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031152Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 15_SEnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031237 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_35cm_T3_129EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031455Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 23_SEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032829Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.3EnvironmentalOpen in IMG/M
3300033233Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_bottomEnvironmentalOpen in IMG/M
3300033480Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D5_BEnvironmentalOpen in IMG/M
3300033813Sediment microbial communities from East River floodplain, Colorado, United States - 30_j17EnvironmentalOpen in IMG/M
3300034164Sediment microbial communities from East River floodplain, Colorado, United States - 14_s17EnvironmentalOpen in IMG/M
3300034257Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_02D_17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
F12B_1238839813300000443SoilQNSVEVANARQILEHYPECERPRRFGPCWPLLSKRVQADWARKGRPTAAEYADARAAGEPRYSDFRVMQIRRSPSRVVFVTEATRAAAKDALPDRVEYAILREGDQWKIDGRRVGPSETTP*
F24TB_1069449713300000550SoilVAAIGALALGVGAAPSRVAAQNSVEVAKARQILEHYTECERTRRFGPCWPLLSKRVQADWARKGRPTAAEYADARGAGEPRYSDFRVMQIRRSPSRVVFVTEATRAAAKDALPDRVEYAILREGDQWKIDGRRVGPSETTP*
F14TC_10103254423300000559SoilMARGGLLGLCCALALGVGAAPSRVAAQNSVEVAKARQILEHYTECERTRRFGPCWPLLSKRVQADWARKGRPTAAEYADARGAGEPRYSDFRVMQIRRSPSRVVFVTEATRAAAKDALPDRVEYAILREGDQWKIDGRRVGPSETTP*
C687J26623_1016853513300002122SoilMGARGRWALVVALVLGSVGAXVPAAAQXSVELARARQALEDYFACERTRRFAPCWPRLSKRVHAEWARQGRGSVAEYAESRGAAEPRYADFRVLQIRRSPSRVVFVVEATRASGRNGRPDRVEYAVLREGGQWKVDGRRVGPSETTP*
JGI25612J43240_103677113300002886Grasslands SoilWXLLAGVLALGAAGAPAPAAGQGSVEVARARQALEDYFACERTLRFAPCWPRLSRRVHAEWARQGRGSVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGAKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
JGI25613J43889_1007439723300002907Grasslands SoilMRGRARWTLLAGVLALGAAGAPAPAAGQGSVEVARARQALEDYFACERTLRFAPCWPRLSRRVHAEWARQGRGSVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGAKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
JGI26129J50193_100214923300003349Arabidopsis Thaliana RhizosphereMVRGGLLGLGCALALGVGGPSAVTAQNSVEVARARQILEHYTECERTRRFGPCWPLLSKRVQADWARQGRPGAVEYADARGAGEPRYVDFRVMQIRRSPSRVVFVTEATRASERDGLPDRVEYAILREGDQWKVDGRRVGASETTP*
JGI25405J52794_1007612513300003911Tabebuia Heterophylla RhizosphereGRGRLMPGRVLVRLCAIALGALAAPPLTVAQNSVEVARARQTLEHYAMCERTRRFAPCWPLLSRRIQAEWARQGRPSAAEYADARGAGEPGYADFRVLQIRRSPSRVVFVTEATRAGGTDAVADRVEYAVLRXGDQWKIDGRRVGASETTP*
Ga0055437_1003430923300004009Natural And Restored WetlandsMASRGRWALVGALVLGAVGAPAPVAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWERQGRGTVAEYAESRGAAEPRYADFRVLQIRRSPSRVVFLVEATREADRHRLPDRVEYAVLREGEQWRIDGRRVGQSETTP*
Ga0055433_1009284213300004025Natural And Restored WetlandsVAGQSSVEVARARQVLEDYFACECTRRFAPCWPRLSKRVQADWERQGRGTVAEYAESRGAAEPRYADFRVLQIRRSPSRVVFLVEATREADRHRLPDRVEYAVLREGEQWRIDGRRVGQSETTP*
Ga0055499_1004950513300004047Natural And Restored WetlandsMRGRARWALVGALVLGALGVPAPAPGQSSVELARARQVLEDYFTCERTRRFAPCWPRLSKRAQAEWTRQGRGTVAEYAESRAASEPRYADFRVQQIRRSPSRVVFLVEATRAADKDGVPDRVEYAVLREGEQWRIDGRRVGQSETTP*
Ga0062593_10010438623300004114SoilMVRGGLLGLGCALALGVGGPSAVTAQNSVEVARARQILEHYTECERTRRFGPCWPLLSKRVQADWARQGRPGAAEYADARGAGEPRYVDFRVMQIRRSPSRVVFVTEATRASERDGLPDRVEYAILREGDQWKVDGRRVGASETTP*
Ga0062589_10002072443300004156SoilMRGRARWTLLACVLALGAAVAPAPAAGQGSVEVARARQALEDYFACERTQRFAPCWPRLSRRVHAEWARQGRGSVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0062594_10306885023300005093SoilGQGSVEVARARQALEDYFACERTQRFAPCWPRLSRRVHAEWARQGRGSVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0068993_1002347223300005183Natural And Restored WetlandsMASRGRWALVGALVLGAVGAPAPVAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWERQGRGTVAEYAESRGAAEPRYADFRVLQIRRSPSRVVFLVEATREADRHRLPDRVEDAVLREGEQWRIDGRRVGQSETTP*
Ga0070691_1001020013300005341Corn, Switchgrass And Miscanthus RhizosphereMVRGGLLGLGCALALGVGGPSAVTAQNSVEVARARQILEHYTECERTRRFGPCWPLLSKRVQADWARQGRPGAAEYADARGAGEPRYVDFRVMQIRRSPSRVVFVTEATRASERDGLPDRVEYAILREGDQWKV
Ga0070681_1030953113300005458Corn RhizosphereMRGRARWTLLACVLALGAAVAPAPAAGQGSVEVARARQALEDYFACERTQRFAPCWPRLSRRVHAEWARQGRGSLAEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0070698_10183130213300005471Corn, Switchgrass And Miscanthus RhizosphereMRGLARWALVVGALSLGAGGAPPPAPAQSSVEVARARQALEDYFACERTRRFAPCWARLSKRVHAEWARQGRGTVSEYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGGQWTIDGRRVGQSETTP*
Ga0070696_10051180823300005546Corn, Switchgrass And Miscanthus RhizosphereMRGRARWALAAGALALGALGAPAPTAGQNSAEVARARQVLEDYFACERTGRFVPCWPRLSKRVQAEWTRQGRGTAAEYAESRAATEPRYADFRVQQIRRSPSRVVFVVEATRAAARDGLPDRVEFAVLREGEQWRIDGRRVGASETTP*
Ga0066905_10000514253300005713Tropical Forest SoilMARGGLLGLCCALALGVGAAPSRVAAQNSVEVAKARQILEHYTECERTRRFGPCWPLLSKRVRADWARRGRPTAAEYADARGAGEPRYSDFRVMQIRRSPSRVVFVTEATRAADQDALPDRVEYAILREGDQWKIDGRRVGPSETTP*
Ga0081455_10000208473300005937Tabebuia Heterophylla RhizosphereMPGRVLVRLCAIALGALAAPPLTVAQNSVEVARARQTLEHYAMCERTRRFAPCWPLLSRRIQAEWARQGRPSAAEYADARGAGEPGYADFRVLQIRRSPSRVVFVTEATRAGGTDAVADRVEYAVLREGDQWKIDGRRVGASETTP*
Ga0070765_10020788523300006176SoilMRPRGRRRGAIGCAGVALVLGMSGAPARAQDSAEVARARQVLEAYQSCELTRRFAPCWPLLSDRVQRDWARQGRGTVAEYALAKGAEESGVSSFRVMQIRRSPSRVVFVVEAIRAADRDRPGSRVEYAVLREGGQWKIDGLRRGQSETTP*
Ga0075421_10069226513300006845Populus RhizosphereMRAPVRWALVGALVLGAVGPPAPAAGQSSAEVARARQALEDYFVCERTRRFAPCWPRLSKRAQAAWTRQGRGTVGEYAESRGAAEPHYSDFRVLQIRRSPARVVFQVEATRDAARDGLPDRIEFAVLREG
Ga0075431_10092693713300006847Populus RhizospherePALAGQGRPGRGGLMRAPVRWALVGALVLGAVGPPAPAAGQSSAEVARARQALEDYFVCERTRRFAPCWPRLSKRAQAAWTRQGRGTVGEYAESRGAAEPHYSDFRVLQIRRSPARVVFQVEATRDAARDGLPDRIEFAVLREGDQWKIDGRRVGQSETTP*
Ga0075431_10145181913300006847Populus RhizosphereVVGALSLGAGGAPAPAAAQSSVEVARARQALEDYFACERTRRFAPCWARLSKRVHAEWARQGRGTVSDYAESRAAAEPRYRDFRVLQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0099829_10000366213300009038Vadose Zone SoilMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPARVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0099828_10001160133300009089Vadose Zone SoilMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0105245_1014773943300009098Miscanthus RhizosphereLMRGRARWTLLACVLALGAAVAPAPAAGQGSVEVARARQALEDYFACERTQRFAPCWPRLSRRVHAEWARQGRGSVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0114129_1009760443300009147Populus RhizosphereMRGLARWALVVGALSLGAGGAPAPAAAQSSVEVARARQALEDYFACERTRRFAPCWARLSKRVHAEWARQGRGTVSDYAESRAAAEPRYRDFRVLQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0105094_1032572123300009153Freshwater SedimentGLMRGRARWALVGALVLGAVGAPAPAASQSSVEVARARQALEDYFTCERTRRFAPCWPRLSKRVQAAWTRQGRGTVAEYAESRGAAEPRYADFRVLQIRRSPARVVFQVEATRAADKDGLPDRVESAVLREGDRWKIDGRRVRPSETTP*
Ga0105064_114098113300009821Groundwater SandSVEVARARQALEDYFACERTRRFAPCWPRLSKRVQADWVRQGRGTVSEYAESRGAAEPRYADFRVLQIRRSPSRVVFLVEATRDVDRHGLPDRVEYAVLREGEQWRIDGRRVGQSETTP*
Ga0126377_1138911213300010362Tropical Forest SoilMARAGLLGLCCALALGVGVTPSRVAAQNSVEVAKARQTFEYYTECERTRRFGPCWPLLSKRVQADWARQGRPTAAEYAEARGAGEPRYADFRVTQIRRSPSRVVFVTEATRAPENAARRDRVEYAILREGDQWKI
Ga0134125_1162553523300010371Terrestrial SoilALGALGAPAPTAAQSSAEVARARQALEDYFACERTQRFAPCWPRLSRRVHAEWARQGRGSVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0134122_1048221023300010400Terrestrial SoilMWGRARWALAAGALALGALGAPAPTAGQNSAEVARARQVLEDYFACERTGRFVPCWPRLSKRVQAEWTRQGRGTAAEYAESRAATEPRYADFRVQQIRRSPSRVVFVVEATRAAARDGLPDRVEFAVLREGEQWRIDGRRVGASETTP*
Ga0134121_1037492723300010401Terrestrial SoilMRGRARWTLLACVLALGAAVAPAPAAGQGSVEVARARQALEDYFACERTQRFAPCWPRLSRRVHAEWARQGRGSVAEYGESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0137446_101543823300011419SoilMRGRAGWALVGALVLGAVGTPAPGAGQSGVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYGDFRVLQIRRSPSRVVFRVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0137456_102739513300011428SoilMRGRARWALVGALVLGAVGVPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYTDFRVQQIRRSPSRVVFLVEATRAAERDGVPDRVE
Ga0137455_111311723300011429SoilVGALVLGAVGAPAPGAGQNSVEVARARQALEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFRVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0137453_102752613300012034SoilMRGRARWALVGALVMGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFLVEATRAAEKDGLPDRVEYA
Ga0137329_100846823300012133SoilMRGRARWALVGALVLGALGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYGDFRVLQIRRSPSRVVFRVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0137338_104020623300012174SoilMGAPARWALVGALVLGAVGAPAPAAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGSVAEYAESRAAAEPRYGDFRVLQIRRSPSRVVFRVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0137376_1155113313300012208Vadose Zone SoilSSVEVARARQALEDYFACERTRRFVPCWARLSKRVHAEWARQGRGTVSEYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWKIDGRRVGASETTP
Ga0137434_105009123300012225SoilLMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVREDYVACERTRHFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFRVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0137369_1060041723300012355Vadose Zone SoilMRGPARWAVVGALVLAVGGAPAPAASQNSVELARARQALEDYFTCERTQRFASCWPRLSKRVQAEWARQGRGTVSEYAESRAAAEPRYSDFRVAQIRRSPSRVVFQVEATRDADRRGLPDRVEYAVLREGEQWKIDGRRVGASETTP*
Ga0137397_1003297563300012685Vadose Zone SoilMRGLARWALVVGALSLGAGGAPAPTAAQSSAEVARARQALEDYFVCERTRRFAPCWARLSKRVHAEWARQGRGTVSEYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWKIDGRRVGASETTP*
Ga0137404_1012255123300012929Vadose Zone SoilMRGLARWALVVGALSLGAGGAPPPAPAQSSVEVARARQALEDYFACERTRRFAPCWARLSKRVHAEWARQGRGTVSEYAESRAAAEPRYGNFRVLQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0137404_1184870513300012929Vadose Zone SoilMRGRARWTLLAGVLALGAAGAPAPAAGQGSVEVARARQALEDYFACERTLRFAPCWPRLSRRVHAEWARQGRGSVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGAKDGLPDRVEYAVLREGEQWKIDGRRVGQSETAP*
Ga0137407_1022679923300012930Vadose Zone SoilMRGLARWALVVGALSLGAGGAPPPAPAQSSVEVARARQALEDYFACERTRRFAPCWARLSKRVHAEWARQGRGTVSEYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0075301_103839723300014262Natural And Restored WetlandsMASRGRWALVGALVLGAVGAPAPVAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWERQGRGTVAEYAESRGAAEPRYADFRVLQIRRSPSRVVFLVEATREADRHWLPDRVEYAVLREGEQWRIDGRRVGQSETTP*
Ga0075303_103834023300014299Natural And Restored WetlandsGGLMASRGRWALVGALVLGAVGAPAPVAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWERQGRGTVAEYAESRGAAEPRYADFRVLQIRRSPSRVVFLVEATREADRHRLPDRVEYAVLREGEQWRIDGRRVGQSETTP*
Ga0075351_111911923300014318Natural And Restored WetlandsGEPAPAPGQSSVELARARQVLEDYFTCERTRRFAPCWPRLSKRAQAEWTRQGRGTVAEYAESRAASEPRYADFRVQQIRRSPSRVVFLVEATRAADKDGVPDRVEYAVLREGEQWRIDGRRVGQSETTP*
Ga0157380_1064357113300014326Switchgrass RhizosphereRGGLMRGRARWTLLACVLALGAAVAPAPAAGQGSVEVARARQALEDYFACERTQRFAPCWPRLSRRVHAEWARQGRGSLAEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0180066_100055043300014873SoilMRGRARWAVVGALVLGTVGVPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYGDFRVLQIRRSPSRVVFRVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0180069_105045613300014882SoilMRGRARWALVGALVLGALGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRGAAEPRYGDFRVLQIRRSPSRVVFLVEATREADRQGLPDRVEYAV
Ga0180104_102550133300014884SoilVGAPAPAAGQSSVEVARARHVLEDYFACERTRRFAPCWPRLSKRVRADWARQGRGSVAEYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATREADRQGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0180063_102418123300014885SoilMGAPARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFALCWPRLSKRVQADWARQGRGSVAEYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATREADRQGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0180063_102430933300014885SoilGLMRGVRWAVVGALVLGALAAPVPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVAEYAESRAAAEPRYADFRVQRIRRSPSRVVFQVEATRAAEKDGVPDRVEYAVLREGEQWRIDGRRVGPSETTP*
Ga0157379_1028284723300014968Switchgrass RhizosphereMRGRARWTLLACVLALGAAVAPAPAAGQGSVEVARARQALEDYFACERTQRFAPCWPRLSRRVHAEWARQGRGSVAEYAESRAAAEPHYADFRVQQIRRSPSRVVFVVEATRAGEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0120098_102330613300015170FossillMRGPARWALVMGGLVLGAVGAPGPAPSQSSVEVARARQALEDYFACERTQRFAPCWPRLSKRVQAEWARQGRGTVAEYAESRGAAEPRYADFRVQQIRRSPSRVVFRVEATRAAERDGVPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0180070_101859623300015251SoilMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFRVEATRDAEKDGLPDRVEYAVLREGEQWRIDGRRVGQSE
Ga0180085_101477723300015259SoilMGAPARWALVGALVLGAVGAPAPAAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVRADWARQGRGSVAEYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATREADRQGLPDRVEYAVLREGEQWKIDGRRVGQSETTP*
Ga0132258_10016370143300015371Arabidopsis RhizosphereMVRGGLLGLCCALALGVGVTPSRLAAQSSVEVAKARQTLEYYTECERTRRFGPCWPLLSKRVQADWARQGRPSAAEYAEARGAGEPRYADFRVMQIRRSPSRVVFVTEATRAAAIDAPRDRVEYAILREGDRWKIDGRRVGASETTP*
Ga0187775_1015814923300017939Tropical PeatlandMGRGALVGLGGALALGIIGAPSAVTAQNSVEVARARQVVEDYTACERTRRFVACWALLSKRAQAEWARQGRGTVADYADARSAAEPRYVDFRVMQIRPSPSRIVFVMEATRSGAPDGPRDRVEYAILREGDRWRMDGRRVGQSETTP
Ga0187778_1003353323300017961Tropical PeatlandMRAWPGRGVAGCAGVALVLGLAGAPARGQDSAELARARQVLEAYQACELAGRFAPCWALLSERVQRQWAKQGRGTADDYAFSRGGEPSGFSSFRVMQVRRSPSRVVFVVEAIREADRDGPGIRVEYAVRRERGQWKIDGLRRGQSETTP
Ga0187780_1108642223300017973Tropical PeatlandMRAWAGRGVAGCATVALVLGLAGAPARGQDSAELARARQVLEAYQACELAGRFAPCWALLSERVQRQWAKQGRGTADDYAFSRGGEPSGFSSFRVMQVRRSPSRVVFVVEAIREADRDGPGIRVEYAVRRERGQWKI
Ga0184610_102294723300017997Groundwater SedimentMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0184626_1003695823300018053Groundwater SedimentMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVMEDYWACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0184615_1017312823300018059Groundwater SedimentMRGRAGWVLVGALVLGAVGTPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0184637_1010854723300018063Groundwater SedimentMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYWACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0184618_1009288523300018071Groundwater SedimentMRGRGRWALVGALVLGAVGAPAPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVSEYAESRAAAEPRYGDFRVQQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0184640_1007510023300018074Groundwater SedimentMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAGEPRYSDFRVLQIRRSPSRVVFRVEATRDAEKDGLPDRVEYAVLREGEQWRIDGRRVGQSETTP
Ga0184629_1006223223300018084Groundwater SedimentMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFLVEATRDAEKDGLPDRVEYAVLREGEQWRIDGRRVGQSETTP
Ga0190265_1005688233300018422SoilMRGRARWALVLGALVLGGAGAPAPAAGQSSVEVARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGSVSEYAESRAAAEPHYADFRVQQIRRSPSRVVFRVEATRAAEKDGLPDRVEYAVLREGEQWKIDGRRVGASETTP
Ga0190265_1015465023300018422SoilMRGPARWALVMGGLVLGAVGAPGPAPSQSSVEVARARQALEDYFACERTQRFAPCWPRLSKRVQAEWARQGRGTVAEYAESRGAAEPRYADFRVQQIRRSPSRVVFRVEATRAAERDGVPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0190265_1034500513300018422SoilMSGRAGWALAGALVLASLGAPGPGAAQGSAELARARQALEDYFACERTLRFAPCWPRLSTRIHAEWARQGRGSVAEYAESRGAAEPRYADFRVIQIRRSPSRVVFVVEATRAAARDGLPDRVEYAVLREGEQWKVDGRRV
Ga0190265_1090503723300018422SoilMGGRARWVLVGSLVLGAVGAPAPAGGQSSVEVARARQALEDYFTCERTQRFAPCWPRLSRRVHAEWARQGRGSVSEYAESRAAAVPRYTDFRVQQIRRSPSRVVFVVEATRAAEKNGVPDRVEYAVLREAEEWKIDGRRVGASETTP
Ga0190265_1206419723300018422SoilMRVHARWALVGALALGALGAPAPAAGQNSVELARARQTLEDYFVCERTRRFAPCWPRLSKRVQAEWTRQGRGTDAEYAESRAASEPRYADFRVQQIRRSPSRVVFLVEATRAAERDGVPDRVEYAVLREGEQWRVDGRRVGASETTP
Ga0190272_1023929323300018429SoilMRGRARWALVGALVLGAVGAPVPAASQSSVEVARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTAAAYAESRAAAEPRYADFRVQQIRRSPSRVVFQVEATRAAEKDGVPDRVEYAVLREGEQWKIDGRRVGASETTP
Ga0190272_1234584023300018429SoilMRGRGRWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQERGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFLVEATREAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0187894_1025470423300019360Microbial Mat On RocksMRAPARWALVGALVLGAVGAPAPAAGQSSAEVARARQALEDYFVCERTRRFAPCWPRLSKRAQAEWTRQGRGTVAEYAESRGAAEPRYADFRVLQIRRSPARVVFQVEATRDAARDGLPDRIEFAVLREGDQWKIDGRRVGQSETTP
Ga0187892_1001984663300019458Bio-OozeMRAPARWALVGAFVLGAVGAPAPAAGQSSAEVARARQVLEDYFACERTGRFAPCWPRLSKRVQAKWTREGRGAVGEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAAARDGLPDRVEFAVLREGEQWRIDGRRVGASETTP
Ga0187892_1013973113300019458Bio-OozeMGRPVRRVLVGVLVLGALGAPAPGAAQSSVDLARARQVLEDYFACERTGRFAPCWPRLSKRVQADWARQGRGSVADYVEAKGAAEPRYADFRVLQIRRSPARVVFLVEATRAAERDERPDRIEYAVLREGELWKIDGRRVGQSETTP
Ga0187892_1028891823300019458Bio-OozeGAVGAPAPAAGQSSAEVARARQALEDYFVCERTRRFAPCWPRLSKRAQAEWTRQGRGTVAEYAESRGAAEPRYADFRVLQIRRSPARVVFQVEATRDAARDGLPDRIEFAVLREGDQWKIDGRRVGQSETTP
Ga0193705_110489713300019869SoilMRGRGRWALVGALVLGAVGAPVPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVSEYAESRAAAEPRYADFRVQQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWRIDGRRVGQSETTP
Ga0193723_106423423300019879SoilMRVRARWALLAAALALGAVGAPAPGAGQGSVEVARARQALEDYFACERTQRFAPCWPRLSKRVQVDWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFLVEATRAGAKDEVPDRVEYAVLREGEQWKID
Ga0193707_100656323300019881SoilMRGRGRWALVGALVLGAVGAPVPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVSDYAESRAAAEPRYSDFRVAQIRRSPSRVVFQVEATRDADRSGLPDRVEYAVLREGEQWKIDGRRVGASETTP
Ga0193713_101032943300019882SoilMRGRARWALLAAALALGAVGAPAPGAGQGSVEVARARQALEDYFACERTQRFAPCWPRLSKRVQAEWARQGRGSVAEYAESRTAAEPRYADFRVQQIRRSPSRVVFLVEATRAGAKDEVPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0193727_102472433300019886SoilMRGRARWALLTAALALGAVGAPAPGAGQGSVEVARARQALEDYFVCERTQRFAPCWPRLSKRVQAEWARQGRGSVAEYAESRAAGEPRYADFRVQQIRRSPSRVVFLVEATRAGAKDEVPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0193727_110947923300019886SoilMRGRGRWALVGALVLGAVGAPAPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVSDYAESRAAAEPRYSDFRVAQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0193727_117689813300019886SoilALVGALVVGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0193743_103428723300019889SoilMRGRARWALVVALVLGAVGAPAPGAGQNSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0193739_100463263300020003SoilMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTAGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0193755_102057823300020004SoilMRGRARWALLTAALALGAVGAPAPGAGQGSVEVARARQALEDYFACERTQRFAPCWPRLSKRVQAEWARQGRGSVAEYAESRAAGEPRYADFRVQQIRRSPSRVVFLVEATRAGAKDEVPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0193717_105767123300020060SoilMRGRARWALVGALVLGAVGAPGPAASQSSVEVARARQALEDYFTCERTQRFAPCWPRLSRRVQAEWTRQGRGSVSEYAESRAAAEPRYADFRVVQIRRSPSRVVFRVEATRDADRHGLPDRVEYAVLREGEQWKIDGRRVGASETTP
Ga0193717_109982323300020060SoilMRGRARWALVGALVLGAVGAPPPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFLVEATRAAERDGVPDRVEYAVLREGDQWKIDGRRVGQSETTP
Ga0210407_1073938913300020579SoilMRPRGRRRGAIGCAGVALVLGMSGAPARAQDSAEVARARQVLEAYQSCELTRRFAPCWPLLSDRVQRDWARQGRGTVAEYALAKGAEESGVSSFRVMQIRRSPSRVVFVVEAIRAAERDRPGSRVEYAVLREGGQWKIDGLRRGQSETTP
Ga0210403_1046174723300020580SoilAGVALVLGMSGAPARAQDSAEVARARQVLEAYQSCELTRRFAPCWPLLSDRVQRDWARQGRGTVAEYALAKGAEESGVSSFRVMQIRRSPSRVVFVVEAIRAAERDRPGSRVEYAVLREGGQWKIDGLRRGQSETTP
Ga0210401_1124527113300020583SoilMRPRGRRRGAIGCAGVALVLGMSGAPARAQDSAEVARARQVLEAYQSCELTRRFAPCWPLLSDRVQRDWARQGRGTVAEYALAKGAEESGVSSFRVMQIRRSPSRVVFVVESIRAAERDRPGSRVEYAVLREGGQWKI
Ga0210382_1007491223300021080Groundwater SedimentMRGRGRWALVGALVLGAVGAPVPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVSEYAESRAAAEPRYGDFRVQQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0210377_1003268613300021090Groundwater SedimentQSSVEVARARQVLEDYFACERTRRFAACWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETT
Ga0210400_1135955213300021170SoilRLMRPRGRRRGAIGCAGVALVLGMSGAPARAQDSAEVARARQVLEAYQSCELTRRFAPCWPLLSDRVQRDWARQGRGTVAEYALAKGAEESGVSSFRVMQIRRSPSRVVFVVEAIRAAERDRPGSRVEYAVLREGGQWKIDGLRRGQSETTP
Ga0193719_1007514123300021344SoilMRGRGRWALVGALVLGAVGAPVPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVSEYAESRAAAEPRYADFRVQQIRRSPSRVVFLVEATRAAEKDGLPD
Ga0193719_1010228033300021344SoilRVRARWALLAAALALGAVGAPAPGAGQGSVEVARARQALEDYFVCERTQRFAPCWPRLSKRVQAEWARQGRGSVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFLVEATRAGAKDEVPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0210394_1096156523300021420SoilMRPRGRRRGAIGCAGVALVLGMSGAPARAQDSAEVARARQVLEAYQSCELTRRFAPCWPLLSDRVQRDWARQGRGTVAEYALAKGAEESGVSSFRVMQIRRSPSRVVFVVEAIRAADRDRPGSRVEYAVLREGGQWKIDGLRRGQSETTP
Ga0210409_1106411613300021559SoilRGRRRGAIGCAGVALVLGMSGAPARAQDSAEVARARQVLEAYQSCELTRRFAPCWPLLSDRVQRDWARQGRGTVAEYALAKGAEESGVSSFRVMQIRRSPSRVVFVVEAIRAAERDRPGSRVEYAVLREGGQWKIDGLRRGQSETTP
Ga0222622_1016663423300022756Groundwater SedimentMRGRGRWALVGALVLGAVGAPVPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVSEYAESRAAAEPRYGDFRVQQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWRIDGRRVGQSETTP
Ga0247744_103749323300023073SoilMVRGGLLGLGCALALGVGGPSAVTAQNSVEVARARQILEHYTECERTRRFGPCWPLLSKRVQADWARQGRPGAVEYADARGAGEPRYVDFRVMQIRRSPSRVVFVTEATRASERDGLPDRVEYAILREGDQWKVDGRRVGASETTP
Ga0209109_1021241523300025160SoilMGARGRWALVVALVLGSVGAPVPAAAQGSVELARARQALEDYFACERTRRFAPCWPRLSKRVHAEWARQGRGSVAEYAESRGAAEPRYADFRVLQIRRSPSRVVFVVEATRASDRNALPDRVEYAVLREGEQWKVDGRRVGPSETTP
Ga0209640_10005004123300025324SoilMGARGRWALVVALVLGSVGAPVPAAAQGSVELARARQALEDYFACERTRRFAPCWPRLSKRVHAEWARQGRGSVAEYAESRGAAEPRYADFRVLQIRRSPSRVVFVVEATRASGRNGRPDRVEYAVLREGGQWKVDGRRVGPSETTP
Ga0209640_1010404023300025324SoilMGAPARWALVGALVLGAVGAPAPAAGQSSVEVARARQALEDYFACERTRRFAPCWPRLSKRVQADWARQGRGSVAEYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATREADRQGLPDRVEYAVLREGAQWKIDGRRVGQSETTP
Ga0210139_102297523300025558Natural And Restored WetlandsMASRGRWALVGALVLGAVGAPAPVAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWERQGRGTVAEYAESRGAAEPRYADFRVLQIRRSPSRVVFLVEATREADRHRLPDRVEYAVLREGEQWRIDGRRVGQSETTP
Ga0207645_1003464233300025907Miscanthus RhizosphereMRGRARWTLLACVLALGAAVAPAPAAGQGSVEVARARQALEDYFACERTQRFAPCWPRLSRRVHAEWARQGRGSVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0207645_1048335623300025907Miscanthus RhizosphereVTAQNSVEVARARQILEHYTECERTRRFGPCWPLLSKRVQADWARQGRPGAAEYADARGAGEPRYVDFRVMQIRRSPSRVVFVTEATRASERDGLPDRVEYAILREGDQWKVDGRRVGASETTP
Ga0207707_1042906823300025912Corn RhizosphereMRGRARWTLLACVLALGAAVAPAPAAGQGSVEVARARQALEDYFACERTQRFAPCWPRLSRRVHAEWARQGRGSLAEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0207706_1006206923300025933Corn RhizosphereMVRGGLLGLGCALALGVGGPSAVTAQNSVEVARARQILEHYTECERTRRFGPCWPLLSKRVQADWARQGRPGAAEYADARGAGEPRYVDFRVMQIRRSPSRVVFVTEATRASERDGLPDRVEYAILREGDQWKVDGRRVGASETTP
Ga0210089_100097843300025957Natural And Restored WetlandsMRGRARWALVGALVLGALGEPAPAPGQSSVELARARQVLEDYFTCERTRRFAPCWPRLSKRAQAEWTRQGRGTVAEYAESRAASEPRYADFRVQQIRRSPSRVVFLVEATRAADKDGVPDRVEYAVLREGEQWRIDGRRVGQSETTP
Ga0207639_1123313723300026041Corn RhizosphereMRGRARWTLLACVLALGAAVAPAPAAGQGSVEVARARQALEDYFACERTQRFAPCWPRLSRRVHAEWARQGRGSVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGEKDGLPDRVE
Ga0209438_100206433300026285Grasslands SoilMRGRARWTLLAGVLALGAAGAPAPAAGQGSVEVARARQALEDYFACERTLRFAPCWPRLSRRVHAEWARQGRGSVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFVVEATRAGAKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0257173_102384823300026360SoilMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWVRQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPARVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0257169_100662523300026469SoilMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPARVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0209981_100392313300027378Arabidopsis Thaliana RhizosphereSAVTAQNSVEVARARQILEHYTECERTRRFGPCWPLLSKRVQADWARQGRPGAVEYADARGAGEPRYVDFRVMQIRRSPSRVVFVTEATRASERDGLPDRVEYAILREGDQWKVDGRRVGASETTP
Ga0209968_102013823300027526Arabidopsis Thaliana RhizosphereMRNTFSKKQTCLALGCALALGVGGPSAVTAQNSVEVARARQILEHYTECERTRRFGPCWPLLSKRVQADWARQGRPGAAEYADARGAGEPRYVDFRVMQIRRSPSRVVFVTEATRASERDGLPDRVEYAILREGDQWKVDGRRVGASETTP
Ga0209982_109002113300027552Arabidopsis Thaliana RhizosphereMVRGGLLGLGCALALGVGGPSAVTAQNSVEVARARQILEHYTECERTRRFGPCWPLLSKRVQADWARQGRPGAAEYADARGAGEPRYVDFRVMQIRRSPSRVVFVTEATRASERDGLPDRVEYAILREGDQWKVDGRRVGA
(restricted) Ga0233416_1000573813300027799SedimentMGPGARRMLAGLLALGAVGGPLPAAAQSSADVARARQTLEEYWTCERTRRFAPCWPLLSRRTRETWARQGRGSVTEYAEARGAAERAYSDFRVLHIRRSPSRVVFVVEATRAAAGNGVPDRVEYAVLREGDQWKVDGRRVGQSETTP
Ga0209726_1004888633300027815GroundwaterMRGRARWALVGALVLGAVGTPVSGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRAQADWARQGRGSVAEYAESRAAAEPRYGDFRVLQIRRSPARVVFLVEATRDADRRGLPDRVEYAVLREGEQWRIDGRRVGQSETAP
Ga0209180_10002390123300027846Vadose Zone SoilRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPARVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0209488_1088677113300027903Vadose Zone SoilMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPARVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSET
Ga0137415_1033269333300028536Vadose Zone SoilTAAQSSAEVARARQALEDYFACERTQRFAPCWARLSKRVHAEWARQGRGTVSEYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWKIDGRRVGASETTP
Ga0307313_1004120923300028715SoilMRGRGRWALVGALVLGAVGAPVPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVSEYAESRAAAEPRYADFRVQQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVL
Ga0307290_1002287823300028791SoilMRGRGRWALVGALVLGAVGTPVPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVSEYAESRAAAEPRYADFRVQQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWRIDGRRVGQSETTP
Ga0307296_1013591933300028819SoilMRGRGRWALVGALVLGAVGAPVPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVSEYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0307312_1093618513300028828SoilMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARVRQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVGEYAESRAAAEPRYSDFRVLQIRRSPSRVVFLVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0307312_1112182513300028828SoilMRGRGRWALVGALVLGAVGAPAPAASQNSVELARARQALEDYFTCERTQRFAPCWARLSKRVQAEWARQGRGTVSDYAESRAAAEPRYSDFRVAQIRRSPSRVVFQVEATRDADRRGLPDRVEYAVLREGEQWKIDGRRVGASETTP
Ga0307278_1004750123300028878SoilMRGLARWALVVGALSLGAGGAPAPAAAQSSVEVARARQALEDYFACERTRRFAPCWARLSKRVHAEWARQGRGTVSDYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0299907_1009146543300030006SoilMRARWALVGALVLGAVGAPAPTAGQSSVEVARARQALEDYFVCERTRRFAPCWPRLSTRVRAEWARQGRGTVSEYAESRAATEPHYADFRVVRIRRSPARMVFVVEATRAAARDGLPDRVEYAVLREGEQWRIDGRRVGQSETTP
Ga0302046_1015328423300030620SoilMRARWALVGALVLGAVGAPAPTAGQSSVEVARARQALEDYFVCERTRRFAPCWPRLSTRVRAEWARQGRGTVSEYAESRAATEPHYGDFRVVRIRRSPARMVFVVEATRAAARDGLPDRVEYAVLREGEQWRIDGRRVGQSETTP
Ga0302046_1074969223300030620SoilMGAPARWALVGALVLGAVGAPAPAAGQSSVEVARARQALEDYFACERTRRFAPCWPRLSKRVQADWARQGRGSVAEYAESRAVAEPRYGDFRVLQIRRSPSRVVFLVEATREADRQGLPDRVEYAVLREGAQWKIDGRRVGQSETTP
(restricted) Ga0255311_107242423300031150Sandy SoilMRGTRWAVVGVLALSAIGAPPPAASQNSVELARARQALEDYFTCERRQRFAPCWPRLSKRVQAEWARQGRGTVTEYAESRAAAEPRYADFRVMQIRRSPARVVFVVEATRGSARDGLPDRVEYAVLREGEQWKIDGRRVGASETTP
Ga0307501_1013219723300031152SoilMRGRGRWALVGALVLGAVGAPAPAASQNSVELARARQALEDYFTCERTQRFALCWPRLSKRVQAEWARQGRGTVSEYAESRAAAEPRYGDFRVQQIRRSPSRVVFLVEATRAAEKDGLPDRVEYAVLRE
(restricted) Ga0255310_1005025713300031197Sandy SoilMPLRARWVLVGALALSAIGAPVPAVSQNSVELARARQALEDYFTCERTQRFAACWPRLSKRVQAEWTRQGRGTVSEYAESRAAAEPRYADFRVMQIRRSPSRVVFQVEAARAAEKGGLPDRVEYAVLREGEQWKID
(restricted) Ga0255310_1022859013300031197Sandy SoilWAVVGVLALSAIGAPPPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFQVEATRAAEKAGVPDRVEYAVLREGGQWKIDGRRVGPSETTP
Ga0299913_1025407523300031229SoilMRARWALVGALVLGAVGAPAPTAGQSSVEVARARQALEDYFVCERTRRFAPCWPRLSTRVRAEWARQGRGTVSEYSESRAATEPHYGDFRVVRIRRSPARIVFVVEATRAAARDGLPDRVEYAVLREGEQWRIDGRRVGQSETTP
(restricted) Ga0255334_101919113300031237Sandy SoilWGGLMRGARWAVVGVLALSAIGAPPPAASQNSVELARARQALEDYFTCERRQRFAPCWPRLSKRVQAEWARQGRGTVTEYAESRAAAEPRYADFRVMQIRRSPARVVFVVEATRGSARDGLPDRVEYAVLREGEQWKIDGRRVGASETTP
(restricted) Ga0255312_100585423300031248Sandy SoilMRGTRWAVVGVLALSAIGAPPPAASQNSVELARARQALEDYFTCERRQRFAPCWPRLSKRVQAEWARQGRGTVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFQVEATRAAEKAGVPDRVEYAVLREGGQWKIDGRRVGPSETTP
(restricted) Ga0255312_104482623300031248Sandy SoilPGGGGLMPLRARWVLVGALALSAIGAPGPAVSQNSVELARARQALEDYFTCERTQRFAACWPRLSKRVQAEWTRQGRGTVSEYAESRAAAEPRYADFRVMQIRRSPSRVVFQVEAARAAEKGGLPDRVEYAVLREGEQWKIDGRRVGASETTP
Ga0307505_1038263813300031455SoilGRGGLRRGRALGALRAGALALGAVGAPAPGAGQGSVEVARARQALEDYFACERTQRFAPCWPRLSKRVQAEWTRQGRGTVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFRVEATRAGAKDEVPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0307469_1001065143300031720Hardwood Forest SoilMRGRARWALLAAALALGAVGAPAPGAGQGSVEVARARQALEDYFACERTRRFAPCWPRLSKRVQAEWARQGRGSVAEYAESRAASEPRYADFRVQQIRRSPSRVVFLVEATRSGAKDGVPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0307469_1134845713300031720Hardwood Forest SoilLMRPRGRRRGAIGWAGVALVLGMSGAPARAQDSAEVARARQVLEAYQSCELTRRFAPCWPLLSDRVQRDWARQGRGTVAEYALAKGAEESGVSSFRVMQIRRSPSRVVFVVEAIRAADRDRPGSRVEYAVLREGGRWKIDGLRRGQSETTP
Ga0307469_1216794423300031720Hardwood Forest SoilMRGHARWALVGALALGALGAPAPAVGQSSVEVARARQTLEDYFACERARRFAPCWSRLSKRVQAEWARQGRGTVAEYAESRAAGEPHYSDFRVQQIRRSPSRVVFLVEATRAAEKDGVPDRVEYAVLREGEQWRIDGRRVG
Ga0214473_1001431693300031949SoilMSRGPWWALVGALVLGTSGAPGPATAQSSAELARARQVLEDYFTCERTRRFAPCWPRLSKRVQADWAQQGRGTVSEYAEARGADEPRYVDHRVVRIRRSPSRVVFVVEATRSAEKEGLPDRVEYAILREGDQWRIDGRRVGPSDTTP
Ga0214473_1078380313300031949SoilLAGQARPGRGGLMGTPARWALVGALVLGAVGAPAPAAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVRADWARQGRGSVAEYAESRAAAEPRYGDFRVLQIRRSPSRVVFLVEATREADRQGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0307479_1002247933300031962Hardwood Forest SoilMRPRGRRRGAIGCAGVALVLGMSGAPARAQDSAEVARARQVLEAYQSCELTRRFAPCWPLLSDRVQRDWARQGRGTVAEYALAKGAEESGVSSFRVMQIRRSPSRVVFVVEAIRAADRDRPGSRVEYAVLREGGRWKIDGLRRGQSETTP
Ga0307470_1005561333300032174Hardwood Forest SoilMRGHARWALVGALALGALGAPAPAVGQSSVEVARARQTLEDYFACERARRFAPCWSRLSKRVQAEWARQGRGTVAEYAESRAAGEPHYSDFRVQQIRRSPSRVVFLVEATRAAEKDGVPDRVEYAVLREGEQWRIDGRRVGQSETTP
Ga0335085_10002916223300032770SoilLTSRRRLLALCGALLLGATAGPRSAPAQNSVEVAKARQVLEFYTTCERTRRFGPCWALLSKRVQAEWARQGRATATEYAESRGAGEPRYSDFRVKQIRRSPSRVVFVTEATRAAEKGGLPDRVEYAILREGDQWRIDGRRVGPSETTP
Ga0335070_1012948543300032829SoilTSRRRLLALWGALLLGATAGPRSAPAQNSVEVAKARQVLEFYTTCERTRRFGPCWALLSKRVQAEWARQGRATATEYAESRGAGEPRYSDFRVKQIRRSPSRVVFVTEATRAAEKGGLPDRVEYAILREGDQWRIDGRRVGPSETTP
Ga0334722_1077484413300033233SedimentMPIRAQWVLVGALALSAIGAPVPAVSQNSVELARARQALEDYFTCERTQRFAACWPRLSKRVQAEWTRQGRGTVSEYAESRAAAEPRYADFRVMQIRRSPSRVVFLVEATRAAEKDGVPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0316620_1045111333300033480SoilTRWAVVGVLALSAIGAPPPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGTVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFQVEATRAAEKAGVPDRVEYAVLREGGQWKIDGRRVGPSETTP
Ga0364928_0010958_36_4793300033813SedimentMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVAEYAESRAAAEPRYSDFRVLQIRRSPSRVVFRVEATRDAEKDGLPDRVEYAVLREGEQWKIDGRRVGQSETTP
Ga0364940_0036868_1_3603300034164SedimentMRGRARWALVGALVLGAVGAPAPGAGQSSVEVARARQVLEDYFACERTRRFAPCWPRLSKRVQADWARQGRGTVAEYAESRAAAEPRYSDFRVLQIRRSPSRVVFRVEATRDAEKDGLPD
Ga0370495_0152932_26_4663300034257Untreated Peat SoilMRGARWALVGALVLGAMGAPPPAASQNSVELARARQALEDYFTCERTQRFAPCWPRLSKRVQAEWARQGRGSVAEYAESRAAAEPRYADFRVQQIRRSPSRVVFLVEATRAAEKDGVPDRVEYAVLREGEQWRIDGRRVGQSETTP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.