NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F079827

Metagenome Family F079827

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F079827
Family Type Metagenome
Number of Sequences 115
Average Sequence Length 101 residues
Representative Sequence VSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDTARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARAPLAAPPVRH
Number of Associated Samples 108
Number of Associated Scaffolds 115

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 73.04 %
% of genes near scaffold ends (potentially truncated) 98.26 %
% of genes from short scaffolds (< 2000 bps) 94.78 %
Associated GOLD sequencing projects 101
AlphaFold2 3D model prediction Yes
3D model pTM-score0.38

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(19.130 % of family members)
Environment Ontology (ENVO) Unclassified
(26.957 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.087 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 42.98%    β-sheet: 0.00%    Coil/Unstructured: 57.02%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.38
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 115 Family Scaffolds
PF00550PP-binding 89.57
PF00355Rieske 5.22
PF08028Acyl-CoA_dh_2 0.87
PF02780Transketolase_C 0.87

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 115 Family Scaffolds
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 0.87


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_105881840All Organisms → cellular organisms → Bacteria1147Open in IMG/M
3300000597|AF_2010_repII_A1DRAFT_10143972All Organisms → cellular organisms → Bacteria575Open in IMG/M
3300001661|JGI12053J15887_10332454All Organisms → cellular organisms → Bacteria738Open in IMG/M
3300002562|JGI25382J37095_10216213All Organisms → cellular organisms → Bacteria578Open in IMG/M
3300002912|JGI25386J43895_10181689All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300004268|Ga0066398_10052893All Organisms → cellular organisms → Bacteria827Open in IMG/M
3300005166|Ga0066674_10059405All Organisms → cellular organisms → Bacteria1739Open in IMG/M
3300005177|Ga0066690_10146771All Organisms → cellular organisms → Bacteria → Proteobacteria1548Open in IMG/M
3300005177|Ga0066690_10637187All Organisms → cellular organisms → Bacteria710Open in IMG/M
3300005186|Ga0066676_10983430All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300005332|Ga0066388_106672596All Organisms → cellular organisms → Bacteria581Open in IMG/M
3300005336|Ga0070680_101102404All Organisms → cellular organisms → Bacteria686Open in IMG/M
3300005446|Ga0066686_10128669All Organisms → cellular organisms → Bacteria → Proteobacteria1658Open in IMG/M
3300005560|Ga0066670_10375812All Organisms → cellular organisms → Bacteria868Open in IMG/M
3300005574|Ga0066694_10383294All Organisms → cellular organisms → Bacteria663Open in IMG/M
3300005713|Ga0066905_101137455All Organisms → cellular organisms → Bacteria695Open in IMG/M
3300005937|Ga0081455_10505978All Organisms → cellular organisms → Bacteria810Open in IMG/M
3300006032|Ga0066696_10239839All Organisms → cellular organisms → Bacteria → Proteobacteria1169Open in IMG/M
3300006034|Ga0066656_10909980All Organisms → cellular organisms → Bacteria563Open in IMG/M
3300006800|Ga0066660_10020931All Organisms → cellular organisms → Bacteria → Proteobacteria3828Open in IMG/M
3300006847|Ga0075431_100150567All Organisms → cellular organisms → Bacteria → Proteobacteria2396Open in IMG/M
3300006847|Ga0075431_100790242All Organisms → cellular organisms → Bacteria923Open in IMG/M
3300006854|Ga0075425_100872516All Organisms → cellular organisms → Bacteria1029Open in IMG/M
3300006904|Ga0075424_101844603All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium639Open in IMG/M
3300006914|Ga0075436_100650020All Organisms → cellular organisms → Bacteria779Open in IMG/M
3300006914|Ga0075436_100853142All Organisms → cellular organisms → Bacteria680Open in IMG/M
3300007255|Ga0099791_10108139All Organisms → cellular organisms → Bacteria1283Open in IMG/M
3300009100|Ga0075418_11447048All Organisms → cellular organisms → Bacteria746Open in IMG/M
3300009792|Ga0126374_10796562All Organisms → cellular organisms → Bacteria722Open in IMG/M
3300009811|Ga0105084_1012221All Organisms → cellular organisms → Bacteria → Proteobacteria1340Open in IMG/M
3300009812|Ga0105067_1103337All Organisms → cellular organisms → Bacteria520Open in IMG/M
3300009813|Ga0105057_1017216All Organisms → cellular organisms → Bacteria1032Open in IMG/M
3300009817|Ga0105062_1104946All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300009819|Ga0105087_1045039All Organisms → cellular organisms → Bacteria708Open in IMG/M
3300009822|Ga0105066_1028104All Organisms → cellular organisms → Bacteria → Proteobacteria1133Open in IMG/M
3300010303|Ga0134082_10114063All Organisms → cellular organisms → Bacteria1076Open in IMG/M
3300010336|Ga0134071_10140324All Organisms → cellular organisms → Bacteria → Proteobacteria1171Open in IMG/M
3300010358|Ga0126370_11632978All Organisms → cellular organisms → Bacteria618Open in IMG/M
3300010362|Ga0126377_12934266All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium550Open in IMG/M
3300010398|Ga0126383_10407410All Organisms → cellular organisms → Bacteria1398Open in IMG/M
3300012199|Ga0137383_10807104All Organisms → cellular organisms → Bacteria685Open in IMG/M
3300012206|Ga0137380_11112344All Organisms → cellular organisms → Bacteria673Open in IMG/M
3300012917|Ga0137395_10309626All Organisms → cellular organisms → Bacteria1120Open in IMG/M
3300012918|Ga0137396_11122594All Organisms → cellular organisms → Bacteria560Open in IMG/M
3300012922|Ga0137394_10698870All Organisms → cellular organisms → Bacteria854Open in IMG/M
3300012925|Ga0137419_11222483All Organisms → cellular organisms → Bacteria630Open in IMG/M
3300012929|Ga0137404_11776361All Organisms → cellular organisms → Bacteria573Open in IMG/M
3300012930|Ga0137407_10918972All Organisms → cellular organisms → Bacteria828Open in IMG/M
3300012930|Ga0137407_11447217All Organisms → cellular organisms → Bacteria653Open in IMG/M
3300012971|Ga0126369_10182192All Organisms → cellular organisms → Bacteria2008Open in IMG/M
3300012986|Ga0164304_10200001All Organisms → cellular organisms → Bacteria1304Open in IMG/M
3300013297|Ga0157378_10475816All Organisms → cellular organisms → Bacteria1244Open in IMG/M
3300015170|Ga0120098_1004356All Organisms → cellular organisms → Bacteria → Proteobacteria1331Open in IMG/M
3300015241|Ga0137418_10864699All Organisms → cellular organisms → Bacteria669Open in IMG/M
3300015373|Ga0132257_100885946All Organisms → cellular organisms → Bacteria1118Open in IMG/M
3300016341|Ga0182035_10871870All Organisms → cellular organisms → Bacteria793Open in IMG/M
3300016371|Ga0182034_11400889All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300016387|Ga0182040_10780779All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300016445|Ga0182038_10425225All Organisms → cellular organisms → Bacteria1119Open in IMG/M
3300017654|Ga0134069_1339693All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300017939|Ga0187775_10106702All Organisms → cellular organisms → Bacteria948Open in IMG/M
3300017947|Ga0187785_10200320All Organisms → cellular organisms → Bacteria869Open in IMG/M
3300018082|Ga0184639_10563495All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300018089|Ga0187774_10042773All Organisms → cellular organisms → Bacteria1959Open in IMG/M
3300018431|Ga0066655_10011649All Organisms → cellular organisms → Bacteria → Proteobacteria3837Open in IMG/M
3300018468|Ga0066662_12881955All Organisms → cellular organisms → Bacteria511Open in IMG/M
3300024241|Ga0233392_1037307All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300025899|Ga0207642_10279349All Organisms → cellular organisms → Bacteria960Open in IMG/M
3300025910|Ga0207684_10654069All Organisms → cellular organisms → Bacteria895Open in IMG/M
3300025923|Ga0207681_10599514All Organisms → cellular organisms → Bacteria910Open in IMG/M
3300025972|Ga0207668_12101738All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300026118|Ga0207675_102611386All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300026277|Ga0209350_1083836All Organisms → cellular organisms → Bacteria844Open in IMG/M
3300026310|Ga0209239_1357946All Organisms → cellular organisms → Bacteria511Open in IMG/M
3300026313|Ga0209761_1298834All Organisms → cellular organisms → Bacteria562Open in IMG/M
3300026317|Ga0209154_1294669All Organisms → cellular organisms → Bacteria536Open in IMG/M
3300026324|Ga0209470_1115227All Organisms → cellular organisms → Bacteria1196Open in IMG/M
3300026324|Ga0209470_1345575All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300026333|Ga0209158_1059239All Organisms → cellular organisms → Bacteria1539Open in IMG/M
3300026334|Ga0209377_1328710All Organisms → cellular organisms → Bacteria513Open in IMG/M
3300026335|Ga0209804_1071330All Organisms → cellular organisms → Bacteria1645Open in IMG/M
3300026529|Ga0209806_1110779All Organisms → cellular organisms → Bacteria → Proteobacteria1143Open in IMG/M
3300026529|Ga0209806_1147731All Organisms → cellular organisms → Bacteria901Open in IMG/M
3300026536|Ga0209058_1347252All Organisms → cellular organisms → Bacteria513Open in IMG/M
3300026540|Ga0209376_1020565All Organisms → cellular organisms → Bacteria → Proteobacteria4400Open in IMG/M
3300026550|Ga0209474_10124377All Organisms → cellular organisms → Bacteria1706Open in IMG/M
3300027032|Ga0209877_1022553All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300027654|Ga0209799_1014755All Organisms → cellular organisms → Bacteria1693Open in IMG/M
3300027949|Ga0209860_1043669All Organisms → cellular organisms → Bacteria595Open in IMG/M
3300027961|Ga0209853_1170552All Organisms → cellular organisms → Bacteria517Open in IMG/M
3300028792|Ga0307504_10048491All Organisms → cellular organisms → Bacteria1204Open in IMG/M
3300031199|Ga0307495_10084802All Organisms → cellular organisms → Bacteria723Open in IMG/M
(restricted) 3300031248|Ga0255312_1057813All Organisms → cellular organisms → Bacteria930Open in IMG/M
3300031562|Ga0310886_10330692All Organisms → cellular organisms → Bacteria880Open in IMG/M
3300031681|Ga0318572_10149225All Organisms → cellular organisms → Bacteria1349Open in IMG/M
3300031682|Ga0318560_10067591All Organisms → cellular organisms → Bacteria1797Open in IMG/M
3300031740|Ga0307468_100489361All Organisms → cellular organisms → Bacteria971Open in IMG/M
3300031740|Ga0307468_101618361All Organisms → cellular organisms → Bacteria606Open in IMG/M
3300031748|Ga0318492_10285205All Organisms → cellular organisms → Bacteria858Open in IMG/M
3300031782|Ga0318552_10038035All Organisms → cellular organisms → Bacteria2239Open in IMG/M
3300031798|Ga0318523_10166328All Organisms → cellular organisms → Bacteria1099Open in IMG/M
3300031819|Ga0318568_10187050All Organisms → cellular organisms → Bacteria1275Open in IMG/M
3300031831|Ga0318564_10150837All Organisms → cellular organisms → Bacteria1036Open in IMG/M
3300031835|Ga0318517_10334053All Organisms → cellular organisms → Bacteria685Open in IMG/M
3300031940|Ga0310901_10499029All Organisms → cellular organisms → Bacteria545Open in IMG/M
3300032010|Ga0318569_10109788All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1252Open in IMG/M
3300032052|Ga0318506_10464666All Organisms → cellular organisms → Bacteria561Open in IMG/M
3300032064|Ga0318510_10497180All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300032065|Ga0318513_10685097All Organisms → cellular organisms → Bacteria502Open in IMG/M
3300032068|Ga0318553_10202642All Organisms → cellular organisms → Bacteria1034Open in IMG/M
3300032075|Ga0310890_11823542All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300032122|Ga0310895_10328210All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium731Open in IMG/M
3300032180|Ga0307471_100411024All Organisms → cellular organisms → Bacteria1483Open in IMG/M
3300032205|Ga0307472_102216469All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium554Open in IMG/M
3300034817|Ga0373948_0130924All Organisms → cellular organisms → Bacteria613Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil19.13%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil11.30%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil9.57%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand7.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.09%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere6.09%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.35%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.48%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil3.48%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.61%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.61%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland2.61%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.74%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.87%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil0.87%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.87%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.87%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.87%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.87%
FossillEnvironmental → Terrestrial → Soil → Fossil → Unclassified → Fossill0.87%
Deep Subsurface SedimentEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface Sediment0.87%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.87%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.87%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.87%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.87%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.87%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil0.87%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000597Forest soil microbial communities from Amazon forest - 2010 replicate II A1EnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300004268Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 MoBioEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300009811Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_20_30EnvironmentalOpen in IMG/M
3300009812Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60EnvironmentalOpen in IMG/M
3300009813Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_10_20EnvironmentalOpen in IMG/M
3300009817Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_10_20EnvironmentalOpen in IMG/M
3300009819Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_40_50EnvironmentalOpen in IMG/M
3300009822Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300015170Fossil microbial communities from human bone sample from Teposcolula Yucundaa, Mexico - TP48EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017939Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_10_MGEnvironmentalOpen in IMG/M
3300017947Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0815_BV2_4_20_MGEnvironmentalOpen in IMG/M
3300018082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2EnvironmentalOpen in IMG/M
3300018089Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300024241Subsurface microbial communities from Mancos shale, Colorado, United States - Mancos A_50_July_PBEnvironmentalOpen in IMG/M
3300025899Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M2-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025972Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027032Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_0_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027654Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027949Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027961Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031562Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D3EnvironmentalOpen in IMG/M
3300031681Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f20EnvironmentalOpen in IMG/M
3300031682Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f22EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031748Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f22EnvironmentalOpen in IMG/M
3300031782Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f20EnvironmentalOpen in IMG/M
3300031798Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f19EnvironmentalOpen in IMG/M
3300031819Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f21EnvironmentalOpen in IMG/M
3300031831Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f20EnvironmentalOpen in IMG/M
3300031835Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f21EnvironmentalOpen in IMG/M
3300031940Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D2EnvironmentalOpen in IMG/M
3300032010Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f22EnvironmentalOpen in IMG/M
3300032052Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f19EnvironmentalOpen in IMG/M
3300032064Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f17EnvironmentalOpen in IMG/M
3300032065Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f20EnvironmentalOpen in IMG/M
3300032068Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f21EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032122Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D4EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300034817Populus rhizosphere microbial communities from soil in West Virginia, United States - GW9791_WV_N_1Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10588184013300000364SoilMQALWKREGVSNNTLMVVQCEGPLDPERIQRAIERFLDVCPWPAARLSRPFPWGALHWAAREREALVRPPVRHGRISSPEALHAELETELNTAIDPRRESPLRFA
AF_2010_repII_A1DRAFT_1014397223300000597Forest SoilMQALWRREGVSNNTLMVVQCDGPLDPARISRSLDRFLDYCPWPSARLRRPFPWGQLHWAAGARDTLTAPPVRHQRLGSPEALHGEL
JGI12053J15887_1033245423300001661Forest SoilMQALWKREGVSNNTLMVVQCDGPLDPARTARALERFLDFCPWPAARLNRPFPWGQLHWAARARSALAAPPVRHQRLSAPEALHAELEAELNRAIDP
JGI25382J37095_1021621323300002562Grasslands SoilMQALWKHAGVSNNTLMVVQCDGPLDPQRIRRSLDRFLDVCPWPAARLRRPFPWGKLHWAARSRAALVGPAVRHRRLRAPEALQAELEAELNAAID
JGI25386J43895_1018168913300002912Grasslands SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDTARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARAPLAAPP
Ga0066398_1005289313300004268Tropical Forest SoilVSTPRVRRLPLRGADLVLVAMQALWRHAGVSNNTLMVVQCDGLLEPERIRRALDRFLDWCPWLSARLRRRRPWGGLYWVAGARAAL
Ga0066674_1005940513300005166SoilVSAHARRLPLRGADLVLVAMQALWRREKISNNTLMVVECDGPLDPARIRRALDGFLEICPWPVARLRRPFPWGKLHWAAGRRAV
Ga0066690_1014677143300005177SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDTARIARALERFLGVCPWPAARLRRPFPWGPLHWAARAR
Ga0066690_1063718713300005177SoilMRAGWPFRRSGTTPARRLPLRGADLALIAMQSLWTRASVSNNTLLVVQCDAPIAPERIRRALDRFLGSCPWPAARLRRPFPWGKLHWAAGPP
Ga0066676_1098343013300005186SoilVSTARVRRLPLRGADLVLVAMQALWKREGVSNNTLMVVQCEGPLDPERIQRAIERFLDVCPWPAARLSRPFPWGALHWAAHEREALVRPPVRHGRLSSPE
Ga0066388_10667259623300005332Tropical Forest SoilVSTPRVRRLPLRGADLVLVAMQALWRHAGVSNNTLMVVQCDGLLEPERIRRALDRFLDWCPWLSARLRRRRPWGGLYWVAGARAALAAPPLRHQRLDSPAALHAELEAELNRAIDARREP
Ga0070680_10110240423300005336Corn RhizosphereVSTARVRRLPLRGADLVLVAMQALWKREGVSNNTLMVVQCEGPLDPERIQRAVERFLDVCPWPAARLSRPFPWGALHWAAHEREALVRPPVRHRRISSPEALHAELETELNTAIDP
Ga0066686_1012866943300005446SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDTARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARAPLAAPLVRHRRLSAPEALHVELEAELNTAIDP
Ga0066670_1037581233300005560SoilMQALWKSEGISNNTLMVVHCDGPLDHARIGRALDRFLDFCPWPSARLRRRLPWGPLHRAAEARAALAAPPVRHKRLSSPGALHAELEAELNAAIDPRREPPLRF
Ga0066694_1038329413300005574SoilVSAHARRLPLRGADLVLVAMQALWRREKISNNTLMVVACDGPLDPARVRRALDGFLEICPWPVARLRRPFPWGKLHWAAGRHVV
Ga0066905_10113745513300005713Tropical Forest SoilVSTPRVRRLPLRGADLVLVAMQALWKHVGVSNNTLMAVQCDGPLDASRIARALERFLDFCPWPAARLRRPFPWGALHWAARSRAALAVPP
Ga0081455_1050597813300005937Tabebuia Heterophylla RhizosphereVSAHARRLPLRGADLVLVAMQALWRREKISNNTLMVVECDGPLDPARLRRAFDRFLLVCPWPVARLRRPFPWGKLSWAAGRDAAPESLPLLHLRVASAER
Ga0066696_1023983933300006032SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDPARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARGPLAAPLVRHRHLSAPEALHVELEA
Ga0066656_1090998023300006034SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDTARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARAPLAAPPVRHRSLSAPE
Ga0066660_1002093113300006800SoilMQALWKREGVSNNTLMIVQCDGPLDPARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARGPLAAPLVRHRHLSAPEALHVELEAELNTAIDPRREPPLHFSILDSVSKATGPQSAL
Ga0075431_10015056713300006847Populus RhizosphereVSTTRSRRLPLRGADLVLVAMQTLWKHARVSNNTLMVVQCDGPLDPARIARALERFLDVCPWPAARLRRPFPWGALHWAAHSRDALIAPPVRHRRLPAPEALHAELEAELNLAIDPR
Ga0075431_10079024213300006847Populus RhizosphereVSTPRVRRLPLRGADLVLVAMQALWKRAGVSNNTVMVVQCDGPLDSSRVARALGRFLHFCPWPAARLRRPFPWGALHWAARSRATLTVPPVRHRCLGGPEALHAALEA
Ga0075425_10087251633300006854Populus RhizosphereVSIARLRRLPLRGADLVLVAMQALWKREGVSNNTLMVVQCEGPLDPERIQRATERFLDVCPWPAARLSRPFPWGALHWAAHEREALASPPVRHRQ
Ga0075424_10184460323300006904Populus RhizosphereVSTRRIRRLPLRGADLVLVAMQALWRREGVSNNTLMVVHCDGPLEPERIRRALDRFIDFCPWPAARLRRRRPWGGLHWVARARAALTAPPVRHQRLASPVALHAELEAELNRAI
Ga0075436_10065002033300006914Populus RhizosphereVSIARLRRLPLRGADLVLVAMQALWKREGVSNNTLMVVQCEGPLDPERIQRATERFLDVCPWPAARLSRPFPWGALHWAAHEREALASSPVRHRQLSSPEALHAELETELN
Ga0075436_10085314213300006914Populus RhizosphereVSTTPVRRLPLRGADLVLVAMQALWNREGVSNNTLMIVQCDGPLEPARVARALERFLGVCPWPAARLRRPFPWGSLHWAARARAPLAASLVRHRHLSAPEALHVELEAELNTAIDPRREP
Ga0099791_1010813943300007255Vadose Zone SoilVSTPRVRRLPLRGADLVLVAMQALWKREGVSNNTLMAVQCDGLLDAERIRRALHRLLDYCPWPAARLRRRLPWGQLHWAAGA
Ga0075418_1144704813300009100Populus RhizosphereVSTTRSCRLPLRGADLVLVAMQTLWKHARVSNNTLMVVQCDGPLDPARIARALERFLDVCPWPAARLRRPFPWGALHWAAHSRDALIAPPV
Ga0126374_1079656223300009792Tropical Forest SoilMQALWRREGVSNNTLMVVQCDGPLDPARISRSLDRFLDYCPWPSARLRRPFPWGQLHWAAGARDTLTAPPVRHQRLGSPEALHGELESEL
Ga0105084_101222143300009811Groundwater SandMRRLPLRGADLVLVAMQSLWKRFGVSNNTVMVVQCDGPIDPARLARALDRFLDFCPWPAARLGRPFPWGKL
Ga0105067_110333713300009812Groundwater SandVSTSRPRRLPLRGADLVLVAMQALWKRAGVSNNTIMVVQCDGPIDPGRIARALDRFLDFCPWPAARLRRPFPWGKLAWAARARAALTAPPLRQRRVDSHDELHVELEAELNRVIDPRREPPL
Ga0105057_101721633300009813Groundwater SandMRRLPLRGADLVLVAMQSLWKRFEVSNNTVMVVQCDGLIDPARLARALDRFLDFCPWPAARLGRPFPWGKLGWAAGPRAALTALPVRRRRIGSPAELHVELEAELNRA
Ga0105062_110494613300009817Groundwater SandMRRLPLRGADLVLVAMQSLWKRFGVSNNTVMVVQCDGPIDPARLARALDRFLDFCPWPAARLRRPFPWGKLGWAAGPRAALTA
Ga0105087_104503923300009819Groundwater SandMQALWKRAGVSNNTIMVVQCDGPIDPGRIARALDRFLDFCPWPAARLRRPFPWGKLAWAARPRAALTTPPLRQRRVDSHDELHVELEAELNRAIDPRREPPLRFAVLDHATEAQSFLVATWFHPLM
Ga0105066_102810413300009822Groundwater SandMQALWKRAGVSNNTIMVVQCDGPIDPGRIARALDRFLDFCPWPAARLRRPFPWGKLAWAARPRAALTTPPLRQRRVDSHDELHVELEAELNRAIDPR
Ga0134082_1011406333300010303Grasslands SoilMQALWKREGISNNTLMVVHCDGPLDHARIGRALDRFLDFCPWPSARLRRRLPWGPLHWAAGARAALAAPPVRHKRLSSPGALHAELEAELNAAIDPRRVSFTRARDGVRARISGPGSFSP
Ga0134071_1014032433300010336Grasslands SoilVSAHARRLPLRGADLVLVAMQALWRREKISNNTLMVVECDGPLDPARIRRALDGFLEICPWPVARLRRPFPWGKLHW
Ga0126370_1163297823300010358Tropical Forest SoilMQALWKREGVSNNTLMVVECDGSIEPERIERALARFLDVCPWPAARLRRSFPWGRLHWAAGRRERLIPPPIRRATAHSHEALHAELEAELNAAIDARRDPLVRFAIIDHPGADGG
Ga0126377_1293426623300010362Tropical Forest SoilVSTTRVHRLPLRGADLVLVAMQALWKRERVSNNTLMVVHCDGSLDPARIARALERFVELCPWPASRLRRPFPWGQLHWSAPANAAAAAPAVRHRRLMTPDGLHALLEAELNRAIDPRVEPPLRVAICDTIDDAAGPQS
Ga0126383_1040741013300010398Tropical Forest SoilVSTPRVRRLPLRGADLVLVAMQALWRHAGVSNNTLMVVQCDGLLEPERIRRALDRFLDWSPWLSARLRRRRPWGGLYWVAGARAALAAPPVRHQRLDSPAALHAELEAELNRAIDARREPPLRFAI
Ga0137383_1080710423300012199Vadose Zone SoilVSTARIRRLPLRGADLVLVAMQALWKREGVSNNTLMVVQCEGPLDPERIQRAIERFLDVCPWPAARLSRPFPWGALHWAAHEREALVRPPVRHGRLSSPEALLAELETELNTAIDPR
Ga0137380_1111234423300012206Vadose Zone SoilVSAHARRLPLRGADLVLVAMQALWRREKISNNTLMVVECDGPLDPARIGRALDGFLEICPWPLARLRRPFPWGKLHWAAGRRAVPGSLPLLHQRVDSPKLLHTMLEAELNAAIDPR
Ga0137395_1030962613300012917Vadose Zone SoilVSTARVRRLPLRGADLVLVAMQALWKREGVSNNTLMVVQCEGPLDPERIQRAIERFLDICPWPAARLSRPFPWGALHWAAHEREALVRPPVRHGRLSSPEALHAELETELNTAI
Ga0137396_1112259423300012918Vadose Zone SoilVSAHARRLPLRGADLVLVAMQALWRREKISNNTLMVVECEGPLDPARVRRALDGFLEICPWPVARLRRPFPWGKLHWAAGRRAMPGSLPLLHQRVDSPKLLHTMLEA
Ga0137394_1069887023300012922Vadose Zone SoilMQALWKREGVSNNTLMVVQCDGPLDPARTARALERFLDFCPWPAARLNRPFPWGQLHWAARARSALAAPPVRHQRLSAPEALHA
Ga0137419_1122248323300012925Vadose Zone SoilVRTPGLRRLPLRGADLVLVAMQALWKRAGVSNNTLMVVQCDGLLDPERIRRALDRFLDFCPWPAARLSRPFPWGRLHWAARRRAEMGAPPLRHRYLSSPDALH
Ga0137404_1177636113300012929Vadose Zone SoilVSAHARRLPLRGADLVLVAMQALWRREKISNNTLMVVECDGPLDPARIRRALDGFLEICPWPVARLRRPFPWGK
Ga0137407_1091897233300012930Vadose Zone SoilVSTQRVRRLPLRGADLVLVAMQALWKREGVSNNTLMAVQCDGPLDPARIRRALHCLLDYCPWPAARLRRRLPWGQLHWAAGPRA
Ga0137407_1144721713300012930Vadose Zone SoilVSTPRVRRLPLRGADLVLVAMQALWKHVGVSNNTLMAVQCDGPLDASRIARALERFLDFCPWPAARLRRPFPWGALHWAARSRAALAVPPVRHERLLATEALHSALEAELNAAIDP
Ga0126369_1018219243300012971Tropical Forest SoilMARLPLRGADLVLLAMPALHKRARISNNALLAVDCDGPVDAGRLRRALDRFLDVCPWPAARLRRSFPWGRLHWAAGRRERLIPPPIRRATAHSHEALHAELEAELNAAIDARRDPLVRFAIIDHPGADGGVAGVFAMT
Ga0164304_1020000113300012986SoilVSIARLRRLPLRGADLVLVAMQALWKREGVSNNTLMVVQCEGPLDPERIQRAIERFLDVCPWPAARLSRPFPWGALHWAAHEREALVT
Ga0157378_1047581613300013297Miscanthus RhizosphereVSTARVRRLPLRGADLVLVAMQALWKREGVSNNTLMVVQCEGPLDPERIQRAVERFLDVCPWPAARLSRPFPWGALHWAAHERKALVRPPVRHRRISSPEALHAEL
Ga0120098_100435633300015170FossillVSPSGVRRLPLRGADLVLVAMQSLWKRFGVSNNTVMVVQCDGAIDPARLARALDRFLDFCPWPAARLRRPFPWGKLGWAARPRAALTAPPVRRRRIGSPAELQVELEAELNRAIDPRRESPIRFSI
Ga0137418_1086469913300015241Vadose Zone SoilVRTPGLRRLPLRGADLVLVAMQALWKRAGVSNNTLMVVQCDGLLDPERIRRALDRFLDFCPWPAARLSRPFPWGRLHWAARRRAEMGAPPLRHRYLSSPDALHAVLEAELNAAID
Ga0132257_10088594623300015373Arabidopsis RhizosphereVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMVVHCDGPLDPARIGRALERFVELCPWPASRLRRPFPWGQLHWAAPAHAAPEAPAVRHQRGARCWRRGISPASRAFSSRLPG*
Ga0182035_1087187013300016341SoilMQALWKREGVSNNTLMVIEYEGSIEPERIERALARFLDVCPWPAARLRRPFPWGALHWAAGARAMRQTPPVRRQRLGAPEA
Ga0182034_1140088923300016371SoilMQALWKREGVSNNTLMVIEYEGSIEPERIERALARFLDVCPWPAARLRRPFPWGALHWAAGARATLRTPPVRRQRLGAPEALHAELEAELNAAIDPRREPPLRFAIFEGLGDPSGTQSALVVTWFHPL
Ga0182040_1078077923300016387SoilMQALWKREGVSNNTLMVIEYEGSIEPERIERALARFLDVCPWPAARLRRPFPWGALHWAAGARAMRQTPPVRRQRLDTPEALHAELEAELNAPIDPRR
Ga0182038_1042522533300016445SoilVSPTPISRLPLRGADLVLVAMQALWKREGVSNNTLMVIEYEGSIEPERIERALARFLDVCPWPAARLRRPFPWGALHWAAGARTTLRTPPVRRQQLDGPAALHAELEAELNAPIDPRGEPPLRFAIF
Ga0134069_133969323300017654Grasslands SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMVVQCDGPLDPARISRSLDRFLDFCPWPSARLRRPFPWGQLHWAAGARAALAIPPVRHQRLGSPEALHAELEAELNHAI
Ga0187775_1010670213300017939Tropical PeatlandVTISSVRRLPLRGADLVLVAMQALWKREGISNNTLMVVECEGGLEPERLRRALERFLDVCPWPAARLRRPFPWGALHWAVGERAALRTPPVRRRRLD
Ga0187785_1020032013300017947Tropical PeatlandMQALWKREGVSNNTLMVVECEGGLEPERLRRALERFLDVCPWPAARLRRPFPWGALHWAVGERAALRTPPVRRRRLDSSEAIQAELEAELNAPIDPRRESP
Ga0184639_1056349513300018082Groundwater SedimentVPTRRLPLRGADLILMAMQALWRAGEVSNNALLVVECDGPLPVARVGQALDRFLDDCPWPAARLRRPFPWGKLHWAAGPRADLACP
Ga0187774_1004277353300018089Tropical PeatlandVTISSVRRLPLRGADLVLVAMQALWKREGISNNTLMVVECEGGLEPERLRRALERFLDVCPWPAARLRRPFPWGALHWAVGERAALRTPPVRRRRLDSSEAIQAELEAELNAPIDPRRESPVRFA
Ga0066655_1001164913300018431Grasslands SoilVSAHARRLPLRGADLVLVAMQALWRREKISNNTLMVVECDGPLDPARIRRALDGFLEICPWPVARLRRPFPWGKLH
Ga0066662_1288195523300018468Grasslands SoilMQALWKREGISNNTLMVVQCDGPLDPERIRRALDRFLDVCPWPSARLRRRRPWGQLHWAAGARAALAIPPVRHQRLGQPGALPAELE
Ga0233392_103730723300024241Deep Subsurface SedimentVRTSRLRRLPLRGADLVLVAMQSLWKRAGVSNNTVMVVQCDGPIDPDRIKRALDRFLDFCPWPAARLRRPFPWGKLHWAARPRAAM
Ga0207642_1027934933300025899Miscanthus RhizosphereVSIARLRRLPLRGADLVLVAMQALWKREGVSNNTLMVVLCEGPLDPERIQRAIERFLDVCPWPAARLSRPFPWGALHWAAHERKALVRPPVRHRRISSPEALHAELETELNTAID
Ga0207684_1065406933300025910Corn, Switchgrass And Miscanthus RhizosphereVRTPRLRRLPLRGADLVLVAMQSLWKHASVSNNTLMVVQCDGPIDPERITRTLERFLDFCPWPAARLRRPFPWGKLHWAARSRAALVPPPVRHQRLRAP
Ga0207681_1059951413300025923Switchgrass RhizosphereVSTARVRRLPLRGADLVLVAMQALWKREGVSNNTLMVVQCEGPLDPERIQRAVERFLDVCPWPAARLSRPFPWGALHWAAHEREALVRPPVRHRRISS
Ga0207668_1210173813300025972Switchgrass RhizosphereVSIARLRRLPLRGADLVLVAMQALWKREGVSNNTLMVVLCEGPLDPERIQRAIERFLDVCPWPAARLSRPFPWGALHWA
Ga0207675_10261138623300026118Switchgrass RhizosphereVSIARLRRLPLRGADLVLVAMQALWKREGVSNNTLMVVLCEGPLDPERIQRAIERFLDVCPWPAARLSRPFPWGALH
Ga0209350_108383613300026277Grasslands SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDTARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARAPLAAPPVRHRSLS
Ga0209239_135794623300026310Grasslands SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDTARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARAPLAAPPVRHRSLSAPEALHVELEAELNTAIDPRREPPLHFSILDSVSEATGPQ
Ga0209761_129883423300026313Grasslands SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDTARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARAPLAAPPVRHRSLSAPEALHVE
Ga0209154_129466923300026317SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDTARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARAPLAAPPVRH
Ga0209470_111522713300026324SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDTARIARALERFLDVCPWPAARLRRPFPWGPLHWAARARAPLAAPLVCHR
Ga0209470_134557523300026324SoilVSTARVRRLPLRGADLVLVAMQALWKREGVSNNTLMVVQCEGPLDPERIQRAIERFLDVCPWPAARLSRPFPWGALHW
Ga0209158_105923913300026333SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDPARIARALERFLGVCPWPAARLRRPFPWGPLHWAPRARGPLAAPLVRHRHLSAPEALH
Ga0209377_132871013300026334SoilVSTPRVRRLPLRGADLVLVAMQALWKHVGVSNNTLMAVQCDGPLDASRIARALERFLDFCPWPAARLRRPFPWGALHWAARSRAALAVPPVRHERLLATEALHSALEAELNAAIDPRHEPPLRL
Ga0209804_107133013300026335SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDTARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARAP
Ga0209806_111077933300026529SoilVSAHARRLPLRGADLVLVAMQALWRREKISNNTLMVVECDGPLDPGRIRRALDGFLEICPWPVARLRRPFPWGKLHWAAGRRAVPGSLPLLHQRVDSPKLLHSMLEAELN
Ga0209806_114773113300026529SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDTARIARALERFLGVCPWPAARLRRPFPWGPLHWAARA
Ga0209058_134725213300026536SoilVRTPRLRRLPLRGADLVLVAMQALWKHAGVSNNTLMVVQCDRPLDPERIRRSLDRFLDVCPWPAARLRRPFPWGKLHWAARSRAALIGPAVRHQRLRAPEALQA
Ga0209376_102056513300026540SoilMQALWKREGVSNNTLMIVQCDGPLDPARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARGPLAAPLVRHRHLSAPEALHVELEAELNTAIDPRREPPLHFSILDSVSEATGP
Ga0209474_1012437713300026550SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDPARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARGPLAAPLVRHRHLSAPE
Ga0209877_102255313300027032Groundwater SandMRRLPLRGADLVLVAMQSLWKRFGVSNNTVMVVQCDGPIDPARLARALDRFLDFCPWPAARLRRPFPWGKLGWAAGPRAALTALS
Ga0209799_101475513300027654Tropical Forest SoilVSTPRVRRLPLRGADLVLVAMQALWRHAGVSNNTLMVVQCDGLLEPERIRRALDRFLDWSPWLSARLRRRRPWGGLYWVAGARAALAAPPVRHQRLDSPAAL
Ga0209860_104366913300027949Groundwater SandVSTSRPRRLPLRGADLVLVAMQALWKRAGVSNNTIMVVQCDGPIDPGRIARALDRFLDFCPWPAARLRRPFPWGKLAWAARPRAALTTPPLRQRRVDSHDKLHVELEAE
Ga0209853_117055223300027961Groundwater SandVSTSRPRRLPLRGADLVLVAMQALWKRAGVSNNTIMVVQCDGPIDPGRIARALDRFLDFCPWPAARLRRPFPWGKLAWAARPRAALTTPPLRQRRVDSHDELHVELEAELNRAIDPRREPPLRFTVLDHATEARSSLVV
Ga0307504_1004849133300028792SoilVSAHARRLPLRGADLVLVAMQALWRREKISNNTLMVVECDGPLDPTRIRRALDGFLEICPWPVARLRRPFPWGKLHWAAGRHVVPSALSLRQRRIDSPEHLQRELEAELNAVIDPRRES
Ga0307495_1008480213300031199SoilVSGDARRLPLRGADLVLVAMQALWRREKISNNTLMVVECEGPLDPARVRRALDGFLEICPWPVARLRRPFPWGKLHWAAGRDAV
(restricted) Ga0255312_105781333300031248Sandy SoilVSIQRLRRLPLRGADLVLVAMQALWKHAGVSNNTLMVIQCDGPLDPARIARALEGFLDFCPWPAARLRRSIPWGQLHWAARSRAALVAPPVRHRRLSAPE
Ga0310886_1033069213300031562SoilMQALWKREGVSNNTLMVVHCDGPLAPARIARALERFVELCPWPASRLRRPFPWGQLHWAAPAKAAPEAPTVRHQRLTTPDSLHGLLEAELNRAIDPRVEPPLRVAIFDTMIDAAGPQSALVL
Ga0318572_1014922543300031681SoilVSPTPISRLPLRGADLVLVAMQALWKREGVSNNTLMVIEYEGSIEPERIERALARFLDVCPWPAARLRRPFPWGALHWAAGARAMRQTP
Ga0318560_1006759113300031682SoilMQALWRREGVSNNTLMVVQCDGPLDPARISRSLDRFLDYCPWPSARLRRPFPWGQLHWAAGARDTLTAPPVRHQRLGSPEALHAE
Ga0307468_10048936113300031740Hardwood Forest SoilVSIARLRRLPLRGADLVLVAMQALWKREGVSNNTLMVVQCEGPLDPERIRRAIERFLDICPWPAARLSRPFPWGALHWAAHE
Ga0307468_10161836113300031740Hardwood Forest SoilVSTPRVRRLPLRGADLVLIAMQALWKRAGVSNNTVMVVQCDGPLDSSRVARALGRFLDFCPWPAARLRRPFPWGALHWAARSRATLTVPPV
Ga0318492_1028520533300031748SoilVSTPTVRRLPLRGADLVLVAMQALWRREGVSNNTLMVVQCDGPLDPARISRSLDRFLDYCPWPSARLRRPFPWGQLHWAAGARDTLTAPPVRHQRLGSPEALHAE
Ga0318552_1003803513300031782SoilMQALWKREGVSNNTLMVIEYEGSIEPERIERALARFLDVCPWPAARLRRPFPWGALHWAAGARATLRTPPVRRQRLGAPEALHAELEAELNAAIDPRREPPLRFAIFEG
Ga0318523_1016632833300031798SoilVSPTPISRLPLRGADLVLVAMQALWKREGVSNNTLMVIEYEGSIEPERIERALARFLDVCPWPAARLRRPFPWGALHWAAGAR
Ga0318568_1018705013300031819SoilMQALWRREGVSNNTLMVVQCDGPLDPARISRSLDRFLDYCPWPSARLRRPFPWGQLHWAAGARDTLTAPPVRHQRLGSPEALHAELESELN
Ga0318564_1015083733300031831SoilVSPTPISRLPLRGADLVLVAMQALWKREGVSNNTLMVIEYEGSIEPERIERALARFLDVCPWPAARLRRPFPWGALHWAAGAG
Ga0318517_1033405333300031835SoilVSPTPISRLPLRGADLVLVAMQALWKREGVSNNTLMVIEYEGSLEPERIERALARFLDVCPWPAARLRRPF
Ga0310901_1049902923300031940SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNHSLMVVHCDGPLAPARIARALERFVELCPWPASRLRRPFPWGQLHWAAPANAAPEAPTVRHQRLTTPDSLHGLLEA
Ga0318569_1010978813300032010SoilVSPTPISRLPLRGADLVLVAMQALWKREGVSNNTLMVIEYEGSIEPERIERALARFLDVCPWPAARLRRPF
Ga0318506_1046466613300032052SoilVSPTPISRLPLRGADLVLVAMQALWKREGVSNNTLMVIEYEGSIEPERIERALARFLDVCPWPAARLRRPFPWGALHWAAGARATLRTPPVRRQRLGAPEA
Ga0318510_1049718013300032064SoilVSPTPISRLPLRGADLVLVAMQALWKREGVSNNTLMVIEYEGSIEPERIERALARFLDVCPWPAARLRRPFPWGALHWAAGARATLRTPPVRRQRLG
Ga0318513_1068509713300032065SoilVSPTPISRLPLRGADLVLVAMQALWKREGVSNNTLMVIEYEGSLEPERIERALARFLDVCPWPAARLRRPFPWGALH
Ga0318553_1020264213300032068SoilMQALWKREGVSNNTLMVIEYEGSIEPERIERALARFLDVCPWPAARLRRPFPWGALHWAAGARAMRQTPPVRRQRLDTPEALHAELEAELNAPIDPRGEPPLRFAIFDGLGDPVGTQSALVVT
Ga0310890_1182354223300032075SoilVSIARLRRLPLRGADLVLVAMQALWKREGVSNNTLMVVQCEGPLDPERIQRAIERFLDVCPWPAARLSRPFPWGALH
Ga0310895_1032821023300032122SoilMQALWKREGVSNNTLMVVHCDGPLAPARIARALERFVELCPWPASRLRRPFPWGQLHWAAPANAAPEAPAVRHQRLTTPDSLHGLLEAELNRAIDPRVEPPLR
Ga0307471_10041102413300032180Hardwood Forest SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMIVQCDGPLDTARIARALERFLGVCPWPAARLRRPFPWGPLHWAARARAPLAAPLV
Ga0307472_10221646923300032205Hardwood Forest SoilVSTPRVRRLPLRGADLVLVAMQALWRREGVSNNTLMVVHCDGPLEPERIRRALDRFMDFCPWPAARLRRRRPWGGLHWVARARAALTAPPVRHQRLASPVALHAELEAELNRAIDPRREPPLRFAILDGAFDGAGPH
Ga0373948_0130924_2_4363300034817Rhizosphere SoilVSTTPVRRLPLRGADLVLVAMQALWKREGVSNNTLMVVHCDGPLAPARIARALERFVELCPWPASRLRRPFPWGQLHWAAPANAAPEAPAVRHQRLTTPDSLHGLLEAELNRAIDPRVEPPLRVAIFDTMIDAAGPQSALVLTWF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.