NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F073322

Metagenome / Metatranscriptome Family F073322

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F073322
Family Type Metagenome / Metatranscriptome
Number of Sequences 120
Average Sequence Length 61 residues
Representative Sequence VLNEPHGAALKVGPHATRIVEPRQARKGATVRRPSRVPWEYLTGAGSGERSQYEPFEGGCTV
Number of Associated Samples 114
Number of Associated Scaffolds 120

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 98.33 %
% of genes near scaffold ends (potentially truncated) 95.83 %
% of genes from short scaffolds (< 2000 bps) 93.33 %
Associated GOLD sequencing projects 112
AlphaFold2 3D model prediction Yes
3D model pTM-score0.15

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (66.667 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil
(18.333 % of family members)
Environment Ontology (ENVO) Unclassified
(30.833 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.667 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 7.78%    β-sheet: 0.00%    Coil/Unstructured: 92.22%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.15
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 120 Family Scaffolds
PF14437MafB19-deam 4.17
PF13177DNA_pol3_delta2 4.17
PF00589Phage_integrase 4.17
PF00581Rhodanese 0.83
PF12728HTH_17 0.83



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A66.67 %
All OrganismsrootAll Organisms33.33 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000550|F24TB_10361730All Organisms → cellular organisms → Bacteria520Open in IMG/M
3300002122|C687J26623_10070481All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium949Open in IMG/M
3300002124|C687J26631_10202879Not Available659Open in IMG/M
3300002916|JGI25389J43894_1103376Not Available514Open in IMG/M
3300004798|Ga0058859_11778761Not Available617Open in IMG/M
3300005171|Ga0066677_10778501Not Available530Open in IMG/M
3300005178|Ga0066688_10945690Not Available528Open in IMG/M
3300005445|Ga0070708_102063581Not Available527Open in IMG/M
3300005467|Ga0070706_102131957Not Available506Open in IMG/M
3300005471|Ga0070698_100055785All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium4004Open in IMG/M
3300005568|Ga0066703_10471695All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium RIFCSPHIGHO2_02_FULL_73_26749Open in IMG/M
3300005568|Ga0066703_10732413Not Available567Open in IMG/M
3300005576|Ga0066708_10589196All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium RIFCSPHIGHO2_02_FULL_73_26713Open in IMG/M
3300005586|Ga0066691_10897293All Organisms → cellular organisms → Bacteria521Open in IMG/M
3300006034|Ga0066656_10919523Not Available560Open in IMG/M
3300006796|Ga0066665_10367670Not Available1178Open in IMG/M
3300006797|Ga0066659_10380036All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1105Open in IMG/M
3300006845|Ga0075421_102732374Not Available509Open in IMG/M
3300006852|Ga0075433_11895592Not Available511Open in IMG/M
3300006894|Ga0079215_11599942Not Available518Open in IMG/M
3300006904|Ga0075424_100368683All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1530Open in IMG/M
3300006904|Ga0075424_101722137All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales664Open in IMG/M
3300009012|Ga0066710_100630111All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1629Open in IMG/M
3300009038|Ga0099829_11213850Not Available624Open in IMG/M
3300009089|Ga0099828_11690511Not Available557Open in IMG/M
3300009093|Ga0105240_12733764Not Available508Open in IMG/M
3300009137|Ga0066709_103837635Not Available546Open in IMG/M
3300009809|Ga0105089_1041801All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium687Open in IMG/M
3300009811|Ga0105084_1020434All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. STM 38091092Open in IMG/M
3300009821|Ga0105064_1121030Not Available548Open in IMG/M
3300010043|Ga0126380_10354441All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1070Open in IMG/M
3300010043|Ga0126380_10442612All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium979Open in IMG/M
3300010046|Ga0126384_10037553All Organisms → cellular organisms → Bacteria3263Open in IMG/M
3300010046|Ga0126384_11415649All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. STM 3809649Open in IMG/M
3300010046|Ga0126384_12439910Not Available507Open in IMG/M
3300010065|Ga0127435_133067Not Available529Open in IMG/M
3300010067|Ga0127432_120982Not Available572Open in IMG/M
3300010080|Ga0127448_179323Not Available504Open in IMG/M
3300010085|Ga0127445_1013408Not Available717Open in IMG/M
3300010097|Ga0127501_1032961Not Available506Open in IMG/M
3300010108|Ga0127474_1096166Not Available565Open in IMG/M
3300010114|Ga0127460_1027299Not Available861Open in IMG/M
3300010118|Ga0127465_1073992Not Available506Open in IMG/M
3300010127|Ga0127489_1151556Not Available552Open in IMG/M
3300010142|Ga0127483_1053081All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia644Open in IMG/M
3300010325|Ga0134064_10380650Not Available559Open in IMG/M
3300010336|Ga0134071_10174203Not Available1054Open in IMG/M
3300010399|Ga0134127_11963398All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. STM 3809663Open in IMG/M
3300011000|Ga0138513_100078130All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. STM 3809512Open in IMG/M
3300012096|Ga0137389_11598090Not Available548Open in IMG/M
3300012164|Ga0137352_1079312Not Available655Open in IMG/M
3300012198|Ga0137364_10088334All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2159Open in IMG/M
3300012209|Ga0137379_10814515Not Available838Open in IMG/M
3300012349|Ga0137387_10518450Not Available865Open in IMG/M
3300012362|Ga0137361_11749744Not Available540Open in IMG/M
3300012371|Ga0134022_1076704Not Available509Open in IMG/M
3300012374|Ga0134039_1202316All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia518Open in IMG/M
3300012376|Ga0134032_1028412Not Available573Open in IMG/M
3300012379|Ga0134058_1052191Not Available511Open in IMG/M
3300012396|Ga0134057_1271465Not Available600Open in IMG/M
3300012398|Ga0134051_1362588Not Available501Open in IMG/M
3300012401|Ga0134055_1230907Not Available528Open in IMG/M
3300012401|Ga0134055_1308434Not Available622Open in IMG/M
3300012407|Ga0134050_1018336Not Available596Open in IMG/M
3300012410|Ga0134060_1030438Not Available508Open in IMG/M
3300012929|Ga0137404_10057383All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3008Open in IMG/M
3300014865|Ga0180078_1047054Not Available716Open in IMG/M
3300015054|Ga0137420_1284674All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium580Open in IMG/M
3300016319|Ga0182033_10193358All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1612Open in IMG/M
3300016404|Ga0182037_11796893Not Available548Open in IMG/M
3300018076|Ga0184609_10457868Not Available586Open in IMG/M
3300018082|Ga0184639_10041399All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2367Open in IMG/M
3300019212|Ga0180106_1163092Not Available534Open in IMG/M
3300019228|Ga0180119_1190866Not Available548Open in IMG/M
3300019254|Ga0184641_1008039Not Available572Open in IMG/M
3300019887|Ga0193729_1186669All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium719Open in IMG/M
3300020065|Ga0180113_1412531Not Available540Open in IMG/M
3300020610|Ga0154015_1039425Not Available508Open in IMG/M
3300021151|Ga0179584_1100659Not Available577Open in IMG/M
3300021307|Ga0179585_1180216Not Available537Open in IMG/M
3300025149|Ga0209827_10391542Not Available699Open in IMG/M
3300025313|Ga0209431_10665381All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. STM 3809773Open in IMG/M
3300025538|Ga0210132_1066574All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium540Open in IMG/M
3300025560|Ga0210108_1060793All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → unclassified Bradyrhizobium → Bradyrhizobium sp. STM 3809730Open in IMG/M
3300025910|Ga0207684_10003754All Organisms → cellular organisms → Bacteria14680Open in IMG/M
3300025922|Ga0207646_11842145All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → unclassified Clostridia → Clostridia bacterium 62_21517Open in IMG/M
3300025941|Ga0207711_12095964Not Available508Open in IMG/M
3300026312|Ga0209153_1028251Not Available1898Open in IMG/M
3300026497|Ga0257164_1091375Not Available521Open in IMG/M
3300026514|Ga0257168_1099491Not Available647Open in IMG/M
3300026529|Ga0209806_1314202Not Available526Open in IMG/M
3300026532|Ga0209160_1034371All Organisms → cellular organisms → Bacteria3144Open in IMG/M
3300027266|Ga0209215_1012151Not Available1063Open in IMG/M
3300027480|Ga0208993_1092305Not Available554Open in IMG/M
3300027561|Ga0209887_1072493Not Available716Open in IMG/M
3300027617|Ga0210002_1098498Not Available528Open in IMG/M
3300027655|Ga0209388_1042887Not Available1304Open in IMG/M
3300027669|Ga0208981_1082902All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium818Open in IMG/M
3300027748|Ga0209689_1418959Not Available513Open in IMG/M
3300027775|Ga0209177_10150893Not Available788Open in IMG/M
3300028380|Ga0268265_11521854Not Available673Open in IMG/M
3300028589|Ga0247818_11369917Not Available509Open in IMG/M
3300028791|Ga0307290_10355050Not Available536Open in IMG/M
3300030683|Ga0247621_1194732Not Available518Open in IMG/M
3300030904|Ga0308198_1085848Not Available535Open in IMG/M
3300031094|Ga0308199_1193537Not Available509Open in IMG/M
3300031421|Ga0308194_10365748Not Available517Open in IMG/M
3300031720|Ga0307469_10453979All Organisms → cellular organisms → Bacteria1110Open in IMG/M
3300031768|Ga0318509_10011366All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3864Open in IMG/M
3300031797|Ga0318550_10447246All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium625Open in IMG/M
3300031880|Ga0318544_10250271Not Available686Open in IMG/M
3300031911|Ga0307412_11140585All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium RIFCSPHIGHO2_02_FULL_73_26695Open in IMG/M
3300031941|Ga0310912_10839144All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium709Open in IMG/M
3300032002|Ga0307416_102777868Not Available585Open in IMG/M
3300032005|Ga0307411_11735307Not Available578Open in IMG/M
3300032064|Ga0318510_10550374Not Available504Open in IMG/M
3300032180|Ga0307471_100897952Not Available1053Open in IMG/M
3300032397|Ga0315287_11867723All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium666Open in IMG/M
3300033475|Ga0310811_10958701All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium RIFCSPHIGHO2_02_FULL_73_26758Open in IMG/M
3300033485|Ga0316626_11766027Not Available559Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil18.33%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil10.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil10.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil6.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil5.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.17%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.17%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment3.33%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand3.33%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.33%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.50%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.50%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil2.50%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere2.50%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands1.67%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.67%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.67%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.67%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.67%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.67%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.83%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs0.83%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.83%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.83%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil0.83%
Host-AssociatedHost-Associated → Human → Digestive System → Large Intestine → Fecal → Host-Associated0.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.83%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.83%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.83%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300002122Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_2EnvironmentalOpen in IMG/M
3300002124Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_3EnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300004798Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - roots SR-2 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009809Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_30_40EnvironmentalOpen in IMG/M
3300009811Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_20_30EnvironmentalOpen in IMG/M
3300009821Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010065Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010067Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_20_2_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010080Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010085Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010097Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010108Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_5_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010114Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010118Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_20_2_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010127Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010142Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300011000Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t6i015EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012164Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT730_2EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012371Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_2_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012374Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012376Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_20cm_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012379Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012396Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012398Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_24_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012401Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012407Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012410Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014865Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT499_16_10DEnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2EnvironmentalOpen in IMG/M
3300019212Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLIBT25_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019228Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT790_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019254Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300020065Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT499_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020610Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5pm-1 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021151Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_06_16RNAfungal (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021307Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_06_16RNAfungal (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300025149Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2 (SPAdes)EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300025538Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025560Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026312Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120 (SPAdes)EnvironmentalOpen in IMG/M
3300026497Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300027266Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027480Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027561Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027617Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 S AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027669Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028589Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Glucose_Day1EnvironmentalOpen in IMG/M
3300028791Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_144EnvironmentalOpen in IMG/M
3300030683Metatranscriptome of soil fungal communities from truffle orchard in Rollainville, France - Anb10 (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030904Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_202 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031094Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_203 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031768Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f22EnvironmentalOpen in IMG/M
3300031797Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f23EnvironmentalOpen in IMG/M
3300031880Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.168b4f25EnvironmentalOpen in IMG/M
3300031911Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-1Host-AssociatedOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300032002Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-3Host-AssociatedOpen in IMG/M
3300032005Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-O-1Host-AssociatedOpen in IMG/M
3300032064Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f17EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032397Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G11_0EnvironmentalOpen in IMG/M
3300033475Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YCEnvironmentalOpen in IMG/M
3300033485Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_T1_C1_D5_AEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
F24TB_1036173023300000550SoilKVGPHATRIVEPRQARKGATVRRPSRVPWEYLAGVGSAGRSQYGSIDGGCTVTLLTYQVE
C687J26623_1007048143300002122SoilVSNEPRGTVLKVGPHATSRVESRQTRKGATVRRSARVPWEYLTGAGSAWRSRDGRFEGGCTVL
C687J26631_1020287933300002124SoilVSNEPRGTVLKVGPHATSRVESRQARKGATVRRSARVPWEYLTGAGSAWRSRDGRFEGGCTVLI
JGI25389J43894_110337613300002916Grasslands SoilVLSEPHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLTGAGSGERSQYGPFEGGCTVIVL
Ga0058859_1177876113300004798Host-AssociatedVSNEPRGTVMKVGPHATRNVEPRQARKGATVRRLSRVPWEYLAGAGSMWRSRYESFEGGCTVI
Ga0066677_1077850113300005171SoilVLSELHGAALKVGPHATGIVEPRQARKGATVRRPFRVPWEYLAGAGSGERSQYGPFEGGCTVIVLERE
Ga0066688_1094569033300005178SoilVLSELHGAALKAGPHATGIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQYGPFEGGCTVIVL
Ga0070708_10206358113300005445Corn, Switchgrass And Miscanthus RhizosphereVSNELRGTVLKVGPHAMSRVKPRQARKGATVRRLARVPWEYLTGAGSALRSRDGRFEGGCTV
Ga0070706_10213195713300005467Corn, Switchgrass And Miscanthus RhizosphereVLNEPQGAALKVGPHATRITEPRQARKGATVRRPSRVPWEYLTGAGFLGRSQYESFEGGCTVT
Ga0070698_100055785103300005471Corn, Switchgrass And Miscanthus RhizosphereVLSELHGAALKAGPHATGIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQYGPFEGGCTVIV
Ga0066703_1047169543300005568SoilVLNEPQGAALKVGPHATRITEPRQARKGATVRRPSRVPWEYLTGAGFLGSSQYGSFEGGCTVTL
Ga0066703_1073241313300005568SoilVSNELRGTVLKVGPHATSRAKPRQTRKGATVRRLARVPWEYLTGAGSALRSRDGRFEGGCTVIQ
Ga0066708_1058919643300005576SoilVLNEPQGAALKVGPHATRITEPRQARKGATVRRPSRVPWEYLTGAGFLGSSQYG
Ga0066691_1089729313300005586SoilHGAALKAGPHATEIVELRQARKGATVRRPFRVPWEYLAGAGSVGRSQGGSLRSGCTAT*
Ga0066656_1091952333300006034SoilVLSEPHGAALKVGPHATRIAEPRQARKGATVRRPSRVPWEYLTGAGVVGRSQDCSFEGGCTV
Ga0066665_1036767013300006796SoilLKVGPHATRITEPRQARKGATVRRPSRVPWEYLTGAGFLGSSQYGSFEGGCTVTL
Ga0066659_1038003613300006797SoilVLSELHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQ
Ga0075421_10273237433300006845Populus RhizosphereVLNEPRRTALKVGPYATKIVEPRQARKGATVRRPFRVPWEYLTGAGSAGRSQYGSIDGGCTVT
Ga0075433_1189559213300006852Populus RhizosphereVLSELHGAALKVGPHATRIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQYGPF
Ga0079215_1159994213300006894Agricultural SoilVLNEPYGAALKAWPYATRIVEPRQARKGATVRRHSRVPWEYLAGAGPVRSSQYGSIEGGCTVTLKV
Ga0075424_10036868313300006904Populus RhizosphereVLNELHGVALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLTGAGSGERSQDGTFEGRCTVI
Ga0075424_10172213723300006904Populus RhizosphereVSSEPHEAALKVGPHATRSVEPRQARKGATVRRPSRVPWEYLAGAGSVGSSRYEMFEGGCTVTEYMEGDEATPPSPPR*
Ga0066710_10063011153300009012Grasslands SoilVLNELHGVALKVGPHATRFVEPRQARKGATVRRPSRVPWEYLTGAGSGERSQDG
Ga0099829_1121385013300009038Vadose Zone SoilVSNELRGTVLKVGPHAMSRVKPRQARKGATVRRLARVPWEYLTGAGSALRSRDG
Ga0099828_1169051113300009089Vadose Zone SoilVLNEPHAVALKVGPYATRIVEPRQARKGATVRRPSRVPWEYLTGAGFVRRSQDG
Ga0105240_1273376413300009093Corn RhizosphereVLNEPHGTVLKVGPHATRIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQYEPF
Ga0066709_10383763513300009137Grasslands SoilVLNELHGAALKVGPHATRIVEPRQARKGATVRRPSRVPWEYLTGAGSGERSQYEPFEGGCTVIVLE
Ga0105089_104180113300009809Groundwater SandLKVGPHAMRIVEPRQTRKGATVRRPSRVPWEYLAGAGSGERSQYEPFEGRCTVIVTEREI
Ga0105084_102043443300009811Groundwater SandVSNEPYGTALRVGPHATGCVEPRQARKGATVRRPSRVPWEYLTGAGSAGRSQDGPFEGGCTA
Ga0105064_112103013300009821Groundwater SandVLNEPRGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLTGVGSVGRSRYGP
Ga0126380_1035444153300010043Tropical Forest SoilVSNEPCGTVLKVGPHATSRVESRQARKGATVRRSARVPWEYLTGAGFARRSRD
Ga0126380_1044261213300010043Tropical Forest SoilVSNEPCGTALKVGPYATSRVESRQARKGATVRRSARVPWEYLTGAGFARRSRDGR
Ga0126384_10037553103300010046Tropical Forest SoilVSNEPCGTALKVGPYATSRVESRQARKGATVRRSARVPWEYLTGAGFARRSRDGRFEGGCTVIQ
Ga0126384_1141564913300010046Tropical Forest SoilVSNEPYGTALRAGPHATGCVEPRQARKGATVRRPSRVPWEYLAGAVSTRRSQDGSIEGGCTVTLYRS
Ga0126384_1243991033300010046Tropical Forest SoilVLSELHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQYGPFEGGCTVIVNGA
Ga0127435_13306723300010065Grasslands SoilVLSEPHGAALKVGPHATGIAESRQARKGATVRRPSRVPWKYLTGAGSMGRSQYRSFGGGC
Ga0127432_12098213300010067Grasslands SoilVLNEPQGAALKVGPHATRITEPRQTRKGATVRRPSRVPWEYLTGAGFLGSSQYGSFEG
Ga0127448_17932313300010080Grasslands SoilVLNEPQGAALKLGPHATRITEPRQTRKGATVRRPSSVPWEYLTGAGSLRRSQYESFEGGC
Ga0127445_101340813300010085Grasslands SoilVLNEPHGAALKVGPHATRIVEPRQARKGATVRRPSRVPWEYLTGAGSGERSQYEPFEGGCTV
Ga0127501_103296113300010097Grasslands SoilVLSEPHGAALKVGPHATRIAEPRQARKGATVRRPSRVPWEYLTGAGFVGRSQDCSFEGGCTV
Ga0127474_109616613300010108Grasslands SoilVLNEPQGAALKVGPHATRITEPRQTRKGATVRRPSRVPWEYLTGAGSLRRSQYESFEGGCTVT
Ga0127460_102729923300010114Grasslands SoilVLNELHGAALKVGPHAMRIVEPRQARKGATVRRPSRVPWEYLTGAGSGERSQDGPFEGGCTV
Ga0127465_107399213300010118Grasslands SoilVLSEPHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLTGAGFVRRSQDRSFEGGCTV
Ga0127489_115155613300010127Grasslands SoilVLSEPRGAALKVGPHATRITEPRQTRKGATVRRPSRVPWEYLTGAGSLRRSQYESFEGGCTVT
Ga0127483_105308113300010142Grasslands SoilVSNELRGTVLKVGPHAMSRVKPRQARKGATVRRLARVPWEYLTGAGSALRSRDGRFEGGCTVIGRARR
Ga0134064_1038065043300010325Grasslands SoilVLNEPQGAALKVGPHATRIAEPRQARKGATVRRPSRVPWEYLAGAGFVRRSRDGRFD
Ga0134071_1017420323300010336Grasslands SoilVLSELHGAALKAGPHATGIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQYGPFEGGCTVIVLEREIGVDEGGE*
Ga0134127_1196339813300010399Terrestrial SoilVSNELLGTVLKVGPHATRSVEPRQARKGATVRRHFRVPWEYLTGAGFTERSRYGRFEGG
Ga0138513_10007813013300011000SoilVLNEPRRTVLKVGPHATKIVEPRQARKGATVRRPFRVPWEYLTGVGSAGRSQYGSIDGGCTA
Ga0137389_1159809013300012096Vadose Zone SoilVSNELRGTVLKVGPHAMSRVKPRQARKGATVRRLARVPWEYLTGAGSALRSRDGRFEGGCTVLTGS
Ga0137352_107931243300012164SoilVLNEPHVAALKVGPHATRIVEPRQARKGATVRRPSRVPWEYLIGADPVGRSQYGPFGGGCTA
Ga0137364_1008833413300012198Vadose Zone SoilLKVGPHATGIVEPRQARKGATVRRPSRVPWEYLTGAGFVRRSQDRSFEGGCTVI
Ga0137379_1081451543300012209Vadose Zone SoilVLSEPHGAALKVGPHATRIVEPRQARKGATVRRPSRVPWEYLTGAGSTRRSRDGRFDGGCTVIFVPGG
Ga0137387_1051845043300012349Vadose Zone SoilVLNELHGVALKVGPHATRFVEPRQARKGATVRRPSRVPWEYLTGAGSGERSQ
Ga0137361_1174974413300012362Vadose Zone SoilVSNELRGTVLKVGPHAMSRVKPRQARKGATVRRLARVPWEYLTGAGSTWRSRDGR
Ga0134022_107670413300012371Grasslands SoilVLSEPHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLTGAGFVRRSQDRSFEGGCTVI
Ga0134039_120231613300012374Grasslands SoilVLSELHGAALKAGPHATGIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQYGPFEGGCTV
Ga0134032_102841223300012376Grasslands SoilVSNELRGTVLKVGPHAMSRVKPRQARKGATVRRLARVPWEYLTGAGSALRSRDGRFEGGCTA
Ga0134058_105219113300012379Grasslands SoilVLSEPHGAALKVGPHATRIAEPRQARKGATVRRPSRVPWEYLTGAGFVGRSQDCSFEGGCTVIG
Ga0134057_127146513300012396Grasslands SoilVLNEPQGAALKVGPHATRITEPRQARKGATVRRPSRVPWEYLTGAGFMGRSQY
Ga0134051_136258813300012398Grasslands SoilVLNEPHGAALKVGPHATENVEPRQARKGATVRRLFRVPWEYLTGAGSVRSSQDGPFGGGC
Ga0134055_123090713300012401Grasslands SoilVLNEPQGAALKVGPHATRITEPRQARKGATVRRPSRVPWEYLTGAGFMGRSQYGSFEG
Ga0134055_130843413300012401Grasslands SoilVLNDLHGAALKVGTHAMRIVEPRQARKGATVRRPSRVPWDYLTGAGSGERSQDGPFEGGC
Ga0134050_101833633300012407Grasslands SoilVLSEPRRAALKAGPHATRIVEPRQARKGATVRRPSRVPWEYLAGAVSTGRSQDGSIEGGC
Ga0134060_103043813300012410Grasslands SoilVLSEPHGAALKVGPHATRIAEPRQARKGATVRRPSRVPWEYLTGAGFVGRSQDCSFEGGCTVI
Ga0137404_1005738313300012929Vadose Zone SoilVLSELHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQYGPFEGGCTVI
Ga0180078_104705423300014865SoilVSNEPRGTVLKVGPHATSRVESRQTRKGATVRRFARVPWEYLTGAGSAWRSRDGRFDGGCTVLIRFGGE
Ga0137420_128467413300015054Vadose Zone SoilLNEPHGAALKAGPHATEIVELRQARKGATVRRPFRVPWEYLAGAGSVGRSQGRVA
Ga0182033_1019335813300016319SoilVSNEPCGTVLKVGPHATSRVESRQARKGATVRRSARVPWEYLTGAGFARRSRDGRFEGGCTVM
Ga0182037_1179689333300016404SoilVLNELHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLTGAGSGERSQYGPFEGGCTVIVL
Ga0184609_1045786813300018076Groundwater SedimentVLSELHGAASKVGPHAMGIVEPRQARKGATVRRPSRVPWEYLTGAGSGERSQDGTFEGGCTVIVSERESGL
Ga0184639_1004139913300018082Groundwater SedimentVLSELHGAASKVGPHATGIVEPRQTRKGATVRRPSRVPWEYLTGAGSGERSQDGTFEGGCTV
Ga0180106_116309233300019212Groundwater SedimentVLNEPHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLIGADPVGRSQYGLVGGGCTA
Ga0180119_119086633300019228Groundwater SedimentVLNEPYGAATKAWPHATRIVEPRQARKGATVRRPSRVPWEYLTRAGFAGRSRCGVFEGGCTVTYGLPGSRAD
Ga0184641_100803913300019254Groundwater SedimentVSNEPQGAALKVGPHATRIVEPRQARKGATVRRPFRVPWEYLTGAGSAGRSQYGSID
Ga0193729_118666913300019887SoilVSNEPYGTALRVGPHATGCVEPRQARKGATVRRSSRVPWEYLTGAGFVRRSRDGR
Ga0180113_141253113300020065Groundwater SedimentVLNEPHGAVLKVGPHATGIVEPRQARKGATVRRPSRVPWEYLIGADPVGRSQYGLVGGGC
Ga0154015_103942533300020610Corn, Switchgrass And Miscanthus RhizosphereVLNEPHRTALKVGPHATKIVEPRQARKGATVRRPFRVPWEYLTGVGSAGRSQYGSIDGGCTV
Ga0179584_110065913300021151Vadose Zone SoilVLSELHGAALKVGPHAMGIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQYGPFEGGC
Ga0179585_118021613300021307Vadose Zone SoilVLSELHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLTGAGSGERSQDG
Ga0209827_1039154213300025149Thermal SpringsVSSEPRGTVLKVGPYATRIVEPRQARKGATVRRPSRVPWEYLTGAGSTGRSRYEPFGRGCTVIS
Ga0209431_1066538143300025313SoilVSNEPRGTVLKVGPHATSRVESRQARKGATVRRSARVPWEYLTGAGSAWRSRDGRFEGGCTVLF
Ga0210132_106657413300025538Natural And Restored WetlandsVSNEPRGTVLKVGPHATSRVESRQARKGATVRRSARVPWEYLTGAGSTWRSRDGR
Ga0210108_106079313300025560Natural And Restored WetlandsVSNEPYGTALKVGPHATGCVEPRQARKGATVRRPSRVPWEYLTGAGFVRRSRYGRFD
Ga0207684_1000375423300025910Corn, Switchgrass And Miscanthus RhizosphereVLNEPQGAALKVGPHATRITEPRQARKGATVRRPSRVPWEYLTGAGFLGRSQYESFEGGCTVTYRSAR
Ga0207646_1184214513300025922Corn, Switchgrass And Miscanthus RhizosphereVLNEPHAVVLKVGPHATGIVEPRQARKGATVRRPSRVPWEYLTGAGFVRRSQDGPFGGRCTTTLFTGGAP
Ga0207711_1209596413300025941Switchgrass RhizosphereVLNEPHGTVLKVGPHATRIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQYGPFEG
Ga0209153_102825123300026312SoilVLNEPQGAALKVGPHATRITEPRQTRKGATVRRPSRVPWEYLTGAGFRGSSQYGSFEGGCTVTFDLPSERLRRAPALDGRAIIT
Ga0257164_109137533300026497SoilVLSELHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQYGPFEGGC
Ga0257168_109949133300026514SoilVSNELRGTVLKVGPHAMSRVKPRQARKGATVRRLARVPWEYLTGAGSALRSRD
Ga0209806_131420213300026529SoilVLSELHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQYGPFEGGCTVIVL
Ga0209160_103437123300026532SoilLKAGPHATEIVELRQARKGATVRRPFRVPWEYLAGAGSVGRSQGGSLRSGCTAT
Ga0209215_101215143300027266Forest SoilVLNELHGAALKVGPHAMRFVEPRQARKGATVRRPSRVPWEYLTGAGSVERSQYEPF
Ga0208993_109230513300027480Forest SoilVLSELHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLAGAGSGERSQYGPFEGGCTVIVSEREIG
Ga0209887_107249343300027561Groundwater SandVSNELRGTVLKVGPHAMSRVKSRQARKGATVRRLARVPWEYLTGAGSALRSRDGRFDGGC
Ga0210002_109849823300027617Arabidopsis Thaliana RhizosphereVSNEPQGAALKVGPHAMRIVEPRQARKGATVRRPSRVPWEYLTGAGFLGRSRYGSFDWGCTVTVSER
Ga0209388_104288713300027655Vadose Zone SoilVSNELRGTVLKVGPHAMSRVKPRQARKGATVRRLARVPWEYLTGAGSALRSRDGRFEGGC
Ga0208981_108290243300027669Forest SoilVLSEPHGAASKVGPHATRIVEPRQARKGATVRRPSRVPWEYLTGAGFLGSSQYGSFEGGCTVTFDLPSERLRRA
Ga0209689_141895933300027748SoilVLSEPHGAALKVGPHATRIAEPRQARKGATVRRPSRVPWEYLTGAGVVGRSQDCSFEGGCTVIV
Ga0209177_1015089313300027775Agricultural SoilVLNESRGTALKVGPYATKIVEPRQARKGATVRRPFRVPWEYLAGVGSAGRSQYGLIDGGCTV
Ga0268265_1152185413300028380Switchgrass RhizosphereVLNEPQGAALKVGPHATRITEPRQARKGATVRRPSRVPWEYLTGAGFLGRSQYASFEGGCTV
Ga0247818_1136991713300028589SoilVLNEPRRTVLKVGPHATRIVEPRQARKGATVRRPFRVPWEYLTGAGSVGRSQYGSIDGGCTVTVY
Ga0307290_1035505033300028791SoilVLNEPRRTALKVGPYATKIVEPRQARKGATVRRPFRVPWEYLTGVGSAGRSQYG
Ga0247621_119473213300030683SoilVLSELHGAASKVGPHATRIVEPRQARKGATVRRPSRVPWEYLTGAGSVESSRFGSFG
Ga0308198_108584813300030904SoilVLSELHGAASKVGPHAMGIVEPRQARKGATVRRPSRVPWEYLTGAGSGERSQDGTFEGGC
Ga0308199_119353713300031094SoilVLNEPRRTALKVGPYATKIVEPRQARKGATVRRPFRVPWEYLTGVGSAGRSQYGLIDGGCTV
Ga0308194_1036574813300031421SoilVSNEPRGTVLKVGPHATSHVESRQARKGATVRHPSRVPWEYLAGAGFIRRSRYGRFD
Ga0307469_1045397953300031720Hardwood Forest SoilVLNEPHGAALKVGPHATRIVEPRQARKGATVRRPSRVPWEYLTGAGSVGRSQDGPFEGG
Ga0318509_1001136673300031768SoilVLNELHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLAGAGSVERSQYGPFEGGC
Ga0318550_1044724613300031797SoilVSNEPCGTVLKVGPHATSRVESRQARKGATVRRSARVPWEYLTGAGFARRSRDGRFEGGCTVMSVKCG
Ga0318544_1025027113300031880SoilVLNELHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLAGAGSVERSQYGPFEGGCTVM
Ga0307412_1114058513300031911RhizosphereVSNEPQGAALKVGPHATRIVEPRQARKGATVRRPSRVPWEYLTGAGFLGRSRYGSLDGGCTVTL
Ga0310912_1083914423300031941SoilVSNEPCGTVLKVGPHATSRVESRQARKGATVRRSVRVPWEYLTGAGFARRSRDGRFEGGC
Ga0307416_10277786833300032002RhizosphereVLNEPQGAALKVGPHAMRIVEPRQARKGATVRRPSRVPWEYLTGAGFLGRSRYGSFDWGCTVTVTEREIRLK
Ga0307411_1173530713300032005RhizosphereVLNEPRRTALKVGPHATKIVESRQARKGATVRRPFRVPWEYLTGAGSAGRSQYRS
Ga0318510_1055037433300032064SoilVLNELHGAALKVGPHATGIVEPRQARKGATVRRPSRVPWEYLAGAGSVERSQYG
Ga0307471_10089795253300032180Hardwood Forest SoilVLNEPQGAALKVGPHATRITEPRQARKGATVRRPSRVPWEYLTGAGFLGSSQYGSFEGGCTVT
Ga0315287_1186772323300032397SedimentVSNEPRGTALKVGPHATSRVESRQTRKGATVRRSARVPWEYLTGAGSAWRSRDGLFDGGCTVLIDSGGER
Ga0310811_1095870113300033475SoilVLNEPQGAALKVGPHATRIVEPRQARKGATVRRPSRVPWEYLTGAGFLGSSQYRSFEGGCTVTV
Ga0316626_1176602733300033485SoilVSNEPRGAALMVGPHATGIVEPRQARKGATVRRPSRVPWEYLTGAGSVRRSRDESIGG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.