NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F071706

Metagenome Family F071706

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F071706
Family Type Metagenome
Number of Sequences 122
Average Sequence Length 96 residues
Representative Sequence MNEVELLALGDALARTSDVLRPGEHVSFRAYLARVLQSEEQIAALFGETAFHVSVDGRPVDGPASTVPVTADSRVVLYRRQGPALDVLTRGVVRRICLN
Number of Associated Samples 90
Number of Associated Scaffolds 122

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 84.43 %
% of genes near scaffold ends (potentially truncated) 27.87 %
% of genes from short scaffolds (< 2000 bps) 86.07 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.83

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (54.918 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(24.590 % of family members)
Environment Ontology (ENVO) Unclassified
(31.148 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(47.541 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 25.98%    β-sheet: 25.20%    Coil/Unstructured: 48.82%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.83
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
e.7.1.2: GlpX-like bacterial fructose-1,6-bisphosphatased3biga_3big0.52767
e.7.1.1: Inositol monophosphatase/fructose-1,6-bisphosphatase-liked1lbva_1lbv0.52529
e.7.1.1: Inositol monophosphatase/fructose-1,6-bisphosphatase-liked2bjia_2bji0.52212
e.7.1.0: automated matchesd5zhha_5zhh0.51226
e.7.1.0: automated matchesd5eq7a_5eq70.51175


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 122 Family Scaffolds
PF04552Sigma54_DBD 14.75
PF11154DUF2934 4.10
PF13365Trypsin_2 3.28
PF04963Sigma54_CBD 3.28
PF15919HicB_lk_antitox 1.64
PF00664ABC_membrane 1.64
PF13473Cupredoxin_1 0.82
PF02653BPD_transp_2 0.82
PF04185Phosphoesterase 0.82
PF07883Cupin_2 0.82
PF04909Amidohydro_2 0.82
PF08487VIT 0.82
PF01850PIN 0.82
PF13189Cytidylate_kin2 0.82
PF07750GcrA 0.82
PF01243Putative_PNPOx 0.82
PF01478Peptidase_A24 0.82
PF00583Acetyltransf_1 0.82
PF00581Rhodanese 0.82
PF14833NAD_binding_11 0.82
PF04392ABC_sub_bind 0.82

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 122 Family Scaffolds
COG1508DNA-directed RNA polymerase specialized sigma subunit, sigma54 homologTranscription [K] 18.03
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.82
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 0.82
COG5352Uncharacterized conserved proteinFunction unknown [S] 0.82


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A54.92 %
All OrganismsrootAll Organisms45.08 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002914|JGI25617J43924_10243078Not Available607Open in IMG/M
3300003994|Ga0055435_10036161All Organisms → cellular organisms → Bacteria1137Open in IMG/M
3300004024|Ga0055436_10248486Not Available567Open in IMG/M
3300005445|Ga0070708_100197094Not Available1884Open in IMG/M
3300005445|Ga0070708_100670802All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium976Open in IMG/M
3300005445|Ga0070708_100694770Not Available958Open in IMG/M
3300005445|Ga0070708_101758717Not Available576Open in IMG/M
3300005467|Ga0070706_100199021All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1871Open in IMG/M
3300005467|Ga0070706_100386807All Organisms → cellular organisms → Bacteria1302Open in IMG/M
3300005467|Ga0070706_100437329Not Available1218Open in IMG/M
3300005467|Ga0070706_100973671Not Available783Open in IMG/M
3300005467|Ga0070706_101116478All Organisms → cellular organisms → Bacteria → Terrabacteria group726Open in IMG/M
3300005467|Ga0070706_101138233Not Available718Open in IMG/M
3300005468|Ga0070707_101355097Not Available678Open in IMG/M
3300005471|Ga0070698_101791570All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales → Pseudomonadaceae → Pseudomonas → unclassified Pseudomonas → Pseudomonas sp. G5(2012)567Open in IMG/M
3300005518|Ga0070699_100136083Not Available2167Open in IMG/M
3300005518|Ga0070699_101230823Not Available687Open in IMG/M
3300006047|Ga0075024_100130210Not Available1130Open in IMG/M
3300006163|Ga0070715_10189243All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1038Open in IMG/M
3300006173|Ga0070716_101664349Not Available526Open in IMG/M
3300006173|Ga0070716_101728675Not Available516Open in IMG/M
3300006852|Ga0075433_10717115Not Available875Open in IMG/M
3300006903|Ga0075426_10043778All Organisms → cellular organisms → Bacteria3198Open in IMG/M
3300006904|Ga0075424_100014727All Organisms → cellular organisms → Bacteria7881Open in IMG/M
3300007255|Ga0099791_10489578All Organisms → cellular organisms → Bacteria → Acidobacteria597Open in IMG/M
3300007265|Ga0099794_10293206Not Available842Open in IMG/M
3300007265|Ga0099794_10809901Not Available501Open in IMG/M
3300009038|Ga0099829_10047697All Organisms → cellular organisms → Bacteria3157Open in IMG/M
3300009088|Ga0099830_10228447All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1466Open in IMG/M
3300009089|Ga0099828_11207301Not Available670Open in IMG/M
3300009143|Ga0099792_10651538Not Available676Open in IMG/M
3300010391|Ga0136847_11019005All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Alicyclobacillaceae → Alicyclobacillus536Open in IMG/M
3300010400|Ga0134122_10665060Not Available973Open in IMG/M
3300011270|Ga0137391_11088867All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium647Open in IMG/M
3300012040|Ga0137461_1106711All Organisms → cellular organisms → Bacteria804Open in IMG/M
3300012096|Ga0137389_10700637All Organisms → cellular organisms → Bacteria870Open in IMG/M
3300012189|Ga0137388_10927050Not Available805Open in IMG/M
3300012202|Ga0137363_10330501Not Available1257Open in IMG/M
3300012205|Ga0137362_10016581All Organisms → cellular organisms → Bacteria → Proteobacteria5580Open in IMG/M
3300012226|Ga0137447_1049698Not Available755Open in IMG/M
3300012358|Ga0137368_10849571Not Available561Open in IMG/M
3300012363|Ga0137390_10172875All Organisms → cellular organisms → Bacteria2147Open in IMG/M
3300012363|Ga0137390_10455352All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1255Open in IMG/M
3300012363|Ga0137390_11945739Not Available515Open in IMG/M
3300012582|Ga0137358_10046327All Organisms → cellular organisms → Bacteria2889Open in IMG/M
3300012917|Ga0137395_10555698Not Available828Open in IMG/M
3300012918|Ga0137396_10384384Not Available1040Open in IMG/M
3300012922|Ga0137394_10513395All Organisms → cellular organisms → Bacteria1017Open in IMG/M
3300012925|Ga0137419_10845618Not Available751Open in IMG/M
3300012927|Ga0137416_10254408All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1436Open in IMG/M
3300012930|Ga0137407_10103798All Organisms → cellular organisms → Bacteria2442Open in IMG/M
3300012958|Ga0164299_10573976All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium766Open in IMG/M
3300014308|Ga0075354_1087035Not Available635Open in IMG/M
3300015264|Ga0137403_10008077All Organisms → cellular organisms → Bacteria11788Open in IMG/M
3300017997|Ga0184610_1053579All Organisms → cellular organisms → Bacteria1199Open in IMG/M
3300018056|Ga0184623_10521446Not Available505Open in IMG/M
3300018074|Ga0184640_10182541Not Available944Open in IMG/M
3300018076|Ga0184609_10066287All Organisms → cellular organisms → Bacteria1572Open in IMG/M
3300018076|Ga0184609_10289133All Organisms → cellular organisms → Bacteria767Open in IMG/M
3300018078|Ga0184612_10107043All Organisms → cellular organisms → Bacteria1464Open in IMG/M
3300018084|Ga0184629_10006121All Organisms → cellular organisms → Bacteria4506Open in IMG/M
3300018084|Ga0184629_10526358Not Available612Open in IMG/M
3300019458|Ga0187892_10158928Not Available1255Open in IMG/M
3300019458|Ga0187892_10251783All Organisms → cellular organisms → Bacteria907Open in IMG/M
3300019487|Ga0187893_10151370All Organisms → cellular organisms → Bacteria1887Open in IMG/M
3300019487|Ga0187893_10271478All Organisms → cellular organisms → Bacteria1230Open in IMG/M
3300019881|Ga0193707_1096981All Organisms → cellular organisms → Bacteria885Open in IMG/M
3300019883|Ga0193725_1053852All Organisms → cellular organisms → Bacteria1022Open in IMG/M
3300019883|Ga0193725_1147435Not Available510Open in IMG/M
3300019886|Ga0193727_1029575All Organisms → cellular organisms → Bacteria1887Open in IMG/M
3300020004|Ga0193755_1014026All Organisms → cellular organisms → Bacteria2637Open in IMG/M
3300021081|Ga0210379_10165436Not Available945Open in IMG/M
3300021086|Ga0179596_10722735Not Available504Open in IMG/M
3300021344|Ga0193719_10134430Not Available1072Open in IMG/M
3300021432|Ga0210384_10299260All Organisms → cellular organisms → Bacteria1448Open in IMG/M
3300025535|Ga0207423_1002496All Organisms → cellular organisms → Bacteria2615Open in IMG/M
3300025549|Ga0210094_1029973All Organisms → cellular organisms → Bacteria908Open in IMG/M
3300025551|Ga0210131_1013275All Organisms → cellular organisms → Bacteria1144Open in IMG/M
3300025580|Ga0210138_1007647All Organisms → cellular organisms → Bacteria2052Open in IMG/M
3300025910|Ga0207684_10303599Not Available1376Open in IMG/M
3300025910|Ga0207684_10334808All Organisms → cellular organisms → Bacteria1304Open in IMG/M
3300025910|Ga0207684_10469475Not Available1080Open in IMG/M
3300025910|Ga0207684_10978948All Organisms → cellular organisms → Bacteria708Open in IMG/M
3300025910|Ga0207684_11465081Not Available557Open in IMG/M
3300025916|Ga0207663_10886561Not Available713Open in IMG/M
3300025922|Ga0207646_10637231Not Available955Open in IMG/M
3300025939|Ga0207665_10414450Not Available1028Open in IMG/M
3300026285|Ga0209438_1078644Not Available1062Open in IMG/M
3300026340|Ga0257162_1029072Not Available678Open in IMG/M
3300026351|Ga0257170_1061814Not Available526Open in IMG/M
3300026358|Ga0257166_1020547All Organisms → cellular organisms → Bacteria → Proteobacteria870Open in IMG/M
3300026371|Ga0257179_1004998All Organisms → cellular organisms → Bacteria → Proteobacteria1221Open in IMG/M
3300026371|Ga0257179_1045929Not Available562Open in IMG/M
3300026374|Ga0257146_1030870Not Available870Open in IMG/M
3300026376|Ga0257167_1012641Not Available1146Open in IMG/M
3300026377|Ga0257171_1101259Not Available512Open in IMG/M
3300026480|Ga0257177_1014413Not Available1078Open in IMG/M
3300026480|Ga0257177_1078212Not Available534Open in IMG/M
3300026494|Ga0257159_1049036Not Available717Open in IMG/M
3300026514|Ga0257168_1010517All Organisms → cellular organisms → Bacteria1754Open in IMG/M
3300026514|Ga0257168_1144195Not Available530Open in IMG/M
3300026551|Ga0209648_10036273All Organisms → cellular organisms → Bacteria4282Open in IMG/M
3300027671|Ga0209588_1104682All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium910Open in IMG/M
3300027815|Ga0209726_10093038Not Available1853Open in IMG/M
3300027815|Ga0209726_10096151All Organisms → cellular organisms → Bacteria1808Open in IMG/M
3300027815|Ga0209726_10097357Not Available1792Open in IMG/M
3300027862|Ga0209701_10172228Not Available1308Open in IMG/M
3300027875|Ga0209283_10346806Not Available975Open in IMG/M
3300027894|Ga0209068_10005702All Organisms → cellular organisms → Bacteria → Proteobacteria5874Open in IMG/M
3300028536|Ga0137415_10016707All Organisms → cellular organisms → Bacteria7274Open in IMG/M
3300028536|Ga0137415_11451962Not Available510Open in IMG/M
(restricted) 3300031197|Ga0255310_10201504All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium557Open in IMG/M
3300031720|Ga0307469_11065582Not Available758Open in IMG/M
3300031740|Ga0307468_100442438Not Available1009Open in IMG/M
3300031740|Ga0307468_100667112Not Available863Open in IMG/M
3300031962|Ga0307479_10159178All Organisms → cellular organisms → Bacteria2217Open in IMG/M
3300032180|Ga0307471_102631599Not Available638Open in IMG/M
3300032180|Ga0307471_103984447Not Available522Open in IMG/M
3300033004|Ga0335084_10702691Not Available1031Open in IMG/M
3300033004|Ga0335084_12150771Not Available541Open in IMG/M
3300033433|Ga0326726_10170406All Organisms → cellular organisms → Bacteria1995Open in IMG/M
3300033433|Ga0326726_11123539Not Available764Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil24.59%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere20.49%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil11.48%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment6.56%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil5.74%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands4.92%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.92%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater2.46%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.46%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.46%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.64%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.64%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.64%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil1.64%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.64%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze1.64%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.82%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.82%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.82%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.82%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.82%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300003994Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004024Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012040Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT746_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012226Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT400_2EnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300014308Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D1EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300025535Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025549Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025551Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025580Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026351Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-BEnvironmentalOpen in IMG/M
3300026358Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-14-BEnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026374Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-AEnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026494Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI25617J43924_1024307813300002914Grasslands SoilMNEVDMLALGDALARASEVLRPGECFAFRDYLERVLPSEAQVAAVFGGDEAFVVSVDGRPVDGPESTVPVTPDSRVVLTRRQGPALDVLTRGVVRRICLN*
Ga0055435_1003616123300003994Natural And Restored WetlandsMNEVELLALGDALARTSEVLHPGEHVSFRDYLARVLRSEEQIAALFGEPACQVSVDGRPVDGPASTVPVTAASRVVLYRRQGPALDVLTRGVVRRICLN*
Ga0055436_1024848613300004024Natural And Restored WetlandsMNEVELLALGDALARTSEVLHPGEHVSFRDYLARVLRSEEQIAALFGEPACQVSVDGRPVDGPGSTVPVTAASRVVLYRRQGPALDVLTRGVVRRICLN*
Ga0070708_10019709423300005445Corn, Switchgrass And Miscanthus RhizosphereMNEVEMLVLGDALARASDVLRPDECLPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVEGPESTVPVTADSRVVLYRRQGPALDVLTRGVVRRICLN*
Ga0070708_10067080223300005445Corn, Switchgrass And Miscanthus RhizosphereMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVFTRGVVRRICLN*
Ga0070708_10069477013300005445Corn, Switchgrass And Miscanthus RhizosphereLARTSDVLRPGEHVSFRAYLARVLQSEEQIAALFGETAFHVSVDGRPVDGPASTVPVTADSRVVLYRRQGPALDVLTRGVVRRICLN*
Ga0070708_10175871713300005445Corn, Switchgrass And Miscanthus RhizosphereLALGDTLARTSDVLRPGEHVSFRDYLARALRSEAQIAALFGERAIQVSVDGRPVDGPESTVPVTTGSRVVLYRRQGPALDVLTRGVVRRICLN*
Ga0070706_10019902123300005467Corn, Switchgrass And Miscanthus RhizosphereMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSAAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVFTRGVVRRICLN*
Ga0070706_10038680733300005467Corn, Switchgrass And Miscanthus RhizosphereMNEVEMLVLGDALARACDVLRPDECLPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVEGPESTVPVTADSRVVLYRRQGSALDVLTRGVVRRICLN*
Ga0070706_10043732913300005467Corn, Switchgrass And Miscanthus RhizosphereMNEVDMLALGDALARTSEVVRAGECIPFRDYLERVVRSEAQIAVVFGSDEFFTVSVDGRPVDGPGSTVPVTAASRVVLYRRQGPAL
Ga0070706_10097367123300005467Corn, Switchgrass And Miscanthus RhizosphereMNEVELLALGDALARTSDVLRPGEHVSFRAYLARALRSEAQIAALFGETTFQVSVDGQPVDGPESTLPVTAASRVVLYRRRGPALDVLTRGV
Ga0070706_10111647823300005467Corn, Switchgrass And Miscanthus RhizosphereMNEVEMLALGDALARTSDVLRPGEHVSFRDYLARVLQNEEQIAALFGETAFQVSVDGQPVDGPASTVPVTAASRVVLYRRQGPALDVLTRGVVRRICLN*
Ga0070706_10113823323300005467Corn, Switchgrass And Miscanthus RhizosphereMNEVNMLALGDALARTSDVLRPGECLAFCNYLERVLPSEAQIAAVFGDNKAFMVSVDGRPVDGPGSTVCVTADSRVVLYRRQ
Ga0070707_10135509723300005468Corn, Switchgrass And Miscanthus RhizosphereMNEVEMLVLGDALARACDVLRPDECLPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVEGPESTVPVTADSRVVLYRRQGSALDVLTRGVVRRIC
Ga0070698_10179157013300005471Corn, Switchgrass And Miscanthus RhizosphereMNEVELLVLGDALARASDALRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVFTRGVVRRICLN*
Ga0070699_10013608323300005518Corn, Switchgrass And Miscanthus RhizosphereMNEVEMLVLGDALARACDVLRPDECLPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVEGPESTVPVTADSRVVLYRRQGPALDVLTRGVVRRICLN*
Ga0070699_10123082313300005518Corn, Switchgrass And Miscanthus RhizosphereMNEVDMLALGDALARTSEVVRAGECIPFRDYLERVVRSEAQIAVVFGSDEFFTVSVDGRPVDGPGSTVPVTAASRVVLYRRQGPALDVFTRGVVRRICLN*
Ga0075024_10013021013300006047WatershedsMNDVELLALGDTLARTSKVLRPGEHVAFGDYLARVLRSEAQIAALFGETAVQVSVDGRPVEGPESAVPVTADSRVVLYRRQGPALDVLTRGVVRRICLN*
Ga0070715_1018924313300006163Corn, Switchgrass And Miscanthus RhizosphereEHVSFRDYLARVLRTEKQITALFGETTLLVSVDGRPVEASHVVLYRRQGPALDVLTRSMVRRICLN*
Ga0070716_10166434913300006173Corn, Switchgrass And Miscanthus RhizosphereMNEVEMLALGDALARTSEVLRPGEHVSFRDYLARALRTEEQIAALFGERAIRVSVDGRPVDGPESTVPVTTGSRVVLYRRHGPALDVLTRGVVRRICLN*
Ga0070716_10172867513300006173Corn, Switchgrass And Miscanthus RhizosphereMNEVEMLALGDALARTSDVLGPGEHVSFRDYLARVLRTEEQITALFGETTLLVSVDGRPVEASHVVLYRRQGPALDVLTRGVVRRICLN*
Ga0075433_1071711523300006852Populus RhizosphereMNEVELLALGDALARTSEVLCPGECLAFRNYLERVLRSEAQLAAGFGGDETFVVSVDGRPGQRTRVRGVTADSRVVLYRRQGPALDVLTRVVVRRICLN*
Ga0075426_1004377843300006903Populus RhizosphereMNEVELLALGDALARTSEVLCPGECLAFRNYLERVLRSEAQLAAGFGGDETFVVSVDGRPGHSRVVLYRRQGPALDVLTRVVVRRICLN*
Ga0075424_10001472743300006904Populus RhizosphereMNEVELLALGDALARTSEVLCPGECLAFRNYLERVLRSEAQLAAGFGGDETFVVSVDGRPGHSRVVLYRRQGPALDVLTRVVVRRVCLN*
Ga0099791_1048957823300007255Vadose Zone SoilMNEVELLALGDALARTSDTLRPGECLPFRDYLQRVLRSEAQLAAVFHGNEACVVSVDGRPVQGPESTVPVTADSRVLY
Ga0099794_1029320613300007265Vadose Zone SoilMNEVELLALGDALARTSDTLRPGECLPFRDYLQRVLRSEAQLAAVFHGNEACVVSVDGRPVQGPESTVPVTADSRVLYRRQGPALDVLTRGVVRRICLN*
Ga0099794_1080990113300007265Vadose Zone SoilSDVLRPDERIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVFTRGVVRRICLN*
Ga0099829_1004769743300009038Vadose Zone SoilMNEVELLALGDALARTSDVLRPGEHVAFRDYLARVLQNEEQIAALFGETAFQVSVDGRPVDGPASTVPVTSASRVVLYRRQGPALDVLTRGVVRRICLN*
Ga0099830_1022844713300009088Vadose Zone SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAKIAAVFGSDESFLVSVDGRPVGGPEWTLPVTADSRVVLTRRLGSALDVLTRGVVRRICLN*
Ga0099828_1120730113300009089Vadose Zone SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPES
Ga0099792_1065153813300009143Vadose Zone SoilMNEVELLALGDALARTSDTLRPGECLPFRDYLQRVLRSEAQLAAVFHGNEACVVSVDGRPVEGPESTEPVTADSRVVLYRRQGPTLDVVTRGVVPRICLN*
Ga0136847_1101900513300010391Freshwater SedimentMNEVELLALGDVLARTSDVLRPGEHVSFRDYLARVLRDDAQIAALFGETAFQVSVDGRPVDGPDSTLPVTAKSRVVLYRRQGPALDVLTRGVVRRIWLN*
Ga0134122_1066506013300010400Terrestrial SoilMNEVELLALGDALARTSAVLRPGEHVSFRAYLARALRSEAQIAALFGEKAFRVLVDGRPVDGPESTLPVTAESRVVLYRRQGPALDVFTRGVVRRICLN*
Ga0137391_1108886723300011270Vadose Zone SoilMNEVELLALGDVLARTSDALRPRECLPFRDYLQRVLRSEPQLAAVFYGNEACVVSVDGWPVEGPESTVPVTPDSRVVLTRRQGPA
Ga0137461_110671123300012040SoilMNEVELLALGDALARTSDVLHPGEHVSFRDYLARALPNEEQIAALFGEAAFQVSVDGRPIEGPESTVPVTADSRVVLYRRRGPSLDVLTRGVVRRIWLN*
Ga0137389_1070063723300012096Vadose Zone SoilMNETEMLAVGGALARTADVLRPDEQVPFRDYVRRVREAGEGQLAALFGDGAFVVSVDGRPVDGCDSALPVTAGSRVVLYRRQGPMLDVLNRGVGRRICLH*
Ga0137388_1092705013300012189Vadose Zone SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVWSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVLTRGVVRRICLN*
Ga0137363_1033050123300012202Vadose Zone SoilMNEVEMLALGDALARTSDVLGPGEHVSFRDYLARVLRTEEQIAALFGEKAIQVSVDGRPVDGPESTVPVTTGSRVVLYRRQGPALDVLTRGVVRRICLN*
Ga0137362_1001658143300012205Vadose Zone SoilMNEVEMLALGDALARTSDVLGPGEHVSFRDYLARVLRTEEQIAALFGERAIQVSVDRRPVDGPESTVPVTTGSRVVLYRRQGPALDVLTRGVVRRICLN*
Ga0137447_104969813300012226SoilDVLHPGEHVSFRDYLARALPNEEQIAALFGEAAFQVSVDGRPIEGPESTLSVTAETRVVLHRRQGPALDVLTRGVVRRICLN*
Ga0137368_1084957123300012358Vadose Zone SoilLRPGEQVPIRHYIDLALDTGAGQLAALLGNGTFVVSVDGRPVDGCDSAIPLTTGSRILLYRRHGPMLDVLNRGVVRRICLN*
Ga0137390_1017287523300012363Vadose Zone SoilMNETEMLAVGEALARTADVLRPDEQVPFRDYVRRVLEAGEAQLAALFGDGAFVVSVDGRPVDGCDSAIPVTAGSRVVLYRRQGPMLDVLNRGVVRRICLN*
Ga0137390_1045535233300012363Vadose Zone SoilMNEVELLALGDVLARTSDALRPRECLPFRDYLQRVLRSEPQLAAVFYGNEACVVSVDGWPVEGPESTVPVTADSRVVLYRRQGPALDVLTRGVVRRICLN*
Ga0137390_1194573913300012363Vadose Zone SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPEWTLPVTADSRVVLTRRLGSALDVLTRGVVRRICLN*
Ga0137358_1004632753300012582Vadose Zone SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERALSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVFTRGVVRRICLN*
Ga0137395_1055569813300012917Vadose Zone SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVLTRGVVRRICLN*
Ga0137396_1038438423300012918Vadose Zone SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRQGPALDVFTRGVVRRICLN*
Ga0137394_1051339523300012922Vadose Zone SoilMNETEMLAFGEALARTVDVLRLDEQVPFRDYVRLALDTGAGQLAALLGDGTFVVSVDGRPVDGCDSAIPVTTGSRIVLYRRHGPMLDVLNRGVVRRICLN*
Ga0137419_1084561813300012925Vadose Zone SoilMNEVELLVLGDALARASDVLRPDECIPFQPYLERVLSNEAQIAAIFKSDESFLVSVDGRPVGGPEWTLPVTVDSRVVLTRRLGSALDVLTRGVVRRICLN*
Ga0137416_1025440813300012927Vadose Zone SoilMNEVELLVLGDALARASDVLRPDECIPFRPYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVFTRGVV
Ga0137407_1010379823300012930Vadose Zone SoilMNETEMLAFGEALARTVDVLRPDEQVPFRDYVRLALDTGAGQLAALLGDGTFVVSVDARPVDGCDSAIPVTTGSRIVLYRRHGPMLDVLNRGVVRRICLN*
Ga0164299_1057397613300012958SoilMHSRYLATATVTVRSRCQGALARTSDVLRPGEHVSFRDYLARVLRTEEQIAALFGERAIQVSVDGRPVDGPESTVPVTTGSRVVLYRRQGPALDVLTRGV
Ga0075354_108703523300014308Natural And Restored WetlandsMNDVELLALGDALARTSEVLHPGEHVSFRDYLARVLRSEAQIAALFGEPACQVSVDGRPVDGPASTVPVTAASRVVLYRRQGPALDVLTRGVVRRICLN*
Ga0137403_1000807743300015264Vadose Zone SoilMNETEMLAFGEALARTVDVLRLDERVPFRDYVRLALDTGAGQLAALLGDGTFVVSVDARPVDGCDSAIPVTTGSRIVLYRRHGPMLDVLNRGVVRRICLN*
Ga0184610_105357923300017997Groundwater SedimentMNETEMLALGEALARTADVLRPDEQVPFRDYVRRVLEAGEAQLAALFGDGAFVVSVDGRPVDGCDSAIPVTAGSRVVLYRRQGPMLDVLNRGVVRRICLN
Ga0184623_1052144613300018056Groundwater SedimentMNETEMLALGEALARTADVLRPGEQVPFRDYVRRVLDTGEGQIEALFGAGTFVVSVDGRPVDGGDSAVPVTAGSRVVLYRRQGPMLDVLNRGVVRRICLN
Ga0184640_1018254133300018074Groundwater SedimentTEMLALGEALARTTEVLHPGERIPFCDYLRRVLGDSEDQIRSLFGSPASTVSVDGQPVDGPDSALSVTARSRVVLYRWQGPTLDVLTRGVVRRICLN
Ga0184609_1006628723300018076Groundwater SedimentMNETEMLAVGEALARTADVLRADEQVPFRDYVRRVLEAGEAQLAALFGDGAFVVSVDGRPVDGCDSAIPVTAGSRVVLYRRQGPMLDVLNRGVVRRICLN
Ga0184609_1028913323300018076Groundwater SedimentMNAVELLALGDALARTSDVLRPGEHVSFRDYLGRVLRSEAQIAALFGETAFQVSVDGRPVDGPESTLSVTAESRVVLYRRQGPTLDVLTRGVVRRIWLN
Ga0184612_1010704323300018078Groundwater SedimentMNETEMLAVGEALARTADVLRPDEQVPFRDYVRRVLEAGEGQLAALFGDGAFVVSVDGRPVDGCDSAIPVTAGSRVVLYRRQGPMLDVLNRGVVRRICLN
Ga0184629_1000612133300018084Groundwater SedimentMNEVELLALGDALARTSDVLRPGEHVAFRDYLARVLQNEEQIAALFGETAFQVSVDGRPVDGPAPTVPVTSASRVVLYRRQGPALDVLTRGVVRRICLN
Ga0184629_1052635823300018084Groundwater SedimentMNEVEMLALGDALARTSDVLRPGEHVSFRDYLARVLPNEEQIAALFGEAAFPVSVDGRPVEGPESTLPVTAASRVVLYRRQGPTLDVLTRGAVRRIWLN
Ga0187892_1015892813300019458Bio-OozeMNEGEMLALGEALARTSDVLHPGECISFCDYLERVSQRWMRPATVFGENKGFTVSVDGRLVEGPASTLSVTAKSRVVLYRRQGAALDVLTRGVVRRIWLN
Ga0187892_1025178313300019458Bio-OozeMNEVEMLALGEALARTSDVLHPGECIPFCDYLERVSQRGVRPATVFGGNKAFMVSVDGQPVEGPDATLSVTAKSRVVLYRREGAALDVLTRGVVRRIWLN
Ga0187893_1015137033300019487Microbial Mat On RocksMKETEMLALGEMLARTADVVRPGEQVPFRDYLRRVLDGGEGQIAALFGAGTFIVSVDGRPVDGGDSAVPVTAASQVVLYRQQGPLLDVLTRGVVRRICMN
Ga0187893_1027147823300019487Microbial Mat On RocksMNEVEMLALGEALARTSDVLHPGECIPFCDYLERVSQRGVRPATVFGGNKAFMVSVDGQPVEGPDATLSVTAKSRVVLYRREGAALDVLTRGVVRRIWLNCLSAPA
Ga0193707_109698123300019881SoilMNETEMLAFGEALARTVDILRPDEQVPFRDYVHLALDTGAGQLAALLGDGTFVVSVDGRPVDGCDSAIPVTTGSRIVLYRRHGPMLDVLNRGVVRRICLN
Ga0193725_105385223300019883SoilLAFGEALARTVDVLRPDEQVPFRDYVRLALDTGAGQVAALLGDGTFVVSVDGRPVDGCDSAIPVTTGSRIVLYRRHGPMLDVLNRGVVRRICLN
Ga0193725_114743513300019883SoilMNEVELLALGDALARTSDILHPGEHVAFRDYLARVLQNEEQIAALFGETAFQVSVDGRPVDGPASTVPVTSASRVVLYRRQGPALDVLTRGVV
Ga0193727_102957533300019886SoilMNETEMLAFGEALARTVDVLRPDEQVPFRDYVRLALDTGAGQVAALLGDGTFVVSVDGRPVDGCDSAIPVTTGSRIVLYRRHGPMLDVLNRGVVRRICLN
Ga0193755_101402633300020004SoilMNEVEMLALAHALARTLDVLPPGEHVSFRDYLARALRSEEQIAALFGATAFHVSVDGRPVDGPESTVPVTAESRVVLCVRQGPVLDVLTRGVVRRICLN
Ga0210379_1016543623300021081Groundwater SedimentMNEVEMLALGDALARTSDVLRPGEHVAFRDYLARVLQNEEQIAALFGETAFQVSVDGRPVDGPAPTVPVTSASRVVLYRRQGPALDVLTRGVVRRICLN
Ga0179596_1072273513300021086Vadose Zone SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPEWTLPVTADSRVVLTRRLGSALDVLTRGVVRRICLN
Ga0193719_1013443023300021344SoilMNEVEMLALAHALARTLDVLPPGEHVCFRDYLARALRSEEQIAALFGATAFHVSVDGRPVDGPESTVPVTAESRVVLCVRQGPVLDVLTRGVVRRICLN
Ga0210384_1029926013300021432SoilMNEVEMLALGDTLARTSDVLRPGEHVSFRDYLARALRSEAQITALFGETTLLVSVDGRAVDGPESTVPVTTGSRVVLYRRRGPALDVLTRGVVRRICLN
Ga0207423_100249613300025535Natural And Restored WetlandsMNEVELLALGDALARTSEVLHPGEHVSFRDYLARVLRSEEQIAALFGEPACQVSVDGRPVDGPGSTVPVTAASRVVLYRRQGPALDVLTRGVVRRICLN
Ga0210094_102997323300025549Natural And Restored WetlandsRTSEVLHPGEHVSFRDYLARVLRSEEQIAALFGEPACQVSVDGRPVDGPASTVPVTAASRVVLYRRQGPALDVLTRGVVRRICLN
Ga0210131_101327523300025551Natural And Restored WetlandsMNEVELLALGDALARTSEVLHPGEHVSFRDYLARVLRSEEQIAALFGEPACQVSVDGRPVDGPASTVPVTAASRVVLYRRQGPALDVLTRGVVRRICLN
Ga0210138_100764713300025580Natural And Restored WetlandsLALGDALARTSEVLHPGEHVSFRDYLARVLRSEEQIAALFGEPACQVSVDGRPVDGPASTVPVTAASRVVLYRRQGPALDVLTRGVVRRICLN
Ga0207684_1030359923300025910Corn, Switchgrass And Miscanthus RhizosphereMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVFTRGVVRRICLN
Ga0207684_1033480823300025910Corn, Switchgrass And Miscanthus RhizosphereMNEVEMLALGDALARTSDVLRPGEHVSFRDYLARVLQNEEQIAALFGETAFQVSVDGQPVDGPASTVPVTAASRVVLYRRQGPALDVLTRGVVRRICLN
Ga0207684_1046947533300025910Corn, Switchgrass And Miscanthus RhizosphereMNEVDMLALGDALARTSEVVRAGECIPFRDYLERVVRSEAQIAVVFGSDEFFTVSVDGRPVDGPGSTVPVTAASRVVLYRRQGPALDVFTRGVVRRICLN
Ga0207684_1097894823300025910Corn, Switchgrass And Miscanthus RhizosphereMNEVELLALGDALARTSDVLRPGEHVSFRAYLARALRSEAQIAALFGETTFQVSVDGQPVDGPESTLPVTAASRVVLYRRRGPALDVLTRGVARRICLN
Ga0207684_1146508113300025910Corn, Switchgrass And Miscanthus RhizosphereMNEVELLALGDALARTSDVLRPGEHVSFRAYLARVLQSEEQIAALFGETAFHVSVDGRPVDGPASTVPVTADSRVVLYRRQGPALDVLTRGVVRRICLN
Ga0207663_1088656113300025916Corn, Switchgrass And Miscanthus RhizosphereMNEVEMLALGDTLARTSEVLRPDEHVSFRDYLARVLRTEEQITALFGETTLLVSVDGRPVEASHVVLYRRQGPALDVLTRGVVRRICLN
Ga0207646_1063723113300025922Corn, Switchgrass And Miscanthus RhizosphereMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSAAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVFTRGVVRRICLN
Ga0207665_1041445023300025939Corn, Switchgrass And Miscanthus RhizosphereMNEVEMLALGDTLARTSEVLRPDEHVSFRDYLARVLRTEEQITALFGERAIRVSVDGRPVDGPESTVPVTTGSRVVLYRRHGPALDVLTRGVVRRICLN
Ga0209438_107864413300026285Grasslands SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERALSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVFTRGVVRRICLN
Ga0257162_102907213300026340SoilRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPEWTLPVTADSRVVLTRRLGSALDVFTRGVVRRICLN
Ga0257170_106181413300026351SoilASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVLTRGVVRRICLN
Ga0257166_102054723300026358SoilMNEVELLALGDALARTSDVLRPGEHVAFRDYLARVLQNEEQIAALFGETAFQVSVDGRPVDGPASTVPVTSASRVVLYRRQGPALDVLTRGVVRRICLN
Ga0257179_100499823300026371SoilGDALARTSDVLRPGEHVAFRDYLARVLQNEEQIAALFGETAFQVSVDGRPVDGPASTVPVTSASRVVLYRRQGPALDVLTRGVVRRICLN
Ga0257179_104592913300026371SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVLTRGVVRRICLN
Ga0257146_103087013300026374SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVMLTRRLGSALDVFTRGVVRRICLN
Ga0257167_101264113300026376SoilMNEVELLILGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVFTRGVVRRICLN
Ga0257171_110125923300026377SoilLGDALARTSDVLRPGEHVAFRDYLARVLQNEEQIAALFGETAFQVSVDGRPVDGPASTVPVTSASRVVLYRRQGPALDVLTRGVVRRICLN
Ga0257177_101441313300026480SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPEWTLPVTVDSRVVLTRRLGSALDVLTRGVVRRICLN
Ga0257177_107821223300026480SoilMNEVELLALGDALARTSDVLRPGEHVAFRDYLARVLQNEEQIAALFGETAFQVSVDGRPVDGPASTVPVTSASR
Ga0257159_104903613300026494SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRLGSALDVFTWGVVRRICLN
Ga0257168_101051713300026514SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVV
Ga0257168_114419513300026514SoilMNEVELLALGDALARTSDTLRPGECLPFRDYLQRVLRSEAQLAAVFHGNEACVVSVDGRPVQGPESTVPVTADSRVVLYRRQGPALDVLTRGVVRRICLN
Ga0209648_1003627363300026551Grasslands SoilMNEVDMLALGDALARASEVLRPGECFAFRDYLERVLPSEAQVAAVFGGDEAFVVSVDGRPVDGPESTVPVTPDSRVVLTRRQGPALDVLTRGVVRRICLN
Ga0209588_110468223300027671Vadose Zone SoilMNEVELLALGDALARTSDTLRPGECLPFRDYLQRVLRSEAQLAAVFHGNEACVVSVDGRPVQGPESTVPVTADSRVLYRRQGPALDVLTRGVVRRICLN
Ga0209726_1009303823300027815GroundwaterMNETEMLALGEALARTADVLRPGEQVPFRAYVRRVLDAGDSAVPVTAGSRVVLYRRQGPMLDVLNRGVVRQICLN
Ga0209726_1009615133300027815GroundwaterMNEVELLALGDALARTSDVLRPGEHVSFRDYLGRVLRSEAQIAALFGETAFQVSVDGRAVDGPESTVSVTAESRVVLYRRQGPTLDVLTRGVVRRICLN
Ga0209726_1009735713300027815GroundwaterMNETEMLALGEALARTADVLRPGEQVPFRAYGRRVLDGGEGQIEALFGAGTFAVSVDGRPVDGGDSAVPVIAGSRVVLYRRQGPMLDVLNRGGVRRICLN
Ga0209701_1017222813300027862Vadose Zone SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAKIAAVFGSDESFLVSVDGRPVGGPEWTLPVTADSRVVLTRRLGSALDVLTRGVVRRICLN
Ga0209283_1034680613300027875Vadose Zone SoilMNEVELLALGDALARTSDVLRPGEHVAFRDYLARVLQNEEQIAALFGETAFQVSVDGRPVDGPASTVPVTSASRVVLYRRQGPALDVLTRGVVRRIC
Ga0209068_10005702113300027894WatershedsMNDVELLALGDTLARTSKVLRPGEHVAFGDYLARVLRSEAQIAALFGETAVQVSVDGRPVEGPESAVPVTADSRVVLYRRQGPALDVLTRGVVRRICLN
Ga0137415_1001670773300028536Vadose Zone SoilMNEVELLVLGDALARASDVLRPDECIPFRPYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRIVLCRRQGPALDVLTRGVVRQICMN
Ga0137415_1145196223300028536Vadose Zone SoilARTADVLRAGEQVPFREYIRRVVHAGEGQIVALFADGGFLVAVDGRPVDGGDSSVPVTPESRVVLYRRQGPMLDVLNRGVVRRICLN
(restricted) Ga0255310_1020150413300031197Sandy SoilMHEIEMLALGDALARTSEVLRPGECLAFHDYLERVVRSGAQLAAVFGGEEAFVVSVDGRAIEGPGSTVPVTADSRVVLYRRQ
Ga0307469_1106558213300031720Hardwood Forest SoilALARTSAVLRPGEHVSFRDYLARALRSEAQIAALFGEEAFRVLVDGRPVDGPESTLPVTAESRVVLYRRQGPALDVFTRGVVRRICLN
Ga0307468_10044243813300031740Hardwood Forest SoilMNEVELLALGDALARTSAVLRPGEHVSFRDYLARALRSEAQLAALFGEEAFRVLVDGRPVEGPESTLPVTAESRVVLYRRQGPALDVFTRGVVRRICLN
Ga0307468_10066711213300031740Hardwood Forest SoilMNEVELLVLGDALARASDVLRPDECIPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVGGPESTLPVTADSRVVLTRRQGSALDVFTRGVVRRICLN
Ga0307479_1015917813300031962Hardwood Forest SoilRNVTFPVADAGHPQVHRGSPMNEVDMLALGDALARTSVVVRPEECIPFRDYLERVLRSEAQGAAVFGGDEAFVVSVDGRPVEGPESTLPVTADSRVVLCRRQGPALEVLTRGVLRRICLN
Ga0307471_10263159913300032180Hardwood Forest SoilMNEVEMLALGDTLARTSEVLRPGEHVSFRDYLARALRSEAQITALFGETTLLVSVDGRAVDGPESTVPVTAASHVVLYRRQVPALDVLTLGVVRRICLN
Ga0307471_10398444723300032180Hardwood Forest SoilMNEVEMLVLGDALARASDVLRPDECLPFRAYLERVLSSEAQIAAVFGSDESFLVSVDGRPVEGPESTVPVTADSRVVLYRRQGPALDVLTRGVVRRICLN
Ga0335084_1070269113300033004SoilMNEVELLALGDALARTSDALRPGEHVAFRDYLARVLRSEEQIAAVFGETAFRVSVDGRLVDGPASTLPVTAASRVVLYRRQGPALDVLTRGVVRRICLN
Ga0335084_1215077113300033004SoilRTSEVLRPGECLAFRDYVGRVLRSEAQLAAVFGGDEAFVISVDGRAVEGPESMTSVTAESRVVLYRRQGPALDVLTRGVVRRICLN
Ga0326726_1017040623300033433Peat SoilMDEIEMLALGDALARTSDVLRPDEHLSFRDYLARILRTEEQIAALFGETAIQVSVDGRPVDGPESTVPVTTGSRVVLYRRQGPALDVLTRGVVRRICLN
Ga0326726_1112353913300033433Peat SoilMNEVEMLALGDALARTSEMVRPEERIPFRDYLERLQRREAQIAAVFGSSEAFLVSVDGRPVEGPESTVPVTADSRVVLYRRQGPALDVLTRAVVRRICLN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.