NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F066482

Metagenome Family F066482

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F066482
Family Type Metagenome
Number of Sequences 126
Average Sequence Length 90 residues
Representative Sequence MIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGTGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALRQPP
Number of Associated Samples 95
Number of Associated Scaffolds 126

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 73.02 %
% of genes near scaffold ends (potentially truncated) 43.65 %
% of genes from short scaffolds (< 2000 bps) 73.81 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction Yes
3D model pTM-score0.69

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (53.968 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(44.444 % of family members)
Environment Ontology (ENVO) Unclassified
(50.794 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(55.556 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 25.00%    β-sheet: 15.18%    Coil/Unstructured: 59.82%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.69
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 126 Family Scaffolds
PF00072Response_reg 10.32
PF04392ABC_sub_bind 3.17
PF00211Guanylate_cyc 3.17
PF00248Aldo_ket_red 1.59
PF01068DNA_ligase_A_M 1.59
PF160782-oxogl_dehyd_N 0.79
PF00300His_Phos_1 0.79
PF08334T2SSG 0.79
PF00226DnaJ 0.79
PF00083Sugar_tr 0.79
PF13384HTH_23 0.79
PF01590GAF 0.79
PF06568DUF1127 0.79
PF03446NAD_binding_2 0.79
PF00589Phage_integrase 0.79
PF13177DNA_pol3_delta2 0.79
PF13545HTH_Crp_2 0.79
PF13439Glyco_transf_4 0.79
PF00571CBS 0.79
PF13408Zn_ribbon_recom 0.79

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 126 Family Scaffolds
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 3.17
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 3.17
COG1423ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) familyReplication, recombination and repair [L] 1.59
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 1.59
COG5457Uncharacterized conserved protein YjiS, DUF1127 familyFunction unknown [S] 0.79


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms56.35 %
UnclassifiedrootN/A43.65 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000891|JGI10214J12806_10505085All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1745Open in IMG/M
3300001661|JGI12053J15887_10101258Not Available1561Open in IMG/M
3300002245|JGIcombinedJ26739_101220583Not Available641Open in IMG/M
3300005458|Ga0070681_11999238All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium508Open in IMG/M
3300005549|Ga0070704_100058851All Organisms → cellular organisms → Bacteria2739Open in IMG/M
3300005843|Ga0068860_100961843Not Available871Open in IMG/M
3300005843|Ga0068860_101353344Not Available733Open in IMG/M
3300006047|Ga0075024_100011796All Organisms → cellular organisms → Bacteria3504Open in IMG/M
3300007255|Ga0099791_10010609All Organisms → cellular organisms → Bacteria3850Open in IMG/M
3300007255|Ga0099791_10018700All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2964Open in IMG/M
3300007265|Ga0099794_10518358All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium628Open in IMG/M
3300007788|Ga0099795_10551248Not Available543Open in IMG/M
3300009038|Ga0099829_10035198Not Available3607Open in IMG/M
3300009088|Ga0099830_10035223All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3422Open in IMG/M
3300009143|Ga0099792_10315062All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium933Open in IMG/M
3300009147|Ga0114129_10048758All Organisms → cellular organisms → Bacteria5952Open in IMG/M
3300009157|Ga0105092_10935228Not Available513Open in IMG/M
3300009168|Ga0105104_10568690Not Available643Open in IMG/M
3300010159|Ga0099796_10098336All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1097Open in IMG/M
3300010159|Ga0099796_10336193All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium648Open in IMG/M
3300010400|Ga0134122_10094646All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2356Open in IMG/M
3300010400|Ga0134122_10224530All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1571Open in IMG/M
3300011269|Ga0137392_10273666All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1392Open in IMG/M
3300011270|Ga0137391_10333859All Organisms → Viruses → Predicted Viral1304Open in IMG/M
3300011270|Ga0137391_10334556All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1302Open in IMG/M
3300011270|Ga0137391_10407097Not Available1162Open in IMG/M
3300011271|Ga0137393_10119148All Organisms → cellular organisms → Bacteria2175Open in IMG/M
3300011271|Ga0137393_10601601Not Available943Open in IMG/M
3300011271|Ga0137393_11550906All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium552Open in IMG/M
3300012096|Ga0137389_11553137All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium558Open in IMG/M
3300012189|Ga0137388_10204310All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1782Open in IMG/M
3300012199|Ga0137383_10131518All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1827Open in IMG/M
3300012199|Ga0137383_10400193All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1006Open in IMG/M
3300012202|Ga0137363_10126051All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1982Open in IMG/M
3300012203|Ga0137399_10403553Not Available1140Open in IMG/M
3300012203|Ga0137399_10514301Not Available1005Open in IMG/M
3300012204|Ga0137374_11115904All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria560Open in IMG/M
3300012205|Ga0137362_10713719All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria861Open in IMG/M
3300012353|Ga0137367_10854670All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium629Open in IMG/M
3300012357|Ga0137384_11111461All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria633Open in IMG/M
3300012360|Ga0137375_10062689All Organisms → cellular organisms → Bacteria3977Open in IMG/M
3300012360|Ga0137375_10155831All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2223Open in IMG/M
3300012361|Ga0137360_10362345All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1216Open in IMG/M
3300012362|Ga0137361_10693154All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium931Open in IMG/M
3300012363|Ga0137390_11380311Not Available649Open in IMG/M
3300012582|Ga0137358_10870860Not Available593Open in IMG/M
3300012918|Ga0137396_10381464All Organisms → Viruses → Predicted Viral1044Open in IMG/M
3300012918|Ga0137396_11169773All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium544Open in IMG/M
3300012923|Ga0137359_11026766All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria707Open in IMG/M
3300012925|Ga0137419_10185749Not Available1529Open in IMG/M
3300012925|Ga0137419_10332088Not Available1171Open in IMG/M
3300012925|Ga0137419_10503596Not Available962Open in IMG/M
3300012929|Ga0137404_10040579All Organisms → cellular organisms → Bacteria3510Open in IMG/M
3300012929|Ga0137404_12108241Not Available527Open in IMG/M
3300012930|Ga0137407_10057415All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3188Open in IMG/M
3300012930|Ga0137407_10059817All Organisms → cellular organisms → Bacteria3129Open in IMG/M
3300012930|Ga0137407_12267948Not Available519Open in IMG/M
3300014881|Ga0180094_1147438Not Available551Open in IMG/M
3300015241|Ga0137418_10926089All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria638Open in IMG/M
3300017997|Ga0184610_1037824Not Available1382Open in IMG/M
3300018000|Ga0184604_10249209Not Available621Open in IMG/M
3300018027|Ga0184605_10205639Not Available894Open in IMG/M
3300018028|Ga0184608_10435994Not Available565Open in IMG/M
3300018028|Ga0184608_10506356Not Available516Open in IMG/M
3300018051|Ga0184620_10260330Not Available582Open in IMG/M
3300018052|Ga0184638_1004764All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4373Open in IMG/M
3300018052|Ga0184638_1137891All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium885Open in IMG/M
3300018054|Ga0184621_10070717Not Available1200Open in IMG/M
3300018071|Ga0184618_10427541Not Available559Open in IMG/M
3300018076|Ga0184609_10028623Not Available2273Open in IMG/M
3300018422|Ga0190265_10052680All Organisms → cellular organisms → Bacteria3559Open in IMG/M
3300018422|Ga0190265_10072505Not Available3116Open in IMG/M
3300018422|Ga0190265_10350081Not Available1563Open in IMG/M
3300018422|Ga0190265_10694748All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1139Open in IMG/M
3300018422|Ga0190265_13268382All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria541Open in IMG/M
3300018429|Ga0190272_10518483Not Available1018Open in IMG/M
3300019487|Ga0187893_10065221All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3507Open in IMG/M
3300019866|Ga0193756_1031903Not Available744Open in IMG/M
3300019878|Ga0193715_1011924All Organisms → cellular organisms → Bacteria1865Open in IMG/M
3300019879|Ga0193723_1193464All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria513Open in IMG/M
3300019889|Ga0193743_1177032Not Available703Open in IMG/M
3300020002|Ga0193730_1011603All Organisms → cellular organisms → Bacteria2500Open in IMG/M
3300020004|Ga0193755_1009639Not Available3152Open in IMG/M
3300020004|Ga0193755_1224684All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria520Open in IMG/M
3300020061|Ga0193716_1028547All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2744Open in IMG/M
3300020061|Ga0193716_1134252All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1016Open in IMG/M
3300020170|Ga0179594_10082890Not Available1133Open in IMG/M
3300021344|Ga0193719_10008082All Organisms → cellular organisms → Bacteria4375Open in IMG/M
3300022534|Ga0224452_1094325Not Available912Open in IMG/M
3300022694|Ga0222623_10196470Not Available783Open in IMG/M
3300025910|Ga0207684_10026198All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria4970Open in IMG/M
3300025922|Ga0207646_10180768All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1905Open in IMG/M
3300026304|Ga0209240_1215621All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria581Open in IMG/M
3300026354|Ga0257180_1062282All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium538Open in IMG/M
3300026360|Ga0257173_1022860All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium794Open in IMG/M
3300026376|Ga0257167_1029600Not Available811Open in IMG/M
3300026469|Ga0257169_1050919All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium651Open in IMG/M
3300026494|Ga0257159_1002858Not Available2415Open in IMG/M
3300026497|Ga0257164_1020358All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium938Open in IMG/M
3300026514|Ga0257168_1152924All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium514Open in IMG/M
3300026515|Ga0257158_1024962All Organisms → Viruses → Predicted Viral1025Open in IMG/M
3300026535|Ga0256867_10140027Not Available912Open in IMG/M
3300026555|Ga0179593_1055592Not Available3124Open in IMG/M
3300026557|Ga0179587_10419385Not Available874Open in IMG/M
3300027616|Ga0209106_1117443Not Available597Open in IMG/M
3300027671|Ga0209588_1067399All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1156Open in IMG/M
3300027815|Ga0209726_10023263All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria5152Open in IMG/M
3300027846|Ga0209180_10005237All Organisms → cellular organisms → Bacteria6550Open in IMG/M
3300027862|Ga0209701_10276663All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium972Open in IMG/M
3300027875|Ga0209283_10029072All Organisms → cellular organisms → Bacteria3437Open in IMG/M
3300027903|Ga0209488_10081801All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2404Open in IMG/M
3300027903|Ga0209488_10110324All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2066Open in IMG/M
3300028047|Ga0209526_10286779All Organisms → cellular organisms → Bacteria1117Open in IMG/M
3300028381|Ga0268264_10663784Not Available1033Open in IMG/M
3300028536|Ga0137415_10054686All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium3860Open in IMG/M
3300028536|Ga0137415_10168292Not Available2022Open in IMG/M
3300028787|Ga0307323_10174678Not Available776Open in IMG/M
3300028807|Ga0307305_10496543Not Available547Open in IMG/M
3300028814|Ga0307302_10438486Not Available647Open in IMG/M
3300028828|Ga0307312_10083723All Organisms → cellular organisms → Bacteria1955Open in IMG/M
3300028828|Ga0307312_10456322Not Available842Open in IMG/M
3300028828|Ga0307312_10721123Not Available660Open in IMG/M
3300028828|Ga0307312_10790853Not Available629Open in IMG/M
3300028884|Ga0307308_10598750Not Available529Open in IMG/M
3300031740|Ga0307468_100305815Not Available1157Open in IMG/M
3300031740|Ga0307468_100821274Not Available795Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil44.44%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil19.84%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment8.73%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil6.35%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.17%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.38%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.38%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment1.59%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.59%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.59%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.59%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.79%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.79%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.79%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.79%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.79%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.79%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.79%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009168Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014881Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_1DaEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019866Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1m1EnvironmentalOpen in IMG/M
3300019878Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2m2EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019889Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020061Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c1EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026354Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-BEnvironmentalOpen in IMG/M
3300026360Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-BEnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026469Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-BEnvironmentalOpen in IMG/M
3300026494Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-AEnvironmentalOpen in IMG/M
3300026497Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027616Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028787Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_381EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10214J12806_1050508563300000891SoilMIPGLPDANLRQDVRARLSKRELFAATGISSIRRSTGRRCIVCGLPIVPPTLEREVEGPGIVALAHPDCYAIWREESLLLKRRTI*
JGI12053J15887_1010125813300001661Forest SoilVISDLFGDDSLRREVRGRLVKRELFAAMGISSIRRGTGRPCVVCARSIVSPTLEREVEAPGVVAFAHPDCYKIWREESAILKRRPAV*
JGIcombinedJ26739_10122058313300002245Forest SoilQDSTSSRSHPLLRGVGVRARQGPAMIPVLPDAALRRDIRARLSKRELFAATGMSSIRRGTGRRCLVCGLPIESPTLEREVEGPGVFGLAHPDCYTLWREESAALRQPPPRRWGTTGLPRAW*
Ga0070681_1199923813300005458Corn RhizosphereNLRQDARARLSKRELFAATGISSIRRSTGRRCIVCGLPIVPPTLEREVEGPGIVALAHPDCYAIWREESLLLKRRTI*
Ga0070704_10005885153300005549Corn, Switchgrass And Miscanthus RhizosphereMIPDLPDEPLRRQVRAQLAARHLFPADGVSSVHRGTGRRCTVCGRPIDSPTLEREVEGPGVVGRAHPACYVIWREESVARQRKTG*
Ga0068860_10096184313300005843Switchgrass RhizosphereVIPDLPDEPLRRQVRAQLAARQLFLADGVSSVRRGTGRPCTVCGHPIDSPTLEREVEGPGIVARAHPACYVIWREESM
Ga0068860_10135334423300005843Switchgrass RhizosphereMIPDLPDEPLRRQVRAQLAARHLFLADGVSSVHRGTGRRCTVCGRPIDSPTLEREVEGPGVVGRAHPACYVIWREESVARQRKTG*
Ga0075024_10001179623300006047WatershedsMIPVLPDAALRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGLPIQSPTLEREVEGPGVFGLAHPDCYALWREESAALRQPPPRRWGTIESPRAW*
Ga0099791_1001060973300007255Vadose Zone SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGAGRLCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALKQPPPRRWRTTDSSRAW*
Ga0099791_1001870023300007255Vadose Zone SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPCIVCEHSIDSPTLEREVEGPGAFGLAHPDCYALWREESAALRQPPPRRWSTSGSAPPHAW*
Ga0099794_1051835813300007265Vadose Zone SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGAGRLCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALKQPPPRR
Ga0099795_1055124813300007788Vadose Zone SoilMIPVLPDAALRRDIRARLSKRELFAASGMSSIRRGTGRRCLVCGLRIESPTLEREVEGPGVLGLAHPDCYTLWREESAALRQPPPKRWGTTGSPPAW*
Ga0099829_1003519843300009038Vadose Zone SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGTGRPCTVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAVLREPPPRRWSTTAPPRDW*
Ga0099830_1003522363300009088Vadose Zone SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPRIVCEHSIDSPTLEREVEGPGAFGLAHPDCYAIWREESAAMRQPPPRRWSTSGSAPPRAW*
Ga0099792_1031506223300009143Vadose Zone SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPCIVCEHSIDSPTLEREVEGPGVFGLAHPDCYALWREESAALRQPPPRRWSTSGSAPPRAW*
Ga0114129_1004875853300009147Populus RhizosphereMVPDLPDEPLRRQVRTQLAARHLFPADGVSSVCRGTGRRCTVCGRPIDAPTLEREVEGPGVVGRAHPACYVIWREESMARQRKTG*
Ga0105092_1093522813300009157Freshwater SedimentMIPDLPDEPLRRQVRAQLASRQLFSADGVSSVRRGTGRLCTVCRRPIDSPALEREVEGPGVVGRAHPDCYVIWREESMARQRKAG*
Ga0105104_1056869023300009168Freshwater SedimentMIALALREELRTLVRAQLSTHALFLATGISSIRRGTGRPCHVCNQPIVSPTLEREVEGPGVFGLAHPDCYTLWREESAALRQPPPRRW
Ga0099796_1009833613300010159Vadose Zone SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPCIVCEHSIDSPTLEREVEGPGAFGLAHPDCYAIWREESAA
Ga0099796_1033619313300010159Vadose Zone SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGAGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALKQPPPRRWRTTDSP
Ga0134122_1009464633300010400Terrestrial SoilMIRLPERDELRTLVRTQLSTRALFPATGISSIHRGTGRPCRVCDHPIDSPTLAREVEGSGVVVVAHPACYAIWREESAALRQPSPRRWGSMGLPRAW*
Ga0134122_1022453033300010400Terrestrial SoilMIPELFDDELRRRLRARLSKRDLFPAMGISSVRRGAGRPCIVCAHTIDSPTLEREVEGPGVVGVAHADCYKVWREESALFRHRPAV*
Ga0137392_1027366623300011269Vadose Zone SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPCIVCEHSIDSPTLEREVEGPGAFGLAHPDCYAIWREESAAMRQPPPRRWSTSGVRPAPRLVAPV*
Ga0137391_1033385943300011270Vadose Zone SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGTGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALRQPPPRRWNT
Ga0137391_1033455623300011270Vadose Zone SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPRIVCEHSIDSPTLEREVEGPGAFGLAHPDCYAIWREESAAMRQPPPRRWSTSGVRPAPRLVAPV*
Ga0137391_1040709713300011270Vadose Zone SoilMPDAMILGLPDGDLRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGLPINSPTLEREVEGPGIVALAHPDCYAIWREESAALRQPPPRRWSTTGAPRAW*
Ga0137393_1011914863300011271Vadose Zone SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGTGRPCIVCDLPIDSPTLEREVDGPSVVGRAHPACYVIWREESAALRQPPPRRWNTEGSASPRAW*
Ga0137393_1060160133300011271Vadose Zone SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPCIVCEHPINSPTLEREVEGPGAFGLAHPDCYALWREESAALRQPPPRRWSTSGSAPPRAW*
Ga0137393_1155090623300011271Vadose Zone SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGAGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIW
Ga0137389_1155313723300012096Vadose Zone SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPCIVCEHSIDSPTLEREVEGPGAFGLAHPDCYAIWREESAAMRQPPPR
Ga0137388_1020431033300012189Vadose Zone SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPCIVCEHSIDSPTLEREVEGPGAFGLAHPDCYAIWREESAAMRQPPPRRWSTSGSAPPRAW*
Ga0137383_1013151813300012199Vadose Zone SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGAGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALKQPPPRRWRTTDSPRAW*
Ga0137383_1040019333300012199Vadose Zone SoilMIPGPPDEALRRQIRVRLSKHELFAAMGISSIRRGTGRPCIVCEHSIDSPTLEREVEGTGVFGLAHLDCYTFWREESAALRQ
Ga0137363_1012605113300012202Vadose Zone SoilMIPGPPDEALRRQVRVRLSKHELFAATGISSIRRGTGRPCIVCAHSIDSPTLEREVEGTGVFGLAHPDCYALWREESAARRQPPPRRWSTSGSAPPRA
Ga0137399_1040355313300012203Vadose Zone SoilMVSDLIGDDSLRWQVRARLAKRELFGATGISSVRRGTGRTCIVCERPIDSPTLEREVEDPGVVAIAHPDCYKIWREESAILKHRPAV*
Ga0137399_1051430133300012203Vadose Zone SoilMKQVPRPDPLFRGVGVRAEQATAMIPEPPDAGLRQQVRARLSKRDLFAATGVSSIHRGTGRPCIVCDLPIDSPTLEREVDGPGVVGCAHPACYVIWREESAALRQPPPRRWSTRASPRAW
Ga0137374_1111590413300012204Vadose Zone SoilMLLESPDDALRRKVRARLAAHQLFPANGISSVRRGTGRPYIVCERPIDSPTLEREVEGPGVVGLAHPACYVI
Ga0137362_1071371913300012205Vadose Zone SoilMIPEPPDAGLRQQVRARLSKRDLFAATGMSSIHRGTGRPCVVCDLPIDSPTLEREVDGPGEVGRAHPACYVIWREESAALKQPPPRRWRTTDSPRAW
Ga0137367_1085467023300012353Vadose Zone SoilMIADLFDDGSVRRQVRARLVKRELFAAMGISSVRRGTRRPFVVCERPIESPTLEREVEGPGVVAVAHPECYKLWREESALLKHRPAI*
Ga0137384_1111146123300012357Vadose Zone SoilMIPGPPDEALRRQIRVRLSKHELFAAMGISSIRRGTGRPCIVCEHSIDSPTLEREVEGTGVFGLAHLDCYTFWREESAALRQPPPSPVSLRSVDK
Ga0137375_1006268953300012360Vadose Zone SoilVIPDLPDEPLRRQVRAQLAARHLFPADGVSSVRRGTGRRCTVCGRPIDSPTLEREVEGPPGVVGRAHPVCYVIWREESMARQRKTG*
Ga0137375_1015583113300012360Vadose Zone SoilMIADLFDDGSVRRQVRARLVKRELFAAMGISSVRRGTGRPFVVCERPIESPTLEREVEGPGVVAVAHPECYKLWREESALLKHQPAV*
Ga0137360_1036234543300012361Vadose Zone SoilVIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPCIVCEHSIDSPTLEREVEGPGAFGLAHPDCYAIWREESAALK
Ga0137361_1069315413300012362Vadose Zone SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPCIVCEHSIDSPTLEREVEGPGAFGLAHPDCYALWREESAAMRQPPPRRWSTSGSAPPRAW*
Ga0137390_1138031123300012363Vadose Zone SoilPMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPRIVCEHSIDSPTLEREVEGPGAFGLAHPDCYAIWREESAAMRQPPPRRWSTSGVRPAPRLVAPV*
Ga0137358_1087086013300012582Vadose Zone SoilMIPVLPHAALRRDIRARLSKRELFAATGMSSIRRGTGRRCLVCGLRIESPTLEREVEGPGVLGLAHPDCYTLWREESAALR
Ga0137396_1038146423300012918Vadose Zone SoilMIPGPPDEALRRQVRVRLSKRELFAATGISSIRRGTGRPCIVCEHSIDSPTLEREVEGPGAFGLAHPDCYALWREESAALRQPPPRRWSTSGSAPPRAW*
Ga0137396_1116977313300012918Vadose Zone SoilMIPEPSDAGLRQQVRARLSKRDLFAATGVSSIHRGTGRPCIVCDLPIDSPTLEREVDGPGVVGCAHPACYVIWREESAA
Ga0137359_1102676613300012923Vadose Zone SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGTGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALK
Ga0137419_1018574913300012925Vadose Zone SoilMVSDLFGDDSLRWQVRARLAKRERFGATGISSVRRGTGRTCIVCERPIDSPTLEREVEDPGVVAIAHPDCYKIWREESAILKHRPAV*
Ga0137419_1033208813300012925Vadose Zone SoilMKQVPRPDPLFRGVGVRAEQTTAMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGTGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALKQPPPRRWRTTDSPRAW
Ga0137419_1050359613300012925Vadose Zone SoilMPDAMIPGLPDGDLRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGLPINSPTLEREVEGPGIVALAHPDCYAIWREESAALRQPPPRRWSTTGAPRAW*
Ga0137404_1004057993300012929Vadose Zone SoilMISDLFGGELRQDVRTRLSKRELFSATGISSIRRGTGRRCLVCGLPINFTTLEREVEGPGIVGLAHPDCYKLWREESALLRQPPPIRWSTTGAPRAW*
Ga0137404_1210824113300012929Vadose Zone SoilPDDALRRKVRARLAARELFPATGIASIRRGTGRPCIVCELPIDSPTLEREVDGPGIIALAHPACYGIWREESIARQRKAN*
Ga0137407_1005741543300012930Vadose Zone SoilVIPDLPDEPLRRQVRAQLAARHLFPAAGVSSVRRGTGRRCTVCGRPIDSPTLEREVEGPPGVVGRAHPACYVIWREESMARQRKAG*
Ga0137407_1005981763300012930Vadose Zone SoilMISDLFGGELRQDVRTRLSKRELFSATGISSIRRGTGRRCLVCGLPINFPTLEREVEGPGIVGLAHPDCYKLWREESALLRQPPPIRWSTTGAPRAW*
Ga0137407_1226794813300012930Vadose Zone SoilMPDAMIPGLPDGDLRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGLPINSPTLEREVEGPGIVGLAHPDCYALWREESALLRQPPP
Ga0180094_114743813300014881SoilMIPEPPNAGLRQHVRARLSKRDLFAATGISSIHRGTGRPCTVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALRE
Ga0137418_1092608913300015241Vadose Zone SoilMMLPERREELRTLLRTQLSTRALFPATGISSIRRGTGRPCHVCNHPIDSPTLEREVEGPGVVVAAHPAC
Ga0184610_103782443300017997Groundwater SedimentMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGTGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALRQPP
Ga0184604_1024920913300018000Groundwater SedimentMPDAMIPGLPDGDLRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGLPIDSPTLEREVEGPGIIALAHPDCYAIWREESAALRQPPPRRWSTT
Ga0184605_1020563923300018027Groundwater SedimentMPDAMIPGLPDGDLRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGMPIDSPTLEREVEGPGIIAHATMAHTIT
Ga0184608_1043599413300018028Groundwater SedimentMIPEPSNSEMRQHVRTRLSKRQLFAATGISSIRRGTGRPCIVCVRSIESPTLEREVEGPGVLGLAHPGCYK
Ga0184608_1050635613300018028Groundwater SedimentMPDAMIPGLPDGDLRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGLPIDSPTLEREVEGPGIIALAHPDC
Ga0184620_1026033023300018051Groundwater SedimentMPDAKIPGLPDGDLRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGLPIDSPTLEREVEGPGIIALAHPDCYAIWREESAA
Ga0184638_100476493300018052Groundwater SedimentMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGTGRPCIVCDLPIDSPMLEREVDGPGVVGRAHPACYVIWREESAALRQPPPRRWNTE
Ga0184638_113789123300018052Groundwater SedimentMIPEWPDAGLRQQVRTRLSKRQLFAAAGISSVRRGTGRPCIVCERSIESSTLEREVEGSGQLGVAHPDCYKIWREESALLKHRPAI
Ga0184621_1007071733300018054Groundwater SedimentVIPDLPGDALRRKVRARLAARQLFLADGISSIRRGTGRPCIVCELPIYPPTLEREVDGPGVVGLAHPACYVIWREESMARQRKAG
Ga0184618_1042754113300018071Groundwater SedimentMISDLFGDDSVRRQVRARLVKRELFAAMGISSVRRGTGRPCTVCERPIESPTLEREVEGPGVSSVAHPDCYKLWREESAILKHRPAV
Ga0184609_1002862353300018076Groundwater SedimentMTALIADLFGDDSLRRQVRARLSKRQLFAATGISSVRRGTSRPCIVCERPIDSPTLEREIEGPGVVAFAHPDCYKLWREESAILKHRPAV
Ga0190265_1005268063300018422SoilMMFLESPEDVLRRKVRARLAARHLFPADGISSIRRGTGRPCVICELPIDSPMLEREVEGPGVVALAHPACYTIWREESIARKRKPN
Ga0190265_1007250533300018422SoilMIPGLPHGELRQDVRARLLKRELFAATGISSIRRGTGRRCLVCGLPVQSPTLEREVEGPGIVGLAHPDCYAIWREESAALRQPPPRRWSTTGAPRAW
Ga0190265_1035008123300018422SoilMREHEEERQQVRARLSNRDLFAATGISSIHRGTGRPCIVCNRPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALRQPSPRRWSTTGSASPHDW
Ga0190265_1069474833300018422SoilMIPEPPDEELRRQVRAQLANRRLFPATGISSVRRGTGRPCNVCGRAIESPTLEREVEGPGVIGLAHPSCYAIWREESIARHVRQAG
Ga0190265_1326838213300018422SoilMIALALREELRTLVRTQLSTRALFPATGISSIRRGTGRPCHVCNQPIDSPTLEREVEGPGVVALAHPDCYTLWREESAAQRQPPPRRWTTTGSPRAW
Ga0190272_1051848313300018429SoilMIMISDLFGDDSLRRQVRARLARRELFAAKGISSVRRGTGRPCIVCDRPIESPTLEREVEGPGVVALAHPACYVIWREESLALKRKAN
Ga0187893_1006522153300019487Microbial Mat On RocksMIPGLLDGDLRQDVRARLSKRELFAATGIYSIRRGTGRRCLVCGLPIQSPTLEREVEGPGVFGLAHPDCYTLWREESAALRQPTSRRWSTTGAPRSW
Ga0193756_103190323300019866SoilMPDLPNGELRQDVRSRLSKRELFAATGISSIRRGTGRRCLVCGLPIQSPTLEREVEGPGIFGLAHPDCYTLWREESAALRQPPPRRWSTTGSASPR
Ga0193715_101192413300019878SoilEQAAPVLMISELPDNALRLQLRARLSKRDLFPATGISSIHRGTGRPCMVCDLPIDSPTLEREVAGPGVVGRAHPACYVIWREESAALRQPPPRRWSTTGSPRAR
Ga0193723_119346423300019879SoilMIPVLPHAALRRDIRARLSKRELFAATGMSSIRRGTGRRCLVCGLPIDSPTLEREVEGPDIFGLAH
Ga0193743_117703213300019889SoilRTTMIPEPPDDELRRQVRAQLANRRLLPATGISSVRRGTGRPCNVCGSGIDSPTLEREVEGPGVTGVAHPSCYAIWREESIARHARKAN
Ga0193730_101160343300020002SoilMPDAMIPGLPDGDLRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGLPIDSPTLEREVEGPGIIALAHPDCYAIWREESAALRQPPPRRWSTTGTPRAW
Ga0193755_100963933300020004SoilMPDEARQREQARAVVQNGKLPAMIPSLDADLRQDVRARLSKRELFAATGISSIRRGTGRRCLVCGLPINFPTLEREVEGPGIVGLAHPDCYALWREESALLRQPPPRRWSTTGSARAW
Ga0193755_122468413300020004SoilMIALALREEIRTLIRTQLSTRALFPATGMASIRRGTGRPCHVCDHPIDSPTLEREVEGPGVVVVAHPACYAIWREESAALRQPPQK
Ga0193716_102854763300020061SoilMLLESPDDALRRQVRKQLAARELFPATGISSIRRGTGRPCQVCKLPIDSPTLEREVEGPGVVALAHPACYAIWREESIARTRKTN
Ga0193716_113425243300020061SoilMIPDPPDSGLRQQVRARLSARQLCPADGISTVRRGTGRPCTVCDMPIDSPTMEREVEGPGVVALAHPACYVIWREESI
Ga0179594_1008289023300020170Vadose Zone SoilVISDLFGDDSLRREVRGRLVKRELFAAMGISSIRRGTGRPCVVCARSIVSPTLEREVEAPGVVAFAHPDCYKIWREESAILKRRPAV
Ga0193719_1000808253300021344SoilMIPELPDNALRLQVRARLSKRDLFPATGISSIHRGTGRPCMVCDLPIDSPTLEREVAGPGGVGRAHPACYVIWREESAALRQPPPRRWSTTGSPRAW
Ga0224452_109432523300022534Groundwater SedimentMPDAMIPGLPDGDLRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGLPIDSPTLEREVEGPGIIALAHPDCYAIWREESAALRQPPPRRWSTTGAPRAW
Ga0222623_1019647013300022694Groundwater SedimentMPDAMIPGLPDGDLRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGLPINSPTLEREVEGPGIVALAHPDCYAIWREESAALRQPPPRRWSTAGSAPPRAW
Ga0207684_1002619813300025910Corn, Switchgrass And Miscanthus RhizosphereMIPERPDAGLRQQVRARLSKRDLFAATGISSIRRGAGRPCTVCDLPIDSPALEREVDGPGVVGRAH
Ga0207646_1018076823300025922Corn, Switchgrass And Miscanthus RhizosphereMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGAGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALKQPPPRRWRTTDSPRAW
Ga0209240_121562113300026304Grasslands SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPRIVCEHSIDSPTLEREVEGPGAFGLAHPDCYAIWREESAAMRQPPPRRWSTSGVRPAPRLVAPV
Ga0257180_106228223300026354SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGAGRLCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVI
Ga0257173_102286013300026360SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGAGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWR
Ga0257167_102960013300026376SoilMPDAMIPGLPDGDLRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGLPINSPTLEREVEGPGIVALAHPDCYAIWREESAALRQPPPRRWSTTGAPRAW
Ga0257169_105091923300026469SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGAGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALKQPPPRRWRTTD
Ga0257159_100285853300026494SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGAGRLCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALKQPPPRRWRTTDSSRAW
Ga0257164_102035813300026497SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGAGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWRE
Ga0257168_115292423300026514SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPCIVCEHSIDSPTLEREVEGPGAFGLAHPDCYAIWREESAAMRQPPPRRWSTSGSAP
Ga0257158_102496213300026515SoilMIPILPDAALRQDIRARLSKRELFAATGMSSIRRGTGRRCLVCGRPIESPTLEREVEGPGVSGLAHPDCYTLWREESAALRQPPPRRWGTTESPRAW
Ga0256867_1014002713300026535SoilMIPESSDAGLRQQVRARLSKRDLFAATGISSIHRGTGRPCIVCALPIDSPTLEREVDGPGVVGRAHPACYVIWRDESAA
Ga0179593_105559233300026555Vadose Zone SoilVIPDLPEEPLRRPVRAQLAANQLFSADGVSSVRRGTGRLCTVCRRPIDSPTLEREVEGPGVVGRAHPACYVIWREESAA
Ga0179587_1041938513300026557Vadose Zone SoilMIPVLPHAALRRDIRARLSKRELFAATGMSSIRRGTGRRCLVCGLPIESPTLEREVEGPGVVVGLAHPDCYTI
Ga0209106_111744323300027616Forest SoilMISDLFGDDSLRREVRGRLVKRELFAAMGISSIRRGTGRPCVVCARSIVSPTLEREVEAPGVVAFAHPDCYKIWREESAILKHRPAV
Ga0209588_106739913300027671Vadose Zone SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGAGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALKQPPPRRWRTTDSPR
Ga0209726_1002326343300027815GroundwaterMIPEPPDAGLRQHVRARLSKRDLFAATGISSIHRGTGRPCTVCDLPIHSPTLEREVDGPGVVGRAHPACYVIWREESAALREPPPRRWSTTAPPRDW
Ga0209180_1000523773300027846Vadose Zone SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGTGRPCIVCDLPIDSPTLEREVDGPSVVGRAHPACYVIWREESAALRQPPPRRWNTEGSASPRAW
Ga0209701_1027666313300027862Vadose Zone SoilMIPEPSDAGLRQQVRARLSKRDLFAATGISSIHRGAGRPCIVCDLPIDSPTLEREVDGPGVVGRAHP
Ga0209283_1002907273300027875Vadose Zone SoilMIPEPPDAGLRQQVRARLSKRDLFAATGISSIHRGTGRPCIVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALRQPPPRRWNTEGSASPRAW
Ga0209488_1008180113300027903Vadose Zone SoilMIPAPPDAGLRQQVRARLSKRDLFAATGISSIRRGTGRPCVVCDLPIDSPTLEREVDGPGVVGRAHPACYVIWREESAALKQPPPRRWRTSDAPRAW
Ga0209488_1011032413300027903Vadose Zone SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPCIVCEHSIDSPTLEREVEGPGVFGLAHPDCYALWREESAALRQPPPRRWSTSGSAPPR
Ga0209526_1028677933300028047Forest SoilMIPVLPDAALRRDIRARLSKRELFAATGMSSIRRGTGRRCLVCGLPIESPTLEREVEGPGVFGLAHPDCYTLWREESAALRQPPPRRWGTTGLPRAW
Ga0268264_1066378413300028381Switchgrass RhizosphereVIPDLPDEPLRRQVRAQLAARQLFLADGVSSVRRGTGRPCTVCGHPIDSPTLEREVEGPGIVARAHPACYVIWREESMARQRKAG
Ga0137415_1005468663300028536Vadose Zone SoilMIPGPPDEALKRQIRVRLSKHELFAATGISSIRRGTGRPCIVCEHSIDSPTLEREVEGTGVFGLAHPDCYALWREESAARRQPPPRRWSTSGSAPPRAW
Ga0137415_1016829223300028536Vadose Zone SoilMVSDLIGDDSLRWQVRARLAKRELFGATGISSVRRGTGRTCIVCERPIDSPTLEREVEDPGVVAIAHPDCYKIWREESAILKHRPAV
Ga0307323_1017467823300028787SoilVRARLAAHQLFPANGISSVRRGTGRPCIVCERPIDSPTLEREVEGPGVTGLAHPTCYVVWREESIARQRKTG
Ga0307305_1049654323300028807SoilRLAAHQLFPANGISSVRRGTGRPCIVCERPIDSPTLEREVEGPGVTGLAHPTCYVVWREESIARQRKTG
Ga0307302_1043848613300028814SoilMIPEPSNSEMRQHVRTRLSKRQLFAATGISSIRRGTGRPCIVCVRSIESPTLEREVEGPGVLGLAHPGCYKIWREESALLRNPVAGNVFFSISRSKVQ
Ga0307312_1008372343300028828SoilMSKVSLLGFLESPEDALRRKVRARLAAHQLFPANGISSVRRGTGRPCIVCERPIDSPTLEREVEGPGVTGLAHPTCYVVWREESIARQRKTG
Ga0307312_1045632223300028828SoilVIPDLLEEPLRRQVRAQLAANQLFSADGVSSVRRGTGRLCTVCRRPIDAPTLEREVEGPGVVGRAHPACYVIWREESMACQRKAG
Ga0307312_1072112323300028828SoilMPDAMIPGLPDGDLRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGLPIDSPTLEREVEGPGIIALAHPDCYAIWREESAALRQPPPRRWSTTGTP
Ga0307312_1079085313300028828SoilMVADLFGDDPLRGQVRTRLAKRELFAATGISSVRRGTGRPCIVCERSIDSPTLEREVEGPGVVARAHPACYVIWREESMARQR
Ga0307308_1059875013300028884SoilMPDAKIPGLPDGDLRQDVRARLSKRELFAATGMSSIRRGTGRRCLVCGLPIDSPTLEREVEGPGIIALAHPDCYAIWREESAALRQPPPRRWSTTGTPRA
Ga0307468_10030581523300031740Hardwood Forest SoilMMRLPERDELRTLVRTQLSTRALFPATGISSIHRGTGRPCRVCDHPIDSPTLAREVEGSGVVVVAHPACYAIWREESAALRQPSPRRWGSMGLPRAW
Ga0307468_10082127413300031740Hardwood Forest SoilMIPEPRDRDLRRDVRLRLARRQLFAAMGISSVRRGTGRPCIVCDAAIESPTLEREVEGPGVLGVAHPECYKIWREESALLKHRPAV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.