NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F091479

Metagenome Family F091479

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F091479
Family Type Metagenome
Number of Sequences 107
Average Sequence Length 44 residues
Representative Sequence CVAQLRRAMGAGARQFMITGFVPDPRAFMRRWMKEVAGAL
Number of Associated Samples 99
Number of Associated Scaffolds 107

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.93 %
% of genes near scaffold ends (potentially truncated) 98.13 %
% of genes from short scaffolds (< 2000 bps) 90.65 %
Associated GOLD sequencing projects 94
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (91.589 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(12.149 % of family members)
Environment Ontology (ENVO) Unclassified
(32.710 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(38.318 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 45.59%    β-sheet: 0.00%    Coil/Unstructured: 54.41%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 107 Family Scaffolds
PF00296Bac_luciferase 28.04
PF07690MFS_1 7.48
PF05977MFS_3 3.74
PF01557FAA_hydrolase 2.80
PF13436Gly-zipper_OmpA 2.80
PF04909Amidohydro_2 2.80
PF01425Amidase 2.80
PF07355GRDB 1.87
PF02894GFO_IDH_MocA_C 1.87
PF08241Methyltransf_11 1.87
PF01408GFO_IDH_MocA 1.87
PF13458Peripla_BP_6 1.87
PF01168Ala_racemase_N 0.93
PF02775TPP_enzyme_C 0.93
PF135322OG-FeII_Oxy_2 0.93
PF01180DHO_dh 0.93
PF01799Fer2_2 0.93
PF01988VIT1 0.93
PF13442Cytochrome_CBB3 0.93
PF02639DUF188 0.93
PF02515CoA_transf_3 0.93

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 107 Family Scaffolds
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 28.04
COG2814Predicted arabinose efflux permease AraJ, MFS familyCarbohydrate transport and metabolism [G] 3.74
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 2.80
COG0673Predicted dehydrogenaseGeneral function prediction only [R] 1.87
COG0042tRNA-dihydrouridine synthaseTranslation, ribosomal structure and biogenesis [J] 0.93
COG0069Glutamate synthase domain 2Amino acid transport and metabolism [E] 0.93
COG0167Dihydroorotate dehydrogenaseNucleotide transport and metabolism [F] 0.93
COG1304FMN-dependent dehydrogenase, includes L-lactate dehydrogenase and type II isopentenyl diphosphate isomeraseEnergy production and conversion [C] 0.93
COG1633Rubrerythrin, includes spore coat protein YhjRInorganic ion transport and metabolism [P] 0.93
COG1671Uncharacterized conserved protein YaiI, UPF0178 familyFunction unknown [S] 0.93
COG1804Crotonobetainyl-CoA:carnitine CoA-transferase CaiB and related acyl-CoA transferasesLipid transport and metabolism [I] 0.93
COG1814Predicted Fe2+/Mn2+ transporter, VIT1/CCC1 familyInorganic ion transport and metabolism [P] 0.93
COG2070NAD(P)H-dependent flavin oxidoreductase YrpB, nitropropane dioxygenase familyGeneral function prediction only [R] 0.93


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms91.59 %
UnclassifiedrootN/A8.41 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000955|JGI1027J12803_101373246All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria514Open in IMG/M
3300000956|JGI10216J12902_100097070All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300005289|Ga0065704_10616593All Organisms → cellular organisms → Bacteria599Open in IMG/M
3300005331|Ga0070670_100714055All Organisms → cellular organisms → Bacteria902Open in IMG/M
3300005406|Ga0070703_10531335All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium534Open in IMG/M
3300005446|Ga0066686_10839321All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria608Open in IMG/M
3300005468|Ga0070707_100425467Not Available1288Open in IMG/M
3300005546|Ga0070696_100070130All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2465Open in IMG/M
3300005552|Ga0066701_10500328All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium752Open in IMG/M
3300005557|Ga0066704_10001888All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales9421Open in IMG/M
3300005713|Ga0066905_102159527All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium519Open in IMG/M
3300005719|Ga0068861_100272830All Organisms → cellular organisms → Bacteria1453Open in IMG/M
3300005843|Ga0068860_100321372All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales1519Open in IMG/M
3300005880|Ga0075298_1036656All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300005886|Ga0075286_1067342All Organisms → cellular organisms → Bacteria521Open in IMG/M
3300006880|Ga0075429_100666209All Organisms → cellular organisms → Bacteria912Open in IMG/M
3300009038|Ga0099829_10494712All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1015Open in IMG/M
3300009078|Ga0105106_10405220All Organisms → cellular organisms → Bacteria983Open in IMG/M
3300009088|Ga0099830_11770616All Organisms → cellular organisms → Bacteria516Open in IMG/M
3300009093|Ga0105240_11161974All Organisms → cellular organisms → Bacteria819Open in IMG/M
3300009100|Ga0075418_10209230All Organisms → cellular organisms → Bacteria → Proteobacteria2077Open in IMG/M
3300009101|Ga0105247_11216151All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium601Open in IMG/M
3300009147|Ga0114129_13094856All Organisms → cellular organisms → Bacteria544Open in IMG/M
3300009171|Ga0105101_10581375All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium555Open in IMG/M
3300009777|Ga0105164_10545488Not Available575Open in IMG/M
3300010047|Ga0126382_10232499All Organisms → cellular organisms → Bacteria → Proteobacteria1337Open in IMG/M
3300010047|Ga0126382_11168330All Organisms → cellular organisms → Bacteria687Open in IMG/M
3300010329|Ga0134111_10119387All Organisms → cellular organisms → Bacteria1024Open in IMG/M
3300010358|Ga0126370_10561113All Organisms → cellular organisms → Bacteria978Open in IMG/M
3300010359|Ga0126376_10520047All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1106Open in IMG/M
3300010360|Ga0126372_11997316All Organisms → cellular organisms → Bacteria626Open in IMG/M
3300010360|Ga0126372_12745422All Organisms → cellular organisms → Bacteria544Open in IMG/M
3300010360|Ga0126372_13045754All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium520Open in IMG/M
3300010361|Ga0126378_10991424All Organisms → cellular organisms → Bacteria944Open in IMG/M
3300010371|Ga0134125_11054691All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium890Open in IMG/M
3300010397|Ga0134124_10002841All Organisms → cellular organisms → Bacteria → Proteobacteria13983Open in IMG/M
3300010397|Ga0134124_10516881All Organisms → cellular organisms → Bacteria1157Open in IMG/M
3300010400|Ga0134122_10801470All Organisms → cellular organisms → Bacteria899Open in IMG/M
3300010403|Ga0134123_11650046All Organisms → cellular organisms → Bacteria690Open in IMG/M
3300010403|Ga0134123_13091651All Organisms → cellular organisms → Bacteria534Open in IMG/M
3300010868|Ga0124844_1252385Not Available618Open in IMG/M
3300011271|Ga0137393_10426730All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1136Open in IMG/M
3300011414|Ga0137442_1104256All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium619Open in IMG/M
3300011441|Ga0137452_1176721All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium722Open in IMG/M
3300012096|Ga0137389_10400975All Organisms → cellular organisms → Bacteria1171Open in IMG/M
3300012174|Ga0137338_1091491Not Available667Open in IMG/M
3300012189|Ga0137388_10038920All Organisms → cellular organisms → Bacteria → Proteobacteria3772Open in IMG/M
3300012203|Ga0137399_10030157All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium3719Open in IMG/M
3300012353|Ga0137367_10957128All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium587Open in IMG/M
3300012361|Ga0137360_11385154All Organisms → cellular organisms → Bacteria606Open in IMG/M
3300012361|Ga0137360_11642259Not Available548Open in IMG/M
3300012922|Ga0137394_10663144All Organisms → cellular organisms → Bacteria880Open in IMG/M
3300012929|Ga0137404_10739688Not Available891Open in IMG/M
3300012931|Ga0153915_12222919Not Available642Open in IMG/M
3300012951|Ga0164300_10621688All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium641Open in IMG/M
3300012971|Ga0126369_11813029All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria699Open in IMG/M
3300012972|Ga0134077_10060010All Organisms → cellular organisms → Bacteria → Proteobacteria1417Open in IMG/M
3300014299|Ga0075303_1050701Not Available705Open in IMG/M
3300014302|Ga0075310_1171022All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → Planctomycetia505Open in IMG/M
3300015264|Ga0137403_10001549All Organisms → cellular organisms → Bacteria28855Open in IMG/M
3300015264|Ga0137403_10494953All Organisms → cellular organisms → Bacteria → Proteobacteria1094Open in IMG/M
3300015359|Ga0134085_10186756All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium889Open in IMG/M
3300016294|Ga0182041_10969682All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria767Open in IMG/M
3300017997|Ga0184610_1263455All Organisms → cellular organisms → Bacteria571Open in IMG/M
3300018031|Ga0184634_10424039All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300018058|Ga0187766_10713093All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria694Open in IMG/M
3300018059|Ga0184615_10261452All Organisms → cellular organisms → Bacteria969Open in IMG/M
3300018076|Ga0184609_10462750All Organisms → cellular organisms → Bacteria582Open in IMG/M
3300018468|Ga0066662_10110051All Organisms → cellular organisms → Bacteria → Proteobacteria1979Open in IMG/M
3300018482|Ga0066669_10274831All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1349Open in IMG/M
3300019997|Ga0193711_1012968All Organisms → cellular organisms → Bacteria1049Open in IMG/M
3300021073|Ga0210378_10342914All Organisms → cellular organisms → Bacteria558Open in IMG/M
3300025549|Ga0210094_1035273All Organisms → cellular organisms → Bacteria851Open in IMG/M
3300025900|Ga0207710_10586497All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium582Open in IMG/M
3300025912|Ga0207707_11174071All Organisms → cellular organisms → Bacteria622Open in IMG/M
3300025914|Ga0207671_11498646All Organisms → cellular organisms → Bacteria521Open in IMG/M
3300025915|Ga0207693_10006110All Organisms → cellular organisms → Bacteria9981Open in IMG/M
3300025917|Ga0207660_10451312All Organisms → cellular organisms → Bacteria1040Open in IMG/M
3300025922|Ga0207646_10527543All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1063Open in IMG/M
3300025972|Ga0207668_10663076All Organisms → cellular organisms → Bacteria914Open in IMG/M
3300026035|Ga0207703_10320748All Organisms → cellular organisms → Bacteria1419Open in IMG/M
3300026075|Ga0207708_10093214All Organisms → cellular organisms → Bacteria2324Open in IMG/M
3300026088|Ga0207641_11962550All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300026118|Ga0207675_100347257All Organisms → cellular organisms → Bacteria1453Open in IMG/M
3300026307|Ga0209469_1118139All Organisms → cellular organisms → Bacteria666Open in IMG/M
3300026377|Ga0257171_1106160Not Available500Open in IMG/M
3300026499|Ga0257181_1006267All Organisms → cellular organisms → Bacteria1445Open in IMG/M
3300026507|Ga0257165_1012919All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1334Open in IMG/M
3300026524|Ga0209690_1193585All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium655Open in IMG/M
3300026552|Ga0209577_10289884All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1223Open in IMG/M
3300027252|Ga0209973_1053316All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300027646|Ga0209466_1085688All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium636Open in IMG/M
3300027725|Ga0209178_1283932All Organisms → cellular organisms → Bacteria606Open in IMG/M
3300028592|Ga0247822_10338992All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Geodermatophilales → Geodermatophilaceae → Modestobacter → unclassified Modestobacter → Modestobacter sp. DSM 444001158Open in IMG/M
3300028824|Ga0307310_10578404All Organisms → cellular organisms → Bacteria571Open in IMG/M
3300028884|Ga0307308_10515377All Organisms → cellular organisms → Bacteria574Open in IMG/M
(restricted) 3300031197|Ga0255310_10133118All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium678Open in IMG/M
3300031544|Ga0318534_10130158All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Hyphomicrobiales bacterium1448Open in IMG/M
3300031716|Ga0310813_10006626All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales7259Open in IMG/M
3300031763|Ga0318537_10285382All Organisms → cellular organisms → Bacteria → Proteobacteria611Open in IMG/M
3300031892|Ga0310893_10295685All Organisms → cellular organisms → Bacteria684Open in IMG/M
3300031946|Ga0310910_10230921All Organisms → cellular organisms → Bacteria1443Open in IMG/M
3300032174|Ga0307470_11748675All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium525Open in IMG/M
3300032180|Ga0307471_100871384All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Geodermatophilales → Geodermatophilaceae → Modestobacter → unclassified Modestobacter → Modestobacter sp.1068Open in IMG/M
3300032180|Ga0307471_103952385All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300034155|Ga0370498_171235All Organisms → cellular organisms → Bacteria530Open in IMG/M
3300034820|Ga0373959_0080826All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium748Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil12.15%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil8.41%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil7.48%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil5.61%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.61%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.61%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.67%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere4.67%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.74%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil2.80%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.80%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.80%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.80%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.80%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment1.87%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.87%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands1.87%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil1.87%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.87%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.87%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.87%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.87%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.93%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.93%
WastewaterEnvironmental → Aquatic → Freshwater → Drinking Water → Unchlorinated → Wastewater0.93%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.93%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.93%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.93%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.93%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.93%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.93%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.93%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil0.93%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300005289Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
3300005331Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaGHost-AssociatedOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005843Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2Host-AssociatedOpen in IMG/M
3300005880Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_201EnvironmentalOpen in IMG/M
3300005886Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_205EnvironmentalOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009101Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaGHost-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009171Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm May2015EnvironmentalOpen in IMG/M
3300009777Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking waterEnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300010868Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (PacBio error correction)EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011414Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT266_2EnvironmentalOpen in IMG/M
3300011441Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT513_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012174Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT366_2EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012951Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_226_MGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014299Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleC_D1EnvironmentalOpen in IMG/M
3300014302Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailA_D2EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018058Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019997Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m2EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300025549Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025900Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025914Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025972Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026035Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026075Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026307Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes)EnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027252Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant Co S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027646Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 30 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031544Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f26EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031763Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f29EnvironmentalOpen in IMG/M
3300031892Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C0D2EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300034155Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_05D_17EnvironmentalOpen in IMG/M
3300034820Populus rhizosphere microbial communities from soil in West Virginia, United States - WV94_WV_N_2Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J12803_10137324613300000955SoilRAMGAGARQFIITGFVPDPRAFMRRWSREVIPALG*
JGI10216J12902_10009707023300000956SoilDDCAAQIRRAMGAGARQFITTGFVPDPRGFMRRFMGDVAKLLS*
Ga0065704_1061659323300005289Switchgrass RhizosphereAGTPADCAAQLRRAMGAGARQFMITGFVPDPRAFVRRWMKEVAGAL*
Ga0070670_10071405523300005331Switchgrass RhizosphereTPDDCVAQIRRAMSAGARQFITTGFVPDPRGFMRRFMGEVANLLRGD*
Ga0070703_1053133513300005406Corn, Switchgrass And Miscanthus RhizosphereDDCAAQIRRAMGAGARQFITTGFVPDPRGFMRRFMGEVAHLLG*
Ga0066686_1083932113300005446SoilDRFAVAGTPADCIAQLSRAMAAGARQFIITGFVPDPRAFMRRWSREVAPALA*
Ga0070707_10042546713300005468Corn, Switchgrass And Miscanthus RhizosphereTPADCVAQVERAMDAGARQFLITGFVPDPRAFMRRWAREVADRVTV*
Ga0070696_10007013033300005546Corn, Switchgrass And Miscanthus RhizosphereGTPADCVAQVVRAMGAGARQFLITGFVPDPRAFMRRWAKEVADRITV*
Ga0066701_1050032813300005552SoilAQIRRAMAAGARQFVITSFVPDRRAFVRRWTREVAAAVV*
Ga0066704_1000188813300005557SoilRAMAAGARQFIITAFVPDVRALMRRFMREVAAAVATPA*
Ga0066905_10215952723300005713Tropical Forest SoilIQRAMATGAHQFVITSFVPDPRAFMRRWSREIAAAVSG*
Ga0068861_10027283013300005719Switchgrass RhizosphereQLRRAMGAGARQFMITGFVPDPRAFVRRWMKEVAGAL*
Ga0068860_10032137233300005843Switchgrass RhizosphereAVAGTPADCVAQLSRAMGAGARQFIITGFVPDPRAFMRRWSREVIPALG*
Ga0075298_103665613300005880Rice Paddy SoilADCVAQVERAMAAGARQFLITGFVPDPRAFMRRWAKEVADRITV*
Ga0075286_106734213300005886Rice Paddy SoilDCVAQIARAMDAGARQFVITGFVPDPRTFMRRWAKEVAGAL*
Ga0075429_10066620913300006880Populus RhizosphereAMAAGARQFITTSFVPDPRAFIRRWAQQVAMAVA*
Ga0099829_1049471213300009038Vadose Zone SoilPDDCAAQIRRAMAAGARQFVITSFVPDPRAFMRRWAREVAAALG*
Ga0105106_1040522013300009078Freshwater SedimentCVAQLRRAMGAGARQFMITGFVPDPRAFMRRWMKEVAGAL*
Ga0099830_1177061613300009088Vadose Zone SoilRFAIAGTPSECVAQVRRAMAAGARQFIITGFVPDPPAFMRRWTREVADALP*
Ga0105240_1116197423300009093Corn RhizosphereGTPDDCVAQIRRAMDAGARQFMITGFVPDPGAFMRRWARDVAGRVAG*
Ga0075418_1020923033300009100Populus RhizosphereADLRRAMAAGAHQFITTSFVPDPRVFMRRWRTEVAGVLTT*
Ga0105247_1121615123300009101Switchgrass RhizosphereRAMGAGARQFITTGFVPDPRGFMRRFMGEVAQSLS*
Ga0114129_1309485623300009147Populus RhizosphereAIAGTPADCVAQIRRAMAAGARQFVITGFVPDPRAFMKRWTHEVAAVL*
Ga0105101_1058137523300009171Freshwater SedimentAELRRAMGAGSRQFMITGFVPDPRAFARRWMKEVAGAL*
Ga0105164_1054548823300009777WastewaterAGTPEDCVAQISRAMAAGARQFMITGFVPDPRAFMRRWAREVVAAL*
Ga0126382_1023249913300010047Tropical Forest SoilLVQIQRAMAAGAHQFVITGFVPDPSAFMRRWEREVAARL*
Ga0126382_1116833023300010047Tropical Forest SoilIVGTPDECVAQIRRAMDAGARQFVITSFVPDPRAFVRRWSREVAAAVS*
Ga0134111_1011938713300010329Grasslands SoilIRRAVAAGARQFVITSFVPDRRAFVRRWMREVAAAVR*
Ga0126370_1056111313300010358Tropical Forest SoilRAMAAGAHQFVITGFVPDPSAFMRRWEREVAARL*
Ga0126376_1052004733300010359Tropical Forest SoilAQIRRAMAAGAHQFVITGFVPDPSAFMRRWEREVAARL*
Ga0126372_1199731623300010360Tropical Forest SoilVDRFAIAGTPGDCAAQIRRAMSAGARQFVITGIVPDPASFMRRFMREAAGAVRSRP*
Ga0126372_1274542213300010360Tropical Forest SoilFGIAGTPEDCRAQISRAMAAGARQFMITGFVPDPRAFVRRWAREVATVRP*
Ga0126372_1304575413300010360Tropical Forest SoilRRAMAAGAHQFVITSFVPDPRAFMRRWFREITAALGEPDLPPSR*
Ga0126378_1099142413300010361Tropical Forest SoilFAIAGTPDNCVAQIRRAMDAGARQFVITSFVPDPHVFMRRWMRDVVANLS*
Ga0134125_1105469123300010371Terrestrial SoilAMGAGARQFITTGFVPDPRGFMRRFMGEVARLLR*
Ga0134124_1000284113300010397Terrestrial SoilFAIAGTPADCVAQVERAVEAGARQFLITGFVPDPRAFVRRWMREVAGRVTA*
Ga0134124_1051688113300010397Terrestrial SoilAMGAGARQFITTGFVPDPRGFMRRFMGEVAHLLG*
Ga0134122_1080147023300010400Terrestrial SoilFAIAGTPADCVTQIRRAMAAGAQQFVITVFVPDPRAFMRRWSREVAAL*
Ga0134123_1165004613300010403Terrestrial SoilRAMAAGARQFVITGFVPDGPAFMRRWAREVAGVV*
Ga0134123_1309165123300010403Terrestrial SoilAIAGTPDECIAQIRRAMAAGARQFITTSFVPDPRAFMRRWAGEVAKAVT*
Ga0124844_125238523300010868Tropical Forest SoilCIAQISRAMAAGARQFILAGFVPDPRAFMRRWSREVAARI*
Ga0137393_1042673013300011271Vadose Zone SoilAAQIRRAMAAGARQFVITSFVPDPRAFMRRWAREVAAALG*
Ga0137442_110425613300011414SoilIAGTPDDCAAQIRRAMGAGARQFITTGFVPDPRGFMRRFMGEVARLLS*
Ga0137452_117672113300011441SoilDDCAAQIRRAVGAGARQFITTGFVPDPRGFMRRFMGEVATSLS*
Ga0137389_1040097523300012096Vadose Zone SoilLRDRFGIAGTSDDCVAQIRRAMAAGARQFIIAAFVPDVSGFMRRFMSEVAAAVGGPP*
Ga0137338_109149113300012174SoilAGTPDDCVVQISRAMAAGARQFMITGFVPDPRAFVRRWAREVVAALR*
Ga0137388_1003892063300012189Vadose Zone SoilMAAGARQFIITAFVPDVSGFMRRFMSEVAAAVGGPR*
Ga0137399_1003015753300012203Vadose Zone SoilRRAMAAGAHQFVITSFVPNPRAFMRRWMRDVVAAAR*
Ga0137367_1095712813300012353Vadose Zone SoilDDCVAQVRRAMAAGARQFIVTSFVPDPRAFMRRFMGEVAAAVGDAR*
Ga0137360_1138515413300012361Vadose Zone SoilTPAECAAQIRRAMAAGAQQFITTSFVPDPRAFMRRWAKEVAAGLA*
Ga0137360_1164225913300012361Vadose Zone SoilDCVAQVERAMDAGARQFLITGFVPDPRAFMRRWAREVADRVTV*
Ga0137394_1066314423300012922Vadose Zone SoilDRFAIAGTPADCVAQIRRAMAAGARQFVITGFVPDPRAFMRRWAHEVAGVV*
Ga0137404_1073968813300012929Vadose Zone SoilPPDCVAQLSRAMAAGARQFIITAFVPDPRAFMRRWAKEVVPALG*
Ga0153915_1222291923300012931Freshwater WetlandsCVAQISRAMAAGARQFMITGFVPDPRAFMRRWAREVVAALR*
Ga0164300_1062168813300012951SoilIAGTPADCVAQVERAVEAGARQFLITGFVPDPRAFVRRWMREVAGRVTV*
Ga0126369_1181302913300012971Tropical Forest SoilQRAMAAGARQFIITGFVPDPSAFMRRWAREVAGRL*
Ga0134077_1006001033300012972Grasslands SoilFAVAGTPADCIAQLSRAMAAGARQFIITGFVPDPRAFMRRWSREVAPALA*
Ga0075303_105070123300014299Natural And Restored WetlandsRDCVAQLSRAMAAGARQFIITGFVPDPRAFMRQWAREVAPALV*
Ga0075310_117102223300014302Natural And Restored WetlandsFLVDRFAIAGTPGECVAQIRRAMEAGARQFITTSFVPEPGLFMRRWAGEVAGPLG*
Ga0137403_1000154913300015264Vadose Zone SoilDCVAQLSRAMAAGARQFIITAFVPDPRAFMRRWAKEVVPALG*
Ga0137403_1049495313300015264Vadose Zone SoilDCVAQLSRAMAAGARQFIITAFVPDPRAFMRRWSREVVSALG*
Ga0134085_1018675623300015359Grasslands SoilSDDCVAQVRRAMAAGARQFIITAFVPDVRALMRRFMREVAAAVGTPA*
Ga0182041_1096968213300016294SoilIAQIQRAMAAGARQFIITGFVPDPSAFMRRWAREVAARL
Ga0184610_126345523300017997Groundwater SedimentIAGTPADCVAQLRRAIAAGARQFITTSFVPDPRAFMRRWAREVAQPLA
Ga0184634_1042403923300018031Groundwater SedimentAGTPADCVAQLRRAMGAGARQFMITGFVPDPPAFMRRWMKEVAGVL
Ga0187766_1071309323300018058Tropical PeatlandRFAVAGTASDCVAQLSRAMAAGARQFIITSFVPDPRLFVRRWAREVVPAL
Ga0184615_1026145213300018059Groundwater SedimentVAQIRRAIAAGARQFITTSFVPDPRAFMRRWAREVADTLG
Ga0184609_1046275023300018076Groundwater SedimentADCVAQLRRAMGAGARQFMITGFVPDPRAFMRRWMKEVAGVL
Ga0066662_1011005113300018468Grasslands SoilCVAQVRRAMAAGARQFIITAFVPDVRALMRRFMREVAAAVATPA
Ga0066669_1027483133300018482Grasslands SoilAQIQRAVAAGARQFVITSFVPDRRAFVRRWMREVAAAVR
Ga0193711_101296823300019997SoilVAQLRRAMGAGARQFMITGFVPDPRAFVRRWMKEVAGAL
Ga0210378_1034291413300021073Groundwater SedimentHRQRFAIAGTPADCVAQLRRAIAAGARQFITTSFVPDPRAFMRRWAREVAQPLA
Ga0210094_103527323300025549Natural And Restored WetlandsQLRRAMGAGARQFMITGFVPDPRAFMRRWMKEVAGAL
Ga0207710_1058649723300025900Switchgrass RhizosphereRAMGAGARQFITTGFVPDPRGFMRRFMGEVAQSLS
Ga0207707_1117407113300025912Corn RhizosphereRRAMAAGARQFITTSFVPDPRAFMRRWAGEVAKAVT
Ga0207671_1149864623300025914Corn RhizosphereCVAQVERAVEAGARQFLITGFVPDPRAFVRRWMREVAGRVTA
Ga0207693_1000611013300025915Corn, Switchgrass And Miscanthus RhizosphereAVAGTPADCVAQLSRAMGAGARQFIITGFVPDPRAFMRRWSREVIPALG
Ga0207660_1045131213300025917Corn RhizosphereRFAVAGTPADCAAQLRRAMGAGARQFMITGFVPDPRAFVRRWMKEVAGAL
Ga0207646_1052754313300025922Corn, Switchgrass And Miscanthus RhizosphereLRRAMGAGARQFMITGFVPDPRAFMRRWMKEVAGAL
Ga0207668_1066307613300025972Switchgrass RhizosphereRRAMGAGARQFMITGFVPDPRAFVRRWMKEVAGAL
Ga0207703_1032074833300026035Switchgrass RhizosphereGTPDDCAAQIRRAMGAGARQFITTGFVPDPRGFMRRFMGEVAQSLS
Ga0207708_1009321443300026075Corn, Switchgrass And Miscanthus RhizosphereQIRRAMSAGARQFITTGFVPDPRGFMRRFMGEVANLLRGD
Ga0207641_1196255013300026088Switchgrass RhizosphereAIAGTPADCVEQIRRAMAAGARQFVITGFVPEGPAFMRRWAREVAGVV
Ga0207675_10034725733300026118Switchgrass RhizosphereQLRRAMGAGARQFMITGFVPDPRAFVRRWMKEVAGAL
Ga0209469_111813923300026307SoilVAQIRRAVAAGARQFVITSFVPDRRAFVRRWMREVAAAVR
Ga0257171_110616013300026377SoilRAMAAGARQFIITAFVPDPRAFMRRWAKEVVPALG
Ga0257181_100626713300026499SoilLVDRFAVAGTPADCVAQLRRAMGAGARQFMITGFVPDPRAFMRRWMKEVAGVL
Ga0257165_101291913300026507SoilDCVAQVERAMDAGARQFLITGFVPDPRAFMRRWAREVADRVTV
Ga0209690_119358513300026524SoilAQIRRAMAAGARQFVITSFVPDRRAFVRRWTREVAAAVV
Ga0209577_1028988413300026552SoilDCVAQVRRAMAAGARQFIITAFVPDVRALMRRFMREVAAAVGTPA
Ga0209973_105331613300027252Arabidopsis Thaliana RhizosphereQIRRAMDAGARQFMITGFVPDPGAFMRRWARDVAGRVAG
Ga0209466_108568813300027646Tropical Forest SoilDRFAIGGTPDECVAQVRRAVAAGAHQFVIAGFVADSRAFMRRFMREVAAPCGSGA
Ga0209178_128393223300027725Agricultural SoilAIAGTPADCVAQVERAVEAGARQFLITGFVPDPRAFVRRWMREVAGRVTV
Ga0247822_1033899233300028592SoilVGGRAGDCVAQIRRAMAAGARQFVITGFVPDPRAFMKRWTHEVAAVL
Ga0307310_1057840423300028824SoilDRFAVAGTPADCVAQLRRAMGAGARQFMITGFVPDPGAFMRRWMKEVAGAL
Ga0307308_1051537713300028884SoilADRFAVAGTPADCVAQLRRAVGAGARQFMITGFVPDPGAFMRRWMKEVAGAL
(restricted) Ga0255310_1013311813300031197Sandy SoilRAMAAGARQFIITAFVPDVRALMRRFMREVAAAVGSPR
Ga0318534_1013015823300031544SoilFLVDRFAVAGTASDCVAQLSRAMAAGARQFIITSFVPDPRLFMRRWAREVVPAL
Ga0310813_1000662673300031716SoilDCVAQVERAVEAGARQFLITGFVPDPRAFVRRWMREVAGRVTA
Ga0318537_1028538213300031763SoilSDCVAQLSRAMAAGARQFIITSFVPDPRLFMRRWAREVVPAL
Ga0310893_1029568513300031892SoilIAGTPGECVTQIRRAMTAGARQFITTSFVPDPRVFMRRWAGGVADALT
Ga0310910_1023092113300031946SoilGDCLAQIRRAMAAGAHQFVITGFVPDPSAFMRRWEREVAARL
Ga0307470_1174867523300032174Hardwood Forest SoilMGAGARQFITTGFVPDPRGFMRRFMGEVAKSSSSTES
Ga0307471_10087138433300032180Hardwood Forest SoilGTPADCVAQIRRAMAAGARQFIITGFVPDPSAFMRQWAGEVAGVV
Ga0307471_10395238513300032180Hardwood Forest SoilPADCVTQIRRAMAAGARQFVITGFVPDPRVFMRRWASEVAAVV
Ga0370498_171235_1_1503300034155Untreated Peat SoilFAVAGTPADCVAQLRRAMGAGARQFMITGFVPDPRAFVRRWMKEVAGAL
Ga0373959_0080826_630_7463300034820Rhizosphere SoilVERAVEAGARQFLITGFVPDPRAFVRRWMREVAGRVTV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.