NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F104647

Metagenome / Metatranscriptome Family F104647

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104647
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 114 residues
Representative Sequence MLGFRLGVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSANSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK
Number of Associated Samples 87
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 74.75 %
% of genes near scaffold ends (potentially truncated) 18.00 %
% of genes from short scaffolds (< 2000 bps) 69.00 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (79.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(17.000 % of family members)
Environment Ontology (ENVO) Unclassified
(45.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(39.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 12.93%    β-sheet: 43.10%    Coil/Unstructured: 43.97%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF02518HATPase_c 15.00
PF00561Abhydrolase_1 14.00
PF13185GAF_2 8.00
PF04214DUF411 6.00
PF05193Peptidase_M16_C 5.00
PF02515CoA_transf_3 5.00
PF14534DUF4440 4.00
PF01979Amidohydro_1 3.00
PF00166Cpn10 3.00
PF13426PAS_9 2.00
PF13545HTH_Crp_2 2.00
PF13147Obsolete Pfam Family 1.00
PF00999Na_H_Exchanger 1.00
PF01022HTH_5 1.00
PF08402TOBE_2 1.00
PF03328HpcH_HpaI 1.00
PF00392GntR 1.00
PF07992Pyr_redox_2 1.00
PF00118Cpn60_TCP1 1.00
PF12680SnoaL_2 1.00
PF00775Dioxygenase_C 1.00
PF00072Response_reg 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG3019Uncharacterized metal-binding protein, DUF411 familyFunction unknown [S] 6.00
COG1804Crotonobetainyl-CoA:carnitine CoA-transferase CaiB and related acyl-CoA transferasesLipid transport and metabolism [I] 5.00
COG0234Co-chaperonin GroES (HSP10)Posttranslational modification, protein turnover, chaperones [O] 3.00
COG0025NhaP-type Na+/H+ or K+/H+ antiporterInorganic ion transport and metabolism [P] 1.00
COG0459Chaperonin GroEL (HSP60 family)Posttranslational modification, protein turnover, chaperones [O] 1.00
COG0469Pyruvate kinaseCarbohydrate transport and metabolism [G] 1.00
COG0475Kef-type K+ transport system, membrane component KefBInorganic ion transport and metabolism [P] 1.00
COG2301Citrate lyase beta subunitCarbohydrate transport and metabolism [G] 1.00
COG3004Na+/H+ antiporter NhaAEnergy production and conversion [C] 1.00
COG3263NhaP-type Na+/H+ and K+/H+ antiporter with C-terminal TrkAC and CorC domainsEnergy production and conversion [C] 1.00
COG3485Protocatechuate 3,4-dioxygenase beta subunitSecondary metabolites biosynthesis, transport and catabolism [Q] 1.00
COG38362-keto-3-deoxy-L-rhamnonate aldolase RhmACarbohydrate transport and metabolism [G] 1.00
COG4651Predicted Kef-type K+ transport protein, K+/H+ antiporter domainInorganic ion transport and metabolism [P] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms80.00 %
UnclassifiedrootN/A20.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004019|Ga0055439_10005244All Organisms → cellular organisms → Bacteria2674Open in IMG/M
3300004024|Ga0055436_10019783All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1614Open in IMG/M
3300004025|Ga0055433_10069470All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium751Open in IMG/M
3300004062|Ga0055500_10000987All Organisms → cellular organisms → Bacteria3385Open in IMG/M
3300004463|Ga0063356_100032845All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales5035Open in IMG/M
3300005294|Ga0065705_10258123All Organisms → cellular organisms → Bacteria1170Open in IMG/M
3300005345|Ga0070692_10577891All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium740Open in IMG/M
3300005444|Ga0070694_100999115Not Available694Open in IMG/M
3300005458|Ga0070681_10987036Not Available762Open in IMG/M
3300005546|Ga0070696_101053303Not Available682Open in IMG/M
3300005549|Ga0070704_100009347All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria5917Open in IMG/M
3300006845|Ga0075421_100807497All Organisms → cellular organisms → Bacteria1079Open in IMG/M
3300009038|Ga0099829_10001106All Organisms → cellular organisms → Bacteria → Proteobacteria14644Open in IMG/M
3300009053|Ga0105095_10265912All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii941Open in IMG/M
3300009088|Ga0099830_10010883All Organisms → cellular organisms → Bacteria → Proteobacteria5553Open in IMG/M
3300010400|Ga0134122_11202692Not Available759Open in IMG/M
3300011271|Ga0137393_10003614All Organisms → cellular organisms → Bacteria10061Open in IMG/M
3300011395|Ga0137315_1008952All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1153Open in IMG/M
3300011429|Ga0137455_1044708All Organisms → cellular organisms → Bacteria1244Open in IMG/M
3300012225|Ga0137434_1012678All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1002Open in IMG/M
3300012685|Ga0137397_10052501All Organisms → cellular organisms → Bacteria2928Open in IMG/M
3300012685|Ga0137397_10069313All Organisms → cellular organisms → Bacteria2550Open in IMG/M
3300012918|Ga0137396_10491831Not Available909Open in IMG/M
3300012922|Ga0137394_10196455All Organisms → cellular organisms → Bacteria1722Open in IMG/M
3300012922|Ga0137394_10286255All Organisms → cellular organisms → Bacteria1409Open in IMG/M
3300012925|Ga0137419_10015718All Organisms → cellular organisms → Bacteria → Proteobacteria4267Open in IMG/M
3300012929|Ga0137404_10191244Not Available1729Open in IMG/M
3300012929|Ga0137404_10216894All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1628Open in IMG/M
3300012929|Ga0137404_11603908All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria603Open in IMG/M
3300012930|Ga0137407_10168354All Organisms → cellular organisms → Bacteria1948Open in IMG/M
3300012930|Ga0137407_10382265All Organisms → cellular organisms → Bacteria1301Open in IMG/M
3300012944|Ga0137410_10004129All Organisms → cellular organisms → Bacteria → Proteobacteria9846Open in IMG/M
3300014308|Ga0075354_1003748All Organisms → cellular organisms → Bacteria1914Open in IMG/M
3300014320|Ga0075342_1254954Not Available512Open in IMG/M
3300014873|Ga0180066_1043555All Organisms → cellular organisms → Bacteria876Open in IMG/M
3300014877|Ga0180074_1030998All Organisms → cellular organisms → Bacteria1083Open in IMG/M
3300014877|Ga0180074_1108416All Organisms → cellular organisms → Bacteria623Open in IMG/M
3300014885|Ga0180063_1003730All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria4146Open in IMG/M
3300015245|Ga0137409_10195295All Organisms → cellular organisms → Bacteria1825Open in IMG/M
3300015254|Ga0180089_1052360All Organisms → cellular organisms → Bacteria807Open in IMG/M
3300017997|Ga0184610_1006311All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2836Open in IMG/M
3300018031|Ga0184634_10079642All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1406Open in IMG/M
3300018053|Ga0184626_10068584All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1493Open in IMG/M
3300018056|Ga0184623_10521912Not Available504Open in IMG/M
3300018063|Ga0184637_10016199All Organisms → cellular organisms → Bacteria → Proteobacteria4477Open in IMG/M
3300018071|Ga0184618_10468341Not Available529Open in IMG/M
3300018074|Ga0184640_10074071All Organisms → cellular organisms → Bacteria1447Open in IMG/M
3300018075|Ga0184632_10327019All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla660Open in IMG/M
3300018076|Ga0184609_10398697All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria640Open in IMG/M
3300018079|Ga0184627_10060165All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1975Open in IMG/M
3300018084|Ga0184629_10184128All Organisms → cellular organisms → Bacteria1074Open in IMG/M
3300018429|Ga0190272_10104462All Organisms → cellular organisms → Bacteria1831Open in IMG/M
3300018429|Ga0190272_10298543All Organisms → cellular organisms → Bacteria1246Open in IMG/M
3300019259|Ga0184646_1230496All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1720Open in IMG/M
3300019360|Ga0187894_10053137All Organisms → cellular organisms → Bacteria2366Open in IMG/M
3300019879|Ga0193723_1018210All Organisms → cellular organisms → Bacteria2167Open in IMG/M
3300019881|Ga0193707_1028764All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1820Open in IMG/M
3300019883|Ga0193725_1014051All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Hexapoda → Insecta → Dicondylia → Pterygota → Neoptera → Endopterygota → Coleoptera → Polyphaga → Elateriformia → Elateroidea → Lampyridae → Luciolinae → Abscondita → Abscondita terminalis2239Open in IMG/M
3300019883|Ga0193725_1117308Not Available611Open in IMG/M
3300019997|Ga0193711_1000546All Organisms → cellular organisms → Bacteria → Proteobacteria5075Open in IMG/M
3300019998|Ga0193710_1006051All Organisms → cellular organisms → Bacteria1214Open in IMG/M
3300020003|Ga0193739_1007452All Organisms → cellular organisms → Bacteria2915Open in IMG/M
3300020004|Ga0193755_1004496All Organisms → cellular organisms → Bacteria4461Open in IMG/M
3300020004|Ga0193755_1019628All Organisms → cellular organisms → Bacteria2232Open in IMG/M
3300020060|Ga0193717_1097881Not Available932Open in IMG/M
3300020063|Ga0180118_1287240All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium840Open in IMG/M
3300020067|Ga0180109_1474561All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla575Open in IMG/M
3300020068|Ga0184649_1468079Not Available746Open in IMG/M
3300021073|Ga0210378_10061376All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1477Open in IMG/M
3300021088|Ga0210404_10017602All Organisms → cellular organisms → Bacteria3024Open in IMG/M
3300022534|Ga0224452_1017899All Organisms → cellular organisms → Bacteria1943Open in IMG/M
3300025324|Ga0209640_10024146All Organisms → cellular organisms → Bacteria5308Open in IMG/M
3300025535|Ga0207423_1003586All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2240Open in IMG/M
3300025910|Ga0207684_10023596All Organisms → cellular organisms → Bacteria5252Open in IMG/M
3300025917|Ga0207660_10039953All Organisms → cellular organisms → Bacteria → Proteobacteria3282Open in IMG/M
3300026102|Ga0208914_1009386All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1043Open in IMG/M
3300026285|Ga0209438_1034918All Organisms → cellular organisms → Bacteria1668Open in IMG/M
3300026320|Ga0209131_1320497Not Available576Open in IMG/M
3300026469|Ga0257169_1008177All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1275Open in IMG/M
3300026507|Ga0257165_1000516All Organisms → cellular organisms → Bacteria3875Open in IMG/M
3300026507|Ga0257165_1062716Not Available675Open in IMG/M
3300027731|Ga0209592_1332185Not Available516Open in IMG/M
3300027815|Ga0209726_10049282All Organisms → cellular organisms → Bacteria2953Open in IMG/M
3300027875|Ga0209283_10014012All Organisms → cellular organisms → Bacteria → Proteobacteria4798Open in IMG/M
3300027909|Ga0209382_10950957All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium900Open in IMG/M
3300028711|Ga0307293_10065285All Organisms → cellular organisms → Bacteria1141Open in IMG/M
3300028792|Ga0307504_10020218All Organisms → cellular organisms → Bacteria1645Open in IMG/M
(restricted) 3300031150|Ga0255311_1038454All Organisms → cellular organisms → Bacteria1002Open in IMG/M
3300031720|Ga0307469_10202994All Organisms → cellular organisms → Bacteria1550Open in IMG/M
3300031720|Ga0307469_10529201All Organisms → cellular organisms → Bacteria1039Open in IMG/M
3300031740|Ga0307468_100264133All Organisms → cellular organisms → Bacteria1221Open in IMG/M
3300031740|Ga0307468_102337972Not Available520Open in IMG/M
3300032180|Ga0307471_100686322All Organisms → cellular organisms → Bacteria1188Open in IMG/M
3300032180|Ga0307471_101876692Not Available749Open in IMG/M
3300033233|Ga0334722_10258297Not Available1278Open in IMG/M
3300033417|Ga0214471_10066510All Organisms → cellular organisms → Bacteria → Proteobacteria2951Open in IMG/M
3300033513|Ga0316628_102079867All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii754Open in IMG/M
3300034164|Ga0364940_0036265All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1297Open in IMG/M
3300034176|Ga0364931_0217876Not Available624Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil17.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil15.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment11.00%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil8.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil6.00%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment5.00%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands5.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.00%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands3.00%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment2.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.00%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere2.00%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment2.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.00%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.00%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater1.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.00%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.00%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.00%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.00%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004019Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2EnvironmentalOpen in IMG/M
3300004024Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300004025Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004062Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009053Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011395Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT200_2EnvironmentalOpen in IMG/M
3300011429Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT600_2EnvironmentalOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014308Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D1EnvironmentalOpen in IMG/M
3300014320Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqA_D1EnvironmentalOpen in IMG/M
3300014873Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200B_16_10DEnvironmentalOpen in IMG/M
3300014877Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT366_16_10DEnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015254Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT860_16_10DEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019259Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019997Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m2EnvironmentalOpen in IMG/M
3300019998Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m1EnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020060Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2EnvironmentalOpen in IMG/M
3300020063Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT730_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020067Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLIBT47_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020068Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025535Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026102Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleA_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026469Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-17-BEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300027731Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 19-21cm March2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028711Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_150EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033233Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_bottomEnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300034164Sediment microbial communities from East River floodplain, Colorado, United States - 14_s17EnvironmentalOpen in IMG/M
3300034176Sediment microbial communities from East River floodplain, Colorado, United States - 21_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
Ga0055439_1000524443300004019Natural And Restored WetlandsMRFRLGVCLVVLSLAAVAYAGADRPIANVKSLTGDWRAMGGLSSAAIRIKPDGSYEGIAANGAKTVGKITAAGGKASFQSANSAGTVTWSQEGGKDMLLFVRADGRGSAKLERVK*
Ga0055436_1001978323300004024Natural And Restored WetlandsMRFRLGVCLVVLSLAAVAYAGADRPIANVKSLTGDWRAMGGLSSAAIRIKPDGSYEGIAANGAKTVGKITAAGGKASFQSANSAGTVTWSQEGGRDMLLFVRADGRGSAKLERVKEPSRRPGSGSRLLTVVTAAGR*
Ga0055433_1006947013300004025Natural And Restored WetlandsMRFRLGVCLVVLSLAAVAYAGADRPIANVKSLTGDWRAMGGLSSAAIRIKPDGSYEGIAANGAKTVGKITAAGGKASFQSANSAGTVTWSQEGGRDMLLFVRADGRGSAKLERVK*
Ga0055500_1000098713300004062Natural And Restored WetlandsMRFRLGVCLVVLSLAAVAYAGADRPIANVKSLTGDWRAMGGVSSAAIRIKPDGSYEGIAANGAKTVGKITAAGGKASFQSANSAGTVTWSQEGGKDMLLFVRADGRGSAKLERVK*
Ga0063356_10003284553300004463Arabidopsis Thaliana RhizosphereMMHVRLRVCLIVLSLATVAFAGPDKPIRDVKSLTGDWRATGGGPAAIRIKPDGSYEGISANGAKTVGKITATGGTASFQSANSAGSVTWSQEGGKDVLLFVRADGRASAKLERVK*
Ga0065705_1025812333300005294Switchgrass RhizosphereAFAGPDRPITSLKSLTGEWRALGGASAAAIRIKPDGSYEGTAANGIRTVGKITIADGKASFQSASSAGSVTWIQEGGKDVLLFARGDGRGSAKLERVNTVERLKTK*
Ga0070692_1057789113300005345Corn, Switchgrass And Miscanthus RhizosphereDRPITSVKSLAGEWRAVGGASAAAIRIKPDGAYEGTAANGARTVGRITIADGKASFQSASSAGGVTWTQEGGKDVLLFVRGDGRGSAKLERVNTAERVKAR*
Ga0070694_10099911523300005444Corn, Switchgrass And Miscanthus RhizosphereMMHVRLRVCLIVLSLATVAFAGPDKPIRDVKSLTGDWRATGGGPAAIRIKPDGSYEGISANGAKTVGKITATGGTASFQSANSAGSVTWSQEGGKDVLLFVRADGRAS
Ga0070681_1098703613300005458Corn RhizosphereVAFAGPDKPIRDVKSLTGDWRATGGGPAAIRIKPDGSYEGISANGAKTVGKITATGGTASFQSANSAGSVTWSQEGGKDVLLFVRADGRASAKLERVK*
Ga0070696_10105330323300005546Corn, Switchgrass And Miscanthus RhizosphereMMHVRLRVCLIVLSLATVAFAGPDKPIRDVKSLTGDWRATGGGPAAIRIKPDGSYEGISANGAKTVGKITATGGKASFQSASSAGSVTWSQEGGKDVLHFVRADGRGSAKLERVK*
Ga0070704_10000934723300005549Corn, Switchgrass And Miscanthus RhizosphereMRFRIGVCLVVLSLAAVAFAGPDKPIANVKSLSGDWRAPGGASAAAIRIQPDGSYEGLSANGTKTVGKITTVGGKASFQSAKSAGSVTWSQEAGKDVLMFVAGDGRGSARLERVK*
Ga0075421_10080749733300006845Populus RhizosphereMRLGLGVCLLVCSLVAVAFAGPDKPIANVKSLTGDWRALGGASAAAIRIKADGSYEGIAANGTKTVGKITAAGGKASFQSANSAGTVTWSQEAGKEMLLFVRADGRGSAKLERVK*
Ga0099829_1000110693300009038Vadose Zone SoilMLGFRVGVCLVVFSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGTAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK*
Ga0105095_1026591223300009053Freshwater SedimentMRFRLGVCLLVLSLAAVAFAGPDKPITNVKSLTGDWRAMGGASAAAIRIKPDGSYEGISANGAKTVGKITTVGGKASFQSANSAGSVTWSQEGGKDVLFFVRGDGRGSAKLERVK*
Ga0099830_1001088383300009088Vadose Zone SoilMLGFRVGVCLVVFSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGTAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVT*
Ga0134122_1120269213300010400Terrestrial SoilMMRVRLGVCLIVLSLAAVALAGPDKPIRDVKSLTGDWRATGGGPAAIRIKPDGSYEGISANGAKTVGKITATGGKASFQSASSAGSVTWSQEGGKDVLHFVRADGRGSAKLERVK*
Ga0137393_1000361493300011271Vadose Zone SoilMLGFRLGVCLVVFSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGTAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVT*
Ga0137315_100895223300011395SoilMLGFRLGVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK*
Ga0137455_104470813300011429SoilMLGFRLGVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRMK*
Ga0137434_101267813300012225SoilMLGFRLGVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAIRGASAAAIHIKPDGSYEGIAANGVKTAGKITTGSGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK*
Ga0137397_1005250133300012685Vadose Zone SoilVCLVVLSLAAVAYAGPDKPIASVKSLSGDWRALGGASAAAIRIKPDGSYEGLSSNGAKTAGKITTAGGKASFQSAKSVGSVTWSQEAGKDVLLFVAGDGRGSARLERVK*
Ga0137397_1006931333300012685Vadose Zone SoilMRFRLGVCLIILSLAAVAFAGSDRPITSVKSLAGEWRALGGASAAAIRIKPDGAYEGTAANGARTLGRITIVDGKASFQSANSAGRVTWIREGGNDVLLFVRGDGRGSAKLERVNTAERVKAK*
Ga0137396_1049183113300012918Vadose Zone SoilVCLVVLSLATVAYAGPDKPIASVKSLSGDWRALGGASAAAIRIKPDGSYEGLSSNGARTVGKITTAGGKASFQSAKSVGSVTWSQEAGKDVLLFVAGDGRGSARLERVK*
Ga0137394_1019645523300012922Vadose Zone SoilMRFRLGVCLVVLSLAAVAYAGPDKPIASVKSLSGDWRALGGASAAAIRIKPDGSYEGLSSNGAKTAGKITTAGGKASFQSAKSVGSVTWSQEAGKDVLLFVAGDGRGSARLERVK*
Ga0137394_1028625513300012922Vadose Zone SoilMRFRLGVCLIILSLAAVAFAGSDRPITSVKSLAGEWRALGGASAAAIRIKPDGAYEGTAANGARTVGRITIVDGKASFQSANSAGRVTWIREGGNDVLLFVRGDGRGSAKLERVNTAERVKAK*
Ga0137419_1001571823300012925Vadose Zone SoilLSLAAVAYAGPDKPIASVKSLSGDWRALGGASAAAIRIKPDGSYEGLSSNGARTVGKITTAGGKASFQSAKSVGSVTWSQEAGKDVLLFVAGDGRGSARLERVK*
Ga0137404_1019124433300012929Vadose Zone SoilMRFRLGVCLIILSLAAVAFAGSDRPITSVKSLAGEWRALGGASAAAIRIKPDGAYEGTAANGARTVGRITIIDGKASFQSANSAGRVTWIREGGNDVLLFVRGDGRGSAKLERVNTAERVKAK*
Ga0137404_1021689423300012929Vadose Zone SoilPRHQGLGLSLAAVAYAGPDKPIASVKSLSGDWRALGGASAAAIRIKPDGSYEGLSSNGAKTVGKITTAEGKASFQSAKSAGSVTWSQEAGKDVLLFVAGDGRGSARLERVK*
Ga0137404_1160390813300012929Vadose Zone SoilAVAYAGPDKPIASVKSLSGDWRALGGASAAAIRIQPDGSYEGFSANGTKTVGKITTAGGKASFQSAKSAGSVTWSQEAGKDVLLFVAGDGRGSARLERVK*
Ga0137407_1016835433300012930Vadose Zone SoilVCLIILSLAAVAFAGSDRPITSVKSLAGEWRALGGASAAAIRIKPDGAYEGTAANGARTVGRITIIDGKASFQSANSAGRVTWIREGGNDVLLFVRGDGRGSAKLERVNTAERVKAK*
Ga0137407_1038226523300012930Vadose Zone SoilMRFRLGVCLVVLSLAAVAFAGPDKPIANVKSLSGDWRAPGGASAAAIRIQPDGSYEGFSANGTKTVGKITTAGGKASFQSAKSAGSVTWSQEAGKDVLLFVAGDGRGSARLERVK*
Ga0137410_1000412923300012944Vadose Zone SoilMRFRLGVCLVVLSLAAVAYAGPDKPIASVKSLSGDWRALGGASAAAIRIKPDGSYEGLSSNGARTVGKITTAGGKASFQSAKSVGSVTWSQEAGKDVLLFVAGDGRGSARLERVK*
Ga0075354_100374813300014308Natural And Restored WetlandsVCLVVLSLAAVAYAGADRPIANVKSLTGDWRAMGGVSSAAIRIKPDGSYEGIAANGAKTVGKITAAGGKASFQSANSAGTVTWSQEGGKDMLLFVRADGRGSAKLERVK*
Ga0075342_125495413300014320Natural And Restored WetlandsMRFRLGVCLLVLSLAAVAFAGPDKPITNVKSLTGDWRAMGGASAAAIRIKPDGSYEGISANGAKTVGKITTAGGKASFQSANSAGSVTWSQEGGKDVLFFVRGDGRGSAKLERVK*
Ga0180066_104355523300014873SoilMLGFRLGVCLVVLSLAAVTFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSANSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK*
Ga0180074_103099823300014877SoilMLGFRLGVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSANSAGSVAWSQEDGKDMLLFVRADGRGAAKLQRVK*
Ga0180074_110841613300014877SoilMLRFRLGVCLVVCSLAAVAFAGPDKPIANVKSLTGDWRAMSGASAAAIRIKPDGSYEGIAANGTKTVGKITTAGGKASFQSANSAGTVTWSREDGKDMLLFVRADGRGSAKLQRVK*
Ga0180063_100373033300014885SoilMRLRLAVGLLVLSLATVAFAGPDKPIASVKSLTGDWRAIGGTSAAAIRIKPDGSYEGISANGTKTVGKITTAGGRASFQSASSAGSVTWSQEGGKDVLFFVRGDGRGSAKLERVK*
Ga0137409_1019529533300015245Vadose Zone SoilMRFRLGVCLVVLSLAAVAYAGPDKPIASVKSLSGDWRALGGASAAAIRIKPDGSYEGLSSNGAKTVGKITTAEGKASFQSAKSAGSVTWSQEAGKDVLLFVAGDGRGSARLERVK*
Ga0180089_105236023300015254SoilLAAVAFAGPDKPIANVKSLTGDWRAMSGASAAAIRIKPDGSYEGIAANGTKTVGKITTAGGKASFQSANSAGTVTWSREDGKDMLLFVRADGRGSAKLQRVK*
Ga0184610_100631123300017997Groundwater SedimentVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSATSAGTVAWYQEDGKDMLLFVRADGRGSAKLQRVK
Ga0184634_1007964223300018031Groundwater SedimentMLGFRLGVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0184626_1006858423300018053Groundwater SedimentLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSANSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0184623_1052191223300018056Groundwater SedimentAMLGFRLGVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSANSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0184637_1001619963300018063Groundwater SedimentMLGFRLGVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIHIKPDGSYEGIAANGVKTAGKITTGGGKASFQSANSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0184618_1046834123300018071Groundwater SedimentMRFRLGVCLVVLSIAAVAFAGPDKPIASVKSLSGDWRALGGASAAAIRIKPDGSYEGLSANGARTVGKITTAGGKASFQSAKSAGSVTWSQEAGKDVLLFVAGDGRGAARLERVK
Ga0184640_1007407113300018074Groundwater SedimentMLRFRLGVCLVVFSLAAVAFAGPDKPIANVKSLTGDWRAMSGASAAAIRIKPDGSYEGTAANGVKTAGKITAGGGKASFQSTTSAGTVTWSQEDGKDMLVFVRADGRGSAKLQRVK
Ga0184632_1032701913300018075Groundwater SedimentMLGFRLGVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIHIKPDGSYEGIAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0184609_1039869723300018076Groundwater SedimentMLGFRLGVCLVVFSLTAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSANSAGTIAWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0184627_1006016533300018079Groundwater SedimentMLGFRLGVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSANSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0184629_1018412813300018084Groundwater SedimentMLGFRFGVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGTAANGAKTVGKITTGGGKASFQSANSAGSVAWSQEDGKD
Ga0190272_1010446233300018429SoilMMRLRLAVGLLVLSLATVAFAGPDRPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGFSANGAKTVGKITATGGKASFQSASSAGSVTWSQEGGKDMLFFVRADGRGSAKLERVK
Ga0190272_1029854333300018429SoilMLGFRLGVCLVVFSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDVLLFVRGDGRGSAKLERVK
Ga0184646_123049623300019259Groundwater SedimentMLGFRLGVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITAGGGKALFQSATSAGTVAWSQEDGKDMLLFVRADGRASAKLQRVK
Ga0187894_1005313733300019360Microbial Mat On RocksMMRLGLGVCLLVCSLAAVAFAGPDKPIANVKSLTGDWRALGGASAAAIRIKADGSYEGIAANGTKTVGKITTAGGKASFQSANSAGTVTWSQEAGKDVLLFVRADGRGSAKLERVK
Ga0193723_101821023300019879SoilMRFRLGVCLIILSLAAVAFAGSDRPITSVKSLAGEWGALGGASAAAIRIKPDGAYEGTAANGVRTVGRITIADGKASFQSANSAGRVTWIQEGGKDVLLFVRGDGRGSAKLERVNTAERVKAK
Ga0193707_102876423300019881SoilMRFRLGVCLVVLSLAAVAFAGPDKPIASVKSLSGDWRALGGASAAAIRIKPDGSYEGLSANGAKTVGKITTAGGKASFQSAKSAGSVTWSQEAGKDVLLFVAGDGRSSARLERVK
Ga0193725_101405143300019883SoilMLGFRLGVCLVVFSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGTAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0193725_111730813300019883SoilMRFRLGVCLIILSLAAVSFAGSDRPITSVKSLAGEWGALGGASAAAIRIKPDGAYEGTAANGVRTVGRITIADGKASFQSANSAGHVTWIQEGGKDVLLFVRGDGRGSAKLARVNTVERVKAK
Ga0193711_100054643300019997SoilMRFRLGVCLIILSLAAVAFAGSDRPITSVKSLAGEWRALGGASAAAIRINPDGAYEGTAANGVRTVGRITLVDGKASFQSANSAGRVTWIQEGGKDVLLFARGDGRGSAKLERVNTAERVKAK
Ga0193710_100605113300019998SoilMRFRLGVCLIILSLAAVAFAGSDRPITSVKSLAGEWRALGGASAAAIRIKPDGAYEGTAANGVRTVGRITIADGKASFQSANSAGRVTWIQEGGKDVLLFVRGDGRGSAKLERVNTAERVKAK
Ga0193739_100745233300020003SoilMLGFRLGVCLVVFSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIHIKPDGSYEGIAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0193755_100449633300020004SoilMRFRLGVCLVVLSLAAVAFAGPDKPIASVKSLSGDWRALGGASAAAIRIKPDGSYEGLSANGAKTVGKITTAGGKASFQSAKSAGSVTWSQEAGKDVLLFVAGDGRGSARLERVK
Ga0193755_101962833300020004SoilMRFRLGVCLIILSLAAVAFAGSDRPITSVKSLAGEWGALGGASAAAIRIKPDGAYEGTAANGVRTVGRITIVDGKASFQSANSAGRVTWIQEGGKDVLLFVRGDGRGSAKLERVNTAERVKAK
Ga0193717_109788123300020060SoilMTRLRLAVGLLVLSLATVAFAGPDRPIASVKSLTGDWRAIGGASAAAIRIKPDGSYEGFSANGAKTVGKITATGGKASFQSASSAGSVTWSQEGGKDMLFFVRADGRGSAKLERVK
Ga0180118_128724013300020063Groundwater SedimentAMLRFRLGVCLVVCSLAAVAFAGPDKPIANVKSLTGDWRAMSGASAAAIRIKPDGSYEGIAANGTKTVGKITTAGGKASFQSANSAGTVTWSREDGKDMLLFVRADGRGSAKLQRVK
Ga0180109_147456113300020067Groundwater SedimentMLRFRLGVCLVVCSLAAVAFAGPDKPIANVKSLTGDWRAMSGASAAAIRIKPDGSYEGIAANGTKTVGKITTAGGKASFQSANSAGTVTWSREDGKDMLLFVRADGRGSAKLQRVK
Ga0184649_146807923300020068Groundwater SedimentMGFRLGVCLVVLSLAAVAFAGPDKPIANLKSLTGDWRAMGGASAAAIRIKSDGSYEGIAANGVKTAGKITTAGGKTSFQSANSAGTVTWSQEGGKDVLLFVRGDGRGSAKLERVK
Ga0210378_1006137633300021073Groundwater SedimentMLGFRLGVCLVVFSLTAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0210404_1001760233300021088SoilMLRRTLMVSLIVCGLAGAAVAGPEKPITNVKSLAGDWRAVGGATAAAIRIKTDGSYEGTSANGAKTAGKITATGGKGSFQSTSAVGSVAWSQEGGNDVLTFMRADGRGTAKLQRVK
Ga0193719_1005723733300021344SoilMRFRLGVCLIILSLAAVAFAGSDRPITSVKSLAGEWGALGGASAAAIRIKPDGAYEGTAANGVRTVGRITIADGKASFQSANSAGRVTWIQEGG
Ga0224452_101789933300022534Groundwater SedimentMLGFRLGVCLVVFSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGAAKLQRVK
Ga0209640_1002414623300025324SoilMLGFRLGVCLVVWSLAAVAFAGPDRPIANVKSLIGDWRAMDGASAAAIRIKPDGSYEGVAANGAKTVGKITIAGGKASFQSANSAGTVTWSREDGKDMLLFVRADGRGSAKLRRVK
Ga0207423_100358623300025535Natural And Restored WetlandsMRFRLGVCLVVLSLAAVAYAGADRPIANVKSLTGDWRAMGGLSSAAIRIKPDGSYEGIAANGAKTVGKITAAGGKASFQSANSAGTVTWSQEGGKDMLLFVRADGRGSAKLERVK
Ga0207684_1002359683300025910Corn, Switchgrass And Miscanthus RhizosphereMRSMMVRMAVVVALAVGGLAAVVSAGPDKPIPSVKTLAGEWRAAGDASKASIRIKGDGTYEGTSANGKKTTGQVTAIGGKASFKSATAAGSVTLSQEGGKDMLTFVSADGRSSAKLQRVK
Ga0207660_1003995333300025917Corn RhizosphereMMHVRLRVCLIVLSLATVAFAGPDKPIRDVKSLTGDWRATGGGPAAIRIKPDGSYEGISANGAKTVGKITATGGTASFQSANSAGSVTWSQEGGKDVLLFVRADGRASAKLERVK
Ga0208914_100938623300026102Natural And Restored WetlandsMRFRLGVCLVVLSLAAVAYAGADRPIANVKSLTGDWRAMGGVSSAAIRIKPDGSYEGIAANGAKTVGKITAAGGKASFQSANSAGTVTWSQEGGKDMLLFVRADGRGSAKLERVK
Ga0209438_103491823300026285Grasslands SoilMRFRLGVCLIILSLAAVAFAGSDRPITSVKSLAGEWRALGGASAAAIRIKPDGAYEGTAANGARTVGRITIVDGKASFQSANSAGRVTWIREGGNDVLLFVRGDGRGSAKLERVNTAERVKAK
Ga0209131_132049713300026320Grasslands SoilMRFRLGVCLIILSLAAVAFAGSDRPITSVKSLAGEWRALGGASAAAIRIKPDGAYEGTAANGARTLGRITIVDGKASFQSANSAGRVTWIREGGNDVLLFVRGDGRGSAKLERVNTAERV
Ga0257169_100817723300026469SoilMLGFRVGVCLVVFSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGTAANGVKTAGKITTGGGKASFQSATSAGPVAWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0257165_100051633300026507SoilMLGFRVGVCLVVFSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGTAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0257165_106271613300026507SoilMRFRLGVCLVVLSLAAVAFAGPDKPIASVKSLSGDWRALGGASAAAIRIKPDGSYEGLSANGTKTVGKITTAGGKASFQSAKSAGSVTWSQEAGKDVLLFVAGDGRG
Ga0209592_133218513300027731Freshwater SedimentMRFRLGVCLLVLSLAAVAFAGPDKPITNVKSLTGDWRAMGGASAAAIRIKPDGSYEGISANGAKTVGKITTVGGKASFQSANSAGSVTWSQEGGKDVLFFVRGDGRGSAKLERVK
Ga0209726_1004928243300027815GroundwaterMLGFRLGVCLVVVGLAAVAFAGPDKPIANVKSLTGDWRAMSGASAAAIRIKPDGSYEGTAANGTKTAGKITTAGGKASFQSANSAGTVTWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0209283_1001401263300027875Vadose Zone SoilMLGFRVGVCLVVFSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGTAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVT
Ga0209382_1095095723300027909Populus RhizosphereMRLGLGVCLLVCSLVAVAFAGPDKPIANVKSLTGDWRALGGASAAAIRIKADGSYEGIAANGTKTVGKITAAGGKASFQSANSAGTVTWSQEAGKEMLLFVRADGRGSAKLERVK
Ga0307293_1006528533300028711SoilMRLRLGVCLIVLSLAAIAYAGPDKPIANVKSLSGDWRAMGGASAAAIRINPDGSYEGLSANGAKTAGKITTAGGKASFQSAKSAGSVTWSQEAGKDVLLFVARDGRGSA
Ga0307504_1002021823300028792SoilMRFRIGVCLIILSLAAVAFAGSDRPITSLKSLAGEWRALGGASAAAIRIKPDGAYEGTAANGVRTVGRITLVDGQASFQSASSAGRVTWIQEGGKDVLLFVRGDGRGSAKLERVNTVERVKAK
(restricted) Ga0255311_103845423300031150Sandy SoilMRFRLGVCLVVLSLAAVAFAGPDRPITSLKSLTGEWRALGGASAAAIRIKPDGSYEGTAANGIRTVGKIRLADGKASFQSASSAGSVTWIQEGGKDVLLFARGDGRGSAKLERVNTVERAKTK
Ga0307469_1020299433300031720Hardwood Forest SoilMMRLRLAVCLVVLSLAGVAFAGPDKPILDVKSLTGDWRAVGGGAAAIRIKPDGSYEGISANGAKTAGKITTAGGKASFQSANSAGGVTWSQEGGKDVLLFVRGDGRGSAKLERVK
Ga0307469_1052920113300031720Hardwood Forest SoilMRFRLGVCLIILSLAAVAFAGSDRPITSAKSLAGEWRALGGASAAAIRIKPDGAYEGTAANGVRTIGRITIADGKASFQSANSTGRVTWIQEGGKDVLLFVRGDGRGSAKLERVNTAERAKAK
Ga0307468_10026413323300031740Hardwood Forest SoilMRFRLGVCLIILSLAAVAFAGSDRPITSAKSLAGEWRALGGASAAAIRIKPDGAYEGTAANGVRTIGRITIADGKASFQSANSTGRVTWIQEGGK
Ga0307468_10233797213300031740Hardwood Forest SoilMRFRLGVCLVVLSLAAGAFAGPDKPIANVKSLSGDWRAMGGASAAAIRIKPDGSYEGLSANGAKTVGKITTAGGKASFQSAKSAGSVTWSQEAGKDVLLFVAGDGRSSARLERVK
Ga0307471_10068632223300032180Hardwood Forest SoilMRFRLGVCLIILSLAAVAFAGSDRPITSAKSLAGEWRALGGASAAAIRIKPDGAYEGTAANGVRTIGRITIADGKASFQSANSAGRVTWIQEGGKDVLLFVRGDGRGSAKLERVNTAERAKAK
Ga0307471_10187669213300032180Hardwood Forest SoilMMRLRLAVCLVVLCLAGVAFAGPDKPILDVKSLTGDWRAVGGGAAAIQIKPDGSYEGISANGAKTAGKITTASGKASFQSANSAGGVTWSQEGGKDVLLFVRGDGRGSAKLERVK
Ga0334722_1025829723300033233SedimentMRSRLAVVLLVLSLAGVAFAGPDKPIASVKSLTGDWRAMGGASAAAIRIKPDGSYEGISANGTKTVGKITLAGGKASFQSANSAGSVTWSQEGGKDVLFFVRADGRGSAKLERVK
Ga0214471_1006651013300033417SoilMLGFRLGVCLVVWSLAAVAFAGPDRPIANVKSLTGDWRAMGGASAAAIRIKPDGSYEGTAANGAKTVGKITSAGGKASFQSANSAGTVTWSREDGKDMLLF
Ga0316628_10207986723300033513SoilLAAVAFAGPDKPITNVKSLTGDWRAMGGASAAAIRIKPDGSYEGISANGAKTVGKITTVGGKASFQSANSAGSVTWSQEGGKDVLFFVRGDGRGSAKLERVK
Ga0364940_0036265_387_7013300034164SedimentLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIHIKPDGSYEGIAANGVKTAGKITTGGGKASFQSATSAGTVAWSQEDGKDMLLFVRADGRGSAKLQRVK
Ga0364931_0217876_2_2893300034176SedimentMLGFRFGVCLVVLSLAAVAFAGPDKPIANVKSLTGDWRAISGASAAAIRIKPDGSYEGIAANGVKTAGKITTGGGKASFQSANSAGTVAWSQEDGK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.