NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F086756

Metagenome / Metatranscriptome Family F086756

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F086756
Family Type Metagenome / Metatranscriptome
Number of Sequences 110
Average Sequence Length 66 residues
Representative Sequence MRQNEGLQRAAAVLLQATLAMGFLTGCGVAPRPPDGTLDLPHFASQAPSKEMEPSPRLSVSAETIWAQ
Number of Associated Samples 95
Number of Associated Scaffolds 110

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 88.68 %
% of genes near scaffold ends (potentially truncated) 27.27 %
% of genes from short scaffolds (< 2000 bps) 80.91 %
Associated GOLD sequencing projects 90
AlphaFold2 3D model prediction Yes
3D model pTM-score0.35

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (50.909 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(15.454 % of family members)
Environment Ontology (ENVO) Unclassified
(37.273 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(44.545 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 26.04%    β-sheet: 0.00%    Coil/Unstructured: 73.96%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.35
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 110 Family Scaffolds
PF13545HTH_Crp_2 6.36
PF00072Response_reg 5.45
PF00166Cpn10 5.45
PF07238PilZ 3.64
PF00118Cpn60_TCP1 2.73
PF01925TauE 1.82
PF01070FMN_dh 1.82
PF00132Hexapep 0.91
PF08530PepX_C 0.91
PF09360zf-CDGSH 0.91
PF09924LPG_synthase_C 0.91
PF04239DUF421 0.91
PF00572Ribosomal_L13 0.91
PF13683rve_3 0.91
PF08352oligo_HPY 0.91
PF13371TPR_9 0.91
PF01471PG_binding_1 0.91
PF12770CHAT 0.91
PF00440TetR_N 0.91
PF00106adh_short 0.91

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 110 Family Scaffolds
COG0234Co-chaperonin GroES (HSP10)Posttranslational modification, protein turnover, chaperones [O] 5.45
COG0459Chaperonin GroEL (HSP60 family)Posttranslational modification, protein turnover, chaperones [O] 2.73
COG0069Glutamate synthase domain 2Amino acid transport and metabolism [E] 1.82
COG0730Sulfite exporter TauE/SafE/YfcA and related permeases, UPF0721 familyInorganic ion transport and metabolism [P] 1.82
COG1304FMN-dependent dehydrogenase, includes L-lactate dehydrogenase and type II isopentenyl diphosphate isomeraseEnergy production and conversion [C] 1.82
COG0102Ribosomal protein L13Translation, ribosomal structure and biogenesis [J] 0.91
COG2323Uncharacterized membrane protein YcaP, DUF421 familyFunction unknown [S] 0.91
COG2936Predicted acyl esteraseGeneral function prediction only [R] 0.91


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A50.91 %
All OrganismsrootAll Organisms49.09 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001661|JGI12053J15887_10337926Not Available730Open in IMG/M
3300005206|Ga0068995_10013073All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1186Open in IMG/M
3300005218|Ga0068996_10059523Not Available771Open in IMG/M
3300005289|Ga0065704_10465431All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla693Open in IMG/M
3300005294|Ga0065705_10553906Not Available730Open in IMG/M
3300005295|Ga0065707_10395924All Organisms → cellular organisms → Bacteria → Acidobacteria841Open in IMG/M
3300005440|Ga0070705_100366135Not Available1056Open in IMG/M
3300005444|Ga0070694_101376286Not Available595Open in IMG/M
3300005467|Ga0070706_100043007All Organisms → cellular organisms → Bacteria → Proteobacteria4174Open in IMG/M
3300005467|Ga0070706_100966696Not Available786Open in IMG/M
3300005468|Ga0070707_101876417All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micromonosporales → Micromonosporaceae → Salinispora → Salinispora pacifica566Open in IMG/M
3300005542|Ga0070732_10820091All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300005549|Ga0070704_100085608Not Available2333Open in IMG/M
3300006047|Ga0075024_100005942All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4917Open in IMG/M
3300006047|Ga0075024_100074194All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1455Open in IMG/M
3300006845|Ga0075421_100590089All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1307Open in IMG/M
3300006847|Ga0075431_101041225All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium784Open in IMG/M
3300007265|Ga0099794_10000412All Organisms → cellular organisms → Bacteria14391Open in IMG/M
3300009038|Ga0099829_10000437All Organisms → cellular organisms → Bacteria22000Open in IMG/M
3300009038|Ga0099829_10261251Not Available1414Open in IMG/M
3300009090|Ga0099827_10160527Not Available1846Open in IMG/M
3300009803|Ga0105065_1013354All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium892Open in IMG/M
3300009804|Ga0105063_1050824Not Available593Open in IMG/M
3300009806|Ga0105081_1017025All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium863Open in IMG/M
3300009808|Ga0105071_1020398All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium945Open in IMG/M
3300009812|Ga0105067_1039900All Organisms → cellular organisms → Bacteria716Open in IMG/M
3300009814|Ga0105082_1003609All Organisms → cellular organisms → Bacteria → Proteobacteria1962Open in IMG/M
3300009816|Ga0105076_1132024Not Available503Open in IMG/M
3300011270|Ga0137391_10126675All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2218Open in IMG/M
3300011271|Ga0137393_10190301All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1728Open in IMG/M
3300011443|Ga0137457_1314347Not Available534Open in IMG/M
3300012035|Ga0137445_1111504Not Available556Open in IMG/M
3300012096|Ga0137389_10359385All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1240Open in IMG/M
3300012174|Ga0137338_1016256All Organisms → cellular organisms → Bacteria → Proteobacteria1427Open in IMG/M
3300012203|Ga0137399_10054498All Organisms → cellular organisms → Bacteria2923Open in IMG/M
3300012203|Ga0137399_10377463All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1180Open in IMG/M
3300012205|Ga0137362_10964523All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium727Open in IMG/M
3300012361|Ga0137360_11777832Not Available522Open in IMG/M
3300012362|Ga0137361_10506163Not Available1109Open in IMG/M
3300012918|Ga0137396_10171347All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1587Open in IMG/M
3300012918|Ga0137396_10512117Not Available889Open in IMG/M
3300012929|Ga0137404_11849186Not Available562Open in IMG/M
3300014870|Ga0180080_1104357Not Available505Open in IMG/M
3300014884|Ga0180104_1168687Not Available652Open in IMG/M
3300017997|Ga0184610_1079482All Organisms → cellular organisms → Bacteria1014Open in IMG/M
3300018031|Ga0184634_10200861All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria907Open in IMG/M
3300018052|Ga0184638_1300524All Organisms → cellular organisms → Bacteria543Open in IMG/M
3300018052|Ga0184638_1333458Not Available510Open in IMG/M
3300018056|Ga0184623_10301993Not Available723Open in IMG/M
3300018056|Ga0184623_10428653Not Available576Open in IMG/M
3300018075|Ga0184632_10147935Not Available1036Open in IMG/M
3300018075|Ga0184632_10172601All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla954Open in IMG/M
3300018076|Ga0184609_10059178All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1652Open in IMG/M
3300018076|Ga0184609_10303837Not Available746Open in IMG/M
3300018076|Ga0184609_10363272Not Available675Open in IMG/M
3300018084|Ga0184629_10236088Not Available953Open in IMG/M
3300018429|Ga0190272_10369482Not Available1153Open in IMG/M
3300019249|Ga0184648_1300650All Organisms → cellular organisms → Bacteria569Open in IMG/M
3300019259|Ga0184646_1380311Not Available534Open in IMG/M
3300019487|Ga0187893_10200983All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1533Open in IMG/M
3300019877|Ga0193722_1115284Not Available626Open in IMG/M
3300019879|Ga0193723_1053186All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1188Open in IMG/M
3300019879|Ga0193723_1191615Not Available517Open in IMG/M
3300019882|Ga0193713_1014807All Organisms → cellular organisms → Bacteria2339Open in IMG/M
3300019883|Ga0193725_1011422All Organisms → cellular organisms → Bacteria2513Open in IMG/M
3300019886|Ga0193727_1193320Not Available514Open in IMG/M
3300019998|Ga0193710_1008409All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1037Open in IMG/M
3300020003|Ga0193739_1023201All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1617Open in IMG/M
3300020004|Ga0193755_1104294All Organisms → cellular organisms → Bacteria895Open in IMG/M
3300020004|Ga0193755_1180005Not Available620Open in IMG/M
3300020063|Ga0180118_1059224Not Available1091Open in IMG/M
3300021073|Ga0210378_10122784All Organisms → cellular organisms → Bacteria1007Open in IMG/M
3300021081|Ga0210379_10386809Not Available617Open in IMG/M
3300021090|Ga0210377_10025218All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria4307Open in IMG/M
3300021972|Ga0193737_1024473Not Available854Open in IMG/M
3300022694|Ga0222623_10145512Not Available923Open in IMG/M
3300022756|Ga0222622_10505679Not Available864Open in IMG/M
3300025324|Ga0209640_10126636All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2193Open in IMG/M
3300025910|Ga0207684_10074629All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla2881Open in IMG/M
3300025922|Ga0207646_11632800Not Available555Open in IMG/M
3300026285|Ga0209438_1057502All Organisms → cellular organisms → Bacteria1282Open in IMG/M
3300026340|Ga0257162_1000105All Organisms → cellular organisms → Bacteria → Proteobacteria6220Open in IMG/M
3300026354|Ga0257180_1039515Not Available656Open in IMG/M
3300026489|Ga0257160_1051177Not Available714Open in IMG/M
3300026496|Ga0257157_1087367Not Available541Open in IMG/M
3300026514|Ga0257168_1115331Not Available598Open in IMG/M
3300026551|Ga0209648_10792191Not Available518Open in IMG/M
3300027013|Ga0209884_1030993Not Available563Open in IMG/M
3300027645|Ga0209117_1027424All Organisms → cellular organisms → Bacteria1799Open in IMG/M
3300027681|Ga0208991_1067073All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1082Open in IMG/M
3300027815|Ga0209726_10035671All Organisms → cellular organisms → Bacteria → Proteobacteria3741Open in IMG/M
3300027915|Ga0209069_10048222All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2001Open in IMG/M
3300027947|Ga0209868_1004218Not Available1327Open in IMG/M
3300027952|Ga0209889_1037474All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Sphingomonadales → Sphingomonadaceae → Sphingomonas → unclassified Sphingomonas → Sphingomonas sp. AX61041Open in IMG/M
3300028536|Ga0137415_10249393All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1584Open in IMG/M
3300028536|Ga0137415_10275478Not Available1488Open in IMG/M
3300028792|Ga0307504_10220168Not Available682Open in IMG/M
3300028828|Ga0307312_10334878Not Available989Open in IMG/M
(restricted) 3300031150|Ga0255311_1039932All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla984Open in IMG/M
(restricted) 3300031150|Ga0255311_1055202Not Available839Open in IMG/M
3300031720|Ga0307469_11932695Not Available572Open in IMG/M
3300032180|Ga0307471_100738489Not Available1150Open in IMG/M
3300033233|Ga0334722_10052271All Organisms → cellular organisms → Bacteria3220Open in IMG/M
3300033433|Ga0326726_10022304All Organisms → cellular organisms → Bacteria5504Open in IMG/M
3300034643|Ga0370545_049589Not Available812Open in IMG/M
3300034773|Ga0364936_022282All Organisms → cellular organisms → Bacteria1067Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil15.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil13.64%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment10.91%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand9.09%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.27%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment5.45%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil4.55%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.55%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds2.73%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere2.73%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.73%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands1.82%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.82%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.82%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil1.82%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.82%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.91%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.91%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.91%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.91%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.91%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.91%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.91%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.91%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.91%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.91%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.91%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005206Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300005218Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300005289Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009803Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_40_50EnvironmentalOpen in IMG/M
3300009804Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40EnvironmentalOpen in IMG/M
3300009806Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_50_60EnvironmentalOpen in IMG/M
3300009808Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50EnvironmentalOpen in IMG/M
3300009812Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60EnvironmentalOpen in IMG/M
3300009814Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_50_60EnvironmentalOpen in IMG/M
3300009816Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011443Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT630_2EnvironmentalOpen in IMG/M
3300012035Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT338_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012174Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT366_2EnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013100Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C6-5 metaGHost-AssociatedOpen in IMG/M
3300014870Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT560_16_10DEnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019249Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019259Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019877Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m1EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300019998Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m1EnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020063Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT730_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300021972Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2m2EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026354Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-BEnvironmentalOpen in IMG/M
3300026489Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-11-AEnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027013Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027681Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300027947Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027952Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033233Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_bottomEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300034643Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_120 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034773Sediment microbial communities from East River floodplain, Colorado, United States - 4_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI12053J15887_1033792623300001661Forest SoilMRQHECLQRAAAGLLQATLAMGFLIGCGVATRPADGTLDLPHFASQAPSKEMESSPRLSVSAEAIWAQ*
Ga0063356_10043566443300004463Arabidopsis Thaliana RhizosphereVNSHVAAVLLVTALAMAILVGCGVTPRPADGIRDLPHFANQAPSRETDPSPR*
Ga0068995_1001307313300005206Natural And Restored WetlandsMRPNGRLQHAAAVLLQATLAMGILVACGVAPRPADGTLDLPHFANQAPSKEMEPSPRLSH
Ga0068996_1005952333300005218Natural And Restored WetlandsMRPNGRLQHAAAVLLQATLAMGILVACGVAPRPADGTLDLPHFANQAPSKEMEPSPRLS
Ga0065704_1046543123300005289Switchgrass RhizosphereAAAVLLQTTLAMGILVGCVVPPRPADGTPDLPHFANQAPSREMEPSPHLSVERETGFKNGW*
Ga0065705_1055390613300005294Switchgrass RhizosphereSEPDDETEGSLQRAAVALLQATLAMGILIGCRVATRPSDGTLDLPHFAYQAPALGMDPSPRLSVATEAIWAQ*
Ga0065707_1025316443300005295Switchgrass RhizosphereMRRGGGLQRAAAVLLQTTLAMGILVGCVVPPRPADGTPDLPHFAN
Ga0065707_1039592433300005295Switchgrass RhizosphereMRRDGVLQRAAVVLLQVTLAMGILIGCGAATRPADGTLDLPHFAYQAPPMGMEPSPRLSIATEAIWAQ*
Ga0070705_10036613523300005440Corn, Switchgrass And Miscanthus RhizosphereMRQHEGLRRAAAVLLQATLVVGLLSGCGVAARPADDGALDLPHFANQAPSKEMGPSPRLSVS
Ga0070694_10137628623300005444Corn, Switchgrass And Miscanthus RhizosphereMRRDGGLQRAAVVLLQATLAMGILIGCGAATRPADGTLDLPHFAYQAPSVGMEPSPRLSIATEAIWAQ*
Ga0070706_10004300723300005467Corn, Switchgrass And Miscanthus RhizosphereMRPNEGIRRAAVVFLQATLVVGFLSGCMVASRPADGALDLPHFASQAPSKEKGSSPGLSVSAEAIWIP*
Ga0070706_10096669623300005467Corn, Switchgrass And Miscanthus RhizosphereMRPNEGIRRAAVVFLQATVVVGFLSGCVVTSRPADGALDLPHFASQAPKEKGSSPGLSVSAEAIW
Ga0070707_10187641713300005468Corn, Switchgrass And Miscanthus RhizosphereMRQHEGIQRAAASLLRATLVVGLLTGCGVAARPDDGARDLPHFASQAPSKEMGPSPRLSVSAEAGPVISGAPRSRGWDWSAGDG
Ga0070732_1082009123300005542Surface SoilLLQATLTVGFLAGCGVATRPADGTLDLPHFASQAPSKQIEPSPSLTVSAEAIWVQ*
Ga0070704_10008560813300005549Corn, Switchgrass And Miscanthus RhizosphereMRRDGGLQRAAVVLLQATLAMGILIGCGAATRPADGTLDLPHFAYQAPSVGMEPSPRSSIATEAIWAQ*
Ga0075024_10000594213300006047WatershedsMRKGRGFQGAATVLLQAALAVGCLAGCGVATRPPAGILDLPHFANQAPSKEIKGSPAVTVSAETIWVQ*
Ga0075024_10007419413300006047WatershedsMRQHEGLQRAAAVLLQATLAMGFLIGCGVATRPADDGALDLPHFASQAPSKEMGPSPRLSSRASRGLSSSIG
Ga0075421_10059008913300006845Populus RhizosphereRDEGLQRAAAVLLQATLAMGLLIGCGAATHPPDGTLDLPHFASQAGSEAMNPSHRSSGSVEAIPVR*
Ga0075431_10104122513300006847Populus RhizosphereMRRDEGLQRAAAVLLQATLAMGLLIGCGAATHPPDGTLDLPHFASQAGSEAMNPSHRSSGSVEAIPVR*
Ga0099794_1000041243300007265Vadose Zone SoilMRPNEGIRRAAVVFLQATLVVGSLSGCAVASRPADGALDLPHFASQAPSKEKGSSPGLSVSAEAIWVP*
Ga0099829_1000043783300009038Vadose Zone SoilMRQNEGLQRAAAVLVQATLAMGFLTGCGVVPRPPDGTLDLPHFASQAPSKAMDPSPRLSVSAEAIWAQ*
Ga0099829_1026125113300009038Vadose Zone SoilMRQHEGIQRAAAVLLQATLVVGFLSGCGVAPRPADDGALDLPHFASQVPSKEMGPS
Ga0099827_1016052733300009090Vadose Zone SoilMIAGLKQRQRSESAMRQDGGLQRAAVGLLQATLAIGILIGCGVATRPPDGTLDLPHFANQAPSTGMEPSPRLSVATEAIRAQ*
Ga0105065_101335423300009803Groundwater SandMRQNEGLQRAAAVLLQATLAMGFLTGCGVATRQPDGTLDLPHFASQAPSKEMEPPPRLSVSAAELATDVGHGVSRTWVQ*
Ga0105063_105082413300009804Groundwater SandRQNEGLQRAVAVLLQATLAMGFLTGCGVATRQPDGTLDLPHFASQAPSKEMELPPRLSVFAAELARDVGHGVSRTWVQ*
Ga0105081_101702523300009806Groundwater SandMRQNEGLQRAAAVLLQATLAMGFLTGCGVATRQPDGTLDLPHFASQAPSKEMELPPRLSVFAAELASDVGHGVSRTWVQ*
Ga0105071_102039813300009808Groundwater SandMRQHEGIQRAAAVLLQATLVMGFLSGCGVATRQDDGARDLPHFASQAPSKEVDPSPRLTVSPETIWIQ*
Ga0105067_103990023300009812Groundwater SandEGLQRAAAVLLQATLAMGFLTGCGVATRQPDGTLDLPHFASQAPSKEMEPPPRLSVSAAELATDVGYGVSRTWVQ*
Ga0105082_100360943300009814Groundwater SandMRQNEGLQRAAAVLLQATLAMGFLTGCGVATRQPDGTLDLPHFASQAPSKEMEPPPRLSVSAAELATDVGQRRL*
Ga0105076_113202413300009816Groundwater SandMRQNEGLQRAVAVLLQATLAMGFLTGCGVATRQPDGTLDLPHFASQAPSKEMEPPPRLSVSAAELATDVGYGVSRT
Ga0137391_1012667523300011270Vadose Zone SoilMRQHEGIQRAAAVLLQATLVVGFLSGCGVAPRPADDGALDLPHFASQVPSKEMGPSPRLSVSAEAIWAQ*
Ga0137393_1019030123300011271Vadose Zone SoilMRPNEGIRRAAVVFLQATLVVGFLSGCAVASRPADGALDLPHFASQAPSKEKGSSPGLSVSAEAIWVP*
Ga0137457_131434723300011443SoilMRQNEGLQRAAAVLVQAALAMGFLTGCGVAPRPPDGTLDLPYFASQAPSKAMDPSPRLSVSAEAIWAQ*
Ga0137445_111150413300012035SoilMRQNEGLQRAAAVLLPATLALGLLTGCGVAPRPPDGTLDLPHFASQAPPKEMDPSTRPSVSTEAIWAQ*
Ga0137389_1035938523300012096Vadose Zone SoilMRPNEGIRRAAVVFLQATLVVGFLSGCAVASRPADGALDLPHFASQAPSKEKGSSPGLSVSAEAIWIP*
Ga0137338_101625623300012174SoilMRQNEGLQRATAVLLPATLALGLLIGCGAATRPPDGTLDLPHFASRAPSKAMQPSPRSSVSAEAIWAQ*
Ga0137399_1005449813300012203Vadose Zone SoilMRQDGGLQRAAVVLLQATLAMGILIGCGVAARPVDGTLDLPHFANQAPAKGMEPSPRLSVATEAIWAQ*
Ga0137399_1037746313300012203Vadose Zone SoilEGLQRAAAGLLQATLAMGFLIGCGVATRPADGTLDLPHFASQAPSKEMESSPRLSVAAEAIWAQ*
Ga0137362_1096452323300012205Vadose Zone SoilMRQHEGLRRAAAVLLQATLVVGLLSGCGVAARPDDGARDLPHFASQAPSKEMGPSPRL
Ga0137360_1177783213300012361Vadose Zone SoilMRPHEGLRRAAAVLLQATLVVGLLSGCGVAARPDDGARDLPHFASQAPSKEMGPSP
Ga0137361_1050616313300012362Vadose Zone SoilMRQPEGLRCAAAVLLQATLAVGLLSGCGVAARPDDGARDLPHFASQAPSKEMGPSPRLSVSAEAGPVISGAPRSRGWDWSAG
Ga0137396_1017134733300012918Vadose Zone SoilMRQHEGLRRAAAVLLQATLVVGWLSGCGVAARPDDGARDLPHFASQAPSKEMGPSPRLSVSAEAGPVISGAPRSRG*
Ga0137396_1051211723300012918Vadose Zone SoilMRQHEGLQRAAAGLLQATLAMGFLIGCGVATRPADGTLDLPHFASQAPSKEMESSPRLSVAAEAIWAQ*
Ga0137404_1184918613300012929Vadose Zone SoilMRQDGGLQRAAVVLLQATLAMGILIGCGVAARPVDGTLDLPHFANQAPAKGLEPSPRLSVATEAIWAQ*
Ga0157373_1116076613300013100Corn RhizosphereLVVNSHVAAVLLVTALAMAILVGCGVTPRPADGIRDLPHFANQAPSRETDPSPR*
Ga0180080_110435713300014870SoilMRQNEGLQRAAAVLLPATLALGLLTGCGVAPRPPDGTLDLPYFASQAPSKAMDPSPRLSVSAEAIWAQ*
Ga0180104_116868723300014884SoilMRQNEGLQRAAAVLLQAALAMGFLTGCGVAPRPPDGTLDLPYFASQAPSKALDPSPRLSVSAEAIWAQ*
Ga0184610_107948223300017997Groundwater SedimentMRQNEGLQRAAAVLLQAALAMGFLTGCGVAPRPPDGTLDLPYFASQAPSKAMDPSPRLSVSAEAIWAQ
Ga0184634_1020086123300018031Groundwater SedimentMRQNEGLQRAAAVLLQATLAMGFLTGCGVAPRPPDGTLDLPHFASQAPSKAMDPSPRLSVSAEAIWAP
Ga0184638_130052413300018052Groundwater SedimentMRQNEGLQRAAAVLLQATLAIALLIGCGVATRPPDGTLDLPHFASQAPSKAMDPSPRLNVSAEAIWAQ
Ga0184638_133345813300018052Groundwater SedimentMRQNEGLQRAAAVLLQATLAMGFLTGCGVAPRPPDGTLDLPHFASQAPSKTISAEAIWAQ
Ga0184623_1030199313300018056Groundwater SedimentMRQNEGLQRAAGVLLQAALAMGFLTGCGVAPRPPDGTLDLPYFASQAPSKAMDPSPRLSVSAEAIWAQ
Ga0184623_1042865313300018056Groundwater SedimentMRQNEGLQRAAAVLVQAALAMGFLTGCGVVPRPPDGTLDLPYFASQAPSKAMDPSPRLSVSAEAIWAQ
Ga0184632_1014793533300018075Groundwater SedimentMRQNEGLQRAAAVLLQAALAMGFLTGCGVVPRPPDGTLDLPHFASQAPSKAMDPSPRLSVSAEAIWAQ
Ga0184632_1017260123300018075Groundwater SedimentGLQRASAVLLQATLAMGFLVGCGVATRPADGTLDLPHFARQAPSKEVESSPRLSVSAEAIWAQ
Ga0184609_1005917823300018076Groundwater SedimentMRQHEGLQRASAVLLQATLAMGFLVGCGVATRPADGTLDLPHFARQAPSKDVESSPRLSVSGEAIWSQ
Ga0184609_1030383713300018076Groundwater SedimentMRQNEGLQRAAAVLVQATLAMGFLTGCGVAPRPPDGTLDLPYFASQAPSKAMDPSPRLSVSAEAIWAQ
Ga0184609_1036327223300018076Groundwater SedimentMRRNEGLQRTAAGLVQAALVTGFLTGCGVVPRPPDGTLDLPHFASQAPSKAMDPSRRLSVSAEAIWAQ
Ga0184629_1023608813300018084Groundwater SedimentMRQNEGLQRAAAVLLQATLAMGFLTGCGVAPRPPDGTLDLPHFASQAPSKAMDPPPRLSVSAEAIWAQ
Ga0190272_1036948243300018429SoilMRQNEGLQRAAAVLLQAALAMGFLTGCGVVPRPPDGTLDLPHFASQAPSKAMDTSPRSSVSAEAIWAQ
Ga0184648_130065023300019249Groundwater SedimentMRQNEGLQRAAAVLVQATLAMGFLTGCGVVPRPPDGTLDLPHFASQAPSKAMDPPPRLSVSAEAIWAQ
Ga0184646_138031113300019259Groundwater SedimentMRQNEGLQRAAAVLVQAALAMGFLTGCGVVPRPPDGTLDLPHFASQAPSKAMDPSPRLSVSAEAIWAQ
Ga0187893_1020098323300019487Microbial Mat On RocksMRRNDGLRRAAAVVLQATLAIGLLIGCAATHPPDGTLDLPHFARQAPSEAMSPSHRSRVSAEANSAR
Ga0193722_111528413300019877SoilMRPNEGIWRAAVVFLQATLVVGFLSGCVVASRPADGALDLPHFASQAPSKEKGSSPGLSVSAEDIWVP
Ga0193723_105318623300019879SoilMRQDGRLQHAATVLLQATLAIGILVGCGVARPPDGHLDLPHFANHAPSKEWTLVH
Ga0193723_119161513300019879SoilMRQNEGLQRAAAVLVQATLAMGFLTGCGVVPRPPDGTLDLPHFASQAPSKAMDPSPRLSVSAEAIWAQ
Ga0193713_101480733300019882SoilMLLQATLAMGILIGCGAATRPADGTFDLPHFANQAPSMGMEPSPRLSVATEAIWAQ
Ga0193725_101142233300019883SoilMRQDGGLQRAAVMLLQATLAMGILIGCGAATRPADGTFDLPHFANQAPSMGMEPAPRLSVATEAIWAQ
Ga0193727_119332013300019886SoilMRQDGGLQRAAVMLLQATLAMGILIGCGAATRPADGTFDLPHFANQAPMGMEPSPRLSVATEAIWAQ
Ga0193710_100840933300019998SoilMRPDGRLQHAATVLLQATLAIGILVGCGVARPPDGHLDLPHFANHAPSKEWTLVH
Ga0193739_102320143300020003SoilMRRNEGLQRAAAVLVQAALVMGFLTGCGVVPRPPDGTLDLPYFASQAPSKAMDPSRRSSVSAEAI
Ga0193755_110429433300020004SoilMRQHEGLQRVAAVLLQATLAMGFLFGCGVATRPPDGTLDLPHFASQVPSKATEPLTLVPKRDKS
Ga0193755_118000513300020004SoilMRQHEGLQRTAAGLLQATLVMGFLIGCGVATRPADGTLDLPHFASQAPSKEMESSPRLSVSAEAIWAQ
Ga0180118_105922413300020063Groundwater SedimentMRQYECLQRAAAGFLQATLVMGFLIGCGVATRPPDGTLDLPHFASQAPSKAMDPSPRLSVSAEAIWAQ
Ga0210378_1012278433300021073Groundwater SedimentMRQNEGLQRAAAVLVQAALAMGFLTGCGVVPRPPDGTLDLPHFASQAPSKAMDPSRRLSVSAEAIWAQ
Ga0210379_1038680913300021081Groundwater SedimentLLLATLTMGFLTGCGVAPRPPDGTLDLPHFASQAPSKEMGPSPRLSVSAEAIWAQ
Ga0210377_1002521843300021090Groundwater SedimentMRQNEGLQRAAAVLLQGTLALGLLVGCGATPRPPDGTLDLPYFASHVPSKPMDPSPRSSVSAEAIWAQ
Ga0193737_102447323300021972SoilMRRNEGLQRAAAVLVQAALVMGFLTGCGVVPRPPDGTLDLPYFASQAPSKAMDPSRR
Ga0222623_1014551233300022694Groundwater SedimentMRRNEGLQRAAAGLVQAALVMGFLTGCGVVPRPPDGTLDLPHFASQAPSKAMDPSRRLSVSAEAIWAQ
Ga0222622_1050567933300022756Groundwater SedimentMRQDGGLQRCRQRAAVVLLQATLVMGILIGCGVAARPVDGTLDLPHFANQAPAMGMETF
Ga0209640_1012663623300025324SoilMRQHEGLQRAAAGLLQATLAMGFLIGCGVATRQTDGTLDLPHFASQAPSREMEPSPRLSVSAEAIWAQ
Ga0207684_1007462923300025910Corn, Switchgrass And Miscanthus RhizosphereMRPNEGIRRAAVVFLQATLVVGFLSGCMVASRPADGALDLPHFASQAPSKEKGSSPGLSVSAEAIWIP
Ga0207646_1163280013300025922Corn, Switchgrass And Miscanthus RhizosphereMRQHEGIQRAAASLLRATLVVGLLTGCGVAARPDDGARDLPHFASQAPSKEMGPSPRLSVSAEAGPVISV
Ga0207687_1028438623300025927Miscanthus RhizosphereMRRLRRAAAGLLQTTLAMGILVGCVGAPRPGNDIRDLPHIANQAPSREMEP
Ga0209438_105750223300026285Grasslands SoilMRPNEGIRRAAVVFLQATLVVGSLSGCAVASRPADGALDLPHFASQAPSKEKGSSPGLSV
Ga0257162_100010543300026340SoilMRPNEGIRRAAVVFLQATLVVGFLSGCAVASRPADGALDLPHFASQAPSKEKGSSPGLSVSAEAIWVP
Ga0257180_103951513300026354SoilMRPNEGIRRAAVVFLQATLVVGSLSGCAVASRPADGALDLPHFASQAPSKEKGSSPGLSVSAEAIWVP
Ga0257160_105117713300026489SoilMRPNEGIRRAAVVFLQATLVVGSLSGCAVASRPADGALDLPHFASQAPSKEKGSSPGLSVSAEA
Ga0257157_108736713300026496SoilMRQPEGLRRAAAVLLQATVVVGLLSGCGVAARPDDGARDLPHFASQAPSKEMGPSPR
Ga0257168_111533113300026514SoilMRQPEGLRCAAAVLLQATLAVGLLSGCGVAARPDDGARDLPHFASQAPSKEMGPSPRLSVSAEAGPVISGAPRSRGWG
Ga0209648_1079219113300026551Grasslands SoilMRQRGGFQRAAAVLLQATLAVGFLAGCGVATRPADGTLDLPHFASPAPSKEIEPSPPLIVSVEAIWV
Ga0209884_103099313300027013Groundwater SandSAIRQNEGLQRAVAVLLQATLAMGFLTGCGVATRQPDGTLDLPHFASQAPSKEMEPPPRLSVSAAELATDVGYGVSRTWVQ
Ga0209117_102742423300027645Forest SoilMRPNEGIRRAAVVFLQATLVVGFLSGCVVASRPADGALDLPHFASQAPSKEKGSSPGLSVSAEAIWVP
Ga0208991_106707333300027681Forest SoilMRQHEGLQRAAAGLLQATLAMGFLIGCGVATRPADGTLDLPHFASQAPSKEMESSPRLSVSAEAIWAQ
Ga0209726_1003567173300027815GroundwaterMRQNEGLQRAAAVLLQATLAMGFLTGCGVAPRPPDGTLDLPHFASQAPSKEMEPSPRLSVSAETIWAQ
Ga0209069_1004822223300027915WatershedsMRQHEGLQRAAAVLLQATLAMGFLIGCGVATRPADDGALDLPHFASQAPSKEMEPSARLSVSAEAIWAQ
Ga0209868_100421823300027947Groundwater SandMRQNEGLQRAAAVLLQATLAMGFLTGCGVATRQPDGTLDLPHFASQAPSKEMEPPPRLSVSAAELATDVGYGVSRTWVQ
Ga0209889_103747433300027952Groundwater SandMRQNEGLQRAAAVLLQATLAMGFLTGCGVATRQPDGTLDLPHFASQAPSKEMEPPPRLSVSAAELATDVGYGVSRT
Ga0137415_1024939323300028536Vadose Zone SoilMRQHEGLQRAAAGLLQATLAMGFLIGCGVATRPADGTLDLPHFASQAPSKEMESSPRLSVAAEAIWAQ
Ga0137415_1027547813300028536Vadose Zone SoilMRQDGGLQRAAVVLLQATLAMGILIGCGVAARPVDGTLDLPHFANQAPAKGMEPSPRLSVATEAIWAQ
Ga0307504_1022016813300028792SoilMRPDGRLQHAATVLLQATLAIGILVGCGVARPPDGHLDLPHFASHAPSK
Ga0307312_1033487813300028828SoilMRQHECLQRAAAGLLQATLAMGFLIGCGVATRPADGTLDLPHFASQAPSKAMDPSPRLSVSAEAIWAQ
(restricted) Ga0255311_103993213300031150Sandy SoilMRRDGGLQRAAVVLLQTTLAMGILVGCGVAPRPADGTLDLPHIANQAPSKEMEPSPRLSAARETIWAQ
(restricted) Ga0255311_105520223300031150Sandy SoilMRRDGGLQRAAAVLLQAALAMGILVGCVGAPRPADGTLDLPHFANRAPSKDIGFKNGW
Ga0307469_1193269513300031720Hardwood Forest SoilMRPDGRLPHAATVLLQATLAIGILVGCGVARPPDGHLDLPHFANHAPSKEWTLVH
Ga0307471_10073848933300032180Hardwood Forest SoilMRRDGGLQRATVVLLQTTLVMGILVGCVAVPRPADGIRDLPHIANQAPSPHVSVARETIWTQ
Ga0334722_1005227113300033233SedimentMRRDGGLQRAAAVLLQTALAMGILVGCAGAPRPADGTLDLPHFANQAPSKDTGFKKGR
Ga0326726_1002230463300033433Peat SoilMRQRGGFQRAAAVLLQATLAVGFLAGCGVATRPADGTLDLPHFASQAPSKQIEPSYPLTVSAEAIWVQ
Ga0370545_049589_546_7523300034643SoilMRQNEGLQRAAVVLLLATLTMGFLTGCGVAPRPPDGTLDLPHFASQAPSKEMGPSPRLSVSAEAIWAQ
Ga0364936_022282_553_7593300034773SedimentMRQNEGLQRAAAVLLQAALAMGFLTGCGVAPRPPDGTLDLPYFASQAPSKALDPSPRLSVSAEAIWAR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.