NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F061881

Metagenome / Metatranscriptome Family F061881

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F061881
Family Type Metagenome / Metatranscriptome
Number of Sequences 131
Average Sequence Length 75 residues
Representative Sequence MSLSRDSRQERKLKAIRRAQFRLKQNLERIDFEEDVLLPEIEALKSGKSVLGLPAGAAFDIQIEDENPGQS
Number of Associated Samples 93
Number of Associated Scaffolds 131

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 94.66 %
% of genes near scaffold ends (potentially truncated) 20.61 %
% of genes from short scaffolds (< 2000 bps) 74.81 %
Associated GOLD sequencing projects 87
AlphaFold2 3D model prediction Yes
3D model pTM-score0.58

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (68.702 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Sediment → Unclassified → Unclassified → Soil
(21.374 % of family members)
Environment Ontology (ENVO) Unclassified
(41.221 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(38.931 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 38.38%    β-sheet: 0.00%    Coil/Unstructured: 61.62%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.58
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 131 Family Scaffolds
PF07087DUF1353 3.05
PF03412Peptidase_C39 2.29
PF01555N6_N4_Mtase 1.53
PF05014Nuc_deoxyrib_tr 1.53
PF07589PEP-CTERM 1.53
PF13539Peptidase_M15_4 1.53
PF01464SLT 0.76
PF13365Trypsin_2 0.76
PF09588YqaJ 0.76
PF14359DUF4406 0.76
PF08241Methyltransf_11 0.76
PF09594GT87 0.76
PF12762DDE_Tnp_IS1595 0.76
PF13560HTH_31 0.76
PF01381HTH_3 0.76
PF01973MptE-like 0.76
PF02945Endonuclease_7 0.76
PF01569PAP2 0.76
PF13356Arm-DNA-bind_3 0.76

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 131 Family Scaffolds
COG0863DNA modification methylaseReplication, recombination and repair [L] 1.53
COG1041tRNA G10 N-methylase Trm11Translation, ribosomal structure and biogenesis [J] 1.53
COG2189Adenine specific DNA methylase ModReplication, recombination and repair [L] 1.53
COG3613Nucleoside 2-deoxyribosyltransferaseNucleotide transport and metabolism [F] 1.53


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A68.70 %
All OrganismsrootAll Organisms31.30 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000033|ICChiseqgaiiDRAFT_c2105332Not Available554Open in IMG/M
3300000033|ICChiseqgaiiDRAFT_c2110026Not Available509Open in IMG/M
3300000033|ICChiseqgaiiDRAFT_c2110733Not Available541Open in IMG/M
3300001838|RCM33_1050645Not Available520Open in IMG/M
3300002223|C687J26845_10123261All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions956Open in IMG/M
3300002460|C687J35021_10125635Not Available1025Open in IMG/M
3300002460|C687J35021_10352208Not Available547Open in IMG/M
3300003267|soilL1_10121013Not Available1356Open in IMG/M
3300003313|P32013IDBA_1035353All Organisms → cellular organisms → Bacteria → Spirochaetes → unclassified Spirochaetota → Spirochaetes bacterium GWB1_59_51533Open in IMG/M
3300003324|soilH2_10285833Not Available3150Open in IMG/M
3300003324|soilH2_10309150Not Available1427Open in IMG/M
3300003324|soilH2_10358895All Organisms → cellular organisms → Bacteria4860Open in IMG/M
3300004463|Ga0063356_103556249All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium671Open in IMG/M
3300005332|Ga0066388_102795058Not Available892Open in IMG/M
3300005341|Ga0070691_10339243Not Available831Open in IMG/M
3300005343|Ga0070687_100000106All Organisms → cellular organisms → Bacteria28120Open in IMG/M
3300005406|Ga0070703_10264246Not Available703Open in IMG/M
3300005406|Ga0070703_10330159All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium645Open in IMG/M
3300005438|Ga0070701_10951536Not Available596Open in IMG/M
3300005441|Ga0070700_100615668Not Available853Open in IMG/M
3300005444|Ga0070694_100119657All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1888Open in IMG/M
3300005468|Ga0070707_101241205Not Available712Open in IMG/M
3300005529|Ga0070741_10153655All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Cellvibrionales → Cellvibrionaceae → Cellvibrio → unclassified Cellvibrio → Cellvibrio sp. KY-GH-12308Open in IMG/M
3300005529|Ga0070741_11157548Not Available655Open in IMG/M
3300005544|Ga0070686_100056763All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → Candidatus Giovannonibacteria → Candidatus Giovannonibacteria bacterium GW2011_GWB1_47_6b2512Open in IMG/M
3300005544|Ga0070686_100721729Not Available797Open in IMG/M
3300005546|Ga0070696_100023346Not Available4204Open in IMG/M
3300006358|Ga0068871_100548072All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1047Open in IMG/M
3300006755|Ga0079222_11481842Not Available634Open in IMG/M
3300006804|Ga0079221_10569327Not Available755Open in IMG/M
3300006845|Ga0075421_100909436All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1004Open in IMG/M
3300006846|Ga0075430_100419194Not Available1105Open in IMG/M
3300006847|Ga0075431_101194068Not Available724Open in IMG/M
3300006853|Ga0075420_101648371Not Available549Open in IMG/M
3300006876|Ga0079217_10812874Not Available651Open in IMG/M
3300007004|Ga0079218_13581993Not Available527Open in IMG/M
3300007734|Ga0104986_1912Not Available45101Open in IMG/M
3300009147|Ga0114129_10290106All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium2184Open in IMG/M
3300009156|Ga0111538_10107423All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium3545Open in IMG/M
3300009162|Ga0075423_10400175Not Available1441Open in IMG/M
3300009162|Ga0075423_11623940All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium695Open in IMG/M
3300009162|Ga0075423_12309688Not Available585Open in IMG/M
3300009285|Ga0103680_10000510All Organisms → cellular organisms → Bacteria41134Open in IMG/M
3300009285|Ga0103680_10015184All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions5324Open in IMG/M
3300009285|Ga0103680_10016656All Organisms → cellular organisms → Bacteria4993Open in IMG/M
3300009597|Ga0105259_1002417Not Available3295Open in IMG/M
3300009610|Ga0105340_1084627Not Available1261Open in IMG/M
3300009610|Ga0105340_1431294Not Available588Open in IMG/M
3300009678|Ga0105252_10068491Not Available1376Open in IMG/M
3300009678|Ga0105252_10091108All Organisms → Viruses → Predicted Viral1212Open in IMG/M
3300009678|Ga0105252_10157519Not Available951Open in IMG/M
3300009678|Ga0105252_10401299Not Available627Open in IMG/M
3300011400|Ga0137312_1037019Not Available766Open in IMG/M
3300011415|Ga0137325_1000147All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Phyllobacteriaceae → Mesorhizobium → Mesorhizobium caraganae16067Open in IMG/M
3300011416|Ga0137422_1023582Not Available1453Open in IMG/M
3300011421|Ga0137462_1000192All Organisms → cellular organisms → Bacteria8099Open in IMG/M
3300011430|Ga0137423_1141831Not Available721Open in IMG/M
3300011431|Ga0137438_1099618Not Available884Open in IMG/M
3300011432|Ga0137428_1003365All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage5204Open in IMG/M
3300011435|Ga0137426_1152029Not Available680Open in IMG/M
3300011437|Ga0137429_1124359Not Available793Open in IMG/M
3300011439|Ga0137432_1019023All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1968Open in IMG/M
3300011439|Ga0137432_1275676Not Available539Open in IMG/M
3300011440|Ga0137433_1063364Not Available1114Open in IMG/M
3300011443|Ga0137457_1126254Not Available835Open in IMG/M
3300011443|Ga0137457_1180356Not Available712Open in IMG/M
3300011443|Ga0137457_1245179Not Available613Open in IMG/M
3300012022|Ga0120191_10018979All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium958Open in IMG/M
3300012160|Ga0137349_1091143Not Available564Open in IMG/M
3300012210|Ga0137378_10821790Not Available842Open in IMG/M
3300012357|Ga0137384_11551582Not Available512Open in IMG/M
3300012530|Ga0136635_10367658All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium526Open in IMG/M
3300014148|Ga0180010_1029481Not Available526Open in IMG/M
3300014613|Ga0180008_1036156Not Available2002Open in IMG/M
3300014745|Ga0157377_10537595Not Available823Open in IMG/M
3300014876|Ga0180064_1091233Not Available641Open in IMG/M
3300015371|Ga0132258_12478981Not Available1298Open in IMG/M
3300018082|Ga0184639_10387504Not Available723Open in IMG/M
3300018083|Ga0184628_10001270All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium12558Open in IMG/M
3300018083|Ga0184628_10005236Not Available6197Open in IMG/M
3300018083|Ga0184628_10144159All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1238Open in IMG/M
3300018083|Ga0184628_10171818All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1131Open in IMG/M
3300018083|Ga0184628_10286614Not Available865Open in IMG/M
3300018083|Ga0184628_10496384Not Available632Open in IMG/M
3300018084|Ga0184629_10503875Not Available630Open in IMG/M
3300018084|Ga0184629_10707713Not Available507Open in IMG/M
3300019458|Ga0187892_10002122All Organisms → cellular organisms → Bacteria41434Open in IMG/M
3300019487|Ga0187893_10022730Not Available7643Open in IMG/M
3300020020|Ga0193738_1001839All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Ardenticatenia → Ardenticatenales → Ardenticatenaceae → Candidatus Promineofilum → Candidatus Promineofilum breve8342Open in IMG/M
3300020020|Ga0193738_1005238Not Available4501Open in IMG/M
3300020020|Ga0193738_1014481Not Available2532Open in IMG/M
3300020202|Ga0196964_10000993All Organisms → cellular organisms → Bacteria → Proteobacteria15208Open in IMG/M
3300020202|Ga0196964_10003099All Organisms → cellular organisms → Bacteria → Proteobacteria7780Open in IMG/M
3300020202|Ga0196964_10143603All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1077Open in IMG/M
3300020215|Ga0196963_10194806Not Available875Open in IMG/M
3300020215|Ga0196963_10210969Not Available842Open in IMG/M
3300020215|Ga0196963_10239666Not Available790Open in IMG/M
3300020215|Ga0196963_10290567Not Available719Open in IMG/M
3300021062|Ga0196974_1100166Not Available503Open in IMG/M
3300023179|Ga0214923_10000993All Organisms → cellular organisms → Bacteria41358Open in IMG/M
3300024187|Ga0247672_1023341Not Available991Open in IMG/M
3300024232|Ga0247664_1018542Not Available1618Open in IMG/M
3300024283|Ga0247670_1026352Not Available1042Open in IMG/M
3300024331|Ga0247668_1094326Not Available606Open in IMG/M
3300025324|Ga0209640_10552897Not Available933Open in IMG/M
3300025918|Ga0207662_10003177All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium8399Open in IMG/M
3300025922|Ga0207646_10077294All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium2975Open in IMG/M
3300025922|Ga0207646_10707752All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium900Open in IMG/M
3300026118|Ga0207675_102324996Not Available549Open in IMG/M
3300027362|Ga0208320_1002585All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Bacilli → Bacillales → Staphylococcaceae → Staphylococcus → Staphylococcus pettenkoferi2056Open in IMG/M
3300027533|Ga0208185_1096933Not Available693Open in IMG/M
3300027533|Ga0208185_1105219Not Available661Open in IMG/M
3300027573|Ga0208454_1012671All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium2081Open in IMG/M
(restricted) 3300027799|Ga0233416_10055657Not Available1337Open in IMG/M
3300027909|Ga0209382_11153323Not Available796Open in IMG/M
3300028592|Ga0247822_11286440Not Available613Open in IMG/M
3300028809|Ga0247824_10703974Not Available616Open in IMG/M
3300031578|Ga0307376_10063303All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium2661Open in IMG/M
3300031787|Ga0315900_10455780Not Available987Open in IMG/M
3300031854|Ga0310904_10588862All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium758Open in IMG/M
3300031857|Ga0315909_10025612All Organisms → cellular organisms → Bacteria5829Open in IMG/M
3300031858|Ga0310892_11390487Not Available504Open in IMG/M
3300031911|Ga0307412_11713401Not Available576Open in IMG/M
3300032144|Ga0315910_10859537Not Available706Open in IMG/M
3300032770|Ga0335085_11006453Not Available900Open in IMG/M
3300033551|Ga0247830_10462428Not Available995Open in IMG/M
3300033551|Ga0247830_10620975Not Available856Open in IMG/M
3300033551|Ga0247830_10738423Not Available782Open in IMG/M
3300033551|Ga0247830_11096865Not Available635Open in IMG/M
3300033551|Ga0247830_11369569Not Available565Open in IMG/M
3300034191|Ga0373909_0326014Not Available501Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil21.37%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.63%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere7.63%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment6.87%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil6.87%
SoilEnvironmental → Terrestrial → Soil → Sand → Desert → Soil6.11%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.05%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.05%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil3.05%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.05%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere3.05%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil3.05%
GroundwaterEnvironmental → Aquatic → Freshwater → Drinking Water → Chlorinated → Groundwater2.29%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil2.29%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater1.53%
FreshwaterEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater1.53%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil1.53%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.53%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater0.76%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater0.76%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.76%
Marine PlanktonEnvironmental → Aquatic → Freshwater → Lotic → Unclassified → Marine Plankton0.76%
Polar Desert SandEnvironmental → Aquatic → Freshwater → Ice → Unclassified → Polar Desert Sand0.76%
TerrestrialEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial0.76%
Ore Pile And Mine Drainage Contaminated SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Ore Pile And Mine Drainage Contaminated Soil0.76%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.76%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.76%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil0.76%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.76%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.76%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.76%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.76%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.76%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.76%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.76%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.76%
Sediment SlurryEngineered → Bioremediation → Metal → Unclassified → Unclassified → Sediment Slurry0.76%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000033Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300001838Marine plankton microbial communities from the Amazon River plume, Atlantic Ocean - RCM33, ROCA_DNA217_0.2um_bLM_C_2aEnvironmentalOpen in IMG/M
3300002223Soil microbial communities from Rifle, Colorado - Rifle CSP2_plank lowO2_1.2EnvironmentalOpen in IMG/M
3300002460Soil microbial communities from Rifle, Colorado - Rifle CSP2_plank highO2_1.2EnvironmentalOpen in IMG/M
3300003267Sugarcane bulk soil Sample L1EnvironmentalOpen in IMG/M
3300003313Ore pile and mine drainage contaminated soil microbial communities from Mina do Sossego, Brazil - P3 sampleEnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005343Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaGEnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005438Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-2 metaGEnvironmentalOpen in IMG/M
3300005441Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300006358Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2Host-AssociatedOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006876Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200EnvironmentalOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300007734Freshwater viral communities from Lake Soyang, Gangwon-do, South Korea - SYL_2015JanEnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009285Microbial communities from groundwater in Rifle, Colorado, USA - 2A_0.1umEnvironmentalOpen in IMG/M
3300009597Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT299EnvironmentalOpen in IMG/M
3300009610Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT700EnvironmentalOpen in IMG/M
3300009678Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100EnvironmentalOpen in IMG/M
3300011400Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT133_2EnvironmentalOpen in IMG/M
3300011415Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT469_2EnvironmentalOpen in IMG/M
3300011416Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT551_2EnvironmentalOpen in IMG/M
3300011421Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT769_2EnvironmentalOpen in IMG/M
3300011430Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT600_2EnvironmentalOpen in IMG/M
3300011431Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT157_2EnvironmentalOpen in IMG/M
3300011432Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT718_2EnvironmentalOpen in IMG/M
3300011435Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT660_2EnvironmentalOpen in IMG/M
3300011437Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT736_2EnvironmentalOpen in IMG/M
3300011439Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT820_2EnvironmentalOpen in IMG/M
3300011440Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT840_2EnvironmentalOpen in IMG/M
3300011443Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT630_2EnvironmentalOpen in IMG/M
3300012022Terrestrial microbial communites from a soil warming plot in Okalahoma, USA - C6EnvironmentalOpen in IMG/M
3300012160Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT630_2EnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012530Polar desert sand microbial communities from Dry Valleys, Antarctica - metaG UQ85 (21.06)EnvironmentalOpen in IMG/M
3300014148Groundwater microbial communities from the Aspo Hard Rock Laboratory (HRL) deep subsurface site, Sweden - OS_PW_MetaGEnvironmentalOpen in IMG/M
3300014613Groundwater microbial communities from the Aspo Hard Rock Laboratory (HRL) deep subsurface site, Sweden - MM_PW_MetaGEnvironmentalOpen in IMG/M
3300014745Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M5-5 metaGHost-AssociatedOpen in IMG/M
3300014876Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200_16_10DEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300018082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2EnvironmentalOpen in IMG/M
3300018083Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300020020Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a1EnvironmentalOpen in IMG/M
3300020202Soil microbial communities from Anza Borrego desert, Southern California, United States - S1_10EnvironmentalOpen in IMG/M
3300020215Soil microbial communities from Anza Borrego desert, Southern California, United States - S1_5EnvironmentalOpen in IMG/M
3300021062Soil microbial communities from Anza Borrego desert, Southern California, United States - S1_10-13CEnvironmentalOpen in IMG/M
3300023179Freshwater microbial communities from Lake Lanier, Atlanta, Georgia, United States - LL-1510EnvironmentalOpen in IMG/M
3300024187Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK13EnvironmentalOpen in IMG/M
3300024232Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK05EnvironmentalOpen in IMG/M
3300024283Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK11EnvironmentalOpen in IMG/M
3300024331Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK09EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025918Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027362Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT299 (SPAdes)EnvironmentalOpen in IMG/M
3300027533Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT700 (SPAdes)EnvironmentalOpen in IMG/M
3300027573Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100 (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028809Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day48EnvironmentalOpen in IMG/M
3300031578Soil microbial communities from Risofladan, Vaasa, Finland - TR-2EnvironmentalOpen in IMG/M
3300031787Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 12 MA114EnvironmentalOpen in IMG/M
3300031854Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C48D1EnvironmentalOpen in IMG/M
3300031857Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 2 MA125EnvironmentalOpen in IMG/M
3300031858Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C8D2EnvironmentalOpen in IMG/M
3300031911Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-1Host-AssociatedOpen in IMG/M
3300032144Garden soil microbial communities collected in Santa Monica, California, United States - Edamame soilEnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300033551Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5EnvironmentalOpen in IMG/M
3300034191Uranium-contaminated sediment microbial communities from bioreactor in Oak Ridge, Tennessee, United States - B4A4.1EngineeredOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiDRAFT_210533223300000033SoilVSLSRDSRTERKLKAIRMAQHRLRQRLETIDWIEDHLLPEIEALKSGKSVLGLDAGSAFDIRIEPDDHSDPKQ*
ICChiseqgaiiDRAFT_211002613300000033SoilMSSLSRDSRQERKLKAVRRAQFALKQRLEKIDWEEDELLPEVMALKAGNSVLGLDAGSAFDIRIEPDDHSDPKQ*
ICChiseqgaiiDRAFT_211073313300000033SoilVSLSRDSRTERKLKAIRMAQHRLRQRLETIDWIEDHLLPEIEALKSGKSVLGLDAGSAFDIRIEPDDHSDPK*
RCM33_105064523300001838Marine PlanktonMSLSRDSRTEKKLKAFLRAKFRLKQQFEYIDYCEDVLIPEVEKLKQGKSVLGLDAGTAFDLVIDDSRPDSPEAQDAE*
C687J26845_1012326113300002223SoilMSLSRDSRQERKLKALRRAQFRLTQRFEYIDWEEDVLIPEVEALKAGRPVFGLTDGRMFEIEIEESHADPDQTPP
C687J35021_1012563523300002460SoilMSLSRDSRQERKLKALRRAQFRLTQRFEYIDWEEDVLIPEVEALKAGRPVFGLTDGRMFEIEIEESHADPDQTPPDHTEPDPGTR*
C687J35021_1035220813300002460SoilMSLSRDSRQERKLKAIRMAQFRLKQRLEAIDFLEDVLMPEIEALKAGKSVLGLPENVAFDIQVITDADPRYPATGESD*
soilL1_1012101323300003267Sugarcane Root And Bulk SoilMSLSRDSRQERKLKALRRAQFRLQQTFQRIDWEEDVLLPEIEALKAGKSVLGLPEGAAFDIQIDHADSDQKSPDGSK*
P32013IDBA_103535323300003313Ore Pile And Mine Drainage Contaminated SoilVLSRDSRAERKLKAVRRAQFRLKQSLERIDFEEDVLLPEIEALKAGKPVLGLPEGAAFDIQVVADEDPNPRSAISTE*
soilH2_1028583343300003324Sugarcane Root And Bulk SoilVSLSRDSRVERKLKAIRRAEFALKQRLERIDFEEDVLLPEIEALKAGKSVLELPENTAFDIIVEHDADPRSRKTRRAR*
soilH2_1030915033300003324Sugarcane Root And Bulk SoilLPLSRDSRSERKWKLWRRTEFRLKQGLERIDWEEDVLLPEIEAMKAGKAVGELPAGAAFDIKVIEDGSEI*
soilH2_1035889543300003324Sugarcane Root And Bulk SoilVPLSRDSRQIREWKLIRRTQFRLRQGLERIRWEEEELIPEVEALLSGAKSVAGLPAGAAFDIVIEDDAHPDSTPTDPAGE*
Ga0063356_10355624923300004463Arabidopsis Thaliana RhizosphereVSLSRDSRAERKLKALRRAQFRLKQNFERIDWEEDELLPEVEALKSGKSVLGLPEGVAFDIQIEHATPDPAKTSDG*
Ga0066388_10279505823300005332Tropical Forest SoilMSLSRDSRQERKMKALMRAQFRLQQNFDRIDWEEDVLLPEIQNLKAGKSVLGLDAGSAFDLRIEPDDHQGPTQ*
Ga0070691_1033924313300005341Corn, Switchgrass And Miscanthus RhizosphereVSLSRDSRQERKLKAIRLAQFRLKQRLEAIDWIEDVLQPEIEALKAGKSVLGLPEGAAFDIVIEAEDADPNPATASDTGE*
Ga0070687_100000106283300005343Switchgrass RhizosphereMSLSRDSRQERKLKALRRAQFALKQRFERIDFEEDVLMPEVEALKAGKSVLGLEAGTAFDIKVGVE*
Ga0070703_1026424613300005406Corn, Switchgrass And Miscanthus RhizosphereMALSRDSRQERKLKAYRRAQFRLQQHLERIDWEEDFLLPELEALKSGKSVLSLPEGAVFDIQIEHEVQADVSPKADG*
Ga0070703_1033015913300005406Corn, Switchgrass And Miscanthus RhizosphereMSLSRDSRQERKLKALRRAQFRLQQNFQRIDWEEDVLLPEIEALKAGKSVLGLPEGAAFDIQIEDDADRSQDQNRAE*
Ga0070701_1095153613300005438Corn, Switchgrass And Miscanthus RhizosphereVSLSRDSRQERKLKAVRRTMFRLKQNLERIDWEEDVLLPEIEALKAGKSVLGLPDGSAFDIQIVNDANPSHSQTPETD*
Ga0070700_10061566823300005441Corn, Switchgrass And Miscanthus RhizosphereVSLSRDSRQERKLKALRWAQFRLKQRLEAIDWQEDVLLPEIEALKAGRSVLGLPEGVAFDIQIESDASPRQDTTEN*
Ga0070694_10011965733300005444Corn, Switchgrass And Miscanthus RhizosphereMSLSRDSRQERKLKAVRRAQFRLKQNLERIDWEEDSLLPEIEALKSGKSVLGLPDGMAFDIVIEHEVQNPSEAAKPDTDA*
Ga0070707_10124120513300005468Corn, Switchgrass And Miscanthus RhizosphereVSLSRDSRQERKLKAVRRAQFRLKQNLERIDWEEDSLLPEIEALKSGKSVLGLPDGMAFDIVI
Ga0070741_1015365523300005529Surface SoilMPLSRDSRSERKWKAYRRAEFRLKQALERIDWEEEILMPEIEALKSGKAVAGLPAGAAFDIRIEQE*
Ga0070741_1115754833300005529Surface SoilMPLSRDSRSERKWKAYRRAEFRLKQALERIDWEEETLMPEIEALKSGKAVAGLPAGAAFDIRVEQE*
Ga0070686_10005676313300005544Switchgrass RhizosphereMPLSRDSRSERKLKAIRRAEFRLRQNLDRIDWEEEVLMPEIEALKSGKSVLGLPANTAFEIEVVNEVSHPRKARRSK*
Ga0070686_10072172923300005544Switchgrass RhizosphereLSLSRDSRQERKLKAFRRAQFALKQRLERIDWEEDKLMPEIEGLKAGRSVLGLPEGTVFDIKIEAE*
Ga0070696_10002334653300005546Corn, Switchgrass And Miscanthus RhizosphereLALSRDSRQERKLKAFRRAQFRLKQSLERIDWEEDVLMPEIEALKGGRSVLGLPEGAAFDITIEQDEVQNPRANEPN*
Ga0068871_10054807223300006358Miscanthus RhizosphereVSIARDSRAEKKLKAYRRAQFALKQRFERIDFEEDVLYPEIEALKAGKSVLGLPANSAFDMVVVDGADPVTRQKPSDG*
Ga0079222_1148184223300006755Agricultural SoilSLSRDSRQERKLKALRRAQFRLKQSLERIDWEEDTLLPEIEALKSGKSVLGLPDGAAFDICIEDENHDPAKDTH*
Ga0079221_1056932723300006804Agricultural SoilVLSRDSRQERKLKALRRAQFRLKQSLERIDWEEDTLLPEIEALKSGKSVLGLPEGAAFDICIEDHDPSKDAQ*
Ga0075421_10090943623300006845Populus RhizosphereMSLSRDSRQERKLKAIRRAQFRLKQNLERIDFEEDVLLPEIEALKSGKSVLGLPAGAAFDIQIEDENPGQS*
Ga0075430_10041919423300006846Populus RhizosphereMSLSRDSRQERKLKAVRRAQFRLKQTIERIDWEEDILLPEIEALKAGKSILGLPEGAAFDIVIESSDAHPAAATPGDARE*
Ga0075431_10119406823300006847Populus RhizosphereMSLSRDSRQERKLKAIRRAQFRLKQNLERIDWEEDELLPEIEALKSGKSVLGLPAGAAFDIQIEDENPGQS*
Ga0075420_10164837113300006853Populus RhizosphereMALSRDSRQEQKLKAIRRAQFRLAQRLATIDWENDVLMPEIEALKAGKPVLGLSEGTAFEIVIDHANHDQGQADRGGSAKRSRRR*
Ga0079217_1081287423300006876Agricultural SoilMSLSRDSRAERKLKAVRRAQFNLKQALERIDWIEDQLLPEVEALKAGKPVLGLPEGVAFDIQVVSDEDSNTPSANHASE*
Ga0079218_1358199313300007004Agricultural SoilVSLSRDSRQERKLKALRRAQFRLKQALERIDWEEDTLLPEIEALKSGKSVLGLPEGAAFD
Ga0104986_1912223300007734FreshwaterVPLSRDSRQERKLKALRRAQFRLKQSLERIDWEEDTLLPEIEALKSGKSVLGLPDGAAFDICIEDHDPAKDAH*
Ga0114129_1029010663300009147Populus RhizosphereVSLSRDSRQERKLKAVRRAQFRLKQNLERIDWEEDSLLPEIEALKSGKSVLGLPDGMAFDIVIEHEVQNPSEAAKPDADA*
Ga0111538_1010742383300009156Populus RhizosphereMSLSRDSRQEKKLKAIRLAQHRLRQRLETIDWIEDHLLPEIEALKSGKSVLGLEAGSAFDIRIESDDHHNQDAPASSDKGHHD*
Ga0075423_1040017533300009162Populus RhizosphereVSLSRDSRQERKLKAVRRAQFRLKQNLERIDWEEDSLLPEIEALKSGKSVLGLPDGMAFDIVIEHEAIQESTEDADN*
Ga0075423_1162394013300009162Populus RhizospherePCKAEAQLLSRDSRQERKLKAVRRAQFRLKQALERIDFEEDVLLPEIEALKSGKGVLGLPDGVAFDIEIVHETPDPAKTDSAD*
Ga0075423_1230968813300009162Populus RhizosphereMSLSRDSRQERKLKAVRRAQFRLKQAIERIDWEEDYLLPEVEALKAGKSVLGLEAGSAFDIQVENQKPVKSKRR*
Ga0103680_10000510123300009285GroundwaterMSLSRDSRQERKLKALRRAQFRLKQRFESIDFEEDVLLPEIEALKSGRSVLGLAEGTAFDLQIVDDAHPDSTKIDA*
Ga0103680_1001518463300009285GroundwaterMSLSRDSRQERKLKALRRAQFRLTQRFEYIDWEEDVLIPEVEALKAGRPVFGLTDGRMFEIEIEESHADPDQTPPDHTEPDPRTR*
Ga0103680_1001665673300009285GroundwaterLSLSRDSRQESKLKALRRAQFRLKQRFEYIDWVEDHLLPEIEALKSGKSVLGLPEGAVFDIQVDDENQNPQPPKTDEPA*
Ga0105259_100241733300009597SoilMSLSRDSRQERKLKLFLRTQFRFKQGIERIDWEEDVLLPEVEKLKSGGSVLGLPEGMAFDIQVVDQSEPEKD*
Ga0105340_108462723300009610SoilMLSRDSRQERKLKAFRRAQFNLKQLFERIDWTEDVLLPEVEALKSGKSILGLPEGAAFDLVIEHENPDPPAPSDTGQ*
Ga0105340_143129413300009610SoilMSLSRDSRQERKLKLFLRTQFRFKQGIERIDWEEDVLLPEVEKLKSGGSVLGLPDGMAFDIQVVDANSTHPKADNSSE*
Ga0105252_1006849143300009678SoilLSLSRDSRQERKLKLLLRTQFRFKQGLERIDFEEDVLLPEIEKLKSGGSVLGLPEGVAFDIHIDHENPDPPASTPEHRPRT
Ga0105252_1009110823300009678SoilVSLSRDSKQERKLKLLLRTQFRLKQGFERIDFEENELLPQIAAMKAGKSVLGLEPGAAFDIQIEPEKPKRRRT*
Ga0105252_1015751913300009678SoilLSRDSRTERKLKLLRRTEVRLKQGFERIDFEEDVLGPEIEALKAGKPILGLEAGSVFDIQIVDDADPDRS*
Ga0105252_1040129913300009678SoilNSLGPLTLTSKRGDVSLSRDSRTEKKLKALRRAEFGLKQRFERIDWEEDVLLPEVEKLKSGGSVLGLPEGMAFDIQVVDADPDHSKTDKPTE*
Ga0137312_103701923300011400SoilVSLSRDSRIEQKLKAVRLAEKRLEQRLEMIDWREDVLLPELEAFKSGKSVLGLPAGVAFDIRIDHVDPNHPKTDDTER*
Ga0137325_100014773300011415SoilMLSRDGRQERKLKAIRWAQFRLKQRLEAIDFMEDVLGPEVEALKSGKSVLGLAEGMAFDIKVVDHADSDHPQTDDAE*
Ga0137422_102358223300011416SoilMSLSRDSRTEKKLKALLRTKFRLKQQFEYIDYCEDVLIPEVEKLKAGKPVLGLDAGMAFDLVIDDDANPNPPQTNDTH*
Ga0137462_100019293300011421SoilMRSEFRLKQGLERIDWEEDVLLPEIEAYKSGKSVLGMPAGSAFDIKVEHVAQVQDGSKAADPNTAADNATRSEN*
Ga0137423_114183123300011430SoilMSLSRDSRQERKLKALRRAEFRLKQSFERIDWEEDCLLPEITALKSGKSVLGLGEGMAFDIRVE
Ga0137438_109961823300011431SoilMSLSRDSRQERKLKALRRTQFRLRQAFERIDWEEDELLPEIEALKSGKSILGLPEGAAFDIQIQHENTDSIKSDENQ*
Ga0137428_100336523300011432SoilLSLSRDSRAERQLKAIRRAQFRLKQTLERIDWEEDKLLPEIEALKSGKPVLGLPEGSAFEIVIDDADQNPS*
Ga0137426_115202913300011435SoilLSLSRDSRQERKLKLLLRTQFRFKQGLERIDFEEDVLLPEIEKLKSGGSVLGLPEGVAFDIHIDHEN
Ga0137429_112435923300011437SoilLSLSRDSRQERKLKLLLRTQFRFKQGLERIDFEEDVLLPEIEKLKSGGSVLGLPEGVAFDIHIDHENPDPPASTPEHRPRTP
Ga0137432_101902333300011439SoilMLSRDGRQERKLKAIRWAQFRLKQRLEAIDFMEDELGPEVEALKSGKSVLGLADGMAFDIKVVDADSNHPKTDDAGE*
Ga0137432_127567623300011439SoilMSLSRDSRQERKLKAIRMAQWRLKQRLETIDFLEDELQPEVEALKSGKSVLGLPEGVAFDIVIEAEEKK*
Ga0137433_106336413300011440SoilVLSRDSRQERKLKAIRWAQHRLAQRLQTIDWIEDVLTPEIEAIKSGGSVLGLPAGEAFDIQI
Ga0137457_112625423300011443SoilMSLSRDSRQERKFKAVRRAQFRLKQNLERIDWEEDELLPEIAAMKSGESFMGLPAGTAFDIQIESHADQNHPAPADCTDNATRSEN*
Ga0137457_118035623300011443SoilVPLSRDSRTEKKLKALRRAEFKLRQQFEYIDWSEDVLFPEIEALKSGRPVLGLEAGAVFDLQVTDAHTSRDEARKPAG
Ga0137457_124517913300011443SoilVSLSRDSKQERKLKLLLRTQFRLKQGFERIDFEENELLPQIAAMKAGKSVLGLEPGAAFDIQIEPEKPKRKR*
Ga0120191_1001897923300012022TerrestrialMPLNRDNRAEKKLKALRRAQWRLKQRLETIDWEEDDFIPEMEALKSGKSVLGLPAGAAFDIKVENEIPNHEPTDTDE*
Ga0137349_109114323300012160SoilVSLSRDSRQERKLKAIRWAQFRLKQRLEAIDFMEDVLGPEIEQLKSSGSVLDLPPCEAFDIVLEADVATPDQTTGEN*
Ga0137378_1082179023300012210Vadose Zone SoilMSLSRDSRQEKKVKALLRAQYGLKVRLERIDWEEDYFLPLLEDLKAGKSVLGLPAGAAFDVEIVDENSSDNKTK*
Ga0137384_1155158213300012357Vadose Zone SoilMSLSRDSRQEKKVKALLRAQYGLKVRLERIDWEEDYFLPLLEDLKAGKSVLGLPAGAAFDVEIVDENSS
Ga0136635_1036765813300012530Polar Desert SandSRDSRQERKLKAIRMAQFRLRQRLETIDWLEDVLQPEVEALKSGKSVLGLPDGVAFDIQIEPDENPH*
Ga0180010_102948113300014148GroundwaterMSLSRDSRQERKLKALRRAQFRLKQRFEVIDFEEDVLLPEIEALKSGKSVLGLLEGAAFDIQVVDEGMTN*
Ga0180008_103615633300014613GroundwaterMSLSRDSRQERKLKAIRRAQIRLKQRFELIDWEEDVLLPEIEALKSGKAVLGLTEGAAFDIQVVDETSHENPDPPTTD*
Ga0157377_1053759533300014745Miscanthus RhizosphereMSLSRDSRTERKLKAIRMAQWRLKQRLETIDFLEDELQPEIEKLKSGQSVLGLPDGVAFDLVI
Ga0180064_109123313300014876SoilVSLSRDSRQERKLKLFLRTQFRFKQGIERIDWEEDVLLPEVEKLKSGGSVLGLPEGMAFDIQVVDSADSDHPKTDTPTE*
Ga0132258_1247898113300015371Arabidopsis RhizosphereVLSRDSRQERKLKAVRRTMFRLKQNLERIDWEEDVLLPEIEALKAGKSVLGLPDGSAFDIQIVTDAHPDHSQTPETD*
Ga0184639_1038750423300018082Groundwater SedimentMSLSRDSRQERKLKAIRWAQHRLKQRIETIDWMDDSLGPEVEALKSGKSVLGLPAGVAFDIQVVDENSDH
Ga0184628_10001270113300018083Groundwater SedimentVSLSRDSRQERKLKALRRTQFRLKQAFDRIDWEEDVLMPEIEALKSGKSVLGLDAGSAFDIQIEDGPSDHQNQGE
Ga0184628_1000523673300018083Groundwater SedimentVSLSRDSKQERKLKLLLRTQFRLKQGFERIDFEENELLPQIAAMKAGKSVLGLEPGAAFDIQIEPEKPKRKR
Ga0184628_1014415943300018083Groundwater SedimentVSLSRDSMQERKLKLLLRTQFRLKQGFQRIEFEENELLPQIAAMKAGKSVLGLEPGAAFDIQVEPEKPKRKR
Ga0184628_1017181833300018083Groundwater SedimentMSLSRDSRAERKLKAIRRAQFRLRQAMDRIDWEEDTLQPEIEALKSGRSVLGLPEGSAFDIEIVHEVTDPTTPQ
Ga0184628_1028661423300018083Groundwater SedimentLSLSRDSRQERKLKALRRTQFRLKQAFDRIDWEEDVLMPEIEALKSGKSVLGLEPGSAFNIQIEDDHQDQGE
Ga0184628_1049638413300018083Groundwater SedimentVLSRDSRQERKLKAFRRAQFRLKQQFEYIDWTEDVLLPEVEALKSGKSILGLPEGAAFDLVIDENPDPSPTPNSGE
Ga0184629_1050387513300018084Groundwater SedimentMSLSRDSRQERKLKAYRRAQFALKQRMERIDWEEDSLAPEIATLKAGGSVLGLPEGAAFDITVEDENQNNDNGNA
Ga0184629_1070771313300018084Groundwater SedimentVSLSRDSRAERKLKLFRRTQFRLKQGLERIDFEEDVLGPEIEALKAGRAVLGLPEGVVFDITIEHEVPAEKPRPDTD
Ga0187892_10002122523300019458Bio-OozeMSLSRDSRQERKLKTLRRAQFRLKQRFESIDWEEDVLTPEVLALKAGKPVLGLPAGAAFDIQVEDENSNKE
Ga0187893_1002273053300019487Microbial Mat On RocksMSLSRDSRQERKLKALRRAQFRLKQRFESIDWEEDVLTPEVLALKAGKPVLGLPAGAAFDIQVEDENSNKE
Ga0193738_100183933300020020SoilVLSRDSRAEKKLKAFRRAQFRLKQQFEYIDWTEDSLLPEVEALKSGKSVLGLPEGAAFDLVIDANPDPATPTNPGE
Ga0193738_100523883300020020SoilVLSRDSRAEKKLKAFRRAQFRLKQQFEYIDWTEDILLPEVEALKSGKSVLGLPDGAAFDLVIDASPDPAPADHTE
Ga0193738_101448143300020020SoilVLSRDSRAEKKLKLYRRAQFRLKQQFEYVDWCEDFLLPEVEALKSGKSVLGLPEGAAFDLVIEGADANPSPTAPTDPSE
Ga0196964_1000099313300020202SoilVLSRDSRQERKLKAIRMAQWRLKQRLETIDFLEDVLQPEVEALKAGKSVLGLDEGVAFDIRIVDADSNHPAKP
Ga0196964_1000309933300020202SoilVSLSRDSRQEKKLKAIRLAQFRLKQRLEAIDFVEDVLLPEIEALKSGKSVLGLPEGAAFDIQIVDDAHSHQADRADAK
Ga0196964_1014360323300020202SoilMSLSRDSRAERKLKAFRRAQFRLKQALERIDFEEDVLLPEIAALKSGKSVLGLPEGAAFDIQIVNDADSDPNTSASGQ
Ga0196963_1019480633300020215SoilMSLSRDSRQERKLKAFRRAQFRLKQALQRIDWEEDVLMPEIEALKSGRSVLGLPEGAAFDITIEDDAVPAPVDHDTH
Ga0196963_1021096923300020215SoilVPLSRDSRQERKLKALRRAQFRLQQTLQRIDWEEDVLLPEIEALKAGKSVLGLPDGAAFDILIEHENPHPGQAADAD
Ga0196963_1023966613300020215SoilVLSRDSRQERKLKAIRMAQWRLKQRLETIDFLEDVLQPEVEALKAGKSVLGLDEGVAFDIRIVDADSNHPAKPDAE
Ga0196963_1029056723300020215SoilMPLSRDSRAEKKLKALRRAQFALQQRLQRIDWEEDVLYPEVEALKSGKSVLGLPEGVAFEIVIEQDDTK
Ga0196974_110016623300021062SoilMSLSRDSRAERKLKAFRRAQFRLKQALERIDFEEDVLLPEIAALKSGKSVLGLPEGAAFDIQIVNDADS
Ga0214923_10000993453300023179FreshwaterMPLSRDSRAERKLKAIRRAQFRLKQTMERIDWEEDTLLPEIEALKSGKSVLGLPEGAAFDICIEDDADHDPNQAAH
Ga0247672_102334123300024187SoilMSLSRDSRAERKLKAFRRAEFRLQQALERIDWEEDKLLPELEAFKSGKPVLGLPAGTMFDIQIEGDENPNHPARKASRKRSSHRR
Ga0247664_101854263300024232SoilMPLSRDSRQERKLKAVRRAQFRLQQNLERIDWEEDVLMPEIEALKSGKSVLGLGDGVAFDIVIEHENSDSTEADGAGTD
Ga0247670_102635233300024283SoilMPLSRDSRQERKLKAVRRAQFRLQQNLERIDWEEDVLMPEIEALKSGKSVLGLGDGVAFDIVIEHENPDSTEADGAGTD
Ga0247668_109432623300024331SoilLSRDSRQERKLKAVRRAQFRLQQNLERIDWEEDVLMPEIEALKSGKSVLGLGDGVAFDIVIEHENSDSTEADGAGTD
Ga0209640_1055289723300025324SoilMSLSRDSRQERKLKALRRAQFRLKQRFESIDWEEEVLLPEIEALKSGRSVLGLTEGAAFDIQIVDEAPADEDSRPVKAKSRSK
Ga0207662_10003177123300025918Switchgrass RhizosphereMSLSRDSRQERKLKALRRAQFALKQRFERIDFEEDVLMPEVEALKAGKSVLGLEAGTAFDIKVGVE
Ga0207646_1007729423300025922Corn, Switchgrass And Miscanthus RhizosphereMSLSRDSRQERKLKAVRRAQFRLKQNLERIDWEEDSLLPEIEALKSGKSVLGLPDGMAFDIVIEHEVQNPSEAAKPDTDA
Ga0207646_1070775213300025922Corn, Switchgrass And Miscanthus RhizosphereVSLSRDSRQERKLKAVRRAQFRLKQNLERIDWEEDSLLPEIEALKSGKSVLGLPDGMAFDIVIEHEVQNPSEAAKPDTDA
Ga0207675_10232499613300026118Switchgrass RhizosphereMSLSRDSRQERKLKALRRAQFRLQQNFQRIDWEEDVLLPEIEALKAGKSVLGLPEGAAFDIQIEDDADRSQDQNRAE
Ga0208320_100258523300027362SoilMSLSRDSRQERKLKLFLRTQFRFKQGIERIDWEEDVLLPEVEKLKSGGSVLGLPEGMAFDIQVVDQSEPEKD
Ga0208185_109693323300027533SoilVSLSRDSRQERKLKAIRWAQFRLKQRLEAIDFMEDVLGPEIEQLKSSGAVLGLPAGEAFDIQIVADANPDHTTS
Ga0208185_110521923300027533SoilMSLSRDSRQERKLKLFLRTQFRFKQGIERIDWEEDVLLPEVEKLKSGGSVLGLPDGMAFDIQVVDANSTHPKADNSSE
Ga0208454_101267123300027573SoilVSLSRDSKQERKLKLLLRTQFRLKQGFERIDFEENELLPQIAAMKAGKSVLGLEPGAAFDIQIEPEKPKRRRT
(restricted) Ga0233416_1005565723300027799SedimentMSLSRDSRQERKLKAFRRAQFRLKQRLESIDFEEDVLLPEIEALKAGRSVLGLPPGAAFDITIENDAIQTHANTPDDADEN
Ga0209382_1115332313300027909Populus RhizosphereMSLSRDSRQERKLKAIRRAQFRLKQNLERIDFEEDVLLPEIEALKSGKSVLGLPAGAAFD
Ga0247822_1128644023300028592SoilMSLSRDSRQERKLKAFRRAQFRLQQAIQRIDWEEDILQPEIEALKSGKSVLGLPAGAAFDIVIEAS
Ga0247824_1070397423300028809SoilMSLSRDSRQERKLKAFRRAQFRLQQAIQRIDWEEDILQPEIEALKSGKSVLGLPAGAAFDIVIEASDDQVQAAPADTSDDSY
Ga0307376_1006330353300031578SoilMSLSRDSRQERKLKAILRAQFRLKQRFESIDWEEDVLMPEVEALKSGRSVLGLEAGTAFDLQVVDDRGDDAHPHTDDNGTQH
Ga0315900_1045578013300031787FreshwaterVSLSRDSRTEKKLKALRRAEFRLRQQFEYIDWTEDVLYPEIEALKAGRPVLGLEAGAVFDLRVADAHPHSAVPHAEPGSDP
Ga0310904_1058886223300031854SoilVSLSRDSRQERKLKALRRAQFRLQQNFQRIDWEEDVLLPEIEALKAGKSVLGLPEGAAFDIQIEDDADRSQDQNRAE
Ga0315909_1002561213300031857FreshwaterVSLSRDSRTEKKLKALRRAEFRLKQQFEYIDWTEDVLIPEIDSLKAGRPVLGLEAGALFDLQVAD
Ga0310892_1139048723300031858SoilMALSRDSRQERKLKAVRRAQFRLKQSLERIDFEEDVLMPEIEALKSGKSILGLPAGAAFDIQVDHADSHHSPTTGRPKRSKRVSH
Ga0307412_1171340113300031911RhizosphereVSLSRDSRAEKKLKALRLAQFRLKQRFDMIDFVEDVLMPEVEAMKAGKPVLGLDAGAAFDLVVKHED
Ga0315910_1085953713300032144SoilMSLSRDSRQERKLKAFRRSQFRLKQAIERIDWEEDFLLPEIEALKAGRPVLGLPEGVAFDIQVET
Ga0335085_1100645323300032770SoilRDTREEKKLKALRRAEFALKQRFERIDWEEDELIPEVEALKAGHAVLGLPENTSFDIEVIDEDSDLDGS
Ga0247830_1046242823300033551SoilLSLSRDSRAERKLKAIRWAQHRLAQRLQTIDWQEEVLMPEIEALKSGKAVLGLVEGTAFDIEIVNETQNPGDNGKSGT
Ga0247830_1062097513300033551SoilMSLSRDSLAERQLKLILRTKFRLKQGLERLQWEEDILIPQIEALKSGKSVMGLPEGSSIDIEIVHEDPNPSNS
Ga0247830_1073842323300033551SoilMSLSRDSRQERKLKAFRRAQFRLQQAIQRIDWEEDILQPEIEALKSGKSVLGLPAGAAFDIVIEASDDQVQAAPDNTSDESY
Ga0247830_1109686513300033551SoilMSLSRDSRQERKLKAFRRAQFRLQQAIQRIDWEEDILQPEIEALKSGKSVLGLPAGAAFDIVIEASDDQVQAAPDNTSDDSY
Ga0247830_1136956913300033551SoilMSLSRDSRQERKLKAFRRAQFRLQQAIQRIDWEEDILQPEIEALKSGKSVLGLPAGAAFDIVIEALDDQVQAAPDNTSDESY
Ga0373909_0326014_41_2503300034191Sediment SlurryMSLSRDSRQERKMKAIRMAQWRLKQRLETIDFQEDVLQPEVEALKSGKSVLGLEAGSAFDIQVVVDNEQ


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.