NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F093448

Metagenome / Metatranscriptome Family F093448

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F093448
Family Type Metagenome / Metatranscriptome
Number of Sequences 106
Average Sequence Length 83 residues
Representative Sequence MRLLLVIASAALVTGCESMSNKIDASRQDRCQRADWAQVGERDGVEGAMTMTERYAHICGDMFQPGPYQEGLRKGQARRPRPPV
Number of Associated Samples 88
Number of Associated Scaffolds 106

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 71.15 %
% of genes near scaffold ends (potentially truncated) 32.08 %
% of genes from short scaffolds (< 2000 bps) 82.08 %
Associated GOLD sequencing projects 85
AlphaFold2 3D model prediction Yes
3D model pTM-score0.67

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (64.151 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere
(10.377 % of family members)
Environment Ontology (ENVO) Unclassified
(23.585 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(39.623 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 59.82%    β-sheet: 0.00%    Coil/Unstructured: 40.18%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.67
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 106 Family Scaffolds
PF06863DUF1254 8.49
PF01594AI-2E_transport 6.60
PF00497SBP_bac_3 4.72
PF13505OMP_b-brl 3.77
PF13557Phenol_MetA_deg 3.77
PF13533Biotin_lipoyl_2 2.83
PF00484Pro_CA 2.83
PF00529CusB_dom_1 1.89
PF03458Gly_transporter 1.89
PF00924MS_channel 0.94
PF01464SLT 0.94
PF01103Omp85 0.94
PF13502AsmA_2 0.94
PF02321OEP 0.94
PF00076RRM_1 0.94
PF16868NMT1_3 0.94
PF01636APH 0.94
PF03781FGE-sulfatase 0.94
PF02390Methyltransf_4 0.94
PF04392ABC_sub_bind 0.94
PF00873ACR_tran 0.94

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 106 Family Scaffolds
COG5361Uncharacterized conserved proteinMobilome: prophages, transposons [X] 8.49
COG0628Predicted PurR-regulated permease PerMGeneral function prediction only [R] 6.60
COG0288Carbonic anhydraseInorganic ion transport and metabolism [P] 2.83
COG1538Outer membrane protein TolCCell wall/membrane/envelope biogenesis [M] 1.89
COG2860Uncharacterized membrane protein YeiHFunction unknown [S] 1.89
COG003016S rRNA A1518 and A1519 N6-dimethyltransferase RsmA/KsgA/DIM1 (may also have DNA glycosylase/AP lyase activity)Translation, ribosomal structure and biogenesis [J] 0.94
COG0220tRNA G46 N7-methylase TrmBTranslation, ribosomal structure and biogenesis [J] 0.94
COG0668Small-conductance mechanosensitive channelCell wall/membrane/envelope biogenesis [M] 0.94
COG1262Formylglycine-generating enzyme, required for sulfatase activity, contains SUMF1/FGE domainPosttranslational modification, protein turnover, chaperones [O] 0.94
COG2226Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenGCoenzyme transport and metabolism [H] 0.94
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.94
COG3264Small-conductance mechanosensitive channel MscKCell wall/membrane/envelope biogenesis [M] 0.94
COG4122tRNA 5-hydroxyU34 O-methylase TrmR/YrrMTranslation, ribosomal structure and biogenesis [J] 0.94


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms64.15 %
UnclassifiedrootN/A35.85 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001686|C688J18823_10650862All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria671Open in IMG/M
3300002568|C688J35102_120966974All Organisms → cellular organisms → Bacteria → Proteobacteria3507Open in IMG/M
3300003267|soilL1_10009224All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria4632Open in IMG/M
3300003319|soilL2_10182059All Organisms → cellular organisms → Bacteria → Proteobacteria1593Open in IMG/M
3300003324|soilH2_10004733All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria2862Open in IMG/M
3300003995|Ga0055438_10223183All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae578Open in IMG/M
3300004009|Ga0055437_10185818Not Available660Open in IMG/M
3300004058|Ga0055498_10009113All Organisms → cellular organisms → Bacteria → Proteobacteria1255Open in IMG/M
3300004114|Ga0062593_100649015All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales1018Open in IMG/M
3300004114|Ga0062593_101635300All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales701Open in IMG/M
3300004114|Ga0062593_102909252All Organisms → cellular organisms → Bacteria → Proteobacteria547Open in IMG/M
3300004463|Ga0063356_100152078All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria2640Open in IMG/M
3300004463|Ga0063356_101271680All Organisms → cellular organisms → Bacteria1074Open in IMG/M
3300004463|Ga0063356_103708429All Organisms → cellular organisms → Bacteria → Proteobacteria658Open in IMG/M
3300004479|Ga0062595_100165877All Organisms → cellular organisms → Bacteria → Proteobacteria1312Open in IMG/M
3300004798|Ga0058859_11167887Not Available507Open in IMG/M
3300005336|Ga0070680_100121524All Organisms → cellular organisms → Bacteria → Proteobacteria2181Open in IMG/M
3300005336|Ga0070680_101104844Not Available685Open in IMG/M
3300005340|Ga0070689_100134295All Organisms → cellular organisms → Bacteria → Proteobacteria1986Open in IMG/M
3300005341|Ga0070691_10015542All Organisms → cellular organisms → Bacteria → Proteobacteria3496Open in IMG/M
3300005341|Ga0070691_10496689All Organisms → cellular organisms → Bacteria705Open in IMG/M
3300005364|Ga0070673_100185518All Organisms → cellular organisms → Bacteria → Proteobacteria1783Open in IMG/M
3300005364|Ga0070673_102165845All Organisms → cellular organisms → Bacteria → Proteobacteria528Open in IMG/M
3300005438|Ga0070701_10361156All Organisms → cellular organisms → Bacteria → Proteobacteria910Open in IMG/M
3300005440|Ga0070705_100703653All Organisms → cellular organisms → Bacteria → Proteobacteria794Open in IMG/M
3300005444|Ga0070694_100191457All Organisms → cellular organisms → Bacteria → Proteobacteria1519Open in IMG/M
3300005444|Ga0070694_100294525All Organisms → cellular organisms → Bacteria → Proteobacteria1241Open in IMG/M
3300005454|Ga0066687_10423929All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria774Open in IMG/M
3300005468|Ga0070707_101120259All Organisms → cellular organisms → Bacteria → Proteobacteria753Open in IMG/M
3300005518|Ga0070699_101223477Not Available689Open in IMG/M
3300005536|Ga0070697_101189711All Organisms → cellular organisms → Bacteria → Proteobacteria679Open in IMG/M
3300005545|Ga0070695_100512649All Organisms → cellular organisms → Bacteria → Proteobacteria929Open in IMG/M
3300005764|Ga0066903_103367959Not Available863Open in IMG/M
3300006031|Ga0066651_10536014All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pseudomonadales620Open in IMG/M
3300006844|Ga0075428_102064326All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae590Open in IMG/M
3300006854|Ga0075425_100125520All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria2931Open in IMG/M
3300006880|Ga0075429_100162981All Organisms → cellular organisms → Bacteria → Proteobacteria1953Open in IMG/M
3300006894|Ga0079215_10039619All Organisms → cellular organisms → Bacteria → Proteobacteria1755Open in IMG/M
3300006953|Ga0074063_12038271Not Available582Open in IMG/M
3300010362|Ga0126377_12081134Not Available644Open in IMG/M
3300010362|Ga0126377_12610328Not Available580Open in IMG/M
3300010398|Ga0126383_13651196Not Available503Open in IMG/M
3300010400|Ga0134122_10216434All Organisms → cellular organisms → Bacteria → Proteobacteria1598Open in IMG/M
3300010401|Ga0134121_10275161All Organisms → cellular organisms → Bacteria1482Open in IMG/M
3300011409|Ga0137323_1002520All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria4701Open in IMG/M
3300011437|Ga0137429_1018618All Organisms → cellular organisms → Bacteria → Proteobacteria1977Open in IMG/M
3300011442|Ga0137437_1004188All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae → Variovorax → unclassified Variovorax → Variovorax sp. URHB00205119Open in IMG/M
3300011445|Ga0137427_10046430All Organisms → cellular organisms → Bacteria → Proteobacteria1714Open in IMG/M
3300012232|Ga0137435_1125956All Organisms → cellular organisms → Bacteria → Proteobacteria777Open in IMG/M
3300012469|Ga0150984_108936535Not Available531Open in IMG/M
3300012922|Ga0137394_10882462Not Available747Open in IMG/M
3300012944|Ga0137410_10001086All Organisms → cellular organisms → Bacteria → Proteobacteria18754Open in IMG/M
3300012971|Ga0126369_11519692Not Available759Open in IMG/M
3300014166|Ga0134079_10063995Not Available1326Open in IMG/M
3300014326|Ga0157380_13383606Not Available510Open in IMG/M
3300015245|Ga0137409_11039237All Organisms → cellular organisms → Bacteria → Proteobacteria657Open in IMG/M
3300015371|Ga0132258_10732363All Organisms → cellular organisms → Bacteria2490Open in IMG/M
3300015371|Ga0132258_11518079All Organisms → cellular organisms → Bacteria → Proteobacteria1692Open in IMG/M
3300015373|Ga0132257_104394215Not Available513Open in IMG/M
3300018083|Ga0184628_10012364All Organisms → cellular organisms → Bacteria → Proteobacteria4134Open in IMG/M
3300018084|Ga0184629_10081045All Organisms → cellular organisms → Bacteria1553Open in IMG/M
3300018469|Ga0190270_11414971All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae → Variovorax742Open in IMG/M
3300018481|Ga0190271_11710279Not Available742Open in IMG/M
3300018481|Ga0190271_13239245All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria546Open in IMG/M
3300021082|Ga0210380_10146580All Organisms → cellular organisms → Bacteria → Proteobacteria1058Open in IMG/M
3300024219|Ga0247665_1064249Not Available526Open in IMG/M
3300024254|Ga0247661_1035519Not Available898Open in IMG/M
3300025569|Ga0210073_1120931Not Available568Open in IMG/M
3300025917|Ga0207660_10790804Not Available774Open in IMG/M
3300025941|Ga0207711_11611153Not Available593Open in IMG/M
3300025942|Ga0207689_10355371Not Available1218Open in IMG/M
3300025954|Ga0210135_1001215All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium SCGC AG-212-J231901Open in IMG/M
3300025955|Ga0210071_1009061All Organisms → cellular organisms → Bacteria1163Open in IMG/M
3300026075|Ga0207708_10534525All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria987Open in IMG/M
3300026095|Ga0207676_10270070Not Available1540Open in IMG/M
3300026118|Ga0207675_101523653Not Available689Open in IMG/M
3300026320|Ga0209131_1095003All Organisms → cellular organisms → Bacteria1581Open in IMG/M
3300027907|Ga0207428_10023097All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria5235Open in IMG/M
3300027910|Ga0209583_10206781All Organisms → cellular organisms → Bacteria → Proteobacteria841Open in IMG/M
3300028812|Ga0247825_10070576All Organisms → cellular organisms → Bacteria → Proteobacteria2337Open in IMG/M
3300028812|Ga0247825_10167832All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae1510Open in IMG/M
3300028812|Ga0247825_10423090All Organisms → cellular organisms → Bacteria942Open in IMG/M
3300031199|Ga0307495_10217357Not Available534Open in IMG/M
3300031226|Ga0307497_10136498All Organisms → cellular organisms → Bacteria → Proteobacteria1003Open in IMG/M
3300031226|Ga0307497_10427079Not Available639Open in IMG/M
3300031366|Ga0307506_10357033Not Available588Open in IMG/M
3300031548|Ga0307408_101490955Not Available639Open in IMG/M
3300031720|Ga0307469_11926685All Organisms → cellular organisms → Bacteria573Open in IMG/M
3300031740|Ga0307468_101546597Not Available617Open in IMG/M
3300031820|Ga0307473_10558489Not Available783Open in IMG/M
3300032012|Ga0310902_10873662Not Available617Open in IMG/M
3300032144|Ga0315910_10030152All Organisms → cellular organisms → Bacteria → Proteobacteria3978Open in IMG/M
3300032157|Ga0315912_10002485All Organisms → cellular organisms → Bacteria → Proteobacteria19537Open in IMG/M
3300032157|Ga0315912_10059842All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria3006Open in IMG/M
3300032174|Ga0307470_11542106All Organisms → cellular organisms → Bacteria554Open in IMG/M
3300032174|Ga0307470_11861963Not Available511Open in IMG/M
3300032180|Ga0307471_104007201All Organisms → cellular organisms → Bacteria520Open in IMG/M
3300032205|Ga0307472_100845553All Organisms → cellular organisms → Bacteria → Proteobacteria841Open in IMG/M
3300033550|Ga0247829_11391136Not Available580Open in IMG/M
3300033551|Ga0247830_10793226Not Available753Open in IMG/M
3300033551|Ga0247830_10868224All Organisms → cellular organisms → Bacteria → Proteobacteria718Open in IMG/M
3300034115|Ga0364945_0207933Not Available598Open in IMG/M
3300034664|Ga0314786_189020Not Available505Open in IMG/M
3300034818|Ga0373950_0112115All Organisms → cellular organisms → Bacteria596Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere10.38%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil9.43%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil7.55%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil6.60%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands5.66%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil4.72%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.72%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.77%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil3.77%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil2.83%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil2.83%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere2.83%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere2.83%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere2.83%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.89%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.89%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.89%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.89%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.89%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.89%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere1.89%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.94%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.94%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.94%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.94%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.94%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.94%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.94%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.94%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.94%
Host-AssociatedHost-Associated → Human → Digestive System → Large Intestine → Fecal → Host-Associated0.94%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.94%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.94%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.94%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil0.94%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.94%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001686Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300003267Sugarcane bulk soil Sample L1EnvironmentalOpen in IMG/M
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300003995Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2EnvironmentalOpen in IMG/M
3300004009Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004058Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004798Switchgrass rhizosphere and bulk soil microbial communities from Kellogg Biological Station, Michigan, USA for expression studies - roots SR-2 (Metagenome Metatranscriptome)Host-AssociatedOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005340Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaGEnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005364Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaGHost-AssociatedOpen in IMG/M
3300005438Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006953Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHMB (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011409Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT423_2EnvironmentalOpen in IMG/M
3300011437Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT736_2EnvironmentalOpen in IMG/M
3300011442Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT138_2EnvironmentalOpen in IMG/M
3300011445Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT700_2EnvironmentalOpen in IMG/M
3300012232Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT100_2EnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300018083Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300024219Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK06EnvironmentalOpen in IMG/M
3300024254Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK02EnvironmentalOpen in IMG/M
3300025569Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025942Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025954Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqC_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025955Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026075Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026095Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031226Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 10_SEnvironmentalOpen in IMG/M
3300031366Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 25_SEnvironmentalOpen in IMG/M
3300031548Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-3Host-AssociatedOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032012Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D3EnvironmentalOpen in IMG/M
3300032144Garden soil microbial communities collected in Santa Monica, California, United States - Edamame soilEnvironmentalOpen in IMG/M
3300032157Garden soil microbial communities collected in Santa Monica, California, United States - V. faba soilEnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033550Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day4EnvironmentalOpen in IMG/M
3300033551Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5EnvironmentalOpen in IMG/M
3300034115Sediment microbial communities from East River floodplain, Colorado, United States - 29_s17EnvironmentalOpen in IMG/M
3300034664Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20R3 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300034818Populus rhizosphere microbial communities from soil in West Virginia, United States - GW9791_WV_N_3Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
C688J18823_1065086223300001686SoilLRHILLIAACAALVSGCESMNQKIDTSRQDRCQRAEWSQVGERDGIEAAQGMAERYARICGDMFQPDPYKEGYQKGNARRARPPV*
C688J35102_12096697443300002568SoilMRVLIAAALVVPLLVACASMNEKIDASRQDRCARANWSDVGQRDGIEAAQGMAERYAHICGDMFQPEPYKEGYQTGLARRARPPV*
soilL1_1000922433300003267Sugarcane Root And Bulk SoilLRNLLLIAACAALASGCESMNQKIDASRQDRCQRANWAEVGERDGVEGAQGMAERYARICGDMFEAAPYQEGLKKGSSRRGRAPA*
soilL2_1018205933300003319Sugarcane Root And Bulk SoilMRRELLIAACAALVCGCESMNQRIDASRQDRCQRANWSEVGERDGVEGAQGMAERYARICGDMFQPEPYKEAYQKGNARRARPPV*
soilH2_1000473333300003324Sugarcane Root And Bulk SoilLRHVLLIAVYAVLASGCESMNQKIDASRQDRCQRANWAEVGERDGVEGAQGMAERYARICGDMFEAAPYQEGLKKGSSRRGRAPA*
Ga0055438_1022318323300003995Natural And Restored WetlandsMRLMLTLAACALVTGCESFSERVDASRQDRCQRADWAKVGERDGLQGATMMAERYAHICGDMFQPDPYREGLQKGTARRPRPPV*
Ga0055437_1018581813300004009Natural And Restored WetlandsMRLMLTLAACALVTGCESFSEKVDASRQDRCQRADWAKVGERDGLQGATMMAERYAHICGDMFQPDPYREGLQKGTARRPRPPV*
Ga0055498_1000911333300004058Natural And Restored WetlandsMRLILTLAACVLVTGCESFNTRLDASRQDRCQRADWAQVGERDGVEGATMMAERYAHICGDMFQPDPYREGLQKGTARRPRPPV*
Ga0062593_10064901513300004114SoilMRKLPVVILVLAVSGCEAMGEKIDASRQDRCARAEWALVGERDGVTGALLQAERYQEICGDMFQPGPYREGLQKGLARRPKPAV*
Ga0062593_10163530023300004114SoilMRAPAAVALVLLVSGCEGMQEKIHDSRQDRCARADWAQVGERDGVEGIQNQAERYQMICGDMYQPGPYQEGLQKGAARRPRPPV*
Ga0062593_10290925213300004114SoilMRQALLVASGAVLVWGCESMNQKLDASRQDRCQRADWAQVGERDGVEGAQGMAERYARICGDMFQPGPYQEGLKKGSSRRGR
Ga0063356_10015207843300004463Arabidopsis Thaliana RhizosphereVRAVLMIAAVLVSGCESMNQKIDASRQDRCQRAEWSQVGERDGVEGVKDMAERYARICGDMFQAAPYQEGFQKGFARRGRSPV*
Ga0063356_10127168033300004463Arabidopsis Thaliana RhizosphereMRKFIIAAGACLPLVSACTGLNEKIDASRQDRCQRADWALVGERDGVEGAATQAERYQYICGELFQPGPYKEGLRKGLARRPRPPV*
Ga0063356_10370842923300004463Arabidopsis Thaliana RhizosphereMRILAAVALVILVSGCESMKEKVHDSRMDRCARADWAQVGERDGVTGALMQADRYAEICGDMFQPAPYKEGLQKGFARRPKPAAL*
Ga0062595_10016587733300004479SoilVRLLLVIASAALVTGCESMSNKIDASRQDRCQRADWAQVGERDGVEGAMTMTERYAHICGDMFQPGPYQEGLKK
Ga0058859_1116788723300004798Host-AssociatedMKLFLVGACAVLVTGCEAMGEKIDASRQDRCARADWAQVGERDGVEGANMMAERYQHICGDMFQPGPYREGLQK
Ga0070680_10012152433300005336Corn RhizosphereMRVVTATVLCAALGAGCTAMEEKVAASRQDRCQRADWVLVGERDGVEGVMSAADRYQYICGDMFQPGPYREGLQKGQARRPRPPV*
Ga0070680_10110484413300005336Corn RhizosphereMRLILATACALLVTGCESFSEKLDASRQDRCQRADWAQVGERDGVEGASMADRYAHICGDQFQPDAYKQGLQKGLARRPRPPV*
Ga0070689_10013429523300005340Switchgrass RhizosphereMKLFLVGACAVLVTGCEAMGEKIDASRQDRCARADWAQVGERDGVEGANMMAERYQHICGDMFQPGPYREGLQKGAARRPRPPV*
Ga0070691_1001554233300005341Corn, Switchgrass And Miscanthus RhizosphereMRLLLVTACAVLVAGCEAMGEKIDASRQDRCQRAEWAKVGERDGLEGATTMADRYQHICGDMFQPGPYREGLQKGTARRPRPPV*
Ga0070691_1049668913300005341Corn, Switchgrass And Miscanthus RhizosphereLLVTGCESFSEKLDASRQDRCQRADWAQVGERDGVEGASMADRYAHICGDQFQPDAYKQGLQKGLARRPRPPV*
Ga0070673_10018551833300005364Switchgrass RhizosphereMRLLLVIASAALVTGCESMSNKIDASRQDRCQRADWAQVGERDGVEGAMTMTERYAHICGEMFQPGPYQEGLRKGQARRPRPPV*
Ga0070673_10216584523300005364Switchgrass RhizosphereMRQALLVASGAVLVWGCESMNQKLDASRQDRCQRADWAQVGERDGVEGAQGMAERYARICGDMFQPGPYQEGLKKGSSRRGRAPA*
Ga0070701_1036115623300005438Corn, Switchgrass And Miscanthus RhizosphereLVTGCEAMGEKIDASRQDRCARADWAQVGERDGVEGANMMAERYQHICGDMFQPGPYREGLQKGAARRPRPPV*
Ga0070705_10070365323300005440Corn, Switchgrass And Miscanthus RhizosphereEAAVVRLVLLAGACALLAGCESMSQQIDASRQERCHKADWAMVGERDGVEGATTMGDRYAHICGDAYNDVAYKEGLQKGMARRPRPPV*
Ga0070694_10019145723300005444Corn, Switchgrass And Miscanthus RhizosphereVRLLLAVACALLAGCESMSQQIDASRQERCHKADWAMVGERDGVEGATTMGDRYAHICGDAYNDVAYKEGLQKGMARRPRPPV*
Ga0070694_10029452533300005444Corn, Switchgrass And Miscanthus RhizosphereMRILTATVLCAALGAGCTAMEEKVAASRQDRCQRADWVLVGERDGVEGVMNAAERYQHICGDMFQPGPYKEGLQKGQARRPRPPV*
Ga0066687_1042392913300005454SoilCASMNEKIDASRQDRCARANWSDVGQRDGIEAAQGMAERYAHICGDMFQSEPYKEGYQAGFARRARPPV*
Ga0070707_10112025913300005468Corn, Switchgrass And Miscanthus RhizosphereMRLLLAVVGCALVAGCETFNEKLDASRQDRCQRAEWAKVGERDGLEGATTMADRYQHICGDMFQPGPYREGLQKGTARRPRPPV*
Ga0070699_10122347723300005518Corn, Switchgrass And Miscanthus RhizosphereVRPVLLIAACAVLVSGCESMNQKIDASRQDRCQRAEWALVGERDGVEGAQGMAERYSRICGDMFQAAPYQEGFQK
Ga0070697_10118971123300005536Corn, Switchgrass And Miscanthus RhizosphereMRLALVVSACALAAGCESMSQKVDASRQERCQQADWAMVGERDGVEGATTMGDRYAHICGDAYNDVAYKEGLQKGMARRPRPPV*
Ga0070695_10051264913300005545Corn, Switchgrass And Miscanthus RhizosphereAMRLLLVTACAVLVAGCEAMGEKIDASRQDRCQRAEWAKVGERDGLEGATTMADRYQHICGDMFQPGPYREGLQKGTARRPRPPV*
Ga0066903_10336795943300005764Tropical Forest SoilMRLLIASACIVLVAGCESFSERVDASRQDRCQRADWAQVGERDGVEGASQAERYAHICGDLFQPAPYGEGL
Ga0066651_1053601413300006031SoilVRPVLLIAACAVLVSGCESMNQKIDASRQDRCQRAEWSQVGERDGVEGAQGMADRYSHICGDMFQPGPYQEGFQKGAARRGRSPA*
Ga0075428_10206432623300006844Populus RhizosphereLLLATACAVLVTGCESFGEKLDASRQGRCARADWAQIGERDGVEGASGMAARYAHICGDVFQSAPYQEGLRKGAARRPRPPI*
Ga0075425_10012552053300006854Populus RhizosphereMRLLLVVASAALVTGCESMSNRIDASRQDRCQRADWAQVGERDGVEGAMTMTERYAHICGDMFQPGPYQEGLRKGQARRPRPPV*
Ga0075429_10016298123300006880Populus RhizosphereMRLLLVIASAALVMGCESMSNRIDASRQDRCQRADWAQVGERDGVEGAMTMTERYAHICGDMFQPGPYQEGLRKGQARRPRPPV*
Ga0079215_1003961923300006894Agricultural SoilMRALIAVSIVFLVSGCESMGEKIDASRQDRCARAEWATVGERDGFAGNAMQAERYAEICGDMFQAGPYREGLQKGLARRPKPVV*
Ga0075426_1110990733300006903Populus RhizosphereMKLFLVGACAVLVTGCEAMGEKIDASRQDRCARADWAQVGERDGVEGANMMAERYQHICGDMFQPGPY
Ga0074063_1203827123300006953SoilMRLLLAVLGCTLVAGCESFNTKLDASRQDRCQRAEWAKVGERDGLEGATTMADRYQHICGDMFQPEPYKEGLQKGTARRPRPPV*
Ga0126377_1208113423300010362Tropical Forest SoilMRLLIASACVVLVAGCESFSERVDASRQDRCQRADWAQVGERDGVEGTSQADRYAHICGDLFQSAPYGEGLRKGAARRPRPPV*
Ga0126377_1261032813300010362Tropical Forest SoilMRLLLVTACAVLATGCESFSEKLDASRQDRCQRADWAQVGERDGVEGAGMMAERYAHICGDLFQPGPYREGLQKGMARRPRPPV*
Ga0126383_1365119623300010398Tropical Forest SoilMRLLIVSACVMLVAGCESFSERVDASRQDRCQRADWAQVGERDGVEGTSQADRYAHICGDLFQSAPYGEGLRKGAARR
Ga0134122_1021643433300010400Terrestrial SoilPAAGVSSDTSRVNDNGGRMRVMLAVAACAALVSGCESMNQKVDASRQDRCQRADWAQVGERDGVEGAMTMTERYAHICGDMFQPSPYQEGLRKGQARRPRPPV*
Ga0134121_1027516123300010401Terrestrial SoilMRLLLVIAGAALVTGCESMSNKIDASRQDRCQRADWAQVGERDGVEGAMTMTERYAHICGDMFQPGPYQEGLRKGQARRPRPPV*
Ga0137323_100252023300011409SoilMRLMLAVAACAALVSGCESMSEKIDASRQDRCQRADWALVGERDGVAGAHPQLLIDRYQYICGDMFQPGPYKEGYQKGFARRPKPAA*
Ga0137429_101861823300011437SoilLVLLVSGCESLSEKVHDSRQDRCARADWAQVGERDGVEGAQTQAERYQMICGDMFQPGPYKEGLQKGLARRPKPTA*
Ga0137437_100418863300011442SoilMRFFPLLAACALLAGCESFSTKLDASRQDRCQRADWALVGERDGVEGATTMAERYERICGDMFQPEPYRQGLQKGTARRPRPAV*
Ga0137427_1004643023300011445SoilMRLLLAALAGCALVSGCESFNTKLDASRQDRCQRAEWAKVGERDGLEGASTMAERYAHICGDMFQPGPYKEGLQKGLARRPRPPV*
Ga0137435_112595613300012232SoilLLVAALAGCALVSGCESFNTKLDASRQDRCQRADWVQVGERDGTEGATTMAERYAHICGDMFQPGPYREGLQKGMARRPRPPV*
Ga0150984_10893653523300012469Avena Fatua RhizosphereMRLAALAVAWCAVMASGCQSLDVSRQERCQKADWALVGERDGVEGASSMAERYSSICGDLYQDAAYKEGLQKGLARRPRP
Ga0137394_1088246223300012922Vadose Zone SoilMRMLLVVGACLPLLSACEGMQQKVDDSRQERCNKADWAMVGERDGFENIKNAADRYQMICGDLFKPEPYSEGLKKGAVRRPTPPV*
Ga0137410_1000108663300012944Vadose Zone SoilMRFLLAIVSCTLVSGCESFSQKVDASRQDRCQRAEWAKVGERDGLEGATTMADRYQHICGDMFQPGPYQEGLQKGTARRPRPPV*
Ga0126369_1151969213300012971Tropical Forest SoilMRLLIASACIVLVAGCESFSERVDASRQDRCQRADWAQVGERDGVEGAGMMAERYAHICGDLFQPGPYREGLQKG
Ga0134079_1006399523300014166Grasslands SoilMRLLLVIASAALVTGCESMSNKIDASRQDRCQRADWAQVGERDGVEGAMTMAERYSHVCGDMFQPGPYQEGLKKGQARRPRPPV*
Ga0157380_1212717123300014326Switchgrass RhizosphereMTAACLVQAMGDKIDASRLDRCARADWAQVGERDGVEGANTMAERYAHICGDMFQPGPYREGLQKGAARRPRPPV*
Ga0157380_1338360623300014326Switchgrass RhizosphereMRLLLVIASAALVTGCESMSNKIDASRQDRCQRADWAQVGERDGVEGAMTMTERYAHICGDMFQPGPYQEGLRKGQARRPRPPV*
Ga0137409_1103923723300015245Vadose Zone SoilMRLFLVGACAVLVSGCEAMGEKIDASRQDRCARANWALVGERDGVEGATMMTERYQHVCGDMFQPAPYQEGLQKGTARRPRPPV*
Ga0132258_1073236333300015371Arabidopsis RhizosphereVIPGFVIPEANAMRLLLVAACAALVTGCESMSNKIDASRQDRCQRADWAQVGERDGVEGAMSMTDRYAHICGDMFQPGPYQEGLKKGQARRPRPPV*
Ga0132258_1151807933300015371Arabidopsis RhizosphereRPPAEAGIVITEKDTMRLLLVLASAALVTGCESFSERVDASRQDRCQRADWAQVGERDGVEGAMTMTERYAHICGDMFQPGPYQEGLRKGQARRPRPPV*
Ga0132257_10439421523300015373Arabidopsis RhizosphereIVITEKDTMRLLLVLASAALVTGCESFSERVDASRQDRCQRADWAQVGERDGVEGAMTMTERYAHICGDMFQPGPYQEGLRKGQARRPRPPV*
Ga0184628_1001236443300018083Groundwater SedimentMRFFPLLAACALLAGCESFSTKLDASRQDRCQRADWALVGERDGVEGATTMAERYERICGDMFQPEPYRQGLQKGTARRPRPSV
Ga0184629_1008104523300018084Groundwater SedimentMRFFPLLAACALLAGCESFSTKLDASRQDRCQRADWALVGERDGVEGATTMAERYERICGDMFQPEPYRQGLQKGTARRPRPAV
Ga0190270_1141497123300018469SoilMRVLIAAAVCTALLSGCESMQEKIHDSRQDRCARADWALVGERDGVEGVQNQADRYQTICGDMFQPAPYKEGLQKGLARRPRPPV
Ga0190271_1171027923300018481SoilMRLFLVTACALLTTGCESMSNKIDASRQDRCQRADWAQIGERDGVEGANTMAERYAHICGDLFQPGPYKEGLQKGAARRPRPPV
Ga0190271_1323924523300018481SoilMRTATALALLLLLAGCEAMGDKLAASRQDRCARADWKDVGLRDGVEGVSTMAGRYEHFCGEMFKPGPYQEGVR
Ga0210380_1014658023300021082Groundwater SedimentMRLLLAALAGCALVSGCESFNTKLDASRQDRCQRAEWAKVGERDGLEGASTMAERYAHICGDMFQPGPYKEGLQKGLARRPRPPV
Ga0247665_106424913300024219SoilVRLLLVIASAALVTGCESMTNKIDASRQDRCQRADWAQVGERDGVEGAMTMTERYAHICGEMFQLGPYQEGLR
Ga0247661_103551923300024254SoilVIPEANARRLLLVAACAALVTGCESMSNKVDASRQDRCQRADWAQVGERDGVEGAMSMTERYAHICGDLFQPGPYQEGLKKGQARRPRPPV
Ga0210073_112093113300025569Natural And Restored WetlandsMRLMLTLAACALVTGCESFSERVDASRQDRCQRADWAKVGERDGLQGATMMAERYAHICGDMFQPDPYREGLQKGTARRPRPPV
Ga0207660_1079080423300025917Corn RhizosphereMRVVTATVLCAALGAGCTAMEEKVAASRQDRCQRADWVLVGERDGVEGVMSAADRYQYICGDMFQPGPYREGLQKGQARRPRPPV
Ga0207711_1161115323300025941Switchgrass RhizosphereMKLFLVGACAVLVTGCEAMGEKIDASRQDRCARADWAQVGERDGVEGANMMAERYQHICGDMFQPGPYREGLQKGAAR
Ga0207689_1035537123300025942Miscanthus RhizosphereMKLFLVGACAVLVTGCEAMGEKIDASRQDRCARADWAQVGERDGVEGANMMAERYQHICGDMFQPGPYREGLQKGAARRPRPPV
Ga0210135_100121533300025954Natural And Restored WetlandsMRLILTLAACVLVTGCESFNTRLDASRQDRCQRADWAQVGERDGVEGATMMAERYAHICGDMFQPDPYREGLQKGTARRPRPPV
Ga0210071_100906123300025955Natural And Restored WetlandsMRLLLALAVCAPLLAGCESFNTKLDASRQDRCQRADWAQVGERDGVEGATMMAERYAHICGDMFQPDPYREGLQKGTARRPRPPV
Ga0207708_1053452523300026075Corn, Switchgrass And Miscanthus RhizosphereMRKLPVVILVLAVSGCEAMGEKIDASRQDRCARAEWALVGERDGVTGALLQAERYQEICGDMFQPGPYREGLQKGLARRPKPAV
Ga0207676_1027007023300026095Switchgrass RhizosphereMRLLLVIASAALVTGCESMSNKIDASRQDRCQRADWAQVGERDGVEGAMTMTERYAHICGDMFQPGPYQEGLRKGQARRPRPPV
Ga0207675_10152365323300026118Switchgrass RhizosphereMRLLLVTACAVLVAGCEAMGEKIDASRQDRCQRAEWAKVGERDGLEGATTMADRYQHICGDMFQPGPYQEGLRKGQARRPRPPV
Ga0209131_109500323300026320Grasslands SoilVRLLLAVAACALVTGCESMGNKIDASRQDRCQRADWAQVGERDGSEGATTMAERYAHICGDMFQPAPYREGLQKGMARRPRPPV
Ga0207428_1002309723300027907Populus RhizosphereMRLLLVVASAALVTGCESMSNRIDASRQDRCQRADWAQVGERDGVEGAMTMTERYAHICGDMFQPGPYQEGLRKGQARRPRPPV
Ga0209583_1020678113300027910WatershedsMRLLLAIVGCALVSGCESFSQKVDASRQDRCQRADWAQVGERDGVEGATTMASRYQHICGDMFQPDPYN
Ga0247825_1007057643300028812SoilMRTATALALLLLLAGCEAMGDKLDASRQDRCAHADWKDVGLRDGVEGVSTMAGRYEHFCGEMFKPGPYQEGVREGLARRPRPPA
Ga0247825_1016783213300028812SoilEVDAMRLFLVTACALLTTGCESMGNKIDASRQDRCQRADWAQIGERDGVEGANTMAERYAHICGELFQPGPYQEGLRKGAARRPRPPV
Ga0247825_1042309023300028812SoilMRLILATACAVLVTGCESFSEKLDASRQDRCQRADWAQVGERDGVEGASMADRYAHICGDQFQPGPYGEGLRKGAARRPRPPV
Ga0307495_1021735713300031199SoilGHVITEGDVMRLLLVTACAMLVTGCESFNTKLDASRQDRCQRADWAQVGERDGVEGATMMTERYAHICGDLFQPAPYQEGLQKGVARRPRPPV
Ga0307497_1013649823300031226SoilRLLLAIVCCALVSGCESFNTKLDASRQERCQRADWAQVGERDGTEGATMAERYAHICGDMFQPEPYQQGLQKGMARRPRPPV
Ga0307497_1042707913300031226SoilMRLLLITACAVLATGCESFNTKLDASRQDRCQRADWAQVGERDGVEGATTMADRYQHICGDMFQPAPYKEGLQKGVARRPRPPV
Ga0307506_1035703313300031366SoilLLLAVSACTVLLGCEAMGDKIDASRQDRCQRADWAQVGERDGVEGAMTMGDRYSHICGDMFQPGPYQEGLKKGQARRPRPPV
Ga0307408_10149095523300031548RhizosphereVRPVLLIATCAALVAGCESMNQKIDASRQDRCQRAVWAEVGERDGVEGAQGMAERYSRICGDMFQPGPYQQGFDKGFARRPKPSV
Ga0307469_1192668523300031720Hardwood Forest SoilVKLLLLAGACALLAAGCESMSQKIDASRQERCHKADWAMVGERDGVEGATAMAERYAHICGDLYNGTAYKEGLQKGMARRPRPPV
Ga0307468_10154659723300031740Hardwood Forest SoilVRLLLVTACAVLATGCESFSAKVDASRQDRCQRADWAQVGERDGVEGASMADRYQHICGDMFQPAPYQQGLQKGAARRPRPPV
Ga0307473_1055848923300031820Hardwood Forest SoilMRLLLITACAMLATGCESFSNKVDASRQDRCQRADWAQVGERDGVEGAAQGERYAHICGDLFQPGPYKEGLQKGMARRPRPPV
Ga0310902_1087366223300032012SoilVLIWGCEGMSQKVDASRQERCQRADWAQVGERDGLEGAQGMAERYSSICGDLFEAGPYQEGVKKGSSRRGRAPA
Ga0315910_1003015223300032144SoilVRTAMALALLLLLSACESMSEKLAASRQDRCARADWKDVGLRDGIEGASTMAQRYEHFCGEMFQPGPYKEGLQEGLARRPRPPV
Ga0315912_1000248543300032157SoilMRLLLAVVGCTLIAGCESFNTKLDASRQDRCQRAEWAKVGERDGLEGATTMAERYQHICGDMFQPGPYQEGLQKGTARRPRPPV
Ga0315912_1005984223300032157SoilMRMPIAVGLVLVLAGCESFQERVHDSRMDRCARADWALVGERDGVEGASGQASRYQEICGEMFQAGPYREGLQKGLARRPRPPV
Ga0307470_1154210613300032174Hardwood Forest SoilGACALLVAGCESMSQKIDASRQERCHKADWAMVGERDGVEGATAMAERYAHICGDLYNGTAYKEGLQKGMARRPRPPV
Ga0307470_1186196323300032174Hardwood Forest SoilMRLLLVIASAALVTGCESMSNKIDASRQDRCQRADWAQVGERDGVEGAMTMADRYSHVCGDMFQPGPYQEGLRKGQARRPRPPV
Ga0307471_10400720133300032180Hardwood Forest SoilRLLLVTACAVLATGCESFSAKVDASRQDRCQRADWAQVGERDGVEGASMADRYQHICGDMFQPAPYQQGLQKGAARRPRPPV
Ga0307472_10084555313300032205Hardwood Forest SoilCAVLATGCESFNTKLDASRQDRCQRADWAQVGERDGVEGATTMADRYQHICGDLFQPGPYQEGLQKGVARRPRPPV
Ga0247829_1139113613300033550SoilMRLLLVTACAVLATGCESFSNRVDASRQDRCQRADWAQIGERDGVEGANTMAERYAHICGELFQPGPYQEGLRKGAARRPRPPV
Ga0247830_1079322623300033551SoilMRLLLVTACAVLATGCESFSNRVDASRQDRCQRADWAQVGERDGVEGANTMAERYAHICGELFQPGPYQEGLRKGAARRPRPPV
Ga0247830_1086822423300033551SoilMRALIAVSIVFLVSGCESMGEKIDASRQDRCARAEWATVGERDGFAGNAMQAERYAEICGDMFQAGPYREGLQKGLARRPKPVV
Ga0364945_0207933_275_5293300034115SedimentMRFFPLLAACALLAGCESFSTKLDASRQDRCQRADWAQVGERDGVEGATTMAERYQRICGDMFQPDPYRQGLQKGTARRPRPPV
Ga0314786_189020_1_2283300034664SoilVRQGFLVAASAVLIWGCEGMSQKVDASRQERCQRADWAQVGERDGLEGAQGMAERYSSICGDLFEAGPYQEGVKKG
Ga0373950_0112115_345_5963300034818Rhizosphere SoilQALLVASGAVLVWGCESMNQKLDASRQDRCQRADWAQVGERDGVEGAQGMAERYARICGDMFQPGPYQEGLKKGSSRRGRAPA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.