NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F103972

Metagenome / Metatranscriptome Family F103972

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F103972
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 45 residues
Representative Sequence MALTPLETEGAAQLRGPDADLWRTLAAWLWMTYLLLVTGAVLWWLF
Number of Associated Samples 89
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 83.17 %
% of genes near scaffold ends (potentially truncated) 26.73 %
% of genes from short scaffolds (< 2000 bps) 83.17 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.49

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (60.396 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Host-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere
(11.881 % of family members)
Environment Ontology (ENVO) Unclassified
(29.703 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(39.604 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 52.70%    β-sheet: 0.00%    Coil/Unstructured: 47.30%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.49
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF07238PilZ 53.47
PF00196GerE 11.88
PF13537GATase_7 0.99



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A60.40 %
All OrganismsrootAll Organisms39.60 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090014|GPIPI_16661545All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria2055Open in IMG/M
3300000652|ARCol0yngRDRAFT_1008710Not Available733Open in IMG/M
3300001305|C688J14111_10017351All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium2102Open in IMG/M
3300001686|C688J18823_10017111All Organisms → cellular organisms → Bacteria4837Open in IMG/M
3300001686|C688J18823_10920852Not Available555Open in IMG/M
3300002568|C688J35102_118126595Not Available532Open in IMG/M
3300004022|Ga0055432_10193809All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria582Open in IMG/M
3300004114|Ga0062593_103489644Not Available505Open in IMG/M
3300004157|Ga0062590_100449168Not Available1078Open in IMG/M
3300004157|Ga0062590_101000555Not Available795Open in IMG/M
3300004463|Ga0063356_101533469Not Available988Open in IMG/M
3300004479|Ga0062595_101827016Not Available579Open in IMG/M
3300004480|Ga0062592_101344582Not Available677Open in IMG/M
3300004643|Ga0062591_102629061Not Available531Open in IMG/M
3300005160|Ga0066820_1020945Not Available512Open in IMG/M
3300005164|Ga0066815_10096993Not Available549Open in IMG/M
3300005289|Ga0065704_10235512Not Available1030Open in IMG/M
3300005294|Ga0065705_10218509Not Available1331Open in IMG/M
3300005295|Ga0065707_10714551Not Available628Open in IMG/M
3300005332|Ga0066388_100178903All Organisms → cellular organisms → Bacteria → Proteobacteria2728Open in IMG/M
3300005332|Ga0066388_102271997All Organisms → cellular organisms → Bacteria981Open in IMG/M
3300005345|Ga0070692_10655712Not Available701Open in IMG/M
3300005526|Ga0073909_10017889Not Available2264Open in IMG/M
3300005535|Ga0070684_100463036All Organisms → cellular organisms → Bacteria1172Open in IMG/M
3300005549|Ga0070704_101789724Not Available568Open in IMG/M
3300005616|Ga0068852_100812446All Organisms → cellular organisms → Bacteria → Proteobacteria949Open in IMG/M
3300005617|Ga0068859_100026987All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium5761Open in IMG/M
3300005713|Ga0066905_100059123All Organisms → cellular organisms → Bacteria → Proteobacteria2399Open in IMG/M
3300005713|Ga0066905_101218199Not Available674Open in IMG/M
3300005719|Ga0068861_100363298All Organisms → cellular organisms → Bacteria → Proteobacteria1274Open in IMG/M
3300005764|Ga0066903_100865790All Organisms → cellular organisms → Bacteria → Proteobacteria1629Open in IMG/M
3300005844|Ga0068862_102440324Not Available535Open in IMG/M
3300006049|Ga0075417_10027522All Organisms → cellular organisms → Bacteria → Proteobacteria2328Open in IMG/M
3300006163|Ga0070715_10223836Not Available969Open in IMG/M
3300006605|Ga0074057_11618165All Organisms → cellular organisms → Bacteria → Proteobacteria1292Open in IMG/M
3300006845|Ga0075421_100327474Not Available1856Open in IMG/M
3300006847|Ga0075431_100361081Not Available1459Open in IMG/M
3300006852|Ga0075433_11229563Not Available650Open in IMG/M
3300006853|Ga0075420_100318253All Organisms → cellular organisms → Bacteria → Proteobacteria1348Open in IMG/M
3300007076|Ga0075435_101905753Not Available522Open in IMG/M
3300007255|Ga0099791_10023029All Organisms → cellular organisms → Bacteria → Proteobacteria2691Open in IMG/M
3300009156|Ga0111538_11331432Not Available906Open in IMG/M
3300009156|Ga0111538_13621962Not Available535Open in IMG/M
3300009162|Ga0075423_11559117Not Available709Open in IMG/M
3300009176|Ga0105242_11623034Not Available681Open in IMG/M
3300009609|Ga0105347_1073081All Organisms → cellular organisms → Bacteria1259Open in IMG/M
3300009609|Ga0105347_1429885Not Available570Open in IMG/M
3300010046|Ga0126384_10057360All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium2714Open in IMG/M
3300010359|Ga0126376_11047981Not Available820Open in IMG/M
3300010362|Ga0126377_11995116All Organisms → cellular organisms → Bacteria656Open in IMG/M
3300010397|Ga0134124_10006286All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria9556Open in IMG/M
3300010397|Ga0134124_11100446Not Available811Open in IMG/M
3300010399|Ga0134127_10814813All Organisms → cellular organisms → Bacteria → Proteobacteria983Open in IMG/M
3300010399|Ga0134127_12635749Not Available582Open in IMG/M
3300010403|Ga0134123_11538519Not Available711Open in IMG/M
3300011432|Ga0137428_1041380All Organisms → cellular organisms → Bacteria → Proteobacteria1200Open in IMG/M
3300011439|Ga0137432_1020417All Organisms → cellular organisms → Bacteria1906Open in IMG/M
3300011442|Ga0137437_1123154Not Available892Open in IMG/M
3300011443|Ga0137457_1110380All Organisms → cellular organisms → Bacteria → Proteobacteria883Open in IMG/M
3300012173|Ga0137327_1120655Not Available580Open in IMG/M
3300012469|Ga0150984_115846518Not Available1216Open in IMG/M
3300012510|Ga0157316_1002356Not Available1276Open in IMG/M
3300012685|Ga0137397_10059364All Organisms → cellular organisms → Bacteria → Proteobacteria2758Open in IMG/M
3300012896|Ga0157303_10313432Not Available506Open in IMG/M
3300012948|Ga0126375_10145689All Organisms → cellular organisms → Bacteria → Proteobacteria1485Open in IMG/M
3300012984|Ga0164309_10889014Not Available725Open in IMG/M
3300012987|Ga0164307_10176238All Organisms → cellular organisms → Bacteria → Proteobacteria1434Open in IMG/M
3300018083|Ga0184628_10114243All Organisms → cellular organisms → Bacteria → Proteobacteria1394Open in IMG/M
3300020082|Ga0206353_10179788Not Available540Open in IMG/M
3300021082|Ga0210380_10156234All Organisms → cellular organisms → Bacteria → Proteobacteria1024Open in IMG/M
3300024181|Ga0247693_1036719Not Available687Open in IMG/M
3300024224|Ga0247673_1035723Not Available689Open in IMG/M
3300025271|Ga0207666_1037684Not Available752Open in IMG/M
3300025922|Ga0207646_11286375Not Available639Open in IMG/M
3300025931|Ga0207644_10966264Not Available714Open in IMG/M
3300026088|Ga0207641_10491739Not Available1190Open in IMG/M
3300026088|Ga0207641_11008397Not Available829Open in IMG/M
3300026116|Ga0207674_12300264Not Available501Open in IMG/M
3300026118|Ga0207675_100403360Not Available1348Open in IMG/M
3300027056|Ga0209879_1058835Not Available618Open in IMG/M
3300027513|Ga0208685_1046433All Organisms → cellular organisms → Bacteria942Open in IMG/M
3300027655|Ga0209388_1199122Not Available555Open in IMG/M
3300027821|Ga0209811_10007696All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria3450Open in IMG/M
3300027821|Ga0209811_10277537Not Available643Open in IMG/M
3300027873|Ga0209814_10109421Not Available1175Open in IMG/M
3300027880|Ga0209481_10598522Not Available571Open in IMG/M
3300027909|Ga0209382_10284521Not Available1868Open in IMG/M
3300028380|Ga0268265_11328958Not Available719Open in IMG/M
3300028592|Ga0247822_10250011All Organisms → cellular organisms → Bacteria → Proteobacteria1341Open in IMG/M
3300028792|Ga0307504_10001485All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria4566Open in IMG/M
3300028812|Ga0247825_10570284Not Available809Open in IMG/M
3300031152|Ga0307501_10000261All Organisms → cellular organisms → Bacteria → Proteobacteria3871Open in IMG/M
3300031184|Ga0307499_10001744All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria5946Open in IMG/M
3300031199|Ga0307495_10087925Not Available714Open in IMG/M
3300031547|Ga0310887_10142370All Organisms → cellular organisms → Bacteria → Proteobacteria1246Open in IMG/M
3300031720|Ga0307469_10607113All Organisms → cellular organisms → Bacteria → Proteobacteria979Open in IMG/M
3300031740|Ga0307468_100005195All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium4470Open in IMG/M
3300031740|Ga0307468_100623638Not Available885Open in IMG/M
3300031740|Ga0307468_101051668Not Available721Open in IMG/M
3300031820|Ga0307473_10218023All Organisms → cellular organisms → Bacteria1146Open in IMG/M
3300032075|Ga0310890_10237324All Organisms → cellular organisms → Bacteria1275Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere11.88%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil7.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.93%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere6.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil5.94%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil4.95%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.95%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.95%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil3.96%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.96%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.97%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil2.97%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.97%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.98%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere1.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.98%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.98%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Rhizosphere1.98%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.99%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.99%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.99%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.99%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.99%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.99%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.99%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.99%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000652Arabidopsis rhizosphere microbial communities from the University of North Carolina - sample from Col-0 young rhizosphere DNAHost-AssociatedOpen in IMG/M
3300001305Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300001686Grasslands soil microbial communities from Hopland, California, USAEnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300004022Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D1EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005160Soil and rhizosphere microbial communities from Laval, Canada - mgLMBEnvironmentalOpen in IMG/M
3300005164Soil and rhizosphere microbial communities from Laval, Canada - mgLACEnvironmentalOpen in IMG/M
3300005289Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005535Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.2-3L metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005616Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2Host-AssociatedOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006605Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHAB (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009156Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009609Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011432Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT718_2EnvironmentalOpen in IMG/M
3300011439Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT820_2EnvironmentalOpen in IMG/M
3300011442Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT138_2EnvironmentalOpen in IMG/M
3300011443Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT630_2EnvironmentalOpen in IMG/M
3300012173Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT517_2EnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012510Arabidopsis rhizosphere microbial communities from North Carolina - M.Col.9.old.080610Host-AssociatedOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012896Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S118-311C-2EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300018083Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_b1EnvironmentalOpen in IMG/M
3300020082Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-4 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300024181Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK34EnvironmentalOpen in IMG/M
3300024224Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK14EnvironmentalOpen in IMG/M
3300025271Switchgrass rhizosphere bulk soil microbial communities from Kellogg Biological Station, Michigan, USA, with spike-in - S5 (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026088Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S6-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026116Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027056Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027513Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027821Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027880Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 (SPAdes)Host-AssociatedOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300031152Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 15_SEnvironmentalOpen in IMG/M
3300031184Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 13_SEnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031547Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D4EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_029397302088090014SoilLTPLETEGAAQLRGPDADLWRTLAAWLWMAYLMLMSGAVLWWLF
ARCol0yngRDRAFT_100871013300000652Arabidopsis RhizosphereMALTPLETEGAAQLRGPDADLWRTLAAWLWMTYLLLVTGAVLWWL
C688J14111_1001735113300001305SoilMALTPVETEGAGELRGPDADMWRTLAAWLWMTYLGLLTGAVLWWLL*
C688J18823_1001711193300001686SoilMALTPVETEGAGQLRGPDADMWRTLAAWLWMTYLGLLTGAVLWWLL*
C688J18823_1092085213300001686SoilMALTPVETEGTAQLRGPDADMWRTLAAWLWMAYLMLLTGAVLWWLL*
C688J35102_11812659513300002568SoilMALTPVETEGAAQLGGPDADMWRTLAAWLWMTYLMLLTGAVLWWLL*
Ga0055432_1019380913300004022Natural And Restored WetlandsKGSMAMRQLDTEGAPLAGPDGDMWRTAAAWLWTTYLMLVTGAVLYWVF*
Ga0062593_10348964413300004114SoilMAMTPLETEVAAQLRGPDADLWRTLAAWLWMTYLLLLTGAVLWWLF*
Ga0062590_10044916823300004157SoilMALTPLETEAAAELRGPDADLWRTLAAWLWMTYLLLVTGAFLWWLF*
Ga0062590_10100055513300004157SoilMALTPLETEGAAQLRGPDADLWRTLAAWLWMSYLVLVTGAVLWWLF*
Ga0063356_10153346913300004463Arabidopsis Thaliana RhizosphereMAFRSLDTEGAAGTPRLREPDADMWRTLAAWLWTSYLVLVTGAILWWVF*
Ga0062595_10182701623300004479SoilMALTPLETEGAAQLGGPEADVWRTFAAWLWMAYLFLVTGAVLWWFF*
Ga0062592_10134458223300004480SoilMAMTPLETEGAAQLRGPDADMWRTLAAWLWMTYLLLVTGAVLWWVF*
Ga0062591_10262906113300004643SoilMALTPLETEGAAQLGGPEADVWRTFAAWLWMAYLFLVTGTVLWWFF*
Ga0066820_102094513300005160SoilMAMTPLETEGTAQLRGPDADMWRTLAAWLWMTYLLLVTGAVLWWVF*
Ga0066815_1009699313300005164SoilMAMTPLETEGAAQLRGPDADLWRTLAAWLWMSYLVLVTGAILWWLF*
Ga0065704_1023551233300005289Switchgrass RhizosphereMALTPLETEGAAQLRGPDADLWRTLAAWLWMTYLLLVTGAVLWWLF*
Ga0065705_1021850933300005294Switchgrass RhizosphereMAFTPLETEGAAQLRGPDADLWRTLAAWLWMTYLLLVTGAVLWWLL*
Ga0065707_1071455113300005295Switchgrass RhizosphereMAMTPLETEGAAQLRGPDADLWRTLAAWLWMSYLVLVTGAVLWWLF*
Ga0066388_10017890343300005332Tropical Forest SoilMAMRPLDTEGAQLRGPDADMWRTLAAWLWTSYLMLVTGAILWWFF*
Ga0066388_10227199723300005332Tropical Forest SoilMALTPLETEGAAQLGGPEADVWRTFAAWLWMAYLFLVAGAVLWWFF*
Ga0070692_1065571223300005345Corn, Switchgrass And Miscanthus RhizosphereMALTPLETEGAAQLRGPDADLWRTLAAWLWMTYLFLVTGAVLWWLL*
Ga0073909_1001788923300005526Surface SoilMALTPLETEGAAPLSGPDADTWRTLAAWLWMTYLLLVTGAVLWWLG*
Ga0070684_10046303623300005535Corn RhizosphereMALTPLETEGAAQLGGPEADVWRTFAAWLWMAYLFLVTGALLWWFF*
Ga0070704_10178972413300005549Corn, Switchgrass And Miscanthus RhizosphereMAFRSLDTEGAAGTPRLREPAADMWRTLAAWLWTSYLVLVTGAILWWVF*
Ga0068852_10081244633300005616Corn RhizosphereMALTPLETEAAAELRGPDADLWRTLAAWLWMTYLLLVTGAFLWW
Ga0068859_100026987123300005617Switchgrass RhizosphereASRERSMALTPLETEAAAELRGPDADLWRTLAAWLWMTYLLLVTGAFLWWLF*
Ga0066905_10005912333300005713Tropical Forest SoilMALRRLDTEAAAQLDGADADMWRTLAAWLWTTYLAVVTGAVLWWVF*
Ga0066905_10121819923300005713Tropical Forest SoilMTIRPLDTEGAPLRGPDADMWRTAAAWLWTAYLTLVTGAILWWVF*
Ga0068861_10036329813300005719Switchgrass RhizosphereMALTPLETEGAAQLGGPEADVWRTLAAWLWMAYLFLVTGAVLWWFF*
Ga0066903_10086579013300005764Tropical Forest SoilMALTPLEAEGAAHLRGPDADLWRTLAAWLWMTYLALVTGAVLWWLS*
Ga0068862_10244032423300005844Switchgrass RhizosphereMAFRSLDTEEAAGTPRLRGPDADMWRTLAAWLWTSYLVLVTG
Ga0075417_1002752213300006049Populus RhizosphereMALTPLETDGAAQLRGQDADLWRTLAAWLWMTYLFLVTGAVLWWLL*
Ga0070715_1022383623300006163Corn, Switchgrass And Miscanthus RhizosphereMALTPLETEGTAQLGGPEADLWRTFAAWLWMAYLFLVSGAVLWWFF*
Ga0074057_1161816513300006605SoilMAMTRLETESAAQLRGPDADMWRTLAAWLWMSYLVLVTGAVLWWVF*
Ga0075421_10032747413300006845Populus RhizosphereDGAAQLRGQDADLWRTLAAWLWMTYLFLVTGAVLWWLL*
Ga0075431_10036108113300006847Populus RhizosphereMALTPFDTEGVAPLRGPEADMWRTLAAWLWTTYLALVTGAVLWWVF*
Ga0075433_1122956313300006852Populus RhizosphereMALTPLDTEGAAQLRGPDADLWRTLAAWLWMTYLLLVAGAVLWWLF*
Ga0075420_10031825313300006853Populus RhizosphereVTSAEQEDTPKVGSMALTPFDTEGVAPLRGPEADMWRTLAAWLWTTYLALVTGAVLWWVF
Ga0075435_10190575323300007076Populus RhizosphereMALTPLDTEGAAQLRGPDADLWRTLAAWLWMTYLLLVAGA
Ga0099791_1002302933300007255Vadose Zone SoilMALTPLETEGAAHLGGPDADLWRTLAAWLWMAYLVLVSGAVLWWVL*
Ga0111538_1133143223300009156Populus RhizosphereMALTPLDTEGAAQLRGPDADLWRTLAAWLWMTYLLLVTGAVLWWLF*
Ga0111538_1362196213300009156Populus RhizosphereMAFTPLETEGAAQLRGPDADLWRTLAAWLWMTYLLLVTG
Ga0075423_1155911713300009162Populus RhizosphereTPLETEVAAQLRGPDADLWRTLAAWLWMTYLFLVTGAVLWWLL*
Ga0105242_1162303423300009176Miscanthus RhizosphereMALTPLETEGAAQLGGPEADVWRTLAAWLWMAYLFLVTGTVLWWFF*
Ga0105347_107308123300009609SoilMTPVGRREGRSMALRSLDTEEAPPLSGPDADMWRTLAAWLWATYLMLVTGAILWWVF*
Ga0105347_142988513300009609SoilMRTLDTEGAPLAGPDADMWRTAAAWLWTTYLMLVTGAVLYWVF*
Ga0126384_1005736053300010046Tropical Forest SoilMALTPLETEATAQLGGPEADVWRTFAAWLWMAYLFLVTGAVLWWFL*
Ga0126376_1104798113300010359Tropical Forest SoilMALTPLETEGAAQLGGPEADVWRTYAAWLWMAYLFLVTGAVLWWFF*
Ga0126377_1199511613300010362Tropical Forest SoilRGGLMTIRPLDTEGAPLRGPDADMWRTAAAWLWTAYLTLVTGAILWWVF*
Ga0134124_1000628613300010397Terrestrial SoilMALTPLETEGAAQLGGPDADLWRTLAAWLWMTYLLLVTGAVLWWLL*
Ga0134124_1110044613300010397Terrestrial SoilMTPLETEGAAQLRGPDADLWRTLAAWLWMTYLLLVTGAVLWWLF*
Ga0134127_1081481333300010399Terrestrial SoilMAFRSLDTEGAAGTPRLRGPDADMWRTLAAWLWTSYLVLVTGA
Ga0134127_1263574913300010399Terrestrial SoilMTPLETEGAGQLRGPDADMWRTLAAWLWMTYLFLVTGAVLWWLF*
Ga0134123_1153851913300010403Terrestrial SoilMALTPLETEGAAQLGGPDADLWRTLAAWLWMTYLLLVTGAVPWWLL*
Ga0137428_104138013300011432SoilMALRSLDTEEAPPLSGPDADMWRTLAAWLWATYLMLVTGA
Ga0137432_102041733300011439SoilMALRSLDTEEAPRLRGPDADMWRTLAAWLWATYLMLVTGAILWWVF*
Ga0137437_112315413300011442SoilMALTPLETEGAAQLGGPDADLWRTLAAWLWMTYLVLVTSAVLWWLF*
Ga0137457_111038023300011443SoilDTEGAPLAGPDADMWRTAAAWLWTTYLMLVTGAVLYWVF*
Ga0137327_112065513300012173SoilMTPVGRREGRSMALRSLDTEEAPPLSGPDADMWRTLAAWLWATYLMLVTGAI
Ga0150984_11584651823300012469Avena Fatua RhizosphereMALTPVETEGAVQLGGPDADMWRTLAAWLWMTYLMLLTGAVLWWLL*
Ga0157316_100235633300012510Arabidopsis RhizosphereMALTPLETEGAAQLRGPDADLWRTLAAWLWMTYLLLVTGAVLWWLL*
Ga0137397_1005936443300012685Vadose Zone SoilMALTPLETEGAAHLGGRDADLWRTLAAWLWMAYLVLVSGAVLWWVL*
Ga0157303_1031343223300012896SoilMALTPLETEAAAELRGPDADLWRTLAAWLWMTYLLLVT
Ga0126375_1014568913300012948Tropical Forest SoilMRPLDTEGAQLRGPDADMWRTLAAWLWTAYLMLITGAILWWFF*
Ga0164309_1088901413300012984SoilEGAAPLSGPDADTWRTLAAWLWMTYLLLVTGAVLWWLG*
Ga0164307_1017623833300012987SoilMALTPLETEAAAELRGPDADLWRTLAAWLWMTYLLLVTGAVLWWLG*
Ga0184628_1011424323300018083Groundwater SedimentMALTPLETEGAAQLGGPDADLWRTLAAWLWMTYLVLVTSAVLWWLF
Ga0206353_1017978823300020082Corn, Switchgrass And Miscanthus RhizosphereETEAAAELRGPDADLWRTLAAWLWMTYLLLVTGAFLWWLF
Ga0210380_1015623433300021082Groundwater SedimentMALTPLETEGAAQLGGPDADLWRTLAAWLWMTYLVLVTSAVL
Ga0247693_103671913300024181SoilMALTPLETEGTAQLGGPEADVWRTFAAWLWMAYLFLVSGAVLWWFF
Ga0247673_103572313300024224SoilMALTPLETEGAAQLGGPEADVWRTFAAWLWMAYLFLVSGAVLWWFF
Ga0207666_103768413300025271Corn, Switchgrass And Miscanthus RhizosphereSMALTPLETEAAAELRGPDADLWRTLAAWLWMTYLLLVTGAFLWWLF
Ga0207646_1128637523300025922Corn, Switchgrass And Miscanthus RhizosphereMALTPLETEAAAELRGPDADLWRTLAAWLWMTYLLL
Ga0207644_1096626413300025931Switchgrass RhizosphereMALTPLETEAAAELRGPDADLWRTLAAWLWMTYLLLVTG
Ga0207641_1049173943300026088Switchgrass RhizosphereMALTPLETEGAAQLRGPDADLWRTLAAWLWMSYLVLVTGAVLWWLF
Ga0207641_1100839723300026088Switchgrass RhizosphereMALTPLETEGAAQLGGPDADLWRTLAAWLWMTYLLLVTGAVLWWLL
Ga0207674_1230026423300026116Corn RhizosphereMALTPLETEAAAELRGPDADLWRTLAAWLWMTYLLLVTGAFL
Ga0207675_10040336023300026118Switchgrass RhizosphereMALTPLETEGAAQLGGPEADVWRTLAAWLWMAYLFLVTGAVLWWFF
Ga0209879_105883513300027056Groundwater SandMAMRTLDTEGAPLAGPDADMWRTAAAWLWTTYLMLVTGAVLYWVF
Ga0208685_104643323300027513SoilMALRSLDTEEAPPLSGPDADMWRTLAAWLWATYLMLVTGAILWWVF
Ga0209388_119912213300027655Vadose Zone SoilMALTPLETEGAAHLGGPDADLWRTLAAWLWMAYLVLVSGAVLWWVL
Ga0209811_1000769653300027821Surface SoilMALTPLETEGAAPLSGPDADTWRTLAAWLWMTYLLLVTGAVLWWLG
Ga0209811_1027753723300027821Surface SoilMAMTPLETEGTAQLRGPDADMWRTLAAWLWMTYLLLVTGAVLWWVF
Ga0209814_1010942133300027873Populus RhizosphereMALTPLETDGAAQLRGQDADLWRTLAAWLWMTYLFLVTGAVLWWLL
Ga0209481_1059852213300027880Populus RhizosphereMALTPFDTEGVAPLRGPEADMWRTLAAWLWTTYLALVTGAVLWWVF
Ga0209382_1028452163300027909Populus RhizosphereDGAAQLRGQDADLWRTLAAWLWMTYLFLVTGAVLWWLL
Ga0268265_1132895813300028380Switchgrass RhizosphereMALTPLETEAAAELRGPDADLWRTLAAWLWMTYLLLVTGAFLWWL
Ga0247822_1025001133300028592SoilMRLLDTEGAPLAGPDADMWRTAAAWLWTTYLMLVTGAVLYWVF
Ga0307504_1000148533300028792SoilMALSPLEAEGAAELREPDADLWRTLAAWLWMTYLLLVTGAVLWWLS
Ga0247825_1057028423300028812SoilMALTPLETEGAAQLRGPDADLWRTLAAWLWMTYLMLVTGAVLWWLL
Ga0307501_1000026153300031152SoilMAMTPLETEGAAQLRGPDADMWRTLAAWLWMTYLSLVTSAVLWWLF
Ga0307499_1000174453300031184SoilMAMTPLEIEGAGQLRGPDADMWRTLAAWLWMAYLILVTGAVLWWLF
Ga0307495_1008792523300031199SoilMTPLETEGAAHLHGPDADLWRTLAAWLWMTYLALVTGAVLWWVF
Ga0310887_1014237013300031547SoilMAFTPLETEGAAQLRGPDADLWRTLAAWLWMTYLLLVTGAVLWWLL
Ga0307469_1060711333300031720Hardwood Forest SoilMALTPLDTAGTAQLRGPDADLWRTLAAWLWMTYLLLVAGAFLWWLF
Ga0307468_10000519593300031740Hardwood Forest SoilMALTPLETDGPAQLRGPDADMWRTLAAWLWMTYLLLVTGAVLWWVF
Ga0307468_10062363813300031740Hardwood Forest SoilMALTPLETEGTAQLRGPDADLWRTLAAWLWMTYLLLVAGAFLWWLF
Ga0307468_10105166813300031740Hardwood Forest SoilMAMRQLDTEGAPLRGPDADLWRTLAAWLWTTYLMLVTGAILWWVF
Ga0307473_1021802323300031820Hardwood Forest SoilMRLDTEGAPLAGPDADMWRTAAAWLWTTYLMLVTGAVLYWVF
Ga0310890_1023732423300032075SoilMALRRLDTESAPHLHGADADMWRTLAAWLWTTYLAVVTGAVVWWVF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.