NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F104821

Metagenome / Metatranscriptome Family F104821

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104821
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 73 residues
Representative Sequence MNWPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMDPAERDEILNELQQVRQEVAELAERMDFAERLL
Number of Associated Samples 84
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 67.86 %
% of genes near scaffold ends (potentially truncated) 13.00 %
% of genes from short scaffolds (< 2000 bps) 20.00 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.43

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (82.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(30.000 % of family members)
Environment Ontology (ENVO) Unclassified
(38.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(42.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 59.41%    β-sheet: 0.00%    Coil/Unstructured: 40.59%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.43
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF00005ABC_tran 20.00
PF04012PspA_IM30 7.00
PF01594AI-2E_transport 2.00
PF03091CutA1 1.00
PF07755DUF1611 1.00
PF13407Peripla_BP_4 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG1842Phage shock protein ATranscription [K] 14.00
COG0628Predicted PurR-regulated permease PerMGeneral function prediction only [R] 2.00
COG1324Divalent cation tolerance protein CutAInorganic ion transport and metabolism [P] 1.00
COG3367Uncharacterized conserved protein, NAD-dependent epimerase/dehydratase familyGeneral function prediction only [R] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A82.00 %
All OrganismsrootAll Organisms18.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300003319|soilL2_10142976Not Available1366Open in IMG/M
3300004114|Ga0062593_102005963Not Available643Open in IMG/M
3300005177|Ga0066690_10081970All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2041Open in IMG/M
3300005178|Ga0066688_10613644All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes699Open in IMG/M
3300005179|Ga0066684_10192996All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1314Open in IMG/M
3300005184|Ga0066671_10057325All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2000Open in IMG/M
3300005440|Ga0070705_100156965Not Available1516Open in IMG/M
3300005518|Ga0070699_100549465All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1051Open in IMG/M
3300006046|Ga0066652_101124941All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes744Open in IMG/M
3300006797|Ga0066659_10037675All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2954Open in IMG/M
3300010304|Ga0134088_10429701Not Available646Open in IMG/M
3300010336|Ga0134071_10022332All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2689Open in IMG/M
3300012200|Ga0137382_10306875All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1108Open in IMG/M
3300012204|Ga0137374_10590133Not Available849Open in IMG/M
3300012206|Ga0137380_10015138All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes7038Open in IMG/M
3300012209|Ga0137379_10102901All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → eudicotyledons → Gunneridae → Pentapetalae → asterids → campanulids → Asterales → Asteraceae → Asteroideae → Anthemideae → Anthemidinae → Tanacetum → Tanacetum cinerariifolium2748Open in IMG/M
3300012285|Ga0137370_10006884All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → Gemmatimonadaceae → Gemmatirosa → Gemmatirosa kalamazoonesis5247Open in IMG/M
3300012349|Ga0137387_10575320All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes817Open in IMG/M
3300012349|Ga0137387_10812271Not Available677Open in IMG/M
3300012918|Ga0137396_10151227Not Available1690Open in IMG/M
3300012927|Ga0137416_10176871Not Available1694Open in IMG/M
3300012976|Ga0134076_10489310Not Available560Open in IMG/M
3300018433|Ga0066667_11834529Not Available550Open in IMG/M
3300025921|Ga0207652_11298086All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes630Open in IMG/M
3300026325|Ga0209152_10024455All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2067Open in IMG/M
3300027775|Ga0209177_10248727All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes656Open in IMG/M
3300027903|Ga0209488_11054209All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes559Open in IMG/M
3300031199|Ga0307495_10004142All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1764Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil30.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil18.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil13.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.00%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil2.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.00%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil2.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil2.00%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment2.00%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.00%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.00%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004153Grasslands soil microbial communities from Hopland, California, USA (version 2)EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005334Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2Host-AssociatedOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010134Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010141Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300011427Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT418_2EnvironmentalOpen in IMG/M
3300012140Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT690_2EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012396Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012401Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012403Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019257Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT660_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019998Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m1EnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025921Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031152Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 15_SEnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031965Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT100D185EnvironmentalOpen in IMG/M
3300033407Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300034177Sediment microbial communities from East River floodplain, Colorado, United States - 17_j17EnvironmentalOpen in IMG/M
3300034773Sediment microbial communities from East River floodplain, Colorado, United States - 4_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
soilL2_1014297633300003319Sugarcane Root And Bulk SoilVSGPEFAAMVFVVGGGFWVVRPVAMAIAKRIAGEHRKAELDPEDRDEILAELHQVRQEVAELAERMDFAERMLAKPRE*
soilH2_1038973923300003324Sugarcane Root And Bulk SoilMVFVVGGGFWVVRPVAMAIAKRIAGEHRKAELDPEDRDEILAELHQVRQEVAELAERMDFAERMLAKPRE*
Ga0062593_10200596313300004114SoilMSGPEMIAAVVFFGGLFTVLRPVAGAVAKRISGEARRNEMEAGDRDEIVAELQQMRQEMSELAERVDFTERLLAKQGDTGR
Ga0063455_10139189823300004153SoilMVFGGAFWVLRPIGAAIAKRIAGEHRKPGMDAADRDEILSELHAVREEVAELAERMDFAERMLAKPKNG*
Ga0066679_1016540533300005176SoilMFFGGTFWVLRPVAAAVAKRIAGEHRRPSMEPAERDEILNELQQVRQELTELAERMDFADRLLAKQSEVKR*
Ga0066690_1008197053300005177SoilAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPAERDEILNELHQVRQEVAELAERMDFAERLLSKQSEVKR*
Ga0066688_1061364413300005178SoilVSGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPAERDEILNELHQVRQEVAELAERMDFAERLLSKQSEVKR*
Ga0066684_1019299623300005179SoilMFFGGAFWVLRPIGAAIAKRIAGEHRKPGMDAADRDEILSELHAVREEVAELAERMDFAERMLAKPKNG*
Ga0066671_1005732513300005184SoilAAVIFFGGAFWVLRPIGAAIAKRIAGEHRKPGMDAADRDEILSELHAVREEVAELAERMDFAERMLAKPKNG*
Ga0068869_10110999213300005334Miscanthus RhizosphereMSGPEMIAAVVFFGGLFTVLRPVAGAVAKRISGEARRNEMEAGDRDEIVAELQQMRQEMSELAERVDF
Ga0070705_10015696513300005440Corn, Switchgrass And Miscanthus RhizosphereMNGPEAIAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMDPAERDEILNELRQVREEVAELAERMDFAER
Ga0066686_1060858913300005446SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMDPAERDEILNELQQVRQEVAELAERMDFAERLLSKQSEV
Ga0066682_1008966913300005450SoilVSPQDAAAVFLIFGGGFWVIRPVAAAIAKRIAGEHRRPGIEPAERDEILDELQRVRAELTELAERMDFAER
Ga0066682_1082137213300005450SoilTFWVLRPVAAAVAKRIAGEHRKPGMDAAESDEILNELQQVRQEVAELAERMDFAERLLSKQSEVKR*
Ga0066681_1006223543300005451SoilMSGPEAVAAFMFFGGAFWVIRPVAAALAKRIAGEHRRPGMEPAERDEILGEVQQMRQELSELAERVDFTERLLAKQSEIKGRP*
Ga0070699_10054946513300005518Corn, Switchgrass And Miscanthus RhizosphereMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPAERDEILNELQQVRQEVAELAERM
Ga0073909_1029603513300005526Surface SoilMNGPEAIAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMDPAERDEILSELRQVREEVAELAERMDFAERL
Ga0066692_1100087613300005555SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMDPAERDEIMSELRQVREEVAELAERMDFAERL
Ga0066651_1025337343300006031SoilMSGPEAVAAFMFFGGAFWVIRPVAAALAKRIAGEHRRPGMEPAERDEILGEVQQMRQELSELAERVDFTERLLAKQSEIKG
Ga0066696_1023568633300006032SoilVSGPEAVAAFMFFGGAFWVLRPIGAAIAKRIAGEHRKPGMDAADRDEILSELHVVREEVAELAERMDFAERMLAKPKNG*
Ga0066652_10112494123300006046SoilMSGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPSMEPAERDEILNELQQVRQEVTELAERMAFAERLLAKQSEVKR*
Ga0066659_1003767553300006797SoilMTGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPSMEPAERDEILNELQSVRQELTELAERMDIAERLLAKQSEVKR*
Ga0075428_10067714013300006844Populus RhizosphereMSGPEAIIAFVFFGGTFWVLRPVAAAVAKRIAGEHRRPGMEPAERDEILSELHEVRQEVAELAE
Ga0075436_10078405433300006914Populus RhizosphereMSGPEAIAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGIDPAERDEILNQLHAVREEVAELAERM
Ga0075436_10113624733300006914Populus RhizosphereMSGPEAIAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGIDPAERDEILTELHAVREEVAELAERMD
Ga0075435_10112749633300007076Populus RhizosphereMNVTEAIAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGMEPAERDEILSELQAVRQEVAELAERMDFA
Ga0099793_1013555423300007258Vadose Zone SoilMNAPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGIGAEERDEILTELQQVRHEVAELAERMDFAERMLAKPRE*
Ga0075423_1119043033300009162Populus RhizosphereMSGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGMDPDEKEEILNELQHVRQEVAELAE
Ga0127484_116652113300010134Grasslands SoilTFWVLRPVAAAVAKRIAGEHRKPGMDAAERDEILNELQQVRQEVAELAERMDFAERLLSKQSEVKR*
Ga0127499_109032443300010141Grasslands SoilGGTFWVLRPVAAAVAKRIAGEHRKPGMDPAERDEILNELQQVRQEVAELAERMDFAERLLSKQSEVKR*
Ga0134088_1042970133300010304Grasslands SoilMSGPEAVAAVVFFGGVFTILRPVAAAVAKRISGEHRRTGLESAERDEILSQLQAVREEVAELAER
Ga0134071_1002233233300010336Grasslands SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMDPAERDEILNELQQVRQEVAELAERMDFAERLLSKQSEVKR*
Ga0134062_1043671933300010337Grasslands SoilMNGPEAIAAFMFFGGTFWVLRPVAGAIAKRIAGEHRKPGIDPAERDEILSELQRMRQEVAELAERMDFAERLL
Ga0134066_1000369033300010364Grasslands SoilMFFGGAFWVLRPIGAAVAKRIAGEHRKPGLDAADRDEILTELHAVREEVAELAERMDFAERMLAKPKNG*
Ga0137448_110906733300011427SoilMSGPEALAMFAFLGGSFWVLRPVAAAVAKRIAGEHRRPGMDHEERDEILTELQQVRHEVAELAERMDFAERMLAKPRE*
Ga0137351_102844613300012140SoilMSGPEALAAVVFFGGVFTVLRPVAAAVAKRISGEHRQAGIDPAERDEILTELQQVRQELTELAERVEFTERLLARQQQDALPKPGR*
Ga0137382_1001577523300012200Vadose Zone SoilVNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMDPAERDEILNELQQVRQEVAELAERMDFAERLLSKQSEVKR*
Ga0137382_1030687543300012200Vadose Zone SoilVNGPEAIAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGLDPAERDEILSELQAVRQEVADLAERMDFAE
Ga0137382_1070172313300012200Vadose Zone SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGLDPAERDEILSELQAVRQEVADLAERMDFAE
Ga0137365_1002720913300012201Vadose Zone SoilMNGPEAVAAFMFFGGTFWVLRPVAAAIAKRIAGEHRRPGLEKEERDEILTELQQVRQEVAELAE
Ga0137374_1059013333300012204Vadose Zone SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGMETAEREEILSELQQVREEVAELAERMDFA
Ga0137380_1001513843300012206Vadose Zone SoilMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPAERDEILNELQQVRQEVAELAERMDFAERLLSKQSEVKR*
Ga0137379_1010290113300012209Vadose Zone SoilMNGPEAIAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGIDPAERDEILNELQQVREEVAELAERMDFAERLL
Ga0137379_1125199513300012209Vadose Zone SoilMSGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGIDPAERDEILGELRAVREEVAELAERMDFA
Ga0137378_1100076813300012210Vadose Zone SoilMSGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGMEPAERDEILTQLQQVREEVAELAERMDFAERLLAK
Ga0137377_1052977333300012211Vadose Zone SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPGERDEILNELQQVRQEVAELAERMDFAERLLSKQSEVKR*
Ga0137370_1000688463300012285Vadose Zone SoilVNGPEAVAAFMFFGGAFWVLRPIGAAVAKRIAGEHRRPEMDAADREEILSELHAVREEVAELAERMDFAERMLAKPKDG*
Ga0137387_1057532033300012349Vadose Zone SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPAERDEILNELQQVRQEVAQLAERLDFAERLLSKQSEVKR*
Ga0137387_1081227113300012349Vadose Zone SoilMSGPEAVAAFMFFGGTFWVLRPDAAAGAKRIAGEHRRPGMEPAERDEILTELQQVRQEVAELAERMDFAERLLAKPRDG*
Ga0134057_107494613300012396Grasslands SoilMNWPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMDPAERDEILNELQQVRQEVAELAERMDFAERLL
Ga0134055_126708233300012401Grasslands SoilMNGPEAIAAFMFFGGTFWVLRPVAAAIAKRIAGEHRKPGMDPAERDEILSELQAVREEVAELA
Ga0134055_127702313300012401Grasslands SoilGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMDPAERDEILNELQQVRQEVAELAERMDFAERLLSKQSEVKR*
Ga0134049_140142933300012403Grasslands SoilVSPQDAAAVFLIFGGGFWVIRPVAAAIAKRIAGEHRRPGIEPAERDEIIEELQRVREEL
Ga0137396_1015122753300012918Vadose Zone SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGMEPAERDEILNELQQVRQEVAELAERLDFAERLLSKQSEVKR*
Ga0137396_1060602633300012918Vadose Zone SoilMNGPEAVAAFMFFGGAFWVLRPVAAAVAKRIAGEHRRPTMEPAERDEILNELQSVRQELAELAE
Ga0137396_1060660633300012918Vadose Zone SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPTMEPAERDEILNELQSVRQELAELAE
Ga0137394_1030082613300012922Vadose Zone SoilMSGPEAIAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGLELADRDEILTELQQLRREVAELAERVDFTERMLPRETDRK
Ga0137394_1035264643300012922Vadose Zone SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGLEPAEHDEILTELQQLRREVAELAERVDFTER
Ga0137359_1127515633300012923Vadose Zone SoilTFWVLRPVAAAVAKRIAGEHRKPGMEPAERDEILNELQQVRQEVAELAERMDFAERLLSKQSEVKR*
Ga0137416_1017687113300012927Vadose Zone SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGLDPAERDEILSELQHVREEVAEL
Ga0137416_1054097243300012927Vadose Zone SoilMNGPEAVAAFVFFGGTFWVLRPVAAAVAKRIAGEHRRATLDPGEREEILSELQQVRQELSDLAERMDFAERLLAK
Ga0137416_1153708323300012927Vadose Zone SoilFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPAERDEILSELQEMRSEISELAERMDFAERLLSKQSEVKR*
Ga0134077_1047666513300012972Grasslands SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGLEPSEREEILSELQHVREEVAELAERMDFAERL
Ga0134076_1048931013300012976Grasslands SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGMEAAERDEIMSELQHVREEVAELAERMDF
Ga0134076_1055692313300012976Grasslands SoilMSGPEAVAAVVFFGGVFTILRPVAAAVAKRISGEHRRTGLESAERDEILSQLQAVREEVAELAEWL
Ga0137420_104891313300015054Vadose Zone SoilVNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPAERDEILNELQQVRQEVAELAERMDLAERLLSK
Ga0137409_1103610613300015245Vadose Zone SoilNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGMEPAERDEILNELHQVRQEVAELAERMDFAERLLSKQSEVKR*
Ga0184618_1022948433300018071Groundwater SedimentVNPPEAVAAFAFFGGVFWVLRPVAAAVAKRIAGEHRRPGLEKEDREEILTELQQVRHEVAELAERMDFAERMLAKPRE
Ga0184640_1014570733300018074Groundwater SedimentMSGPEALAMFAFLGGSFWVLRPVAAAVAKRIAGEHRRPGMDHEERDEILTELQQVRHEVAELAERMDFAERMLAKPRG
Ga0066667_1032705533300018433Grasslands SoilMSGPEAVAAFMFFGGAFWVIRPVAAALAKRIAGEHRRPGMEPAERDEILGEVQQMRQELSELAERVDFTERLLAKQSEIKGRP
Ga0066667_1183452913300018433Grasslands SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGMEPAERDEILRELQQVREEVAELAERMEFAERLLATPRVGSGK
Ga0180115_111191343300019257Groundwater SedimentFFGGVFTVLRPVAAAVAKRISGEHRQAGIDPAERDEILTELQQVRQELTELAERVDFTERLLARQQQDALPKPGR
Ga0193710_100106143300019998SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGMEPAERDEILDELHQVRQEVAEL
Ga0193739_1000065143300020003SoilMSGPEALAMFAFLGGSFWVLRPVAAAVAKRIAGEHRRPGMDHEERDEILTELQQVRHEVAELAERMDFAERMLAKPRE
Ga0179594_1022953313300020170Vadose Zone SoilVNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPAERDEILSQLQQVREEVAEL
Ga0193719_1020501533300021344SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGLEKDDRDEILTELQQVRHEVAELAERMDFAERLL
Ga0209640_1076248933300025324SoilMSGPEAVAMFVFLGGSFWVLRPVATAIAKRIAGEHRRPPEMDREERDAILAELQQVRREVSELAE
Ga0207653_1031537213300025885Corn, Switchgrass And Miscanthus RhizosphereVSPGEAAAFVMIFGGGFWVIRPVAAAIAKRIAGEHRRPGIETAERDELLQELQQVREELTELA
Ga0207652_1129808623300025921Corn RhizosphereMIAAVVFFGGMFTILRPVAGAVAKRISGEARRNEMEAGDRDEILAELQQMRQEMSELAERVDFTERLLARHQQDALPKPGG
Ga0209438_102500333300026285Grasslands SoilMNPPEAVAAFAFFGGVFWVLRPVAAAVAKRIAGEHRRPGLEKEDREEILSELQQVRHEVAELAERMDFAERMLAKPRE
Ga0209237_103988113300026297Grasslands SoilMSGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPAERDEILSELQAMRTEISELAERMDFAER
Ga0209131_114315843300026320Grasslands SoilGTFWVLRPLAAAVAKRIAGEHRPPATDAGEREEILTELQQLRHEVGELAERVDFTERLLAREREMAKLDRGH
Ga0209152_1002445523300026325SoilVSGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPAERDEILNELHQVRQEVAELAERMDFAERLLSKQSEVKR
Ga0209690_109050853300026524SoilMNGPETIAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGMEPADRDEILSELQAVRQEVAELAERMDFAERLLA
Ga0209058_104392143300026536SoilMSGPEAVAAFVFFGGTFWVLRPVAAAIAKRIAGEHRRPSMEPAERDEILNELQSVRQELTELA
Ga0209157_102520643300026537SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMDPAERDEILNELQQVRQEVAELAERMDFAERLLSKQSEVKR
Ga0209076_119221323300027643Vadose Zone SoilMNAPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGIGAEERDEILTELQQVRHEVAELAERMDFAERMLAKPRE
Ga0209177_1024872733300027775Agricultural SoilMSGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGIDPAERDEILTELQAVREEVAELA
Ga0209488_1105420913300027903Vadose Zone SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPAEHDEILNELQQVREEVAELAERL
Ga0137415_1122060223300028536Vadose Zone SoilMNGPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPAERDEILSELQAMRTEISELAERMDFAERLLSK
Ga0137415_1149872413300028536Vadose Zone SoilMNAPEAVAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGIGAEERDEILTELQQVRHEVAELAERMDFA
Ga0307501_1014510623300031152SoilMSGPEAVAAFMFFGGAFWVLRPLAGAVAKRIAGEHRRPGLEKEDREEILTELQQVRHEVAELAERMDFAERMLAKPRE
Ga0307495_1000414243300031199SoilMSGPEMVAAVVFFGGMFTILRPVAGALAKRISGEARRNEMDAGDRDEILTELQQVRHEVAELAERVDFAERLLSKQSEVKR
Ga0307469_1183573413300031720Hardwood Forest SoilMNGPEAIAAFMFFGGTFWVLRPVAAAVAKRIAGEHRRPGMEPAERDEILSELQAVRQEVAELAERMDFAER
Ga0307469_1225956213300031720Hardwood Forest SoilMNGPEAIAAFMFFGGTFWVLRPVAAAVAKRIAGEHRKPGMEPAERDEILSELQQVRAEVAELAERMDFA
Ga0326597_1045574833300031965SoilMSGPEAVAAFMFFGGAFWVLRPVAAAIAKRIAGEHRPPGMDKEERDEILTELQEVRAELGELAERMDFAERMLAKPRE
Ga0214472_1034570213300033407SoilAFMFFGGAFWVLRPVAAAIAKRIAGEHRPPGMDKEERDEILTELQEVRAELGELAERMDFAERMLAKPRE
Ga0214471_1108358633300033417SoilFFGGAFWVLRPVAAAIAKRIAGEHRPPGMDKEERDEILTELQEVRAELGELAERMDFAERMLAKPRE
Ga0364932_0324913_2_2143300034177SedimentMFAFLGGSFWVLRPVAAAVAKRIAGEHRRPGMDHEERDEILTELQQVRHEVAELAERMDFAERMLAKPRE
Ga0364936_040374_39_2753300034773SedimentMSGPEALAMFAFLGGSFWVLRPVAAAVAKRIAGEHRRPGMDHEERDEILTELQQVRHEVAELAERMDFAERMLAQPRG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.