NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F096949

Metagenome / Metatranscriptome Family F096949

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F096949
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 77 residues
Representative Sequence TPYLGVAGGADWIFAPPQACRQVVERVGAARKMLAVAPGLSHRGLVLSERARISCWPNVVAWLKETL
Number of Associated Samples 93
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.96 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 85
AlphaFold2 3D model prediction Yes
3D model pTM-score0.49

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.038 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(29.808 % of family members)
Environment Ontology (ENVO) Unclassified
(32.692 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(43.269 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 33.68%    β-sheet: 11.58%    Coil/Unstructured: 54.74%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.49
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF13419HAD_2 62.50
PF01571GCV_T 23.08
PF08669GCV_T_C 7.69
PF01370Epimerase 3.85
PF01074Glyco_hydro_38N 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG0383Alpha-mannosidaseCarbohydrate transport and metabolism [G] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.04 %
All OrganismsrootAll Organisms0.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300009038|Ga0099829_10023450All Organisms → cellular organisms → Bacteria4281Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil29.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil12.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil11.54%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.65%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere7.69%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.81%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.85%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.88%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.88%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.92%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.92%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.92%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.92%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.92%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.96%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.96%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.96%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.96%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005615Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-3 metaGEnvironmentalOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009799Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_0_30EnvironmentalOpen in IMG/M
3300009803Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_40_50EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011443Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT630_2EnvironmentalOpen in IMG/M
3300011444Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT800_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012392Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012409Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_16_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012910Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S198-509B-2EnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019866Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1m1EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300024347Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028796Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_141EnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300034354Sediment microbial communities from East River floodplain, Colorado, United States - 23_s17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1003056633300002558Grasslands SoilAAVSTPYFGVAGGADWIFAPPRACRQIVDRVGAARKMLAVEPGLSHRGLVLSERARTACWPNVVAWLKETL*
JGI25385J37094_1003797213300002558Grasslands SoilAALSTPYLGVAGGADWIFAPPKACRQVVERVGAADTTLAVAPGLSHRGLVLSEHARTSCWPKVVAWLKETL*
JGI25383J37093_1002673833300002560Grasslands SoilGTADRIFAPPAACRQVVDRVGSRRKLLAVEPGLTHRGLVLSDRARQSCWPNVVAWLKETL
JGI25382J37095_1018340113300002562Grasslands SoilTPYLGVAGGADWIFAPPQACRQVVERVGAARKMLAVAPGLSHRGLVLSERARISCWPNVVAWLKETL*
JGI25382J43887_1012699013300002908Grasslands SoilAEIIAEWMEWNVRGRWLGMDGFDYFAGLAALSTPYLGVAGGADWIFAPPQACRQVVERVGAARKMLAVAPGLSHRGLVLSERARISCWPNVVAWLKETL*
Ga0062594_10249381523300005093SoilGLAALNIPYLGVAGGADWIFAPAKACQQVVDRVGATRKALAVEPGLTHRGLVLSPRARLACWPNIVAWLKETL*
Ga0066683_1049002623300005172SoilRWLGMDGFDYFAGLAALNTPYLGVAGGADRIYAPPKACRQVVDRVGAARKMLAIAPGLSHRGLVLSDRARVACWPNVVAWLKETL*
Ga0066680_1074345413300005174SoilDGFDYFGGLAAVSVPYLGVAGTADRIFAPPAACRQVVDRVGSRRKLLAVEPGLTHRGLVLSDRARQSCWPNVVAWLKETL*
Ga0070694_10119888313300005444Corn, Switchgrass And Miscanthus RhizosphereIIAEWMEWNVRGAWLGTDAFDYFAGLASVSIPYLGVAGTADRIFAPPKACRQVVDRIGTARKALVVERGLSHAGLVLDPRARVTCWPNIVSWLKETL*
Ga0066682_1044709213300005450SoilDGFDYFTALAAVRTPYLAVAGGSDRVFAPATACRQVVERVGSDRKSLTVEAGLSHRGLVLSPKARDSCWPGVAAWLKETLG*
Ga0070699_10023451633300005518Corn, Switchgrass And Miscanthus RhizosphereTWVGSDGFDYFAALAAVRTPYLAVAGGSDRVFAPTAACRQVLERVGSARKTLAVEPGLSHSGLVLAPKARESCWPAVAAWLKETLA*
Ga0070699_10035828523300005518Corn, Switchgrass And Miscanthus RhizosphereGGWMGSDGFDYFTALGAVHTPYLAVAGGSDRLFAPAAACRQVVDRVGAERKKLTIETGLTHRGLVLSPRARDTCWLGVATWIKEILG*
Ga0066692_1004209113300005555SoilAAEIIAEWMEWNVRGRWLGMDGFDYFAGLAALSTPYLGVAGGADWIFAPPKACRQVVERVGAADTTLAVAPGLSHRGLVLSEHARTSCWPKVVAWLKETL*
Ga0066708_1008746533300005576SoilNVRGTWLGSDGFDYFAGLASITTPYLGVAGAADRVFAPPAACRQVVERLGAIRKTLAIEPGLSHRGLVLSERARSACWPNVVVWLKETL*
Ga0066708_1048299823300005576SoilGNEDEAAEVIAEWMEWNVRGRWLGMDGFDYFAGLAALSTPYLGVAGGADWIFAPPKACRQVVERVGAPRKRLAVAPGLSHRGLVVSDRARTVCWPNVVAWLKETL*
Ga0070702_10028928333300005615Corn, Switchgrass And Miscanthus RhizosphereAADRIFAPPKACGQVVDRVGAVRKALVVERGLTHAGLVLNPRARETCWPQIVSWLKETL*
Ga0068862_10157114613300005844Switchgrass RhizosphereVRGAWMGTDAFDYFAGLASVSIPYLGVAGAADRIFAPPKACGQVVDRVGAARKALVVERGLTHAGLVLDPRARETCWPQIVSWLKETL*
Ga0066656_1018110533300006034SoilLGVAGGSDWLYAPPRACQQIVDRVGAARKALAVAPGLSHRGLVLSERARTTCWPNIVAWLKETW*
Ga0066659_1145379523300006797SoilGLAALSTPYLGVAGGSDWLYAPPQACRQVVDRVGATRKTLAVAPGLSHRGLVVSELARISCWPNIVAWLKETW*
Ga0075428_10256425623300006844Populus RhizosphereLAVVGGSDRLFAPETACRKVLERVGSERKLLTVEPGLSHRGLVLSPGARAACWPGVAAWIKDILG*
Ga0075421_10273925323300006845Populus RhizosphereDGFDYFAALTTVRTPYLAVVGAADRLFAPEAACRKVLERVGSERKLLTVETGLSHRGLVLAPQARNACWPGVAAWIKEVLG*
Ga0075431_10036266913300006847Populus RhizosphereGVAGASDRIFAPPTACRQVVDRVGTGRKRLAIEPGLSHRGLVLSPHARETCWPNIATWLKETL*
Ga0075420_10153373223300006853Populus RhizospherePATACRLVVEQVGSERKSLTVETGLSHRGLVLAPEARDSCWPAVAAWLKETLG*
Ga0079219_1064806723300006954Agricultural SoilGVAGGSDWIFAPPRACRQVVDRVGAARKMLAIAPGLSHRGLVLSDRARVACWPHVVAWLKETL*
Ga0075435_10149379613300007076Populus RhizosphereHTPYLAVAGGSDRVFAPVIACRHVIDRVGSERKTLTIESGLSHSGLVLSPKARESCWTGVATWLKETLA*
Ga0099791_1010009733300007255Vadose Zone SoilDGFDYFAGLAALSTPYLGVAGGADWVFAPPKACREVVERVGTGRKKLVIEPGLSHRGMVLSERARASCWPNIVAWLKETL*
Ga0099793_1007544033300007258Vadose Zone SoilDGFDYFAGLAALSAPYLGVAGGADWIFAPPKACRQVVERVGAAHKMLAVAPGLSHRGLVLSEHARTSCWPQVVAWLKETL*
Ga0099794_1011005333300007265Vadose Zone SoilWLGMDGFDYFAGLAALSTPYLGVAGGADWIFAPPKACRQVVERVGAAHKMLAVAPGLSHRGLVLSEHARTSCWPQVVAWLKETL*
Ga0099794_1052900323300007265Vadose Zone SoilYFAGLAALSTPYLGVAGGSDWMFAPPQACQQVVDRVGAARKTLAIAPGLSHRGLVLSEQARTSCWPAVVAWLKETL*
Ga0099795_1044815923300007788Vadose Zone SoilYLGVAGGADRIFAPPKACRQVVERVGSARKMLAVEPVLSHRGLVLSDRAKTSCWPNVVTWLKETL*
Ga0099829_1002345063300009038Vadose Zone SoilGGADWIFAPPKACRQVVERVGAAHKMLAVAPGLSHRGLVLSEYARTSCWPKVVAWLKETL
Ga0099829_1068050823300009038Vadose Zone SoilADWIFAPPKACRQVVERVGTGRKKLVIEPGLSHRGMVLSERARASCWPNIVTWLKETL*
Ga0099830_1015445333300009088Vadose Zone SoilEIIAEWMEWNVRGRWLGSDGFDYFAGLAALSTPYLGVAGGADWIFAPPKACQQVVERVGTGRKKLVIEPGLSHRGMVLSERARASCWPNIVTWLKETL*
Ga0075418_1098323123300009100Populus RhizosphereAVVGAADRLFAPEAACRKVLERVGSERKLLTVETGLSHRGLVLAPQARDACWPGVAAWIKEVLG*
Ga0066709_10080078313300009137Grasslands SoilEDEAAEIIAEWMEWNVRGAWLGTDRFDYFGGLAAVSVPYLGVAGTADRIFAPPAACRQVVDRIGSRRKLLAVASGLTHRGLVLSDRARQSCWPNVVAWLKETL*
Ga0114129_1026990643300009147Populus RhizosphereGASDRLFAPVAACRQVVERVGSERKTLSVETGFSHSGLVLSPRARDVCWPGIAAWLKETLA*
Ga0075423_1066960113300009162Populus RhizosphereGLAALNTPYLGVAGGSDRIFAPPRACRQVVDRVGAARKMLAIAPGLSHRGLVLSDRARVACWPHVVAWLKETL*
Ga0105075_101504313300009799Groundwater SandEDEATEIISQWMEWNVRGAWLGSDGFDYFAALGAVTTPYLGIAGAADRIFAPPSACKQVVDRIGAAQKAFEVEPGLSHRGLVLSERGRSGCWANLAGWLKETL*
Ga0105065_102776723300009803Groundwater SandPYLGVAGEADRIFAPPAACRHLVDHVGTARKKLLIEPGLSHAGLVLGERAKTSCWPNIVAWLKETL*
Ga0099796_1005056833300010159Vadose Zone SoilEAAEIIAEWMEWNVRGRWLGMDGFDYFAGLAALSTPYLGVAGGADRIFAPPKACRQVVERVGSARKMLAVEPGLSHRGLVLSDRAKTSCWPNVVTWLKETL*
Ga0134088_1037483813300010304Grasslands SoilPARALRFGNEDEAAEIIAEWMEWNVRGRWLGMDGFDYFAGLAALNTPYLGVAGGADRIYAPPKACRQVVDRVGAARKMLAIAPGLSHRGLVLSDRARVACWPNVVAWLKETL*
Ga0134109_1002427433300010320Grasslands SoilFDYFAGLAALSTPYLGVAGGSDWIFAPPNSCRQVVERVGAARKSLAIAPGLSHGGLVLSEQARMACWPNIVAWLQETL*
Ga0134111_1014478423300010329Grasslands SoilDGFDYFAALAAVRTPYLAVAGAADRIFAPPAACHQVVRRIGAERKTLSIVPGLSHRGLVLGGKARDACWPAVSTWLKEILTPGYISAR*
Ga0134111_1051976923300010329Grasslands SoilGVAGGADWIFAPPRACRQVVDRVGAARKTLAVAPRLSHRGLVVSEHARTNCWPNVVAWLRETL*
Ga0134122_1136555813300010400Terrestrial SoilDEAAEIIAEWMEWNVRGAWMGTDAFDYFAGLASVSIPYLGVAGAADRIFAPPKACGQVVDRVGAVRKALVVERGLTHAGLVLNPRARETCWPQIVSWLKETL*
Ga0134121_1202061313300010401Terrestrial SoilGAWMGTDAFDYFAGLASVSIPYLGVAGAADRIFAPPKACGQVVDRIGAARKALVVERGLTHAGLVLNPRARETCWPQIVSWLKETL*
Ga0134123_1008259443300010403Terrestrial SoilMEWNVRGAWMGTDAFDYFAGLASVSIPYLGVAGAADRIFAPPKACGQVVDRVGAARKALVVERGLTHAGLVLDPRARETCWPQIVSWLKETL*
Ga0137457_114717123300011443SoilGGADRIFAPPAACRQVVERVGAARKMLAVVPGLSHSGLVLSERARSSCWPNIVAWLKETL
Ga0137463_133082413300011444SoilLAVAGGSDRIFAPVTACRQLVDRVGSERKTLTVNAGLSHSGLVLSPRAREECWPGVATWLKETLA*
Ga0137389_1024021833300012096Vadose Zone SoilVAGGADWIFAPPKACRQVVERVGTGRKKLVIEPGLSHRGMVLSERARASCWPNIVTWLKETL*
Ga0137399_1020695033300012203Vadose Zone SoilGVAGGADWIFAPPKACRQVVERVGAARKMLAVAPGLSHRGLVLSDRARTSCWPNVVAWLKETL*
Ga0137399_1049695723300012203Vadose Zone SoilAPPAACRQVVDRVGSDRKKLTIETGLSHRGLVLSPRARDGCWLGIATWLKEILG*
Ga0137380_1058887833300012206Vadose Zone SoilAAVSVPYLGVAGTADRIFAPPAACRQVVDRIGSRRKLLAVASGLTHRGLVLSDRARQSCWPNVVAWLKETL*
Ga0137376_1000531973300012208Vadose Zone SoilAGGADWIFAPPRACRQVVDRVGGARKTLAVAPRLSHRGLVVSERARTNCWPNVVAWLRETL*
Ga0137376_1174532213300012208Vadose Zone SoilTPYLAVAGASDRVFAPAAACRQVVERIGAERKALSVAPGLSHRGLVLGEKARDACWPHVATWLREILTPGYISAE*
Ga0137377_1075923123300012211Vadose Zone SoilLGVAGTADRIFAPPAACRQVVDRVGSRRKRLAVESGLTHRGLVLSDRARQSCWPNVVGWLKETL*
Ga0137377_1156734513300012211Vadose Zone SoilTPYLGVAGGADWIFAPPRACRQVVERVGGANKKLAVEAGLSHRGLVLNERARLSCWPNVVAWLKETL*
Ga0137387_1010217033300012349Vadose Zone SoilFAPATACRQVVERVGSDRKSLTVEAGLSHRGLVLSPKARDSCWPGVAAWLKETLG*
Ga0137386_1128044613300012351Vadose Zone SoilYFAGLAALSTPYLGGAGGADWIFAPARACRQVVERVGAAHKMLAVAPGLSHRGLVLSEHARTSCWPNVVAWLKETL*
Ga0134043_113733123300012392Grasslands SoilGADWIFAPPRACRQVVERVGGANKKLAVEAGLSHRGLVLSERARLSCWPNVVAWLKETL*
Ga0134045_105499623300012409Grasslands SoilLAALSTPYLGVAGGADWIFAPPRACRQVVERVGGANKKLAVEAGLSHRGLVLSERARLSCWPNVVAWLKETL*
Ga0157308_1020831823300012910SoilASVSIPYLGVAGAADRIFAPPKACGQVVDRVGAVRKALVVERGLTHAGLVLNPRARETCWPQIVSWLKETL*
Ga0137395_1106701123300012917Vadose Zone SoilALNTPYLGVAGGADRIFAPPRACRQVVDRVGAARKMLAIAPGLSHRGLVLSDRARVACWPTVVAWLKETL*
Ga0137359_1010256513300012923Vadose Zone SoilDGFDYFAGLAALNTPYLGVAGGSDWIFAPPKACQQVVDRVGAARKTLTIAPGLSHRDLVLGEQARTTCWPAVVAWLKETL*
Ga0137413_1119786623300012924Vadose Zone SoilDEAAEIIAEWMEWNVRGRWLGMDGFDYFAGLAALNTPYLGVAGGSDWIFAPPKACQEVVDRVGAARKTLTIAPGLSHRDLVLGEQARTTCWPAVVAWLKETL*
Ga0137419_1086403423300012925Vadose Zone SoilGADWIFAPPRACRQVVERVGGANKKLAVEPGLSHRGLVLSERARLSCWPNVVAWLKETL*
Ga0137416_1163197413300012927Vadose Zone SoilDYFAALGAVRTPYLAVAGGSDRVFAPVVACRHVIDRVGSERKTLTVEAGLSHRGLVLSPKARDSCWTGVAGWLKETLA*
Ga0137420_117232533300015054Vadose Zone SoilRFGNEDEAAEIIAEWMEWNVRGRWLGMDGFDYFAGLAALSTPYLGVAGGADWIFAPPKACRQVVERVGAAHKMLAVAPGLSHRGLVLSEHARTSCWPQVVAWLKETL*
Ga0134089_1010392533300015358Grasslands SoilWNVRGRWLGIDGFDYFAGLAALNTPYLGVAGGADRIYAPPKACRQVVDRVGAARKMLAIAPGLSHRGLVLSDRARVACWPNVVAWLKETL*
Ga0134085_1000704713300015359Grasslands SoilFGGLAAVSIPYLGVAGTADRIFAPPAACRQVVDRVGSRRKLLAVEPGLTHRGLVLSDRARQSCWPNVVAWLKETV*
Ga0134085_1047758023300015359Grasslands SoilGVRTPYLAVAGAADRIFAPPAACHQVVRRIGAERKTLSIVPGLSHRGLVLGGKARDACWPAVSTWLKEILTPGYISAR*
Ga0184634_1005590913300018031Groundwater SedimentTPYLAVAGGSDRVFAPPAACRHLVEQVGSERKSLLVEAGLSHRGLVLAPRARDSCWPAVAAWLKETLG
Ga0184640_1006372533300018074Groundwater SedimentGSDRVFAPPAACRHLVEQVGSERKSLLVEAGLSHRGLVLAPRARDSCWPAVAAWLKETLG
Ga0066655_1065825123300018431Grasslands SoilLGVAGGSDWLYAPPQACRQVVDRVGATRKTLAVAPGLSHRGLVVSERARISCWPNIVAWLKETW
Ga0066667_1060820313300018433Grasslands SoilFDYFAGLAALNTPYLGVAGGADRIYAPPKACRQVVDRVGAARKMLAIAPGLSHRGLVLSDRARVACWPNVVAWLKETL
Ga0066669_1114663323300018482Grasslands SoilGVAGGSDWIFAPPKACRQVVDRVGAARKTLAIAPGLSHRDLVLGEQARTACWPAVVAWLKETL
Ga0193756_100687133300019866SoilMRWNVRGAWVGSDGFDYFAALAAVRTPYLAVAGGSDRLFAPTAACRQVVERVGSARKTLAVEPGLSHSGLVLAPQARESCWPAVAAWLKETLA
Ga0193727_112093123300019886SoilMRWNVRGEWTGSDGFDYFTALGAVHTPYLAVAGGSDRLFAPAAACRQVVDRVGAERKKLTIETGLTHSGLVLSPRARDTCWLGVATWIKEILE
Ga0210382_1014830913300021080Groundwater SedimentMDGFDYFAGLAALSTPYLGVAGGADWIFAPPKACRQVVERVGGANKKLAVEAGLSHRGLVLSERAKLSCWPNVVAWLKETL
Ga0179596_1026633213300021086Vadose Zone SoilADWIFAPPRACRQIVDRVGAARKMLAVEPGLSHRGLVLSERARTACWPNVVAWLKETL
Ga0224452_110335213300022534Groundwater SedimentTAVRTPYLAVVGGSDRIFAPAAACQHVVEQVGSERKSLTVAAGLSHRGLVLAPQARDSCWPAVAAWLEETLG
Ga0179591_116230113300024347Vadose Zone SoilGGSDRVFAPTAACRQVVERVGSARKTLAVEPGLSHSGLVLAPKARESCWPAVAAWLKETL
Ga0209640_1045912723300025324SoilTPLLSVAGAADRVFAPPAACRQIVEQIGATRKALVVEPGLTHRGLVLSERARSSCWPNIVAWLKETL
Ga0207675_10103289213300026118Switchgrass RhizosphereARALRFGNEDEAAEIIAEWMEWNVRGAWMGTDAFDYFAGLASVSIPYLGVAGAADRIFAPPKACGQVVDRVGAARKALVVERGLTHAGLVLNPRARETCWPQIVSWLKETL
Ga0209350_103045333300026277Grasslands SoilADRIFAPPAACRQVVDRVGSRRKLLAVEPGLTHRGLVLSDRARQSCWPNVVAWLKETL
Ga0209235_100600413300026296Grasslands SoilDRIFAPPAACRQVVDRVGSRRKLLAVEPGLTHRGLVLSDRARQSCWPNVVAWLKETL
Ga0209240_110926913300026304Grasslands SoilALNTPYLGVAGGSDWIFAPPKACQQVVDRVGAARKTLTIAPGLSHRDLVLGEQARTTCWPAVVAWLKETL
Ga0209761_126191723300026313Grasslands SoilLGMDGFDYFAGLAALSTPYLGVAGGADWIFAPPQACRQVVERVGAARKMLAVAPGLSHRGLVLSERARISCWPNVVAWLKETL
Ga0209266_123854023300026327SoilGFDYFAGLAALSVPYLGVAGAADRIFAPAAACRQVVERVGAARKALAIEPGLSHRGLVVSEQARSSCWPNIVAWLKETL
Ga0209378_100503673300026528SoilGLAAVSTPYFGVAGGADWIFAPPRACRQIVDRVGAARKMLAVEPGLSHRGLVLSERARTACWPNVVAWLKETL
Ga0209058_104800813300026536SoilDYFAGLAALNTPYLGVAGGADRIYAPPKACRQVVDRVGAARKMLAIAPGLSHRGLVLSDRARVACWPNVVAWLKETL
Ga0209376_128491123300026540SoilGFDYFAGLAALSVPYLGVAGAADRIFAPAAACRQVVERVGAGRKALAIEPGLSHRGLVVSEQARSSCWPNIVAWLKETL
Ga0209076_110093923300027643Vadose Zone SoilRFGNEDEAAEVIAEWMEWNVRGRWLGMDGFDYFAGLAALSTPYLGVAGCADWIFAPPKACRQVVERVGAARKMLAVAPGLSHRGLVLSDRARTSCWPNVVAWLKETL
Ga0209388_123284013300027655Vadose Zone SoilFAGLAALSTPYLGVAGGADWVFAPPKACREVVERVGTGRKKLVIEPGLSHRGMVLSERARASCWPNIVAWLKETL
Ga0209177_1014473013300027775Agricultural SoilDGFDYFAGLAALNTPYLGVAGGSDWMFAPAESCRQVVDRVGAARKTLAIAPGLTHRGLVLSEHARTSCWPAVVAWLKETL
Ga0137415_1053299213300028536Vadose Zone SoilEDEAAEIIAEWMEWNARGRWLGMDGFDYFAGLAALTTPYLGVAGGADWIFAPPKACRQVVERVGAARKTLAVAPGLSHRGLVLSDRARTACWPNVVAWLKETL
Ga0137415_1094556623300028536Vadose Zone SoilDYFAALGAVRTPYLAVAGGSDRVFAPVVACRHVIDRVGSERKTLTVEAGLSHRGLVLSPKARDSCWTGVAGWLKETLA
Ga0307287_1014218413300028796SoilGLAALNIPYLGVAGGADWIFAPPNACRQVVDRVGAARKMLAVAPGLSHRGLVSSERARTTCWPNVVAWLKETL
Ga0307495_1002593723300031199SoilPEIIAEWMEWNVRGAWMGNDAFDYFAGLASVSIPYLGVAGTADRIFAPPKACRQVVDRVGATRKALVVERGLTHAGLVLDPRARETCWPHIVSWLKDTL
Ga0307468_10149319513300031740Hardwood Forest SoilEWMEWNVRGAWMGTDAFDYFAGLAAVSIPYLGVAGTADRIFAPPKACRQVVDRIGTARKALMVERGLSHAGLVLSPRAREACWPHIVSWLKETL
Ga0307473_1104934223300031820Hardwood Forest SoilGADWIFAPPRACRQIVDRVGAARKMLAVEPGLSHRGLVLSEHARTACWPNVVAWLKETL
Ga0214473_1179196613300031949SoilGFDYFAALVTVRTPYLAVAGGSDRMFAPAAACRQVVERVGAERKALAVEPGLSHRGLVLAPRAREACWPAVAAWLKETLA
Ga0307471_10215409323300032180Hardwood Forest SoilLAALSTPYLGVAGAADRIFAPPAACQQVVDRIGAARKKLVIEAGLSHRGLVLDPRARESCWPNIVAWLKETL
Ga0364943_0116080_708_9443300034354SedimentFDYFAGLAALSIPYLGVAGASDRIFAPPAACRQVVERVGTARKALAIEPGLSHRGLVLSPRARASCWPNIVRWLKEIL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.