NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F104900

Metagenome Family F104900

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104900
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 69 residues
Representative Sequence AARFDVIAATNAAAAIQDERVFLRWIADHATSFPDAYRTIKETNLGLAEVSNADAEVLESGPNQCAVG
Number of Associated Samples 88
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 1.00 %
% of genes from short scaffolds (< 2000 bps) 1.00 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.54

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(32.000 % of family members)
Environment Ontology (ENVO) Unclassified
(38.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(45.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 43.75%    β-sheet: 0.00%    Coil/Unstructured: 56.25%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.54
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF08487VIT 52.00
PF13768VWA_3 17.00
PF13519VWA_2 5.00
PF07883Cupin_2 3.00
PF13490zf-HC2 2.00
PF04542Sigma70_r2 1.00
PF00753Lactamase_B 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG0568DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32)Transcription [K] 1.00
COG1191DNA-directed RNA polymerase specialized sigma subunitTranscription [K] 1.00
COG1595DNA-directed RNA polymerase specialized sigma subunit, sigma24 familyTranscription [K] 1.00
COG4941Predicted RNA polymerase sigma factor, contains C-terminal TPR domainTranscription [K] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.00 %
All OrganismsrootAll Organisms1.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300007265|Ga0099794_10129578All Organisms → cellular organisms → Bacteria1273Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil32.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil21.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil10.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment4.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.00%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil2.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere2.00%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands1.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.00%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.00%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.00%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.00%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000881Soil microbial communities from Great Prairies - Wisconsin Restored Prairie soilEnvironmentalOpen in IMG/M
3300002557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300004058Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300011435Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT660_2EnvironmentalOpen in IMG/M
3300012035Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT338_2EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300024224Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK14EnvironmentalOpen in IMG/M
3300026089Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026307Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300027395Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M2 PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027490Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028072Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK16EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031226Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 10_SEnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300034178Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10215J12807_100107113300000881SoilVIAATNPVVAIQDEREFLKWIAEHQTSFPDAYRTIKEANLGLVELSDADAEVLESGPNQCAVG*
JGI25381J37097_102217523300002557Grasslands SoilNVAAAIQDERTFLQWINEHTTSFPDAYRTIKEVNLGLADLSDAEAEMLESGPNQCAVK*
JGI25384J37096_1007153133300002561Grasslands SoilRFDVVAATNVAAAIQDETAFLHWVADHAPAAPDAYRTIKLANLGLLELSDADAEVLESGPNQCAVPGAA*
JGI25384J37096_1015572813300002561Grasslands SoilVIAATNGAAAIQDERAFLRWITEHTTSFPDAYRAIKETNLGLADLSDSEAEMLESGPNQCAVK*
JGI25382J43887_1021258713300002908Grasslands SoilAHYAGEAERRADRAVAARFDVISATNTAAAIQDERVFLKWIADHTTNFPDAYRTIKETNLGLVDVAEADAEILESGPNQCAVV*
JGI25382J43887_1025406513300002908Grasslands SoilARFDVVAATNEVAAIQDRATFLRWIEDHTTTPPDSYRTIKLANLGLLELSEVDAELVESGPNQCAVG*
JGI25616J43925_1028518213300002917Grasslands SoilIQDQQVFLKWIADHATNFPDGYRTIKETNLGLVGLSDADAEILESGPNQCAVI*
Ga0055498_1002078923300004058Natural And Restored WetlandsIAGTNPVVAIQDERAFLKWITDHQTSFPDAYRTIKEINLGLVDVSDADIEVLESGPNQCAIG*
Ga0062595_10014194213300004479SoilEDERRADRVIAARFDVITATNPPAAIQDERVFLQWIAEHQTSFPDAYRTIKEANLGLVDLSDADAEVIESGPNQCAVG*
Ga0066674_1019751313300005166SoilAARFDVVAATNVAAAIQDEAAFLRWVADHTPVAPDAYRTIKLANLGLVALSDADAEVLESGPNQCAVPGAA*
Ga0066677_1027646113300005171SoilASESERRADRAIAARFDVIAATNVAAAIQDERTFLQWINAHTTTFPDAYRTIKEVNLGLADLSDAEAEMLESGPNQCAVK*
Ga0066683_1019796523300005172SoilDRAVAARFDVISATNTAAAIQDERVFLKWIADHAMTSPDAYRMIKEANLGLVQLSDEDAEILESGPNQCAVM*
Ga0066676_1092240513300005186SoilIQDERVFLKWIADHTTNFPDAYRTIKETNLGLVDVAEADAEILESGPNQCAVV*
Ga0070708_10005351613300005445Corn, Switchgrass And Miscanthus RhizosphereHYASEGERRADRAVAARFDVIAATNAAAVIQDEGVFLKWIADRMTPFPEAYRTIKEANLGLVDPSDSDTEILESGANQCAIG*
Ga0066689_1007371313300005447SoilAHYASEAERRADRAVAARFDVISATNTAAAIQDERVFLKWIADHTTNFPDAYRTIKETNLGLVDVAEADAETLESGPNQCAVV*
Ga0066681_1006732413300005451SoilDVVAATNVAAAIQDKTAFLRWVADHAPAAPDAYRSIKLANLGLLELSDADAEVLESGPNQCAVPGAA*
Ga0068867_10037650923300005459Miscanthus RhizosphereRFDVIAATNPVVAIQDEREFLTWIADHQTSFPEAYRTIKEANLGLVELSDADAEVLESGPNQCAV*
Ga0070706_10023085723300005467Corn, Switchgrass And Miscanthus RhizosphereDRAVAARFDVISATNSAAAIQDERVFLKWIADHATAFPDAYRTIKEANLGLTQLSDADAEVVESGPNQCAIV*
Ga0070699_10020539113300005518Corn, Switchgrass And Miscanthus RhizosphereIQDEGVFLKWIADRVTPFPEAYRTIKEANLGLVDPSDSDTEILESGANQCAIG*
Ga0070686_10104430113300005544Switchgrass RhizosphereATNPVVAIQDEREFLRWIADHQTSFPDAYRTIKEANLGLVELSDADAEVLESGPNQCAVG
Ga0066695_1002482513300005553SoilAIQDERVFLQWLADHVTSFPDAYRTIKEVNLGLLEVSDADAEILEVGPNQCAIG*
Ga0066707_10002081103300005556SoilATNAAAVIQDERVFLNWIADRATPFPEAYRTIKEANLGLVDTSDSDAEALESGPNQCAIG
Ga0066698_1026566213300005558SoilAARFDVIAATNAAAAIQDERTFLKWIEDHASVFPDAYRMIKETNLGLMDISDADAEVLESGPNQCAVR*
Ga0066700_1041694313300005559SoilHYASESERRADRAIAARFDVIAATNVAAAIQDERTFLQWIKEHTTTFPDAYRTIKEVNLGLADLSDAEAEMLESGPNQCAVQ*
Ga0066670_1010083323300005560SoilIAATNVAAAIQDERTFLQWINEHTTSFPDAYRTIKEVNLGLADLSDAEAEMLESGPNQCAVK*
Ga0066699_1018748623300005561SoilDRAVAARFDVISATNTAAAIQDERIFLQWIADHATNFPDAYRTIKEANLGLVELTDADAEVLESGPNQCAIG*
Ga0068859_10069408223300005617Switchgrass RhizosphereHYSSETERRADRAVAARFDVIAATNPVVAIQDEREFLTWIADHQTSFPEAYRTIKEANLGLVELSDADAEVLESGPNQCAV*
Ga0068862_10219386323300005844Switchgrass RhizosphereETERRADRAVAARFDVIAATNPVVAIQDEREFLKWIADHQTSFPEAYRTIKEANLGLVELSDADAEVLESGPNQCAV*
Ga0066656_1035720713300006034SoilDERVFLRWIADHATSFPDAYRTIKETNLGLAEVSNADAEVLESGPNQCAVG*
Ga0066653_1010444533300006791SoilSGRRADRSVAARFDVVAATNVAATIQDETAFLRWVADHAPAAPDAYRSIKLANLGLLELSDADAEVLESGPNQCAVPGAA*
Ga0075425_10078102723300006854Populus RhizosphereVISATNSAAAIQDERVFLKWIADHATTFPDAYRTIKEANLGLAQVSDADAEVVESGPNQCAIG*
Ga0075425_10279139223300006854Populus RhizosphereAAAIQDQHVFLKWIADHATTFPDAYRTIKETNLGLAQISDEDAELLESGPNQCAVI*
Ga0075436_10086847623300006914Populus RhizospherePVARVQDERQFLRWIAEHVTPFPDAYRTIKEANLGLVTLSEADAEIVESGPNQCAIA*
Ga0075435_10181333113300007076Populus RhizosphereAGESERRADRAIAARFDVISATNTAAAIQDQHVFLKWIADHATSFPEAYRTIKETNLGLVELSDADAEQLESGPNQCAVI*
Ga0099793_1001938213300007258Vadose Zone SoilRFDVISATNTAAAIQDERVFLKWIADHTTNFPDAYRTIKETNLGLVDVAEADAEILESGPNQCAVM*
Ga0099793_1044228713300007258Vadose Zone SoilNTAAAIQDERVFLQWIADHATNFPDAYRTIKDTNLGLVDVSEADAEILESGPNQCAVA*
Ga0099794_1012957813300007265Vadose Zone SoilESERRADRSVAARFDVVAATNVAAAIQDETAFLRWVADHAPVAPDAYRTIKLANLGLLEISDADAEVLEAGPNQCAVPGAA*
Ga0066710_10013534443300009012Grasslands SoilRFDVISATNGAAAIQDERQFLQWVKDHVTTFPDAYRTIKEANLGLVRLTEPDIDILESGPNQCAVG
Ga0066710_10389240213300009012Grasslands SoilHYESEAERRADRAVAARFDEVVTTNAAAAIQDEREFLNWVADHFGPIPDAYRTIKEANLGLLDLSDSDAGMLESGPNQCAVR
Ga0099829_1074884713300009038Vadose Zone SoilIQDERVFLKWIADRQTAFPDAYRTIKEANLGLVDLSDADAELAESGPNQCAVA*
Ga0099827_1147998313300009090Vadose Zone SoilSESERRADRAIAARFDVISATNAAAAIQDERVFLKWIAEHATNFPDAYRTIKETNLGLVQLSDPDAEMLESGPNQCAVI*
Ga0066709_10069161313300009137Grasslands SoilESERRADRAIAARFDVISATNTAAPIQDQQVFLKWIADHATNFPDAYRTIKETNLGLVGLSDADADILESGPNQCAVI*
Ga0099792_1025281313300009143Vadose Zone SoilDVIAATNGAAAIQDERAFLRWIADHQTSFPDAYKTIKEANLGLVDVSDPDAEILESGPNQCAV*
Ga0126382_1246627213300010047Tropical Forest SoilDRAVASRFDVILATNHVAAIQEEREFLRWIAEHATTFPDAYRTIKEANLGLVTLSDADAEIVESGPNQCAIA*
Ga0126377_1069444523300010362Tropical Forest SoilSGEGERRADRAVAARFDVVAATNAPVAIQDEGTFLQWIADHTTTFPDAYRTIKEVNLGLTEVPDADAEILESGPNQCAIV*
Ga0134128_1183250113300010373Terrestrial SoilNEAARIQGEGDFLQWVADHQTTPPDAYRTIKLANLGLVDLTDSDAATLEAGPNQCAVK*
Ga0137426_102569523300011435SoilDRAVAARFDVIAATNPIASIQDELQFLTWIADHQASFPDAYRTIKEANLGLVELSEADAEVLESGPNQCAVG*
Ga0137445_106364713300012035SoilAHRQRRATAAVAARAVAARFDVIAATNPVASIQDEHQFLKSIADHQASFPDAYRTIKEANLGLAELSDADAEVLESGPNQCAVA*
Ga0137364_1131973113300012198Vadose Zone SoilAARFDVVAATNAAAAIQDEAAFLRWVAEHTSVAPEAYRTIKLANLGLVTLSDADAEVLESGPNQCAVPGTV*
Ga0137382_1017944823300012200Vadose Zone SoilVILATNPVAAIQDERQFLQWIGDHATAFPDAYKTIKEANLGLVDVSDLDAELLESGPNQCAVG*
Ga0137399_1056100123300012203Vadose Zone SoilVAAAIQDETAFLRWVADHAPVAPDAYRTIKLANLGLLEISDADAEVLESGPNQCAVPGAA
Ga0137374_1022617223300012204Vadose Zone SoilAATNAAAAIQDEREFLGWVADHQMTPPAAYRTIKQANLGLVDVAESDADLLESGPNQCAVR*
Ga0137380_1010773623300012206Vadose Zone SoilAHYARELERRADRAIAARFDVIAATNAAAAIQDERAFLQWIAEHTTSFPDAYRTIKETNLALADLSDSEAEMLESGPNQCAVK*
Ga0137381_1062083223300012207Vadose Zone SoilRFDVIAATNGAAAIQDERAFLGWITEHTATFPDAYRTIKETNLGLADLSDSEAEMLESGPNQCAVK*
Ga0137376_1023010713300012208Vadose Zone SoilSATNTAAAIQDERVFLKWIADHATTFPDAYRTIKEANLGLVQLSDEDAEILESGPNQCAIF*
Ga0137376_1072847413300012208Vadose Zone SoilLATNPVAAIQDERQFLQWIGDHATAFPDAYKTIKEANLGLVDVSDLDAELLESGPNQCAVG*
Ga0137376_1107834123300012208Vadose Zone SoilAIHDERLFLQWIADHHTTPPAAYRTIKLANLGLIEVSDADAEVLESGPNQCAIG*
Ga0137378_1033251423300012210Vadose Zone SoilEGERRADRAVAARFDVITATNAAAAIQDERVFLQWIADHQTSFPDAYRTIKEANLGLVQISDSDAEVLESGPNQCAIA*
Ga0137369_1049681213300012355Vadose Zone SoilRAIAARFDVITATNAAAAIQDERVFLQWIADHQTSFPDAYRTIKEANLGLVQISDADAEVLESGPNQCAIA*
Ga0137384_1013513423300012357Vadose Zone SoilHYASEGERRADRAIAARFDVIAATNGAAAIQDERVFLQWLADHVTSFPDAYRTIKEVNLGLLEVSDADAEILEVGPNQCAIG*
Ga0137384_1050823623300012357Vadose Zone SoilAIAARFDVIAATNGAAAIQDERVFLQWLVDHVTSFPDAYRTIKEVNLGLLEVSDADAEILEVGPNQCAIG*
Ga0137385_1106701913300012359Vadose Zone SoilSEGERRADRAIAARFDVIAATNGAAAIQDERVFLQWLVDHVTSFPDAYRTIKEVNLGLLEVSDADAEILEVGPNQCAIG*
Ga0137373_1115954413300012532Vadose Zone SoilEVERRADRAVAARFDVISATNAAAAIQDERAFLRWVADHQTPPPDAYRSIKLANLGLVAVPDSDAEVLESGPSQCAVG*
Ga0137358_1032159923300012582Vadose Zone SoilDVISATNTAAAIQDERVFLEWIADHQSSFPDAYRTVKEANLGLVELSDPDAETLESGANQCAVM*
Ga0137398_1115228423300012683Vadose Zone SoilISATNAAAAIQDERVFLKWVADHATTFPDAYRTIKEANLGLAQLSDADAEVVESGPNQCAIV*
Ga0137396_1105935823300012918Vadose Zone SoilDRAIAALFDVISATNEATAIQDERVFLRWIADHSTTFPDAYRTIKEANLGLVDVADPDAEILESGPNLCAVM*
Ga0137359_1179680913300012923Vadose Zone SoilFDVILATSPAAAIQDERQFLQWIGDHGTTFPDAYKTIKEANLGLVDVSDLDAELLESGPNQCAVG*
Ga0137419_1178216323300012925Vadose Zone SoilWAVAARFDVIAATNAAAAIQDERQFLKWIADHSSTFPDAYRTIKEANLGLTDLSEADAEVLESGPNQCAIA*
Ga0137419_1193667423300012925Vadose Zone SoilDVISATNTAAAIQDERVFLKWIADHATDFPDAYRTIKEANLGLVDVAEADAEILESGPNQCAIV*
Ga0137416_1031022213300012927Vadose Zone SoilDEPAFLRWLSEHTSSFPDAYRTIKATNLGLADLSDSEAETLESGPNQCAVK*
Ga0137416_1074283913300012927Vadose Zone SoilRADRAVAARFDVISATNTAAAIQDERVFLQWIADHATAFPEAYRTIKESNLGFVELSDADAEILESGPNQCAVV*
Ga0137416_1193274313300012927Vadose Zone SoilVILATNPAAAIQDERQFLQWIGDHATSLPDAYKTIKEANLGLVDVSDLDAELLESGPNQCAVG*
Ga0134076_1014947813300012976Grasslands SoilYASEGERRADRAIAARFDVIAATNGAAAIQDERVFLQWLADHVTSFPDAYRTIKEVNLGLLEVSDADAEILEVGPNQCAIG*
Ga0134089_1008323723300015358Grasslands SoilDVITATNAAAAIQDERVFLQWIADRQTGFPDAYRTIKETNLGLADLSDSEAEMLESGPNQCTVK*
Ga0134083_1056948723300017659Grasslands SoilAARFDVIAATNAAAAIQDERVFLRWIADHATSFPDAYRTIKETNLGLAEVSNADAEVLESGPNQCAVG
Ga0184608_1010191623300018028Groundwater SedimentLATNPVAAIQDKRQFLQWIGDHATTFPDAYKTIKEANLGLVTVSDLDAEILESGPNQCAV
Ga0184634_1029199123300018031Groundwater SedimentGEGERRADRAIAARFDVIAATNAAASIQDEAVFLKWIADHQTSFPEAYRTIKEANLGLADLSDADAEVLESGPNQCAVG
Ga0184633_1015873013300018077Groundwater SedimentFDVIAATNAAASIQDEAAFLQWIADHRTTFPEAYRRIKEANLGLVDVSEADAEVLESGPNQCAVK
Ga0184627_1067374723300018079Groundwater SedimentYAGEGERRADRAIAARFDVIAATNAAASIQDEGVFLKWIADHPTSFPDAYRTIKETNLGLVDLSDADAEVLESGPNQCAVG
Ga0066667_1006803813300018433Grasslands SoilQDERVFLQWIADHQTSFPDAYRTIKEANLGLVQISDADAEVLESGPNQCAIA
Ga0247673_106619223300024224SoilERRADRAIAARFDVISATNTAAAIQDQQVFLKWIADHATTFPDAYRTIKETNLGLVQISDEDAELLESGPNQCAVV
Ga0207648_1204313423300026089Miscanthus RhizosphereAARFDVIAATNPVVAIQDEREFLTWIADHQTSFPEAYRTIKEANLGLVELSDADAEVLESGPNQCAV
Ga0209469_105155523300026307SoilASESERRADRAIAARFDVILATNPVAAIQDERQFLQWIGDHATTFPDAYKTIKEANLGLVDVSDLDAELLESGPNQCAVG
Ga0209472_104273833300026323SoilDVVAATNVAAAIQDKTAFLRWVADHAPAAPDAYRSIKLANLGLLELSDADAEVLESGPNQCAVPGAA
Ga0209470_136358623300026324SoilAVAARFDVISATNTAAAIQDESVFLKWIADHQSNFPDAYRTIKEANLGLVELSDADAEILESGANQCAVV
Ga0209266_117999913300026327SoilRRADRAVAARFDVISATNTAAAIQDERVFLKWIADHAMTSPDAYRMIKEANLGLVQLSDEDAEILESGPNQCAVM
Ga0209266_120888523300026327SoilELERRADRALAARFDVISATNAAAAIQDERLFLQWIADHQATPPDAYRTIKLANLGLVDVSDADAEALESGPNQCAVA
Ga0209267_129013423300026331SoilFDVIAATNVAAAIQDERTFLQWIKEHTTTFPDAYRTIKEVNLGLADLSDAEAEMLESGPNQCAVQ
Ga0209805_143587313300026542SoilHYASESERRADRAIAARFDVIAATNVAAAIQDERTFLQWINEHTTAFPDAYRTIKEVNLGLADLSDAESEMLESGPNQCAVK
Ga0209996_107263823300027395Arabidopsis Thaliana RhizosphereDRVVAARFDVIAATNAVVAIQDERTFLKWIVDHETSFPDAYRTIKEANLGLVELSDADVEILESGPNQCAIA
Ga0209899_111121723300027490Groundwater SandNPAAAIQDERAFLQWIVDHQTSFPDTYRRIKEANLGLVDVPDADAEVLESGPNQCAVR
Ga0209701_1049189013300027862Vadose Zone SoilYVGVSSYSANRTNEAAAIQDERAFLQWIAAHTPIFPDSYRTIKTANLGLVDVGEADAEILEFGPNQCAIR
Ga0247675_102628613300028072SoilTNPVVAIQDEREFLKWIADHQTSFPEAYRTIKEANLGLVELSDADAEVLESGPNQCAV
Ga0137415_1131904913300028536Vadose Zone SoilVILATNPAAAIQDERQFLQWIGDHATSLPDAYKTIKEANLGLVDVSDLDAELLESGPNQCAVG
Ga0307302_1006766023300028814SoilYAGEDERRADRAVAARFDVISATNSAAAIQDERVFLRWVADHATTFPDAYRTIKEANLGLAQLSDADAEVVESGPNQCAIV
Ga0307278_1048792713300028878SoilAHYASEAERRADRAVAARFDVISATNTAAAIQDERVFLKWIADHQSSFPDAYRTIKEANLGLVELSEPDAETLESGANQCAVM
Ga0307495_1007178613300031199SoilAHYASESERRADRAVAARFDVIAATNPVASIQDQQQFVTWIADHQARFPDAYRTIKEANLGLVELSDPDADVLESGPNQCAVA
Ga0307497_1018715913300031226SoilYASESERRADRAVASRFDVILATNPVAAIQDEREFLRWIANRATTFPDAYRTIKEANLGLVTLSEPDVEILESGPNQCAVA
Ga0307471_10288547813300032180Hardwood Forest SoilVVAATNTAAAIQDERTFLKWIADHTSPFPDAYRTIKEANLGLVDVSDADAEILESGPNQCAIV
Ga0364934_0345316_355_5643300034178SedimentIAARFDVIAATNAAAAIQDERTFLGWIADHTTTFPDAYRTIKEANLGLVDPSDADIEVLESGPNQCAVG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.