NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F104596

Metagenome Family F104596

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104596
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 70 residues
Representative Sequence MPSKTTTALAILLVFGSGSAALADTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Number of Associated Samples 94
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 50.00 %
% of genes near scaffold ends (potentially truncated) 1.00 %
% of genes from short scaffolds (< 2000 bps) 1.00 %
Associated GOLD sequencing projects 85
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(30.000 % of family members)
Environment Ontology (ENVO) Unclassified
(38.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(36.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 27.03%    β-sheet: 0.00%    Coil/Unstructured: 72.97%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF13407Peripla_BP_4 19.00
PF01292Ni_hydr_CYTB 8.00
PF12697Abhydrolase_6 1.00
PF04909Amidohydro_2 1.00
PF03466LysR_substrate 1.00
PF13419HAD_2 1.00
PF00156Pribosyltran 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG1969Ni,Fe-hydrogenase I cytochrome b subunitEnergy production and conversion [C] 8.00
COG2864Cytochrome b subunit of formate dehydrogenaseEnergy production and conversion [C] 8.00
COG3038Cytochrome b561Energy production and conversion [C] 8.00
COG3658Cytochrome b subunit of Ni2+-dependent hydrogenaseEnergy production and conversion [C] 8.00
COG4117Thiosulfate reductase cytochrome b subunitInorganic ion transport and metabolism [P] 8.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.00 %
All OrganismsrootAll Organisms1.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001661|JGI12053J15887_10009735All Organisms → cellular organisms → Bacteria5233Open in IMG/M
3300010047|Ga0126382_11426740Not Available633Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil30.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil12.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment10.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil7.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere3.00%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment2.00%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil2.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.00%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil2.00%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere2.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.00%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.00%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere2.00%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.00%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.00%
Populus EndosphereHost-Associated → Plants → Roots → Bulk Soil → Unclassified → Populus Endosphere1.00%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.00%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere1.00%
SimulatedEngineered → Modeled → Simulated Communities (Sequence Read Mixture) → Unclassified → Unclassified → Simulated1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
2166559005Simulated microbial communities from Lyon, FranceEngineeredOpen in IMG/M
2170459002Grass soil microbial communities from Rothamsted Park, UK - March 2009 direct MP BIO 1O1 lysis 0-21 cmEnvironmentalOpen in IMG/M
2170459010Grass soil microbial communities from Rothamsted Park, UK - December 2009 direct MP BIO1O1 lysis 0-9cm (no DNA from 10 to 21cm!!!)EnvironmentalOpen in IMG/M
2228664021Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000033Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000787Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005347Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaGHost-AssociatedOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005543Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M1-3 metaGHost-AssociatedOpen in IMG/M
3300005578Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2Host-AssociatedOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006028Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-3 metaGEnvironmentalOpen in IMG/M
3300006038Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. deltoides DD176-5Host-AssociatedOpen in IMG/M
3300006880Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3Host-AssociatedOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012911Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S088-202R-2EnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300012989Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_237_MGEnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300017792Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S4-5 metaGHost-AssociatedOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018067Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_5_coexEnvironmentalOpen in IMG/M
3300018072Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b2EnvironmentalOpen in IMG/M
3300018073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_5_b1EnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018083Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_b1EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300019996Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3a2EnvironmentalOpen in IMG/M
3300020000Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3a1EnvironmentalOpen in IMG/M
3300020005Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3m2EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300021418Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3s2EnvironmentalOpen in IMG/M
3300021968Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3c1EnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025972Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025981Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027181Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028709Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_118EnvironmentalOpen in IMG/M
3300028711Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_150EnvironmentalOpen in IMG/M
3300028720Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_357EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300028802Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 17_SEnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028875Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_143EnvironmentalOpen in IMG/M
3300028876Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_140EnvironmentalOpen in IMG/M
3300028884Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_195EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300031170Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 12_SEnvironmentalOpen in IMG/M
3300031198Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 14_SEnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031547Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D4EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_024957202088090014SoilMPSKTALAALLVFGSASAVLADTTNRTDHQGSVVRRAVAVTMSNSRHRAVDHAVKPFTAEEKGWFDRASRVF
cont_0170.000007902166559005SimulatedMPSKTTTALAILLVFGSGSAALADTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
E1_094325402170459002Grass SoilMPSKTTTALAILLVFGSGSAALAGTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEE
F62_068870402170459010Grass SoilMPCKTTTALAILLVFGFGSAALADTNTRTGHHASVARRAVPVTMPNRRSGTVEHAVKPFTAEEKGWFDRASRVF
ICCgaii200_068164412228664021SoilMPSKTTTALAALLFFGSASAALANTNNRNDHQGSVVGRAVSVTMSSGGAGAVNHAQ
ICCgaii200_068229312228664021SoilMPSKTTTALAALLFFGSASAALANTNNRNDHQGSVVGRAVSVTMSSGGAGAVNHAQKPSTAEEKAA
ICChiseqgaiiDRAFT_056571023300000033SoilMPSKTKTALAALLVFGSASAVLADTTNRTXHXGSVVRRAVAVTMSNSRHRAVDHAVKPFT
INPhiseqgaiiFebDRAFT_10507402423300000364SoilMPSKTTTALAALLFFGSASAALANTNNRNDHQGSVVGRAVSVTMSSGGAGAVNRAQK
JGI11643J11755_1175549913300000787SoilMPSKTKTALAALLVFGSASAVLADTTNRTDHQGSVVRRAVAVTMSNSRHRAVDHAVKPFTAEEK
JGI1027J12803_10032454113300000955SoilMPSKTALAALLVFGSASAVLADTTNRTDHQGSVVRRAVAVTMSNSRHRAVDHAVKPFTAEEKGWFDRASRVF*
JGI12053J15887_10009735103300001661Forest SoilMPSKTTTALAILLVFGSGSAALAQTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKRFTAEEKGWFDRASRVF*
C688J35102_12034173623300002568SoilMPCKTTAALAILLVFGSGSAALANTNNRTSHHGSVARRAVPVTIPNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0062592_10048606433300004480SoilMPRKTTTALAALLFFGSASAALANTNYRNDHQGSVVGRAVSVTMSSGGAGAVNHAQKPS
Ga0070668_10141687213300005347Switchgrass RhizosphereMPSKTTTALAILLVFGSGSAALANTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0070709_1014879923300005434Corn, Switchgrass And Miscanthus RhizosphereMPSKTKTALAALLVFGSASAVLADTTNRTDHHGSVVRRAVSVTMSNSRNTAVDHAVKPFTAE
Ga0070714_10037311313300005435Agricultural SoilMPSKTKTALAALLVFGSASAVLADTTNRTDHHGSVVRRAVSVTMSNSRNRAVDHAVKPFTAEEKGWF
Ga0070714_10169018413300005435Agricultural SoilMPSKTTTTLAALLFFGSASAALANTNNRNDHQGSVVGRAVSVTMSSGGAGAVNHAQKPSTAEEKAGFG
Ga0070713_10011942033300005436Corn, Switchgrass And Miscanthus RhizosphereMPSKTKTALAALLVFGSASAVLADTTNRTDHRVQLSDVRLSNSRSRAVDHAVKPFTAEQKGWFDRASRVF*
Ga0070711_10021630423300005439Corn, Switchgrass And Miscanthus RhizosphereMPSKTKTALAALLVFGSASAVLADTTNRTDHHGSVVRRAVSVTMSNSRNRAVDHAVKPFTAEQKGWFDRASRVF*
Ga0070672_10094089313300005543Miscanthus RhizosphereMPSKTTTTLAALLFFGSASAALANTNYRNDHQGSVVGRAVSVTMSSGGAGAVNYEQKPS
Ga0068854_10118062723300005578Corn RhizosphereMPRKTTTALAALLFFGSASAALANTNYRNDHQGSVVGRAVSVTMSSGGAGAVN
Ga0068861_10185565023300005719Switchgrass RhizosphereMPCKTTTALAILLVFGSGSAALADTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVY*
Ga0066903_10150374033300005764Tropical Forest SoilMLSKTKMALTTLLVLGSASAVLADSNDRTDHHGSTARRAVSVTMSNDHIGAVNHAVKPFTAEEKAWFAGP
Ga0070717_1042610913300006028Corn, Switchgrass And Miscanthus RhizosphereMPSKTKTALAALLVFGSASAVLADTTNRTDHRVQLSDVRLSNSRNRAVDHAVKPFTAEEKGWFDRASRVF*
Ga0075365_1024279423300006038Populus EndosphereMPSKTTTALAVLLVFGSGSAALADTNNRTGHHGSVARRAVPVTMSNGRNGAVNHAVKPFTAEEKAWFDRASKVF*
Ga0075429_10153168523300006880Populus RhizosphereMPSKTKTALAALLVFGSASAALADTTHRTDHYGSVVRRAVSVTMSNSRNRAVDHAVKPFTAEEKGWFDRASRVF*
Ga0068865_10182040223300006881Miscanthus RhizosphereMPSKTTTALAILLVFGSGSAALANTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAE
Ga0099795_1002773813300007788Vadose Zone SoilMPSKTTTALAILLVFGSGSAALADTNNRTRHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0099792_1013544813300009143Vadose Zone SoilMPCKTTTALAILLVFGSGSAALADTNNRTSHHGSVARRAVPVTMLNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0075423_1076417523300009162Populus RhizosphereMPSKTKTALAALLVFGSASAVLADTKNRTDHHGSVVRRAVSVTMSNSRNRAVDHAVNPFSAEEKGWTSAC*
Ga0126382_1142674013300010047Tropical Forest SoilMALATLLVLGSASAVLADSNDRTDHHGSAARRAVSLTMFNDRVGAVNHAVKPFAADKKAWFAGPQLAQ
Ga0099796_1003674923300010159Vadose Zone SoilMPSKTKTALTILLVFGFGSAALADTNNRTRHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0099796_1042138423300010159Vadose Zone SoilMPNRTKSALAALLFFGSASAVLADTNNRTDHYGSVGRRAVWVTMSHGRNGAVNHDVRPFTAEEKAWFAAPQPALVQ
Ga0134122_1115859813300010400Terrestrial SoilMPCKTTTALAILLVFGSGSAALANTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFT
Ga0137382_1120840713300012200Vadose Zone SoilMPSKTKTALAALLVFGSASAVLADTTNRGAHHGAVVRRAVSVTMSNSRNGAVDHAVKPFTAEEKAWFDRASRVF*
Ga0137398_1007260423300012683Vadose Zone SoilMPSKTTTALAILLVFGSGSAALADTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0137397_1012215923300012685Vadose Zone SoilMPSKTTTALAILLVFGSGSAALADTNNRTSHHGSVARRAVPVTMPNGRNGTVEHAVKPFTAEEKGWFDRASRVF*
Ga0157301_1040036313300012911SoilMPSKTTTFLAALLVFGSASAVLADTNNRTYHHYSSDARRAGSVAMPTGRPGAVNHA
Ga0137396_1118916223300012918Vadose Zone SoilMPSKTTTALAILLVFGSGSAALAQTNNRTRHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0137413_1005121613300012924Vadose Zone SoilMPSKTTTALAILLVFGSGSAALADTNNRTGHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0137419_1051060123300012925Vadose Zone SoilMPSKTKTALTILLVFGFGSAALADTNNRTGHHASVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0137410_1010361133300012944Vadose Zone SoilMPSKTTTALAILLVFGSGSAALAQTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0164302_1045993413300012961SoilTTTALAILLVFGSGSAALADTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0164309_1024675723300012984SoilMPCKTTTALAILLVFGSGSAALADTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0164307_1158940713300012987SoilLAILLVFGSGSAALANTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0164305_1123159713300012989SoilMPSKTTTALAILLVFGFGSAALADTNNRIGHHASVARRAVPVTMPNGRNGTVEHAVKPFT
Ga0163162_1053271023300013306Switchgrass RhizosphereMPSKTTTALAILLVFGFGSAALADTNNRIGHHASVARRAVPVTMPNGRNGTVEHAVKPFTAEEKGWFDRASRVF*
Ga0137412_1008023043300015242Vadose Zone SoilMPSKTTTALAILLVFGSGSAALADTNNRTGHHGSVARRAVPVTMSNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0132258_1086525613300015371Arabidopsis RhizosphereMPSKTTTALAILLVFGSGSAALAQTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVY*
Ga0132256_10055117813300015372Arabidopsis RhizosphereMPSKTTTALAALLFFGSASAALANTNNRNDHHGSFIGRAESVTMSSGRAGAVNHAPKPFTAEEKAWFATPQLALAE
Ga0132257_10030526033300015373Arabidopsis RhizosphereMPSKTTTALAVLLVFGSGSAALAQTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVY*
Ga0132255_10169498823300015374Arabidopsis RhizosphereMPSKTTTALAILLVFGSGSAALAQTNNRTSHHGSVARRAVLVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF*
Ga0163161_1059031723300017792Switchgrass RhizosphereMPSKTTTALAALLFFGSASAALANTNNRNDHQGSVVGRAVSVTMSSGGAGAVNYEQKPSTAEEKAGFGTP
Ga0184610_131608813300017997Groundwater SedimentMPSKTTTALAILLVFGSGSAALADTNNRTRHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0184604_1000634523300018000Groundwater SedimentMPSKTTTALAILLVFGSGSAALADTNNRTGHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0184621_1000351123300018054Groundwater SedimentMPSKTTTALAILLVFGSGSAALAQTNNRTGHHGSVARRAVPVTMSNGRNGTVEHAVKPFTAEEKGWFDRASRVF
Ga0184619_1051059513300018061Groundwater SedimentMPSKTTTALAILLVFGSGSAALADTNNRTSQHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVY
Ga0184611_105644613300018067Groundwater SedimentMPSKTTTALAALLFFGSASAAWANTNNRNDHQGSVVGRAVSVTMSSGGAGAVNHAQKPSTAEEKAGF
Ga0184611_114910423300018067Groundwater SedimentMPSKTTTALAILLVFGFGSAALADTNNRIGHHASVARRAVPVTMPNGRNGTVEHAVKPFTAEEKGWFDRASRVF
Ga0184635_1010503023300018072Groundwater SedimentMPSKTTTALAILLVFGSGSAALAQTNNRTSHHGSVARRAVPATMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0184624_1007681313300018073Groundwater SedimentMPSKTTTALAILLVFGFGSAALADTNNRTGHHASVARRAVPVTMPNGRNGTVEHAVKPFTAEEKGWFDRASRVF
Ga0184633_1017317223300018077Groundwater SedimentMPSKTTTALAILLVFGSGSAALAQTNNRTSHHGSVARRAVPVTMPNGRNGTVEHAVKPFTAEEKGWFDRASRVF
Ga0184628_1031415923300018083Groundwater SedimentMPSKTTTALAALLFFGSASAALANTNNRNDHQGSVVGRAVSVTMSSGGARAVNH
Ga0193707_119900513300019881SoilFSKTTTALAILLVFGSGSAALADTNNRTRHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0193727_102544723300019886SoilMPRKTTTALAILLVFGFGSAALAQTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0193693_102608023300019996SoilMPSKTTTALAILLVFGSGSAALADTNNRTSHHDPVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0193692_104791823300020000SoilMPSKTTTALAILLVFGFGSAALAQTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVY
Ga0193697_107478813300020005SoilMPSKTTTALAILLVFGFGSAALAQTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0210381_1012557523300021078Groundwater SedimentMPSKTTTALAALLFFGSASAALANTNNRNDHQGSVVGRAVSVTMSSGGPGAVNHAQKPSTAEEKAGFGTP
Ga0210382_1035079913300021080Groundwater SedimentMPSKTTTALAILLVFGSGSAALADTNNRTRHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVY
Ga0193719_1008683823300021344SoilMPSKTTTALAILLVFGSGSAALANTNNRTSHQGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0193695_107567013300021418SoilMPSKTTTALAILLVFGSGSAALAQTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0193698_101223623300021968SoilMPCKTTTALAILLVFGSGSAALADTNNRTSHHDPVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVY
Ga0224452_101596323300022534Groundwater SedimentMPSKTTTALAILLVFGFGSAALAQTNNRTSHDGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0207692_1017939523300025898Corn, Switchgrass And Miscanthus RhizosphereMPSKTKTALAALLVFGSASAVLADTTNRTDHRVQLSDVRLSNSRSRAVDHAVKPFTAEQKGWFDRASRVF
Ga0207668_1058660033300025972Switchgrass RhizosphereMPSKTTTALAILLVFGSGSAALADTNNRTSHHGSVARRAVPVTMPNGRNGTVEHAVKPFTAEEKGWFDRASRVF
Ga0207640_1098216223300025981Corn RhizosphereMPRKTTTALAALLFFGSASAALANTNYRNDHQGSVVGRAVSVTMSSGGAGAVNHAQKPSTAEEKAGFGT
Ga0207675_10047834033300026118Switchgrass RhizosphereMPCKTTTAFAILLVFGSGSAALANTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVY
Ga0208997_105295323300027181Forest SoilMPSKTTTALAILLVFGSGSAALAQTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKRFTAEEKGWFDRASRVF
Ga0307279_1002582213300028709SoilMPSKTTTALAILLVFGCGSAALADTNNRTRHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0307293_1022531223300028711SoilMPNRTKSALAALLFFGSASAVLADTNNRTDHYGSVGRRAVSVTMSHGRNGAVNHDVRPFTAEEKAWFAAP
Ga0307293_1029456413300028711SoilMPSKTTTALAALLFFGSASAALANTNNRNDHQGSVVGRAVSVTMSSGGPGAVNHAQKPST
Ga0307317_1020145833300028720SoilMPSRTKTALAALLFFGSASAVLADTNNRTDHHGSVARRAVSVTMSEGRTGAINHAVKPFTAE
Ga0307282_1027764623300028784SoilMPNRTKSALAALLFFGSASAVLADTNNRTDHYGSVGRRAVSVTISHDRNGAVNHDVRPFTAE
Ga0307503_1056657023300028802SoilMPSKTTTALAILLVFGFGSAALADTNNRTGHHASVARRAEPVTMPNGRNGTVEHAVKPFTAEEKGWFDRASRVF
Ga0307292_1012349423300028811SoilMPNRTKSALAALLFFGSASAVLADTNNRTDHYGSVGRRAVSVTMSHGRNGAVNHDVRPFTAEEKAWFAAPQPA
Ga0307302_1033775013300028814SoilMPSRTKTALAALLFFGSTSAILADTNNRTDHYGSVARRAVSVIMSNDRTGAVNHAVRPFIAEESAWFAALQPALV
Ga0307312_1031830813300028828SoilMPSRTKTALAALLFVGSTSAILADTNNRTDHYGSVARRAVSVIMSNDRTGAVNHAVRPFIAEESAWFAALQP
Ga0307289_1002699413300028875SoilMPSKTTMALVILLVFGSDSAALADTNNRTGHHGSVARRAVSVTMSNDRTGAVNHAVEPFTAEQNAWFAAPQSTLVQSAPF
Ga0307289_1007366033300028875SoilPSKTTTALAILLVFGFGSAALAQTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0307286_1002149643300028876SoilKTTTALAILLVFGSGSAALADTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0307308_1039531213300028884SoilMPSKTTMTLAALLFFGSASAALANTNNRNDHHGSVIGRAESVTMSSGRAGAVNHALQPF
Ga0307304_1009275023300028885SoilMPNRTKSALAALLFFGSASAVLADTNNRTDHYGSVGRRAVSVTMSHGRNGAVNHDVRPFTAEEKAWFAA
Ga0307498_1002405213300031170SoilMPSKTTTALAALLFFGSASAALANTNNRNDHQGSVVGRAVSVTMSSGGPGAVNDAQKPSTAEEKAGFGTPQ
Ga0307500_1020703913300031198SoilMPSKTTTALAILLVFGSGSAALAQTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFERASRVF
Ga0307495_1019247513300031199SoilMPSKTTTTLAALLFFGSASAALANTNNRNDHQGSVVGRAVSVTMSSGGAGAVNYEQKPSTGEEKA
Ga0170820_1444280423300031446Forest SoilMPNRTKSALAALLFFGSASAVLADTNNRTDHYGSVGRRPVSVTMSHGRNGAVNHDVRPFTAEEK
Ga0310887_1023103813300031547SoilMPSKTTTALAALLFFGSASAALANTNNRNDHQGSVVGRAVSVTMSSGGAGAVNHAQKPSTAEEKAG
Ga0307470_1016265823300032174Hardwood Forest SoilMPSKTTTALAILLVCGSGSAALAQTNNRTSHHGSVARRAVPVTMPNGRNGTVDHAVKPFTAEEKGWFDRASRVF
Ga0307472_10015385423300032205Hardwood Forest SoilMPCKTTTALAILLVFGSGSAALANTNNRTSHQGSVARRAVPVAMPNGRNGTVDHAVKPFTAEEKGWFDRASRVY


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.