NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F099046

Metagenome / Metatranscriptome Family F099046

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099046
Family Type Metagenome / Metatranscriptome
Number of Sequences 103
Average Sequence Length 43 residues
Representative Sequence TIDATERAAIEALFRAFEPYRYDRDAGPLRFVTLAQLAKAYSR
Number of Associated Samples 89
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.97 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 83
AlphaFold2 3D model prediction Yes
3D model pTM-score0.46

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.029 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(28.155 % of family members)
Environment Ontology (ENVO) Unclassified
(37.864 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(55.340 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 38.03%    β-sheet: 0.00%    Coil/Unstructured: 61.97%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.46
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF01434Peptidase_M41 63.11
PF02355SecD_SecF 8.74
PF07549Sec_GG 1.94
PF00004AAA 0.97
PF13419HAD_2 0.97
PF14690zf-ISL3 0.97
PF03006HlyIII 0.97
PF13518HTH_28 0.97
PF04055Radical_SAM 0.97
PF13683rve_3 0.97
PF02272DHHA1 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG0465ATP-dependent Zn proteasesPosttranslational modification, protein turnover, chaperones [O] 63.11
COG0341Preprotein translocase subunit SecFIntracellular trafficking, secretion, and vesicular transport [U] 10.68
COG0342Preprotein translocase subunit SecDIntracellular trafficking, secretion, and vesicular transport [U] 10.68
COG1272Predicted membrane channel-forming protein YqfA, hemolysin III familyIntracellular trafficking, secretion, and vesicular transport [U] 0.97


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.03 %
All OrganismsrootAll Organisms0.97 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005177|Ga0066690_10051575All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2510Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil28.16%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil17.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil5.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.83%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.88%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost3.88%
WastewaterEnvironmental → Aquatic → Freshwater → Drinking Water → Unchlorinated → Wastewater2.91%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment2.91%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.91%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.94%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil1.94%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.94%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.94%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.94%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.97%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.97%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.97%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.97%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.97%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.97%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.97%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.97%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere0.97%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.97%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001538Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A10-PF 4A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005293Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1Host-AssociatedOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005840Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M6-2Host-AssociatedOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006918Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS100EnvironmentalOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009777Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking waterEnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300012003Permafrost microbial communities from Nunavut, Canada - A20_80cm_0.25MEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013427Permafrost microbial communities from Nunavut, Canada - A15_35cm_18MEnvironmentalOpen in IMG/M
3300013772Permafrost microbial communities from Nunavut, Canada - A10_80_0.25MEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015079Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-6b, vegetation/snow interface)EnvironmentalOpen in IMG/M
3300015203Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-3c, vegetated patch on medial moraine)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300022195Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025167Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 19_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025173Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking water (SPAdes)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025936Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3H metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028719Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_182EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031965Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT100D185EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300034178Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
A10PFW1_1166030523300001538PermafrostVSHPGTIVPAERAAIEALLGAFDALRYDRDAGPVRFVTPAQLAKAYRP*
JGI25382J43887_1014033213300002908Grasslands SoilATERAAIEALFEAFAPVRYDRDAGPVHFVTLAQLAKAYGR*
soilH2_1027096323300003324Sugarcane Root And Bulk SoilATIDATERAAIETLFAAFEPYRWDRQHGPLRFVTLSQLAKAIGP*
Ga0063356_10030246513300004463Arabidopsis Thaliana RhizosphereRAMTLVSHPSTIDATERAAIEKLFQAFEPYRLDRDKGPLRFVTLSQLAKAYGP*
Ga0062592_10021321323300004480SoilGTIDTIERAAIEALFHAFDPYRYDRDSGPLRFVTLAQLAKAYGR*
Ga0066683_1063128713300005172SoilRAVTIVSHPSTIDATERGAIAALFSALAPLRYDLDNGPVRFVTLAQLAQAWR*
Ga0066680_1010006013300005174SoilPGTINAAEGAAINSLFDALATLRYDLDKGPLRFVTLAQLAQAWR*
Ga0066673_1034322113300005175SoilIDATERAAITALFDAFAPLRYDRDAGPVRFVTLAQLAKAWR*
Ga0066690_1005157513300005177SoilGTIDATERAAITELFDALASLRYDLDNGPLRFVTLAQLAQAWR*
Ga0066671_1062920613300005184SoilIDATERGAIEALFTAFDPYRYDRDAGPLRFVTAAELAQAYR*
Ga0066676_1025810523300005186SoilTIDATERAAIEALFRAFEPYRYDRDAGPLRFVTLAQLAKAYSR*
Ga0066676_1064220313300005186SoilRRAITIVSHPGTIDATERAAITALFEALTPLRYDQDKGPVRFVTLAQLAQAWP*
Ga0066675_1119822813300005187SoilIDASERAAIEALLGAFAPLRYDRDTGPLRFVTLAQLAKAYGL*
Ga0065715_1015823423300005293Miscanthus RhizosphereIDTIERAAIEALFHAFDPYRYDRDAGPLRFVTLAQLAKAYGR*
Ga0070691_1062020713300005341Corn, Switchgrass And Miscanthus RhizosphereTERAAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYRP*
Ga0066686_1047347613300005446SoilRAITIVSHPGTIDATECAAITALFNAFAPLRYDLDNGPVRFVTLAQLAQAWR*
Ga0070706_10032257413300005467Corn, Switchgrass And Miscanthus RhizosphereTIDATERAAIEALFDAFGPLRYDADRGPVRFVTLAELAEAWR*
Ga0070706_10033175513300005467Corn, Switchgrass And Miscanthus RhizosphereGTIDATERAAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYSR*
Ga0070697_10067870913300005536Corn, Switchgrass And Miscanthus RhizosphereSHPATIDAIERDAIESLFTAFAPLRYDADAGPVRFVTLAQLAQALK*
Ga0070697_10199916323300005536Corn, Switchgrass And Miscanthus RhizosphereLVSHPGTITATERAAIEALFQAFEPVRYDRDAGPVRFVTLAQLAKAYGR*
Ga0066704_1013930623300005557SoilNERAAITALFDAFTPLRYDLDNGPLRFVTLAQLAQAWR*
Ga0066700_1028114523300005559SoilITIVSHPGTIDATERAAITALFDALAALRYDQDKGPVRFVTLAQLAQAWP*
Ga0066699_1021428433300005561SoilSHPGTIGPAERAAIEALLGAFGPLRYDADAGPVRFVTLAQLATAWGR*
Ga0066699_1036635613300005561SoilAIETLFAAFEPLRYDRGNGPLRFVTLAQLATAYSR*
Ga0066703_1022361813300005568SoilERAAIESLFAAFAPLRYDRDIGPVRFVTLAQLARAYG*
Ga0066691_1040096023300005586SoilERAAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYAR*
Ga0066706_1009858013300005598SoilTIDATERAAITELFDALASLRYDLDNGPLRFVTLAQLAQAWR*
Ga0066706_1023372023300005598SoilTIDATERAAITELFDALASLRYDLDNGPLRFITLAQLAQAYPY*
Ga0066706_1143993713300005598SoilTIVASERAAIETLFAAFEPLRYDRGNGPLRFVTLAQLAEAYSR*
Ga0068870_1051706513300005840Miscanthus RhizospherePATVNATERAAIESLFRSFEPYRYDRDGGPLRFVTLAQLAQAFK*
Ga0066653_1059691413300006791SoilAAIEALFLAFDPVRYDRDAGPVRFVTLAQLAKAYGR*
Ga0066659_1011764613300006797SoilTLVSHPGTIVPAERAAIESLFSAFAPLRYDRDAGPVRFITLAQLATAYGP*
Ga0075425_10031028823300006854Populus RhizosphereERGAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYSR*
Ga0079215_1041346613300006894Agricultural SoilIEKLFQAFEPYRWDRDKGPLRFVTLSQLAKAYGP*
Ga0075424_10236020813300006904Populus RhizosphereGTIDATERAAIETLFKAFFPLRYDQDSGPLRFVTLAQLAAAYSR*
Ga0079216_1158370713300006918Agricultural SoilHPSTIDATERAAIEKLFQAFEPYRWDRDKGPLRFVTLSQLAKAYGP*
Ga0079218_1082555023300007004Agricultural SoilEKLFQAFEPYRWDRDKGPLRYVTLSQLAKAYSSP*
Ga0066710_10012366953300009012Grasslands SoilERAAIEALVKAFAPLRYDRDVGPLRFVTLAEVAKAYSK
Ga0066710_10064985013300009012Grasslands SoilIVSHPGTIDATERAAIETLLGAFAPVRYDRDTGPLRFVTLAQLAKAYGL
Ga0066710_10128892133300009012Grasslands SoilGAIEALLGAVGPLRYDADAGPVRFVTLAQLATAWAR
Ga0099827_1188906323300009090Vadose Zone SoilIVSHPGTLDATEPAAITALFEALAPLRYDLDRGPVRFVTLAQLAQAWR*
Ga0066709_10220668623300009137Grasslands SoilDATERDAIEALFRAFEAYRYDRDAGALRFVTLAQLAKAYAR*
Ga0105164_1012176113300009777WastewaterPGTIDAAERAAIETLLHAFDPFRYDQDRGPVRSITLRELAQVWK*
Ga0134088_1027304023300010304Grasslands SoilIEALFRAFEPYRYDRDAGPLRFVTLAQLAKAYSR*
Ga0134111_1022940313300010329Grasslands SoilSHPGTIDATERAAITELFDALASLRYDLDNGPLRFITLAQLAQAYPY*
Ga0134126_1309954123300010396Terrestrial SoilRAAITALFDALAPLRYDRDAGPVRFVTLAQLAQAWR*
Ga0134124_1270907213300010397Terrestrial SoilTLVSHPATIDATERAAIEKLFAAFEPYRWDRDMGPLRYVTLAQLAKAFGP*
Ga0120163_108590613300012003PermafrostPGTIDATERAAIAALFNALAPLRYDRDAGPVRFVTLAQLAQAWR*
Ga0137399_1056071923300012203Vadose Zone SoilIDVTERAAITALFEALAPLRYDQDKGPLRFVTLAQLAQAWR*
Ga0137399_1098390113300012203Vadose Zone SoilAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYAR*
Ga0137380_1133244723300012206Vadose Zone SoilTVVSHPGTNDATERAAIEALFRAFEPYRYDRDAGPLRFVTLAQLAKAYSR*
Ga0137381_1075527623300012207Vadose Zone SoilVSHPGTIDATERAAITAIFNALAPFRYDLDQGPLRFVTLAQLAQAWR*
Ga0137376_1031715013300012208Vadose Zone SoilIDATERAAITALFDAFAPLRYDRDAGPVRFVTLAQLAQAWR*
Ga0137377_1002388713300012211Vadose Zone SoilHPGTIDATERAATEALFRAFEPYRYDRDAGPLRFVTLAQLAKAYSR*
Ga0137377_1064127723300012211Vadose Zone SoilHPGTIDATERAAITALFEALAPLRYDQDKGPLRFVTLAQLAQAWP*
Ga0137367_1015648513300012353Vadose Zone SoilALTLVSHPGTIDATERAAIEALLGAFTPYRYDSDAGPLRFVTLAQLAKAYSR*
Ga0137367_1074685213300012353Vadose Zone SoilTIDATERAAIETLFRAFEPLRYERDTGPLRFVTLAQLAKAYSP*
Ga0137368_1008816933300012358Vadose Zone SoilTIDATERGAIEKLFAAFEPYRWDRDMGPLRYVTLAQLAKAYGP*
Ga0137397_1004973013300012685Vadose Zone SoilRAAIEALFKAFGPLRYDADSGPVRFVTLAQVAQAFR*
Ga0137396_1040465413300012918Vadose Zone SoilIDATERAAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYAR*
Ga0137410_1037434313300012944Vadose Zone SoilEVKVARDATERAAITALFNALAPLRYDLDKGPVRFVTLAQLAQAWR*
Ga0120106_102604633300013427PermafrostAIEALLGAFDPLRYDLDSGPVRFVTLAQLAKAYGP*
Ga0120158_1028502813300013772PermafrostIEALLGAFDALRYDKDSGPVRFVTAAQLAKAYRQ*
Ga0134075_1001329823300014154Grasslands SoilVSHPGTIDATERAAIAALFNALAPLRYDRDSGPLRFVTLAQLAQAWR*
Ga0167657_103438223300015079Glacier Forefield SoilGTIDATERAAIESLFQSFEPYRYDRDHGPVRFVTLAQLAQALK*
Ga0167650_111115813300015203Glacier Forefield SoilTIVPAERAAIEALLGAFDGLRYDKDAGPVRFVTAAQLAKAYRQ*
Ga0137409_1004294113300015245Vadose Zone SoilGTIDATERAAITALFNALAPLRYDLDKGPVRFVTLAQLAQAWR*
Ga0132258_1311849313300015371Arabidopsis RhizosphereLTLVSHPATIDPIERDAIESLFKAFGPLRYEADSGPLRFVTLAQLAQAMK*
Ga0134112_1039172513300017656Grasslands SoilRAITLVSHPGTIDDTERVAIESLFAAFEPLRYDRDEGPLRFVTLAQLARAYAP
Ga0184610_106753013300017997Groundwater SedimentERAAIEKLFQAFEPYRWDRDKGPLRFVTLSQLAKAYNR
Ga0184604_1037342223300018000Groundwater SedimentTESAAIAALFNALGPLRYDRDSGPVRFVTLAQLAQAWR
Ga0184632_1018619523300018075Groundwater SedimentVSHPSTIDATERGAIETLFRAFDQYRYDRDAGPLRFVTLAQLARAYGP
Ga0215015_1046055423300021046SoilSHPGTFDATERAAIEALFTAFAPLRYDDDAGPVRFVTLTQFAAAYK
Ga0222625_164506013300022195Groundwater SedimentAPLALTIVSHPGTIDATERAAITALFNAFAPLRYDLDKGPVRFVTLAQLAQAWR
Ga0137417_135298823300024330Vadose Zone SoilVITVVSHPGTIVPAERAAIEALLGAFGPLRYDADAGPVRFVTLAQLAKAYNR
Ga0209642_1024854833300025167SoilGAIEALFRAFDPFRYDRDAGPLRFVTLAQLAKAYAR
Ga0209824_1005617213300025173WastewaterPGTIDATERAAIESLFRALEPYRFDRDRGPVRFITLAQLGKALK
Ga0209824_1021818723300025173WastewaterIVSHPGTIDAAERAAIETLLHAFDPFRYDQDKGPLRFVTLQQLARAWK
Ga0207699_1085764133300025906Corn, Switchgrass And Miscanthus RhizospherePAERAAIEALLMAFGPLRYDTDSGPVRFVTLAQLAKAYAH
Ga0207670_1085757323300025936Switchgrass RhizosphereTERAAIEALFKSFEPYRYDRDNGPLRFVTLAQLAQAFK
Ga0207704_1031108623300025938Miscanthus RhizosphereVSHPGTIDTIERAAIEALFHAFDPYRYDRDSGPLRFVTLAQLAKAYGR
Ga0209238_127318223300026301Grasslands SoilTIDATERAAITALFEALAPLRYDQDKGPLRFVTLAQLAQAWR
Ga0209471_115386413300026318SoilVPAERAAIESLFSAFAPLRYDRDAGPVRFITLAQLATAYGP
Ga0209473_105599413300026330SoilGPAERAAVEALLGAFGPLRYDADAGPVRFVTLAQLATAWGR
Ga0209803_110203913300026332SoilATERAAIESLFAAFAPLRYDRDIGPVRFVTLAQLARAYG
Ga0209160_114021813300026532SoilVSHPGTIDANERAAITALFDAFRPLRYDLDNGPLRFVTLAQLAQAWR
Ga0209157_107945213300026537SoilRAAIESLFAAFAPLRYDRDIGPVRFVTLAQLARAYG
Ga0209056_1004372553300026538SoilAITIVSHPGTIDATERAAITALFEALAPLRYDQDKGPVRFVTLAQLAQAWP
Ga0209056_1011993133300026538SoilDATERAAIEALFRAFEPYRYDRDAGPLRFITLAQLAKAYSR
Ga0209376_110083713300026540SoilVVSHPGTIDATERAAIEALFRAFEPYRYDRDAGPLRFITLAQLAKAYSR
Ga0209161_1011129023300026548SoilSHPGTIDATERAAITALFEALAPLRYDQDKGPVRFVTLAQLAQAWP
Ga0209474_1004821753300026550SoilVSHPGTIVPAERAAIESLFSAFAPLRYDRDAGPVRFITLAQLATAYGP
Ga0209011_110187313300027678Forest SoilDATERAAIEALFRAFDPYRYDRDAGPLRFVTLAQLAKAYAR
Ga0209590_1033240123300027882Vadose Zone SoilITATERAAIEALFQAFDPVRYDRDAGPVHFVTLAQLAKAYGR
Ga0209590_1060059123300027882Vadose Zone SoilHPGTIDATERAAITALFDAFSPLRYDRHAGPVRFVTLAQLAQAWR
Ga0307301_1012156213300028719SoilIDATERAVITALFNAFAPLRHDLDNGPVRFVTLAQLAQAWR
Ga0307305_1004577713300028807SoilAAITALFEAFAPLRYDRDAGPVRFVTLAQLAQAWR
Ga0307292_1009514313300028811SoilAIEALFRAFEPYRYDRDAGPLRFVTLAQLAKAYSR
Ga0307304_1022038113300028885SoilHPGTIDATERAAIEALFQSFEPYRYDRDNGPLRFVTLAQLERAFK
Ga0307473_1065947723300031820Hardwood Forest SoilIDATERAAITALFDAFAPLRYDQDNGPLRFVTLAQLADAWR
Ga0326597_1049574513300031965SoilSTIGATERGAIETLFRAFEPLRYDRDAGPLRFVTLAQLAKAYAR
Ga0307471_10261254713300032180Hardwood Forest SoilTVVSHPGTIDATERAAIESLFAALEPLRYDRDSGPVRFVTLAQLARAYAR
Ga0364934_0425100_357_5033300034178SedimentVSHPSTIDATERGAIETLFRAFDPYRYDRDAGPLRFVTLAQLAKAYSR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.