NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F104923

Metagenome / Metatranscriptome Family F104923

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104923
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 89 residues
Representative Sequence MDTGVRSTVKFRTVSIALTLGIVACAALSLAAQKKAAVTSGPVVFVQDKGKLTIKLDGQTVGHEEFEIAPSGGGWLAKGTAEIKPPEGGASK
Number of Associated Samples 81
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 14.00 %
% of genes from short scaffolds (< 2000 bps) 14.00 %
Associated GOLD sequencing projects 76
AlphaFold2 3D model prediction Yes
3D model pTM-score0.37

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (86.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(50.000 % of family members)
Environment Ontology (ENVO) Unclassified
(46.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(51.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 25.83%    β-sheet: 26.67%    Coil/Unstructured: 47.50%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.37
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF01202SKI 68.00
PF13349DUF4097 9.00
PF01336tRNA_anti-codon 5.00
PF08281Sigma70_r4_2 4.00
PF13345Obsolete Pfam Family 3.00
PF01262AlaDh_PNT_C 1.00
PF12704MacB_PCD 1.00



 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A86.00 %
All OrganismsrootAll Organisms14.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005557|Ga0066704_10265438All Organisms → cellular organisms → Bacteria → Acidobacteria1165Open in IMG/M
3300009012|Ga0066710_103483918All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis596Open in IMG/M
3300009089|Ga0099828_10410679All Organisms → cellular organisms → Bacteria → Acidobacteria1221Open in IMG/M
3300012198|Ga0137364_10249043All Organisms → cellular organisms → Bacteria → Acidobacteria1312Open in IMG/M
3300012202|Ga0137363_10392190All Organisms → cellular organisms → Bacteria → Acidobacteria1155Open in IMG/M
3300012203|Ga0137399_10417981All Organisms → cellular organisms → Bacteria → Acidobacteria1120Open in IMG/M
3300012205|Ga0137362_11020959All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis704Open in IMG/M
3300017823|Ga0187818_10075552All Organisms → cellular organisms → Bacteria → Acidobacteria1451Open in IMG/M
3300021086|Ga0179596_10126440All Organisms → cellular organisms → Bacteria → Acidobacteria1193Open in IMG/M
3300026332|Ga0209803_1099419All Organisms → cellular organisms → Bacteria → Acidobacteria1191Open in IMG/M
3300026482|Ga0257172_1019274All Organisms → cellular organisms → Bacteria → Acidobacteria1190Open in IMG/M
3300027643|Ga0209076_1064726All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis1038Open in IMG/M
3300027875|Ga0209283_10183379All Organisms → cellular organisms → Bacteria → Acidobacteria1391Open in IMG/M
3300031753|Ga0307477_10324586All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Candidatus Acidoferrales → Candidatus Acidoferrum → Candidatus Acidoferrum panamensis1061Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil50.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil14.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil8.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.00%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment3.00%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.00%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland2.00%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.00%
PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Peatland1.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006102Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2013EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300017823Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_3EnvironmentalOpen in IMG/M
3300017933Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_1EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300018086Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP02_10_MGEnvironmentalOpen in IMG/M
3300019284Metatranscriptome of tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP05_10_MT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022508Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-19-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022531Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024288Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16fungalEnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026481Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-AEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027910Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12053J15887_1000926313300001661Forest SoilMDTGVRSTVKFRTVSIALALGIVACAALSLAAQKKAAETSSPVVFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTADI
JGI25388J43891_106206623300002909Grasslands SoilMDTGVTSTVKFRTVSLALALGIAVCAALSLAAQKKAAESSGPVVFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEITPPEGAASKVSGSL
JGI25390J43892_1000405913300002911Grasslands SoilMDAGVRGTVKFRIISIALTFGIVACAVSIAAQKKSAATSGPVVFVQDKGKLTIKLGGQTVGHEDFEIAPSGGGWLAKGLAEIKPPEGAGSKVSGSLTLQGN
JGI25617J43924_1004287013300002914Grasslands SoilMDAEVRSTVKFRTAWIALTFGIVASAALSVAAQKKAAVTSGAVVFTPDEGKLTIKLDGQTVGHEEFEIAPSGGGWLAKGTAEIKPPEGAVSKISGSLTL
Ga0066672_1089335823300005167SoilMDTGVRGTVKFRIISIALTFGMAACAVLAIAAQKKSPATPGPVVFVQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGLAEI
Ga0066683_1000309813300005172SoilVKLRISIALTFGIVVCAVLSIAAQKKAAATSGPVVFVQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGL
Ga0066679_1008525413300005176SoilVKFRTASIALVLGIVVCARLSLAAQKKAAETSGPVVLAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPEGGSSKVSGSLTLQADGA
Ga0066675_1010206733300005187SoilVKFRTASLALTIGIVALSLAAQKKAAETSGPVVFAEDKGKLTIKLGGQIVGHEEFEIARSGGGWLAKGTAEIKPPEG
Ga0070708_10094817513300005445Corn, Switchgrass And Miscanthus RhizosphereMDAGVRSTVKFKTTFIALTFGIAAVATLSLAAQKKGAQTSGARVFAADKGKFIIKLGGQTVGHEDFEIAPSGGGWLAKGTAEIKPPESSGSKV
Ga0066682_1047640223300005450SoilMDTEVRSAVKFRTASLALTLGIVAWSLAAQKKAAETSGPVVFAQDKGRLTIKLGGQTVGHEEFEIARSGGGWLAKGTAEIKPPEGESSKVSGSLTLQA
Ga0066704_1026543813300005557SoilMVTRVRSTVKFRTAWIGIAFGIVTCAAFTLAAQKKGAATSGGSIFAQDKGKFAIKLDGQTVGHEEFEIAPSGGGWLAKGTTD
Ga0066691_1000672013300005586SoilVKFRTASLALTLGMVALSLAAQKKAAETSGPVVFAQDKGKLTIKLGGQTVGHEEFEIARSGGGWLAKGTAEIKPP
Ga0066691_1042703623300005586SoilVKFRTGSIALTLGIAACAALSLAAQKGGQSSGTVLFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWVAKGTAEIKPPEGNSSKVSGSLTMRADGAPISY
Ga0066706_1004173253300005598SoilMDTEVRSAVKFRTASLALTLGIVAWSLAAQKKAAETSGPVVFAQDKGRLTIKLGGQTVGHEEFEIARSGGGWLAKGTAEIKPP
Ga0075015_10091672523300006102WatershedsMDTGVEAPVKFTTISFALTLGIVACAALSLAAQKKTAAPSGAVVFSQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPEGTSSKVSGSLTLQA
Ga0079222_1114652513300006755Agricultural SoilMDTGVRGTVKFRIISIALTLGIVVCAVLSMAAQKKAAVTSGPVVFVQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGLAEIKPPEGGSSKV
Ga0066659_1001367073300006797SoilMILGAAACTVLSLAAQKKAAGGSGPVVLAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPEGGSSKVSGSLTLQADG
Ga0099794_10001220103300007265Vadose Zone SoilMDTGVRSTVKSKTVSIALTLGIVACAAFSLAAQKKAAETSGPVVFTQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGIVEIKP
Ga0066710_10348391813300009012Grasslands SoilMDAGVTSAVKFRTASVALTLGIVACAALFLAAQKKAAEASGPVVFAQDKGKLTIKLGGQTVGHEEFEIAPTGGGWLAKGTAEIKPPEGAASKVSGSLTLQGDGALAMA
Ga0099829_1143966723300009038Vadose Zone SoilMDAGVRSTVKFRIISISLTFGIAACAALSLAAQKKSAAAPSPVVFAQDKGKFTIRLGGQTVGHEEFEIAPSGGGWLAKGIAQ
Ga0099828_1041067913300009089Vadose Zone SoilMVLGVRSIVKFRKVSIALTLGIVACAALSLAAQKKAAQTSGPVVFAQDKGKFTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPDG
Ga0099792_1001778553300009143Vadose Zone SoilMVLGVRSIVKFRKVSIALTLGIVACAALSLAAQKKAAQTSGLVVFAQDKGKFTIKLGGQTVGHEEFEIAPCGGGWLAKGTAEIKPPDGT
Ga0134063_1031540513300010335Grasslands SoilMDTGVTSTVKFRTVSLALALGIAVCAALSLAAQKKAAESSGPVVFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPEGA
Ga0126378_1146467813300010361Tropical Forest SoilMIAGVSSTVKFRIVSLGTALALIACVGLGVGAQKKPATGSVFLQDKGKFTIKLAGQTVGHEEFEIAPSGGGWLAKGSA
Ga0137392_1085150413300011269Vadose Zone SoilMDTGVRSAVKFKTVSIALTLGIVACAAFSLAAQKKAAETSGPVVFTQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGIVEIKPPEGDASKVS
Ga0137391_1099531823300011270Vadose Zone SoilMNTGVTSTVKFRTVSLALALGIVACAALSLAAQKKAAESSGPVVFVLDKGKFTVKLDGQTVGHEEFEIAPSSGGWLAKGTVEIKPPEGAASKVSGSLTLQGDGA
Ga0137391_1141221213300011270Vadose Zone SoilMDAGVRSTVKFRTVSIALTLGIVACAALSLAAQKGGQPSGAVLFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWVAKGTAEIKPPEGSSSKVSGSLTM
Ga0137389_1166276823300012096Vadose Zone SoilMDAGVRSTVKFRTTSIALIFGVAACAALSLAAQKKSAAPSAPVVFAQDKGKFTIRLGGQTVGHEEFEIAPSGGGWLAKGIAQIKPPDGGSSKVSGSLTLQ
Ga0137388_1023353913300012189Vadose Zone SoilMVSGVRSIVKFRTVSIALTLGIVACAALALAAQKKAAQTSGPVVFAQDKGKFTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPDSGASKVSGSLT
Ga0137364_1024904313300012198Vadose Zone SoilMDTGVTSTVKFKTVSLALALGIVVCTALSLAAQKKAAESSGPVVFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPEGAASKVSGSLTL
Ga0137382_1001134673300012200Vadose Zone SoilMDTGVTSTVKFKTVSLALALGIVVCTALSLAAQKKAAESSGPVVFAQDKGKFTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPEG
Ga0137382_1042103613300012200Vadose Zone SoilMDAGVTSTVKFRTASVALTLGIVACAALSLAAQKKAAEASGPVVFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIK
Ga0137363_1039219013300012202Vadose Zone SoilMDAGVTSTVNFRTASLALTLGIVACAALSLAAQKKAAESPGPVVFAQDKGKFTVKLGGQTVGHEEFEIAPSGGGW
Ga0137399_1041798113300012203Vadose Zone SoilMNKGVRSTVKFRTVPIAVTLGIVACAGLSLAGQKKGGPASGPVVLVPDRGKLTIKLAGQTVGHEEFEIASSGGGWLAKGTAEIKPPQGASSKVSG
Ga0137399_1055995033300012203Vadose Zone SoilMDTGVRSTVKFRTVSIALTLGIVACAALSLAAQKKAAETSGPVVFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGW
Ga0137399_1117512623300012203Vadose Zone SoilVKFRTVSIALALGIVACAALSLAAQKKAAETSAPVVFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTADIKPPDGGASKVSGSLTLQG
Ga0137362_1102095923300012205Vadose Zone SoilMDAGVRNTVKFRTVSIALTLGIVACAALSLAAQKGGQPSGTVLFAQDKGKLTIKLGGQTVGHEEFEITPSGGGWVAKGTAEIKPPEGSSSKVSGSLTMRADGAPIS
Ga0137362_1120559213300012205Vadose Zone SoilMDAEVRSTVKFKTVFIALTFGIVACAALSLAAQKKGAQISGARVFAADKGKLTIKLGGQTVGHEEFEIAPSGGGWL
Ga0137381_1002174473300012207Vadose Zone SoilMDTGVRSAVKFRTASLALTLGIVALSLAAQKKAAETSGAVVFVQDKGKLTIKLGGQTVGHEDFEIARSGGGWLA
Ga0137381_1088353923300012207Vadose Zone SoilMDAGVRSTVKFKTVFVALTFGIVACGALSLAAQKKGAQISGPRVFAADKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPESSGSKVSGS
Ga0137376_1101334313300012208Vadose Zone SoilMSTGVRNTVNFRTACIGLTLGLAACAALSFAAQKKTAPASGVLVQDKGKFTIKLAGQTVGREEFEIAPAEGGWLAKGTTDIKPPEGAASKVTGSLT
Ga0137378_1111231023300012210Vadose Zone SoilMDAGVRSTVKFKTVFVALTFGIVACGVLSLAAQKKGAQISGPRVFAADKGKLTIKLAGQTVGHEEFEIAPSGGGWLAKGTAEI
Ga0137377_1016653813300012211Vadose Zone SoilVKFRTVSLALTLGIVAWSLAAQKKSASGPVVFAQDKGKLTIKLGGQTVGHEEFEIARSGGGWLAKGTAEIKPPEGSSSKV
Ga0137387_1032651633300012349Vadose Zone SoilMDTGVRGTVKFRTVSVALTLGIVACAVLTFAAQKKGTATGGPVVLAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPEGSASKVSGSLTLQADGA
Ga0137387_1034103823300012349Vadose Zone SoilMDTGVRGTVKFRTVSVALTLGIVACAVLTFAAQKKGTSAPVVLAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPEGSASKVSGSLTLQADGA
Ga0137366_1013245813300012354Vadose Zone SoilMVTGVRGTVKFRTAWLGIAFGIVTCAAFTLAAQKKGAASSGGSILAQDKGKFAIKLDGQTIGHEEFEIAPSGGGWLAKGTTDLKPPEGGASKVT
Ga0137360_1006654513300012361Vadose Zone SoilMDAGVTSTVNFRTASLALTLGIVACAALSLAAQKKAAESPGPVVFAQDKGKFTVKLGGQTVGHEEFEIAPSGGGWL
Ga0137360_1100785823300012361Vadose Zone SoilMDAGVRNTVKFRTVSIALTLGIVACAALSLAAQKGGQPSGTVLFAQDKGKLTIKLGGQTVGHEEFEITPSGGGWVAKGTAEIKPPEGSSSKVSGSLT
Ga0137360_1160395813300012361Vadose Zone SoilMDAGVRSTVKFRTVSIALTLGIVACAALSFAAQKGGQPSGGVLFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWV
Ga0137390_1072948513300012363Vadose Zone SoilMDAGVRSTVKFRTTSIALIFGVAACAALSLAAQKKSAAPSAPVVFAQDKGRLTIKLGGQTVGHEEFEIAPSGGGWLAKGVAEIKPPDGGSSKVSGSLTLQ
Ga0137398_1008851843300012683Vadose Zone SoilVTLGIVACAGLALADQKKGGPASGPVVLVPDRGKLTIKLAGQTVGHEEFEIAPSGGSWLAKGIAEIKLPQGASSKVSGSLALQAN
Ga0137398_1042553423300012683Vadose Zone SoilMDAGVRSIVKFRTVSIALTLGIVACAALSLAAQKGGQPSGAVLFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWVAKGSAE
Ga0137397_1055519223300012685Vadose Zone SoilLSLAAQKKAAEASGPVVLAQDKGKLIVKLGGQTVGHEEFEIAPSSGGWLAKGTAEIKPPEGGSSKISGSLTLQ
Ga0134081_1028915723300014150Grasslands SoilMDAGVRGTVKFRIISIALTFGIVACAVSIAAQKKSAATSGPVVFVQDKGKLTIKLGGQTVGHEDFEISPSGGGWLAKGLAEIKP
Ga0134079_1039395513300014166Grasslands SoilMDTGVTSTVKFRTVSLALALGIAVCAALSLAAQKKAAESSGPVVFAQDKGKLTIKLAGQTVGHEEFEIAPSGGGWLAKGTAEITPPEGAASKVSGSLTL
Ga0137420_115853333300015054Vadose Zone SoilVKFRTVSIALALGIVACAALSLAAQKKAAETSAPVVFAQDKGKLTIKLGGQTVGHEEFEIAPSSGGWLAKGTADINPPDG
Ga0137420_122318623300015054Vadose Zone SoilMDAGVRSTVKFRTVSIALTLGIVACAALSFAAQKGGQPSGGVLFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWVAKGTAEIKPPEGSSSKVSG
Ga0182035_1134031723300016341SoilVKLRALSIALVFVVAAGAVLTLAAQKKVAAAPSILAQDKGKFTIKLAGQTVGHEEFEIAPSGGGWLAKGTADIKPPEGSPSKVTGSLTLQPD
Ga0187818_1007555233300017823Freshwater SedimentMAEGDHLAVKFRTAIITLTVGLAAYATITVAAQKKAASGVFTQDKGKLNIKIDGQTVGHEEFEIVPSGGGWLARGTSEFKPPEGAASKVSGT
Ga0187801_1052235823300017933Freshwater SedimentVKFRTVIFSLTIGLAACAAITVAAQKKAAASVFTQDKGRLDIKLNGQTVGHEEFEIAPSDGGWLAKGTSEFKP
Ga0187817_1081329423300017955Freshwater SedimentMKRISSLKSPDPEREWAGDHLAVKFRTAIISLTIGLAACAAFTVAAQKKGAASVFIQDKGKLVIKLDGQTVGHEEFEIVPSGGGWLAKGTSEFKPPEGAASKVTGTL
Ga0187778_1001729913300017961Tropical PeatlandVKSKAAIFSFAIGVAVFATFSLAAEKKAAGSVFAQDKGKLVIKLDGQTVGHEEFEITPSADGWLAKGTSEFKPPEGAASKVTGSLNMQPDGVPVSY
Ga0187769_1002189663300018086Tropical PeatlandVKFRSSFFALVFVFAAGVALTLEAQKKPAASPSVLAQDKGKFTIKLAGQTVGHEEFEIAPSEGGWLAKGTADIKTPEGPASKVT
Ga0187797_124835213300019284PeatlandVKFRSSFLALVFVFAAGVALTLEAQKKPAASPSVLAQDKGKFTIKLAGQTVGHEEFEIAPSEGGWLAKGTADIKTPEGPASKVTGA
Ga0179592_1001237713300020199Vadose Zone SoilMGTGVRSIVKFRTVSFGLTLAILAWAACSFAAQKKAAATSGAVVFAQDKGKLTIKLGGQTVGHEEFEIAPFGGGWLAKGTAEIKPPAGAASKVSGS
Ga0179592_1008798733300020199Vadose Zone SoilMDTGVSGTVKFRIIVIALTFGVVACAVLSIAAQKKAAATSGPVVFVQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGLAEIKPPDGGSS
Ga0179596_1012644033300021086Vadose Zone SoilMDTGVRNTVKFRTVSFALTFGIVACGGLSLAAQKKSTPTSGPVVFTQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPE
Ga0210400_1006560213300021170SoilMDTGVRNTVKFRTALLALTLGIVACGAFSLAAQKKAAEASGAVVFAQDKAKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPDGPASKVSGSL
Ga0210394_1138452223300021420SoilLKFKTASFGLALTISACAALTFAAQKKSTANSAAAVFAQDKAKFTIKLDGQTVGHEEFEIAPNGGGWLAKGASDIKLPA
Ga0210409_1020136213300021559SoilVKFRTVPIAVTLGIVVCAGLSLADQKKGGPASGPVVLVPDRGKLTIKLGGQTVGHEEFEIAPSGGSWLAKGTAE
Ga0210409_1136186023300021559SoilVKFRTVSFALTLAILACAAFSFAAQKKTAASSGPVIFAQDKGKFTIKLGGQTVGHEEFEIAPSGSGWLAKGTAEIKPPDGTASKVSGS
Ga0222728_107236333300022508SoilVKFRTVSFALTLAILACAAFSFAAQKKTAASSGPVIFAQDKGKFTIKLGGQTVGHEEFEIAPSGGGWLAKGTADIKPPDGGASK
Ga0242660_121194913300022531SoilMDAGVRSTVKFKTVSFTLALAILACAAFSIAAQKKPAATSGPVVFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTA
Ga0179589_1001966643300024288Vadose Zone SoilVKFRTASVALTLGIVTCAALSLAAQKKAAEASGPVFAEDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGIAEIKPPEGAASKVSGS
Ga0209235_124055723300026296Grasslands SoilVKFRATCVGLILGFAACAALSFAAQKKVAAGSGVLAQDKGKFTIKLAGQTVGHEEFEIAPSGGGWLAKGTADIKPPEGAASKVT
Ga0209375_103756113300026329SoilVKFRTASLALTLGIVAWSLAAQKKAAETSGPVVFAQDKGRLTIKLGGQTVGHEEFEIARSGGGWLAKGTAEIKPPEGESSKVSGSLTL
Ga0209803_109941913300026332SoilVKLRISIALTFGIVVCAALSIAAQKKAAATSGPVVFVQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGLAEIKPPEGGSSKVSGSLTLQADGAPISYD
Ga0209377_105875633300026334SoilMVTGIRRTVKFRTAWIGIAFGIVTCAAFTLAAQKKGAASSGGSILAQDKGKFAIKLDGQTIGHEEFEIAPS
Ga0257155_105900613300026481SoilMDTGVRSTVKFRTVSFALTLGVVVCAALSLAAQKKPAPTSGSVVFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWVAKGTA
Ga0257172_101927433300026482SoilVKFRTVSIALTLGIVACAALSLAAQKKAAETSGPIVFTQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTADIKPPDGGASKVSGSLTLQGDGAPISYDW
Ga0209157_131525113300026537SoilMVTGVRGTVKFRTAWLGIAFGIVTCAAFTLAAPKKGAASSGGGILAQDKGKFTIKLDGQTVGHEEFEIAPSG
Ga0209648_1007565513300026551Grasslands SoilVKFKTVSIALTLGIVACAAFSLAAQKKAAETSGPVVFTQDKGKLTIKLGGQTVGHEEFEIAPSGGGWVAKSIVEIKPPEG
Ga0209076_106472633300027643Vadose Zone SoilVKFRIGSIALALGIVACAALSLAAQKKAAPSSGTVVFAQDKGKFTIKLGGQTVGHEEFEITPSGGGWLAKGTTEIKPPEGSASRVSGSLTLQGNGAPISYQWS
Ga0209076_120081923300027643Vadose Zone SoilMDTGVRSTVKFRTVSIALALGIVACAALSLAAQKKAAETSAPVVFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTADIKPPDGGTSKVSGSLTLQGDG
Ga0209117_119957213300027645Forest SoilMDAGVTSTVKFRTASLALTLGIVACAALALAAQKKAAVTSGPVVFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPEGGASKISGSLTLQ
Ga0209588_115082723300027671Vadose Zone SoilMDAGVRSIVKFRTVSIALTLGIVACAALSLAAQKGGQPSGAVLFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWVAKGTAEIKPPEGSSSKVSGSLTMRADGAPI
Ga0209701_1033787113300027862Vadose Zone SoilMDTGVRSTVKFRTVSIALTLGIVACAALSLAAQKKAAVTSGPVVFVQDKGKLTIKLDGQTVGHEEFEIAPSGGGWLAKGTAEIKPPEGGASK
Ga0209701_1037983013300027862Vadose Zone SoilMDAGVRSTVKFRTAWIALAFGIVASAAWSVAAQKKAAVTSGAVVFTPDKGKLTIKLDGQTVGHEEFEIAPSGGGWLAKGT
Ga0209283_1013598513300027875Vadose Zone SoilMVLGVRSIVKFRKVSIALTLGIVACAALSLAAQKKAAQTSGPVVFAQDKGKFTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPDGTASKV
Ga0209283_1018337913300027875Vadose Zone SoilMVSGVRSIVKFRTVSIALTLGIVACAALALAAQKKAAQTSGPVVFAQDKGKFTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPDSGASKVSGSLTL
Ga0209068_1069774723300027894WatershedsVKSKSISVVLTLAIVACAALSLAAQKKTAQTSGPVVLAQDKGKLTIKLAGQTVGHEEFEIAPSGGGWLAKGTAEIKPPDGGASKVSGSLTMQAYGAPISYD
Ga0209488_1000708413300027903Vadose Zone SoilMDAGVTSTVKFRTASVALTLGIVACAALSLAAQKKAGEASGPVVFAQDKGKLTIKLGGQTGGHEEFEIAPSGGGWLAKG
Ga0209488_1013720913300027903Vadose Zone SoilMDAGVRSTVKFRIGSIALALGIVACAAMSLAAQKKAAPSSGAVVFAQDKGKFTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPE
Ga0209583_1026045313300027910WatershedsVKFRTVSFSLALGIVACAALSLAAQKKPAASSGAIVFSQDRGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTAEIKPPEGASSKVSGSLILQANG
Ga0137415_1074836923300028536Vadose Zone SoilMDTGVRSTVKFRTVSIALALGIVACAALSLAAQKKAAETSAPVVFAQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTADIKPP
Ga0137415_1074837023300028536Vadose Zone SoilMDTGVRSTVRFRTVSIALTLGIVACAALSLAAQKKAAETSGPVVFTQDKGKLTIKLGGQTVGHEEFEIAPSGGGWLAKGTADIKPP
Ga0307469_1168129623300031720Hardwood Forest SoilVKFKTSFVALGLGIMACAAFSLAAQKKGAPKSGASVFTQDKGKFSIKLAGQTVGHEEFEIAPAGGGWLAKGTAEIKPPEGAASKVSGS
Ga0307477_1032458633300031753Hardwood Forest SoilVHAALSFAAQKKAAQTSGPVVFAQDKGKLTIKLGGQTVGHEEFEIVPSGGGWLAKGTAEIKPPDSGASKVSGSLTLQANGTPIS
Ga0307479_1040579933300031962Hardwood Forest SoilMDTGVRSAVKFRTISLALTLGIAACAGFSLAAQKKAAAASGPVVFTQDKGKLTVKLAGQTVGQEEFEIAPSGG
Ga0307470_1139696823300032174Hardwood Forest SoilVKFKTSFVALGLGIMACAAFSLAAQKKGAPKSGASVFTQDKGKFSIKLAGQTVGHEEFEIAPAGGGWLAKGSAEIKPPEGAASKVSGSLTLQADGAPISYE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.