NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F100037

Metagenome / Metatranscriptome Family F100037

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100037
Family Type Metagenome / Metatranscriptome
Number of Sequences 103
Average Sequence Length 78 residues
Representative Sequence LLPLFRAVLVWVVILVSVVLPRLAPPAQQSVSIAVTQLSSFDRSSVVQQTKNQRVQAAQMAQVQSWSKELKQGLAEY
Number of Associated Samples 94
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 88
AlphaFold2 3D model prediction Yes
3D model pTM-score0.30

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(25.243 % of family members)
Environment Ontology (ENVO) Unclassified
(46.602 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(67.961 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 55.24%    β-sheet: 0.00%    Coil/Unstructured: 44.76%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.30
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF01553Acyltransferase 34.95
PF00211Guanylate_cyc 14.56
PF00801PKD 3.88
PF13191AAA_16 1.94
PF13828DUF4190 0.97
PF01066CDP-OH_P_transf 0.97
PF01464SLT 0.97
PF12704MacB_PCD 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 14.56
COG0558Phosphatidylglycerophosphate synthaseLipid transport and metabolism [I] 0.97
COG1183Phosphatidylserine synthaseLipid transport and metabolism [I] 0.97
COG5050sn-1,2-diacylglycerol ethanolamine- and cholinephosphotranferasesLipid transport and metabolism [I] 0.97


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil25.24%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil21.36%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil13.59%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.77%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.85%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.85%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.91%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.94%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.94%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.94%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.97%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.97%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil0.97%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil0.97%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost0.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Soil0.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.97%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil0.97%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil0.97%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.97%
Attine Ant Fungus GardensHost-Associated → Fungi → Mycelium → Unclassified → Unclassified → Attine Ant Fungus Gardens0.97%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2140918006Permafrost microbial communities from permafrost in Bonanza Creek, Alaska - Permafrost Layer P1EnvironmentalOpen in IMG/M
3300001089Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3EnvironmentalOpen in IMG/M
3300001154Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1EnvironmentalOpen in IMG/M
3300001414Arctic peat soil from Barrow, Alaska - NGEE Surface sample 210-1 shallow-072012EnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005566Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_142EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010905Grasslands soil microbial communities from Angelo Coastal Reserve, California, USA - 15_R_Wat_40_2_8_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012180Attine ant fungus gardens microbial communities from Georgia, USA - TSGA058 MetaGHost-AssociatedOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012373Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_2_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300013766Permafrost microbial communities from Nunavut, Canada - A26_65cm_6MEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015079Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-6b, vegetation/snow interface)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300024323Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK07EnvironmentalOpen in IMG/M
3300026223Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Permafrost soil replicate 2 DNA2013-190 (SPAdes)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300027565Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027633Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028138Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK25EnvironmentalOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300030743Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada VCO Co-assemblyEnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031670Soil microbial communities from Risofladan, Vaasa, Finland - OX-3EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
P1_C_012920002140918006SoilLLPLFRAVLVWAVILASVILVRIDPPTAVTQAVAVTQLSAYDRLSVVQQTRNQRIQAAQVAQVKAWSAELKQGLADYQAQQDALAAAQAQAA
JGI12683J13190_100814213300001089Forest SoilLLPLFRAVLVWTVILAGVILPRLEQPVDVKYAVAVTQLNAYDRASVVQQTRTQRVQAAQMAQVNAWTQDLKQGLADYQAQLDALAAAEAESQRIAALNNHP
JGI12636J13339_103445813300001154Forest SoilLLPLFRAVLVWVVILASVILPRMAPPAQQTASVAVTQLSAYDRDSVVQQTRDQRIQAAQAAQAQSWTEQLKQGLAAYAAQQEALAAAQ
JGI20174J14864_100571623300001414Arctic Peat SoilLLPLFRAILVWTVILAGVILVRMDPPTAVTQAVAVTQLSAYDRLSVVQQTRNQRVQAAQMAQVDAWSAELKQGLADYQAQQDALAAAQAQAAR
JGI25383J37093_1017335613300002560Grasslands SoilLLPLFRAVLVWVAILVTVILPRLDPPAQRSVAVAVTQLSAFDRNSVMQQTKNPQRVQAAQMAQVQSWSLELK
JGI25382J43887_1001394533300002908Grasslands SoilLLPLFRAVLVWVAILVTVILPRLDPPAQRSVAVAVTQLSAFDRNSVMQQTKNPQRVQAAQMAQVQSWSLELKQGLAEYQARQAALAAAQAVANAIAARNNHP
JGI25386J43895_1012014223300002912Grasslands SoilLLPLFRAVLVWVAILVTVILPRLDPPAQRSVAVAVTQLSAFDRNSVMQQTKNPQRVQAAQMAQVQSWSLELKQGLAEYQARQAALAAAQAV
JGI25617J43924_1013028013300002914Grasslands SoilLGLVPLLRAVLVWAVILASVVLVRLDPPASATNAVAVTQLSAYDRLSVVQQTRAQRVQAAQLAQVTAWAAELKQGLADYQAQQD
JGI25389J43894_103837813300002916Grasslands SoilLLPLFRAVLVWVVILVTVILPRLAPPAQQSVTVAVSQLTAFDRDSAVQHARNQRVQAAQMAQVQSWSLELKQGLAEYQA
Ga0066677_1019765923300005171SoilLLPLFRAVLVWVVILVSVVLPRLAPPAQQSVSIAVTQLSSFDRSSVVQQTKNQRVQAAQMAQVQSWSKELKQGLAEY
Ga0066680_1087404523300005174SoilLLPLFRAVLVWVVILVAVILPRMDPPAPQSVAVAVTQLSSFDRSSAVQQARTQRVQAAQMAQVQSWSQELKQGLAE
Ga0066673_1066452713300005175SoilVWVVILVTVVLPRLAPPAQQSVTVAVSQLTAFDRSSALQQAHAQRVQAAQMAQVQSWSQELKQGLAEYQ
Ga0066688_1010775523300005178SoilLLPLFRAVLVWVVILVSVVLPRLAPPAQQSVSIAVTQLSSFDRSSVVQQTKNQRVQAAQMAQVQSWSKELK
Ga0070711_10142585213300005439Corn, Switchgrass And Miscanthus RhizosphereLGLVPLLRAVLVWAVILASVVLVRLDPPTSVANAVAVTQLSAYDRLSVVQQTRTQRAQAAQMAQVSAWAAELKQGLADYQAQQDALA
Ga0066681_1069898513300005451SoilVLVWVVILVTVILPRLAPPAQQSVTVAVSQLTAFDRDSAVQHARNQRVQAAQMAQVQSWSLELKQGLA
Ga0070732_1037842913300005542Surface SoilLLPLLRAVLVWVVILVIVVLPRLSPPQQPSVAIAVTQLTAFDRSSVVQQSRSARVQAAQMAQVQSWTQELKQGLADYEAKQEAAAAAQA
Ga0070696_10117230913300005546Corn, Switchgrass And Miscanthus RhizosphereLLPLFRAVLVWIAILVTVVLPRIDPPAPDTASVAVTQLSAFDRNSVMQQTRNQRVQAAQMAQVQSWSLELK
Ga0066695_1006668213300005553SoilLLPLFRAVLVWVAILVTVILPRLDPPAQRSVAVAVTQLSAFDRNSVMQQTKNPQRVQAAQMAQVQSWSLELKQGLAEYQARQAALAAAQAVANAIAARNNHPGPP
Ga0066695_1053551813300005553SoilLRAVLVWVVLLVSVIAPRMAPPVVHPIQVAVTQLSAFDRNSVVQQNRDARVEAAQMAQVATWSNELKQGLAEYEAKQQALAAAQAQAAA
Ga0066695_1084331013300005553SoilVWSVILVSVILPRLDPPIPRSVAVAVTQLSAFDRNSVVQQNRNDRVEAAQMAQVQSWSAELKQGLAEYQAKQQALAAAQAYAAQIAARSNHPAPPPEIA
Ga0066661_1000388673300005554SoilLLPLFRAVLVWVVILVTVILPHLTPHQQQTVSIAVTQLSAFDRNSVVKQTVDQRVQAAQMAQVQSWSLELKQGLANYEAEKQAEAAAQAQAAAIAARSNHP
Ga0066692_1078903923300005555SoilLLPLFRAVLVWVVILVTVVLPRLAPPAQQSVTVAVSQLTAFDRSSALQQAHAQRVQAAQMAQVQ
Ga0066692_1079510613300005555SoilVWVVILVSVVLPRLAPPAQQSVSIAVTQLSSFDRSSVVQQTKNQRVQAAQMAQVQSWSKELKQGLAEYQ
Ga0066693_1005125423300005566SoilLLPLFRAVLVWVAILVTVILPRLDPPAQHSVAVAVTQLSAFDRNSVMQQSKNQRVQAAQMAQVQSW
Ga0066705_1000702613300005569SoilVWVVILVTVILPRLTPHQQQSVSVAVTQLSAFDRNSVVQQSVNQRVQAAQMAQVQ
Ga0066705_1014212533300005569SoilVLVWVVILVSVVLPRLDPAPQQTVTVAVTQLTAFDRNSVVQQAKAQKVRAAQMAQVQSWSAELKQGLADYQ
Ga0066691_1025439513300005586SoilLLPLFRAVLVWVVILVTVILPRLTPHQEQSVSVAVTQLSAFDRNSVVQQAVNQRIQAAQMAQVQSWSLELKQGLANYEA
Ga0066651_1023054733300006031SoilVWAVILVSVILPRLDPPVQRSVAVAVTQLSAFDRNSVVHQNRNDRVEAAQMAQVQSWSAELKQGLAEYQAK
Ga0066696_1022540613300006032SoilLLPLFRAVLVWVVILVSVVLPRLAPPAQQSVSIAVTQLSSFDRSSVVQQTKNQRVQAAQMAQVQSWSKELKQGLA
Ga0066656_1026282323300006034SoilVWVVILVTVVLPRLAPPAQQSVTVAVSQLTAFDRSSALQQAHAQRVQAAQMAQVQSWSQELKQGLAEYQAK
Ga0075023_10052972023300006041WatershedsLGLVPLLRAVLVWAVILASVVLVRLDPPAATTNAVAVTQLSAYDRLSVVQQTRAQRVQAAQVAQVTAWAAELK
Ga0066652_10187620713300006046SoilVWAVILVSVILPRLDPPVQRSVAVAVTQLSAFDRNAVVHQNRNDRVEAAQMAQVQSWSAELKQGLAEYQAKQ
Ga0079222_1103865123300006755Agricultural SoilLLPLFRAVLVWVVVLVTIVLPRLAPPQQQSVPVAVTQLTAFDRTSVVQQARNQKVQAAQMAQVQSWSLELKQGLADYQAKLEAQA
Ga0066660_1002439543300006800SoilLLPLFRAVLVWVVILVTVVLPRLAPPAQQSVTVAVSQLTAFDRSSALQQAHAQRVQAAQMAQVQSWSQELKQGLAEYQAK
Ga0079220_1046334613300006806Agricultural SoilLLPLFRAVLVWVVILVSVVLPRLAPAPEQTLTAAVTTVTVFDRNSVVQDAKNQRVRAAQMAQVQSWSAEL
Ga0079220_1169874523300006806Agricultural SoilLPLLRAVLVWVVILVIVVLPRLSPPQQHSVAIAVTQLPAFDRSSVVQQSRTARVQAAQMAQVQSWSQEL
Ga0099791_1026545223300007255Vadose Zone SoilLLPLFRAVLVWTVILATVILPRLEPPVSMKYAVAVTQLSAYDRASVVQQTRTQRVHAAQLAQVSAWS
Ga0099830_1117280723300009088Vadose Zone SoilLLPLFRAVVVWVVILAGVILPRLDAPAAVTHAVAVTQLNAYDRLSVVQQTRNERVQASQMAQ
Ga0099828_1042103623300009089Vadose Zone SoilLGFVPLLRAVLVWAVILASVVLVRLDPPPALANAAAVTQLSAYDRLSVVQQTRAQRVQAAQIAQV
Ga0099827_1035746313300009090Vadose Zone SoilLLPLFRAVLVWVVILVTVILPRLTPHQEQSVSVAVTQLSAFDRNSVVQQTVNQRVQAAQMAQVQSWSLELKQGLANYEAEKQAEAAAQAQAAAIAARSN
Ga0099827_1112536113300009090Vadose Zone SoilLLPLFRAVLVWVVILVTVMLPRLAPPAQQSVTVAVSQLTAFDRDSAVQQARNQRVQAAQMAQ
Ga0066709_10456247813300009137Grasslands SoilVWAVILVSVILPRLDPPVQRSVAVAVTQLSAFDRNSVVHQNRDDRVEAAQMAQVQSWSAELKQGLAEYQAKQQALAAAQAVAAQIAARNNHPPPPP
Ga0099796_1033071123300010159Vadose Zone SoilVVWVVILAGVILPRMDAPAAVTHAVAVTQLSAFDRLSVVQQTRNERVQAAQMAQVSVWSQELKQGLADYQAQQDALAAAQAQAAHRR
Ga0134082_1024617423300010303Grasslands SoilLLPLFRAVLVWVVILVTVILPRLAPPAQQSVTVAVSQLTAFDRDSAVQHARNQRVQAAQMAQVQSWSLELKQGLAEYQAKQVAEAAAQAQAEAIA
Ga0134088_1012663333300010304Grasslands SoilLLPLFRAVLVWALILVSAIMPRLAPPAVHPIQVAVTQLSAFARNSVVQQNRDERVQAAQMAQVARW
Ga0134109_1029822823300010320Grasslands SoilLLPLFRAVLVWVVILVTVILPRLAPPAQQSVTVAVSQLTAFDRDSAVQHARNQRVQAAQMAQV
Ga0134084_1041444213300010322Grasslands SoilVLVWVVILVTVILPRLDPPAQRSVAVAVTQLSAFDRNSVMQQTKNPQRVQAAQMAQVQSWSLELKQGLAEYQARQAALAAAQAQAAAIAVRYNHPG
Ga0134063_1009999013300010335Grasslands SoilLLPLFRAVLVWVVILVTVILPRLAPPAQQSVTVAVSQLTAFDRDSAVQHARNQRVQAAQMAQVQSWSLELKQGLAEY
Ga0134063_1065302323300010335Grasslands SoilLRAVLVWVVLLVSVIAPRMAPPVVHPIQVAVTQLSAFDRNSVVQQNRNDRVQAAQMAQVAAWSSELKQGLAEYE
Ga0134062_1060516013300010337Grasslands SoilLLPLFRAVLVWVAILVTVILPRLDPPAQHSVAVAVTQLSAFDRNSVMQQSKNQRVQAAQMAQVQSWSLELKQGLAEYQARQAALAAAQAVAA
Ga0134122_1308888913300010400Terrestrial SoilVILVTVVLPRLDPPVQRSVAVAVTQLSAFDRNAVAQQNRNERVEAAQMAQVQAWSA
Ga0138112_102496023300010905Grasslands SoilLLPLFRAVLVWVVILVSVVLPRLAPPAQQSVSIAVTQLSSFDRSSVVQQTKNQRVQAAHMAQVQSWSKELKQGLA*
Ga0137389_1036372723300012096Vadose Zone SoilLVPLFRAVLVWVVILVSVVLPRLDPPSQPSVSVAVTQLTAFDRNSVVQQARNQRVQAAQMAQVQSWTLELKQGLADYQA
Ga0153974_109392213300012180Attine Ant Fungus GardensLLALLRAVLVWVVILVSIVLPRLDPAPEQTLTAAVTSVTAFDRNSVVQEAKSQRVKAAQMAQVETWSAELKQGLADYQA
Ga0137364_1002740723300012198Vadose Zone SoilVLVWVVILVTVILPRLDPPAQRSVAVAVTQLSAFDRNSVMQQTKNPQRVQAAEMAQVQSWSLELKQGLAEYQARQAALAAAQAQAAAIAVRYNHPG
Ga0137387_1051053613300012349Vadose Zone SoilLLPLFRAVFVWALILVSVVLPRLDPPVQRSVAVAVTQLSAFDRNSVVHQNRNDRVEAAQMAQVQSWS
Ga0137367_1062403623300012353Vadose Zone SoilLLPLFRAVLVWVVILVTVVLPRLEPPAQRTFAVAVTQLSAFDRSSAVQQAKNDRVQAARMAEVQSWSDELKQGLAEYQAKQQALAAAQALAAA
Ga0137371_1019184133300012356Vadose Zone SoilLRAVLVWVVLLVSVIAPRMAPPVVHPIQVAVTQLSAFDRNSVVQQNRDARVEAAQMAQVATWSNELKQGLAEYEAKQQALAAA
Ga0137385_1003946443300012359Vadose Zone SoilLLPLFRAVLVWVVILVTVILPRLTPHQEQSVSVAVTQLSAFNRNSVVQQTVNQRVQAAQMAQVQSWSLELKQGLANYEAEKQAEAAA
Ga0134042_103058913300012373Grasslands SoilVLVWVVILVTVILPRLDPPAQRSVAVAVTQLSAFDRNSVMQQTKNPQRVQAAQMAQVQSWSLELKQGLAEYQARQAALAAAQAVANAIAARNNHPGPPPEIAK
Ga0137394_1002845353300012922Vadose Zone SoilLLPLFRAVLVWVVILAGVILPRLDAPASVARAVAVTQLSAFDRLSVVQQTRNERVQAAQMAQVNAWS
Ga0137394_1121724923300012922Vadose Zone SoilVVWVVILAGVILPRMDAPAAVTHAVAVTQLSAFDRLSVVQQTRNERVQAAQMAQVNAWS
Ga0137419_1152990513300012925Vadose Zone SoilLVPLFRAVLVWVVILVSVVLPRLDPPSQPSVSVAVTQLSAFDRNSVVQQARNQRIQAAQMAQVQSWTLELKQ
Ga0137404_1142729123300012929Vadose Zone SoilLLPLFRAVLVWVVILAGVILPRMDAPAAVTHAVAVTQLSAFDRLSVVQQTRNERVQAAQMAQVSVWSQELKQGLADYQAQQDALAA
Ga0137410_1065972913300012944Vadose Zone SoilLLPLFRAVVVWVVILAAVILPRLDAPASVSRVVAVTQLSAFDRLSVVQQTRNQRVQAAQMAQVNAWTQDLKKGLAEYQAQQEA
Ga0134077_1038122513300012972Grasslands SoilLLPLFRAVLVWALILVSAIMPRLAPPAVHPIQVAVTQLSAFDRNSVVQQNRDERVQAAQMAQVARWSSELKQGLAEYEA
Ga0134110_1044878323300012975Grasslands SoilLLPLFRAVLVWVVILVSVVLPRLDPAPQQTVTVAVTQLTAFDRNSVVQQAKAQKVRAAQMAQVQSWSAELKQGLADYQAKQEALAAA
Ga0120181_111719613300013766PermafrostLLPLFRAVLVWTVILASVILPRLEPPVSMKYAVAVTQLSAHDRASVVQQTRAQRVEAAQMAQVTAWAAELKQGLADYLGENLW
Ga0134079_1005876823300014166Grasslands SoilLLPLFRAVLVWVAILVTVILPRLDPPAQHSVAVAVTQLSAFDRNSVMQQSKNQRVQAAQMAQVQSWSLELKQGLAEYQARQAALAA
Ga0137411_113755513300015052Vadose Zone SoilLLPLFRAVVVWVVILAAVILPRLDAPASVSRVVAVTQLSAFDRLSVVQQMRNQRVQAAQMAQVNAWTQDLKKGLA
Ga0167657_100658313300015079Glacier Forefield SoilLLPLFRAVLVWTVILAGVILPRLEQPVEVKYAVAVTQLSAYDRASVVQQTRTQRVQAAQMAQ
Ga0137418_1093605013300015241Vadose Zone SoilLVPLFRAVLVWVVILVSVVLPRLDPPSQPSVSVAVTQLSAFDRNSVVQQARNQRIQAAQMAQVQSWTLELKQGLADYQAKQEAEAAAQAIAAQI
Ga0137412_1045882123300015242Vadose Zone SoilVVWVVILAGVILPRMDAPAAVTHAVAVTQLSAFDRLSVVQQTRNERVQAAQMAQVSVWSQELKQCLADYQAQQDALA
Ga0137403_1026226513300015264Vadose Zone SoilVVWVVILAGVILPRMDAPAAVTHAVAVTQLSAFDRLSVVQQTRNERVQAAQMAQVSVWSQELKQGLADYQAQQDALAA
Ga0134089_1000262513300015358Grasslands SoilLLPLFRAVLVWTLILVSVILPRLAPPAVHPIQVAVTQLSAFDRNSVVQQNRDQRVQAAQMAQVAR
Ga0134085_1003197933300015359Grasslands SoilLLPLFRAVLVWTLILVSVILPRLAPPAVHPIQVAVTQLSAFDRNSVVQQNRDQRVQAAQMAQVA
Ga0132257_10140931523300015373Arabidopsis RhizosphereLLPLFRAVLVWAVILVTLVLPRLDPPVQRSVAVAVTQLNAFDRNSVLQQNRTERVEAAQMAQAQTWSAELKQG
Ga0066662_1032081313300018468Grasslands SoilVWVVILVSVVLPRLAPPAQQSVSIAVTQLSSFDRSSVVQQTKNQRVQAAQMAQVQSWSKELKQG
Ga0193735_116060413300020006SoilLLPLFRAVVVWVVILVGVILPRLDAPAAVTHAVAVTQLSAFDRLSVVQQTRNERVQATQMAQVNAWSQELKQGLADYQ
Ga0210384_1011828753300021432SoilLVPLLRAVLVWAVILASVVLVRMDPPAAASNAVAVTQLSAYDRLSVVQQTRAQRVQAAQMAQVTAWAA
Ga0247666_100606433300024323SoilLVPLLRAVLVWAVILASVVLVRLDPPTSVANAVAVTQLSAYDRLSVVQQTRTQRVQAAQMAQVSAWAAELKQG
Ga0209840_106357013300026223SoilLLPLFRAILVWTVILASVILVRMDPPTAVTQAVAVTQLSAYDRLSVVQQTRNQRVQAAQMAQVDAWSAELKQGLAD
Ga0209350_108198613300026277Grasslands SoilLLPLFRAVLVWVVILVTVILPRLAPPAQQSVTVAVSQLTAFDRDSAVQHARNQRVQAAQMAQVQSWSLE
Ga0209470_108883623300026324SoilLLPLFRAVLVWVAILVTVILPRLDPPAQRSVAVAVTQLSAFDRNSVMQQTKNPQRVQAAQMAQVQSWSLELKQGLAEYQARQAALAAAQ
Ga0209801_127421523300026326SoilVLVWVVILVTVILPRLAPPAQQSVTVAVSQLTAFDRDSAVQHARNQRVQAAQMAQVQSWSLELKQGLAEYQAKQVAEAAAQAQAEAIAARGNHPAPPPEIAR
Ga0209473_124672413300026330SoilLLPLFRAVLVWVALLVSVVLPRLDPPAMQPVQVAVTQLSAFDRNSVTQQAQSQRVQAARMAQVASWSSELKQGLADYQAKQQALAAAQA
Ga0209158_116941723300026333SoilVLVWVVILVTVILPRLAPPAQQSVTVAVSQLTAFDRDSAVQHARNQRVQAAQMAQVQSWSLELKQGLAEYQAKQVAEAAAQAQAEAIAAR
Ga0209804_103658333300026335SoilVLVWVVILVTVILPRLAPPAQQSVTVAVSQLTAFDRDSAVQHARNQRVQAAQMAQVQSWSLELKQGLAEYQAKQV
Ga0209804_113314723300026335SoilVWVVILVSVVLPRLAPPAQQSVSIAVTQLSSFDRSSVVQQTKNQRVQAAQMAQVQS
Ga0209058_130081913300026536SoilVWSVILVSVILPRLDPPIPRSVAVAVTQLSAFDRNSVVQQNRNDRVEAAQMAQVQSWSAELKQGLAEYQAKQQALAAAQAYAAQIAA
Ga0209805_103930613300026542SoilLLPLFRAVLVWVALLVSVVLPRLDPPAMQPVQVAVTQLSAFDRNSVTQQAQSQRVQAARMAQVASWSSELKQGLADYQAKQQALAAAQ
Ga0209219_115832723300027565Forest SoilLVPLLRAILVWAVILASVVLVRMDPPAAVTNAVAVTQLSAYDRLSVVQQTRAQRVQAAQMAQVTAWAAELKQGLADYQAQ
Ga0208988_115558613300027633Forest SoilVILPRLDPPTTVKYAVAVTQLSAYDRTSVVQQTRTQRVQAAQMAQVDAWSQQLKQGLADYQAQLDALARVW
Ga0209117_101954743300027645Forest SoilLVPLFRAVLVWTVILVSVILPRLEPSMPIKYAVAVTQLSAYDKASVVQQTRTQRVQAAQMAQVNAWSQELKQGLADYQAKLDAIAAAE
Ga0209580_1018514413300027842Surface SoilVLVWVVILVIVVLPRLSPPQQHSVAIAVTQLSAFDGSSVVQQNRTARVQAAQMAQVQSWSQELKQGLADYQAKQEAV
Ga0209166_1006729223300027857Surface SoilLFPLFRAVLVWVVILTVVVVAKVDAPAARSATVAVTQVSAFDRLSVVQQAKTQRVQAAQMAQVQAWSAELQQ
Ga0209283_1039290313300027875Vadose Zone SoilVWVVILASVILPRMAPPAQLTAAVAVTQLSAYDRNSVVQQTRDQRIQAAQAAQAQTWADQLKQGLAAY
Ga0247684_105972923300028138SoilLVPLLRAVLVWAVILASVVLVRLDPPTSVANAVAVTQLSAYDRLSVVQQTRTQRVQAAQM
Ga0257175_103077213300028673SoilLLPLFRAVLVWVVVLVSVILPRLSPPAQQTVSLAVTQVSAFDRNSVVQQAKDHRVQAAQMAQVQS
Ga0265461_1392551213300030743SoilLGLVPLLRAVLVWAVILASVVLVRMDPPAGVTNAVAVTQLSAYDRLSVVQQTRTQRVQAAQMAQVAAWAKGKE
Ga0308194_1032854613300031421SoilLLPLFRAVVVWVVILAGVILPRMDAPAAVAHAVAVTQLSAFDRLSVVQQTRNERVQATQMAQVNAWSQELKQGLADYQAQQDALA
Ga0307374_1033563723300031670SoilLLPLFRAVVVWVVILVTVIAPRLVPPAQTYTAVAVTQLSSYDRLSVVNQTRTARVHAAQMAQSLAWSQELKQGLADYEAQQQALAAAQARAA
Ga0307473_1135765823300031820Hardwood Forest SoilLLPLFRAVLVWVVILVSVVLPRLAPAPQQTLTAAVTTVTAFDRNSTVLDAKNQRVRAAQMAQVQS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.