NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F101952

Metagenome Family F101952

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F101952
Family Type Metagenome
Number of Sequences 102
Average Sequence Length 124 residues
Representative Sequence MKSILMMVSGVLCVAGGLLHGSGFPVIHGEVAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA
Number of Associated Samples 81
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 84.31 %
% of genes near scaffold ends (potentially truncated) 22.55 %
% of genes from short scaffolds (< 2000 bps) 70.59 %
Associated GOLD sequencing projects 71
AlphaFold2 3D model prediction Yes
3D model pTM-score0.60

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (70.588 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(26.471 % of family members)
Environment Ontology (ENVO) Unclassified
(31.373 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(60.784 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 67.97%    β-sheet: 0.00%    Coil/Unstructured: 32.03%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.60
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF11950DUF3467 7.84
PF02276CytoC_RC 6.86
PF13677MotB_plug 4.90
PF01618MotA_ExbB 3.92
PF01066CDP-OH_P_transf 1.96
PF04014MazE_antitoxin 1.96
PF01098FTSW_RODA_SPOVE 0.98
PF00376MerR 0.98
PF00753Lactamase_B 0.98
PF07804HipA_C 0.98
PF16491Peptidase_M48_N 0.98
PF01638HxlR 0.98
PF07995GSDH 0.98
PF01887SAM_HAT_N 0.98
PF07748Glyco_hydro_38C 0.98
PF05199GMC_oxred_C 0.98
PF00588SpoU_methylase 0.98
PF00440TetR_N 0.98
PF07805Obsolete Pfam Family 0.98
PF03551PadR 0.98
PF01979Amidohydro_1 0.98
PF00873ACR_tran 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG0558Phosphatidylglycerophosphate synthaseLipid transport and metabolism [I] 1.96
COG1183Phosphatidylserine synthaseLipid transport and metabolism [I] 1.96
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 1.96
COG5050sn-1,2-diacylglycerol ethanolamine- and cholinephosphotranferasesLipid transport and metabolism [I] 1.96
COG0219tRNA(Leu) C34 or U34 (ribose-2'-O)-methylase TrmL, contains SPOUT domainTranslation, ribosomal structure and biogenesis [J] 0.98
COG0383Alpha-mannosidaseCarbohydrate transport and metabolism [G] 0.98
COG0565tRNA C32,U32 (ribose-2'-O)-methylase TrmJ or a related methyltransferaseTranslation, ribosomal structure and biogenesis [J] 0.98
COG0566tRNA G18 (ribose-2'-O)-methylase SpoUTranslation, ribosomal structure and biogenesis [J] 0.98
COG0772Peptodoglycan polymerase FtsW/RodA/SpoVECell cycle control, cell division, chromosome partitioning [D] 0.98
COG1695DNA-binding transcriptional regulator, PadR familyTranscription [K] 0.98
COG1846DNA-binding transcriptional regulator, MarR familyTranscription [K] 0.98
COG1912Stereoselective (R,S)-S-adenosylmethionine hydrolase (adenosine-forming)Defense mechanisms [V] 0.98
COG2133Glucose/arabinose dehydrogenase, beta-propeller foldCarbohydrate transport and metabolism [G] 0.98
COG2303Choline dehydrogenase or related flavoproteinLipid transport and metabolism [I] 0.98
COG3550Serine/threonine protein kinase HipA, toxin component of the HipAB toxin-antitoxin moduleSignal transduction mechanisms [T] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms70.59 %
UnclassifiedrootN/A29.41 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002906|JGI25614J43888_10093665Not Available807Open in IMG/M
3300002907|JGI25613J43889_10187066Not Available551Open in IMG/M
3300004479|Ga0062595_100044376All Organisms → cellular organisms → Bacteria1975Open in IMG/M
3300005167|Ga0066672_10315670All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1017Open in IMG/M
3300005167|Ga0066672_10677012All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria663Open in IMG/M
3300005175|Ga0066673_10158133All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1266Open in IMG/M
3300005186|Ga0066676_10864675All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria609Open in IMG/M
3300005332|Ga0066388_100064558All Organisms → cellular organisms → Bacteria3917Open in IMG/M
3300005332|Ga0066388_101340568Not Available1237Open in IMG/M
3300005332|Ga0066388_101458139All Organisms → cellular organisms → Bacteria1194Open in IMG/M
3300005332|Ga0066388_108583491Not Available508Open in IMG/M
3300005434|Ga0070709_10045198All Organisms → cellular organisms → Bacteria → Proteobacteria2731Open in IMG/M
3300005451|Ga0066681_10262373All Organisms → cellular organisms → Bacteria → Acidobacteria1050Open in IMG/M
3300005531|Ga0070738_10144964All Organisms → cellular organisms → Bacteria1170Open in IMG/M
3300005537|Ga0070730_10000085All Organisms → cellular organisms → Bacteria119447Open in IMG/M
3300005537|Ga0070730_10273074All Organisms → cellular organisms → Bacteria1112Open in IMG/M
3300005556|Ga0066707_10408754All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium883Open in IMG/M
3300005557|Ga0066704_10591911Not Available715Open in IMG/M
3300005559|Ga0066700_10580540Not Available782Open in IMG/M
3300005569|Ga0066705_10131064All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1522Open in IMG/M
3300005574|Ga0066694_10115153All Organisms → cellular organisms → Bacteria → Acidobacteria1263Open in IMG/M
3300005586|Ga0066691_10349050All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium876Open in IMG/M
3300005598|Ga0066706_10223942All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1455Open in IMG/M
3300006163|Ga0070715_10532319All Organisms → cellular organisms → Bacteria → Acidobacteria678Open in IMG/M
3300006173|Ga0070716_100070067All Organisms → cellular organisms → Bacteria → Proteobacteria2057Open in IMG/M
3300006791|Ga0066653_10042052Not Available1864Open in IMG/M
3300006796|Ga0066665_10589536Not Available896Open in IMG/M
3300006797|Ga0066659_11304340Not Available606Open in IMG/M
3300006854|Ga0075425_100301219All Organisms → cellular organisms → Bacteria1845Open in IMG/M
3300006903|Ga0075426_10012353All Organisms → cellular organisms → Bacteria6073Open in IMG/M
3300006903|Ga0075426_10323887All Organisms → cellular organisms → Bacteria1129Open in IMG/M
3300006903|Ga0075426_11112959Not Available598Open in IMG/M
3300009012|Ga0066710_100082852All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis4193Open in IMG/M
3300009012|Ga0066710_100866012All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1388Open in IMG/M
3300009137|Ga0066709_102415098All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria714Open in IMG/M
3300009792|Ga0126374_10097188All Organisms → cellular organisms → Bacteria1660Open in IMG/M
3300009792|Ga0126374_10794857All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria722Open in IMG/M
3300010043|Ga0126380_10002311All Organisms → cellular organisms → Bacteria7529Open in IMG/M
3300010043|Ga0126380_10049328All Organisms → cellular organisms → Bacteria2263Open in IMG/M
3300010043|Ga0126380_10282274All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1170Open in IMG/M
3300010043|Ga0126380_12054858Not Available524Open in IMG/M
3300010048|Ga0126373_13135357Not Available515Open in IMG/M
3300010159|Ga0099796_10181316Not Available845Open in IMG/M
3300010329|Ga0134111_10106271All Organisms → cellular organisms → Bacteria → Acidobacteria1081Open in IMG/M
3300010335|Ga0134063_10320876Not Available748Open in IMG/M
3300010358|Ga0126370_12035025All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium562Open in IMG/M
3300010359|Ga0126376_10000036All Organisms → cellular organisms → Bacteria54438Open in IMG/M
3300010359|Ga0126376_10007782All Organisms → cellular organisms → Bacteria6544Open in IMG/M
3300010360|Ga0126372_10018662All Organisms → cellular organisms → Bacteria4023Open in IMG/M
3300010360|Ga0126372_10839847All Organisms → cellular organisms → Bacteria → Acidobacteria914Open in IMG/M
3300010362|Ga0126377_10273948All Organisms → cellular organisms → Bacteria1653Open in IMG/M
3300010364|Ga0134066_10108007All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium818Open in IMG/M
3300010366|Ga0126379_12946998Not Available570Open in IMG/M
3300011271|Ga0137393_10441936All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1115Open in IMG/M
3300012200|Ga0137382_10835308Not Available664Open in IMG/M
3300012202|Ga0137363_10381049All Organisms → cellular organisms → Bacteria1172Open in IMG/M
3300012203|Ga0137399_11456694Not Available571Open in IMG/M
3300012207|Ga0137381_11245461Not Available637Open in IMG/M
3300012211|Ga0137377_10351572Not Available1411Open in IMG/M
3300012359|Ga0137385_10506501All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1021Open in IMG/M
3300012582|Ga0137358_10554323Not Available773Open in IMG/M
3300012685|Ga0137397_10830420Not Available685Open in IMG/M
3300012917|Ga0137395_10461281All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium913Open in IMG/M
3300012923|Ga0137359_10273764Not Available1505Open in IMG/M
3300012925|Ga0137419_10600082All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium884Open in IMG/M
3300012927|Ga0137416_11336995Not Available648Open in IMG/M
3300012929|Ga0137404_10150466All Organisms → cellular organisms → Bacteria → Acidobacteria1935Open in IMG/M
3300012929|Ga0137404_11150659Not Available713Open in IMG/M
3300012944|Ga0137410_10004730All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Acidobacterium9274Open in IMG/M
3300012944|Ga0137410_10007177All Organisms → cellular organisms → Bacteria7571Open in IMG/M
3300015052|Ga0137411_1352726All Organisms → cellular organisms → Bacteria → Acidobacteria2025Open in IMG/M
3300015053|Ga0137405_1135203All Organisms → cellular organisms → Bacteria1258Open in IMG/M
3300015053|Ga0137405_1274817All Organisms → cellular organisms → Bacteria → Acidobacteria1834Open in IMG/M
3300015241|Ga0137418_10264131All Organisms → cellular organisms → Bacteria1454Open in IMG/M
3300015242|Ga0137412_10018686All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5637Open in IMG/M
3300015264|Ga0137403_10001119All Organisms → cellular organisms → Bacteria33905Open in IMG/M
3300020170|Ga0179594_10413798Not Available512Open in IMG/M
3300020199|Ga0179592_10004660All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5805Open in IMG/M
3300021478|Ga0210402_10079187All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2919Open in IMG/M
3300021478|Ga0210402_10348492All Organisms → cellular organisms → Bacteria1374Open in IMG/M
3300024178|Ga0247694_1000012All Organisms → cellular organisms → Bacteria118720Open in IMG/M
3300024179|Ga0247695_1026497All Organisms → cellular organisms → Bacteria820Open in IMG/M
3300024182|Ga0247669_1000728All Organisms → cellular organisms → Bacteria11865Open in IMG/M
3300024182|Ga0247669_1003424All Organisms → cellular organisms → Bacteria3619Open in IMG/M
3300024246|Ga0247680_1010577All Organisms → cellular organisms → Bacteria1355Open in IMG/M
3300024279|Ga0247692_1077248Not Available523Open in IMG/M
3300024290|Ga0247667_1021175All Organisms → cellular organisms → Bacteria1256Open in IMG/M
3300024325|Ga0247678_1029856Not Available852Open in IMG/M
3300024331|Ga0247668_1004573All Organisms → cellular organisms → Bacteria2985Open in IMG/M
3300025905|Ga0207685_10845475Not Available506Open in IMG/M
3300025906|Ga0207699_10365880All Organisms → cellular organisms → Bacteria1021Open in IMG/M
3300025928|Ga0207700_10231030All Organisms → cellular organisms → Bacteria1573Open in IMG/M
3300025939|Ga0207665_10039641All Organisms → cellular organisms → Bacteria → Acidobacteria3141Open in IMG/M
3300026301|Ga0209238_1012927All Organisms → cellular organisms → Bacteria3245Open in IMG/M
3300026301|Ga0209238_1027574All Organisms → cellular organisms → Bacteria2126Open in IMG/M
3300026319|Ga0209647_1023135All Organisms → cellular organisms → Bacteria → Acidobacteria3796Open in IMG/M
3300026319|Ga0209647_1035362All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2866Open in IMG/M
3300026557|Ga0179587_10036532All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2760Open in IMG/M
3300027654|Ga0209799_1077239Not Available751Open in IMG/M
3300027857|Ga0209166_10000212All Organisms → cellular organisms → Bacteria75344Open in IMG/M
3300031754|Ga0307475_10690737Not Available814Open in IMG/M
3300032180|Ga0307471_100184640All Organisms → cellular organisms → Bacteria → Acidobacteria2062Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil26.47%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil13.73%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil13.73%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil10.78%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.82%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.86%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.90%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil3.92%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.92%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.94%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002906Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cmEnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005531Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen12_06102014_R2EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300024178Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK35EnvironmentalOpen in IMG/M
3300024179Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK36EnvironmentalOpen in IMG/M
3300024182Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK10EnvironmentalOpen in IMG/M
3300024246Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK21EnvironmentalOpen in IMG/M
3300024279Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK33EnvironmentalOpen in IMG/M
3300024290Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK08EnvironmentalOpen in IMG/M
3300024325Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK19EnvironmentalOpen in IMG/M
3300024331Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK09EnvironmentalOpen in IMG/M
3300025905Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027654Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25614J43888_1009366513300002906Grasslands SoilMKSILMMVSGVLCVAGGLLHGSGFWVIHGEIAKAGVTGDLAEVVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGIVAAFACLPDQKTA*
JGI25613J43889_1018706613300002907Grasslands SoilMKSILMIVSGVLCVAGGLLHGSGFRVIHGEIAKANVVGDLAEVVDLAWIFMSMGFLTFGAILITCGLQMRKRNYGGKMFTGWAAGCLTIFSAGAMIRFGFNFHFLYFLIVGVVATFACLPDQKTA*
Ga0062595_10004437623300004479SoilMKSILMLISGGLCITGGLLHASGFPVIHGEIAKATATGHLADVVDLAWVFMSMGLLTFGAILITCGLQMRKRNYGGKAMAGWVALCLTLFSGGAMIRFGFNWHFLYFLIVGVTAALACLPQRTTS*
Ga0066672_1031567023300005167SoilHGSGFPVIHGEIAKANVTGDLANVVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0066672_1067701213300005167SoilMKSILMMVGGVLCVAGGLLHGSGFPVIHGEVAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0066673_1015813323300005175SoilMKSILMMVGGVLCVAGGLLHGSGFPVIHGEIAKANVIGNLAEVVDLAWVFMSMGFVTFGAILITCGLQMRRKNYGGRAMAGWAAGCLTLFSAGAMIRFGFDFHFLYFLIVGVVAAFACVPDQKTA*
Ga0066676_1086467513300005186SoilMKSILMMVGGVLCVAGGLLHGSGFPVIHGEIAKANVIGNLAKVVDLAWVFMSMGFVTFGAILITCGLQMRRKNYGGRAMAGWVAGCLTLFSAGAMIRFGFDFHFLYFLIVGV
Ga0066388_10006455843300005332Tropical Forest SoilMKSILMMVSGVLCVAGGLLHTSGFPVIHGEVAKAGVTGNLAEIVDLAWVFMSMGFLTFGAILITYGLQMRKKNYGGRALAGWATACLTLFSGGATIRFGFNWHFLYFLIVGLLAAYACIPDKRATG*
Ga0066388_10134056823300005332Tropical Forest SoilMKSILMLVSGIVCVAGGLLHSSGFRVIHGEVANAGITGDLAEIVDLAWVFMSMGFLTCGAILITCGVQMRKKNYGGRAFAGWVAAFLTVFSGWAAIRFGFNWHFLYFLFVGLLAAVACIPDKTAST*
Ga0066388_10145813933300005332Tropical Forest SoilMKSILMMVSGILCVAGGLLHASGLRVIHGEVAKANVTGDLAEVVDLAWVFMSMGFLTFGAILITGGLQMRKRNYGGRAMAGWTAACLTLFSGGAMIRFGYNHHFLYFLIVGVVAAFACLPDGKTA*
Ga0066388_10858349113300005332Tropical Forest SoilMKSILMLVSGIICVGGGLLHSTGFRVIHGEVAKAGITGDLAEIVDLAWVFMSMGFLTCGAILITCGIQMRRKNYGGRAFGGWVAAFLMVFSGWATVRFGFNWHFLYFLIVGLLAGYASIADKTASA
Ga0070709_1004519813300005434Corn, Switchgrass And Miscanthus RhizosphereMKSILILLSGVLCVGGGLLHGSGFWVIHGEFAKTGATGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRAFAGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGVLAGYACVPDKQTA*
Ga0066681_1026237313300005451SoilMKSILMMVGGVLCVAGGLLHGSGFSVIHGEIAKANVIGNLAEVVDLAWVFMSMGFVTFGAILITCGLQMRRKNYGGRAMAGWAAGCLTLFNAGAMIRFGFDFHFLYFLIVGVAAAFACVPDQKTA*
Ga0070738_1014496423300005531Surface SoilMKSILMLISGVLCLGGGLLHGSGIRIIHGEAVKANVDAHLTTVIDLAWVFMSMSILAFAAILITCGLQMRKKNYGGRSSAAWAAGCLTLFSGGSIIHFGYDSHFLFFLIVGVLAAFACIPDKAAA*
Ga0070730_10000085653300005537Surface SoilMKSILMLVSGVLCTGGGLLHGSGLLVIHGEVAKANVTGDLADIVDLAWVFMSMGFLTFGAILITCGIQMRKKNYAGRVFAGWAAACLTLFSAGAMIRFGFNLHFLYFLIVGVVAGLACLPHQKTA*
Ga0070730_1027307413300005537Surface SoilMNSILMMASGVLCVAGGLLHASGFPIIHEEIAKANVAGDLAEVVDLAWVFMSMGFLTFGAILITCGLQMRKKNYGGKAMAGWAAACLTMFSGGAMIRFGFNWHFLYFLIVGLLAAYACVPDKQATA*
Ga0066707_1040875423300005556SoilGSALYSSPSVNVLTRNEEDEMKSILMIVSGVLCVAGGLLHGSGFPVIHGEIAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0066704_1059191113300005557SoilMKSILMMVGGVLCVAGGLLHGSGFPVIHGEIAKANVTGDLANVVDLAWVFMSMGFLTFGAILITCGIQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0066700_1058054023300005559SoilMKSILMMVGGVLCVAGGLLHGSGFPVIHGEIAKANVIGNLAKVVDLAWVFMSMGFVTFGAILITCGLQMRRKNYGGRAMAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0066705_1013106423300005569SoilMKSILMMVGGVLCVAGGLLHGSGFPVIHGEIAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0066694_1011515343300005574SoilMKSILMMVGGVLCVAGGLLHGSGFSVIHGEIAKANVIGNLAEVVDLAWVFMSMGFVTFGAILITCGLQMRRKNYGGRAMAGWVAGCLTLFSAGAMIRFGFDFHFLYFLIVGVVAAFACLPDQKTA*
Ga0066691_1034905023300005586SoilMKSILMIVSGVLCVAGGLLHGSGFPVIHGEIAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVSVVAAFACVPDQKTA*
Ga0066706_1022394223300005598SoilMKSILMIVSGVLCVAGGLLHGSGFPVIHGEIAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0070715_1053231923300006163Corn, Switchgrass And Miscanthus RhizosphereLILLSGVLCVGGGLLHGSGFWVIHGEFAKTGATGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRAFAGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGVLAGYACVPDKQTA*
Ga0070716_10007006733300006173Corn, Switchgrass And Miscanthus RhizosphereMMVSGVLCVAGGLLHGSGFWVIHGELAKTGVTGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRASVGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGVLAGYACVPD
Ga0066653_1004205223300006791SoilMMVGGVLCVAGGLLHGSGFPVIHGEIAKANVIGNLAKVVDLAWVFMSMGFVTFGAIWITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA
Ga0066665_1058953613300006796SoilMKSILMMVGGVLCVAGGLLHGSGFPVIHGEIAKANVIGNLAEVVDLAWVFMSMGFVTFGAILITCGLQMRRKNYGGRAMAGWAAGCLTLFSAGAMIRFGFDSHFLYFLIVGVVAAFACVPDQKTA*
Ga0066659_1130434013300006797SoilMKSILMIVSGVLCVAGGLLHGSGFPVIHGEIAKANVTGDLANVVDLAWVFMSMGFLTFGAILITCGIQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVV
Ga0075425_10030121933300006854Populus RhizosphereMKSILMMVSGALCVAGGLLHGSGFWVIHREIAKANVIGDLAEVVDLAWVFMSMGFLTFGAILITCGLQMRKKNYGGRASGGWAASCLTLFSGGAMIRFGFNGHFLYFLIVGLLAAYACVADKKASA*
Ga0075426_1001235323300006903Populus RhizosphereMKSILMLVSGVLCVAGGLLHGSGFWVIHGEFAKIGVTGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRASVGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGLLAAYACVPDKQTA*
Ga0075426_1032388713300006903Populus RhizosphereLYSSLVVEWKRERGWLAMKSILILLSGVLCVGGGLLHGSGFWVIHREFAKTGATGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRAFAGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGVLAGYACVPDKQTA*
Ga0075426_1111295913300006903Populus RhizosphereMKSILMMVSGALCVAGGLLHGSGFWVIHREIAKANVIGDLAEVVDLAWVFMSMGFLTFGAILITCGLQMRKKNYGGRASGGWAAACLTLFSGGAMIRFGFNGHFLYFLIVGLLAAYACVADKKASA*
Ga0066710_10008285283300009012Grasslands SoilEEDEMKSILMMVGGVLCVAGGLLHGSGFPVIHGEIAKANVIGNLAKVVDLAWVFMSMGFVTFGAILITCGLQMRRKNYGGRAMAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA
Ga0066710_10086601223300009012Grasslands SoilMKSILMMVGGVLCVAGGLLHGSGFPVIHGEIAKANVIGNLAEVVDLAWVFMSMGFVTFGAILITCGLQMRRKNYGGRAMAGWVAGCLTLFSAGAMIRFGFDFHFLYFLIVGVAAAFACVPDQKTA
Ga0066709_10241509823300009137Grasslands SoilMKSILMMVGGVLCVAGGLLHGSGFSVIHGEIAKANVIGNLAEVVDLAWVFMSMGFVTFGAILITCGLQMRRKNYGGRAMAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA
Ga0126374_1009718823300009792Tropical Forest SoilMKSILMLVSGILCVGGGLLHASGFRVILGEVAKAGVTGELAEVVDLSWVFMSMGFLTCGAILITYGLQMRKKNYGGRVFGGWVAAFLAVFSGWAMVRFGFNWHFLYFLIVGLLAAYGCIPDKRAIA*
Ga0126374_1079485723300009792Tropical Forest SoilMKSILMLLSGVICLGGGLLHASGFRVIHGEVAKAGVTGDLAEIVDLAWVFMSMGFLTCGAMLITCGLQMRRKNYGGRTFAGWVAAFLMVFTGWAMIRFGFNWHFLYFLVVGLLAAYASISEKQTVS*
Ga0126380_1000231123300010043Tropical Forest SoilMKSILMLVSGVVCAAGGLLHSSGFRVIHGEIAKAGVTGNLAEVVDLAWVFMSMGILTFGAILITCGLQMRRKNYGGRAFAGWAAACLALFTGWGMIRFGFNWHFLYFLIVGLLAAYACIPDKRATA*
Ga0126380_1004932823300010043Tropical Forest SoilMKSILMLLSGVICLAGGLLHASGFRLIHGEVAKAGVTGDLAEIVDLAWVFMSMGFLTFGAILITCGLQMRKKNYGGRSFGGWAAAFLTVFSGWATIRFGFNWHSLYFLIVGLLAAFACIPEKRASV*
Ga0126380_1028227423300010043Tropical Forest SoilMKSILMLVSGVLCVAGGLLHTSGFPVIHGEIAKAGVTGDLAEVVDLAWVFMSMGFLTYGAILITCGLQMRKKNFGGRALAGWAALSLTLFSGGAMIRFGFNWHFLYFLVVGVLAAISCLPEKQTAS*
Ga0126380_1205485813300010043Tropical Forest SoilMKSILMLVSGILCVGGGLLHASGFRIIHGETAKAGVTGDLAEIVDLAWVFMSLGFLTCGAILITYGLQMRKKNYGGRVFGGWVAAFLTVFSGWAMVRFGFNWHFLYFLIVGLLAAYACMPDRRAIA*
Ga0126373_1313535713300010048Tropical Forest SoilMKSILMLVSGVVCFAGGLLHASGFRVIHGEVAKAGITGELAEIVDLAWVFMSMGFLSFGTILITCGLQMRKKNYGGRAFGGWAAAFFIVFTGWAMIRFGFNWHFLYFLIVGLLAAYAIIPDQTAPACGP*
Ga0099796_1018131613300010159Vadose Zone SoilMKSILMMVGGALCVAGGLLHGSGFPVIHGEIAKANVPGDLANVVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACVPDQKTA*
Ga0134111_1010627133300010329Grasslands SoilMKSILMMVGGVLCVAGGLLHGSGFSVIHGEIAKANVIGNLAEVVDLAWVFMSMGFVTFGAILITCGLQMRRKNYGGRAMAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVA
Ga0134063_1032087623300010335Grasslands SoilMKSILMMVGGVLCVAGGLLHGSGFSVIHGEIAKANVIGNLAEVVDLAWVFMSMGFVTFGAILITCGLQMRRKNYGGRAMAGWVAGCLTLFSAGAMIRFGFDFHFLYFLIVGVAAAFACVPDQKTA*
Ga0126370_1203502523300010358Tropical Forest SoilVGGGLLHASGFRVILGEVAKAGVTGELAEVVDLSWVFMSMGFLTFGAILITYGLQMRKKNYGGRALAGWATACLTLFSGGATIRFGFNWHFLYFLIVGLLAAYACIPDKRATG*
Ga0126376_10000036103300010359Tropical Forest SoilMRMMMKSILMMVSGVLCVAGGLLHTSGFPVIHGEVAKAGVTGNLAEIVDLAWVFMSMGFLTFGAILITYGLQMRKKNYGGRALAGWATACLTLFSGGATIRFGFNWHFLYFLIVGLLAAYACIPDKRATG*
Ga0126376_1000778223300010359Tropical Forest SoilMKSILMLVSGILCVGGGLLHASGFRIIHGEAAKAGVTGDLAEIVDLAWVFMSLGFLTCGAILITYGLQMRKKNYGGRVFGGWVAAFLTVFSGWAMVRFGFNWHFLYFLIVGLLAAYACMPDRRAIA*
Ga0126372_1001866243300010360Tropical Forest SoilMVSGVLCVAGGLLHTSGFPVIHGEVAKAGVTGNLAEIVDLAWVFMSMGFLTFGAILITYGLQMRKKNYGGRALAGWATACLTLFSGGATIRFGFNWHFLYFLIVGLLAAYACIPDKRATG
Ga0126372_1083984723300010360Tropical Forest SoilMKSILMLISGIICVGGGLLHASGFRVIHGEIAKAGVTGDLAEVVDLAWVFMSMGFLTCGAILITCGLQMRRKKYGGRAFSAWVAAFLMAFTGWAMIRFGFNWHFLYFLIVGLLAAFACIPDKRASA*
Ga0126377_1027394823300010362Tropical Forest SoilMKSILMLVCGILCVAGGLLHASGFRVIHGEVAKAGVTGELAEVVDLAWVFMSMGFLTFGIILITCGLQMRKTNYGGRAFGGWVAAGLTLFSGWAMIRFGFNWHFLYFLIVGLLAAYACIPDKKASAQ*
Ga0134066_1010800723300010364Grasslands SoilMKSILMMVGGVLCVAGGLLHGSGFPVIHGEIAKANVIGNLAEVVDLAWVFMSMGFVTFGAILITCGLQMRRKNYGGRAMAGWAAGCLTLFNAGAMIRFGFDFHFLYFLIVGVAAAFACVPDQKTA*
Ga0126379_1294699823300010366Tropical Forest SoilMRMMMKSILMMVSGVLCVAGGLLHTSGFPVIHGEVAKAGVTGNLAEIVDLAWVFMSMGFLTFGAILITYGLQMRKKNYGGRALAGWATACLTLFSGGATIRFGFNWHFLYFLIVGLLA
Ga0137393_1044193623300011271Vadose Zone SoilMKSILMMVGGVLCAAGGLLHGSGFLVIHGEVAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRERNYGGKIFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0137382_1083530813300012200Vadose Zone SoilMKRILMIVSGVLCVAGGLLHGSGFPVIHGEIAKANVTGDLANVVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGIVAAFACLPDQKTA*
Ga0137363_1038104923300012202Vadose Zone SoilMKSILMMVSGVLCVAGGLLHGSGFRVIHGEIAKANVVGDLAEVVDLAWIFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSTGAMIRFGFNFHFLYFLIVGVVAAFACLPEQKAA*
Ga0137399_1145669413300012203Vadose Zone SoilMKSILMMVGGALCVAGGLLHGSGFPVIHGEVAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACVPDQKTA*
Ga0137381_1124546113300012207Vadose Zone SoilMKSILMIVSGVLCVAGGLLHGSGLRVVHGEVAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRKRNYRGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0137377_1035157213300012211Vadose Zone SoilMKSILMIVSGVLCVAGGLLHGSGLRVVHGEVAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0137385_1050650123300012359Vadose Zone SoilMKSILMIVSGVLCVAGGLLHGSGFLVIHGEIAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRKRNYRGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0137358_1055432323300012582Vadose Zone SoilMKSILMIVSGVLCVAGGLLHGSGFPVIHGEIAKANVTGDLANVVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACVPEQKAA*
Ga0137397_1083042013300012685Vadose Zone SoilMKGILMMMSGVLCVAGGLLHGSGFWVIHGEIAKAGVTGDLAEVVDLAWVFMSMGFLTFGAILITCGLQMKRKNYGGRAFGGWAAACLTIFSGGAMIRFGFNWHFLYFLI
Ga0137395_1046128123300012917Vadose Zone SoilVIHGEIAKANVVGDLAEVVDLAWIFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0137359_1027376423300012923Vadose Zone SoilMKSILMIVSGVLCVAGGLLHGSGFPVIHGEIAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACVPEQKAA*
Ga0137419_1060008213300012925Vadose Zone SoilMKSILMMVGGVLRVAGGLLHGSGFWVIHGEIAKVGVTGDLAEVVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0137416_1133699513300012927Vadose Zone SoilMKSILMIVSGVLCVAGGLLHGSGFPVIHGEVAKANVTGDLANVVDLAWVFMSLGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0137404_1015046633300012929Vadose Zone SoilMKSILMMVGGVLCVAGGLLHGSGFLVIHGEVAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAGFACLPDRKTA*
Ga0137404_1115065923300012929Vadose Zone SoilMKSILMMVSGALCVAGGLLHGSGFPVIHGEVAKANVTGDLANVVDLAWVFMSLGFLTFVAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSTGAMIRFGFNFHFLYFLIVGVVAAFACLPEQKAA*
Ga0137410_1000473063300012944Vadose Zone SoilMKGILMMMSGVLCVAGGLLHGSGFWVIHGEIAKAGVTGDLAEVVDLAWVFMSMGFLTFGAILITCGLQMKRKNYGGRAFGGWAAACLTIFSGGAMIRFGFNWHFLYFLIVGVLAAYACVPDKQTA*
Ga0137410_1000717793300012944Vadose Zone SoilMKSILMLVSGVLCIGGGLLHGSGLRVVHGEVVKANITGNLANIVDLAWVFMSMGFLTFGAILITCGLQMRKKNYEGRVFAGWAAACLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTT*
Ga0137411_135272643300015052Vadose Zone SoilMKSILMLVSGVLCIGGGLLHGSGLRVVHGEVVKANITGNLANIVDLAWVFMSMGFLTFGAILITCGLQMRKKNYEGRVFAGWAAACLTLFSAGAMIRFGFNFHFLYFLIV
Ga0137405_113520323300015053Vadose Zone SoilMKSILMIVSGVLCVAGGLLHGSGFPVIHGEVAKANVTGDLANVVDLAWVFMSLGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSTGAMIRFGFNFHFLYFLIVGVVAAFACLPEQKAA*
Ga0137405_127481743300015053Vadose Zone SoilMKSILMIVSGVLCVAGGLLHGSGFPVIHGEVAKANVTGDLANVVDLAWVFMSLGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSTGAMIRFGFNFH
Ga0137418_1026413113300015241Vadose Zone SoilMKSILMMVSGVLCVAGGLLHGSGFPVIHGEVAKANVTGDLANIVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA*
Ga0137412_1001868633300015242Vadose Zone SoilMKSILMIVSGVLCVAGGLLHGSGFPVIHGEVAKANVTGDLANVVDLAWVFMSLGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSTGAMIRFGFNFHFLYFLIVGVVAAFACVPEQKAA*
Ga0137403_10001119283300015264Vadose Zone SoilMKSILMMVSGVLCVAGGLLHGSGFPVIHGEVAKANVTGDLANVVDLAWVFMSLGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSTGAMIRFGFNFHFLYFLIVGVVAAFACLPEQKAA*
Ga0179594_1041379813300020170Vadose Zone SoilMIVSGVLCVAGGLLHGSGFPVIHGEIAKANVTGDLANVVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSTGAMIRFGFNFHFLYFLIVGVVA
Ga0179592_1000466063300020199Vadose Zone SoilMKSILMIVSGVLCVAGGLLHGSGFPVIHGEVAKANVTGDLANVVDLAWVFMSLGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSTGAMIRFGFNFHFLYFLIVGVVAAFACVPEQKAA
Ga0210402_1007918743300021478SoilMKSILMLVSGVLCIAGGLLHGSGLRVIHGEMVKASASGDLTRVLDLAWVFMSMAILTFGAILLTYGLQMRKKSYGGRAPAAWVAACLTLFSGGAIILLGYNSHFLYFLIVGVVAAFACVPDQKAA
Ga0210402_1034849223300021478SoilMKSILMLVSGVLCIGGGLLHESGFRIVHGEVAKANVMGNLANIVDLAWVFMSMGFLTFGAILITCGLQMRKKNYGGRVFAGWAAACLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTAEA
Ga0247694_1000012413300024178SoilMKSILMLISGGLCITGGLLHASGFPVIHGEIAKATATGHLADVVDLAWVFMSMGLLTFGAILITCGLQMRKRNYGGKAMAGWVAVCLTLFSGGAMIRFGFNWHFLYFLIVGVTAALACLPDKRTTS
Ga0247695_102649723300024179SoilMKSILILLSGVLCVGGGLLHGSGFWVIHREFAKTGATGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRAFAGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGVLAGYACVPDKQTA
Ga0247669_1000728153300024182SoilVEWKRERGWLAMKSILILLSGVLCVGGGLLHGSGFWVIHGEFAKTGATGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRAFAGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGVLAGYACVPDKQTA
Ga0247669_100342443300024182SoilMLISGGLCITGGLLHASGFPVIHGEIAKATATGHLADVVDLAWVFMSMGLLTFGAILITCGLQMRKRNYGGKAMAGWVALCLTLFSGGAMIRFGFNWHFLYFLIVGVTAALACLPQRTTS
Ga0247680_101057723300024246SoilLYSSLVVEWKRERGWLAMKSILILLSGVLCVGGGLLHGSGFWVIHGEFAKTGATGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRAFAGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGVLAGYACVPDKQTA
Ga0247692_107724813300024279SoilLMLVSGVLCVGGGLLHGSGFWVIHGEFAKAGVTGDLAEIGDLAWVFMSMGFLTFGAILVTCGIQMRRKNYGGRAFAGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGLLTASACVPDKQTAQ
Ga0247667_102117523300024290SoilGLLHGSGFWVIHGEFAKTGATGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRAFAGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGVLAGYACVPDKQTA
Ga0247678_102985613300024325SoilMKSILMMVSGVLCVAGGLLHGSGFWVIHGELAKTGVTGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRAFAGWAAASLALFSGWAMIRFGFNWHFLYFLIVGLLAASACVPDKQTA
Ga0247668_100457343300024331SoilMKSILMLISGGLCITGGLLHASGFPVIHGEIAKATATGHLADVVDLAWVFMSMGLLTFGAILITCGLQMRKRNYGGKAMAGWVALCLTLFSGGAMIRFGFNWHFLYFLIVGVTAAFACLPQRTTS
Ga0207685_1084547513300025905Corn, Switchgrass And Miscanthus RhizosphereKSILILLSGVLCVGGGLLHGSGFWVIHGEFAKTGATGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRAFAGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGVLAGYACVPDKQTA
Ga0207699_1036588023300025906Corn, Switchgrass And Miscanthus RhizosphereEWKRERGWLAMKSILILLSGVLCVGGGLLHGSGFWVIHGEFAKTGATGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRAFAGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGVLAGYACVPDKQTA
Ga0207700_1023103023300025928Corn, Switchgrass And Miscanthus RhizosphereMKSILMMVSGVLCVAGGLLHGSGFWVIHGELAKTGVTGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRAFAGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGVLAGYACVPDKQTA
Ga0207665_1003964133300025939Corn, Switchgrass And Miscanthus RhizosphereMMVSGVLCVAGGLLHGSGFWVIHGELAKTGVTGELAEIVDLAWVFMSMGFLTFGAILITCGLQMRRKNYGGRASVGWAAASLTLFSGWAMIRFGFNWHFLYFLIVGLLAAYACVPDKQTA
Ga0209238_101292723300026301Grasslands SoilMKSILMMVSGVLCAAGGLLHGSGFPVIHREIAKANVTGDLANIVDLAWVFMSMGFLTFGAILITGGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGVVAAFACLPDQKTA
Ga0209238_102757413300026301Grasslands SoilMKSILMMVGGVLCVAGGLLHGSGFPVIHGEIAKANVIGNLAEVVDLAWVFMSMGFVTFGAILITCGLQMRRKNYGGRAMAGWAAGCLTLFSAGAMIRFGFDFHFLYFLIVGVVA
Ga0209647_102313543300026319Grasslands SoilMKSILMMVSGVLCVAGGLLHGSGFWVIHGEIAKAGVTGDLAEVVDLAWVFMSMGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSAGAMIRFGFNFHFLYFLIVGIVAAFACLPDQKTA
Ga0209647_103536223300026319Grasslands SoilMKSILMIVSGVLCVAGGLLHGSGFRVIHGEIAKANVVGDLAEVVDLAWIFMSMGFLTFGAILITCGLQMRKRNYGGKMFTGWAAGCLTIFSAGAMIRFGFNFHFLYFLIVGVVATFACLPDQKTA
Ga0179587_1003653243300026557Vadose Zone SoilVRIRNEEDEMKSILMIVSGVLCVAGGLLHGSGFPVIHGEVAKANVTGDLANVVDLAWVFMSLGFLTFGAILITCGLQMRKRNYGGKMFAGWAAGCLTLFSTGAMIRFGFNFHFLYFLIVGVVAAFACVPEQKAA
Ga0209799_107723913300027654Tropical Forest SoilMKSILMLLSGVICLGGGLLHASGFRVIHGEVAKAGVTGDLAEIVDLAWVFMSMGFLTCGAMLITCGLQMRRKNYGGRTFAGWVAAFLMVFTGWAMIRFGFNWHSLYFLIVGLLAAFACIPEKRASV
Ga0209166_10000212103300027857Surface SoilMKSILMLVSGVLCTGGGLLHGSGLLVIHGEVAKANVTGDLADIVDLAWVFMSMGFLTFGAILITCGIQMRKKNYAGRVFAGWAAACLTLFSAGAMIRFGFNLHFLYFLIVGVVAGLACLPHQKTA
Ga0307475_1069073713300031754Hardwood Forest SoilMGGSLYSSATVEWDYEREGAMKSILMIVSGVLCVAGGLLHMSGFWVIHGEVAKAGVTGDLAEVIDLAWVFMSMGFLTFGAILITCWLQMRKKNYGGRAMAGWAAACLTLFSGGAMIRFGFNWHFLYFLIVGVVAALASLPGERTTL
Ga0307471_10018464023300032180Hardwood Forest SoilMKSILMMVCGVFCIVGGMLHGSGFRIIHGEVAKANVTGDLADIVDLTWVFMSMGFLTFGAMLITCGLQMRKKNYGGRALASWAAGCLTLFSGGAMIRFGFNWHFLYFLIVGLAAAYACFPDRRTAS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.