NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F087629

Metagenome Family F087629

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F087629
Family Type Metagenome
Number of Sequences 110
Average Sequence Length 110 residues
Representative Sequence LLVSVPTGEHDDQGWQVVRPPEDWVALFERAGFLVYEDELYVRGADGWRTASLAEARGARYNAGGPGAGAVLLAELHPGTVGEKVRLAVRDVRHRDVVRRSTRVT
Number of Associated Samples 96
Number of Associated Scaffolds 110

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 2.73 %
% of genes near scaffold ends (potentially truncated) 97.27 %
% of genes from short scaffolds (< 2000 bps) 95.45 %
Associated GOLD sequencing projects 92
AlphaFold2 3D model prediction Yes
3D model pTM-score0.74

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (73.636 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(10.000 % of family members)
Environment Ontology (ENVO) Unclassified
(22.727 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(42.727 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 21.80%    β-sheet: 27.07%    Coil/Unstructured: 51.13%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.74
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
c.66.1.41: UbiE/COQ5-liked2p7ia_2p7i0.54723
c.66.1.15: Arylamine N-methyltransferased2g72a_2g720.53048
b.2.3.4: Fibrinogen-binding domaind1n67a21n670.52485
b.2.3.0: automated matchesd2okma_2okm0.51499
c.66.1.15: Arylamine N-methyltransferased2a14a12a140.5111


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 110 Family Scaffolds
PF05977MFS_3 7.27
PF13535ATP-grasp_4 4.55
PF07690MFS_1 3.64
PF02786CPSase_L_D2 3.64
PF02222ATP-grasp 2.73
PF01071GARS_A 0.91
PF13946DUF4214 0.91
PF00571CBS 0.91
PF01850PIN 0.91
PF07021MetW 0.91
PF12704MacB_PCD 0.91

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 110 Family Scaffolds
COG2814Predicted arabinose efflux permease AraJ, MFS familyCarbohydrate transport and metabolism [G] 7.27
COG0458Carbamoylphosphate synthase large subunitAmino acid transport and metabolism [E] 1.82
COG0026Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase)Nucleotide transport and metabolism [F] 0.91
COG0027Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase)Nucleotide transport and metabolism [F] 0.91
COG0045Succinyl-CoA synthetase, beta subunitEnergy production and conversion [C] 0.91
COG0151Phosphoribosylamine-glycine ligaseNucleotide transport and metabolism [F] 0.91
COG0439Biotin carboxylaseLipid transport and metabolism [I] 0.91
COG1038Pyruvate carboxylaseEnergy production and conversion [C] 0.91
COG1181D-alanine-D-alanine ligase or related ATP-grasp enzymeCell wall/membrane/envelope biogenesis [M] 0.91
COG4770Acetyl/propionyl-CoA carboxylase, alpha subunitLipid transport and metabolism [I] 0.91


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A73.64 %
All OrganismsrootAll Organisms26.36 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001535|A3PFW1_10724090All Organisms → cellular organisms → Archaea → Euryarchaeota1208Open in IMG/M
3300005175|Ga0066673_10488679Not Available725Open in IMG/M
3300005176|Ga0066679_10363516Not Available947Open in IMG/M
3300005176|Ga0066679_11041034All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium508Open in IMG/M
3300005336|Ga0070680_100719985Not Available859Open in IMG/M
3300005339|Ga0070660_101541464Not Available565Open in IMG/M
3300005435|Ga0070714_102076298Not Available554Open in IMG/M
3300005435|Ga0070714_102172000Not Available541Open in IMG/M
3300005437|Ga0070710_11434832Not Available517Open in IMG/M
3300005440|Ga0070705_101783515Not Available522Open in IMG/M
3300005445|Ga0070708_100751274Not Available917Open in IMG/M
3300005451|Ga0066681_10294244All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia → unclassified Thermoleophilia → Thermoleophilia bacterium992Open in IMG/M
3300005454|Ga0066687_10318247All Organisms → cellular organisms → Bacteria885Open in IMG/M
3300005454|Ga0066687_10672949Not Available615Open in IMG/M
3300005458|Ga0070681_10163171All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales2152Open in IMG/M
3300005471|Ga0070698_101751739Not Available574Open in IMG/M
3300005537|Ga0070730_10379782Not Available916Open in IMG/M
3300005539|Ga0068853_100459602Not Available1198Open in IMG/M
3300005552|Ga0066701_10761760Not Available578Open in IMG/M
3300005561|Ga0066699_10158308Not Available1551Open in IMG/M
3300005587|Ga0066654_10902942Not Available507Open in IMG/M
3300005713|Ga0066905_101346534Not Available644Open in IMG/M
3300005764|Ga0066903_106450923All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium612Open in IMG/M
3300005764|Ga0066903_108512070Not Available523Open in IMG/M
3300005885|Ga0075284_1017887All Organisms → cellular organisms → Bacteria855Open in IMG/M
3300005898|Ga0075276_10086826Not Available680Open in IMG/M
3300005900|Ga0075272_1001724All Organisms → cellular organisms → Bacteria4776Open in IMG/M
3300006057|Ga0075026_100960251Not Available529Open in IMG/M
3300006173|Ga0070716_101289461Not Available590Open in IMG/M
3300006755|Ga0079222_10489506All Organisms → cellular organisms → Bacteria896Open in IMG/M
3300006804|Ga0079221_10874037Not Available656Open in IMG/M
3300006806|Ga0079220_11162502Not Available632Open in IMG/M
3300006903|Ga0075426_10465272Not Available937Open in IMG/M
3300006914|Ga0075436_100655553Not Available776Open in IMG/M
3300009012|Ga0066710_101242129All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1155Open in IMG/M
3300009090|Ga0099827_11005144All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium723Open in IMG/M
3300009174|Ga0105241_10710682All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium918Open in IMG/M
3300009551|Ga0105238_12648717Not Available538Open in IMG/M
3300010329|Ga0134111_10464475All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria551Open in IMG/M
3300010373|Ga0134128_11694704Not Available696Open in IMG/M
3300010373|Ga0134128_11734256Not Available687Open in IMG/M
3300010379|Ga0136449_101015244Not Available1333Open in IMG/M
3300010396|Ga0134126_10283720Not Available1949Open in IMG/M
3300010398|Ga0126383_12609770Not Available589Open in IMG/M
3300011270|Ga0137391_11528588Not Available512Open in IMG/M
3300012004|Ga0120134_1092861Not Available560Open in IMG/M
3300012202|Ga0137363_10972205All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium721Open in IMG/M
3300012532|Ga0137373_10553591All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria872Open in IMG/M
3300012923|Ga0137359_10069416All Organisms → cellular organisms → Bacteria3080Open in IMG/M
3300012929|Ga0137404_11876864Not Available558Open in IMG/M
3300012930|Ga0137407_10759964Not Available914Open in IMG/M
3300012960|Ga0164301_11165312All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium617Open in IMG/M
3300012971|Ga0126369_12982092Not Available554Open in IMG/M
3300012986|Ga0164304_11835844Not Available507Open in IMG/M
3300012987|Ga0164307_10967755Not Available690Open in IMG/M
3300013105|Ga0157369_11463068Not Available695Open in IMG/M
3300013307|Ga0157372_12366594Not Available610Open in IMG/M
3300015245|Ga0137409_10801723Not Available777Open in IMG/M
3300017939|Ga0187775_10405748Not Available564Open in IMG/M
3300017944|Ga0187786_10551196Not Available533Open in IMG/M
3300017959|Ga0187779_10386405Not Available910Open in IMG/M
3300017959|Ga0187779_11335362Not Available509Open in IMG/M
3300017961|Ga0187778_10849663All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria625Open in IMG/M
3300018032|Ga0187788_10179742All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria811Open in IMG/M
3300018032|Ga0187788_10288263Not Available662Open in IMG/M
3300018064|Ga0187773_11193388Not Available511Open in IMG/M
3300018468|Ga0066662_12868772Not Available512Open in IMG/M
3300019873|Ga0193700_1003768All Organisms → cellular organisms → Bacteria2266Open in IMG/M
3300020059|Ga0193745_1119980Not Available547Open in IMG/M
3300021363|Ga0193699_10482480Not Available507Open in IMG/M
3300021478|Ga0210402_10953163Not Available785Open in IMG/M
3300025505|Ga0207929_1109518Not Available514Open in IMG/M
3300025906|Ga0207699_10393892Not Available985Open in IMG/M
3300025921|Ga0207652_11411503Not Available600Open in IMG/M
3300025921|Ga0207652_11421665Not Available597Open in IMG/M
3300025929|Ga0207664_10295511Not Available1424Open in IMG/M
3300025929|Ga0207664_11852704Not Available526Open in IMG/M
3300025939|Ga0207665_10311736All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1179Open in IMG/M
3300025944|Ga0207661_10956109Not Available789Open in IMG/M
3300026109|Ga0208774_1048366Not Available649Open in IMG/M
3300027826|Ga0209060_10582836Not Available506Open in IMG/M
3300027857|Ga0209166_10703973Not Available508Open in IMG/M
3300027894|Ga0209068_10541643Not Available674Open in IMG/M
3300028653|Ga0265323_10291671Not Available505Open in IMG/M
3300028718|Ga0307307_10115091Not Available827Open in IMG/M
3300028768|Ga0307280_10302260Not Available584Open in IMG/M
3300028802|Ga0307503_10517239Not Available646Open in IMG/M
3300028807|Ga0307305_10124014All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1194Open in IMG/M
(restricted) 3300031248|Ga0255312_1118272Not Available652Open in IMG/M
3300031251|Ga0265327_10514212Not Available515Open in IMG/M
3300031716|Ga0310813_11861591Not Available566Open in IMG/M
3300031754|Ga0307475_10988784Not Available662Open in IMG/M
3300031820|Ga0307473_10094585Not Available1573Open in IMG/M
3300031938|Ga0308175_100275141All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1703Open in IMG/M
3300031938|Ga0308175_102632442Not Available563Open in IMG/M
3300031938|Ga0308175_103252910Not Available503Open in IMG/M
3300031962|Ga0307479_10519637Not Available1174Open in IMG/M
3300031996|Ga0308176_11850642Not Available644Open in IMG/M
3300032010|Ga0318569_10487111Not Available575Open in IMG/M
3300032133|Ga0316583_10049881All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1474Open in IMG/M
3300032180|Ga0307471_102701556Not Available630Open in IMG/M
3300032180|Ga0307471_103983260All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium522Open in IMG/M
3300032261|Ga0306920_102417568Not Available725Open in IMG/M
3300032828|Ga0335080_10829336Not Available954Open in IMG/M
3300032828|Ga0335080_11670318All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium625Open in IMG/M
3300032892|Ga0335081_11738498Not Available678Open in IMG/M
3300032892|Ga0335081_12162038All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium587Open in IMG/M
3300032955|Ga0335076_11402625All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium584Open in IMG/M
3300033233|Ga0334722_10036312All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia4012Open in IMG/M
3300033806|Ga0314865_112891Not Available716Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil10.00%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil7.27%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil7.27%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland7.27%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.36%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.55%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil4.55%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere4.55%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil3.64%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil3.64%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil3.64%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.73%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.73%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.73%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil2.73%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.82%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.82%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost1.82%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.82%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.82%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.82%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.82%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere1.82%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.91%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.91%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil0.91%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil0.91%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.91%
PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Peatland0.91%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.91%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.91%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.91%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.91%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.91%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001535Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A3-PF-15A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005339Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-3 metaGHost-AssociatedOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005539Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C3-2Host-AssociatedOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005885Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_10C_80N_401EnvironmentalOpen in IMG/M
3300005898Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_5C_80N_405EnvironmentalOpen in IMG/M
3300005900Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_5C_0N_404EnvironmentalOpen in IMG/M
3300006057Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2012EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009551Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaGHost-AssociatedOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012004Permafrost microbial communities from Nunavut, Canada - A30_5cm_6MEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300013105Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2-5 metaGHost-AssociatedOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017939Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_10_MGEnvironmentalOpen in IMG/M
3300017944Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0815_BV2_10_20_MGEnvironmentalOpen in IMG/M
3300017959Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MGEnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300018032Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP10_20_MGEnvironmentalOpen in IMG/M
3300018064Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP05_10_MGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019873Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3s1EnvironmentalOpen in IMG/M
3300020059Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1a2EnvironmentalOpen in IMG/M
3300021363Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3c2EnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300025505Arctic peat soil from Barrow, Alaska - NGEE Surface sample 53-1 deep-072012 (SPAdes)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025921Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025939Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025944Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.1-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026109Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_5C_80N_405 (SPAdes)EnvironmentalOpen in IMG/M
3300027826Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028653Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-12-25 metaGHost-AssociatedOpen in IMG/M
3300028718Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_194EnvironmentalOpen in IMG/M
3300028768Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_119EnvironmentalOpen in IMG/M
3300028802Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 17_SEnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031251Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-16-21 metaGHost-AssociatedOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031938Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R1EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300031996Soil microbial communities from UC Gill Tract Community Farm, Albany, California, United States - DLSLS.C.R2EnvironmentalOpen in IMG/M
3300032010Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f22EnvironmentalOpen in IMG/M
3300032133Rhizosphere microbial communities from salt marsh grasses in Alabama, United States - J_170502JBrBrAHost-AssociatedOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032828Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.4EnvironmentalOpen in IMG/M
3300032892Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.5EnvironmentalOpen in IMG/M
3300032955Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_2.5EnvironmentalOpen in IMG/M
3300033233Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_bottomEnvironmentalOpen in IMG/M
3300033806Tropical peat soil microbial communities from peatlands in Loreto, Peru - MAQ_50_20EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
A3PFW1_1072409023300001535PermafrostVSVPTGERDDQGWQLQRTPEDWIGVFERAGFVIFEDELYVRDSDGWRTATLAEAGAARYRDDDGAGAVLLAELRPASVGTKLRLAVRDVRHRDAARRSTQ*
Ga0066673_1048867913300005175SoilLRRVLAKDGRLLVSVPTGEHDDQGWQLQRTPGDWIAVFERAGFLVYEDELYVHATDGWRSATHEEAEGARYGGGGPGAGAVLVAELHPESVGGKLRLAVRDARHGDSPRRSTRK*
Ga0066679_1036351623300005176SoilLIVSVPTGQHDDQGWQVQRTPEGWIAVFERSGFLVYEDELYLRAANGWRRASLAEVGAARYGSGGPGAGAVLLAELHPDSFAGKMRLAVRDVRHRDDVRRSTVA*
Ga0066679_1104103413300005176SoilDEAALRELHRVLGRDGRLLVSVPTGERDDQGWQLQRTPDDWIVVFERAGFLVFEDELYVRGPDGWRTGTLAQAGSARYGANGPGADAVLLAELRPARVAEKLRLAVRDVRHRDAVRRSTAGQ*
Ga0070680_10071998523300005336Corn RhizosphereGERDDQGWQLQRTPEDWIAVFEGAGFVVYEDELYLHTTEGWRTATLDEARGARYGEHTAGAVLVAELHQSSAGEKLRLAVRDIRHRGEIRRSTAA*
Ga0070660_10154146423300005339Corn RhizosphereDQGWQLQRTPEDWVAVFEGAGFVVYEDELYVHTPEGWRTATLDEARGARYGEHTAGAVLLAELHPSSAGEKLRLAVRDVRHRGEIRRSTSA*
Ga0070714_10207629823300005435Agricultural SoilKDGRLIVSVPTGVHDDQGWQLVREPSDWVEQFERAGFVVYEDELYVHTDEGWRTATLDEARAARYGEHSAGAVLLAELRPGTVGEKLRLAVRDVRYRDDIRRSTVP*
Ga0070714_10217200023300005435Agricultural SoilVASRVLVSVPVGVADDQGRQIIRPPLEWIELFERCGFLVYEDELYVRGDDGWRTASVAEANAARYREDGAGAVLLAELHAAGLVERARLAVRDVRHRDTARRSTTA*
Ga0070710_1143483213300005437Corn, Switchgrass And Miscanthus RhizosphereDNSAYGVDATRDEDGQKAALRELHRVLARYGRLLVTVPTGKRDDQGWQLQRAPGEWISLFERSGFLVFEDELYVHTEDGWRTATLAEAEAARYGEPGPGAGAVLLAELPPKRIGERVRLAVRDARHGDEPRRST*
Ga0070705_10178351513300005440Corn, Switchgrass And Miscanthus RhizospherePTGTHDDQGWQVVRTPEEWIELFERAGFLVYEDELYVRAPEGWRTASLDEARAAPYREDGAGAVLLAELRPGSFGGKVRLAVRDVRHAGEIRRSTVVS*
Ga0070708_10075127413300005445Corn, Switchgrass And Miscanthus RhizosphereGVDDDQGWQIVRPPLAWIELFERSGFLVYEDELYVCADDGWRTATTAEADAARYRDDRASAVLLAELRPARVAERVRLAVRDVRHRDEPRRSTSA*
Ga0066681_1029424413300005451SoilRVLARDGRLLVSVPTGRREDHGWHLQRPPLEWIELFERNGFVVFEDELYKHGLDGWRTASLAEAEAARYGSTGPGAGAVLLAELRPSRLSEKIRLAVRDRRLPDEPRRSTAT*
Ga0066687_1031824723300005454SoilEHVGRDNAVYGVENAHDDEGHATALRELHRVLARTGRLLFSVPTGVHADHGWHLQRAPLEWIELFEQNGFVVFEDELYLHTADGWRTATLTEAEAARYESGGPGAGAVLLAELRPRRLSEKIRLAVRDVRRPDEPRRSTAAKASEQAVSNPATTRKNTRSET*
Ga0066687_1067294913300005454SoilQGWQLQREPLAWIGVFERCGFLVYEDELYVRRDDGWRTATLEEAEHATYAENGAGAVLLAELRPGTVGEKLRLAVRDARHRDVVRRSTRVA*
Ga0070681_1016317133300005458Corn RhizosphereLAKDGRLLVTVPTGAGVDQGWQVVRTPEQWIERFERSGFVVYEDELYVRDGDGWRTASLQEARDAAYLEGGAGAVLLAELRPGSFGGKVRLAVRDVRHAGEIRRSTLS*
Ga0070698_10175173923300005471Corn, Switchgrass And Miscanthus RhizosphereLLVSVPTGVRDDQGWQLQRPPEEWIDVFDSAGFFVFEDELYVYRDGWRTATLDEARAARYEGPGAGAVLLAELRPETFGGKLRLAVRDIRHRDEVRRSTVP*
Ga0070730_1037978213300005537Surface SoilDQGWQIVRAPADWIALFERCGFLVYEDELYVHEADGWRTATLADASRARYGGAGPGAGAVLLAELHPDTLGGKMRLAVRDVRHRDEIRRSTVA*
Ga0068853_10045960213300005539Corn RhizosphereQGWQLVRPPADWIGLFERAGFIVYEDELYARDGTGWHAATLTEASAARYEGPGAGAVLLAELHPRTLGSKLRLAVRDVRYRDDVRRSTVA*
Ga0066701_1076176023300005552SoilGWQVVRPPEEWIALFERAGFLVYEDELYHRDADGWRTASLDEARAARYAGNAAGAVLLAELHPGSLGGKLRLAVRDVRHADDVRRSTAP*
Ga0066699_1015830813300005561SoilGRLLVSVPTGEHDDQGWQLQRTPGDWIAVFERAGFLVYEDELYVHAADGWRSATHEEAEGARYGGGGPGAGAVLLAELHPESVGGKLRLAVRDARHGDAPRRSTRK*
Ga0066654_1090294213300005587SoilRVLARDGRLLVSVPTGVRDDQGWQLQRSPEDWVAVFERAGFLVFEDELYVRGADGWRTGSLAEARAAHYGAGGPGAGAVLLAELRPARVAEKVRLAVRDVRHRDDVRRSTAGQ*
Ga0066905_10134653423300005713Tropical Forest SoilYAVDAPRTADGDAAALRELRRVASRVLVSVPTGIHDDQGWQLQRTPDEWIELFERAGFLVFEDELYVRNDMRWRTATLAEARDARYRDCAGAVLLAELRPDTFGGKLRLAVRDVRHRDAPRRSTSAQ*
Ga0066903_10645092313300005764Tropical Forest SoilLHRVLAKDGRLLVSVPTGEHDDQGWQLQRTPEDWVSVFEGAGFLVYEDELYVHTDAGWRSASLAEARAARYGDASAGAVLLAELHPDTLGGKLRLAVRDVRHRDDVRRSTAP*
Ga0066903_10851207013300005764Tropical Forest SoilDNAVYHVDTAREDAGEERALRELHRVASRVLVTVPVGFPDDQGWQVVRAPLEWISIFERAGFVVFEDELYVHADDGWRTADLAEAEAARYGEPGPGAGAVLLAELRSRRLSERVRLAVRDARHSGEPRRSTRQ*
Ga0075284_101788723300005885Rice Paddy SoilLEHVGRDNDVYAVDAPRERSGDEAALHELRRVLAPEGRLLVSVPTGVEDDQGWQVQRPPEDWIARFERAGFIVFEDELYVSAGDGWRSGSLDEARAARYGEPGPGAGAVLLAELRPRRLGERVKLAIRDARHRDMPRRSTV*
Ga0075276_1008682613300005898Rice Paddy SoilGRDNDVYAVDAPRERSGDEAALHELRRVLAPEGRLLVSVPTGVEDDQGWQVQRPPEDWIARFERAGFIVFEDELYVSAGDGWRSGSLDEARAARYGEPGPGAGAVLLAELRPRRLGERVKLAIRDARHRDMPRRSTV*
Ga0075272_100172413300005900Rice Paddy SoilGWQLVRAPLEWIELFERSGFVVFEDELYLHTTQGWRTATLAEAEAAHYGHPGPGAGAVLLAELRPDRLTEKIRLAVRDARHSNEPRRSTAH*
Ga0075026_10096025123300006057WatershedsLRRVLTKDGRLLISVPTGEHEDQKWQIVRTPADWIGLFERCGFLVYEDELYVHEAGGWRTATPADADRARYGATGPGAGAVLLAELHPDTFGGKMRLAVRDVRHRDEIRRSTVA*
Ga0070716_10128946113300006173Corn, Switchgrass And Miscanthus RhizosphereHGDEAALRELRRVAGRVIVSVPVGERDDQGWQIIRPPLEWIELFERCGFLVYEDELYVRGDDGWRTATVADANAARYREDGAGAVLLAELHAAGLAERARLAVRDVRHRDTARRSTTA*
Ga0079222_1048950623300006755Agricultural SoilRRVLAPEGKLLVSVPTGVADDQGWQVQREPLAWTARFEEAGFLVYEDELYVRREDGWRTATLAEAAAARYGDHGAGAVLLAELRPDTFGEKLRLAVRDVRHRDVARRSTRPA*
Ga0079221_1087403713300006804Agricultural SoilVGRDNTRYDIDAARDEDGDLVALRELRRVSERVIVSVPTGVHDDQGWQLVREPSDWVEQFERAGFVVYEDELYVHTDEGWRTATLDEARAARYGEHSAGAVLLAELRPGTVGGKLRLAVRDVRYRDDIRRSTVP*
Ga0079220_1116250223300006806Agricultural SoilRVLAGDGRLLVTVPTGAAEDQGWQVVRPPDEWIALFERTGFLVYEDELYVRDPSGWRTASSAEASSAGYGESSAGAVLLAELHAGSLAEKLRLGVRDVRYRDEIRRSTLAAR*
Ga0075426_1046527223300006903Populus RhizosphereELHRVLARDGRLLVTVPTGERDDQGWQLQRAPHEWISLFEQSGFLVFEDELYLHTEDGWRTATVAEAATASYGEPGPGAGAVLLAELRPKRIGERVRLAVRDARHGDEPRRST*
Ga0075436_10065555323300006914Populus RhizosphereGQAEDQGWQVVRTPEQWIALFERSGYLVYEDELYVRTADGWRTGSVAEAGETASVDSGAGAALLAELRPGSLGGKLRLAVRDVRYRDDIRRSTLA*
Ga0066710_10124212913300009012Grasslands SoilHDDQGWQLQRTPDDWVAVFERAGFLVYEDELYVRSADGWRSATLEEARDARYGEGGPGAGAVLLAELHPETVGGKLRLAVRDVRHGDAPRRNTRK
Ga0099827_1100514413300009090Vadose Zone SoilTALRELRRVASRVLVSVPTGVHDDQGWQLQRTPDDWIDLFERAGFLVFEDELYVRGEDGWRSGTLAEARRSCYDSCAGAVLLAELRPGSVAGKLRLAVRDVRHRDAPRRSTTAQ*
Ga0105241_1071068223300009174Corn RhizosphereRELHRVLTKDGRLLVTVPTGAAVDQGWQVVRMPEQWIERFERSGFVVYEDELYVRDGDGWRTASLQEARDAAYLEGGAGAVLLAELRPGSFGGKVRLAVRDVRHAGEIRRSTLS*
Ga0105238_1264871713300009551Corn RhizosphereLHRVLTKDGRLLVTVPTGAAVDQGWQVVRMPEQWIERFERSGFVVYEDELYVRDGDGWRTASLQEAGDAAYLEGGAGAVLLAELRPGSFGGKVRLAVRDVRHAGEIRRSTMS*
Ga0134111_1046447523300010329Grasslands SoilLRELHRVLAKDGRLLVTVPTGVPENHGWQVQRAPEEWVDVFEQAGFLVFEDELYVYGEGWGSATPDEARRARYEGPGAGAVLLAELRPGSVGGKVRLAVRDVRHRNEVRRTTVSRNQ*
Ga0134128_1169470423300010373Terrestrial SoilWQVQREPLAWTARFEEAGFLVYEDELYVRREDGWRTATLAQAAAARYGDHGAGAVLLAELRPDSFGEKLRLAVRDVRHRDVARRSTRPA*
Ga0134128_1173425613300010373Terrestrial SoilLLVTVPTGAAVDQGWQVVRMPEQWIERFERSGFVVYEDELYVRDGDGWRTASLQEARDAKYLEGGAGAVLLAELRPGSFGGKVRLAVRDVRHAGEIRRSTVVS*
Ga0136449_10101524423300010379Peatlands SoilDEAALRELHRVLTGEGRLLVSVPTGEHDDQGWQAQRTVDDWVALFERSGFLVYEDELYLHGEDGWRSATPAEVRGVAYGSAGPGAGAVLLAELRPRTIGARLRLVIRDAKHRDEPRRSTTLL*
Ga0134126_1028372033300010396Terrestrial SoilVLAKDGRLLVSMPTGEHDDQGWQVVRPPADWVALFERAGFLVYEDELYVRGGVGWRTGSLAEASGARYNAGGPGAGAVLLAELHPDSFGGKIRLAVRDVRHRDEVRRSTVA*
Ga0126383_1260977023300010398Tropical Forest SoilRLIVSVPTGEPEDQDWQVVRAPAEWIGRFERSGFLVFEDELYARTASGWRSATPAEAGAARYGDGGPGAGAVLLAELRPGTIGGKVRLAVRDVRHRDVRRSTAP*
Ga0137391_1152858823300011270Vadose Zone SoilSQDGDEAALRELHRVLARGGRLLVSVPTGERDDQGWQLQRTPEDWVAVFERAGFLVFEDELYLRDADGWRATTLARARAARYGASGPGADAVLLAELRPSRATEKLRLAVRDVRHRDAVRRSTAGQ*
Ga0120134_109286123300012004PermafrostVPTGEHDDQGWQIQRTPEEWIAVFERAGFIVYEDELYLRAANGWRRASLAEARTARYGSGGPGAGAVLLAELHPDSFGGKMRLAMRDVRHRDEVRRSTVA*
Ga0137363_1097220513300012202Vadose Zone SoilRSGRLLVSVPTGAPEDQGWQIIRTPEDWIAVFERSGFLVYEDELYVRGDDGWRTASLDEARAASYGTGGPGAGAVLLAELHPESVGGKLRLAVRDVRHRDEVRRSTVT*
Ga0137373_1055359113300012532Vadose Zone SoilRDDQGWQLQRPPEEWVDLFERAGFFIFEDELYVYGDGWRTATLDEAREARYEGPGAGAVLLAELRPDTLGGKLRLAVRDVRHRDDVRRSTVP*
Ga0137359_1006941613300012923Vadose Zone SoilDNAVYEVDAAREEHGDEAALRELHRVAARVLVSVPVGETDDQGWQIVRPALEWIELFERCGFVVYEDELYVCNDEGWRTASVAEASVARYREDRAGAVLLAELHPAGLGERVRLAVRDVRHRKVARRSTLAGT*
Ga0137404_1187686413300012929Vadose Zone SoilLLVSVPTGAPEDQGWQIIRTPEDWIAVFERSGFLVYEDELYVRGDDGWRTASLDEARAASYGTGGPGAGAVLLAELHPESVGGKLRLAVRDVRHRDEIRRSTVT*
Ga0137407_1075996413300012930Vadose Zone SoilAALRELHRVLAKDGRLIISVPTGEPEDQGWQIVRTPEDWIALFERCGFLVYEDELYVHEADGWRTATPADASRARYGAAGPGAGAVLLAELHPDTFGGKMRLAVRDVRHRDEIRRSTIA*
Ga0164301_1116531223300012960SoilLIVSVPTGEHDDQGWQLQRTPDDWIAVFERAGFLVYEDELYVRSADGWRSATLEEARDARYGESGPGAGAVLLAELHPETIGGKVRLVVRDVRHGDAPRRNTK*
Ga0126369_1298209223300012971Tropical Forest SoilYGVDSSRDESGDVNALRELHRVLAKDGRLLVSVPTGEHDDQGWQLQRTPDDWIAVFERAGFLVYEDELYVHSDAGWRSASLAETQAVRYGESSAGAVLLAELHPDTLGGKLRLAVRDVRHRDDVRRSTAP*
Ga0164304_1183584413300012986SoilVVRTPEEWIELFERAGFLVYEDELYVRAPEGWRTASLDEARAAPYREDGAGAVLLAELRPGSFGGKVRLAVRDVRHAGDIRRSTLS*
Ga0164307_1096775513300012987SoilVVALGELHRVLTADGRLLVSVPTGVLDDQGWQIVRPPEDWIALFEQTGFIVFEDELYVRADGGWRTASLDEARDAAYLEGGAGAVLLAELRPGSFGGKVRLAVRDVRHAGDIRRSTLS*
Ga0157369_1146306823300013105Corn RhizosphereVLAKDGRLLVSVPTGERDDQGWQLQRTPEDWIAVFERAGFVVYEDELYLRTPDGWRTATLAEACAARYGEHTAGAVLLAELHPSSAGEKLRLAVRDVRHRGEIRRSTAA*
Ga0157372_1236659413300013307Corn RhizosphereSDGRLLVSVPTGVREDQGWQLQREPLEWIELFESCGFLVYEDELYVRGDDGWRTATLAEAQAARYAENGRCAGAGAVLLAELRPGTVGEKLRLAVRDVRHRDVARRSTRLP*
Ga0137409_1080172323300015245Vadose Zone SoilVSVPTGEHDDQGWQLQRTPAEWIAVFERAGFVVYEDELYVHDDAGWRNGSLAEASAARYREDGAGAVLLAELHPASIGEKL
Ga0187775_1040574813300017939Tropical PeatlandIAAARGEGADDAALSELHRVLAREGRLLVSVPTGEHDDQEWQLQRTPLEWVGLFERSGFVVFEDELYVHGADGWRTATLAEAESARYGCSGPGAGAVLLAELLPKRIGERIRLAVRDVRHADEPRRSTARH
Ga0187786_1055119623300017944Tropical PeatlandVSVPTGEHDDQEWQLQRTPLEWIGLFERCGFVVFEDELYVHGVDGWRTATLAEAESARYGASGPGAGAVLLAELRPRRIGERIRLAVRDTRFPDEPRRSTTRQ
Ga0187779_1038640523300017959Tropical PeatlandALHELHRVLTPGGRLLVSVPTGAAEDQGWQVQRTPDAWVAAFERAGFVVFEDELYLHTPDGWRSATPDEVRDVGYGVAGPGAGAVLLAELHPRTLGERVRLAVRDDRHRGEPRRSTRPS
Ga0187779_1133536213300017959Tropical PeatlandAVDAPRDDAGDEAALRELHRVLAGDGRLLVSVPVGEPDDQGWQVQRTPEEWLERFERAGFLVFEDELYVHAADGWRGAALDEVRGVRYGDGGPGAGAVLLAELHPRTLGVRVRVAVRDLLHHDEPRRSTRMR
Ga0187778_1084966313300017961Tropical PeatlandGDEAALRELHRVLAPGGRLLVSVPTGADDDQGWQVQRPPAEWTARFERSGYLVYEDELYLHGDDGWRAATLAEAEDAAYGAAGPGAGAILLAELRPRTILERIRLVLRDAKHRDTPRRSTVEA
Ga0187788_1017974223300018032Tropical PeatlandAALRELHRVLTPRGRLLVTVPTGAEDDQGWQVQRTPKEWIARFERSGFLVFEDELYLHGADGWRTATLAEVESVPYGATGPGAGAILLAELRPRTILERIRLVVRDARHRDTPRRSTAAH
Ga0187788_1028826323300018032Tropical PeatlandGRDNEVYAVEAPREEAGDEAALHELRRVITGDGRLVVTVPTGEHDDQGWQVQRTPEDWVSLFERAGFLVFEDELYVRGEDGWRSADLEQARRARYGEGGPGAGAILLAELRPRRLGERVKLAVRDARHRDAPRRSTLAS
Ga0187773_1119338823300018064Tropical PeatlandGGEGADDAALRELHRVLAREGRLLVSVPTGEHDDQEWQLQRTPLEWVDLFERCGFLVFEDELYVHGTDGWRTATLVEAETARYGTAGPGAGAVLLAELRPRHVGERIRLAVRDARFPDEPRRSTTRH
Ga0066662_1286877213300018468Grasslands SoilPLEWIELFERNGFVVFEDELYLHAADGWRTASLPEAQAARYESSGPGAGAVLLAELRPARLGEKIRLAVRDVRLRGEPRRSTAP
Ga0193700_100376843300019873SoilTGVPDDQGWQVVLAPEQWIERFERAGFVVYEDELYVRGDDGWRTATLGEARAAPYLENGAGAVLLAELRPGSFGGKVRLAVRDARHAGEIRRSTLA
Ga0193745_111998013300020059SoilVYDVDAPRDDEGDEAALGELRRVLDKNGRLLISVPTGVADDQGWQVVRTPEQWIERFERVGFVVYEDELYVRRDDGWRTATLDEARAARYLENGAGAVLLAELRPGSFGGKVRLAVRDVRHAGEIRRSTLA
Ga0193699_1048248013300021363SoilWQLVRTPDDWITVFERAGFVVYEDELYVCETEGWRSATVAEASDARYGEHGAGAVLLAELHPSSVGEKLRLAVRDVRYRDEPRRSTLG
Ga0210402_1095316323300021478SoilDNEIYDVDAPRDDTGDEAALRELHRVLSGEGRLLVSVPTGAHEDQGWQVQRTVDDWVGLFERCGFLVYEDELYLHGEDGWRSATPDEVRGVAYGSTGPGAGAVLLAELRPRTIGARLRLVVRDAKHRDEPRRSTTLL
Ga0207929_110951823300025505Arctic Peat SoilERDDAGDEAALRELHRVLTKDGRLLVSVPTGEHDDQGWQIQRTPEEWIEVFERAGFIVYEDELYLRAANGWRRASLAEARTARYGSGGPGAGAILLAELHPDSFGGKMRLAMRDVRHRDEVRRSTVA
Ga0207699_1039389223300025906Corn, Switchgrass And Miscanthus RhizosphereLLVSVPTGEHDDQGWQVVRPPEDWVALFERAGFLVYEDELYVRGADGWRTASLAEARGARYNAGGPGAGAVLLAELHPGTVGEKVRLAVRDVRHRDVVRRSTRVT
Ga0207652_1141150313300025921Corn RhizosphereSDGDLAALRELRRVGGRVLVTVPTGVRDDQGWQLQRAPEEWIALFERSGFTVFEDELYVRAGDGWRTATLGEAREARYAPDGAGAVLLAELRPDTVGEKLRLAVRDARHRDVVRRSTRVP
Ga0207652_1142166523300025921Corn RhizosphereVVRMPEQWIERFERSGFVVYEDELYVRDGDGWRTASLQEARDAAYLEGGAGAVLLAELRPGSFGGKVRLAVRDVRHAGEIRRSTLS
Ga0207664_1029551123300025929Agricultural SoilLLVTVPTGKRDDQGWQLQRAPGEWISLFERSGFLVFEDELYVHTEDGWRTATLAEAEAARYGEPGPGAGAVLLAELRPKRIGERVRLAVRDARHGDEPRRST
Ga0207664_1185270413300025929Agricultural SoilDDSGDEAALRELHRVLAKDGRLLVSVPTGEHDDQGWQVVRPPEDWVALFERAGFLVYEDELYVRGDDGWRTGSLAEASGARYNAGGPGAGAVLLAELHPESFGGKIRLAVRDVRHRDEVRRSTVA
Ga0207665_1031173613300025939Corn, Switchgrass And Miscanthus RhizospherePTGVDEDQGWQVQRSPLAWVERFERAGFLVFEDELYLRTADGWRTATLAEAEAARYGEPGPGAGAVLPAELRPKRIGEPVRLAVRDARHGDEPRRST
Ga0207661_1095610923300025944Corn RhizosphereGVRDDQGWQLQREPLDWIDVFERAGFAVFEDELYVRDGESWHTATLDAARQATYGEHGAGAVLLAELRPGTVGGKLRLAVRDVRHRDVVRRSTRVA
Ga0208774_104836613300026109Rice Paddy SoilAPRERSGDEAALHELRRVLAPEGRLLVSVPTGVEDDQGWQVQRPPEDWIARFERAGFIVFEDELYVSARDGWRSGSLDEARAARYGEPGPGAGAVLLAELRPRRLGERVKLAIRDARHRDMPRRSTV
Ga0209060_1058283613300027826Surface SoilGWQVVRRPEEWVDLFERAGFLVFEDELYARGEEGWRAVDAAAAGAARYGEGGPGAGAVLLAELRPRRLAERVRLAVRDARHRDAPRRSV
Ga0209166_1070397313300027857Surface SoilIEVMIRAHAVHPELHRVLTKDGRLLISVPTGEHEDQGWQIVRTPADWIALFERCGFLVYEDELYVHEADGWRTATLADASRARYGGAGPGAGAVLLAELHPDTLGGKMRLAVRDVRHRDEIRRSTVA
Ga0209068_1054164323300027894WatershedsMGSLPGHSSGRVLISVPTGEPEDQGWQVQRTPEEWLERFERCGFLVFEDELYTRGEDGWRMATLAEARGKRLGDGGPGAGAVLLAELRPARLGERLRLAVRDARHRDAPRRSTLT
Ga0265323_1029167123300028653RhizosphereAGDGRLLVSVPAGEPEDQGWQVQRTPQEWVERFERAGFLVFEDELYARGEDGWRTATLADAAGKRLGEGGPGAGAILLAELRPARIGERLRLAVRDARHRDAPRRSTLT
Ga0307307_1011509113300028718SoilQVVRAPEEWIERFERAGFVVYEDELYVRGDDGWRTATLGEARTAPYLENGAGAVLLAELRPGSFGGKVRLAVRDARHAGEIRRSTLA
Ga0307280_1030226023300028768SoilKDGRLLVSVPTGEHDDQGWQIQRTPEEWIAVFERAGFIVYEDELYLRAANGWRRASLAEARTARYGSGGPGAGAVLLAELHPDSFGGKMRLAMRDVRHRDEVRRSTVA
Ga0307503_1051723913300028802SoilGRLLVSVPTGERDDQGWQLQRAPEEWIVLFQRAGFVVFEDELYLRDDAEGWRSATLAEVGTAEYRDDGAGAVLLAELRPSTVGTKLRLAVRDVRHREAPRRSTLG
Ga0307305_1012401423300028807SoilIVSVPTGQHDDQGWQLQRTPDDWVAVFERAGFLVYEDELYVRSADGWRSATLEEAQGARYGVGGPGAGAVLLAELHAETVGGKLRLVVRDVRHGDAPRRSTRK
(restricted) Ga0255312_111827213300031248Sandy SoilLDDQGWQIVRPPEEWVELFERAGFIVFEDELYLRSDDGWRSASLAEARAARYLENAAGAVLLAELRPGSFGGKMRLAVRDVRHAGDIRRSTLS
Ga0265327_1051421213300031251RhizosphereGRLLLSVPTGVRDDQGWQVQRAPLEWIGLFEDAGFVVYEDELYLHTDDGWRTATLAEAESARYGSSGPGAGAVLLAELRPARLSEKVRLAVRDLRLPDEPRRSTAAKASAQTVSPAAIATNTTRNDT
Ga0310813_1186159113300031716SoilLLVTVPTGTRDDQGWQIVRTPEDWVELYERAGFLVYEDELYVRGEDGWRTATLAQAAAARYGDHGAGAVLLAELRPDSFGEKLRLAVRDVRHRDVARRSTRPA
Ga0307475_1098878413300031754Hardwood Forest SoilGVAEDQGWQIVRPPADWIELFERSGFVVFEDELYVRGADGWHSATVAEAEAARYGSSRQGAGAVLLAELRPSRPSEKLRLAVRDRRYPGEPRRST
Ga0307473_1009458513300031820Hardwood Forest SoilNSAYGVDATRDEDGQEAALRELHRVLARDGRLLVTVPTGKRDDQGWQLQRAPGEWISLFDRSGFLVFEDELYVHTGDGWRTATLAEAETARYGEPGPGAGAVLLAELRPKRIGERVRLAVRDARHGDEPRRST
Ga0308175_10027514123300031938SoilGRVLVTVPTGVHDDQGWQVQREPLAWIELFERCGFLVYEDELYVRHDDGWRTATLHEAERATYEEHGAGAVLLAELRHGTVGEKLRLAVRDVRHREVVRRSTRVAQ
Ga0308175_10263244223300031938SoilQHAADGDAAALRELRRVSSRVIVSVPTGERDDQGWQLQRTPDDWIAVFEAAGFVVYEDELYIHAPDGWRTATLDEARSARYGEHTAGAVLLAELHPSSAGAKLRLAVRDVRHRGEIRRSTSA
Ga0308175_10325291023300031938SoilQGWQLQRAPLEWIELFERSGFTVYEDELYVHDDAGWRTATRAEAEGARYGEHHAGAVLLAELRRRTVGETLRLAVRDARHRDVVRRSTLVP
Ga0307479_1051963713300031962Hardwood Forest SoilLARDGRLFVTVPTGERDDQGWQLQRPPREWIALFERSGFIVFEDELYVHSEDGWRSATLAEADAARYGEPGPGAGAVLLAELRPTRIGERVRLAVRDARHGDEPRRST
Ga0308176_1185064213300031996SoilRDNSGYAVDAARDDAGDLAALRELRRVADRVLVTVPTGVRDDQGWQLQRAPLEWIALFERAGFTVFEDELYVRDDAGWRTATLAEASARRYEESSAGAVLLAELRPGTFGGKLRLAVRDARHGNVVRRSTRVP
Ga0318569_1048711113300032010SoilLLSVPTGKPDDQGWQVQRSPQEWVERFERAGFLVFEDELYGRSDQGWRSVDAATAALTKYGDGGPGAGAVLVAELRPARLSERIRLAVRDARHREVARRSV
Ga0316583_1004988123300032133RhizosphereGWQLQRPPQDWIALFERCGFLVFEDELYVRGEEGWRSAGLAAASAARYGGDDSCPGAGAVLVAELRPRSVGERLRLALRDARHRDAPRRSTLA
Ga0307471_10270155623300032180Hardwood Forest SoilDQGWQIVRTPTDWVALFERCGYLVYEDELYVHETGGWRTATPADASRARYGAAGPGAGAVLLAELHPDTFGGKMRLAVRDVRHRDEIRRSTVA
Ga0307471_10398326013300032180Hardwood Forest SoilVYAIDAARDEGGDEAALRELHRVLANDGRLLISVPTGEHEDQGWQIVRTPEEWVALFERAGLLVYEDELYVRGEDGWRTGTLAEARAARYNAGGPGAGAVLLAELHPDTFGGKIRLAVRDVRHRDEVRRTTVA
Ga0306920_10241756813300032261SoilVAAAAHAARVDRLFERCGFVVFEDELYVHGADGWRSAALAEAETTRYGASGPGAGAVLLAELRPRRVGERIRLAVRDARFPDEPRRSTTRH
Ga0335080_1082933623300032828SoilESDDQGWQVQRPVDDWIALFERCGFLVYEDELYLHSEDGWRSATPPEVRGVGYGAGGPGAGAVLLAELRPRTIGARLRLVVRDAKHRDEPRRSTTLL
Ga0335080_1167031823300032828SoilAGDEAALHELRRVLAPGGRLLVSVPTGAAEDQGWQVQRTPDEWVAGFERAGFVVFEDELYLHAPDGWRSATPAEVGDIAYGAAGPGAGAVLVAELHPRTLGERVRLAVRDVRHRGDPRRSTRPS
Ga0335081_1173849823300032892SoilTDLDWQNQRAPLEWIELFERTGFVVFEDELYVRTTGGWRSATLSEAEPARYGAGGPGAGAVLLAELRPARLRERVRLALRDTRRPDEPRRSTAH
Ga0335081_1216203823300032892SoilEVGDETALHELHRVLAHDGRLLVSMPTGERDDQGWQLQRPPLEWVDLFERCGFVVFEDELYVHDAGGWRSATLAEAETARYGEGGAGAGAVLLAELRPQRLSEKLRLAVRDARHTDEPRRSTRQ
Ga0335076_1140262513300032955SoilAALHELRRVLAPGGRLLVSVPTGAAEDQGWQVQRTPDEWVAGFERAGFVVFEDELYLHAPDGWRSATPAEVGDIAYGAAGPGAGAVLVAELHPRTLGERVRLAVRDVRHRGDPRRSTRPS
Ga0334722_1003631213300033233SedimentQGWQVLRPPLEWIELFERCGLLVFEDELYVRGEDGWRSGSLAEAGAARYGTSAGAVLLAELHPTGLGERVRLAVRDVRHRDVPRRSTLAAS
Ga0314865_112891_394_7143300033806PeatlandLLVTVPTGAEDDQGWQVQRPPGEWIARFERSGFLVFEDELYLHDVEGWRTATPAQVEGVPYGTPGPGAGAILLAELRPRTILERIRLVVRDARHRDTPRRSTATGG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.