NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F079041

Metagenome / Metatranscriptome Family F079041

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F079041
Family Type Metagenome / Metatranscriptome
Number of Sequences 116
Average Sequence Length 74 residues
Representative Sequence MDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAAAVMTGRLLFRKHRARPRAGNLLGDQA
Number of Associated Samples 99
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 84.48 %
% of genes near scaffold ends (potentially truncated) 22.41 %
% of genes from short scaffolds (< 2000 bps) 78.45 %
Associated GOLD sequencing projects 90
AlphaFold2 3D model prediction Yes
3D model pTM-score0.44

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (93.103 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(24.138 % of family members)
Environment Ontology (ENVO) Unclassified
(36.207 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(69.828 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 55.77%    β-sheet: 0.00%    Coil/Unstructured: 44.23%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.44
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 116 Family Scaffolds
PF00232Glyco_hydro_1 57.76
PF00730HhH-GPD 18.97
PF07386DUF1499 6.03
PF11026DUF2721 2.59
PF01266DAO 0.86
PF01694Rhomboid 0.86
PF00067p450 0.86

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 116 Family Scaffolds
COG2723Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidaseCarbohydrate transport and metabolism [G] 57.76
COG01223-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylaseReplication, recombination and repair [L] 18.97
COG0177Endonuclease IIIReplication, recombination and repair [L] 18.97
COG1059Thermostable 8-oxoguanine DNA glycosylaseReplication, recombination and repair [L] 18.97
COG1194Adenine-specific DNA glycosylase, acts on AG and A-oxoG pairsReplication, recombination and repair [L] 18.97
COG22313-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamilyReplication, recombination and repair [L] 18.97
COG4446Uncharacterized conserved protein, DUF1499 familyFunction unknown [S] 6.03
COG0705Membrane-associated serine protease, rhomboid familyPosttranslational modification, protein turnover, chaperones [O] 0.86
COG2124Cytochrome P450Defense mechanisms [V] 0.86


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms93.10 %
UnclassifiedrootN/A6.90 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002245|JGIcombinedJ26739_100408823All Organisms → cellular organisms → Bacteria → Proteobacteria1236Open in IMG/M
3300002245|JGIcombinedJ26739_100621498All Organisms → cellular organisms → Bacteria958Open in IMG/M
3300002568|C688J35102_118188910All Organisms → cellular organisms → Bacteria537Open in IMG/M
3300002914|JGI25617J43924_10040673All Organisms → cellular organisms → Bacteria1678Open in IMG/M
3300003368|JGI26340J50214_10008113All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales3379Open in IMG/M
3300003505|JGIcombinedJ51221_10218583All Organisms → cellular organisms → Bacteria773Open in IMG/M
3300003505|JGIcombinedJ51221_10312060All Organisms → cellular organisms → Bacteria640Open in IMG/M
3300004082|Ga0062384_100324790All Organisms → cellular organisms → Bacteria965Open in IMG/M
3300004092|Ga0062389_101483076All Organisms → cellular organisms → Bacteria862Open in IMG/M
3300004152|Ga0062386_100025329All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4367Open in IMG/M
3300005167|Ga0066672_10100265All Organisms → cellular organisms → Bacteria1765Open in IMG/M
3300005167|Ga0066672_10489276All Organisms → cellular organisms → Bacteria800Open in IMG/M
3300005171|Ga0066677_10004050All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5781Open in IMG/M
3300005175|Ga0066673_10279403All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria971Open in IMG/M
3300005332|Ga0066388_102606560All Organisms → cellular organisms → Bacteria921Open in IMG/M
3300005435|Ga0070714_100307691All Organisms → cellular organisms → Bacteria1479Open in IMG/M
3300005435|Ga0070714_101890034All Organisms → cellular organisms → Bacteria583Open in IMG/M
3300005437|Ga0070710_10018441All Organisms → cellular organisms → Bacteria → Proteobacteria3590Open in IMG/M
3300005437|Ga0070710_10320326All Organisms → cellular organisms → Bacteria1018Open in IMG/M
3300005439|Ga0070711_100367826All Organisms → cellular organisms → Bacteria1160Open in IMG/M
3300005454|Ga0066687_10053343All Organisms → cellular organisms → Bacteria1894Open in IMG/M
3300005554|Ga0066661_10396705All Organisms → cellular organisms → Bacteria845Open in IMG/M
3300005559|Ga0066700_10979718All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium557Open in IMG/M
3300005568|Ga0066703_10441624All Organisms → cellular organisms → Bacteria780Open in IMG/M
3300005602|Ga0070762_10012600All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4280Open in IMG/M
3300005610|Ga0070763_10042341All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2135Open in IMG/M
3300005610|Ga0070763_10426947All Organisms → cellular organisms → Bacteria749Open in IMG/M
3300005713|Ga0066905_102189928All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300005921|Ga0070766_10043750All Organisms → cellular organisms → Bacteria2491Open in IMG/M
3300005921|Ga0070766_10352682Not Available955Open in IMG/M
3300006175|Ga0070712_100068273All Organisms → cellular organisms → Bacteria2532Open in IMG/M
3300006175|Ga0070712_100110386All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium2051Open in IMG/M
3300006755|Ga0079222_11431014All Organisms → cellular organisms → Bacteria642Open in IMG/M
3300006755|Ga0079222_12243263All Organisms → cellular organisms → Bacteria544Open in IMG/M
3300006791|Ga0066653_10333566All Organisms → cellular organisms → Bacteria766Open in IMG/M
3300006796|Ga0066665_10271243All Organisms → cellular organisms → Bacteria → Proteobacteria1351Open in IMG/M
3300006800|Ga0066660_10644123All Organisms → cellular organisms → Bacteria880Open in IMG/M
3300006804|Ga0079221_11627671All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300006893|Ga0073928_10456290All Organisms → cellular organisms → Bacteria924Open in IMG/M
3300006893|Ga0073928_10547062All Organisms → cellular organisms → Bacteria824Open in IMG/M
3300006954|Ga0079219_10086742All Organisms → cellular organisms → Bacteria1491Open in IMG/M
3300009012|Ga0066710_102622761All Organisms → cellular organisms → Bacteria723Open in IMG/M
3300009012|Ga0066710_103903234All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium559Open in IMG/M
3300009143|Ga0099792_10163578All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1237Open in IMG/M
3300010359|Ga0126376_11997382All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria622Open in IMG/M
3300010361|Ga0126378_10036648All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium URHD00884454Open in IMG/M
3300010376|Ga0126381_104786955All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium521Open in IMG/M
3300011120|Ga0150983_12037671All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300012205|Ga0137362_10742914All Organisms → cellular organisms → Bacteria842Open in IMG/M
3300018431|Ga0066655_10988271All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria581Open in IMG/M
3300018468|Ga0066662_10691217All Organisms → cellular organisms → Bacteria972Open in IMG/M
3300020579|Ga0210407_10042550All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3380Open in IMG/M
3300020580|Ga0210403_10028797All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium URHD00884434Open in IMG/M
3300020580|Ga0210403_10096445All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium2394Open in IMG/M
3300020580|Ga0210403_11177729All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium591Open in IMG/M
3300020581|Ga0210399_10826036All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria755Open in IMG/M
3300020582|Ga0210395_10567247All Organisms → cellular organisms → Bacteria853Open in IMG/M
3300020583|Ga0210401_10174631All Organisms → cellular organisms → Bacteria2003Open in IMG/M
3300020583|Ga0210401_10331328All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1383Open in IMG/M
3300021168|Ga0210406_10682814All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria793Open in IMG/M
3300021171|Ga0210405_10111708All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2157Open in IMG/M
3300021178|Ga0210408_11187126All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria583Open in IMG/M
3300021180|Ga0210396_10183654All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1872Open in IMG/M
3300021372|Ga0213877_10304799All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria540Open in IMG/M
3300021374|Ga0213881_10051307All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1747Open in IMG/M
3300021403|Ga0210397_10006579All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria7060Open in IMG/M
3300021405|Ga0210387_10188668All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1783Open in IMG/M
3300021441|Ga0213871_10086880All Organisms → cellular organisms → Bacteria902Open in IMG/M
3300021474|Ga0210390_10296426All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1370Open in IMG/M
3300021479|Ga0210410_11496645All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria568Open in IMG/M
3300021560|Ga0126371_10058965All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3696Open in IMG/M
3300022531|Ga0242660_1049649All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria911Open in IMG/M
3300022532|Ga0242655_10117695All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria748Open in IMG/M
3300022557|Ga0212123_10353983All Organisms → cellular organisms → Bacteria1003Open in IMG/M
3300022712|Ga0242653_1077915All Organisms → cellular organisms → Bacteria579Open in IMG/M
3300022724|Ga0242665_10050825All Organisms → cellular organisms → Bacteria1102Open in IMG/M
3300022726|Ga0242654_10429071All Organisms → cellular organisms → Bacteria512Open in IMG/M
3300025906|Ga0207699_11037266All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria607Open in IMG/M
3300025915|Ga0207693_10429550All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1033Open in IMG/M
3300025916|Ga0207663_10209126All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1413Open in IMG/M
3300025928|Ga0207700_10699059All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria905Open in IMG/M
3300026538|Ga0209056_10288707All Organisms → cellular organisms → Bacteria1140Open in IMG/M
3300026538|Ga0209056_10572995All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium571Open in IMG/M
3300026547|Ga0209156_10160460Not Available1087Open in IMG/M
3300026551|Ga0209648_10057718All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3301Open in IMG/M
3300026999|Ga0207949_1000539All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2679Open in IMG/M
3300027069|Ga0208859_1004433All Organisms → cellular organisms → Bacteria1488Open in IMG/M
3300027174|Ga0207948_1000168All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium5060Open in IMG/M
3300027297|Ga0208241_1037069All Organisms → cellular organisms → Bacteria760Open in IMG/M
3300027605|Ga0209329_1002531All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2907Open in IMG/M
3300027635|Ga0209625_1003302All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3529Open in IMG/M
3300027651|Ga0209217_1053999All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1205Open in IMG/M
3300027701|Ga0209447_10030367All Organisms → cellular organisms → Bacteria → Proteobacteria1507Open in IMG/M
3300027729|Ga0209248_10112685Not Available818Open in IMG/M
3300027767|Ga0209655_10143221Not Available792Open in IMG/M
3300027768|Ga0209772_10063582Not Available1109Open in IMG/M
3300027812|Ga0209656_10002900All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales11127Open in IMG/M
3300027855|Ga0209693_10015712All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales3594Open in IMG/M
3300027903|Ga0209488_10031233All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3881Open in IMG/M
3300027908|Ga0209006_10470503All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1051Open in IMG/M
3300031231|Ga0170824_106467587Not Available811Open in IMG/M
3300031231|Ga0170824_118808879All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300031549|Ga0318571_10091122Not Available983Open in IMG/M
3300031715|Ga0307476_11193542All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria557Open in IMG/M
3300031718|Ga0307474_11072181All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria638Open in IMG/M
3300031753|Ga0307477_10296183All Organisms → cellular organisms → Bacteria1117Open in IMG/M
3300031754|Ga0307475_10817225All Organisms → cellular organisms → Bacteria739Open in IMG/M
3300031754|Ga0307475_10881642All Organisms → cellular organisms → Bacteria707Open in IMG/M
3300031771|Ga0318546_11121954All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria553Open in IMG/M
3300031833|Ga0310917_10862121All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria610Open in IMG/M
3300031890|Ga0306925_12087976All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium531Open in IMG/M
3300031896|Ga0318551_10777745All Organisms → cellular organisms → Bacteria556Open in IMG/M
3300032051|Ga0318532_10375510All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium505Open in IMG/M
3300032054|Ga0318570_10477686Not Available569Open in IMG/M
3300032055|Ga0318575_10711586All Organisms → cellular organisms → Bacteria508Open in IMG/M
3300032205|Ga0307472_101044853All Organisms → cellular organisms → Bacteria769Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil24.14%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil11.21%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil10.34%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil7.76%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.76%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.17%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.17%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil5.17%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.45%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil3.45%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring2.59%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil2.59%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil2.59%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.72%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.72%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil1.72%
Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Bulk Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.86%
Exposed RockEnvironmental → Terrestrial → Rock-Dwelling (Subaerial Biofilms) → Unclassified → Unclassified → Exposed Rock0.86%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.86%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300003368Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2EnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300004082Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300004152Coassembly of ECP12_OM1, ECP12_OM2, ECP12_OM3EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021372Vellozia epidendroides bulk soil microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - BS_R01EnvironmentalOpen in IMG/M
3300021374Barbacenia macrantha exposed rock microbial communities from rupestrian grasslands, the National Park of Serra do Cipo, Brazil - ER_R08EnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021441Rhizosphere microbial communities from Vellozia epidendroides in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R1Host-AssociatedOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022531Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-28-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022532Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300022712Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-32-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025928Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026999Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF044 (SPAdes)EnvironmentalOpen in IMG/M
3300027069Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF002 (SPAdes)EnvironmentalOpen in IMG/M
3300027174Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF040 (SPAdes)EnvironmentalOpen in IMG/M
3300027297Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaG HF047 (SPAdes)EnvironmentalOpen in IMG/M
3300027605Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027635Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027701Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM3 (SPAdes)EnvironmentalOpen in IMG/M
3300027729Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP04_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027767Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP03_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027768Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP03_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027812Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP12_OM2 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031549Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f24EnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031771Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f19EnvironmentalOpen in IMG/M
3300031833Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF178EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031896Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f19EnvironmentalOpen in IMG/M
3300032051Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f26EnvironmentalOpen in IMG/M
3300032054Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f23EnvironmentalOpen in IMG/M
3300032055Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f23EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10040882313300002245Forest SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAVAVMTGRLLFRKHRVRPRAGNLLGDQA*
JGIcombinedJ26739_10062149813300002245Forest SoilMDSFPNWAWWTIAGGXLLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAVAVMTGRLLFRKHR
C688J35102_11818891013300002568SoilMDRVPNWAWWTIAGGVLLSPVFAFLLAVLVEILIGVLTQGGVPALLLAAVVVSGGLLLRKHRARPRAGNLLGDQA*
JGI25617J43924_1004067323300002914Grasslands SoilMDRFPNWAWWTVAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLIVATAVSGWLMLRKHRTRPRVGDPLSDQA*
JGI26340J50214_1000811343300003368Bog Forest SoilMDSLPTWAWWVIAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLFLAATVICGRLLFRKYWTRPAASNLLGDQA*
JGIcombinedJ51221_1021858323300003505Forest SoilMDRVPNWVWWTIAGGILLSPVFAFLLAVVVEISIGVLMQGGVPALLVVAAALMSGRLLFXKHRTRARADNLLGDQA*
JGIcombinedJ51221_1031206023300003505Forest SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAVAVMTGRLLFRKHRARPRAGNLLGDQA*
Ga0062384_10032479023300004082Bog Forest SoilMDSFPNWAWWTIVGGILLSPVFAFLLAVVVEILIGVLTQGGAPALLILAAAVISGRVLLRRRRTRPRAGNLLGDQA*
Ga0062389_10148307623300004092Bog Forest SoilMDSVPNWAWWTIAGGILLSPVFAFLLAVVVEILIGVLTQGGVPALLLAAAMVMSGRLLLRKHRTRARAGNLLGDQA*
Ga0062386_10002532953300004152Bog Forest SoilMDSLPNWAWWTIAGGMLLSPVFAFLLAVLVEILIGIVTEGGLPALLVLAVAVTAGRLLFRKHRRRRPAGGLLGDQA*
Ga0066672_1010026513300005167SoilMDSVPNWAWWTIAGGILLSPVFAFLFAVMVEILIGVLTQGGVPALLIVAAAIISGGSLFRRRWTRPRAGNLLGAQA*
Ga0066672_1048927623300005167SoilMDSLSNWSWWIIAGGILLSPVFAFLLAILAEILIGLLSQGGAPGLLILAAAVMSGLSLIRKRRTRTLARNLLGDQA*
Ga0066677_1000405023300005171SoilMDSVPNWAWWTIAGGILLSPVFAFLFAVMVEILIGVLTQGGVPALLIVAAAIISGGSLFRRRWTRPRAGNLLGDQA*
Ga0066673_1027940323300005175SoilMDSLPNWAWWTIAGGILLSPVFAFLFAVMVEILIGVLTQGGVPALLIVAAAIISGGSLFRRRWTRPRAGNLLGDQA*
Ga0066388_10260656023300005332Tropical Forest SoilMDKLPTWAWWTIAGGILLSPVIAFLLAVLVEIVIGCLVQGGLPAILILTVAVISGRLLMRKRWPRLPA
Ga0070714_10030769123300005435Agricultural SoilMDSLPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLVQGGMPGLLIVAAAVVTGRLLFRKHRARPRAGNLLGDQA*
Ga0070714_10189003413300005435Agricultural SoilMDRLPNWAWWTIAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLLAVVVMSGGLLLRKHRARPRAGNLLGDQA*
Ga0070710_1001844183300005437Corn, Switchgrass And Miscanthus RhizosphereMDSLPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAAAVVTGRLLFRKHRARPRAGNLLGDQA*
Ga0070710_1032032613300005437Corn, Switchgrass And Miscanthus RhizosphereMDRVPNWAWWTIAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLLAVVVMSGGLLLRKHRAR
Ga0070711_10036782623300005439Corn, Switchgrass And Miscanthus RhizosphereMDRVPNWAWWTIAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLLAVVVMSGGLLLRKHRARPRAGNLLGDQA*
Ga0066687_1005334313300005454SoilNWSWWIIAGGILLSPVFAFLLAILAEILIGLLSQGGAPGLLILAAAVMSGLSLIRKRRTRTLARNLLGDQA*
Ga0066661_1039670523300005554SoilMDRFPNWAWWTIAGGILLSPVFAFLLAVAAEILIGVVTQGGVPALLIVVGAAISGLLLLRKHRTRPRAGDLLGDQA*
Ga0066700_1097971813300005559SoilMDSLPNWAWWTIAGEMLLSPVFAFLLALLIEILIGALTEAGLPALLILAAAITSGWLLFDQHRRRRRAGDLLGDQA*
Ga0066703_1044162423300005568SoilMDSVPNWAWWTIAGGILLSPVFAFLFAVMVEILIGVLTQGGVPALLIVAAAIISGGSLSRRRWTRLRAGNLLGDQA*
Ga0070762_1001260073300005602SoilMDRVPNWVWWTIAGGILLSPVFAFLLAVVVEISIGVLMQGGVPALLVVAAALMSGRLLFYKHRTRARADNLLGDQA*
Ga0070763_1004234123300005610SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVALAVMTGRLLFRKHRARPRAGNLLGDQA*
Ga0070763_1042694723300005610SoilMDRLPNWAWWTIAGGILLSPVFAFLLAVVAEILIGVVTQGGVPALLIVAGAAISGGLLLRKHRTRPRAGDLLGDQA*
Ga0066905_10218992813300005713Tropical Forest SoilMDKLPTWAWWTIAGGILLSPVIAFLLAVLVEIVIGCLVQGGLPAILILTVAVISGRLLMRKRWPRLPARNLLGDQA*
Ga0070766_1004375023300005921SoilMDSFPNWAWWTIAGGVLLSPVFAFFLAVMVEILIGVLTQGGMPALLIVALAVMTGRLLFRKHRARPRAGNLLGDQA*
Ga0070766_1035268213300005921SoilMDRVPNWVWWTIAGGILLSPVFAFLLAVVVEISIGVLMQGGVPALLVVAAALMSGRLLFHKHRTRARADNLLGDQA*
Ga0070712_10006827323300006175Corn, Switchgrass And Miscanthus RhizosphereMDRVPNWAWWTIAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLVAVVVMSGGLLLRKHRARPRAGNLLGDQA*
Ga0070712_10011038653300006175Corn, Switchgrass And Miscanthus RhizospherePRQKMDSLPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAAAVVTGRLLFRKHRARPRAGNLLGDQA*
Ga0079222_1143101423300006755Agricultural SoilMNSLSNWAWWIIAGGILLSPVFAFLLAILAEILIGLLSQGGAPGLLILAAAVTSGLLLIRRRRTRTLACNLLGDQA*
Ga0079222_1224326323300006755Agricultural SoilMDSLPNWAWWTIAGGILLSPVFAFLFAVMVEILIGGLTQGGVPALLIAAAAIISGWLLLRRHWTRHRAGNLLGDQA*
Ga0066653_1033356623300006791SoilMDRVSNWAWWTIAGGVLLSPVFAFLVAVVVEILINIIAQGGIPALLVVAAAVMLGWSLFRRHRARSLAGDLLGDQA*
Ga0066665_1027124323300006796SoilMDSLPNWAWWTIAGGMLLSPVFAFLLALLVEILIGALTEAGLPALLILAAAITSGWLLFDQHRRRRRAGDLLGDQA*
Ga0066660_1064412323300006800SoilMDSVPNWAWWTVAGGILLSPVFAFLFVVMVEILIGVLTQGGVPALLIVAAAIISGGSLFRRRWTRPRAGNLLGDQA*
Ga0079221_1162767123300006804Agricultural SoilMDRFPNWAWWTIVGGILLSPVFAFLLAVVVEILIGVLTQGGVPALLLIVAVVMSGGLLLRKQRTRA
Ga0073928_1045629023300006893Iron-Sulfur Acid SpringMDRFPNWAWWTIAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLIVATGVSGWLMLRKHRTHPRVGEPLSDQA*
Ga0073928_1054706223300006893Iron-Sulfur Acid SpringMDRVPNWAWWTIAGGVLLSPVFAFLLAVLVEILIGVLTQGGVPALLLAVVVMSGGLLLRKHRARPRAGNLLGDQA*
Ga0079219_1008674223300006954Agricultural SoilMNSLSNWAWWIIAGGILLSPVFAFLLAILAEILIGLLSQGGAPALLILAAAVTSGLLLIRRRRTRTLACNLLGDQA*
Ga0066710_10262276123300009012Grasslands SoilMDSLPNWAWWTIAGVMLLSPVFAFLLALLVEILIGALTEAGLPALLILAAAITSGWLLFDQHRRRRRAGDLLGDQA
Ga0066710_10390323413300009012Grasslands SoilMDSLSNWSWWIIAGGILLSPVFAFLLAILAEILIGLLSQGGAPGLLILAAAVMSGLSLIRKRRTRTLARNLLGDQA
Ga0099792_1016357833300009143Vadose Zone SoilMDRVPNWAWWTIAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLLAAVVVSGGLLLRKHRARPRAGNLLGDQA*
Ga0126376_1199738213300010359Tropical Forest SoilMDSLPNWAWWLIAGGILLSAVFAFLLAVAVEVLIGVLTQASIPALVIIGLSVTCGGLLLRKYRTRPPTRNLLGDQA*
Ga0126378_1003664833300010361Tropical Forest SoilMDSLPNWAWWIIAGGILLSAVFAFLLAVAVEMLIGVLTQASIPALVIIALSVTCGGLLLRKYRTRSPTGNLLGDQA*
Ga0126381_10478695513300010376Tropical Forest SoilMDSLPNWAWWIIAGGILLSAVFAFLLAVAVEMLIGVLTQASIPALVIIGLSVTCGGLLLRKYRTRPPTRNLLGDQA*
Ga0150983_1203767123300011120Forest SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAAAVVTGRLLFRKH
Ga0137362_1074291423300012205Vadose Zone SoilMLLSPVFAFLLALLIEILIGALTEAGLPALLILAAAITSGWLLFDQHRRRRRAGDLLGDQA*
Ga0066655_1098827123300018431Grasslands SoilMDSVPNWAWWTIAGGILLSPVFAFLFAVMVEILIGVLTQGGVPALLIVAAAIISGGSLFRRRWTRPRAGNLLGDQA
Ga0066662_1069121723300018468Grasslands SoilMDRVSNWAWWAIAGGILLSPAFAFLVAVVVEILINIVAQGGIPALLVVAAAVMSGRSLFRKHRARSRAGDLLGDQA
Ga0210407_1004255043300020579SoilMDSLPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAAAVVTGRLLFRKHRARPRAGNPLGDQA
Ga0210403_1002879773300020580SoilMDRLPNWAWWTIAGGILLSPVFAFLLAVVAEILIGVVTQGGVPALLIVAGAAISGRLLLRKHRTRPRAGDLLGDQA
Ga0210403_1009644563300020580SoilWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAVAVMTGRLLFRKHRARPRAGNLLGDQA
Ga0210403_1117772923300020580SoilMDSFPNWAWWTIAGGVLLSPVFAFFLAVMVEILIGVLTQGGMPALLIVALAVMTGRLLFRKHRARPRAGNLLGDQA
Ga0210399_1082603623300020581SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAVAVMTGRLLFRKHRARPRAGNLLGDQA
Ga0210395_1056724723300020582SoilMDRLPNWAWWTIAGGILLSPVFAFLLAVVAEILIGVVTQGGVPALLIVAGAAISGRRLLRKHRARPRAGDLLGDQA
Ga0210401_1017463123300020583SoilMDRVPNWVWWTIAGGILLSPVFAFLLAVVVEISIGILMQGGVPALLVVAAALMSGRLLFHKHRTRARADNLLGDQA
Ga0210401_1033132833300020583SoilGGILLSPVFAFLLAVVAEILIGVVTQGGVPALLIVAGAAISGRRLLRKHRARPRAGDLLGDQA
Ga0210406_1068281413300021168SoilMDRLPNWAWWTIAGGILLSPVFAFLLAVVAEILIGVVTQGGVPALLIVAGAAISGGLLLRKHRTRPRAGDLLGDQA
Ga0210405_1011170823300021171SoilMDSLPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAVAVMTGRLLFRKHRARPRAGNLLGDQA
Ga0210408_1118712613300021178SoilMDSLPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLSQGGMPALLIVAVAVTTGRLLFRKHRARPRAGNLLGDQA
Ga0210396_1018365423300021180SoilMDRVPNWVWWTIAGGILLSPVFAFLLAVVVEISIGVLMQGGVPALLIVAAALMSGRLLFHKHRTRARADNLLGDQA
Ga0213877_1030479923300021372Bulk SoilRNWAWWTIAGGILLSPVFAFLLAVMIEILVGVVTQGGMPALPMVAVAVMSGRLLLRKHRTRPRAGDLLGDQA
Ga0213881_1005130723300021374Exposed RockMDSLSNWAWWVIAGGILLSPVFAFLLAVAVEILIGVLTQAGPPVLIIVALSVAGGWSLRKYRTRPPVGNLLGDQA
Ga0210397_1000657963300021403SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVALAVMTGRLLFRKHRARPRAGNLLGDQA
Ga0210387_1018866813300021405SoilMDSFPNWAWWTIAGGVLLSPVFAFFLAVMVEILIGVLTQGGMPALLIVALAVMTGRLLVRKH
Ga0213871_1008688023300021441RhizosphereMDRFRNWAWWTIAGGILLSPVFAFLLAVMIEILVGVVTQGGMPALLMVAVAVMSGRLLLRKHRTRPRAGDLLGDQA
Ga0210390_1029642613300021474SoilQGPRQKMDSLPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAVAVMTGRLLFRKHRARPRAGNLPGDQA
Ga0210410_1149664513300021479SoilCSGASTEMDRLPNWAWWTIAGGILLSPVFAFLLAVVAEILIGVVTQGGVPALLIVAGAAISGRRLLRKHRARPRAGDLLGDQA
Ga0126371_1005896553300021560Tropical Forest SoilMDSLPNWAWWIIAGGILLSAVFAFLLAVAVEVLIGVLTQASIPALVIIALSVTCGGLLLRKYRTRPPTRNLLGDQA
Ga0242660_104964923300022531SoilMDSFPNWAWWTIAGGVLLSPVFAFFLAVMVEILIGVLTQGGMPALLIVAAAVVTGRLLFRKHRARPRAGNLLGDQA
Ga0242655_1011769523300022532SoilMDRLPNWAWWTIAGGILLSPVFAFLLAVVAEILIGVVTQGGVPALLIVAGAAISGRLLRKHRARPRAGDLLGDQA
Ga0212123_1035398323300022557Iron-Sulfur Acid SpringMDRFPNWAWWTIAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLIVATGVSGWLMLRKHRTHPRVGEPLSDQA
Ga0242653_107791523300022712SoilMDRLPNWAWWTIAGGILLSPVFAFLLAVVAEILIGIVTQGGVPALLIVAGAAISGGLLLRKHRTRPRAGDLLGDQA
Ga0242665_1005082523300022724SoilMDSFPNWAWWTIAGGVLLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAVAVMTGRLLFRKHRVRPRAGNLLGDQA
Ga0242654_1042907123300022726SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAVAVMTGRLLFRKHRARP
Ga0207699_1103726623300025906Corn, Switchgrass And Miscanthus RhizosphereMDRLPNWAWWTIAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLLAAVVMLGGLLLRKHRARPRAGNLLGDQA
Ga0207693_1042955023300025915Corn, Switchgrass And Miscanthus RhizospherePRQKMDSLPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAAAVVTGRLLFRKHRARPRAGNLLGDQA
Ga0207663_1020912623300025916Corn, Switchgrass And Miscanthus RhizosphereMDSLPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLVQGGMPGLLIVAAAVVTGRLLFRKHRARPRAGNLLGDQA
Ga0207700_1069905913300025928Corn, Switchgrass And Miscanthus RhizosphereAWWTIAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLLAVVVMSGGLLLRKHRARPRAGNLLGDQA
Ga0209056_1028870713300026538SoilMDSLPNWAWWTIAGGMLLSPVFAFLLALLVEILIGALTEAGLPALLILAAAITSGWLLFDQHRRRRRAGDLLGDQA
Ga0209056_1057299513300026538SoilMDSLPNWAWWTIAGEMLLSPVFAFLLALLIEILIGALTEAGLPALLILAAAITSGWLLFDQHRRRRRAGDLLGDQA
Ga0209156_1016046023300026547SoilMDSVPNWAWWTIAGGILLSPVFAFLFAVMVEILIGVLTQGGVPALLIVAAAIISGGSLFRRRWTRPRAGNLMG
Ga0209648_1005771823300026551Grasslands SoilMDRFPNWAWWTVAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLIVATAVSGWLMLRKHRTRLRVGDPLSDQA
Ga0207949_100053933300026999Forest SoilMDRVPNWVWWTIAGGILLSPVFAFLLAVVVEISIGVLMQGGVPALLVVAAALMSGRLLFYKHRTRARADNLLGDQA
Ga0208859_100443323300027069Forest SoilMDSFPNWAWWTIAGGVLLSPVFAFFLAVMVEILIGVLTQGGMPALLIVAVAVTTGRVLFRKHRARPRAGNLLGDQA
Ga0207948_100016883300027174Forest SoilMDRVPNWVWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVALAVMTGRLLFRKHRARPRAGNLLGDQA
Ga0208241_103706923300027297Forest SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAVAVMTGRLLFRKHRARPRA
Ga0209329_100253123300027605Forest SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAVAVMTGRLLFRKHRVRPRAGNLLGDQA
Ga0209625_100330263300027635Forest SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAAAVVTGRLLFRKHRARPRAGNLLGDQA
Ga0209217_105399913300027651Forest SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAVAVMTGGLLFRKHRARPRAGNLLGDQA
Ga0209447_1003036723300027701Bog Forest SoilMDSFPNWAWWTIVGGILLSPVFAFLLAVVAEILIGVVTQGGVPALLIVAGAAVSGRLLLRKHRTCPQAGDLLGDQA
Ga0209248_1011268523300027729Bog Forest SoilMNSLPNWAWWTIAGGILLSPVFAFLLAVVVEILIGVLTQGGVPALLLAAAMVMSGRLLLRKHRTRARAGNLLGDQA
Ga0209655_1014322113300027767Bog Forest SoilMNSLPNWAWWTIAGGILLSPVFAFLLAVVAEILIGVVTQGGVPALLIVAGAAISGRLLLRKHRTRPQAGDLLGDQA
Ga0209772_1006358223300027768Bog Forest SoilMNSLPNWAWWTIAGGILLSPVFAFLLAVVAEILIGVVTQGGVPALLIVAGAAISGRLLLRKHRTRPQAGDLLGDLNDPT
Ga0209656_1000290093300027812Bog Forest SoilMDSLPTWAWWVIAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLFLAATVICGRLLFRKYWTRPAASNLLGDQA
Ga0209693_1001571223300027855SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAVAVVTGRLLFRKHRARPRAGNLLGDQA
Ga0209488_1003123313300027903Vadose Zone SoilMDRVPNWAWWTIAGGILLSPVFAFLLAVLVEILIGVLTQGGVPALLLAAVVVSGGLLLRKHRARPRAGNLLGDQA
Ga0209006_1047050323300027908Forest SoilMDSFPNWAWWTIAGGVLLSPVFAFFLAVMVEILIGVLTQGGMPALLIVAVAVMTGRLLFRKHRARPRAGNLLGDQA
Ga0170824_10646758723300031231Forest SoilMDSLPNWAWWTIAGGMLLSPVVAFLLAVLVEILIGIVTEAGLPALLVLAVAVTAGWLLFRKHRRRRPAGGL
Ga0170824_11880887923300031231Forest SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAAAVVTGRLLFRKHRARPRAGNPLGD
Ga0318571_1009112223300031549SoilMDRLRNWAWWAIAGGILLSPVFAFLLAVVVEILIGVLVQCGASALLILAAGVMSGRLLFRKHRTRGRAGNLLGDQA
Ga0307476_1119354223300031715Hardwood Forest SoilTIAGGILLSPVFAFLLAVVAEILIGVVTQGGVPALLIVAGAAISGRLLLRKHRTRPRAGDLLGDQA
Ga0307474_1107218123300031718Hardwood Forest SoilKMDRVPNWVWWTIAGGILLSPVFAFLLAVVVEISIGVLMQGGVPALLVVAAALMSGRLLFHKHRTRARADNLLGDQA
Ga0307477_1029618323300031753Hardwood Forest SoilMDRVPNWVWWTIAGGILLSPVFAFLLAVVVEISIGVLMQGGVPALLVVAAALMSGRLLFHKHRTRARADNLLGDQA
Ga0307475_1081722523300031754Hardwood Forest SoilMDRVPNWVWWTIAGGILLSPVFAFLLAVVVEISIGVLMQGGVPALLVAAAALMSGRLLFHKHRTRARADNLLGDQA
Ga0307475_1088164223300031754Hardwood Forest SoilMDSFPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAAAVMTGRLLFRKHRARPRAGNLLGDQA
Ga0318546_1112195413300031771SoilAWWTIAGGILLSPVLAFLLAVVAEILIGLVTEGGVPALLIVAGAAISGRLLLRKHRTRPRARDLLGDQA
Ga0310917_1086212113300031833SoilMDSFPNWAWWTIAGGILLSPVLAFLLAVVAEILIGLVTEGGVPALLIVAGAAISGRLLLRKHRTRPRARDLLGDQA
Ga0306925_1208797613300031890SoilMDSLPNWAWWMIAGGILLSPAFAFLLAVVVEILIGVLTQADMPALIVIALLVACGGWLLRKYRTRSPAGNLLGDQA
Ga0318551_1077774513300031896SoilMDSLPNWAWWTMVGGVLLSSVFAFLLAVVTEILIGVLIQASVPALIIIALSIACGCLLLRKYRTRPPAGNLLGD
Ga0318532_1037551013300032051SoilMDSLPNWAWWTMVGGVLLSSVFAFLLAVATEILIGVLIQASMPALIIIALSIACGCLLLRKYRTRPPAGNLLGDQA
Ga0318570_1047768613300032054SoilMDSLSNWAWWMIAGGILLSPVFAFLLAVVVEILLGVLTQLGMPALVVILVAVACARLLLRKYR
Ga0318575_1071158623300032055SoilMDRLRNWAWWAIAGGILLSPVFAFLLAVVVEILIGVLVQCGASALLILAAGVMSGRLLFRKHRTRGR
Ga0307472_10104485323300032205Hardwood Forest SoilMDSLPNWAWWTIAGGILLSPVFAFLLAVMVEILIGVLTQGGMPALLIVAAAVVTGRSLFRKHRARPRAGNLLGDQA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.