NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F100523

Metagenome / Metatranscriptome Family F100523

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F100523
Family Type Metagenome / Metatranscriptome
Number of Sequences 102
Average Sequence Length 85 residues
Representative Sequence MTFVSQGVSVRYLAGNSEATGSWSGTKNPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSASG
Number of Associated Samples 79
Number of Associated Scaffolds 102

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 97.06 %
% of genes near scaffold ends (potentially truncated) 18.63 %
% of genes from short scaffolds (< 2000 bps) 64.71 %
Associated GOLD sequencing projects 69
AlphaFold2 3D model prediction Yes
3D model pTM-score0.26

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.020 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(29.412 % of family members)
Environment Ontology (ENVO) Unclassified
(42.157 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(38.235 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 20.54%    β-sheet: 14.29%    Coil/Unstructured: 65.18%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.26
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 102 Family Scaffolds
PF03704BTAD 52.94
PF00072Response_reg 4.90
PF12833HTH_18 4.90
PF04820Trp_halogenase 3.92
PF00196GerE 1.96
PF04632FUSC 1.96
PF07730HisKA_3 1.96
PF06114Peptidase_M78 0.98
PF13533Biotin_lipoyl_2 0.98
PF05145AbrB 0.98
PF02803Thiolase_C 0.98
PF07969Amidohydro_3 0.98
PF08450SGL 0.98
PF13360PQQ_2 0.98
PF04970LRAT 0.98
PF01435Peptidase_M48 0.98
PF00486Trans_reg_C 0.98
PF06071YchF-GTPase_C 0.98
PF06628Catalase-rel 0.98
PF02771Acyl-CoA_dh_N 0.98
PF00174Oxidored_molyb 0.98
PF13426PAS_9 0.98
PF12821ThrE_2 0.98

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 102 Family Scaffolds
COG3629DNA-binding transcriptional regulator DnrI/AfsR/EmbR, SARP family, contains BTAD domainTranscription [K] 52.94
COG3947Two-component response regulator, SAPR family, consists of REC, wHTH and BTAD domainsTranscription [K] 52.94
COG1289Uncharacterized membrane protein YccCFunction unknown [S] 1.96
COG3850Signal transduction histidine kinase NarQ, nitrate/nitrite-specificSignal transduction mechanisms [T] 1.96
COG3851Signal transduction histidine kinase UhpB, glucose-6-phosphate specificSignal transduction mechanisms [T] 1.96
COG4564Signal transduction histidine kinaseSignal transduction mechanisms [T] 1.96
COG4585Signal transduction histidine kinase ComPSignal transduction mechanisms [T] 1.96
COG0012Ribosome-binding ATPase YchF, GTP1/OBG familyTranslation, ribosomal structure and biogenesis [J] 0.98
COG0183Acetyl-CoA acetyltransferaseLipid transport and metabolism [I] 0.98
COG0753CatalaseInorganic ion transport and metabolism [P] 0.98
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 0.98
COG2041Molybdopterin-dependent catalytic subunit of periplasmic DMSO/TMAO and protein-methionine-sulfoxide reductasesEnergy production and conversion [C] 0.98
COG3180Uncharacterized membrane protein AbrB, regulator of aidB expressionGeneral function prediction only [R] 0.98
COG3386Sugar lactone lactonase YvrECarbohydrate transport and metabolism [G] 0.98
COG3391DNA-binding beta-propeller fold protein YncEGeneral function prediction only [R] 0.98
COG3915Uncharacterized conserved proteinFunction unknown [S] 0.98


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.02 %
UnclassifiedrootN/A0.98 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000546|LJNas_1031530All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium570Open in IMG/M
3300001471|JGI12712J15308_10106781All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium712Open in IMG/M
3300002245|JGIcombinedJ26739_100545702All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1036Open in IMG/M
3300002245|JGIcombinedJ26739_101118087All Organisms → cellular organisms → Bacteria → Proteobacteria675Open in IMG/M
3300002245|JGIcombinedJ26739_101685887All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium533Open in IMG/M
3300002917|JGI25616J43925_10016419All Organisms → cellular organisms → Bacteria → Proteobacteria3281Open in IMG/M
3300006059|Ga0075017_100134857All Organisms → cellular organisms → Bacteria → Proteobacteria1747Open in IMG/M
3300006059|Ga0075017_100281426All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1224Open in IMG/M
3300006162|Ga0075030_100187072All Organisms → cellular organisms → Bacteria → Proteobacteria1670Open in IMG/M
3300006893|Ga0073928_10003110All Organisms → cellular organisms → Bacteria → Proteobacteria26789Open in IMG/M
3300007258|Ga0099793_10096288All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1365Open in IMG/M
3300007265|Ga0099794_10121218All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1316Open in IMG/M
3300007788|Ga0099795_10006228All Organisms → cellular organisms → Bacteria → Proteobacteria3241Open in IMG/M
3300009038|Ga0099829_10782801All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium792Open in IMG/M
3300010880|Ga0126350_11324659All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium710Open in IMG/M
3300010880|Ga0126350_11773100All Organisms → cellular organisms → Bacteria637Open in IMG/M
3300011269|Ga0137392_11086678All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium656Open in IMG/M
3300011271|Ga0137393_11324699All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium609Open in IMG/M
3300012096|Ga0137389_11551343All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria558Open in IMG/M
3300012189|Ga0137388_10447683All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1198Open in IMG/M
3300012202|Ga0137363_11032998All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium698Open in IMG/M
3300012205|Ga0137362_10398316All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1192Open in IMG/M
3300012359|Ga0137385_10728053All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium827Open in IMG/M
3300012363|Ga0137390_11808911All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium542Open in IMG/M
3300012923|Ga0137359_10520393All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1048Open in IMG/M
3300012924|Ga0137413_11000669All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium655Open in IMG/M
3300012925|Ga0137419_10015234All Organisms → cellular organisms → Bacteria → Proteobacteria4320Open in IMG/M
3300013306|Ga0163162_11620624All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium738Open in IMG/M
3300015051|Ga0137414_1143335All Organisms → cellular organisms → Bacteria3921Open in IMG/M
3300015242|Ga0137412_10004605All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria11003Open in IMG/M
3300015242|Ga0137412_10275429All Organisms → cellular organisms → Bacteria1325Open in IMG/M
3300019789|Ga0137408_1297895All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1346Open in IMG/M
3300019886|Ga0193727_1000318All Organisms → cellular organisms → Bacteria → Proteobacteria18208Open in IMG/M
3300019886|Ga0193727_1000519All Organisms → cellular organisms → Bacteria → Proteobacteria15025Open in IMG/M
3300019886|Ga0193727_1000696All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria13363Open in IMG/M
3300019886|Ga0193727_1004251All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae5875Open in IMG/M
3300019886|Ga0193727_1004469All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae5716Open in IMG/M
3300019886|Ga0193727_1006844All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales4564Open in IMG/M
3300019887|Ga0193729_1084327All Organisms → cellular organisms → Bacteria → Proteobacteria1234Open in IMG/M
3300019887|Ga0193729_1152824All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium832Open in IMG/M
3300020001|Ga0193731_1042307All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1203Open in IMG/M
3300020002|Ga0193730_1162059All Organisms → cellular organisms → Bacteria → Proteobacteria582Open in IMG/M
3300020004|Ga0193755_1031089All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1767Open in IMG/M
3300020021|Ga0193726_1105537All Organisms → cellular organisms → Bacteria → Proteobacteria1271Open in IMG/M
3300020034|Ga0193753_10293152All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium703Open in IMG/M
3300020060|Ga0193717_1025665All Organisms → cellular organisms → Bacteria2396Open in IMG/M
3300020061|Ga0193716_1071426All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1560Open in IMG/M
3300020199|Ga0179592_10000016All Organisms → cellular organisms → Bacteria → Proteobacteria81374Open in IMG/M
3300020199|Ga0179592_10000149All Organisms → cellular organisms → Bacteria → Proteobacteria28978Open in IMG/M
3300020199|Ga0179592_10019259All Organisms → cellular organisms → Bacteria → Proteobacteria3020Open in IMG/M
3300020579|Ga0210407_10010391All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6947Open in IMG/M
3300020580|Ga0210403_10054674All Organisms → cellular organisms → Bacteria → Proteobacteria3194Open in IMG/M
3300020580|Ga0210403_10158676All Organisms → cellular organisms → Bacteria → Proteobacteria1851Open in IMG/M
3300020583|Ga0210401_10021648All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae → Variovorax6163Open in IMG/M
3300021168|Ga0210406_10019380All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales6373Open in IMG/M
3300021170|Ga0210400_10319551All Organisms → cellular organisms → Bacteria1276Open in IMG/M
3300021433|Ga0210391_11108025All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae615Open in IMG/M
3300021475|Ga0210392_11196780All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium569Open in IMG/M
3300021475|Ga0210392_11219573All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae563Open in IMG/M
3300021478|Ga0210402_10196543All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae1852Open in IMG/M
3300021478|Ga0210402_10468994All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1169Open in IMG/M
3300021478|Ga0210402_10933876All Organisms → cellular organisms → Bacteria794Open in IMG/M
3300021479|Ga0210410_10136877All Organisms → cellular organisms → Bacteria2179Open in IMG/M
3300022557|Ga0212123_10007792All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales16783Open in IMG/M
3300022726|Ga0242654_10336109Not Available565Open in IMG/M
3300024330|Ga0137417_1090028All Organisms → cellular organisms → Bacteria2657Open in IMG/M
3300026304|Ga0209240_1036489All Organisms → cellular organisms → Bacteria → Proteobacteria1869Open in IMG/M
3300026319|Ga0209647_1084779All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1554Open in IMG/M
3300026320|Ga0209131_1030372All Organisms → cellular organisms → Bacteria → Proteobacteria3172Open in IMG/M
3300026475|Ga0257147_1023899All Organisms → cellular organisms → Bacteria863Open in IMG/M
3300026514|Ga0257168_1088201All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium689Open in IMG/M
3300026551|Ga0209648_10315090All Organisms → cellular organisms → Bacteria → Proteobacteria1107Open in IMG/M
3300026555|Ga0179593_1071828All Organisms → cellular organisms → Bacteria → Proteobacteria2757Open in IMG/M
3300026555|Ga0179593_1151204All Organisms → cellular organisms → Bacteria → Proteobacteria2052Open in IMG/M
3300026557|Ga0179587_10521136All Organisms → cellular organisms → Bacteria → Proteobacteria780Open in IMG/M
3300027521|Ga0209524_1052137All Organisms → cellular organisms → Bacteria863Open in IMG/M
3300027528|Ga0208985_1067791All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium689Open in IMG/M
3300027605|Ga0209329_1014512All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1541Open in IMG/M
3300027660|Ga0209736_1022233All Organisms → cellular organisms → Bacteria → Proteobacteria1937Open in IMG/M
3300027671|Ga0209588_1038070All Organisms → cellular organisms → Bacteria1549Open in IMG/M
3300027674|Ga0209118_1099474All Organisms → cellular organisms → Bacteria → Proteobacteria823Open in IMG/M
3300027684|Ga0209626_1020324All Organisms → cellular organisms → Bacteria1581Open in IMG/M
3300027684|Ga0209626_1162834All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Syntrophobacterales → Syntrophobacteraceae → Syntrophobacter → unclassified Syntrophobacter → Syntrophobacter sp. SbD1590Open in IMG/M
3300027829|Ga0209773_10315827All Organisms → cellular organisms → Bacteria651Open in IMG/M
3300027862|Ga0209701_10055140All Organisms → cellular organisms → Bacteria2540Open in IMG/M
3300027903|Ga0209488_10051276All Organisms → cellular organisms → Bacteria3036Open in IMG/M
3300027908|Ga0209006_10021063All Organisms → cellular organisms → Bacteria → Proteobacteria5909Open in IMG/M
3300027908|Ga0209006_10088574All Organisms → cellular organisms → Bacteria → Proteobacteria2751Open in IMG/M
3300027908|Ga0209006_10314090All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium1334Open in IMG/M
3300027911|Ga0209698_10824283All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium700Open in IMG/M
3300028047|Ga0209526_10044651All Organisms → cellular organisms → Bacteria3118Open in IMG/M
3300028536|Ga0137415_10257919All Organisms → cellular organisms → Bacteria → Proteobacteria1551Open in IMG/M
3300031057|Ga0170834_111786023All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium942Open in IMG/M
3300031231|Ga0170824_110737705All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1406Open in IMG/M
3300031236|Ga0302324_100150962All Organisms → cellular organisms → Bacteria3808Open in IMG/M
3300031446|Ga0170820_14959289All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium650Open in IMG/M
3300031474|Ga0170818_108343631All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → unclassified Gammaproteobacteria → Gammaproteobacteria bacterium786Open in IMG/M
3300031823|Ga0307478_10052227All Organisms → cellular organisms → Bacteria3035Open in IMG/M
3300032174|Ga0307470_10000481All Organisms → cellular organisms → Bacteria → Proteobacteria19611Open in IMG/M
3300032174|Ga0307470_10001004All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales11687Open in IMG/M
3300032174|Ga0307470_10039943All Organisms → cellular organisms → Bacteria → Proteobacteria2316Open in IMG/M
3300032180|Ga0307471_102559355All Organisms → cellular organisms → Bacteria → Proteobacteria646Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil29.41%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil15.69%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil14.71%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil14.71%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.90%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.90%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.92%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil3.92%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring1.96%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil1.96%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.98%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa0.98%
Quercus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Quercus Rhizosphere0.98%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.98%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000546Quercus rhizosphere microbial communities from Sierra Nevada National Park, Granada, Spain - LJN_Illumina_AssembledHost-AssociatedOpen in IMG/M
3300001471Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300006059Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2012EnvironmentalOpen in IMG/M
3300006162Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300010880Boreal forest soil eukaryotic communities from Alaska, USA - C5-1 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300015051Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300019887Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c2EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300020034Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1c2EnvironmentalOpen in IMG/M
3300020060Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2EnvironmentalOpen in IMG/M
3300020061Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c1EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300021475Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-OEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300022726Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-30-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cm (SPAdes)EnvironmentalOpen in IMG/M
3300026319Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_60cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026475Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027521Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027528Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027605Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027660Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027684Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027829Bog forest soil microbial communities from Calvert Island, British Columbia, Canada - ECP14_OM1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027911Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031236Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_1EnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
LJNas_103153013300000546Quercus RhizosphereMTFVSQRMSEYYLAGNSEATGFWSGTKNPRWVRTLSRRALVICITASTSVLARSFTTRGSRAVCAGDQWKKFLSLISRADIASG*
JGI12712J15308_1010678113300001471Forest SoilMIFVSQGLSVRYLAGNSEATGSWSGTKNPRXVRTLSHLGAAISITASTSVLARSFTTRDSRTVYAEDRWKKFLSLISRADSASG*
JGIcombinedJ26739_10054570223300002245Forest SoilMMYVSQGVSVRYLAGNSEATGSWSGTKNLRWVRTLSHLGVAICITASTSVLAMSFTTRGARTVCAENRWKRFLSLISRADSASG*
JGIcombinedJ26739_10111808723300002245Forest SoilMTFVFQGVSVRYFAGDSVATGSWSGTKNLRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISR
JGIcombinedJ26739_10168588713300002245Forest SoilMTIDSQGVSVRYLAGNSEATGSWSGTKNPRWVRTLSHLGAAICITASTSVLARSFTTRDSRAVCAVDRWKKFLSLISRADSASG*
JGI25616J43925_1001641933300002917Grasslands SoilMTFVSQGVSVRYLAGNSEAIGYWSGTKNPRWVRTLSHPGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSASG*
Ga0075017_10013485713300006059WatershedsMTFVSQGVGVRYLAGNSEATGPWWGTKNLRWVRTLSHPGAAISIMASTSELSRSFTTRGSRTVCAEDRWKKFLSL
Ga0075017_10028142613300006059WatershedsMMFVSQNVSVRYLSGNLAATGSWSIAKNPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLVSRADITSG*
Ga0075030_10018707213300006162WatershedsMTFVSQGVGVRYLAGNSEATGPWWGTKNLRWVRTLSHPGAAISIMASTSELSRSFTTRGSRTVCAEDRWKKFLSLISRADITSG*
Ga0073928_1000311083300006893Iron-Sulfur Acid SpringMTFVSQGVSVRYLAGNSEATGFWSGSKNPRWVRTLSHLGAAICITASTSVLARSCTTRGSRTVCAEDRWKKFLSLISRAESASG*
Ga0099793_1009628813300007258Vadose Zone SoilMTFVSQGVSVRYLAGNSVATGSWSGTRNSRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRSDSASGCDRTRHPISMFERSSAEHDPV
Ga0099794_1012121813300007265Vadose Zone SoilMTLVSQGVSVRYLAGNSEATGSWSGTENPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSTSG*
Ga0099795_1000622813300007788Vadose Zone SoilMTLVSQGVSVRYLAGNSEATGSWSGTENPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSACG*
Ga0099829_1078280123300009038Vadose Zone SoilMTFVSQGVSVRYLDGNSEASGFWSGTRNPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSACG*
Ga0126350_1132465923300010880Boreal Forest SoilMTCISQGVIVRYLAENLAATGSWSGTKNPRWVRTLSHLGAAICITASTSVRARSFTIRGSRTVCAEDRWKKFLSLISRADSASG*
Ga0126350_1177310013300010880Boreal Forest SoilMTFVSQSVSVRYLAGNSEATGLWSGTKNPRWVRTLSHLGAAICITASTSALARSFTTQGSRRVCVEDRWKKFLSLISRADS
Ga0137392_1108667823300011269Vadose Zone SoilMTFVSQGVNVRYLAGNSEATGAWSGTKNPRWDRTLSHLGGAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRAHSGSG*
Ga0137393_1132469923300011271Vadose Zone SoilMTFVSQGVSVRCLAGNSEATGSWSGTENPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSASG*
Ga0137389_1155134323300012096Vadose Zone SoilMTLVSQGVSVRYLAGNSEATGSWSGTKNLRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADS
Ga0137388_1044768323300012189Vadose Zone SoilMTIVSQGVSVRYLAGNSEATGSWSGTKNPRWVRTLSHRGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSASG*
Ga0137363_1103299813300012202Vadose Zone SoilMTFVSQGVSVRYLAGNSEATGSWSGTENPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSASG*
Ga0137362_1039831623300012205Vadose Zone SoilMTFVSQGVSVRSLAGNSEATGSWSGTENPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSACG*
Ga0137385_1072805323300012359Vadose Zone SoilMTFVSQGVSVRDLAGNSEATGSWSGTKNPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSASG*
Ga0137390_1180891113300012363Vadose Zone SoilMTFVSQGVNVRYLAGNSEATGAWSGTKNPRWDRTLSHLGAAICITASTSVPARSFTTRGSHTVCAEDRWKKFLSLISRADSASG*
Ga0137359_1052039313300012923Vadose Zone SoilMTFVSQGVSVRYLAVDSEATGFWSGTENPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSTSG*
Ga0137413_1100066913300012924Vadose Zone SoilMTFVSPSVSVRYLTGKSEPTGSWSATKNPRWVRTLSHRGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLWLISRGGTPELHLAQLKG*
Ga0137419_1001523423300012925Vadose Zone SoilMTFVSQGVSVRYLAGNSEATGSWPGTRNPRWVRTLSHLGAAICITASTSVLARSCTTQGSRTVCAEDRWKRFLSLISRANSTSG*
Ga0163162_1162062423300013306Switchgrass RhizosphereVSECYLADHSEATSYWSGTRNPRWVRTLSHPGAAVCITTSTSALAKSFTTRGSRTVCAEGRWKRFLSLISRADSALGRDRTRHLSFDQ*
Ga0137414_114333543300015051Vadose Zone SoilMTSVSQGVSVRYLAGNPEAAGFWSETKNPRWVRTLSHLGAAICITASTSVLARSCTTQGSRTVCAEDRWRRFLSLISRANSTSG*
Ga0137412_10004605113300015242Vadose Zone SoilMTFVSQDVSVRYLAGNSEATGSWPGTKNPRWVRTLSHLGAAICITASTSVLARSFTTQGSRTVCAEDRWKKFLSLISRADSTSG*
Ga0137412_1027542913300015242Vadose Zone SoilMTFVSQDVNARYLAAVDSEATGFWSATENLRWVRTLSHLGAAICITASTSVPARSFTTRGSRTVCAEGRWKKFLSLISRADSVSG*
Ga0137408_129789513300019789Vadose Zone SoilMTFVSQGVSVLYLAGNSEATGFWSGTRNRRWVRTLSHPGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSASG
Ga0193727_100031863300019886SoilMTLASETMGYWSGTKNPRWVRTLSRPGAAICITASMLVLARSFTTRGSRTVCAEDRWKKFLSFISRPESASG
Ga0193727_1000519103300019886SoilMTFVSSCVSVRYLAGSLEATGSWSGTKNPRWVRTLSHLGAAIGITASTSALARSFTTRGSHTVYAEERWKKFLSLISRADSASG
Ga0193727_1000696163300019886SoilMTFVSQGVSVRYLPGNGEATGSCSGTKNPRWVRTLSHLGAVICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSTYWCDRTRHPISTFERSSAEHDPVWGKTAIGC
Ga0193727_100425153300019886SoilMTLVSQGVSVCHLAGSSEATGFWSGTKSPRWVRTLSRLGVAICITASTPVLARSFTIRGSRTVCAEDRWKKFLSLISRARSASG
Ga0193727_100446963300019886SoilMMLVSQCVSVRYLADHAVVTVSWSGTKNPRWVRTLSHLGADICITASTSVLARSFTTRGSGTVCAEDRWKKFLSLISRADIASG
Ga0193727_100684433300019886SoilMIFVSQRMSEYYLAGNSEATGFWSGTKNPRWVRTLSRRALVICITASTSVLARSFTTRGSRAVCAGDQWKKFLSLISRADIASG
Ga0193729_108432713300019887SoilMTFVSQGASVRYLAGNSEATGSWSGTRNPRWVRTLSHLGAAICITASTSVLARSFTTRGSHTVCAEDRWKKFLSLISRAASASG
Ga0193729_115282423300019887SoilMTFVSQCVSVHYLAENSEATGWSGSKNPRWVRTLSHLGAAICITASTPVLATSFTARGSRTVCAEERWKKFLSLISRADIASG
Ga0193731_104230713300020001SoilMTIDSQGVSVRYLAGNSEATGSWSGTKNPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRAASASG
Ga0193730_116205913300020002SoilMTFVSQGVSVRYLAGNSEATGSWSGTKNPRWVRTLSHLGAAICITASTPVLATSFTARGSRTVCAEERWKKF
Ga0193755_103108923300020004SoilMTLVSQGVSVCHLAGSSEATGFWSGTKSPRWVRTLSRLGVAICITASTPVLARSFTIRGSRTVCAEDRWKKFLSLISRADSASG
Ga0193726_110553723300020021SoilMTFDSQSVSVHCLAGNTEATGTWSGIKNPRWVRTLSHLGAAICITASPSALARSFTTRGARTVCAGERWKKFLSLISRADIAS
Ga0193753_1029315223300020034SoilMAFVSQRVSVCYLAGKSEAAGFWSETKSPPWVRTLSRRGLVICITASTSVLARSFTTRGSRMVCAGDQWKKFLSLISRADIASG
Ga0193717_102566543300020060SoilMMLVSQCVSARCPAGNSVAAGSWSGTKNPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISHADTAS
Ga0193716_107142623300020061SoilMTVVSQGLSTRHPDGDSGATCYWSGTKNPRWVRTLSHPGAAISITASMSVLARSFTTLGSPTVCAEGRWKRFLSIISRTDSASG
Ga0179592_10000016403300020199Vadose Zone SoilMTFVSPSVSVRYLTGKSEATGSWSATKNPRWVRTLSHRGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLWLISRGGTPELHLAQLKG
Ga0179592_10000149123300020199Vadose Zone SoilMTFVSQGVSVCYLAGNSVATGSWSAIKNPRWVCTLSRLGAAICITASTSVLAKSFTTRGSRTVCAEDRWKKFLSLVSRASSGSG
Ga0179592_1001925923300020199Vadose Zone SoilMTFISQGASVRYLAGNSEATGSWSGTKNPRWVHTLSHLGAAICITASTSVLARSFITRGSLTVCAEDRWKKFLSLISRADGESG
Ga0210407_1001039123300020579SoilVSACYLSGNAEATGFWSEPKNPRWVRTLSHLGAAVYITASTSVLARSFTAWGSPTVCAGDQWKKFLSFLALISSADIASR
Ga0210403_1005467433300020580SoilMTFVSHGVSVLYLAGNSPATGYWSGNKNPRWVRTLSHPGAAICITASTLVSARSFTIRGSRTVCAEGRWKKFLSLISRADCASG
Ga0210403_1015867613300020580SoilMMFVSQGVSVRYLAGNSEATGFWSGTKNPRWVRTWSLLGAAICITASPSVLARSFTTRDSRTVCAEDRWKKFLSLISRADSASG
Ga0210401_1002164873300020583SoilVSACFLAGHSEATGYWPGTKNPRWVRTLSHPGAAICITASTSVLARSFTTRGSGTAYAEDRWKKFLSLISHGDSASTEAAAKPLRS
Ga0210406_1001938063300021168SoilMFISQNASARYLAGNSEAAGVWTGTTNPRWVRTLSHLGAAICITASTSVLAWSFTTRGSRMVWAEDRWKKFLWLISRAGSESGCDPTRHPIWTFER
Ga0210400_1031955113300021170SoilMTFVSQCVSVHYQAGNSEATGFWSGTKSPRWVRTLSHLGAAICITASTSVPARSFTTRGSRTVCAEDRWKKFLSLISRADSASG
Ga0210391_1110802523300021433SoilVSACFLAGHSEATGYWPGTKNPRWVRTLSHPGAAICITASTSVLARSFTTRGSGTAYAEDRWKKFLSLISHGDSASTEAAAK
Ga0210392_1119678013300021475SoilMRAYQGVSVRCQTGNSEATGYWSGTKNPRWVRTLSHPGAAICITASTSVLAKLFITQGSRAVCVEDRWKKFLSLVSLAGSASG
Ga0210392_1121957313300021475SoilVSACYLSGNAEATGFWSEPKNPRWVRTLSHLGAAVYITASTSVLARSFTAWGSPTVCAGDQWKKFLS
Ga0210402_1019654323300021478SoilMMFVSQGVSVRYLAGNSEATGSWSGTKNSRWVRTLSHLGEAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSRISRADSACG
Ga0210402_1046899413300021478SoilMTFVSQGVSVRYLAGNSEATGSWSGIRNPRWVRTLSHLGAAICITASTSAPARLFTTRGSRMVCAEDRWKKFLSLISRADSTSG
Ga0210402_1093387613300021478SoilMTFVSQGVSARDLAGNSEAIGSWSGTTNLRWVRTLSHLGAAICITASTSVLAWSCTTRGSRTVCAEDRWKKFLSLIS
Ga0210410_1013687723300021479SoilMTLDTQGVSAPYLAGNSEVTGYWSGTKNPRWVRTLSHPAAAICITASTSVLARSFTTRGTHTVCAEDRWKKFLSLVSRADITSG
Ga0212123_1000779283300022557Iron-Sulfur Acid SpringMTFVSQGVSVRYLAGNSEATGFWSGSKNPRWVRTLSHLGAAICITASTSVLARSCTTRGSRTVCAEDRWKKFLSLISRAESASG
Ga0242654_1033610923300022726SoilSQRARARRLPRARRRAVSACYLSGNAEATGFWSEPKNPRWVRTLSHLGAAVYITASTSVLARSFTAWGSPTVCAGDQWKKFLSFLALISSADIASR
Ga0137417_109002823300024330Vadose Zone SoilMTFVSQGVSVRYLAGNSEATGSWPGTRNPRWVRTLSHLGAAICITASTSVLARSCTTQGSRTVCAEDRWKRFLSLISRANSTSG
Ga0209240_103648923300026304Grasslands SoilMTFVSQGVSVRYLAGNSEAIGYWSGTKNPRWVRTLSHPGAAICITSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSASG
Ga0209647_108477923300026319Grasslands SoilMTFVSQGVSVRYLAGNSEATGSWSGSKNPRWVRTLSHLGAAICITASTSVLAKSFTTRGSRTVCVEDRWKKFLSLISRADSASG
Ga0209131_103037233300026320Grasslands SoilMTSVSQGVSVRYLAGNPEAAGFWSETKNPRWVRTLSHLGAAICITASTSVLARSCTTQGSRTVCAEDRWRRFLSLISRANSTSG
Ga0257147_102389913300026475SoilMMFVSQGVSVRYLAGNSEATGSWSGTKNPRWVRTLSHLGAAICITASTSVLARSFTTRGSRAVCAEDR
Ga0257168_108820123300026514SoilMTFVSQGVSVRYLAGNSEATGSWSGTENPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADGASGCDRTRHPISTFERSSAEHDPVWGKTAIGC
Ga0209648_1031509023300026551Grasslands SoilMTFVSQGVSVRYLAGNSEATGSWSGTENPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADGASG
Ga0179593_107182833300026555Vadose Zone SoilMTLISQSVSVRHLDGNSEASGFWSGTKNPRWARTLSHLGAAICITASTSVPATLFTTRDSRTVCAEDRWKRFLSLISRADSASG
Ga0179593_115120413300026555Vadose Zone SoilMTFVSQGVSVRYLAGNSEATGFWSGTENPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSACG
Ga0179587_1052113623300026557Vadose Zone SoilMTLVSQGVSVRYLTGNSEATGSWSGTKNPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWK
Ga0209524_105213723300027521Forest SoilMTFVFQGVSVRYFAGDSVATGSWSGTKNPRWVRTLSHLGAAICITASTSVPARSFTTRGSRTVCAEDRWKKFLSLISRADSASG
Ga0208985_106779123300027528Forest SoilMTIVSQSASVRHLTGNSEATGCWSGTKNPRWVRTLSHLGAAICITASTSVLARSFTTQGSRTVCAEDRWKKFLSLISRTDSASG
Ga0209329_101451233300027605Forest SoilMTFVSQDVSVHYLAGNSEATGSWSGTKNLRWARTLSHLGAAICITASTSVPARSFTTRGSPTVCAEDRWKKFLSLISRADSTSG
Ga0209736_102223323300027660Forest SoilMRAYQGVSVRCQTENSEATGYWSGTKNPRWVRTLLHPGAAICITASTSALAKSFITQGSRAVCVEDRWKKFLSLISLAGNASG
Ga0209588_103807023300027671Vadose Zone SoilMTLVSQGVSVRYLAGNSEATGSWSGTENPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSTSG
Ga0209118_109947413300027674Forest SoilMTFVSQGVSVRYLAGSSVANGSWSGTKNPRWVRTLSHLGAAICITASTSVLARSFTTRGARAVCAENRWKKFLSLISRADSASWC
Ga0209626_102032423300027684Forest SoilMKLLRLCSDEETTVTFVSQRVSVRYQAGNSDATSYWSGTEDPRWVRTLSHPGAAICITASTSVLARSFITRGSRTVCAEDRWKKFLSLISRSGSASG
Ga0209626_116283423300027684Forest SoilMTIDSQGVSVRYLAGNSEATGSWSGTKNPRWVRTLSHLGAAICITASTSALARSFTTRGSRTVCAEDRWKRFLSLISRADITS
Ga0209773_1031582723300027829Bog Forest SoilMSFISQGVSVRYLAGKSDATGYWSGSKNPRWGRTLSPPGAAICITASTSVLTRSFTTRDSRTVCTEDRWKRFLSLI
Ga0209701_1005514043300027862Vadose Zone SoilMTFVSQGVSVRYLAGNSEATGSWSGTKNPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSASG
Ga0209488_1005127633300027903Vadose Zone SoilMTLVSQGVSVRYLAGNSEATGSWSGTENPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSACG
Ga0209006_1002106343300027908Forest SoilMTIVSQGASVRLLGGNAEATCSWSGTKNPRWARTLSHVGEAICITASTSALARSFTTRGSHTVFAEGRWKKFLSLISRAGGASGCDRTRHPISTFERSFAGRNPVWGKTAIGC
Ga0209006_1008857423300027908Forest SoilMIFVSQGLSVRYLAGNSEATGSWSGTKNPRWVRTLSHLGAAISITASTSVLARSFTTRDSRTVYAEDRWKKFLSLISRADSASG
Ga0209006_1031409023300027908Forest SoilMMFISQGVSVRCLAGNSEATGSWSGTKNLRWVRTLSHLGAAICITASTSVLATSFTTRDACTVCAENRWKKFLSLISRADSASGSDRTRHPISTFERSSAGHDPVWGKTVIGC
Ga0209698_1082428323300027911WatershedsMTFVSQGVSVRQLAGNSETSAFWSTTKNPRWVRTLSHPGAAIYIMVSTSVLARSFITRGSRTVCAEDRWKKFLSLISRADITSG
Ga0209526_1004465123300028047Forest SoilMMYVSQGVSVRYLAGNSEATGSWSGTKNLRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSASG
Ga0137415_1025791923300028536Vadose Zone SoilMTFVSQGVSVRYLAGNSEATGSWSGTENPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRADSVSG
Ga0170834_11178602313300031057Forest SoilMTFVSQGVSVRYLAGNSEATGSWSGTENPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSVISRADSASG
Ga0170824_11073770543300031231Forest SoilMYASHDPSVRYQAGNSVVTGSWSETRNPRWVRTLSHLGAANCITASTSVLAKSFTTRGSRTVCAEDRWKTFLSLISRTDSASG
Ga0302324_10015096223300031236PalsaMTFVSQGVSAGYPGGKSEATGSWSGTKNPRWVRTLSHLGAAICITGSTSVLARSFTTRGSHTVCAEDRWKKFLSIISRAGGASG
Ga0170820_1495928923300031446Forest SoilMMFVSQGVSARCLAGNSEATGYWSGTKNPRWVRTLSHPGADICITASTSVLVRSFTTRGLRTVCAEDRWKKFLSLLSRAESPDHPCADIFVVPG
Ga0170818_10834363123300031474Forest SoilMTFVSQSVSMRHLAGNSEVTGSWSGTRNPRWVRTLSHLGAAICITASTSGLAKSFTTRDSRTACAEDRWKKFLSLISRADSASG
Ga0307478_1005222733300031823Hardwood Forest SoilMTFVSQGVSARCLAGNSEATGAWSGTRNPRWVRTLSHLGAAICITASTSVLARSFTTRGSRTVCAEDRWKKFLSLISRAASASGCDRTRPPISTFERSSAEHDPVWGKPAIGC
Ga0307470_10000481113300032174Hardwood Forest SoilMTFVSQGVSVRYPAGNSVAAGSWSGTKNPRWVRTLSHLGAAICITASTSVLARSFTTRDSRTVCAEDRWKKFLSLISRAASASG
Ga0307470_1000100443300032174Hardwood Forest SoilMTFVTQGVSVRYLAGNSEVTGAWSGTKNPRWVRTLSHLGAAICITASTSVPARSFTTRGSRAVCAEDRWKKFLSLISRADSVSGRDPTPHPISTFERSSAEHDPVWARTAIDC
Ga0307470_1003994343300032174Hardwood Forest SoilMSASTWAAMTPTKNRRWVRTLSHPGAATCITASTVLARSFTIRGSRTVCAEGRWKKFLSLISRAGSASG
Ga0307471_10255935513300032180Hardwood Forest SoilMTFVSQGVSVRYLAGHPEWTGSWSGTTNPRWVRTLSHLGAAICITASTSVLARSFTTRDSRTVCAEDR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.