NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F079374

Metagenome Family F079374

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F079374
Family Type Metagenome
Number of Sequences 116
Average Sequence Length 74 residues
Representative Sequence MAKKDKPEEHGLPSLSLVFGYIAIKELQRLEDRVRVLSRLGYGNAEIAAICDTTPASVRTLKSGLKKSKRPRRRK
Number of Associated Samples 98
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 86.92 %
% of genes near scaffold ends (potentially truncated) 13.79 %
% of genes from short scaffolds (< 2000 bps) 61.21 %
Associated GOLD sequencing projects 93
AlphaFold2 3D model prediction Yes
3D model pTM-score0.63

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (87.069 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(12.069 % of family members)
Environment Ontology (ENVO) Unclassified
(18.966 % of family members)
Earth Microbiome Project Ontology (EMPO) Unclassified
(25.862 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 40.78%    β-sheet: 0.00%    Coil/Unstructured: 59.22%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.63
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 116 Family Scaffolds
PF13384HTH_23 3.45
PF01381HTH_3 2.59
PF14338Mrr_N 1.72
PF02735Ku 1.72
PF06271RDD 0.86
PF00278Orn_DAP_Arg_deC 0.86
PF01850PIN 0.86
PF01867Cas_Cas1 0.86
PF13005zf-IS66 0.86
PF10137TIR-like 0.86
PF13641Glyco_tranf_2_3 0.86
PF01939NucS 0.86
PF00082Peptidase_S8 0.86
PF13683rve_3 0.86
PF04326AlbA_2 0.86
PF01568Molydop_binding 0.86
PF05960DUF885 0.86
PF10047DUF2281 0.86
PF00112Peptidase_C1 0.86
PF064393keto-disac_hyd 0.86
PF00325Crp 0.86
PF05362Lon_C 0.86
PF02844GARS_N 0.86
PF08843AbiEii 0.86
PF01597GCV_H 0.86
PF00873ACR_tran 0.86
PF01979Amidohydro_1 0.86
PF09549RE_Bpu10I 0.86
PF00583Acetyltransf_1 0.86
PF03129HGTP_anticodon 0.86
PF01936NYN 0.86
PF09509Hypoth_Ymh 0.86
PF07751Abi_2 0.86
PF16277DUF4926 0.86
PF00557Peptidase_M24 0.86
PF02371Transposase_20 0.86
PF01406tRNA-synt_1e 0.86
PF04480DUF559 0.86
PF12686DUF3800 0.86

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 116 Family Scaffolds
COG1273Non-homologous end joining protein Ku, dsDNA break repairReplication, recombination and repair [L] 1.72
COG1166Arginine decarboxylase (spermidine biosynthesis)Amino acid transport and metabolism [E] 0.86
COG4823Abortive infection bacteriophage resistance proteinDefense mechanisms [V] 0.86
COG4805Uncharacterized conserved protein, DUF885 familyFunction unknown [S] 0.86
COG3547TransposaseMobilome: prophages, transposons [X] 0.86
COG3480Predicted secreted protein YlbL, contains PDZ domainSignal transduction mechanisms [T] 0.86
COG2865Predicted transcriptional regulator, contains HTH domainTranscription [K] 0.86
COG2253Predicted nucleotidyltransferase component of viral defense systemDefense mechanisms [V] 0.86
COG1750Predicted archaeal serine protease, S18 familyGeneral function prediction only [R] 0.86
COG1714Uncharacterized membrane protein YckC, RDD familyFunction unknown [S] 0.86
COG1637Endonuclease NucS, RecB familyReplication, recombination and repair [L] 0.86
COG1518CRISPR-Cas system-associated integrase Cas1Defense mechanisms [V] 0.86
COG1432NYN domain, predicted PIN-related RNAse, tRNA/rRNA maturationGeneral function prediction only [R] 0.86
COG0018Arginyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 0.86
COG1067Predicted ATP-dependent proteasePosttranslational modification, protein turnover, chaperones [O] 0.86
COG0509Glycine cleavage system protein H (lipoate-binding)Amino acid transport and metabolism [E] 0.86
COG0466ATP-dependent Lon protease, bacterial typePosttranslational modification, protein turnover, chaperones [O] 0.86
COG0442Prolyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 0.86
COG0441Threonyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 0.86
COG0423Glycyl-tRNA synthetase, class IITranslation, ribosomal structure and biogenesis [J] 0.86
COG0215Cysteinyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 0.86
COG0151Phosphoribosylamine-glycine ligaseNucleotide transport and metabolism [F] 0.86
COG0143Methionyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 0.86
COG0124Histidyl-tRNA synthetaseTranslation, ribosomal structure and biogenesis [J] 0.86
COG0019Diaminopimelate decarboxylaseAmino acid transport and metabolism [E] 0.86


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms89.66 %
UnclassifiedrootN/A10.34 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000364|INPhiseqgaiiFebDRAFT_105857382All Organisms → cellular organisms → Bacteria4980Open in IMG/M
3300001213|JGIcombinedJ13530_103217872All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium610Open in IMG/M
3300001380|JGI1356J14229_10199606All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium573Open in IMG/M
3300001870|JGI24129J20441_1091690All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium566Open in IMG/M
3300002908|JGI25382J43887_10040393All Organisms → cellular organisms → Bacteria2536Open in IMG/M
3300003312|P12013IDBA_1033864All Organisms → cellular organisms → Bacteria1523Open in IMG/M
3300003319|soilL2_10315546All Organisms → cellular organisms → Bacteria1365Open in IMG/M
3300004063|Ga0055483_10128140All Organisms → cellular organisms → Bacteria800Open in IMG/M
3300004463|Ga0063356_100249039All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae2151Open in IMG/M
3300004463|Ga0063356_101334230All Organisms → cellular organisms → Bacteria1051Open in IMG/M
3300005172|Ga0066683_10120958All Organisms → cellular organisms → Bacteria1597Open in IMG/M
3300005174|Ga0066680_10654149All Organisms → cellular organisms → Bacteria → Proteobacteria651Open in IMG/M
3300005529|Ga0070741_11376273All Organisms → cellular organisms → Bacteria → Proteobacteria587Open in IMG/M
3300005832|Ga0074469_10162615All Organisms → cellular organisms → Bacteria1162Open in IMG/M
3300005836|Ga0074470_11830455All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1113Open in IMG/M
3300006755|Ga0079222_10053069All Organisms → cellular organisms → Bacteria1896Open in IMG/M
3300006865|Ga0073934_10000663All Organisms → cellular organisms → Bacteria95563Open in IMG/M
3300006871|Ga0075434_100444724All Organisms → cellular organisms → Bacteria1317Open in IMG/M
3300007351|Ga0104751_1099613All Organisms → cellular organisms → Bacteria1174Open in IMG/M
3300009038|Ga0099829_10005610All Organisms → cellular organisms → Bacteria7653Open in IMG/M
3300009083|Ga0105047_11463672All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium513Open in IMG/M
3300009089|Ga0099828_10992728All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium748Open in IMG/M
3300009090|Ga0099827_10439125All Organisms → cellular organisms → Bacteria1118Open in IMG/M
3300009091|Ga0102851_10403735All Organisms → cellular organisms → Bacteria1376Open in IMG/M
3300009137|Ga0066709_100339837All Organisms → cellular organisms → Bacteria2058Open in IMG/M
3300009147|Ga0114129_11898252All Organisms → cellular organisms → Bacteria722Open in IMG/M
3300009529|Ga0114919_10114922All Organisms → cellular organisms → Bacteria1949Open in IMG/M
3300010302|Ga0116202_10449219All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium573Open in IMG/M
3300010324|Ga0129297_10034687All Organisms → cellular organisms → Bacteria2450Open in IMG/M
3300010324|Ga0129297_10056851All Organisms → cellular organisms → Bacteria1872Open in IMG/M
3300010391|Ga0136847_10372608All Organisms → cellular organisms → Bacteria3859Open in IMG/M
3300010933|Ga0137936_1027015All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium658Open in IMG/M
3300011269|Ga0137392_10486799All Organisms → cellular organisms → Bacteria1025Open in IMG/M
3300011271|Ga0137393_10735380All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium844Open in IMG/M
3300012201|Ga0137365_10499873All Organisms → cellular organisms → Bacteria894Open in IMG/M
3300012204|Ga0137374_10177894All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1861Open in IMG/M
3300012206|Ga0137380_10057491All Organisms → cellular organisms → Bacteria3562Open in IMG/M
3300012208|Ga0137376_11174999All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium656Open in IMG/M
3300012211|Ga0137377_11246648All Organisms → cellular organisms → Bacteria673Open in IMG/M
3300012360|Ga0137375_11043948All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium639Open in IMG/M
3300012925|Ga0137419_10910961All Organisms → cellular organisms → Bacteria724Open in IMG/M
3300012929|Ga0137404_11119725All Organisms → cellular organisms → Bacteria723Open in IMG/M
3300012964|Ga0153916_10015432All Organisms → cellular organisms → Bacteria6302Open in IMG/M
3300012964|Ga0153916_10674688All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1112Open in IMG/M
3300014262|Ga0075301_1122646All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium582Open in IMG/M
3300014839|Ga0182027_10548342All Organisms → cellular organisms → Bacteria1253Open in IMG/M
3300014881|Ga0180094_1028354All Organisms → cellular organisms → Bacteria1125Open in IMG/M
3300015359|Ga0134085_10437246All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300017941|Ga0187850_10014221All Organisms → cellular organisms → Archaea4871Open in IMG/M
3300018063|Ga0184637_10028940All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria3344Open in IMG/M
3300018070|Ga0184631_10043539All Organisms → cellular organisms → Bacteria1618Open in IMG/M
3300018077|Ga0184633_10350307All Organisms → cellular organisms → Bacteria745Open in IMG/M
3300018079|Ga0184627_10038089All Organisms → cellular organisms → Bacteria2454Open in IMG/M
3300018086|Ga0187769_10621330All Organisms → cellular organisms → Bacteria817Open in IMG/M
3300018088|Ga0187771_10043656All Organisms → cellular organisms → Bacteria3467Open in IMG/M
3300018090|Ga0187770_10349068All Organisms → cellular organisms → Bacteria1156Open in IMG/M
3300019788|Ga0182028_1551986All Organisms → cellular organisms → Bacteria1917Open in IMG/M
3300020074|Ga0194113_10185271All Organisms → cellular organisms → Bacteria1685Open in IMG/M
3300021090|Ga0210377_10262210All Organisms → cellular organisms → Bacteria1073Open in IMG/M
3300022551|Ga0212089_10028360All Organisms → cellular organisms → Bacteria3499Open in IMG/M
3300022551|Ga0212089_10052126All Organisms → cellular organisms → Bacteria2379Open in IMG/M
3300025164|Ga0209521_10530086All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300025310|Ga0209172_10002716All Organisms → cellular organisms → Bacteria29958Open in IMG/M
3300025313|Ga0209431_11012589All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium586Open in IMG/M
3300025952|Ga0210077_1086073All Organisms → cellular organisms → Bacteria653Open in IMG/M
3300026296|Ga0209235_1191488All Organisms → cellular organisms → Bacteria737Open in IMG/M
3300026552|Ga0209577_10705541All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300027725|Ga0209178_1012470All Organisms → cellular organisms → Bacteria2660Open in IMG/M
3300027740|Ga0214474_1048468All Organisms → cellular organisms → Bacteria1643Open in IMG/M
3300027819|Ga0209514_10056961All Organisms → cellular organisms → Bacteria2642Open in IMG/M
3300027846|Ga0209180_10009752All Organisms → cellular organisms → Bacteria4935Open in IMG/M
3300027877|Ga0209293_10182620All Organisms → cellular organisms → Bacteria1019Open in IMG/M
3300027896|Ga0209777_10213520All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1542Open in IMG/M
3300027896|Ga0209777_10235842All Organisms → cellular organisms → Bacteria1448Open in IMG/M
3300027900|Ga0209253_10854500All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium642Open in IMG/M
3300027917|Ga0209536_101414464All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium848Open in IMG/M
3300027964|Ga0256864_1025915All Organisms → cellular organisms → Bacteria1542Open in IMG/M
3300028804|Ga0268298_10162709All Organisms → cellular organisms → Bacteria1250Open in IMG/M
3300028814|Ga0307302_10561582All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium567Open in IMG/M
3300029288|Ga0265297_10216496All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1388Open in IMG/M
3300030613|Ga0299915_10000002All Organisms → cellular organisms → Bacteria314007Open in IMG/M
3300030616|Ga0272442_10554299All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium733Open in IMG/M
3300031576|Ga0247727_10003363All Organisms → cellular organisms → Bacteria35370Open in IMG/M
3300031576|Ga0247727_10010686All Organisms → cellular organisms → Bacteria16205Open in IMG/M
3300031576|Ga0247727_10021772All Organisms → cellular organisms → Bacteria9653Open in IMG/M
3300031576|Ga0247727_10034606All Organisms → cellular organisms → Bacteria6788Open in IMG/M
3300031576|Ga0247727_10035826All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria6617Open in IMG/M
3300031576|Ga0247727_10078678Not Available3647Open in IMG/M
3300031707|Ga0315291_10026790All Organisms → cellular organisms → Bacteria6773Open in IMG/M
3300031707|Ga0315291_10041784All Organisms → cellular organisms → Bacteria5247Open in IMG/M
3300031772|Ga0315288_10573922All Organisms → cellular organisms → Bacteria1094Open in IMG/M
3300031949|Ga0214473_10031149All Organisms → cellular organisms → Bacteria6257Open in IMG/M
3300031952|Ga0315294_10000756Not Available34949Open in IMG/M
3300031952|Ga0315294_10234153All Organisms → cellular organisms → Archaea → Asgard group → Candidatus Thorarchaeota → Candidatus Thorarchaeota archaeon1803Open in IMG/M
3300031965|Ga0326597_10037405All Organisms → cellular organisms → Bacteria6141Open in IMG/M
3300031965|Ga0326597_10842318Not Available946Open in IMG/M
3300031999|Ga0315274_10007510All Organisms → cellular organisms → Bacteria15783Open in IMG/M
3300031999|Ga0315274_10173373All Organisms → cellular organisms → Bacteria2708Open in IMG/M
3300031999|Ga0315274_11855449All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium550Open in IMG/M
3300032118|Ga0315277_10735146All Organisms → cellular organisms → Bacteria943Open in IMG/M
3300032163|Ga0315281_11022631All Organisms → cellular organisms → Bacteria836Open in IMG/M
3300032163|Ga0315281_11268015All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium732Open in IMG/M
3300032173|Ga0315268_10079577All Organisms → cellular organisms → Bacteria3087Open in IMG/M
3300032893|Ga0335069_11287359All Organisms → cellular organisms → Bacteria796Open in IMG/M
3300033417|Ga0214471_10020622All Organisms → cellular organisms → Bacteria5241Open in IMG/M
3300033433|Ga0326726_10425567All Organisms → cellular organisms → Bacteria1262Open in IMG/M
3300033487|Ga0316630_11594110All Organisms → cellular organisms → Bacteria591Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil12.07%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment10.34%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil6.03%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm5.17%
Lake SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Lake Sediment3.45%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment3.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.45%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil3.45%
Freshwater Lake SedimentEnvironmental → Aquatic → Freshwater → Lentic → Sediment → Freshwater Lake Sediment2.59%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland2.59%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.72%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater1.72%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands1.72%
Hot Spring SedimentEnvironmental → Aquatic → Thermal Springs → Sediment → Unclassified → Hot Spring Sediment1.72%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)1.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.72%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.72%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.72%
FenEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Fen1.72%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.72%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.72%
WetlandEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Wetland0.86%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.86%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake0.86%
Anoxic Lake WaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Anoxic Lake Water0.86%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.86%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Peatland0.86%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.86%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater0.86%
FreshwaterEnvironmental → Aquatic → Freshwater → Ice → Glacial Lake → Freshwater0.86%
Marine SedimentEnvironmental → Aquatic → Marine → Oceanic → Sediment → Marine Sediment0.86%
Deep SubsurfaceEnvironmental → Aquatic → Marine → Oceanic → Sediment → Deep Subsurface0.86%
WetlandEnvironmental → Aquatic → Marine → Wetlands → Sediment → Wetland0.86%
Marine SedimentEnvironmental → Aquatic → Marine → Wetlands → Sediment → Marine Sediment0.86%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.86%
Marine SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Marine Sediment0.86%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.86%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.86%
Ore Pile And Mine Drainage Contaminated SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Ore Pile And Mine Drainage Contaminated Soil0.86%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.86%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil0.86%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.86%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.86%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.86%
Deep Subsurface AquiferEnvironmental → Terrestrial → Deep Subsurface → Aquifer → Unclassified → Deep Subsurface Aquifer0.86%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.86%
Landfill LeachateEngineered → Solid Waste → Landfill → Unclassified → Unclassified → Landfill Leachate0.86%
Activated SludgeEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Activated Sludge0.86%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001213Combined assembly of wetland microbial communities from Twitchell Island in the Sacramento Delta (Jan 2013 JGI Velvet Assembly)EnvironmentalOpen in IMG/M
3300001380Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW37 contaminated, 5.8 mEnvironmentalOpen in IMG/M
3300001870Arctic peat soil from Barrow, Alaska - Barrow Graham LP Ref core NGADG0002-211EnvironmentalOpen in IMG/M
3300002104Groundwater microbial communities from Rifle, Colorado - Rifle CSP2_plank lowO2_0.2EnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300003312Ore pile and mine drainage contaminated soil microbial communities from Mina do Sossego, Brazil - P1 sampleEnvironmentalOpen in IMG/M
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300004063Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushMan_CattailNLB_D2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005832Microbial communities from Baker Bay sediment, Columbia River estuary, Washington - S.41_BBBEnvironmentalOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006865Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaGEnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300007351Combined Assembly of Gp0115775, Gp0115815EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009083Freshwater microbial communities from Lake Fryxell liftoff mats and glacier meltwater in Antarctica - MAT-04 (megahit assembly)EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009091Freshwater wetland microbial communities from Ohio, USA, analyzing the effect of biotic and abiotic controls - Mud 3 Core 4 Depth 3 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009529Deep subsurface microbial communities from Black Sea to uncover new lineages of life (NeLLi) - Black_00105 metaGEnvironmentalOpen in IMG/M
3300010302Anoxic lake water microbial communities from Lake Kivu, Rwanda to study Microbial Dark Matter (Phase II) - Lake Kivu water 325m metaGEnvironmentalOpen in IMG/M
3300010324Lake sediment bacterial and archeal communities from Gulf of Boni, Indonesia to study Microbial Dark Matter (Phase II) - ?I18A1 metaGEnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010933Marine sediment microbial communities from North Pond, Atlantic Mid-Ocean Ridge - NP_1383EEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012964Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 4 metaGEnvironmentalOpen in IMG/M
3300014262Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D1EnvironmentalOpen in IMG/M
3300014839Permafrost microbial communities from Stordalen Mire, Sweden - 712E1D metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014881Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_1DaEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017941Peatland microbial communities from SPRUCE experiment site at the Marcell Experimental Forest, Minnesota, USA - June2016WEW_4_150EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018070Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_90_b1EnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018086Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP02_10_MGEnvironmentalOpen in IMG/M
3300018088Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP15_10_MGEnvironmentalOpen in IMG/M
3300018090Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0116_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300019788Permafrost microbial communities from Stordalen Mire, Sweden - 712E1D metaG (PacBio error correction)EnvironmentalOpen in IMG/M
3300020074Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015017 Mahale Deep Cast 200mEnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300022551Boni_combined assemblyEnvironmentalOpen in IMG/M
3300025159Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 3EnvironmentalOpen in IMG/M
3300025164Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 4EnvironmentalOpen in IMG/M
3300025310Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300025952Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushMan_CattailNLB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026498Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-49-AEnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027740Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT95D214 HiSeqEnvironmentalOpen in IMG/M
3300027819Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW37 contaminated, 5.8 m (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027877Wetland microbial communities from Old Woman Creek Reserve in Ohio, USA - Mud_0915_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300027896Freshwater lake sediment microbial communities from the University of Notre Dame, USA, for methane emissions studies -HBP12 HB (SPAdes)EnvironmentalOpen in IMG/M
3300027900Freshwater lake sediment microbial communities from the University of Notre Dame, USA, for methane emissions studies - BRP12 BR (SPAdes)EnvironmentalOpen in IMG/M
3300027917Marine sediment microbial communities from White Oak River estuary, North Carolina - WOR-2-8_12 (SPAdes)EnvironmentalOpen in IMG/M
3300027964Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111 HiSeqEnvironmentalOpen in IMG/M
3300028804Activated sludge microbial communities from WWTP in Nijmegen, Gelderland, Netherland - WWTP WeurtEngineeredOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300029288Leachate microbial communities from a municipal landfill in Southern Ontario, Canada - Leachate well 137-91EngineeredOpen in IMG/M
3300030613Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT92D227EnvironmentalOpen in IMG/M
3300030616Marine sediment archaeal communities from Little Sippewissett salt marsh, Falmouth, MA, United States - SSM-Form-2OEnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031707Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G12_20EnvironmentalOpen in IMG/M
3300031772Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G11_20EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300031952Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G13_40EnvironmentalOpen in IMG/M
3300031965Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT100D185EnvironmentalOpen in IMG/M
3300031999Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G02_20EnvironmentalOpen in IMG/M
3300032118Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G05_15EnvironmentalOpen in IMG/M
3300032163Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - YL17G07_0EnvironmentalOpen in IMG/M
3300032173Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C1_topEnvironmentalOpen in IMG/M
3300032893Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.1EnvironmentalOpen in IMG/M
3300033407Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033487Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_May_M1_C1_D6_AEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10585738263300000364SoilMAKKLNDEHGLPSLALVFGYIAVKELPRLEDKIRILARLGYGNPEIELICDTTAATVRTLKAKAKKERKK*
JGIcombinedJ13530_10321787223300001213WetlandMAKTQAQDEHGLPSLSLVFGYIAVKELRLLEDRVRVLARLGYGNAEIAAICDTSPAVVRTLKSAAKRRPAKKSRRRK*
JGI1356J14229_1019960623300001380GroundwaterMPTNQSAEEHGLPSLSLVFGYIAVKELRLLEDRVRVLARLGYGNAEIATICDTSPGAVRTLKSAAKRKPTNRSQRRKK*
JGI24129J20441_109169013300001870Arctic Peat SoilMAKKPISEEHALPSLSLVFGYIAVKDLQRLEDRVALLARLGYGNIEIAKICGTTSDTVSTLKARSKRAKAKKSKTASKKVRGEE
C687J26621_1005074343300002104GroundwaterMPRRASAKDKEHGLPSLALVFGYIAVKDLQRLEDRIAVLNRLGYGNAEMALICDTTPGSISTLKSRGAARRRTRR*
JGI25382J43887_1004039333300002908Grasslands SoilMAKKDKPEEHGLPSLSLVFGYIAIKELQRLEDRVRVLSRLGYGNAEIAAICDTTPASVRTLKSGLKKSKRPRRRK*
P12013IDBA_103386443300003312Ore Pile And Mine Drainage Contaminated SoilMALTTGRKRKPPASEEHALPSLSLVFGYIAVKELQRLEDKVAVLARLGYGNSEIATICNTTPGTVAPIKSRLKTRR*
soilL2_1031554633300003319Sugarcane Root And Bulk SoilMAGKTVEDHGLPSLALVFGYIAVKELQRLEDRIDVLTRLGYGNAEVATICNTTPGTVRTIKSTTKGKKVGRKK*
Ga0055483_1012814033300004063Natural And Restored WetlandsMIKKARTEEHGLPSLALVFGYIAVKELQGLADRVAVLSRLGYGNEEIARICGKNPNSIRAMKSKLGKGKQSRGTK*
Ga0063356_10024903933300004463Arabidopsis Thaliana RhizosphereMARNDEGEEPKLPSLALVFGYIAVKELQRIEDRIPVLSRLGYGNAEIAAICGTTPGSVRTIKSNIKKTKRPKRSRK*
Ga0063356_10133423033300004463Arabidopsis Thaliana RhizosphereMARNDEGEEHKLPSLALVFGYIAVKELQRIEDRIPVLTRLGYGNAEIAAICGTTPGSVRTIKSNIKKTKRPKRRRK*
Ga0066683_1012095833300005172SoilMAKPKPDEHGLPSLALVFGYIAVKELQRLEDRVVVLSRLGYGNAEIAKICDSTPAAVATLKVR
Ga0066680_1065414933300005174SoilMPKQAVKKNEEHGLPSLALVFGYIAVKELQRLEDRVAVLSRLGYGNIEIARICGSTPASIAVLK
Ga0070694_10138807123300005444Corn, Switchgrass And Miscanthus RhizosphereMPKKASKKDKEHGLPSLSLIFGYIAVKDLQRLEDRISVLSRLGYGNAEMALICDTTPGSISTLKSRGASRRTRRTRR*
Ga0070741_1137627313300005529Surface SoilMAKKDAKNGEEHGLPSLALVFGYIAVKELQRLEDRVGVLSRLGYGNIEIARICGSTPASIATLKHKYH
Ga0074469_1016261533300005832Sediment (Intertidal)MKKAPSSDEHGLPSLSLVFGYIAVKELQRLEDRIGVLGRLGYGNAEIAIICNTTPATVRTLKSATKSKRSKALRRRK*
Ga0074470_1183045523300005836Sediment (Intertidal)MAKKDKAEEYGLPSLAVVFGYIAVKELQRIEDQVRVLARLGFGNAEIATICDTSAGVVRTYKSSLKKNRRSRRRQ*
Ga0079222_1005306953300006755Agricultural SoilMPKSDSEDHGLPSLALVFGYIAVKELQRTQDRVAVLSRLGFGNSEIALICDTTPAVVRTLKALAKKKPRKGRGKRK*
Ga0073934_10000663653300006865Hot Spring SedimentMTKKQMNEDHALPSLSLVFGYTAVKELQRLEDRIAVLTRLGYGISEIATICDTTPATVRTIKSSLNKKKRNRR*
Ga0075434_10044472413300006871Populus RhizosphereMAKKQAKKKSEDRGLPSLALVFGYIAVKDLQRMDDRIKVLSRLGYGNIEMAIICGTKPATVATLKHRAKGGRE*
Ga0104751_109961323300007351Deep Subsurface AquiferMAKQSSDEHGLPSLSLVFGYIAVKELQSIEDRVRVLSRLGYGNAEIATICDTTPASVRTLKYTGKKKPARKSGRRKE*
Ga0066710_10064801823300009012Grasslands SoilMAKKAPSKDVAGLPSLALVFGYIAVKDLKRLEDRIAVLSRLGYGNAEMALICDASPGTIATLKSRAARQHRRG
Ga0099829_1000561063300009038Vadose Zone SoilMAKKPKVKVEEHGLPSLALVFGYIAVKDLQRTEDRVVVLSRLGYGNVEIAKICDTTPAVVATLKSMAKKKPRKGRRKKQ*
Ga0105047_1146367213300009083FreshwaterMATKNDVDEHALPSLSLIFGYIAVKELQRLEDRIAVLARLGYGNAEIAKICDTTPATVRTIKSTTKKAKNAKRQK*
Ga0099828_1099272823300009089Vadose Zone SoilMTKKLDNEEHGLPSLSLVFGYIAIKDLGRLDDRVKVLDRLGYGNAEIARICDTTSGTVSTLKYASKKGKKK*
Ga0099827_1043912523300009090Vadose Zone SoilMAKVKPTEQGLPSLSLVFGYIAVKELQRLEDRVVVLSRLGYGNAEIATICGKSPQVVATLKARAKRRTK*
Ga0102851_1040373513300009091Freshwater WetlandsMTRIPSSEEHGLPSLSLVFGYIAVKELRLLEDRVRVLARLGYGNAEIATICDTTPAVVRTLKSAAKRKPAKRSKRRKT*
Ga0066709_10033983713300009137Grasslands SoilMAKPKPDEHGLPSLALVFGYIAVKELQRLEDRVVVLSRLGYGNAEIAKICDSTPAAVATLKVRA
Ga0114129_1189825213300009147Populus RhizosphereMPTKRAGKTEELGLPSLALVFGYIAVKDLQRMDDRVAVLARLGYGNEEMAKICGTTSATVATLKHRTNKGRRR*
Ga0114919_1011492213300009529Deep SubsurfaceGLPSLSLVFGYIAVKDLQRLEDRVTVLSRLGYGANEIATICDTTPASVHTLRSVAKKSKHSWRSR*
Ga0116202_1044921913300010302Anoxic Lake WaterMAKDNLSEEHALPSLSLVFGYIAVKELQRLEDRVRILSRLGYGNAEIAIICDTTPAAVRTLKSAAKKKKTAKKPKEAQE*
Ga0129297_1003468743300010324Lake SedimentMSKVKPDKEHGLPSLSLVFGYIAVKELQDKIEQVGVLSRLGYGTAEIARICDTTPESVRVLKSVGKKKVRRRARKRKKTR*
Ga0129297_1005685123300010324Lake SedimentMSKVKSDKEHGLPSLSLVFGYIAVKEIQDKIEQVGVLSRLGYGTAEIARICDTTPESVRVLKSVGKKKVRRRARKRKKTR*
Ga0136847_1037260843300010391Freshwater SedimentMATKRDNEEHALPSLSLVFGYIAVKELQRLEDRIAVLARLGYGNAEIAAICGTTPATVRTIKSSTKKAKNTRRQK*
Ga0137936_102701523300010933Marine SedimentMRKKKPLEEHGLPSLSLVFGYIAIKEMQRLEDRVKVLARIGYGNAEIARICDTTPATVRTLKSAIKRGSKK*
Ga0137392_1048679913300011269Vadose Zone SoilMAKPKGKVEEHGLPSLALVFGYIAVKELQRLEDRIVVLSRLGYGTAEIAAICDTTPAAVRTLRSVAKKSKKPRPGKRGRKK*
Ga0137393_1073538023300011271Vadose Zone SoilMTKKPDNEEHGLPSLSLVFGYIAIKDLGRLDDRVKVLDRLGYGNAEIARICDTTSGTVSTLKYASKKGKKK*
Ga0137365_1049987323300012201Vadose Zone SoilMAKAEPNEDGLPSLSLVFGYIAVKELQRMEDRVAVLARLGYGNIGIAKICGSTPAAVATLKVRAKRRRSK*
Ga0137374_1017789433300012204Vadose Zone SoilMKEIQMVKQSADEHGLPSLSLVFGYIAVKELQSIEDRVRVLSRLGYGNAEIAIICDTTPASVRTLKYTGKKKPAKKSGRRKA*
Ga0137380_1005749143300012206Vadose Zone SoilMPKATPQEHGLPSLALVFGYIAVKELQRLEDRVVVLSRLGYGNVEIAKICGSTPAAVGSLKVRAKRRRMK*
Ga0137376_1117499913300012208Vadose Zone SoilMENKDEHALPSLALVFGYIAVKELQRLEDRIAVLTRLGYGNAEIARICDTTPASVRTLKSKAKKSRRK*
Ga0137377_1124664813300012211Vadose Zone SoilMAKVTPQEHGLPSLALVFGYIAVKELQRLEDRVVVLSRLGYGNVEIAKICGSTPAAVGSLKVRAKRRRMK*
Ga0137375_1104394833300012360Vadose Zone SoilVKQSADEHGLPSLSLVFGYIAVKELQSIEDRVRVLSRLGYGNAEIAIICDTTPASVRTLKYTGKKKPAKKSGRRKA*
Ga0137419_1091096133300012925Vadose Zone SoilMAKAKPKEDGLPSLSLVFGYIAVKELQRMEDRVAVLARLGYGNIGIATICGSTPAAVATLKVRAKRRRSK*
Ga0137404_1111972523300012929Vadose Zone SoilMAKAKTKEHGLPSLSLVFGYIAVKELQRMEDRVVVLSRLGYGNVEIATICGSTPAAVATLKVRAKSRRTK*
Ga0153916_1001543233300012964Freshwater WetlandsMEKKKKNEEHGLPSLSLVFGYLATKELQRLEDRVAVLSRLGYGNDEIAKICDTNVDSVRSLKSRISKRRRVRGRK*
Ga0153916_1067468823300012964Freshwater WetlandsMAKKEEHGLPSLSLVFGYIAVKELQRLEDRVAVLSRLGYGNAEIATICGTTPASVRTLKSKRTRRAK*
Ga0075301_112264613300014262Natural And Restored WetlandsMQKNKKNEEHGLPSLALVFGYIAIKELQRPEDRVSVLSRLGYGNVEIAKICNTSPAAVAVYKHRGKGRKRRSK*
Ga0182027_1054834213300014839FenMAKKKEPNEHGLPSLALVFGYIAVKELQRLEDRVAVLSRLGYGNAEIAQICGTTSASVATLKSSSNKSKKARGKR*
Ga0180094_102835433300014881SoilMAKTKSIEEHGLPSLSLVFGYIAVKELQRIEDRVRVLARLGYGNAEIAKICNTTPASVRTIKSAAKNKPAKKTKGRRR*
Ga0134085_1043724623300015359Grasslands SoilMAKPKPDEHGLPSLALVFGYIAVKELQRLEDRVVVLSRLGYGNAEIAKICDSTPAAVATLKVRAKKRPGKGRRKSK*
Ga0187850_1001422153300017941PeatlandMPKQDKDEFLPSLSRVFGYVAVKELRNKKDRVKVLARLGYPNKEIAIICGTTPASVATLKALPSKKKGKHK
Ga0184637_1002894023300018063Groundwater SedimentMAKKVYDQEHALPSLALVFGYIAVKELQRLEDRIAVLARLGYGNAEIAKICGTTPESVSTLKSKAKKTKKRKA
Ga0184631_1004353933300018070Groundwater SedimentMAKIERTDDHGLPSLSLVFGYIAIKDLQRLDDRVKVLTRLGYGANEISKICDTSPATVHVMRSVAKKNKTPRRSM
Ga0184633_1035030713300018077Groundwater SedimentKVYDQEHALPSLALVFGYIAVKELQRLEDRIAVLARLGYGNAEIAKICGTTPESVSTLKSKAKKTKKRKA
Ga0184627_1003808963300018079Groundwater SedimentVYDQEHALPSLALVFGYIAVKELQRLEDRIAVLARLGYGNAEIAKICGTTPESVSTLKSKAKKTKKRKA
Ga0187769_1062133023300018086Tropical PeatlandMKIRKIKKEHDLPSLALVFSYVAVKDLQRLEDRVAVLSRLGYGAAEIATICATTPATVRTLKSKTKGRKR
Ga0187771_1004365663300018088Tropical PeatlandMRRTTAQDEHALPSLSLVFGYFAIKELQRLEDQVKVLARLGYGNAEIARICDTTPATVRTLKSAKKKGSRK
Ga0187770_1034906813300018090Tropical PeatlandMKIRKIKKEHDLPSLALVFGYVAVKDLQRLEDRVAVLSRLGYGAAEIATICATTPATVRTLKSKTKGRKR
Ga0182028_155198633300019788FenMAKKKEPNEHGLPSLALVFGYIAVKELQRLEDRVAVLSRLGYGNAEIAQICGTTSASVATLKSSSNKSKKARGKR
Ga0194113_1018527123300020074Freshwater LakeMVKKEKIEEHGLPSLSLVFGYIAVKELQRLEDRVAVLSRLGYGNAEIAIICDTTPATVRTLKSGLSKGKRSRRGK
Ga0210377_1026221013300021090Groundwater SedimentPGEHGLPSLSLVFGYLAVKELQSIEDRVRVLSRLGYGNAEIAIICDTTPASVRTLKYTGKKKPTKKSGRRKA
Ga0210384_1004662583300021432SoilMAKKASKKKASKKDKEHGLPSLSLVFGYIAVKDLQRLEDRIAVLSRLGYGNAEMALICDTTPGSISTLKSRGAARRTRRTRR
Ga0212089_1002836043300022551Lake SedimentMSKVKPDKEHGLPSLSLVFGYIAVKELQDKIEQVGVLSRLGYGTAEIARICDTTPESVRVLKSVGKKKVRRRARKRKKTR
Ga0212089_1005212623300022551Lake SedimentMSKVKSDKEHGLPSLSLVFGYIAVKEIQDKIEQVGVLSRLGYGTAEIARICDTTPESVRVLKSVGKKKVRRRARKRKKTR
Ga0209619_1009273723300025159SoilMPRRASAKDKEHGLPSLALVFGYIAVKDLQRLEDRIAVLNRLGYGNAEMALICDTTPGSISTLKSRGAARRRTRR
Ga0209521_1053008613300025164SoilKKTKEDVPGLPSLSLVFGYIAVKELQRLEDRVDVLTRLGYGAAETAKICGTTAGTVHTLRSRARRGGRRR
Ga0209172_10002716173300025310Hot Spring SedimentMTKKQMNEDHALPSLSLVFGYTAVKELQRLEDRIAVLTRLGYGISEIATICDTTPATVRTIKSSLNKKKRNRR
Ga0209431_1101258923300025313SoilMKKQSSEEHGLPSLSLVFGYIAVKELQSVEDRVRVLSRLGYGNAEIATICDTTPATVRTLKYTGKKKPARKSGRRKV
Ga0210077_108607333300025952Natural And Restored WetlandsMIKKARTEEHGLPSLALVFGYIAVKELQGLADRVAVLSRLGYGNEEIARICGKNPNSIRAMKSKLGKGKQSRGTK
Ga0209235_119148833300026296Grasslands SoilLPSLALVFGYIAVKELQRLEDRVVVLSRLGYGNAEIAKICDSTPAAVATLKVRAKKRPGKGRRKSK
Ga0257156_113272423300026498SoilMAKKASKKDKEHGLPSLSLVFGYIAVKDLQRLEDRIAVLSRLGYGNAEMALICDTTPGSISTLKSRGAARRTRRTRR
Ga0209577_1070554123300026552SoilMAKKKAKKKKTARGEESGLPSLSLVFGYIAVKELQRLEDRVNVLWRLGYGNPEIATICDTTPATVATLKARIKKSK
Ga0209178_101247023300027725Agricultural SoilMPKSDSEDHGLPSLALVFGYIAVKELQRTQDRVAVLSRLGFGNSEIALICDTTPAVVRTLKALAKKKPRKGRGKRK
Ga0214474_104846823300027740SoilMAKKEKIEEHGLPSLSLVFGYIAVKELQRLEDRVAVLNRLRYGNAEIATICGTTPATVRTLKSGLSRQKRSRRSK
Ga0209514_1005696123300027819GroundwaterMPTNQSAEEHGLPSLSLVFGYIAVKELRLLEDRVRVLARLGYGNAEIATICDTSPGAVRTLKSAAKRKPTNRSQRRKK
Ga0209180_1000975243300027846Vadose Zone SoilMAKKPKVKVEEHGLPSLALVFGYIAVKDLQRTEDRVVVLSRLGYGNVEIAKICDTTPAVVATLKSMAKKKPRKGRRKKQ
Ga0209293_1018262023300027877WetlandMTRIPSSEEHGLPSLSLVFGYIAVKELRLLEDRVRVLARLGYGNAEIATICDTTPAVVRTLKSAAKRKPAKRSKRRKT
Ga0209777_1021352023300027896Freshwater Lake SedimentMIKNSEEHGLPSLSLVFGYIAVKELQRLEDRVVVLSRLGYGNAEIARICDTKPSSVRALRSIHRKKDTSAREK
Ga0209777_1023584223300027896Freshwater Lake SedimentMAKEDSLGDHALPSLSLVFGYIAVKELQRLEDKVRVLARLGYGNAEIAKICNTTLPSVRTMKSAGKNKRPSRKKKGV
Ga0209253_1085450023300027900Freshwater Lake SedimentMAKKQLFEEHGLPSLSLVFGYIAVKELQRLDDRIRVLARLGYGNAEIAQICNTTPAVVRTLKSAAKKKPSRKLKRRK
Ga0209536_10141446433300027917Marine SedimentMPRPTENDHGLPSLALVFGYIAVKELQRLEDRIAVLARLGYGNAEIAAICDTTPGVVRTLKSARKGRKTRRK
Ga0256864_102591543300027964SoilMAKKRTQPKDEHGLPSLALVFGYIAVKELQRLQDKIRILSRLGYGNAEIAAICDASPAVVAALKYRSAKSPRRRTT
Ga0268298_1016270943300028804Activated SludgeMATKNDNDEHALPSLSLVFGYIAVKELQRLEDRISVLARLGYGNAEIAKICDTTPATVRTIKSSTKKSKKERKQK
Ga0307302_1056158213300028814SoilSLRMAKVKVEEHGLPSLALVFGYIAVKELQRLEDRVVVLSRLGYGTAEIAAICDTTPATVRTLRSGAKKAKKPRPGKKGRKK
Ga0265297_1021649633300029288Landfill LeachateMAKEQSSDEHALPSLSLVFGYIAVKELRLLEDRIRILARLGYGNAEIATICDTTPAVVRTYKSVSKKKKTSRK
Ga0299915_100000022543300030613SoilMPKKIPDEEHGLPSLSLVFGYIAVKELQRLEDRIRVLARLGYGNAEIATICNTTAATVSTLKSVAKKNRAQKARGAQQ
Ga0272442_1055429913300030616Marine SedimentMAKRTDTELHALPSLALVFGYIAVKELQRLEDRVSVLTRLGYGNAETATICGTTPATVSTLKSGLKKRRKKK
Ga0247727_10003363343300031576BiofilmMSKKLTNEDHALPSLSLVFGYTAVKELQRLEDRIAVLVRLGYGIAEIATICDTTPATVRTIKSALNKKKKDRRTQ
Ga0247727_10010686173300031576BiofilmMAIKRNNSKERALPSLALICGYIAVKELQRPQDRIAILDRLGYGIAEIATISGTTPAAVRAIKSDANKTKIVRRLKRKKVF
Ga0247727_1002177293300031576BiofilmMAKKQQIDDHGLPSLSLVFGYIAVKELQRLEDRIAVLERLGYGNAEIAKICDTTMATVRTIKSASKKTKRTWRQK
Ga0247727_1003460643300031576BiofilmMATKRDQNADHALPSLSLIFGYIAVKELQRLEDRITVLDRLGYGSAEIAQICDTTPATVHTLKSRTKKERIMRKKR
Ga0247727_1003582693300031576BiofilmMATKRDNSKEHALPSLSLIFGYIAVKELQRLEDRITVLDRLGYGIAEIATICDTTPAAVRAIKSGAKKTKIVRRLN
Ga0247727_1007867843300031576BiofilmMTNKKQNEEHGLPSLSLVFGYIAVKELQRLEDRVKVLTRLGYGNPEIAQICDTKPAVIATMKSSIKRKGKQR
Ga0315291_1002679053300031707SedimentMAKKLKEEDHGLPSLSLVFGYIAVKELQRLEDRVRVLSRLGYGNPEIAIICGTTAASVATVKSAAKKKPAAKSKAKGHKK
Ga0315291_1004178433300031707SedimentMRSVQKTDDQGLPSLSLVFGYIAIKDLQRLEDRVTVLTRLGYGANEIAKICDTSPATVFTLRSIAKNTKLSRRSR
Ga0315288_1057392223300031772SedimentMAKALSSEEHGLPSLSLVFGYIAVKELRLLEDRIRVLARLGYGNAEIATICDTKPTVVRTLKSRVKRKPVKKSKRRIK
Ga0214473_1003114993300031949SoilMANKATDQEHALPSLALVFGYIAVKELQRLEDRIAVLARLGYGNAEIATICGTTPESVSTLKARAKKARKHRRSK
Ga0315294_10000756153300031952SedimentMTKTNRKKSEEHGLPSLSLVAGYIAVKELGRLEDSVEVLARLGYGNAEIATICNTTARKVKGVKIER
Ga0315294_1023415323300031952SedimentMKRKDKKHGLPSLSLVFGYIAVKELGRLEDRVAVLARLGYGNAEIAKICGTTPASVGTLKSRSRKKRGGISSE
Ga0326597_1003740533300031965SoilMAKKNEPEHALPSLALVFGYIAVKDLQRVEDRIAVLARLGYGNAEIAAICDTSTGVVRTLKSAVKKKRSKGK
Ga0326597_1007771713300031965SoilMAKKKARKKDEEHGLPSLALVFGYIAVKDLQSITDRVAVLRRLGYGNVEMAAICGTTPQSIATFKHLGAKRRRRL
Ga0326597_1048977943300031965SoilMARKASRKDEEHGLPSLALVFGYIAVKELQRLEDRVAVLSRLGYGNAEMALICGSTPASIATLKHRGATRKGRRARR
Ga0326597_1084231813300031965SoilMARKDRIEEQGLPSLALVFGYIAVKELQRIEDRVGVLSRLGYGSAEIAKICNTTPATVHTLRSKLKKNKQSRGSK
Ga0315274_10007510163300031999SedimentMAKIVRADEHGLPSLSLIFGYIAIKELQRLEDRVAVLSRLGFGNAEIAKICDTTPATVRTLKYEIGKHKRFRRNR
Ga0315274_1017337313300031999SedimentMATKRDNEEHALPSLSLVFGYIAVKELQRLEDRIAVLVRLGYGNAEIATICDTTPATVRTIKSSTKKARNTRRQK
Ga0315274_1185544923300031999SedimentMAKIKPSEEHGLPSLSLVFGYIAIKELRLLEDRVRVLARLGYGNAEIAAICGTTPAVVGTLKSAIKRKQVKRSKRRKK
Ga0315277_1073514633300032118SedimentMKGKDKKHGLPSLSLVFGYIAVKELGRLEDRVAVLARLGYGNAEIAKICGTTPASVGTLKSRSRKKRGGISSE
Ga0315281_1102263123300032163SedimentMPKTKPPDEHGLPSLSLVFGYIAVKELQRLEDRIAVLSRLGYGNAEIATICGTTTPTVATLKSVLKKNRRTGGTR
Ga0315281_1126801513300032163SedimentMAKAKAEADEHGLPSLALVFGYIAVKELRRLEDRVDVLSRLGYGNMEIARICGTTTNTVAVLKSKRRHR
Ga0315268_1007957773300032173SedimentMAKAKAEADEHGLPSLALVFGYIAIKELRRLEDRVDVLSRLGYGNMEIARICGTTTNTVAVLKSKRRHR
Ga0335069_1128735933300032893SoilMARKAAIEDQGLPSLSLVFGYVAVKELQRREDQVRVLSRLGYGIGAIATICDTTPASVRSLRHLAVKKTRAKKAKGNSK
Ga0214472_1073212113300033407SoilAGGGRMARKASRKDEEHGLPSLALVFGYIAVKELQRLEDRVAVLSRLGYGNAEMALICGSTPASIATLKHRGATRKGRRARR
Ga0214471_1002062223300033417SoilMAKKPTTEDHGLPSLSLVFGYIAVKDLQRLEDRVALLARLGYGNAEIAKICGTTTDTVSTLKARAKRTKKKRKK
Ga0326726_1042556723300033433Peat SoilMAKKEKAEEYGLPSLALVFGYMAVKELQRIEDRVRVLSRLGYGNAEIATICDTSPAVVRTYKSSLKKNRRSRRGQ
Ga0316630_1159411013300033487SoilEEHGLPSLSLVFGYIAVKELQRLEDRVAVLSRLGYGNAEIATICDTTPATVRTLKSGLSKQKRSRRSK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.