NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F067156

Metagenome Family F067156

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F067156
Family Type Metagenome
Number of Sequences 126
Average Sequence Length 55 residues
Representative Sequence MRRRYRRLARRFRSIAFIATVLRSYREGRRRGQTRLAALRFGRAVAHWRRHNGAAHS
Number of Associated Samples 118
Number of Associated Scaffolds 126

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 76.07 %
% of genes near scaffold ends (potentially truncated) 30.16 %
% of genes from short scaffolds (< 2000 bps) 69.84 %
Associated GOLD sequencing projects 108
AlphaFold2 3D model prediction Yes
3D model pTM-score0.33

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (61.905 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(11.111 % of family members)
Environment Ontology (ENVO) Unclassified
(25.397 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(29.365 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 55.29%    β-sheet: 0.00%    Coil/Unstructured: 44.71%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.33
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 126 Family Scaffolds
PF07746LigA 13.49
PF13561adh_short_C2 3.97
PF01593Amino_oxidase 2.38
PF02900LigB 2.38
PF00535Glycos_transf_2 1.59
PF04392ABC_sub_bind 1.59
PF13450NAD_binding_8 1.59
PF00248Aldo_ket_red 0.79
PF13432TPR_16 0.79
PF00581Rhodanese 0.79
PF00583Acetyltransf_1 0.79
PF07687M20_dimer 0.79
PF02668TauD 0.79

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 126 Family Scaffolds
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 1.59
COG2175Taurine dioxygenase, alpha-ketoglutarate-dependentSecondary metabolites biosynthesis, transport and catabolism [Q] 0.79


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms61.90 %
UnclassifiedrootN/A38.10 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000550|F24TB_10069139All Organisms → cellular organisms → Bacteria1854Open in IMG/M
3300002245|JGIcombinedJ26739_101527651Not Available563Open in IMG/M
3300002886|JGI25612J43240_1016194All Organisms → cellular organisms → Bacteria1102Open in IMG/M
3300002914|JGI25617J43924_10124812Not Available904Open in IMG/M
3300002914|JGI25617J43924_10350364Not Available512Open in IMG/M
3300003319|soilL2_10300968All Organisms → cellular organisms → Bacteria1253Open in IMG/M
3300003324|soilH2_10032295All Organisms → cellular organisms → Bacteria4848Open in IMG/M
3300003994|Ga0055435_10009611All Organisms → cellular organisms → Bacteria1803Open in IMG/M
3300003995|Ga0055438_10006677All Organisms → cellular organisms → Bacteria2201Open in IMG/M
3300004019|Ga0055439_10044715Not Available1182Open in IMG/M
3300004024|Ga0055436_10025314All Organisms → cellular organisms → Bacteria1472Open in IMG/M
3300004052|Ga0055490_10014256All Organisms → cellular organisms → Bacteria1764Open in IMG/M
3300004463|Ga0063356_102396319Not Available807Open in IMG/M
3300005332|Ga0066388_100351065All Organisms → cellular organisms → Bacteria2121Open in IMG/M
3300005406|Ga0070703_10025453All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1756Open in IMG/M
3300005434|Ga0070709_10199835All Organisms → cellular organisms → Bacteria1415Open in IMG/M
3300005467|Ga0070706_100145012All Organisms → cellular organisms → Bacteria2217Open in IMG/M
3300005468|Ga0070707_102037225All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300005529|Ga0070741_10003015All Organisms → cellular organisms → Bacteria43130Open in IMG/M
3300005536|Ga0070697_101694807Not Available565Open in IMG/M
3300005545|Ga0070695_100049702All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia2685Open in IMG/M
3300005557|Ga0066704_10620549Not Available692Open in IMG/M
3300005713|Ga0066905_100002337All Organisms → cellular organisms → Bacteria7045Open in IMG/M
3300005833|Ga0074472_10480797All Organisms → cellular organisms → Bacteria575Open in IMG/M
3300005877|Ga0075296_1005799All Organisms → cellular organisms → Bacteria828Open in IMG/M
3300005878|Ga0075297_1001528All Organisms → cellular organisms → Bacteria1664Open in IMG/M
3300005880|Ga0075298_1027875Not Available562Open in IMG/M
3300005937|Ga0081455_10001437All Organisms → cellular organisms → Bacteria29462Open in IMG/M
3300006047|Ga0075024_100018889All Organisms → cellular organisms → Bacteria2780Open in IMG/M
3300006163|Ga0070715_10428781All Organisms → cellular organisms → Bacteria741Open in IMG/M
3300006804|Ga0079221_10187160All Organisms → cellular organisms → Bacteria1121Open in IMG/M
3300009053|Ga0105095_10176189Not Available1170Open in IMG/M
3300009090|Ga0099827_10168063All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1806Open in IMG/M
3300009098|Ga0105245_10903178All Organisms → cellular organisms → Bacteria925Open in IMG/M
3300009148|Ga0105243_11544907Not Available689Open in IMG/M
3300009148|Ga0105243_12063705All Organisms → cellular organisms → Bacteria605Open in IMG/M
3300009157|Ga0105092_10642254Not Available615Open in IMG/M
3300009171|Ga0105101_10092431All Organisms → cellular organisms → Bacteria1469Open in IMG/M
3300009177|Ga0105248_11209136Not Available854Open in IMG/M
3300009820|Ga0105085_1130775Not Available510Open in IMG/M
3300009822|Ga0105066_1108657Not Available616Open in IMG/M
3300010359|Ga0126376_12792101Not Available538Open in IMG/M
3300010362|Ga0126377_12675031Not Available574Open in IMG/M
3300010371|Ga0134125_10007130All Organisms → cellular organisms → Bacteria12631Open in IMG/M
3300010400|Ga0134122_10136027All Organisms → cellular organisms → Bacteria1984Open in IMG/M
3300010400|Ga0134122_11310335Not Available732Open in IMG/M
3300010401|Ga0134121_12487275Not Available560Open in IMG/M
3300012199|Ga0137383_10638457Not Available778Open in IMG/M
3300012360|Ga0137375_10140388All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2377Open in IMG/M
3300012685|Ga0137397_10154004All Organisms → cellular organisms → Bacteria1703Open in IMG/M
3300012925|Ga0137419_10279917All Organisms → cellular organisms → Bacteria1268Open in IMG/M
3300014884|Ga0180104_1204673Not Available589Open in IMG/M
3300015245|Ga0137409_10098050All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2722Open in IMG/M
3300015259|Ga0180085_1155692Not Available686Open in IMG/M
3300017936|Ga0187821_10047427All Organisms → cellular organisms → Bacteria1537Open in IMG/M
3300017939|Ga0187775_10015517All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2015Open in IMG/M
3300017993|Ga0187823_10256473Not Available594Open in IMG/M
3300018054|Ga0184621_10201453Not Available713Open in IMG/M
3300018063|Ga0184637_10049901All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2552Open in IMG/M
3300018422|Ga0190265_10699768All Organisms → cellular organisms → Bacteria1135Open in IMG/M
3300018422|Ga0190265_12206989Not Available653Open in IMG/M
3300018429|Ga0190272_10069093All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2134Open in IMG/M
3300019487|Ga0187893_10274398All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1221Open in IMG/M
3300019882|Ga0193713_1201654Not Available507Open in IMG/M
3300019883|Ga0193725_1049115All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1082Open in IMG/M
3300020004|Ga0193755_1031139All Organisms → cellular organisms → Bacteria1765Open in IMG/M
3300020060|Ga0193717_1110046All Organisms → cellular organisms → Bacteria859Open in IMG/M
3300021080|Ga0210382_10047902All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1673Open in IMG/M
3300021170|Ga0210400_10276756All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1375Open in IMG/M
3300021445|Ga0182009_10010496All Organisms → cellular organisms → Bacteria3199Open in IMG/M
3300025160|Ga0209109_10035771All Organisms → cellular organisms → Bacteria2670Open in IMG/M
3300025324|Ga0209640_10042532All Organisms → cellular organisms → Bacteria3954Open in IMG/M
3300025324|Ga0209640_10046769All Organisms → cellular organisms → Bacteria3765Open in IMG/M
3300025535|Ga0207423_1018364All Organisms → cellular organisms → Bacteria1135Open in IMG/M
3300025549|Ga0210094_1003778All Organisms → cellular organisms → Bacteria2193Open in IMG/M
3300025560|Ga0210108_1056767All Organisms → cellular organisms → Bacteria754Open in IMG/M
3300025885|Ga0207653_10018498All Organisms → cellular organisms → Bacteria2193Open in IMG/M
3300025910|Ga0207684_10030819All Organisms → cellular organisms → Bacteria4564Open in IMG/M
3300025922|Ga0207646_10029858All Organisms → cellular organisms → Bacteria → Proteobacteria4949Open in IMG/M
3300025924|Ga0207694_11254342Not Available627Open in IMG/M
3300025927|Ga0207687_10440202All Organisms → cellular organisms → Bacteria1079Open in IMG/M
3300025933|Ga0207706_11554229Not Available538Open in IMG/M
3300026005|Ga0208285_1000974All Organisms → cellular organisms → Bacteria1473Open in IMG/M
3300026118|Ga0207675_101971649All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes602Open in IMG/M
3300026480|Ga0257177_1090368All Organisms → cellular organisms → Bacteria502Open in IMG/M
3300026482|Ga0257172_1063890All Organisms → cellular organisms → Bacteria675Open in IMG/M
3300027266|Ga0209215_1044925All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia606Open in IMG/M
3300027273|Ga0209886_1033328All Organisms → cellular organisms → Bacteria789Open in IMG/M
3300027671|Ga0209588_1018391All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2175Open in IMG/M
3300027671|Ga0209588_1027146All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1821Open in IMG/M
3300027738|Ga0208989_10254977Not Available570Open in IMG/M
3300027765|Ga0209073_10029518All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1687Open in IMG/M
(restricted) 3300027799|Ga0233416_10066310All Organisms → cellular organisms → Bacteria1224Open in IMG/M
3300027882|Ga0209590_10561203Not Available735Open in IMG/M
3300027894|Ga0209068_10002888All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria8204Open in IMG/M
3300027954|Ga0209859_1076716All Organisms → cellular organisms → Bacteria529Open in IMG/M
3300027955|Ga0209078_1210474Not Available520Open in IMG/M
(restricted) 3300028043|Ga0233417_10352299All Organisms → cellular organisms → Bacteria672Open in IMG/M
3300028047|Ga0209526_10447489Not Available850Open in IMG/M
3300028771|Ga0307320_10436949Not Available527Open in IMG/M
3300028792|Ga0307504_10192457Not Available718Open in IMG/M
3300028889|Ga0247827_10356842Not Available874Open in IMG/M
3300030006|Ga0299907_10136532All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2024Open in IMG/M
3300030619|Ga0268386_10428142Not Available923Open in IMG/M
3300030620|Ga0302046_11005377Not Available663Open in IMG/M
(restricted) 3300031197|Ga0255310_10050918All Organisms → cellular organisms → Bacteria1082Open in IMG/M
3300031716|Ga0310813_10124870All Organisms → cellular organisms → Bacteria2037Open in IMG/M
3300031740|Ga0307468_100712454All Organisms → cellular organisms → Bacteria841Open in IMG/M
3300031949|Ga0214473_12329219Not Available512Open in IMG/M
3300032012|Ga0310902_11116886Not Available552Open in IMG/M
3300032770|Ga0335085_10162790All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2781Open in IMG/M
3300033233|Ga0334722_10118806All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1995Open in IMG/M
3300033432|Ga0326729_1002811All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3544Open in IMG/M
3300033432|Ga0326729_1033972Not Available804Open in IMG/M
3300034178|Ga0364934_0086080All Organisms → cellular organisms → Bacteria1177Open in IMG/M
3300034773|Ga0364936_072600All Organisms → cellular organisms → Bacteria651Open in IMG/M
3300034817|Ga0373948_0013646All Organisms → cellular organisms → Bacteria → Nitrospirae1470Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil11.11%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil7.94%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere7.94%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands6.35%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.76%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment3.97%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.97%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil3.17%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil3.17%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand3.17%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere3.17%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment2.38%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.38%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.38%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil2.38%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.59%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.59%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.59%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.59%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.59%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil1.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.59%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.59%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.59%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.59%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.79%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.79%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.79%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.79%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.79%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.79%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.79%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.79%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.79%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.79%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.79%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.79%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.79%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.79%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.79%
Rhizosphere SoilHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere Soil0.79%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.79%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002886Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300003994Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300003995Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2EnvironmentalOpen in IMG/M
3300004019Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2EnvironmentalOpen in IMG/M
3300004024Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300004052Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005833Microbial communities from Cathlamet Bay sediment, Columbia River estuary, Oregon - S.174_CBKEnvironmentalOpen in IMG/M
3300005877Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_404EnvironmentalOpen in IMG/M
3300005878Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_104EnvironmentalOpen in IMG/M
3300005880Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_201EnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300009053Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009171Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm May2015EnvironmentalOpen in IMG/M
3300009177Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaGHost-AssociatedOpen in IMG/M
3300009820Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60EnvironmentalOpen in IMG/M
3300009822Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010371Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-1EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017939Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_10_MGEnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020060Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021445Bulk soil microbial communities from the field in Mead, Nebraska, USA - 072115-187_1 MetaGEnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025535Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025549Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025560Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025924Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026005Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_0N_101 (SPAdes)EnvironmentalOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300027266Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM2H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027273Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027663Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027675Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm March2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027738Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM3_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027954Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300027955Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm May2015 (SPAdes)EnvironmentalOpen in IMG/M
3300028043 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0.5_MGEnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028771Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_369EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028889Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day2EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031716Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - YN3EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032012Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D3EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300033233Sediment microbial communities from Yellowstone Lake, YNP, Wyoming, USA - C3_bottomEnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M
3300034178Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17EnvironmentalOpen in IMG/M
3300034773Sediment microbial communities from East River floodplain, Colorado, United States - 4_s17EnvironmentalOpen in IMG/M
3300034817Populus rhizosphere microbial communities from soil in West Virginia, United States - GW9791_WV_N_1Host-AssociatedOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
F24TB_1006913933300000550SoilMTRRFRRLTRRYGSIAFVVAVLRSYREGRRRGQTRLSALRFGRAVARWRRQNGAAHS*
JGIcombinedJ26739_10152765113300002245Forest SoilMTRRFRRLARRSPAMAFAWAFLLSYREGRRRGQSPIRAARFGRAVARWRRQNGGAHS*
JGI25612J43240_101619413300002886Grasslands SoilMTRRFRRLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHS*
JGI25617J43924_1012481223300002914Grasslands SoilMTRRFRRLARRSRAVAFAWVVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHP*
JGI25617J43924_1035036413300002914Grasslands SoilMTRRFRRLAXRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHS*
soilL2_1030096823300003319Sugarcane Root And Bulk SoilMTRRMRRLTRRSRALAFAWAVLRSYREGRRRGQSPLHAWRFGRAVARWHQRNGAAHS*
soilH2_1003229523300003324Sugarcane Root And Bulk SoilMMTRRMRRLTRRSRALAFAWAVLRSYREGRRRGQSPLHAWRFGRAVARWHQRNGAAHS*
Ga0055435_1000961133300003994Natural And Restored WetlandsMRRRYSYRRLARRFRSIAFIATVLRTYREGRRRGQTRLAALRFGRAVAQWRRHNGPAHS*
Ga0055438_1000667713300003995Natural And Restored WetlandsGGDMRRRYSYRRLARRFRSIAFIATVLRTYREGRRRGQTRLAALRFGRAVAQWRRHNGPAHS*
Ga0055439_1004471523300004019Natural And Restored WetlandsMRRRYSYRRLARRFRSIAFIATVLRTYRAGRRRGQTRLAALRFGRAVAQWRRHNGPAHS*
Ga0055436_1002531433300004024Natural And Restored WetlandsYSYRRLARRFRSIAFIATVLRTYREGRRRGQTRLAALRFGRAVAQWRRHNGPAHS*
Ga0055490_1001425623300004052Natural And Restored WetlandsMRRRYRRLARRFRSVAFIATVFRTYREGRRRGQTRLTALRFGRAVAQWQRHNGAAHS*
Ga0063356_10239631913300004463Arabidopsis Thaliana RhizosphereMMTRRMRRLTRRSRTLAFAWAVLRSYREGRRRGQSPLHAWRFGRAVARWRQRNGAAHS*
Ga0066388_10035106533300005332Tropical Forest SoilMRRRFRRLSRRFRTIAFLAAVLRSYREGRRRGKSRLGAFRFGRAVARWHRHNGHAHS*
Ga0070703_1002545313300005406Corn, Switchgrass And Miscanthus RhizosphereMTRRFRRLARRSRAVAFAWVVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHS*
Ga0070709_1019983533300005434Corn, Switchgrass And Miscanthus RhizosphereMTRRFRRLARRSPAMAFAWAFLLSYREGRRRGQSPIDAARFGRAVARWRRQNGAEHP*
Ga0070706_10014501243300005467Corn, Switchgrass And Miscanthus RhizosphereMTRRFRRLARRSPAMAFAWAFLLSYREGRRRGQSPINAARFGRAVARWRRQNGAAHS*
Ga0070707_10203722513300005468Corn, Switchgrass And Miscanthus RhizosphereRLARRSRAVRFMAVMLRSYREGRRRGQSRLAALRFGRAVARWRRHNGASHP*
Ga0070741_1000301563300005529Surface SoilMTRRMRRLTRRSRALAFAWAVLRSYREGRRRGQSPLNAWRFGRAVARWHQRNGPAHS*
Ga0070697_10169480713300005536Corn, Switchgrass And Miscanthus RhizosphereMFRTDENARKGGAMRRRYRRLARRFRSIVFITTVVRSYREGRRRGQTRLAALRFGRGVAQWRRHNGTAHS*
Ga0070695_10004970243300005545Corn, Switchgrass And Miscanthus RhizosphereMTRRFKRLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHS*
Ga0066704_1062054913300005557SoilMTRRFRRLARRSRAVAFAWVVLRSYREGRRRGQSPLRALRFGRAVARWRRQ
Ga0066905_10000233793300005713Tropical Forest SoilLRRLTRRYGSIAFVVTVLRSYREGRRRGQTRLSALRFGRAVARWRRQNGAAHS*
Ga0074472_1048079713300005833Sediment (Intertidal)MRRRYRRLARRFRSIAFIDTVFRTYREGRRRGQTRLAALRFGHAVAQWRRHNGAAHS*
Ga0075296_100579913300005877Rice Paddy SoilDVIFPFGGKIPMTRRFRRLARRSRALAFLTDVLRTYREGRRRGRSRLDALRFGRAVARWHRHNGAAQP*
Ga0075297_100152823300005878Rice Paddy SoilMTRRFRRLARRSRALAFLTDVLRTYREGRRRGRSRLDALRFGRAVARWHRHNGAAQP*
Ga0075298_102787523300005880Rice Paddy SoilMTRRFRRLARRSRALAFLTDVLRTYREGRRRGRSRLDALRFGRAVARWHRHNGAPQS*
Ga0081455_10001437213300005937Tabebuia Heterophylla RhizosphereMTRRFRRLTRRYGTLGFVVSVLRSYREGRRRGQTRLTALRFGRAVARWRRQNGAAHS*
Ga0075024_10001888933300006047WatershedsMTRRFRRLARHSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHS*
Ga0070715_1042878123300006163Corn, Switchgrass And Miscanthus RhizosphereMTRRFRRLARRSPAMAFAWAFLLSYREGRRRGQSPIDAARFGRAVARWRRQNGAAHS*
Ga0079221_1018716013300006804Agricultural SoilMTRRMRRLTRRSRTLAFAWAVLRSYREGRRRGQSPLHAWRFGRAVARWRQRNGAAHS*
Ga0079220_1193123423300006806Agricultural SoilMMTRRMRRLTRRSRALAFAWAVLRSYREGRRRGQSPLHAWRF
Ga0105095_1017618913300009053Freshwater SedimentMRRRYRRLARRFRSIAFIATVFRTYREGRRRGQTRLTALRFGRAVAQWQRHNGAAH
Ga0099827_1016806353300009090Vadose Zone SoilMRLQRRYRRLSRRFRSIAFITTVVRSYREGRRRGQTRLAALRFGRAVAQWRRHNGTAHS*
Ga0105245_1090317833300009098Miscanthus RhizosphereRRLARRFRSLAFIATVFRTYREGRRRGQTRLTALRFGRAVAQWRRYNGPAHP*
Ga0105243_1154490723300009148Miscanthus RhizosphereMRRRYRRLARRFRSLAFIATVFRTYREGRRRGQTRLTALRFGRAVAQWRRYNGPAHP*
Ga0105243_1206370523300009148Miscanthus RhizosphereMRLRRRYRRLARRFRSIVFITTVVRSYREGRRRGQTRLAALRFGRGVAQWRRHNGTAHS*
Ga0105092_1064225423300009157Freshwater SedimentMRRRYRRLARRFRSIAFITTVFRTYQEGRRRGQTRLAALRFGRAVARWRRHNGAAHS*
Ga0105101_1009243143300009171Freshwater SedimentMRRRYRRLARRFRSIAFIATVFRTYREGRRRGQTRLTALRFGRAVAQWQRHNGAAHS*
Ga0105248_1120913613300009177Switchgrass RhizosphereMRRLTRRSRTLAFAWAVLRSYREGRRRGQSPLHAWRFGRAVAR
Ga0105085_113077513300009820Groundwater SandMRRRYSYRRLARRFRSIAFIATVLRTYREGRRRGQTRLAALRFGRAVAQWRRHHGIAHS*
Ga0105066_110865723300009822Groundwater SandMTRRFKRLARRSRAIRFMAAVLRSYREGRRRGQSPLAALRFGRAVARWRRHNGVAHL*
Ga0126376_1279210113300010359Tropical Forest SoilMRRLRRRSRALAFAWAVLRSYREGRRRGQSPLRAWRFGRAVARWHQRNNAAHS*
Ga0126377_1267503113300010362Tropical Forest SoilMRRLTRRSRALAFAWAVLRSYREGRRRGQSPLRAWRFGRAVARWHQRNGAAHS*
Ga0134125_10007130113300010371Terrestrial SoilMRRLTRRSRTLAFAWAVLRSYREGRRRGQSPLHAWRFGRAVARWRQRNGAAHS*
Ga0134122_1013602743300010400Terrestrial SoilMRRRYRRLARRFRTIAFIATMLRTYREGRRRGQTHLAALRFGRAVAQWRRHNGAAHS*
Ga0134122_1131033523300010400Terrestrial SoilMTRRFRRLARRSRAVAFAWTVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHS*
Ga0134121_1248727523300010401Terrestrial SoilMTRRFRRLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHP*
Ga0137383_1063845713300012199Vadose Zone SoilMTRRFRRLARRSRAVAFAWVVLRSYRERRRRGQSPLRA
Ga0137375_1014038833300012360Vadose Zone SoilMFHVLKGGEMRRYRRLTRRFRSIAFIATVFRSYREGRRRGQTRLTALRFGRAVAQWRRHNGPAHS*
Ga0137397_1015400433300012685Vadose Zone SoilMKGGEMRLQRRYRRLARRFRSIAFITTVFRSYREGRRRGQTRLAALRFGRAVARWRHHNGTAHS*
Ga0137419_1027991743300012925Vadose Zone SoilMTRRFRRLARRSRAVVFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHS*
Ga0180104_120467323300014884SoilMRRRYRRLARRFRSIAFIATVFRTYREGRRRGQSRLAALRFGHAVAQW
Ga0137409_1009805063300015245Vadose Zone SoilMFHVLKGGEMRLRRRYRRLARRFRSIAFITTVVRSYREGRRRGQTRLAALRFARAVAQW
Ga0180085_115569223300015259SoilMRRRYRRLARRFRSIAFIATVFRTYREGRRRGQSRLAALRFGHAVAQWRRHNGAAHS*
Ga0187821_1004742733300017936Freshwater SedimentMTRRFKRLARRSRAVAFAWAVLRSYREGRRRGQSPLHALRFGRAVARWRRKNGAAHS
Ga0187775_1001551713300017939Tropical PeatlandMFQETIMTGRFRRLARRHRTLALLAAVLRSYREGRRRGQTRLGAFRFGRAVARWRRQNGAAHL
Ga0187823_1025647313300017993Freshwater SedimentMTRRFKRLARRSRAVAFAWAVLRSYREGRRRGQSRLHALRFGRAVARWRRQNGAAHS
Ga0184621_1020145313300018054Groundwater SedimentMRLQRRYRRLARRFRSIAFITTVVRSYREGRRRGQTRLAALRFGRAVAQWRRHNGAARS
Ga0184637_1004990143300018063Groundwater SedimentMTRRFKRLARRSRAIRFMAAVLRSYREGRRRGQSPLAALRFGRAVARWRRHNGAAHP
Ga0190265_1069976823300018422SoilMRRRYRRLARRFRSIAFITTVFRTYREGRRRGQTHLAALRFGRAVARWRRHNGAAHS
Ga0190265_1220698923300018422SoilMRRRYRRLTRRFRSIAFIATVFRTYREGRRRGQTHLAALRFGRAVAHWRRHNGAAHS
Ga0190272_1006909343300018429SoilMRRHYRRLARRFRSIAFIATVFRTYREGRRRGQTRLAALRFGRAVAQWRRHNGAAHS
Ga0187893_1027439833300019487Microbial Mat On RocksMIRRYRRLARRFRSIAFIATVLRSYREGRRRGQSRLGALRFGHAVAQWRRQNAAHS
Ga0193707_120316423300019881SoilMRLQRRYRRLARRFRSIAFITTVVRSYREGRRRGQTRLAALRFGRAVAQ
Ga0193713_120165413300019882SoilMTRRFRRLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHS
Ga0193725_104911513300019883SoilLARRFRSIAFIATVLRTYREGRRRGQTRLAALRFGRAVARWRRHNGAAHS
Ga0193755_103113933300020004SoilMRLRRHYRRLARRFRSIAFITTVVRSYREGRRRGQTRLAALRFGRAVAQWRRHNGTAHS
Ga0193717_111004623300020060SoilMRRRYRRLARKFRSIAFIATVFRTYREGRRRGQTRLTALRFGRAVAQWRRHNGAAHS
Ga0179594_1038840513300020170Vadose Zone SoilMRLRRRYRRLARRFRSIAFITTVVRSYREGRRRGQTRLAALRF
Ga0210382_1004790233300021080Groundwater SedimentVLKGGEMRLQRRYRRLARRFRSIAFITTVVRSYREGRRRGQTRLAALRFGRAVAQWRRHNGTAHS
Ga0210400_1027675643300021170SoilMTRRFRRLARRSPAMAFAWAFLLSYREGRRRGQSPIDAARFGRAVARWRRQNGA
Ga0182009_1001049623300021445SoilMMTRRMRRLTRRSRTLAFAWAVLRSYREGRRRGQSPLHAWRFGRAVARWRQRNGAAHS
Ga0209109_1003577133300025160SoilMRRRYRRLARRFRSIAFIATMLRTYREGRRRGQSRLDALRFGRAVAHWRRHNGAAHP
Ga0209640_1004253263300025324SoilMRRRYRRLARRFRSIAFIATVLRSYREGRRRGQTRLAALRFGRAVAHWRRHNGAAHS
Ga0209640_1004676923300025324SoilMRRRYRRLARRFRSIAFIATMLRTYREGRRRGQSRLDALRFGRAVAHWRRHNGVAHP
Ga0207423_101836433300025535Natural And Restored WetlandsMRRRYSYRRLARRFRSIAFIATVLRTYREGRRRGQTRLAALRFGRAVAQWRRHNGPAHS
Ga0210094_100377843300025549Natural And Restored WetlandsRRRYSYRRLARRFRSIAFIATVLRTYREGRRRGQTRLAALRFGRAVAQWRRHNGPAHS
Ga0210108_105676733300025560Natural And Restored WetlandsYSYRRLARRFRSIAFIATVLRTYREGRRRGQTRLAALRFGRAVAQWRRHNGPAHS
Ga0207653_1001849833300025885Corn, Switchgrass And Miscanthus RhizosphereMTRRFRRLARRSRAVAFAWVVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHS
Ga0207684_1003081973300025910Corn, Switchgrass And Miscanthus RhizosphereMTRRFRRLARRSPAMAFAWAFLLSYREGRRRGQSPINAARF
Ga0207646_1002985843300025922Corn, Switchgrass And Miscanthus RhizosphereMTRRFRRLARRSPAMAFAWAFLLSYREGRRRGQSPINAARFGRAVARWRRQNGAAHS
Ga0207694_1125434223300025924Corn RhizosphereMMTRRMRRLTRRSRTLAFAWAVLRSYREGRRRGQSPLHAWRFGRAVARWRQ
Ga0207687_1044020233300025927Miscanthus RhizosphereRRLARRFRSLAFIATVFRTYREGRRRGQTRLTALRFGRAVAQWRRYNGPAHP
Ga0207706_1155422913300025933Corn RhizosphereMTRRFRRLARRYGSIAFVVAVLRSYREGRRRGQTRLSALRFGRAVARWRRQNGAA
Ga0208285_100097423300026005Rice Paddy SoilMTRRFKRLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHS
Ga0207675_10197164923300026118Switchgrass RhizosphereTRRFRRLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHS
Ga0257179_104780413300026371SoilMTRRFRRLARRSRAVAFAWAVLRSYREGRRRGQSPLRAL
Ga0257167_103799713300026376SoilMTRRFRRLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVA
Ga0257177_109036813300026480SoilRRYRRLARRFRSIAFIATVLRTYREGRRRGQTRLAALRFGRAVARWRRHNGAAHS
Ga0257172_104416623300026482SoilMTRRFRRLARRSRAVAFAWVVLRSYREGRRRGQSPLRALR
Ga0257172_106389013300026482SoilMTRRFRRLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHP
Ga0209215_104492523300027266Forest SoilMGFLKGAEMRRRYRRLARRFRSIAFIATVFRNYREGRRRGQTRLAALRFGRAVAQWRRHNGAAHS
Ga0209886_103332813300027273Groundwater SandRRLARRFRSIAFIATMFRTYRDGRRQGQTHLAALRFGRAVAQWRRHNGVAHS
Ga0208990_109502823300027663Forest SoilMRLRRRYRRLARRFRSIAFITTVVRSYREGRRRGQTRLAAL
Ga0209588_101839113300027671Vadose Zone SoilMTRRFRRLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQN
Ga0209588_102714643300027671Vadose Zone SoilLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHP
Ga0209077_116741723300027675Freshwater SedimentMRRRYRRLARRFRSIAFITTVFRTYQEGRRRGQTRLAALRFGRA
Ga0208989_1025497713300027738Forest SoilMRLRRRYRRLARRFRSIAFITTVVRSYREGRRRGQTRLAALRFGRAVAQ
Ga0209073_1002951843300027765Agricultural SoilMMTRRMRRLTRRSRALAFAWAVLRSYREGRRRGQSPLHAWRFGRAVARWRQRNGAA
(restricted) Ga0233416_1006631023300027799SedimentMRRGYRRLARRFRSIAFIAIVVRSYREGRRRGRTRLDALRCGREVARWRRHNGAAYR
Ga0209590_1056120313300027882Vadose Zone SoilMRLQRRYRRLSRRFRSIAFITTVVRSYREGRRRGQTRLAALRFGRAVAQWRRHNG
Ga0209068_1000288853300027894WatershedsMTRRFRRLARHSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHS
Ga0209859_107671613300027954Groundwater SandTRRCKRLARRSRAIRFMAAVLRSYREGRRRGQSPLAALRFGRAVARWRRHNGVAHL
Ga0209078_121047413300027955Freshwater SedimentMRRRYRRLARRFRSIAFIATVFRTYREGRRRGQTRLTALRFGRAVAQWQRHNGAAHS
(restricted) Ga0233417_1035229923300028043SedimentMRRGYRRLARRFRSIAFIAIVVRSYREGRRRGRTRLDALRCGREIARWRRHNGAAYL
Ga0209526_1044748923300028047Forest SoilMTRRFRRLARRSPAMAFAWAFLLSYREGRRRGQSPIRAARFGRAVARWRRQNGGAHS
Ga0307320_1043694913300028771SoilMRLQRRYRRLTRRFRSIAFITTVVRSYREGRRRGQTRLAALRFGRAVAQWRRHNGTAHS
Ga0307504_1019245713300028792SoilMTRRFRRLARRSPTMAFAWAFLLSYREGRRRGQSPIDAARFGRAVARWRRQNGAAHS
Ga0247827_1035684233300028889SoilMRRRYRRLARRFRSIAFMATVFRTYREGRRRGQSRLAALRFGRAVAHCGAAT
Ga0299907_1013653253300030006SoilMRRRYRRLARRFRSIAFIGTVLRSYREGRRRGQSRLAALRFGRAVAQWRHHDGAAHP
Ga0268386_1042814223300030619SoilMRRRYRRLARRFRSIAFIATVLRSYREGRRRGQSRLAALRFGRAVAQWRHHDGAAHP
Ga0302046_1100537713300030620SoilMRRRYRRLARRFRSIAFIATVLRSYREGRRRGQTRLAALRFGRAVA
(restricted) Ga0255310_1005091833300031197Sandy SoilMRRRYRRLARRFRSIAFIATVLRTYRAGRRRGQTRLTALRFGRAVAQWQRHNGAAHS
Ga0310813_1012487043300031716SoilMTRRFKRLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHP
Ga0307468_10071245433300031740Hardwood Forest SoilKGAEMRRRYRRLARRFRSIAFIATVFRNYREGRRRGQTRLAALRFGRAVAQWRRHNGAAH
Ga0214473_1232921923300031949SoilMRRRYRRLARRFRSIAFIATVFRTYREGRRRGQSRLAALRFGHAVAQWRRHNGAAHS
Ga0310902_1111688613300032012SoilMTRRFRRLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRR
Ga0335085_1016279013300032770SoilMTRRFKRLTRRYGTIAFVAALLRSYREGRRRGQTRLGAFRFGRAVARWRRENGAAHS
Ga0334722_1011880623300033233SedimentMRRRYRRLARRFRSIAFMAILFRTYREGRRRGQTRLTALRFGRAVAQWRRHNGAAHS
Ga0326729_100281143300033432Peat SoilMTRRFKRLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRAVARWRRKNGAAHS
Ga0326729_103397223300033432Peat SoilMTRRFKRLARRSRAVAFAWTVLLSYREGRRRGQSPLRALRFGRAVARWRRQNGAAHS
Ga0326723_0454110_454_5853300034090Peat SoilMTRRFKRLARRSRAVAFAWAVLRSYREGRRRGQSPLRALRFGRA
Ga0364934_0086080_2_1513300034178SedimentARRFRSIAFIATVLRTYREGRRRGQTRLAALRFGRAVARWRRHNGAAHS
Ga0364936_072600_486_6503300034773SedimentRYRRLARRFRSIAFIATVLRTYREGRRRGQTRLGALRFGRAVARWRRHNGAAHS
Ga0373948_0013646_16_1773300034817Rhizosphere SoilMRRLTRRSRTLAFAWAVLRSYREGRRRGQSPLHAWRFGRAVARWRQRNGAAHS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.