NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F098572

Metagenome Family F098572

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F098572
Family Type Metagenome
Number of Sequences 103
Average Sequence Length 100 residues
Representative Sequence MVGEVTGFLHIVRVDAPVDPVTAEYRIAFAPLGGRLRSRHAVVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQIEALGL
Number of Associated Samples 73
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 27.18 %
% of genes near scaffold ends (potentially truncated) 21.36 %
% of genes from short scaffolds (< 2000 bps) 74.76 %
Associated GOLD sequencing projects 66
AlphaFold2 3D model prediction Yes
3D model pTM-score0.82

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (60.194 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(33.010 % of family members)
Environment Ontology (ENVO) Unclassified
(43.689 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(51.456 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 23.97%    β-sheet: 26.45%    Coil/Unstructured: 49.59%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.82
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.177.1.1: FAHd1nkqa_1nkq0.5507
d.177.1.1: FAHd1hyoa21hyo0.53612
d.177.1.1: FAHd1gtta21gtt0.53321
d.177.1.0: automated matchesd5d2ka_5d2k0.53097
d.177.1.0: automated matchesd6j5ya16j5y0.52627


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF01068DNA_ligase_A_M 7.77
PF04392ABC_sub_bind 2.91
PF13489Methyltransf_23 2.91
PF00011HSP20 2.91
PF07992Pyr_redox_2 1.94
PF01565FAD_binding_4 1.94
PF07883Cupin_2 1.94
PF05494MlaC 1.94
PF13473Cupredoxin_1 0.97
PF13495Phage_int_SAM_4 0.97
PF01527HTH_Tnp_1 0.97
PF02913FAD-oxidase_C 0.97
PF08386Abhydrolase_4 0.97
PF01255Prenyltransf 0.97
PF13520AA_permease_2 0.97
PF00990GGDEF 0.97
PF00296Bac_luciferase 0.97
PF12773DZR 0.97
PF003936PGD 0.97
PF03446NAD_binding_2 0.97
PF12728HTH_17 0.97
PF00293NUDIX 0.97
PF07452CHRD 0.97
PF00072Response_reg 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG1423ATP-dependent RNA circularization protein, DNA/RNA ligase (PAB1020) familyReplication, recombination and repair [L] 7.77
COG1793ATP-dependent DNA ligaseReplication, recombination and repair [L] 7.77
COG0071Small heat shock protein IbpA, HSP20 familyPosttranslational modification, protein turnover, chaperones [O] 2.91
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 2.91
COG2854Periplasmic subunit MlaC of the ABC-type intermembrane phospholipid transporter MlaCell wall/membrane/envelope biogenesis [M] 1.94
COG0020Undecaprenyl pyrophosphate synthaseLipid transport and metabolism [I] 0.97
COG0277FAD/FMN-containing lactate dehydrogenase/glycolate oxidaseEnergy production and conversion [C] 0.97
COG03626-phosphogluconate dehydrogenaseCarbohydrate transport and metabolism [G] 0.97
COG10236-phosphogluconate dehydrogenase (decarboxylating)Carbohydrate transport and metabolism [G] 0.97
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 0.97


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A60.19 %
All OrganismsrootAll Organisms39.81 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004019|Ga0055439_10250463Not Available577Open in IMG/M
3300004463|Ga0063356_100032858All Organisms → cellular organisms → Bacteria → Proteobacteria5034Open in IMG/M
3300004463|Ga0063356_100300640Not Available1989Open in IMG/M
3300005336|Ga0070680_100819192All Organisms → cellular organisms → Bacteria → Terrabacteria group → unclassified Terrabacteria group → Terrabacteria group bacterium ANGP1802Open in IMG/M
3300005444|Ga0070694_101084467Not Available667Open in IMG/M
3300005445|Ga0070708_100090175Not Available2790Open in IMG/M
3300005467|Ga0070706_100571230Not Available1051Open in IMG/M
3300005467|Ga0070706_101165678All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium708Open in IMG/M
3300005518|Ga0070699_100088130Not Available2710Open in IMG/M
3300005937|Ga0081455_10088682All Organisms → cellular organisms → Bacteria2512Open in IMG/M
3300006049|Ga0075417_10060968All Organisms → cellular organisms → Bacteria1651Open in IMG/M
3300009038|Ga0099829_10025756All Organisms → cellular organisms → Bacteria4121Open in IMG/M
3300009038|Ga0099829_10107902Not Available2175Open in IMG/M
3300009053|Ga0105095_10078122Not Available1787Open in IMG/M
3300009089|Ga0099828_10899013Not Available791Open in IMG/M
3300009089|Ga0099828_11531624Not Available588Open in IMG/M
3300009162|Ga0075423_10072873All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria3569Open in IMG/M
3300010399|Ga0134127_10004610All Organisms → cellular organisms → Bacteria9952Open in IMG/M
3300010400|Ga0134122_12311992Not Available583Open in IMG/M
3300011270|Ga0137391_10769635All Organisms → cellular organisms → Bacteria795Open in IMG/M
3300011270|Ga0137391_11055819Not Available659Open in IMG/M
3300011270|Ga0137391_11182594All Organisms → cellular organisms → Bacteria612Open in IMG/M
3300012189|Ga0137388_10336775Not Available1389Open in IMG/M
3300012189|Ga0137388_11619152Not Available583Open in IMG/M
3300012203|Ga0137399_10136891All Organisms → cellular organisms → Bacteria → FCB group → Candidatus Latescibacteria → unclassified Candidatus Latescibacteria → Candidatus Latescibacteria bacterium1942Open in IMG/M
3300012203|Ga0137399_10325534All Organisms → cellular organisms → Bacteria1273Open in IMG/M
3300012355|Ga0137369_10255303Not Available1323Open in IMG/M
3300012355|Ga0137369_10504299All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium855Open in IMG/M
3300012363|Ga0137390_10130580All Organisms → cellular organisms → Bacteria2491Open in IMG/M
3300012363|Ga0137390_10745443Not Available940Open in IMG/M
3300012363|Ga0137390_11912685Not Available521Open in IMG/M
3300012685|Ga0137397_10015791All Organisms → cellular organisms → Bacteria5280Open in IMG/M
3300012918|Ga0137396_10065709All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_12_FULL_60_222525Open in IMG/M
3300012922|Ga0137394_10136774All Organisms → cellular organisms → Bacteria2081Open in IMG/M
3300012922|Ga0137394_10365532Not Available1231Open in IMG/M
3300012925|Ga0137419_10399603Not Available1073Open in IMG/M
3300012925|Ga0137419_11408524Not Available588Open in IMG/M
3300012929|Ga0137404_10030311Not Available3993Open in IMG/M
3300012929|Ga0137404_10378623All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Chitinophagia → Chitinophagales → Chitinophagaceae → unclassified Chitinophagaceae → Chitinophagaceae bacterium1244Open in IMG/M
3300012930|Ga0137407_10521195Not Available1111Open in IMG/M
3300012944|Ga0137410_11418238Not Available604Open in IMG/M
3300014881|Ga0180094_1137543Not Available571Open in IMG/M
3300015241|Ga0137418_10066612All Organisms → cellular organisms → Bacteria3286Open in IMG/M
3300015241|Ga0137418_10126908All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → unclassified Hyphomicrobiales → Rhizobiales bacterium 12-66-72269Open in IMG/M
3300015264|Ga0137403_10035218All Organisms → cellular organisms → Bacteria5266Open in IMG/M
3300017997|Ga0184610_1131656Not Available812Open in IMG/M
3300017997|Ga0184610_1142409Not Available783Open in IMG/M
3300017997|Ga0184610_1199455Not Available668Open in IMG/M
3300018000|Ga0184604_10316105Not Available553Open in IMG/M
3300018028|Ga0184608_10154159Not Available991Open in IMG/M
3300018052|Ga0184638_1018996Not Available2417Open in IMG/M
3300018052|Ga0184638_1054223All Organisms → cellular organisms → Bacteria1460Open in IMG/M
3300018053|Ga0184626_10101016All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1222Open in IMG/M
3300018056|Ga0184623_10135255Not Available1140Open in IMG/M
3300018061|Ga0184619_10268021Not Available784Open in IMG/M
3300018071|Ga0184618_10218351Not Available801Open in IMG/M
3300018075|Ga0184632_10166475Not Available972Open in IMG/M
3300018076|Ga0184609_10007425All Organisms → cellular organisms → Bacteria3950Open in IMG/M
3300018076|Ga0184609_10078256Not Available1458Open in IMG/M
3300018078|Ga0184612_10043236Not Available2341Open in IMG/M
3300018078|Ga0184612_10281200Not Available856Open in IMG/M
3300018084|Ga0184629_10177710All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1092Open in IMG/M
3300018422|Ga0190265_10413134All Organisms → cellular organisms → Bacteria1450Open in IMG/M
3300018422|Ga0190265_10648223Not Available1176Open in IMG/M
3300018429|Ga0190272_10052110All Organisms → cellular organisms → Bacteria → Terrabacteria group → unclassified Terrabacteria group → Terrabacteria group bacterium ANGP12376Open in IMG/M
3300018429|Ga0190272_10974410Not Available806Open in IMG/M
3300018429|Ga0190272_11842219Not Available633Open in IMG/M
3300019882|Ga0193713_1079333All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium925Open in IMG/M
3300020001|Ga0193731_1029973Not Available1431Open in IMG/M
3300020003|Ga0193739_1049528All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1081Open in IMG/M
3300020022|Ga0193733_1069744Not Available988Open in IMG/M
3300020170|Ga0179594_10123891Not Available943Open in IMG/M
3300021073|Ga0210378_10154212All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium886Open in IMG/M
3300021073|Ga0210378_10344440Not Available556Open in IMG/M
3300021081|Ga0210379_10426496All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium587Open in IMG/M
3300021344|Ga0193719_10045033All Organisms → cellular organisms → Bacteria1912Open in IMG/M
3300022694|Ga0222623_10037762All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1849Open in IMG/M
3300022694|Ga0222623_10356382Not Available558Open in IMG/M
3300024330|Ga0137417_1033351All Organisms → cellular organisms → Bacteria1651Open in IMG/M
3300025910|Ga0207684_10032941Not Available4407Open in IMG/M
3300025917|Ga0207660_10178490Not Available1648Open in IMG/M
3300026535|Ga0256867_10116280Not Available1021Open in IMG/M
3300027846|Ga0209180_10295033Not Available929Open in IMG/M
3300027846|Ga0209180_10582026Not Available620Open in IMG/M
3300028380|Ga0268265_10755407All Organisms → cellular organisms → Bacteria → Terrabacteria group → unclassified Terrabacteria group → Terrabacteria group bacterium ANGP1944Open in IMG/M
3300028536|Ga0137415_10081132All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_12_FULL_60_223102Open in IMG/M
3300028716|Ga0307311_10271705Not Available507Open in IMG/M
3300028787|Ga0307323_10166409Not Available797Open in IMG/M
3300028803|Ga0307281_10005193All Organisms → cellular organisms → Bacteria → Proteobacteria3572Open in IMG/M
3300028803|Ga0307281_10007766All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2959Open in IMG/M
3300028807|Ga0307305_10163148Not Available1028Open in IMG/M
3300028828|Ga0307312_10133143All Organisms → cellular organisms → Bacteria1566Open in IMG/M
3300030006|Ga0299907_10148449Not Available1938Open in IMG/M
3300030006|Ga0299907_10289861Not Available1337Open in IMG/M
3300030619|Ga0268386_10076820All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2606Open in IMG/M
3300030619|Ga0268386_10874802Not Available565Open in IMG/M
(restricted) 3300031150|Ga0255311_1004167All Organisms → cellular organisms → Bacteria2793Open in IMG/M
(restricted) 3300031150|Ga0255311_1023939Not Available1263Open in IMG/M
(restricted) 3300031197|Ga0255310_10021952Not Available1642Open in IMG/M
(restricted) 3300031237|Ga0255334_1018304Not Available851Open in IMG/M
3300031740|Ga0307468_101985158Not Available557Open in IMG/M
3300032180|Ga0307471_100766889Not Available1131Open in IMG/M
3300034257|Ga0370495_0236308Not Available595Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil33.01%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil20.39%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment16.50%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.83%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil3.88%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment2.91%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.94%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.94%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.94%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.94%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.94%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.94%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.97%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.97%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.97%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.97%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.97%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.97%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004019Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009053Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014881Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_1DaEnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028716Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_198EnvironmentalOpen in IMG/M
3300028787Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_381EnvironmentalOpen in IMG/M
3300028803Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031237 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_35cm_T3_129EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300034257Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_02D_17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
Ga0055439_1025046313300004019Natural And Restored WetlandsRICGRSESCMMAGAAGSPTIGTMVGEVTGFLHIVRIEASIDPVNAEYRLAFAPLGGRLHSRHLTVQGLDRLTAFLRRAHVPTLEIERAWRMLAKRPVHSIPRVGLTPAQIETLGL*
Ga0063356_10003285823300004463Arabidopsis Thaliana RhizosphereMRAPGPTRRFFMKAGPVTPPTIGSAMVGEVTGFLHIVRVAAPVDPVNAEYRIGFAPMDGRLRSRHVMVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTAADLEALGL*
Ga0063356_10030064033300004463Arabidopsis Thaliana RhizosphereMIAFAMVGEVTGFLHIVRVDGPIDPVNAEYRIAFAPLGGRLRSRHAMVQGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIARVGLTPAQIEALGL*
Ga0070680_10081919213300005336Corn RhizosphereMRAPGPTRRFFMKAGPVTPPTIGSAMVGEVTGFLHIVRVAAPVDPVNAEYRIGFAPMDGRLRSRHVMVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPR
Ga0070694_10108446713300005444Corn, Switchgrass And Miscanthus RhizosphereGEVTGFLHIVRVAAPVDPVNAEYRIGFAPMDGRLRSRHVMVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTAADLEALGL*
Ga0070708_10009017553300005445Corn, Switchgrass And Miscanthus RhizosphereMVGEVTGFLHMVRVQAPIDPVNAEYRISFTPAGGRLPSRHATVQGLDQLTALLRQVHVPTLEIERAWRTLAMRGVHSIARVWLTPAQLEAR*
Ga0070706_10057123013300005467Corn, Switchgrass And Miscanthus RhizosphereMAGTAGPPTIGWMVGEVTGFLHIVRIDAPIDPVTAEYRIAFAPVGGRLRSRHVMVQGLDRLTAFLRQAHVPTPEIERAWRALAQRRVHSIPRVGLTPAEIEALEL*
Ga0070706_10116567813300005467Corn, Switchgrass And Miscanthus RhizosphereMVGEVTGFLHMVRVQAPIDPVNAEYRISFTPAGGRLPSRHATVQGLDQLTALLRQVHVPTLEIERAWRTLAMRGVHSIARVWLTPAQLEALGL*
Ga0070699_10008813013300005518Corn, Switchgrass And Miscanthus RhizosphereMVGEVTGFLHMVRVQAPIDPVNAEYRISFTPAGGRLPSRHATVQGLDQLTALLRQVHVPTLEIERAWRTLAMRGVHSIARVWLTPAQLEA
Ga0081455_1008868233300005937Tabebuia Heterophylla RhizosphereMAGEMHGFLHIIRLDVPVDPVNAEYRIAFAQLGGRLRERHAVRHGFDGLTAFLRQAGVPTAEIERSWRTLAKRRVHSIPHVFLTPEQLDRLGL*
Ga0075417_1006096833300006049Populus RhizosphereMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHAMVQGLDRLTAFLRQAHVPTPEIERAWRALAPRRVHSIPRVGLTPTQIAALGL*
Ga0099829_1002575643300009038Vadose Zone SoilMVGEVTGFLHIVRIEAPIDPVNAEYRLAFAPLGGRLHSRHVTVQGLDRLTAFLRQAHVPTLEIERAWRMLAKRPVHSIPHVGLTPAQLETLGL*
Ga0099829_1010790223300009038Vadose Zone SoilMAGEVTGFLHIVRLTGAVDPVIAEYRIAFAPLGGRLRGRHVRCQGLDRLTDVLRQARVATPEIERAWRTLARHRFHAIRVTLTPAQLEAHGL*
Ga0105095_1007812233300009053Freshwater SedimentMVGEVTGFLHIVRVDGPIDPVIAEYRIAFAPLGGRLRSRHATVHGLDRLTAFLRRARVPTPEIERAWRALAMRRVHSIARVGLTPAQIEALGL*
Ga0099828_1089901313300009089Vadose Zone SoilPMAGEVTGFLHIVRLTGAVDPVIAEYRIAFAPLGGRLRGRHVRCQGLDRLTDVLRQARVATPEIERAWRTLARHRFHAIRVTLTPAQLEAHGL*
Ga0099828_1153162413300009089Vadose Zone SoilMVGEVTGFLHIVRIEAPIDPVNAEYRLAFAPLGGRLHSRHVTVQGLDRLTALLRQAHVPTLEIERAWRMLAKRPVHSIPRVGLMPAQIETLGL*
Ga0075423_1007287313300009162Populus RhizosphereMEGPALGFLHITRGRGPIDPVNAEYRIVFAPLGGRLRTRHAQCQGLDALTDFLRQAHVPLPEIARAWQNLAKRRIYSIPQVALTPAQIDALGL*
Ga0134127_1000461023300010399Terrestrial SoilMKAGPVTPPTIGSAMVGEVTGFLHIVRVAAPVDPVNAEYRIGFAPMDGRLRSRHVMVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTAADLEALGL*
Ga0134122_1231199213300010400Terrestrial SoilMIAFAMVGEVTGFLHIVRVDGPIDPVIAEYRIAFAPLGGRLRSRHAMVQGLDRLTAFLRRAHVPTPEIERAWRALATRRVHSIARVGLTPAQIEALGL*
Ga0137391_1076963523300011270Vadose Zone SoilMVGEVTGFLHIVRIEAPIDPVNAEYRLAFAPLGGRLHSRHVTVQGLDRLTALLRQAHVPTLEIERAWRMLAKRPVHSIPHVGLTPAQLETLGL*
Ga0137391_1105581913300011270Vadose Zone SoilMAGEVTGFLHIVRLTGAVDPVIAEYRIAFAPLGGRLRGRHVRCQGLDRLTDVLRQARVATPEIERAWRTLARHRFHAIPRVTLTPAQLEALGL*
Ga0137391_1118259413300011270Vadose Zone SoilMKAGPDTPPIIGIAMVGEVTGFLHIVRVDAPVDPVTAEYRIAFAPLGGRLRSRHAVVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQIEALGL*
Ga0137388_1033677523300012189Vadose Zone SoilMVGEVTGFLHIVRIEAPIDPVNAEYRLAFAPLGGRLHSRHVTVQGLDRLTAFLRQAHVPTLEIERAWRMLAKRPVHSIPRVGLMPAQIETLGL*
Ga0137388_1161915213300012189Vadose Zone SoilMAGEVTGFLHIVRLTGAVDPVIAEYRIAFAPLGGRLRGRHVRCQGLDRLTDVLRQARVATPDIERAWRTLARHRFHAIRATLTPAQLEAQ
Ga0137399_1013689123300012203Vadose Zone SoilMVGEVTEFLHIVRRDAPIDPVNAEYRIAFAPLGGRLRSRHVMVQGLDQLTAFLRQAHVGTPEIERAWRALVRRRVHSISRVGLTPTQIEALGL*
Ga0137399_1032553423300012203Vadose Zone SoilMVGELTGFLHIVRIEAPLDPVTAEYRLTFAPLGGRLHSRHVTVQGLDRLTALLRQAHVPTPEIERAWRALAARRVHSIPRVGLTPAEIEALGL*
Ga0137369_1025530323300012355Vadose Zone SoilMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHAMVQGLDRLTAFLRQAHVPTPEIERAWRALAMRRVHSIPRVEFTAADLEALGL*
Ga0137369_1050429923300012355Vadose Zone SoilMVGEVTGFLHIVRGEGPMDPVNAEYRIAFAPEGGRLHGRHARCQGLDRLTAFLRQAHVPTLEIERAWRRLAKRRVHSIPRVRLNPAEVATLGL*
Ga0137390_1013058043300012363Vadose Zone SoilMKAGPDTPPIIGIAMVGEVTGFLHIVRVDGPIDPVTAEYRIAFAPLGGRLRSRHAMVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQIEALGL*
Ga0137390_1074544323300012363Vadose Zone SoilMVGEVTGFLHIVRVDASIDPVNAEYRLAFAALGGRLRSRHVIVQGLDRLTALLRQAHVSTPEIERTWRTLAKRRVHSIPRVWLTPAQIEALGL*
Ga0137390_1191268513300012363Vadose Zone SoilMVGAVTGFLHIVRVAAPVDPVNAEYRIGFAPIDGRLRSRHVIVHGLDRLTAFLRQAHVPTLEIQRAWRTLAKRRVHSIPRVGLTPTQIEALGL*
Ga0137397_1001579183300012685Vadose Zone SoilMVGELTGFLHIVRIEAPLDPVTAEYRLTFAPLGGRLHSRHVTVQGLDRLTALLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPAEIEALGL*
Ga0137396_1006570933300012918Vadose Zone SoilMVGEVTGFLHIVRLDAPIDPVNAEYRIAFAPLGGRLRSRHVMVQGLDQLTAFLRQAHVGTPEIERAWRALVRRRLHSIPRVGLTPTQIEALGL*
Ga0137394_1013677413300012922Vadose Zone SoilMVGEITGFLHIVRLDASVDPVNAEYRIAFAPLGGRLRSRHVMVQGLDQLTAFLRQAHVPTLEIQRAWRTLAKRRVHSIPRVGLTPTQIATLGL*
Ga0137394_1036553213300012922Vadose Zone SoilITGFLHIVRIEASVDPVTAEYRIAFAPLGGRLRSRHAMVQGLDRLTAFRRQAHVPTPEIERAWRALATRRVHSIPRVGLTAADLEALGL*
Ga0137419_1039960313300012925Vadose Zone SoilAMVGELTGFLHIVRIEAPLDPVTAEYRLTFAPLGGRLHSRHVTVQGLDRLTALLRQAHVPTPEIERAWRALAARRVHSIPRVGLTPAEIEALGL*
Ga0137419_1140852413300012925Vadose Zone SoilTRRSFMKAGPITAPTIGSAMVGEVTGFLHIVRVAASPVDPVNAEYRIGFAPMDGRLRSGHVMVHGFDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTAADLEALGL*
Ga0137404_1003031153300012929Vadose Zone SoilMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHAMVQGLDRLTAFLRQAHVPTPEIERAWRALAARRVHSIPRVGLTPTQISALGL*
Ga0137404_1037862323300012929Vadose Zone SoilMVGEVAGFLHIVRLDAPIDPVNAEYRIAFAPLGGRLRSRHVMVQGLDQLTAFLRQAHVPTLEIQRAWRTLAKRRVHSIPQVGLTPTQIATLGL*
Ga0137407_1052119513300012930Vadose Zone SoilMKAGPITAPTIASAMVGEVTGFLHIVRVAAAPVDPVNAEYRIGFAPMDGRLRSGHVMVHGFDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTAADLEALGL*
Ga0137410_1141823823300012944Vadose Zone SoilMVGEITGFLHIVRLDAPVDPVNAEYRIAFAPLGGRLRSRHVMVQGLDQLTAFLRQAHGPTLEIQWAWRTLAKRRVHSIPRVGLTPTQIATLGL*
Ga0180094_113754313300014881SoilMVGEVTGSLHIVRVEAPIDPVNAEYRLAFAPVGGRLRSRHVTVQGLDRLTAFLRQAHVPTPEIERAWRTLAKRRVHTISHVGLTPAQTEALGL*
Ga0137418_1006661273300015241Vadose Zone SoilAMVGELTGFLHIVRIEAPLDPVTAEYRLTFAPLGGRLHSRHVTVQGLDRLTALLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPAEIEALGL*
Ga0137418_1012690843300015241Vadose Zone SoilMKAGTATPPTIGSAMVGEVTEFLHIVRRDAPIDPVNAEYRIAFAPLGGRLRSRHVMVQGLDQLTAFLRQAHVGTPEIERAWRALVRRRLHSIPRVGLTPTQIEALGL*
Ga0137403_1003521823300015264Vadose Zone SoilMKAGTATAPTIGSAMVGEVTGFLHIVRLDAPIDPVNAEYRIAFAPLGGRLRSRHVTVQGLDRLTALLRQAHVPTPEIERAWRTLARRRVHSIPRVGLTAAEIDSLGL*
Ga0184610_113165623300017997Groundwater SedimentMVGEVSGFLHIVRVEAPIDPVNAEYRIAFAPVGGRLRSRHVMVQGLDRLTALLRQAHVPTPEIERAWRALATRRIHSIPRVGLTPAQIEALGL
Ga0184610_114240913300017997Groundwater SedimentMKVGTATPPTIGIAMGGEVTGFLHILRVEAPIDPVTAEYRLAFAPAGGRLRSQHVTVQGLDRLTALLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPAQIEALGL
Ga0184610_119945523300017997Groundwater SedimentMKAGPDTPPIIGIAMVGEVTGFLHIVRVDGPIDPVTAEYRIAFAPLGGRLRSRHAVVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPAEIEALGL
Ga0184604_1031610513300018000Groundwater SedimentMKAGPITPPTIGSAMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHAMVQGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQIAALGL
Ga0184608_1015415913300018028Groundwater SedimentMKAGPITPPTIGSAMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHAMVQGLDRLTAFLRQAHVPTPEIERAWRALAARRVHSIPRVGLTPTQMAALGL
Ga0184638_101899613300018052Groundwater SedimentTGFLHIVRVEAPVDPVTAEYRIAFAPVGGRLRSRHVMVQGLDRLTAFLRQAHVPTPDIERAWRTLATRRVHSIPRVGLTPAEIEALAL
Ga0184638_105422343300018052Groundwater SedimentMVGEVTGFLHIVRVDAPVDPVTAEYRIAFAPLGGRLRSRHAVVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQIEALGL
Ga0184626_1010101623300018053Groundwater SedimentMKAGPDTPPIIGIAMVGEVTGFLHIVRIEAPIDPVNAEYRVAFAPVGGRLRRRHVTVQGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQIEALGL
Ga0184623_1013525523300018056Groundwater SedimentMKVGTATPPTIGIAMGGEVTGFLHILRVEAPIDPVTAEYRLAFAPAGGRLRSQHVTVQGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPAQIEALGL
Ga0184619_1026802123300018061Groundwater SedimentMRARGPTRRFFMKAGPITPPTIGSAMVGEITGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHAVVHGLDRLTAFLSQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQMAALGL
Ga0184618_1021835123300018071Groundwater SedimentMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHAMVQGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQMAALGL
Ga0184632_1016647523300018075Groundwater SedimentMKAGPATPPTIGIAMVGEVTGFLHIVRVDGPIDPVNAEYRIAFAPVGGRLRSRHAVVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTSTQIEALGL
Ga0184609_1000742523300018076Groundwater SedimentMKAGPITPPTIGSAMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPVGGRLRSRHAMVHGLDRLTAFLRQAHVPTPDIERAWRTLATRRVHSIPRVGLTPAEIEALAL
Ga0184609_1007825623300018076Groundwater SedimentMRAGPDTAPIIGIAMVGEVTGFLHIVRVDGPIDPVTAEYRIAFAPLGGRLRSRHAVVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPAEIEALGL
Ga0184612_1004323663300018078Groundwater SedimentMKAGPDTPPIIGIAMVGEVTGFLHIVRVDAPVDPVTAEYRIAFAPVGGRLRSRHAVVHGLDRLTAFLRQAHVPTPEIERAWRALATRRIHSIARVGLTPAQIEALGL
Ga0184612_1028120013300018078Groundwater SedimentIGIAMVGEVTGFLHIVRVDGPIDPVTAEYRIAFAPLGGRLRSRHAVVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQIEALGL
Ga0184629_1017771023300018084Groundwater SedimentMKAGPITPPMIGIAMGGEVTGFLHILRVEAPVDPVTAEYRIAFAPVGGRFRSRHVMVQGLDRLTAFLRQAHVPTPEIEQAWRALATRRIHSIRHVGLTSAELAALGL
Ga0190265_1041313433300018422SoilMKGGSDTPAIIIFAMVGEVTGFLHIVRVDGPIDPVSAEYRIAFAPLGGRLRSRHATVHGLDRLTAFLREAHVPIPEIERAWRVLATRRVHSIARVGLTPAQIQALGL
Ga0190265_1064822313300018422SoilMKGGSDTPPMIAFAMVGEVTGFLHIFRVDGPIDPVIAEYRIAFAPLGGRLRSRHATVHGLDRLTAFLRQARVPIPEIERAWRALATRRVHSIARVLLKPAQIEAFGL
Ga0190272_1005211023300018429SoilMVGEVTGFLHIVRVEAPVDPVNAEYRIAFAPLGGRLRSRHAVVHGLDRLTAFLRQAHVPTPDIERAWRALATRRVHSIPRVGLTPAQLEALGL
Ga0190272_1097441013300018429SoilMVGEVTGFLHIVRIEAPIDPVNAEYRIAFAPMGGRLRSRHVMVLGLDRLTAFLRQAHVPTSDIERAWRVLAARPVHSVPRVGLTPAEIESLGL
Ga0190272_1184221913300018429SoilMKAGPTTPPTIGIAMVGEVTGFLHIVRVDGPIDPVTAEYRIAFAPLAGRLRSRHAEVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQIEALGL
Ga0193713_107933323300019882SoilMKAEPITPPTIGSAMVGEVTGFLHIVRADAPVDPVNAEYRIAFAPLGGRLRSRHAMVQGFDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTAAQIAALDL
Ga0193731_102997323300020001SoilMRARGPTRRFFMKAGPITPPTIGSAMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHAVVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQMAALGL
Ga0193739_104952813300020003SoilMVGLVTGFLHIVRVEASGDPVTAEYRLAFAPLGGRLRRRHVTVQGLDRLTAFLRQAHVPTSDIERAWRVLATRPVHSVPHVGLTPAEIEALGL
Ga0193733_106974433300020022SoilMRARGPTRRFFMKAGPITPPTIGSAMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHAMVQGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQMAALGL
Ga0179594_1012389113300020170Vadose Zone SoilLGALLSAAMVGELTGFLHIVRIEAPLDPVTAEYRLTFAPLGGRLHSRHVTVQGLDRLTALLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPAEIEALGL
Ga0210378_1015421213300021073Groundwater SedimentMKAGPDTPPIIGIAMVGEVTGFLHIVRVDGPIDPVTAEYRIAFAPLGGRLRSRHAVVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQIEALGL
Ga0210378_1034444013300021073Groundwater SedimentKVGTATPPTIGIAMGGEVTGFLHILRVEAPIDPVTAEYRLAFAPAGGRLRSQHVTVQGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPAQIEALGL
Ga0210379_1042649623300021081Groundwater SedimentPLGPLLSASMVGEVTGFLHIVRIEAPIDPVNAEYRVAFAPVGGPLRRRHVMVQGLDRLTAFLRRAHVPTPEIERAWRALAKRRIHSIPHMGLTPAQIEELGL
Ga0193719_1004503323300021344SoilMKAGPITPPTIGSAMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHATVQGLDWLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQMAALGL
Ga0222623_1003776223300022694Groundwater SedimentMVGEVTGFLHIVRVEASGDPVTAEYRLAFAPLGGRLRRRHVTVQGLDRLTAFLRQAHVPTSDIERAWRVLATRPVHSVPRVGLTPAEIESLGL
Ga0222623_1035638213300022694Groundwater SedimentPPTIGIPMGGEVTGFLHILRVEAPIDPVNAEYRIAFAPMGGRLRSRHVMVHGLDRLTAFLRQAHVPTPEIERAWRRLATRRVHSIPRVGLTPAQLEALGL
Ga0137417_103335133300024330Vadose Zone SoilMVGELTGFLHIVRIEAPLDPVTAEYRLTFAPLGGRLHSRHVTVQGLDRLTALLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPAEIEALGL
Ga0207684_1003294133300025910Corn, Switchgrass And Miscanthus RhizosphereMVGEVTGFLHMVRVQAPIDPVNAEYRISFTPAGGRLPSRHATVQGLDQLTALLRQVHVPTLEIERAWRTLAMRGVHSIARVWLTPAQLEALGL
Ga0207660_1017849043300025917Corn RhizosphereMIAFAMVGEVTGFLHIVRVDGPIDPVNAEYRIAFAPLGGRLRSRHAMVQGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIARVGLTPAQIEALGL
Ga0256867_1011628013300026535SoilMVGEVTGFLHVVRVDGPIDPVIAEYRIAFAPLGGRLRSRHATVHGLDRLTAFLRRARVPTPEIERAWRALAMRRVHSIARVGLTPAQIEALGL
Ga0209180_1029503323300027846Vadose Zone SoilMAGEVTGFLHIVRLTGAVDPVIAEYRIAFAPLGGRLRGRHVRCQGLDRLTDVLRQARVATPEIERAWRTLARHRFHAIRVTLTPAQLEAHGL
Ga0209180_1058202613300027846Vadose Zone SoilTGFLHIVRIEAPIDPVNAEYRLAFAPLGGRLHSRHVTVQGLDRLTAFLRQAHVPTLEIERAWRMLAKRPVHSIPHVGLTPAQLETLGL
Ga0268265_1075540723300028380Switchgrass RhizosphereMRAPGPTRRFFMKAGPVTPPTIGSAMVGEVTGFLHIVRVAAPVDPVNAEYRIGFAPMDGRLRSRHVMVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTAADLEALGL
Ga0137415_1008113223300028536Vadose Zone SoilMAIYRPWSSPTRRFFMKAGTATPPTIGSAMVGEVTEFLHIVRRDAPIDPVNAEYRIAFAPLGGRLRSRHVMVQGLDQLTAFLRQAHVGTPEIERAWRALVRRRVHSISRVGLTPTQIEALGL
Ga0307311_1027170513300028716SoilTAREPSRTVRARGPTRRFFMKAGPITPPTIVSAMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHAMVQGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQMAALGL
Ga0307323_1016640923300028787SoilMRARGPTRRFFMKAGPITPPTIGSAMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHATVQGLDWLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQMAALGL
Ga0307281_1000519343300028803SoilMVGEVTGFLHIVRIEAPIDPVNAEYRVAFAPVGGPLRRRHVMVQGLDRLTAFLRRAHVPTPEIERAWRALAKRRIHSIPHMGLTPAQIEELGL
Ga0307281_1000776653300028803SoilMNAGPITPPMIGIAMGGEVTGFLHILRVEAPVDPVTAEYRIAFAPVGGRFRSRHVMVQGLDRLTAFLRQAHVPTPEIEQAWRALATRRIHSIRHVGLTSAELAALGL
Ga0307305_1016314823300028807SoilMKAGPITPPTIGSAMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHAVVHGLGRLTAFLREAHVPTPEIERAWRALATRRVHSIPRVGLTPTQMAALGL
Ga0307312_1013314313300028828SoilMKAGPITPPTIGSAMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHAVVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPTQMAALGL
Ga0299907_1014844923300030006SoilMVGEVTGFLHIVRVDGPIDPVIAEYRIAFAPLGGRLRSRHATVHGLDRLTAFLRRARVPTPEIERAWRALAMRRVHSIARVGLTPAQIEALGL
Ga0299907_1028986113300030006SoilMVGEVTGFLHIVRIEAPIDPVTAEYRIAFAPLGGRLRRHHVTVHGLDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTLTEIEVLGL
Ga0268386_1007682033300030619SoilMVGEVTGFLHILRVDAPVDPVTAEYRIAFAPVGGRLRSRHVTVQELDRLTAFLRQAHVPTPTIEQAWRALATRRVHSIPRVGLTPTQIAALGF
Ga0268386_1087480213300030619SoilVRVDGPIDPVIAEYRIAFAPLGGRLRSRHATVHGLDRLTAFLRRARVPTPEIERAWRALAMRRVHSIARVGLTPAQIEALGL
(restricted) Ga0255311_100416733300031150Sandy SoilMVGEVSGFLHVVRIDVAVDPVRAQYQVAFAPVGGRLRSRHLVCHGLDGLTSFLRQARVPGPEIERAWRMLAQRQVHSIPRVALTPAQLEVFGL
(restricted) Ga0255311_102393923300031150Sandy SoilMAGAARSRTIGTMVGEVTGFLHIVRIEASVDPVNAEYRLAFAPSGGRLHSRHVTVQGLDRLTAFLRRAHVPTLEIERAWRMLAKRPVHSIPRVGLTPAQIETLGL
(restricted) Ga0255310_1002195233300031197Sandy SoilMMAGAARSRTIGTMVGEVTGFLHIVRIEASVDPVNAEYRLAFAPSGGRLHSRHVTVQGLDRLTAFLRRAHVPTLEIERAWRMLAKRPVHSIPRVGLTPAQIETLGL
(restricted) Ga0255334_101830413300031237Sandy SoilEVTGFLHIVRIEASVDPVNAEYRLAFAPSGGRLHSRHVTVQGLDRLTAFLRRAHVPTLEIERAWRMLAKRPVHSIPRVGLTPAQIETLGL
Ga0307468_10198515813300031740Hardwood Forest SoilGPTRRFFMKAEPITPPTIGSAMVGEVTGFLHIVRVDAPVDPVNAEYRIAFAPLGGRLRSRHAMVQGFDRLTAFLRQAHVPTPEIERAWRALATRRVHSIPRVGLTPAQIAALGL
Ga0307471_10076688923300032180Hardwood Forest SoilMIAFAMVGEVTGFLHIVRVDGPIDPVIAEYRIAFAPLGGRLRSRHAMVQGLDRLTAFLRRAHVPTPEIERAWRALATRRVHSIARVGLTPAQIEALGL
Ga0370495_0236308_107_4033300034257Untreated Peat SoilMIRTGMVGEVTGFLHIVRVDGPIDPVNAEYRIAFAPLGGRLRSRHAVIHGLDRLTALLRQAQVPTPEIERAWRALATRRVHSISRVGLTPAQLEALGL


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.