NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F089361

Metagenome / Metatranscriptome Family F089361

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F089361
Family Type Metagenome / Metatranscriptome
Number of Sequences 109
Average Sequence Length 106 residues
Representative Sequence LLINHFSFDNKVIYPVEWITATWLVIAAAMIFFRGKFVKAYLIAEIVLAAPTAYYICVLAMRHGGDFAPGFKDLVLTILLFLVFSLVPGGLAAWRVLARRKGRS
Number of Associated Samples 86
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 3.67 %
% of genes near scaffold ends (potentially truncated) 95.41 %
% of genes from short scaffolds (< 2000 bps) 94.50 %
Associated GOLD sequencing projects 76
AlphaFold2 3D model prediction Yes
3D model pTM-score0.38

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (52.294 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(23.853 % of family members)
Environment Ontology (ENVO) Unclassified
(26.606 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(38.532 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 59.85%    β-sheet: 0.00%    Coil/Unstructured: 40.15%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.38
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 109 Family Scaffolds
PF03783CsgG 24.77
PF01039Carboxyl_trans 24.77
PF16538FlgT_C 2.75
PF02786CPSase_L_D2 1.83
PF13545HTH_Crp_2 0.92
PF01557FAA_hydrolase 0.92
PF14366DUF4410 0.92
PF00923TAL_FSA 0.92
PF00083Sugar_tr 0.92
PF00289Biotin_carb_N 0.92

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 109 Family Scaffolds
COG0777Acetyl-CoA carboxylase beta subunitLipid transport and metabolism [I] 24.77
COG0825Acetyl-CoA carboxylase alpha subunitLipid transport and metabolism [I] 24.77
COG1462Curli biogenesis system outer membrane secretion channel CsgGCell wall/membrane/envelope biogenesis [M] 24.77
COG4799Acetyl-CoA carboxylase, carboxyltransferase componentLipid transport and metabolism [I] 24.77
COG0176Transaldolase/fructose-6-phosphate aldolaseCarbohydrate transport and metabolism [G] 0.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A52.29 %
All OrganismsrootAll Organisms47.71 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001867|JGI12627J18819_10162972Not Available906Open in IMG/M
3300002245|JGIcombinedJ26739_101220847Not Available641Open in IMG/M
3300004092|Ga0062389_101666890All Organisms → cellular organisms → Bacteria819Open in IMG/M
3300005338|Ga0068868_100561340All Organisms → cellular organisms → Bacteria1007Open in IMG/M
3300005436|Ga0070713_100014024All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter5934Open in IMG/M
3300005439|Ga0070711_101098174Not Available685Open in IMG/M
3300005538|Ga0070731_10929381Not Available576Open in IMG/M
3300005541|Ga0070733_10318207All Organisms → cellular organisms → Bacteria1029Open in IMG/M
3300005541|Ga0070733_10626014Not Available722Open in IMG/M
3300005541|Ga0070733_10962846Not Available573Open in IMG/M
3300005542|Ga0070732_10568449All Organisms → cellular organisms → Bacteria688Open in IMG/M
3300005610|Ga0070763_10807994Not Available554Open in IMG/M
3300005836|Ga0074470_10573281All Organisms → cellular organisms → Bacteria → Acidobacteria1329Open in IMG/M
3300005921|Ga0070766_11009968Not Available572Open in IMG/M
3300006041|Ga0075023_100466781Not Available561Open in IMG/M
3300006050|Ga0075028_100085159All Organisms → cellular organisms → Bacteria → Acidobacteria1587Open in IMG/M
3300006050|Ga0075028_100464388All Organisms → cellular organisms → Bacteria734Open in IMG/M
3300006086|Ga0075019_10110397All Organisms → cellular organisms → Bacteria → Acidobacteria1587Open in IMG/M
3300006086|Ga0075019_10578063All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300006162|Ga0075030_100971546All Organisms → cellular organisms → Bacteria669Open in IMG/M
3300006172|Ga0075018_10719716Not Available541Open in IMG/M
3300006174|Ga0075014_100295188All Organisms → cellular organisms → Bacteria852Open in IMG/M
3300006174|Ga0075014_100878530All Organisms → cellular organisms → Bacteria535Open in IMG/M
3300006176|Ga0070765_100852024Not Available861Open in IMG/M
3300006354|Ga0075021_10266347All Organisms → cellular organisms → Bacteria1057Open in IMG/M
3300006354|Ga0075021_10417052All Organisms → cellular organisms → Bacteria843Open in IMG/M
3300006954|Ga0079219_11225550Not Available652Open in IMG/M
3300007258|Ga0099793_10131703Not Available1176Open in IMG/M
3300007258|Ga0099793_10323755Not Available751Open in IMG/M
3300009038|Ga0099829_11525754Not Available552Open in IMG/M
3300009093|Ga0105240_10261173All Organisms → cellular organisms → Bacteria → Acidobacteria1998Open in IMG/M
3300009174|Ga0105241_11793493Not Available599Open in IMG/M
3300009176|Ga0105242_13257339Not Available504Open in IMG/M
3300010379|Ga0136449_101028828All Organisms → cellular organisms → Bacteria → Acidobacteria1321Open in IMG/M
3300010397|Ga0134124_12911104Not Available522Open in IMG/M
3300011120|Ga0150983_13713452All Organisms → cellular organisms → Bacteria932Open in IMG/M
3300011269|Ga0137392_10120835All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2089Open in IMG/M
3300011269|Ga0137392_10651407Not Available873Open in IMG/M
3300011270|Ga0137391_10787704All Organisms → cellular organisms → Bacteria784Open in IMG/M
3300012010|Ga0120118_1063539All Organisms → cellular organisms → Bacteria917Open in IMG/M
3300012189|Ga0137388_10396180All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1278Open in IMG/M
3300012202|Ga0137363_10604819Not Available925Open in IMG/M
3300012206|Ga0137380_11599681Not Available536Open in IMG/M
3300012357|Ga0137384_10309319All Organisms → cellular organisms → Bacteria1313Open in IMG/M
3300012685|Ga0137397_10680366Not Available765Open in IMG/M
3300012685|Ga0137397_10828200Not Available686Open in IMG/M
3300012917|Ga0137395_10103574All Organisms → cellular organisms → Bacteria1894Open in IMG/M
3300012917|Ga0137395_10371551All Organisms → cellular organisms → Bacteria1021Open in IMG/M
3300012917|Ga0137395_11100245Not Available563Open in IMG/M
3300012925|Ga0137419_11198140Not Available636Open in IMG/M
3300012925|Ga0137419_11511003Not Available569Open in IMG/M
3300012927|Ga0137416_11480700Not Available616Open in IMG/M
3300012927|Ga0137416_11829765Not Available555Open in IMG/M
3300012931|Ga0153915_10502696All Organisms → cellular organisms → Bacteria1386Open in IMG/M
3300012944|Ga0137410_11832778Not Available536Open in IMG/M
3300013770|Ga0120123_1121152All Organisms → cellular organisms → Bacteria608Open in IMG/M
3300014501|Ga0182024_12860554Not Available512Open in IMG/M
3300015241|Ga0137418_10353648Not Available1212Open in IMG/M
3300015245|Ga0137409_11342330Not Available558Open in IMG/M
3300017933|Ga0187801_10395206Not Available574Open in IMG/M
3300017936|Ga0187821_10128909All Organisms → cellular organisms → Bacteria946Open in IMG/M
3300017936|Ga0187821_10197568Not Available772Open in IMG/M
3300017936|Ga0187821_10232106All Organisms → cellular organisms → Bacteria716Open in IMG/M
3300017993|Ga0187823_10277217Not Available576Open in IMG/M
3300017993|Ga0187823_10291182Not Available565Open in IMG/M
3300018006|Ga0187804_10237344All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium785Open in IMG/M
3300018007|Ga0187805_10618354Not Available513Open in IMG/M
3300020170|Ga0179594_10003866All Organisms → cellular organisms → Bacteria3661Open in IMG/M
3300020579|Ga0210407_10578346All Organisms → cellular organisms → Bacteria876Open in IMG/M
3300020579|Ga0210407_11083699All Organisms → cellular organisms → Bacteria608Open in IMG/M
3300020581|Ga0210399_11346312Not Available560Open in IMG/M
3300021171|Ga0210405_11313096Not Available531Open in IMG/M
3300021171|Ga0210405_11413611Not Available506Open in IMG/M
3300021180|Ga0210396_10522488All Organisms → cellular organisms → Bacteria1037Open in IMG/M
3300021407|Ga0210383_11033382Not Available696Open in IMG/M
3300021420|Ga0210394_10077787All Organisms → cellular organisms → Bacteria → Acidobacteria2868Open in IMG/M
3300021433|Ga0210391_10193421Not Available1601Open in IMG/M
3300021433|Ga0210391_10746228All Organisms → cellular organisms → Bacteria766Open in IMG/M
3300024330|Ga0137417_1305457All Organisms → cellular organisms → Bacteria3708Open in IMG/M
3300025934|Ga0207686_11005060Not Available677Open in IMG/M
3300026376|Ga0257167_1012583Not Available1148Open in IMG/M
3300027376|Ga0209004_1008676All Organisms → cellular organisms → Bacteria1465Open in IMG/M
3300027643|Ga0209076_1144328Not Available667Open in IMG/M
3300027674|Ga0209118_1074739All Organisms → cellular organisms → Bacteria → Acidobacteria974Open in IMG/M
3300027676|Ga0209333_1088378All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Acidimicrobiia → Acidimicrobiales → Acidimicrobiaceae → Acidithrix → Acidithrix ferrooxidans847Open in IMG/M
3300027855|Ga0209693_10013135All Organisms → cellular organisms → Bacteria3905Open in IMG/M
3300027867|Ga0209167_10399893Not Available749Open in IMG/M
3300027867|Ga0209167_10719989Not Available545Open in IMG/M
3300027869|Ga0209579_10667370Not Available563Open in IMG/M
3300027894|Ga0209068_10333253All Organisms → cellular organisms → Bacteria857Open in IMG/M
3300027894|Ga0209068_10403374All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium780Open in IMG/M
3300027903|Ga0209488_11049266Not Available560Open in IMG/M
3300027908|Ga0209006_10271543All Organisms → cellular organisms → Bacteria1454Open in IMG/M
3300027911|Ga0209698_10476806Not Available968Open in IMG/M
3300027915|Ga0209069_10335399All Organisms → cellular organisms → Bacteria812Open in IMG/M
3300028792|Ga0307504_10147124All Organisms → cellular organisms → Bacteria796Open in IMG/M
3300029636|Ga0222749_10171638All Organisms → cellular organisms → Bacteria1068Open in IMG/M
3300029636|Ga0222749_10347802All Organisms → cellular organisms → Bacteria776Open in IMG/M
3300031057|Ga0170834_109804866Not Available560Open in IMG/M
3300031231|Ga0170824_117059431Not Available544Open in IMG/M
3300031236|Ga0302324_101102601All Organisms → cellular organisms → Bacteria1068Open in IMG/M
3300031525|Ga0302326_11159188All Organisms → cellular organisms → Bacteria1068Open in IMG/M
3300031525|Ga0302326_11939690All Organisms → cellular organisms → Bacteria764Open in IMG/M
3300031708|Ga0310686_112650837Not Available526Open in IMG/M
3300031820|Ga0307473_11299362Not Available544Open in IMG/M
3300031823|Ga0307478_11067552Not Available674Open in IMG/M
3300032180|Ga0307471_102639318Not Available637Open in IMG/M
3300032205|Ga0307472_101385317All Organisms → cellular organisms → Bacteria681Open in IMG/M
3300032515|Ga0348332_10570002Not Available1826Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil23.85%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds13.76%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil11.93%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment7.34%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil7.34%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil6.42%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.67%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil3.67%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa2.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.83%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost1.83%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.83%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.83%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.83%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.92%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.92%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.92%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.92%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil0.92%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.92%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost0.92%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.92%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.92%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300004092Coassembly of ECP03_0M1, ECP03_OM2, ECP03_OM3, ECP04_OM1, ECP04_OM2, ECP04_OM3, ECP14_OM1, ECP14_OM2, ECP14_OM3EnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006086Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013EnvironmentalOpen in IMG/M
3300006162Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006174Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2014EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012010Permafrost microbial communities from Nunavut, Canada - A7_35cm_12MEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013770Permafrost microbial communities from Nunavut, Canada - A15_5cm_18MEnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017933Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_1EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300018007Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_5EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021433Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-OEnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300027376Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_RefH0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027676Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_O3 (SPAdes)EnvironmentalOpen in IMG/M
3300027855Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3 (SPAdes)EnvironmentalOpen in IMG/M
3300027867Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027869Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027911Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031236Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_1EnvironmentalOpen in IMG/M
3300031525Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_3EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12627J18819_1016297223300001867Forest SoilGIPVLLINHFSFDNKVIYPVEWITATWLVVAAAMIFFRGRLLIAYMVSEIVLAAPTAYYIGILAVHHGGHFAPAFIDLVVTVFLFLFFSLVPIGLAAIRIRARRQSPA*
JGIcombinedJ26739_10122084723300002245Forest SoilRAGIPVLLINHFSFDNKVIYPVEWITAAWLVLLAAVIFFRGQFLKTYLISELVLAAPTAYYIGVLATQHGGDFAPGFRDLLLTTFLFTVFSIVPAGLASWEMTRSRTYPAR*
Ga0062389_10166689023300004092Bog Forest SoilWPGGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITATWLVIAATMIFFRGRFLKPYLIAEIVLAAPTSYYICVLAIRHGGDFAPGFKDLVLTVLLFLVFSLIPVALAARRILARRETRS*
Ga0068868_10056134023300005338Miscanthus RhizosphereNKAIYPVEWLTATWLVLAASMIFFRGKFLTAYLAAEIVLAAPTAYYICVLAIRHGGDFAPGFKDVALTAFLFLFFSVVPGLLAAWRLISGRPMRT*
Ga0070713_10001402443300005436Corn, Switchgrass And Miscanthus RhizosphereVGIYRWPGGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITAAWLALVAAMIFFRGRFLKGYVISEAVLAAPTAYYIGVLAVQHGGDFAPGFKDLLLTILLFCVFSVLPAGLAIRELAARRRNHR*
Ga0070711_10109817423300005439Corn, Switchgrass And Miscanthus RhizospherePVLLINHFSFDNKVIYPAEWITAAWLVFMAAMIFFRGRFLKTYLISELVLAAPTAYYIAVLAIQHGGDFAPGFKDLLLTIILFVLFSVVPAGLAAWEIARVRKTDG*
Ga0070731_1092938123300005538Surface SoilNHFSFDNKVIYPVEWITAAWLVIAAAMIFFRGKFLIPYAAAEVVLAAPSAYYIGLLAVRHGGDFAPAFKDLVLTMLLFFVFSLVPLAWAATRILARKRPGAAES*
Ga0070733_1031820723300005541Surface SoilLLINHFSFDNKIIYPVEWITATWLILAAAMIFFKGRLLMEYLIAEIVLAAPTAYYIGVLAVRHGGDFAPAFKDLVLTALLFLVFSLIPIALAATRLVGRKPA*
Ga0070733_1062601423300005541Surface SoilYPVEWITATWLVFAAAMLFFRGKLLRIYLAAEIVLAAPTAYYICVLAIRHGGHFAPARVDLVVTAVLFTVFSIVPVGLASQRLWAKSNKPN*
Ga0070733_1096284613300005541Surface SoilWITATWLVFAAAMIFFRGKFLKAYLIAEIVLAAPAAYYICILAIRHGGDFAPGFKDLALTAFLFAIFSMTAMGLAVQRLMSRARMQRRSDANCPRRPF*
Ga0070732_1056844913300005542Surface SoilFDNKVIYPVEWITATWLVIAAAMIFFRGKFVKAYLIAEIVLAAPTAYYICVLAMRHGGDFAPGFKDLVLTILLFLVFSLVPGGLAAWRVLARRKGRS*
Ga0070763_1080799413300005610SoilPGGPPFILDPRAGIAVLLINHFSFDNKIIYPVEWITATWLVFLAAMIFFRGRFLKTYLISEIILAAPTAYYIGVLAIQHGGDFAPGFKDLVLTALLFAIFSLAPIGLAVRRLRSKSDVRP
Ga0074470_1057328123300005836Sediment (Intertidal)RWPGGPPFILDPRAGIPVLLINHFSFDNKIIYPVEWITATWLVFLAAMIFFRGKFLKTYLISEMLLAAPTTYYIAVLTVQHEGHFAPGFKDVVLTALLFLVFSLIPVAWAASRILSRRKARA*
Ga0070766_1100996823300005921SoilDNKVIYPVEWTTATWLVIAAAMIFFRGKLIKAYLIAEVLLAAPTAYYICVLAIRHGGDFAPAFKDLVITLLLFLVFSVIPAGLAVHRILARRKLTA*
Ga0075023_10046678113300006041WatershedsGPPFILDPRAGIPVLLINRFSFDNKVIYPVEWITATWLVFLAAMIFFRGKLLKTYLISEIVLAAPTAYYICVLAMRHGGDFAPGFKDLVLTAILFTVFSLVPMGLIIQRIWARKI*
Ga0075028_10008515913300006050WatershedsYCWPGGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITATWLVIAAALIFFQGRLLKPYLIAEIVLAAPTAYYICVLAIRHGGDFAPGFKDLVLTVLLFLVFSLIPAGLAASRVLARREARS*
Ga0075028_10046438813300006050WatershedsYCWPGGPPFILDPRAGIPVLLINHFSFDNKVIYPMEWISATWLVIAASMIFFKGRFLKAYVIVEIVLAAPTAYYIGVLAVRHGGDFAPGFKDLVLTVLLFLGFSLVPAGLALRRILARS*
Ga0075019_1011039723300006086WatershedsPFILDPRAGIPVFLINHFSFDNKVIYPVEWITATWLVIAAAMIFFKGRFLKAYLIGEIVLAAPTAYYICVLAIRHGGDFAPGFKDLVLTLLLFLGFSLLPAGLAAWRIFAQRQAQS*
Ga0075019_1057806323300006086WatershedsNHFSFDNKVIYPVEWITATWLVIAAALIFFRGRLLKPYLIAEIVLAAPTAYYICVLAIRHGGDFAPGFKDLVLTVLLFLVFSLIPAGLAASRVLARREARS*
Ga0075030_10097154613300006162WatershedsYPVEWITATWLVIAAAMIFFKGRFLKVYLIGEIVLAAPTAYYICVLAIRHGGDFAPGFKDLVLTLLLFLGFSLLPAGLAAWRIFAQRQAQS*
Ga0075018_1071971623300006172WatershedsGPPFILDPRAGIPVFLINHFSFDNKIIYPVEWITATWLVFLAAMIYFQGRLLRAYLISEILLAAPTAYYIAVLAIQGGGHFAPGFKDLVLTTVLFTIFSLAPIGLAIRRLWVRGK*
Ga0075014_10029518813300006174WatershedsYPVEWITATWLVIAAAMIFFKGRFLKVYLIGEIVLAAPTAYYICLLAIRHGGDFAPGFKDLVLTLLLFLGFSLLPAGLAAWRIFAQRQAQS*
Ga0075014_10087853013300006174WatershedsVEWITATWLVLVAAMIFFRGRFLKPYLIAEIFLAAPTAYYIGVLAIRHGGDFAPGFKDLVLTVVLFLVFSLVPLGLAAKRVLVRREEQS*
Ga0070765_10085202413300006176SoilAGIPVLLINQFSFDNKVIYPVEWITATWLVFAAAMIFFRGKLLRTYLAVEIVLAAPTAYYICILAIQRGGHFAPAFVDLVVTVVLFSVFSLAPMALTVRCLWINSRACA*
Ga0075021_1026634713300006354WatershedsDPRAGIPVLLINHFSFDNKVIYPVEWITAAWLVFMAAMICFRGTFLKTYLISELVLAAPTAYYIVVLAIQHGGDFAPGFKDLLLTIFLFLVFSAVPAGLAAWHIRQTKAVHA*
Ga0075021_1041705223300006354WatershedsINHFSFDNKVIYPVEWITATWLVIAAALIFFRGRLLKPYLIAEIVLGAPTAYYICVLAIRHGGDFAPGFKDLVLTVLLFLVFSLIPAGLAASRVLARREARS*
Ga0079219_1122555023300006954Agricultural SoilLLINHFSFDNKVIYPVEWITAAWLALMAAMIFFRGRYLKGYAISEAVLAAPTAYYIGVLAIQHGGDFAPGFKDLLLTILLFCVFSVAPAAFALWELAARSGTHHASSNQR*
Ga0099793_1013170323300007258Vadose Zone SoilYRWPGGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITAGWLVLMAAMIFFRGRFLKVYLISELILAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFFVFSAVPASLAAWEIGRTKKTPA*
Ga0099793_1032375513300007258Vadose Zone SoilPVEWITAAWLVLMAVMIFFRGKFLKAYLISELVLAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFCVFSLAPIGLALHRIWQRSKTLV*
Ga0099829_1152575413300009038Vadose Zone SoilPVLLINHFSFDNKVIYPVEWITAAWLVFMAAMIFFRGKLLKAYLISELVLAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFLVFSVVPAGLAGWEIGRHKEFGRPQ*
Ga0105240_1026117333300009093Corn RhizosphereFDNKAIYPVEWLTATWLVLAASMIFFRGKFLTAYLAAEIVLAAPTAYYICVLAIRHGGDFAPGFKDVVLTALLFLLF*
Ga0105241_1179349323300009174Corn RhizosphereLLINHFSFDNKVICPVEWLTATWLVFAASMIFFRGKFLIAYLAAEIVLAAPTAYYICILAIRHGGDFAPGFKDVVLTALLFLLFSVVPAGLAAGRVMVRGRLRT*
Ga0105242_1325733923300009176Miscanthus RhizosphereEWITAGWLVLMAALIFFRGQFLKTYLISELVLAAPTAYYIAVLAIHHGGDFAPGFKDLQLTIILFVVFSVVPVGLAAREIVRSKKARA*
Ga0136449_10102882813300010379Peatlands SoilKVIYPVEWITATWLVIAAAMILFRGRFLKAYLIAEVVLATPTAYYICILAIRHGGDFAPAFKDLVLTVVLFLVFSLFPAGLAARRILVRRDMRA*
Ga0134124_1291110423300010397Terrestrial SoilAGIPVLLINHFSFDNKVICPVEWLTATWLGFAASMIFFRGKFLIAYLAAEIVLAAPTAYYICILAIRHGGDFAPGFKDVVLTALLFLLFSVVPAGLAAGRVMVRGRLRT*
Ga0150983_1371345223300011120Forest SoilPVEWITATWLVIAAAMIFFQGKLLRAYLIAEIILAAPTAYYISILAVRHGGDFAPGFKDLVLTALLFLIFSFVPVGLAARRVLARKRLQS*
Ga0137392_1012083533300011269Vadose Zone SoilVEWITAAWLVLMAAMIFFRGKLLKAYLISELVLAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFLVFSVVPAGLAVWEIGRHKELGAPQ*
Ga0137392_1065140733300011269Vadose Zone SoilHFSFDNKVIYPVEWITATWLVFAATIIFFRGKFLKSYLIAEIVLAWPTAYYICVLAIRHGGDFAPGFKDLVLTVLLFFVFSLVPAGLATMRIFAQRATRS*
Ga0137391_1078770413300011270Vadose Zone SoilRWPGGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITATWLVIAAAMIFFRGRLLKAYLIAEIVLAAPTAYYICVLAIRHGGDFAPGFKDLVLTVLLFLAFSLVPAGIAVRRILARRDARS*
Ga0120118_106353923300012010PermafrostWITATWLVIAAAMIFFRGKILKPYLIAEIVLAAPTAYYICVLSIRHGGDFAPGFKDLVLTTLLFLVFSLVPAGLAARRILARREARS*
Ga0137388_1039618013300012189Vadose Zone SoilLLINHFSFDNKIIYPVEWITAAWLVFLAGMLFFRGRLLKTYLISEMVLAAPTAYYIAVLAIQHGGHFAPAFKDVELTAILFTIFSLAPIALAIQRIRIGGKRALV*
Ga0137363_1060481913300012202Vadose Zone SoilRWPGGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITAAWLVLMAAMIFFRGKFLKVYLISELILAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFFVFSAVPASLAAWEIGRTKKTPA*
Ga0137380_1159968113300012206Vadose Zone SoilKVIYPVEWITAAWLVLMAAMIFFRGKLLKAYLISELVLAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFLVFSLVPASLAAWEIGRHREFGAPQ*
Ga0137384_1030931923300012357Vadose Zone SoilGGPPFILDPRAGIPVLLINYFSFDNKVIYPVEWITATWLVIAAAMIFFRGKLLKTYLITEIVLAAPTAYYICVLAIRHGGDFAPGFKDLVLTALLFLAFSLVPAGIAARRILARREARS*
Ga0137397_1068036613300012685Vadose Zone SoilIPVLLINHFSFDNKVIYPVEWITAAWLVLMAAMIFFRGRFLKVYLISELILAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFFVFSAVPASLAAWEIGRTKKTPA*
Ga0137397_1082820023300012685Vadose Zone SoilGPPFILDPRAGIPVFLINHFSFDNKVIYPVEWVTATWLVFLATMIFFQGRFLKTYLISEILLAAPTAYYIAVLAIQHGGHFAPGFKDLVLTAILFTIFSIAPIGLAVQRLGARKKHLV*
Ga0137395_1010357413300012917Vadose Zone SoilGGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITAAWLVWMAAMICFRGQFLKTYLISEVILAAPTAYYIAVLAIQHGGDFAPGFKDLLLTIILFAVFSIAPAGLAAWEIARARKTDG*
Ga0137395_1037155113300012917Vadose Zone SoilRAGIPVLLINHFSFDNKVIYPVEWITATWLVIAAAMIFFRGRFLKAYLIAEIVLAAPTAYYICILAIRHGGDFAPGFKDLVLTVLLFLVFSVVPTGLALRRILVRREKRS*
Ga0137395_1110024523300012917Vadose Zone SoilRLFILDARAGISVLLINYFFFDNKVIYPVEWITAAWLVLMAAMIFFRGKLLKAYLISELVLAAPTAYYIVVLAIQHGGDFAPGFKDLLLTIFLFLVFSVVPAGLAAWEIGRARKSPG*
Ga0137419_1119814013300012925Vadose Zone SoilINHFSFDNKVIHPVEWITAAWLVLMAVMIFFRGKFLKAYLISELVLAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFCVFSLAPIGLALHRIWQRSKTLV*
Ga0137419_1151100323300012925Vadose Zone SoilLINHFSFDNKVIYPVEWITAAWLVLMAAMIFFRGKFLKVYLISELILAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFFVVSAVPAGLAAWEIGRTKKTPA*
Ga0137416_1148070013300012927Vadose Zone SoilGIYRWPGGPPFILDPRAGIPVLLINHFSFENKVIYPVEWITAAWLVLMAVMIFFQGKFLKVYLISELVLAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFCVFSLAPIGLALHRIRRKSKFLV*
Ga0137416_1182976513300012927Vadose Zone SoilHFSFDNKVIYPVEWVTATWLVFLATMIFFQGRFLKTYLISEILLAAPTAYYIAVLAIQHGGHFAPGFKDLVLTAILFTIFSIAPMGLAVQRLGARKKQLI*
Ga0153915_1050269623300012931Freshwater WetlandsHAVEWITATWLVFVAAMIFFQGRFLKTYLISELVLGAPTVYYIGVLVTQQGGDFASGFNDLLLTTFLFLAFSVAPAGLAARCIWRARQEPA*
Ga0137410_1183277813300012944Vadose Zone SoilVEWITATWLVIAAAMIFFRGKLLKTYLITEIVLAAPTAYYICVLAIRHGGDFAPGFKDLLLTALLFLAFSLVPAGIAARRILARREARS*
Ga0120123_112115213300013770PermafrostNKVIYPVEWITATWLVLPAAMIFFRGRLLKPYLITELVLAAPTAYYICVLAIRHGGDFAPGFKDLVLTVLLFLVFSLVPAGLAASRVLARRQERC*
Ga0182024_1286055413300014501PermafrostPRAGIAVLLINHFAFDNKVIYPVEWITAAWLVLTAVMIFFRGKFVKAYLISEIILAAPTAYYIGVLAIQHGGDFAPGFKDLVLTTLLFATFSLVPIGLAIHRLKSTG*
Ga0137418_1035364813300015241Vadose Zone SoilPVLLINHFSFDNKVIYPVEWITAAWLVLMAAMIFFRGRFLKVYLISELILAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFFVFSAVPASLAAWEIGRTKKTPA*
Ga0137409_1134233023300015245Vadose Zone SoilFDNKVIYPVEWITATWLVIAAAMIFFRGKLLKTYLITEIVLAAPTAYYICVLAIRHGGDFAPGFKDLLLTALLFLAFSLVPAGIAARRILARREARS*
Ga0187801_1039520613300017933Freshwater SedimentNKVIYPVEWITATWLVFLAAMIFFRGKLLKTYLISEIVLAAPTAYYISVLAMQHGGDFAPGFKDLVLTAILFTVFSLVPMGLAVQRLWARKI
Ga0187821_1012890923300017936Freshwater SedimentAAAMIFFRGKFLIPYAAAEVVLAAPSAYYIGLLAVRHGGDFAPAFKDLVLTMLLFFVFSLVPLAWAATRILARKRPGAAES
Ga0187821_1019756813300017936Freshwater SedimentDNKVIYPAEWITATWLVLVAAMIFFRGKFVKAYLISEIVLAAPTAYYIGVLAIQHGGDFAPGFKDLVLTALLFAIFSLVPISLAICRLRSTG
Ga0187821_1023210613300017936Freshwater SedimentATWLVFAATMIFFRGAFLKPYLAVEIVLAAPTAYYIGVLAVRHGGDFAPAFIDLLLTVILFLVFSVIPILLAARRILARRRARQE
Ga0187823_1027721723300017993Freshwater SedimentVLLINHFSFDNKVIHPVEWITATWLIIVAAMIFFRAKWILPYLLVELVLAAPTAYYIVVLAIRHGGDFAPAFIDLVITVLLFLVFSVIPIVLAARRLLAQQRTRSAPSEA
Ga0187823_1029118223300017993Freshwater SedimentAGIAILLINHLSFDDKVIYPAEWITATWLVLVAAMIFFRGKFVKAYLISEIVLAAPTAYYIGVLAIQHGGDFAPGFKDLVLTALLFAIFSLVPISLAICRLRSTG
Ga0187804_1023734423300018006Freshwater SedimentVLLINHFSFDNKVIYPVEWITATWLVFLAAMIFFRGKLLKTYLISEIVLAAPTAYYISVLAMQHGGDFAPGFKDLVLTAILFTVFSLVPMGLAVQRLWARKI
Ga0187805_1061835413300018007Freshwater SedimentDNKVIYPVEWITATWLVIAAAMIFFRGRFLKAYLFVEIVLAAPTAYYIGVLARRHGGDFAPAFMDLVLTAILFTVFSLAPVALAVQRIRVRRKSPA
Ga0179594_1000386613300020170Vadose Zone SoilGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITAAWLVLMAAMIFFRGRFLKVYLISELILAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFFVFSAVPASLAAWEIGRTKKTPA
Ga0210407_1057834623300020579SoilPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITATWLVIAAAMIFFRGRFLRPYLIAEIVLAAPTAYYICVLAIRHGGDFAPGFKDLVLTVLLFLVFSIVPAGLAARRVLARREARS
Ga0210407_1108369913300020579SoilEWITATWLVFLAAMIFFRGRFLKTYLISEIILAAPTAYYIGVLAIQHGGDFAPGFKDLVLTALLFAIFSLAPIGLAVRRLRSKSDVRP
Ga0210399_1134631213300020581SoilPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITATWLVFAAAMIFFRGWFLKAYLIAEIFLAAPTAYYIGVLAIRHGGDFAPGFKDLVLTVLLFLVFSLVPMGLATRLVLVRRETRF
Ga0210405_1131309613300021171SoilIPVLLINHFSLDNKVIYPVEWITATWLVIAAGMIFFRGRFLKPYLIAEIFLAAPTAYYICILAIRHGGDFAPGFKDLVLTTLLFLVFSLLPVGLAIMRILARRETPS
Ga0210405_1141361123300021171SoilDPRAGIPVLLINHFSFDNKVIYPVEWITATWLVIAAAMIFFRGKLLRAYLVAEIILAAPTAYYICLLAIRHGGDFAPGFKDLVLTSLLFLIFSLIPVGLAARRVLAIRKARS
Ga0210396_1052248823300021180SoilDPRAGIPVLLINHFAFDNKVIYPVEWITATWLVLAAAMIFFRGRFLKAYLIAEVILAAPTAYYIGVLAVRHGGDFAPAFKDLILTSLLFLIFSVIPAWWTLSRLLLQKETRQ
Ga0210383_1103338223300021407SoilIYCWPGGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITATWLVIAAAIIFFRGRFLRPYLIAEIVLAAPTAYYICVLAIRHGGDFAPGFKDLVLTVLLFSLFSLIPGGLAVSRILAQRQARL
Ga0210394_1007778713300021420SoilVIYPVEWITAAWLVLLAAVIFFRGQFLKTYLISELVLATPTAYYIGVLATQHGGDFAPGFRDLLLTTFLFTVFSIVPAGLASWEITRSRTCPAR
Ga0210391_1019342113300021433SoilLINHFSLDNKVIYPVEWITATWLVFLAAAIFVRGKLLKTYLISEIVLAAPTAYYIGILAARHGGDFAPGFKDLVITAILFTGFSLVPMALAIQRLRARKKMPA
Ga0210391_1074622813300021433SoilWPGGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITATWLVVAAAMIFFRGKFLKAYLIVEIVLAAPTAYYISVLAIRHGGDFAPAFKDLVLTVLLFLVFSLVPATLAARRLLARREAR
Ga0137417_130545763300024330Vadose Zone SoilGIPVLLINHFSFDNKVIYPVEWITAAWLVLMAAMIFFRGRFLKVYLISELILAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFFVFSAVPASLAAWGIGRTKKTPA
Ga0207686_1100506013300025934Miscanthus RhizosphereGGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITAGWLVLMAALIFFRGQFLKTYLISELVLAAPTAYYIAVLAIHHGGDFAPGFKDLQLTIILFVVFSVVPVGLAAREIVRSKKARA
Ga0257167_101258323300026376SoilNKVIYPVEWITAAWLVLMAAMIFFRGKFLKVYLISELILAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFFVFSAVPASLAAWGIGRTKKTPA
Ga0209004_100867613300027376Forest SoilIYRWPGGPPFVLDPRAGIPVLLINHFSLDNKVIYPVEWITAIWLVVAATIIFFRGRLLLAYMISEIVLAAPTAYYIGILAVHRGGHFAPAFIDLVVTVFLFLFFSLVPIGLAAIRIRARRQSPA
Ga0209076_114432813300027643Vadose Zone SoilEWITAAWLVLMAVMIFFRGKFLKAYLISELVLAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIFLFCVFSLAPIGLALHRIWQRSKTLV
Ga0209118_107473923300027674Forest SoilGGPPFILDPRAGIPILLINHFSFDNKVIYPVEWTTAAWLFLMATMIFFRGKLLKAYLISELVLAAPTAYYIGVLAVQHGGDFAPGFKDLLLTIFLFSVFSAVPAVLAVWCIRRTGAMLA
Ga0209333_108837813300027676Forest SoilIGIYCWPGGPPFILDPRAGIPVLLINRFSFDNKVIYPVEWVTASWLVIAAAMIFFRGKLLKAYLIVEIILAAPTAYYIAILATRHGGDFAPGFKDLVLTAILFTGFSLVPLALAIQRVLRRRKEPV
Ga0209693_1001313553300027855SoilWPSGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITATWLVIAAAMIFFQGRFLKSYLLAEIILAAPTAYYISILAVRHGGDFAPGFKDLVLTALLFLIFSLVPVGLAARRVLARKRLQ
Ga0209167_1039989313300027867Surface SoilDYPVEWITATWLVFAAAMLFFRGKLLRIYLAAEIVLAAPTAYYICVLAIRHGGHFAPARVDLVVTAVLFTVFSIVPVGLASQRLWAKSNKPN
Ga0209167_1071998913300027867Surface SoilLLINHFSFDNKVIYPVEWITATWLVIAAAMIFFRGKFVKAYLIAEIVLAAPTAYYICVLAMRHGGDFAPGFKDLVLTILLFLVFSLVPGGLAAWRVLARRKGRS
Ga0209579_1066737013300027869Surface SoilNHFSFDNKVIYPVEWITAAWLVIAAAMIFFRGKFLIPYAAAEVVLAAPSAYYIGLLAVRHGGDFAPAFKDLVLTMLLFFVFSLVPLAWAATRILARKRPGAAES
Ga0209068_1033325313300027894WatershedsPVLLINHFSFDNKVIYPVEWITATWLVIAAALIFFRGRLLKPYLIAEIVLGAPTAYYICVLAIRHGGDFAPGFKDLVLTVLLFLVFSLIPAGLAASRVLARREARS
Ga0209068_1040337413300027894WatershedsIGIYRWPGGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITAAWLVFMAAMICFRGTFLKTYLISELVLAAPTAYYIVVLAIQHGGDFAPGFKDLLLTIFLFLVFSAVPAGLAAWHIRQTKAVHA
Ga0209488_1104926613300027903Vadose Zone SoilLDPRAGIPVLLINHFSFDNKVIYPVEWITAAWLVLIAAMICFRGQFLKTYLISELILAAPTAYYIGVLAIQHGGDFAPGFKDLLLTIILFVVFSVVPAGLAAREIVRARTIQS
Ga0209006_1027154323300027908Forest SoilVLLINRFSFDNKVIYPVEWITATWLVIAAAMIFLRGRFLKPYLIAEIILAAPTAYYIFILAMRHGGDFAPGFKDLVLTAILFSVFSLVPMGWAVQRIVARSK
Ga0209698_1047680623300027911WatershedsLDPRAGIPVLLINHFGFDNKVIYPVEWITALWLVCVSVMIFFRGKFLKAYFISEILLGAPTLYYIGVLVARHGGDFAPGAKDVFLTLAIFAVFSVAPMVLAARRLLK
Ga0209069_1033539923300027915WatershedsVIYPVEWITATWLVIAAALIFFQGRLLKPYLIAEIVLAAPTAYYICVLAIRHGGDFAPGFKDLVLTVLLFLVFSLIPAGLAASRVLARREARS
Ga0307504_1014712423300028792SoilHFSFDNKVIYPVEWITATWLVLAAAIIFFRGTFVKAYLIAEIVLAAPTAYYIGVLAIRHGGDFAPGFKDLVLTLLLFLVFSLVPAVLAARRILAQRKARC
Ga0222749_1017163813300029636SoilLDPRAGIPVLLINHFSFDNKVIYPVEWITATWLVIAAAMIFFRGRFLRPYLIAEIVLAAPTAYYICVLAIRHGGDFAPGFKDLVLTVLLFLVFSIVPAGLAARRVLARREARS
Ga0222749_1034780213300029636SoilGIPVLLINHFSLDNKVIYPVEWITATWLVIAAGLIFFRRSFLKPYFIAEIFLAAPTAYYICILAIRHGGDFAPGFKDLVLTTLLFLVFSLLPVGLAIMRILARRETPS
Ga0170834_10980486623300031057Forest SoilYCWPGGPPFILDPRAGIPVLLINHFSFDNKIIYPVEWITATWLVIAAAMIFFKGRLVMEYLIAEIVLAAPTAYYIGVLAVRHGGDFAPAFKDLVLTALLFLVFSLIPIGLVVTRVVARKP
Ga0170824_11705943113300031231Forest SoilGIPVLLINHFSFDNKVIYPVEWITATWLVFAATIIFFRGGFLKSYLIAEIFLAAPTAYYICVLAIRHGGDFAPGFKDLVLTVLLFLVFSMIPMGIAVRRTLARREARS
Ga0302324_10110260113300031236PalsaAGIPVLLINHFSFDNKIIYPVEWITATWLAFLAGMIFFRGKLLKTYLISEIILAAPTAYYIGVLAIQHGGDFAPGFQDLLLTAILFTIFSLVPMALAVQRLLSRRKQRL
Ga0302326_1115918813300031525PalsaLDPRAGIPVLLINHFSFDNKVIYPVEWISATWLVIVAAMIFFRGKFVRAYLLAEILLAAPTAYYICVLAIRHGGDFAPAFKDLVITLLLFLVFSVIPGGLAIRRILARRRMPS
Ga0302326_1193969023300031525PalsaHFSFDNKIIYPVEWITVTWLVFVAAMIFFRGKLLKTYLISEIILAAPTAYYIGVLAIQHGGDFAPGFQDLLLTAILFTIFSLVPMALAVQRLLSRRKQRH
Ga0310686_11265083713300031708SoilLDPRAGIPVLLINHFAFDNKVIYPVEWITATWLVIAAAMIFFRGRLLKPYLIAEIVLAAPTAYYICVLAIRHGGDFAPAFKDLVLTVLLLLIFSLVPAGLAWRRVLARRPARS
Ga0307473_1129936213300031820Hardwood Forest SoilGIYCWPSGPPFILDPRAGIPVLLINHFTFDNKVIYPVEWITATWLVIAAAMIFFRGRFLKAYLIAEIVLATPTAYYICVLAIRHGGDFAPAFKDLVLTALLFLIFSVVPVGLAVRRVLAQRELQP
Ga0307478_1106755213300031823Hardwood Forest SoilNKVIYPVEWISATWLVIAAAMIFFRGRFLKPYLIAEIVLAAPTAYYMSVLAMRHGGDFAPGFKDLVLTAILFTGFSLVPMGLAVRLVIAKSGRRVFL
Ga0307471_10263931823300032180Hardwood Forest SoilGPPFILDPRAGIPVLLINHFSFDNKVIYPVEWITAAWLVLMAAMIFFRGKLLKAYLISEFVLAAPTAYYIGVLAIQHGGDFAPGFKDLLLTILLFCLFSLAPAGLAVWEIAGTK
Ga0307472_10138531723300032205Hardwood Forest SoilDPRAGIPVLLINHFSFDNKVIHPVEWITATWLVIAAAIIFFRGKLLRPYLIAEIVLGAPTAYYIFILAIRHGGDFAPGFKDLVLTVLLFLVFSVVPAGLAGWRVLARSDKRS
Ga0348332_1057000233300032515Plant LitterVIYPVEWITATWLVFLAAIIFFRGKLLKTYLVSEIVLAAPTAYYISVLAKRHGGDFAPGFKDLVLTAILFTIFSLAPLGFVILRLRARRKLPA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.