NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F088395

Metagenome Family F088395

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F088395
Family Type Metagenome
Number of Sequences 109
Average Sequence Length 90 residues
Representative Sequence MPTKSINIGGQPTDKTTAPVTASPAGDAEFCDSPGAFARFGLRRSLLYELHSLGLIKGVSLRRRGTARGKRLWSIDSIRSYLASQMEAGK
Number of Associated Samples 71
Number of Associated Scaffolds 109

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 21.30 %
% of genes near scaffold ends (potentially truncated) 22.94 %
% of genes from short scaffolds (< 2000 bps) 86.24 %
Associated GOLD sequencing projects 62
AlphaFold2 3D model prediction Yes
3D model pTM-score0.53

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (54.128 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(25.688 % of family members)
Environment Ontology (ENVO) Unclassified
(38.532 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(65.138 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 26.27%    β-sheet: 8.47%    Coil/Unstructured: 65.25%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.53
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 109 Family Scaffolds
PF01807zf-CHC2 3.67
PF00589Phage_integrase 3.67
PF06778Chlor_dismutase 1.83
PF02801Ketoacyl-synt_C 0.92
PF00202Aminotran_3 0.92
PF00011HSP20 0.92
PF09250Prim-Pol 0.92
PF13548DUF4126 0.92
PF13155Toprim_2 0.92
PF13148DUF3987 0.92
PF04055Radical_SAM 0.92
PF03772Competence 0.92

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 109 Family Scaffolds
COG0358DNA primase (bacterial type)Replication, recombination and repair [L] 3.67
COG3253Coproheme decarboxylase/chlorite dismutaseCoenzyme transport and metabolism [H] 1.83
COG0071Small heat shock protein IbpA, HSP20 familyPosttranslational modification, protein turnover, chaperones [O] 0.92
COG0658DNA uptake channel protein ComEC, N-terminal domainIntracellular trafficking, secretion, and vesicular transport [U] 0.92


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A54.13 %
All OrganismsrootAll Organisms45.87 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090014|GPIPI_16535657All Organisms → cellular organisms → Bacteria4107Open in IMG/M
2088090014|GPIPI_17452081All Organisms → cellular organisms → Bacteria3011Open in IMG/M
2170459003|FZ032L002ICKEJNot Available531Open in IMG/M
2170459019|G14TP7Y02HXS96Not Available645Open in IMG/M
2228664022|INPgaii200_c0904374All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium1571Open in IMG/M
2228664022|INPgaii200_c0972676Not Available588Open in IMG/M
2228664022|INPgaii200_c0974310All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → unclassified Candidatus Udaeobacter → Candidatus Udaeobacter sp.1184Open in IMG/M
3300000033|ICChiseqgaiiDRAFT_c0697363All Organisms → cellular organisms → Bacteria → environmental samples → uncultured bacterium1293Open in IMG/M
3300000033|ICChiseqgaiiDRAFT_c2020601Not Available568Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_100502074Not Available573Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101653628All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Chthoniobacter → Chthoniobacter flavus517Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101771859Not Available1351Open in IMG/M
3300000550|F24TB_16312471Not Available1485Open in IMG/M
3300000787|JGI11643J11755_11728099Not Available931Open in IMG/M
3300000955|JGI1027J12803_100090776All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium 13_2_20CM_55_101842Open in IMG/M
3300000955|JGI1027J12803_100351378All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae → Opitutus → Opitutus terrae1199Open in IMG/M
3300000955|JGI1027J12803_100521100All Organisms → cellular organisms → Bacteria977Open in IMG/M
3300000955|JGI1027J12803_100741926All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Opitutae → Opitutales → Opitutaceae → Opitutus → Opitutus terrae588Open in IMG/M
3300000955|JGI1027J12803_100853851Not Available1026Open in IMG/M
3300000955|JGI1027J12803_101047389All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → unclassified Candidatus Udaeobacter → Candidatus Udaeobacter sp.2040Open in IMG/M
3300000955|JGI1027J12803_102111840All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1335Open in IMG/M
3300000955|JGI1027J12803_102421245Not Available586Open in IMG/M
3300000955|JGI1027J12803_102929100Not Available686Open in IMG/M
3300000955|JGI1027J12803_103610833Not Available1619Open in IMG/M
3300000955|JGI1027J12803_105864891Not Available646Open in IMG/M
3300000956|JGI10216J12902_118862129All Organisms → cellular organisms → Bacteria891Open in IMG/M
3300002899|JGIcombinedJ43975_10008786All Organisms → cellular organisms → Bacteria1628Open in IMG/M
3300004156|Ga0062589_101364697All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae → Candidatus Udaeobacter → unclassified Candidatus Udaeobacter → Candidatus Udaeobacter sp.689Open in IMG/M
3300004156|Ga0062589_102026560Not Available584Open in IMG/M
3300004463|Ga0063356_103386658All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium687Open in IMG/M
3300004480|Ga0062592_101963692Not Available577Open in IMG/M
3300004643|Ga0062591_100857943Not Available846Open in IMG/M
3300004643|Ga0062591_102214421Not Available572Open in IMG/M
3300005172|Ga0066683_10287453All Organisms → cellular organisms → Bacteria1021Open in IMG/M
3300005289|Ga0065704_10521277Not Available653Open in IMG/M
3300005294|Ga0065705_10188872All Organisms → cellular organisms → Bacteria → environmental samples → uncultured bacterium1478Open in IMG/M
3300005332|Ga0066388_100000199All Organisms → cellular organisms → Bacteria24539Open in IMG/M
3300005332|Ga0066388_100511303All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1836Open in IMG/M
3300005332|Ga0066388_100516288All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1829Open in IMG/M
3300005332|Ga0066388_103588444All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → unclassified Blastocatellia → Blastocatellia bacterium792Open in IMG/M
3300005332|Ga0066388_107227314Not Available558Open in IMG/M
3300005332|Ga0066388_108438489Not Available513Open in IMG/M
3300005332|Ga0066388_108720799Not Available504Open in IMG/M
3300005347|Ga0070668_101102932Not Available716Open in IMG/M
3300005439|Ga0070711_100523025Not Available981Open in IMG/M
3300005467|Ga0070706_100376082All Organisms → cellular organisms → Bacteria1323Open in IMG/M
3300005468|Ga0070707_100301083All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1558Open in IMG/M
3300005598|Ga0066706_10925859Not Available677Open in IMG/M
3300005764|Ga0066903_100117789All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium3676Open in IMG/M
3300005764|Ga0066903_100194432All Organisms → cellular organisms → Bacteria3021Open in IMG/M
3300005764|Ga0066903_100791346All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium 13_2_20CM_55_101694Open in IMG/M
3300005764|Ga0066903_101025079All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1512Open in IMG/M
3300005764|Ga0066903_101038249Not Available1503Open in IMG/M
3300005764|Ga0066903_101175022All Organisms → cellular organisms → Bacteria1422Open in IMG/M
3300006046|Ga0066652_100267644All Organisms → cellular organisms → Bacteria1504Open in IMG/M
3300006881|Ga0068865_102020810Not Available523Open in IMG/M
3300009137|Ga0066709_100414210All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1873Open in IMG/M
3300009137|Ga0066709_102456462Not Available705Open in IMG/M
3300009137|Ga0066709_103389004Not Available579Open in IMG/M
3300010046|Ga0126384_11608430Not Available612Open in IMG/M
3300010047|Ga0126382_10581783Not Available918Open in IMG/M
3300010358|Ga0126370_11875588Not Available582Open in IMG/M
3300010359|Ga0126376_12084739Not Available610Open in IMG/M
3300010360|Ga0126372_11118597All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium807Open in IMG/M
3300010361|Ga0126378_12757942Not Available561Open in IMG/M
3300010376|Ga0126381_100338846All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2074Open in IMG/M
3300010376|Ga0126381_100451911All Organisms → Viruses → Predicted Viral1802Open in IMG/M
3300010376|Ga0126381_100732053Not Available1416Open in IMG/M
3300012198|Ga0137364_10846844Not Available691Open in IMG/M
3300012199|Ga0137383_11039053Not Available596Open in IMG/M
3300012200|Ga0137382_10949608Not Available618Open in IMG/M
3300012205|Ga0137362_11282505Not Available618Open in IMG/M
3300012208|Ga0137376_10406735All Organisms → cellular organisms → Bacteria1182Open in IMG/M
3300012208|Ga0137376_10646309Not Available914Open in IMG/M
3300012209|Ga0137379_10413437Not Available1258Open in IMG/M
3300012211|Ga0137377_11231047Not Available678Open in IMG/M
3300012353|Ga0137367_10336037Not Available1076Open in IMG/M
3300012356|Ga0137371_10906952All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → unclassified Chthoniobacterales → Chthoniobacterales bacterium670Open in IMG/M
3300012357|Ga0137384_10658082Not Available852Open in IMG/M
3300012361|Ga0137360_10340974Not Available1253Open in IMG/M
3300012532|Ga0137373_10058954All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria3526Open in IMG/M
3300014501|Ga0182024_11831429All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Verrucomicrobiae → Verrucomicrobiales → Verrucomicrobia subdivision 3 → unclassified Verrucomicrobia subdivision 3 → Verrucomicrobia subdivision 3 bacterium678Open in IMG/M
3300015371|Ga0132258_10344898All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3680Open in IMG/M
3300015374|Ga0132255_103861654All Organisms → cellular organisms → Bacteria637Open in IMG/M
3300016319|Ga0182033_10897533Not Available785Open in IMG/M
3300018051|Ga0184620_10299158Not Available543Open in IMG/M
3300018431|Ga0066655_10261848All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1110Open in IMG/M
3300018482|Ga0066669_10036505All Organisms → cellular organisms → Bacteria2968Open in IMG/M
3300018482|Ga0066669_10762440All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium855Open in IMG/M
3300020579|Ga0210407_10000074All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → Chthoniobacteraceae132738Open in IMG/M
3300021168|Ga0210406_10058171All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → unclassified Chthoniobacterales → Chthoniobacterales bacterium3360Open in IMG/M
3300021560|Ga0126371_13649137Not Available519Open in IMG/M
3300022756|Ga0222622_10005174All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia5832Open in IMG/M
3300025463|Ga0208193_1073204Not Available699Open in IMG/M
3300025910|Ga0207684_10420833All Organisms → cellular organisms → Bacteria1148Open in IMG/M
3300025915|Ga0207693_10955592Not Available656Open in IMG/M
3300025916|Ga0207663_10501724Not Available943Open in IMG/M
3300025938|Ga0207704_11886076Not Available514Open in IMG/M
3300026330|Ga0209473_1072298All Organisms → cellular organisms → Bacteria1466Open in IMG/M
3300027875|Ga0209283_10847146Not Available558Open in IMG/M
3300031231|Ga0170824_108191905Not Available1221Open in IMG/M
3300031231|Ga0170824_112000603Not Available873Open in IMG/M
3300031446|Ga0170820_14750554Not Available853Open in IMG/M
3300031573|Ga0310915_10990619Not Available587Open in IMG/M
3300031945|Ga0310913_10372841All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → Spartobacteria → Chthoniobacterales → unclassified Chthoniobacterales → Chthoniobacterales bacterium1011Open in IMG/M
3300031945|Ga0310913_10376611Not Available1006Open in IMG/M
3300032001|Ga0306922_12163113Not Available537Open in IMG/M
3300033405|Ga0326727_10129058All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium3114Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil25.69%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil12.84%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil11.93%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil9.17%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.50%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil4.59%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.59%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil2.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.83%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere1.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.83%
PeatlandEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Peatland0.92%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.92%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.92%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.92%
Grass SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grass Soil0.92%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost0.92%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.92%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.92%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.92%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.92%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.92%
Switchgrass, Maize And Mischanthus LitterEngineered → Solid Waste → Grass → Composting → Unclassified → Switchgrass, Maize And Mischanthus Litter0.92%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090014Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
2170459003Grass soil microbial communities from Rothamsted Park, UK - March 2009 indirect MP BIO 1O1 lysis 0-21cmEnvironmentalOpen in IMG/M
2170459019Litter degradation MG4EngineeredOpen in IMG/M
2228664022Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000033Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000550Amended soil microbial communities from Kansas Great Prairies, USA - BrdU amended with acetate total DNA F2.4 TB clc assemlyEnvironmentalOpen in IMG/M
3300000787Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000955Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300002899Soil microbial communities from Manhattan, Kansas, USA - Combined assembly of Kansas soil 100-500um Nextera (ASSEMBLY_DATE=20140607)EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005289Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005347Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaGHost-AssociatedOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006881Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2Host-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025463Peatland microbial communities from Minnesota, USA, analyzing carbon cycling and trace gas fluxes - June2015DPH_17_10 (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300033405Lab enriched peat soil microbial communities from McLean, Ithaca, NY, United States - MB29MYEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPIPI_033080702088090014SoilMSKKPINAEGDTAKTIAPVTSLANDAEFCDSVGAYDRFGIKRSLLYELHAQGLIRGCSLRRRGLTRGKRLWSIDSIREFLNSQMERGAK
GPIPI_029549702088090014SoilMPTKSINIGGQPTDKTTAPVTASPAGDAEFCDSPGAFARFGLRRSLLYELHSLGLIKGVSLRRRGTARGKRLWSIDSIRSYLASQMEAGK
E4A_082231202170459003Grass SoilMSAKSINIGEQLTDKTTAPVTASPANDAEFCDNPGAYVRFGLRRSLLYQLYSEGLIDGVSLRRRGAARGKRLWSCDSIRR
4MG_019378902170459019Switchgrass, Maize And Mischanthus LitterLLTDENASVTTKPVTASPAFDAEFCDSPGAFERFALRRSHLYQLHKDGLIQGVSLRRRGAARGKRLWSIDSIRSYLESQME
INPgaii200_090437432228664022SoilMSKMPINAAGDTAKTIAPVQTSPANDAEFCDSLGAYHRFGLKRSLLYELHTQGLIEGVSLRKRGKLRGKRLWSIDSIRAFLKSQMRRELR
INPgaii200_097267612228664022SoilMPLDTKKLGRQLTDKTTAPVVASPASDAEFCDSPGAFQRFGLRRSLLYELHKLGLIQGVSLRRRGALRGKRLWNCDSIRSYLHEQMEA
INPgaii200_097431022228664022SoilMTVKSDNAGGNAQNTVAPVTAFPANDAEFCDSPGAFVRFGVRRSLLYELYAQGLIKGVSLRRRGAARGKRLWSIDSIRSYLTSQMENTK
ICChiseqgaiiDRAFT_069736323300000033SoilVKSTSYNIGGQSTDKTTSPVAASSAGDAEFCDSPGAFQRFGLRRSLLYELHKLGLIQGVSLRRRGALRGKRLWNCDSIRSYLHEQMEAGK*
ICChiseqgaiiDRAFT_202060113300000033SoilMVISNKRNIGGQQTETTTDPVQASPAFDAEFCDSPGAFYRFGMRRSLLYELHAEGLIDGCSLRRRGKQRGKRLWSIPSIRAYLATQMDGRAVK*
INPhiseqgaiiFebDRAFT_10050207423300000364SoilMQTSISVGGQTDTTTTAPVTASPDSNAEFCDTPSAYTRFGLRRSLLYELLRQNLIAGVSLRRRGALRGKRLWSIDSIRRYLVSQMEGGK*
INPhiseqgaiiFebDRAFT_10165362813300000364SoilMLLDIKKIGRQETDKTTXPVXASXXXDAXFCDSPGAFQRFGLRRSLLYELHKLGLIKGVSLRRRGTARGKRLWSIDSIRSYLRAQMDGRAAK*TRTPHQEKR
INPhiseqgaiiFebDRAFT_10177185913300000364SoilMLLDTKFGEQITETTTAPVQASPAFEAEFCDNGGAYLRFGIRRSLLYQLHKEGLIDGVSLRRRGAARGKRLWSCDSIRRYLASQMEAGK*
F24TB_1631247133300000550SoilMPTKSINIGAQPTDKTTGPVTASPAGDAEFCDSPGAFARFGLRRSLLYELHSLGLIKGVSLRRRGTARGKRLWSIDSIRXXXXXXXXXVK*
JGI11643J11755_1172809923300000787SoilMNSIDYKAEGGTAKTTHPVAAWPANHAEFCDSPGAYHLFGLKRSKLYELSASGAIKGVSLRKRGATKGKRLWSCDSIRSYLASQMESKEAGQAS*
JGI1027J12803_10009077633300000955SoilMSKKPINAEGDTAKTIAPVTSLANDAEFCDSVGAYDRFGIKRSLLYELHAQGLIRGCSLRRRGLTRGKRLWSIDSIREFLNSQMERGAK*
JGI1027J12803_10035137833300000955SoilMSKKPINAAGETAKTIAPVQASPVNDAEFCDNPGAFVRFGLKRSFLYGLYEQGLIKGVSLRKRGAARGKRLWSIDSIRSYLASQMGSVE*
JGI1027J12803_10052110023300000955SoilMNSIDYKAEGGTAKITHPVAAWPANHAEFCDSPGAYHLFGLKRSKLYELSASGAIKGVSLRKRGATKGKRLWSCDSIRSYLASQMESKEAGQAS*
JGI1027J12803_10074192613300000955SoilMTSIESGGHPATTAPVQASPANDAEFCDSPGAFLRFGLRRSLLYELHAQGLIQGVSLRRRGAARGKRLWSIDSIRSYLASQMESTK*
JGI1027J12803_10085385123300000955SoilMTWWCVGILVCLVKSTRTKIGRQPTDKTTAPVAASPASDAEFCDSSGAFARFGLGRTLLYELKGLGLIEGVSLRRRGAARGKRLWSIDSIRSYLASQMDKGAAP*
JGI1027J12803_10104738923300000955SoilMTVKSDNAGGNAQNTVAPVTAFPANDAEFCDSPGAFVRFGVRRSLLYELYAQGLIKGVSLRRRGAARGKRLWSIDSIRSYLTSQMENTK*
JGI1027J12803_10211184033300000955SoilMSRTPINAGGETAKTIAPVQASPANDAEFCDSLGAFVRFGLRRSLLYELYAQGLIKGVSLRRRGAARGKRLWSIDSIRSYLHDQMEVAK*
JGI1027J12803_10242124523300000955SoilMPINAAGDTAKTIAPVQTSPANDAEFCDSLGAYHRFGLKRSLLYELHTQGLIEGVSLRKRGKLRGKRLWSIDSIRAFLKSQMRRELR*
JGI1027J12803_10292910023300000955SoilMHSIDYTNAEGGTAHTTAPVAASPASDAEFCDSPGAFYRFGLRRSMLYELDARGLIKSCSLRKRGATKGKRLWNCDSIRTYLSSQMHGAQ*
JGI1027J12803_10361083343300000955SoilMSLDTKKIGQQKTEKTTAPVTASPASDAEFCDSLGAFARFGLRRSLLYDLHAHGHIQGVSLRRRGAQRGKRLWSVDSIRSYLAAQMEGGKE*
JGI1027J12803_10586489123300000955SoilMLLDIKKIGRQETDKTTAPVIASLASDAEFCDSPGAFQRFGLRRSLLYELHHLGLIKGVSLRRRGTTRGKRLWSIDSIRD
JGI1027J12803_10752577713300000955SoilSQKTETTTNPVEASPDNDAEFCDSPGAFHRFGMRRSLLYELRAEGLIDGCSLRRKGRQRGKRLWSVPSI
JGI10216J12902_11886212913300000956SoilNKRIRDERTETVTTNPITASPTNGAAAEFCDSGGAFLRFGLRRSLLYDLHKLGLIKGVSLRRRGTARGKRLWSIDSIRKYLASQMENERK*
JGIcombinedJ43975_1000878633300002899SoilMLKNSINAAGETAKTIAPVQASAANDAEFCDSPGAFVRFGLRRSLLYELYALGLIKGVSLRRRGAARGKRLWSIDSIRSYLREEMETAK*
Ga0062589_10136469713300004156SoilHIMVSTRSSSRQLSDYTTSPVTVSPANDAEFCDSPGAFFRFGLKRSMLYELKARGLIKGVSLRKRGATKGKRLWSCDSIRSYLASQMESKEAGQAS*
Ga0062589_10202656023300004156SoilMPKNSINAEGGTAHTIAPVTASPANDAEFCDSPGAFARFGLRRSLLYELHAEGAIKGVSLRRRGAMRGKRLWSIDSIRCYLASQMSSAI
Ga0063356_10338665813300004463Arabidopsis Thaliana RhizosphereGILLLSVMKCKIADNGGRQRTATTTAPVAASPVSDAEFCDCAGAFARFGLRRSLLYELHSLGLIKGVSLRRRGAARGKRLWSIDSIRSYLESQMENGGRA*
Ga0062592_10196369213300004480SoilMAVMKNSHDKGGNARLTIAPLAASAANDAEFCDSPGAFCRFGLRRSMLYALSARGLIKGVSLRKRGAMKGKRLWSCDSIRSYLASQMEGDNEAT*
Ga0062591_10085794313300004643SoilMKNSYDKGGNTRITIAPLAASAANDAEFCDSPGAFCRFGLRRSMLYALSARGLIKGVSLRKRGAMKGKRLWSCDSIRSYLASQMEGDNEAT*
Ga0062591_10221442113300004643SoilMRKTSSQKHGGQQTATTTAPVQASLVNDAEFCDCAGAFARFGLRRSLLYELHSLGLIKGVSLRRRDTARGKRLWSIDSIRSYLHEQMGATK*
Ga0066683_1028745323300005172SoilMSIPSNNSGGHPQTIAPVTTSLSNDTEFCDSPGAFLRFGLKRSMLYELNARGLIKGCSLRKRGATKGKRLWSCDSIRSYLASAMKDAK*
Ga0065704_1052127713300005289Switchgrass RhizosphereMSAKSINKVGGQITDKTTAAVTASPANDVEFCDSPGAFLRFGLRRSLLYELHAEGLIQGVSLRRRGSVRGKRLWSIDSIRKYLASQMQNGGDA*
Ga0065705_1018887223300005294Switchgrass RhizosphereMSAKSINKVGGQITDKTTAAVTASPANDVEFCDSPGAFLRFGLRRSLLYELHAEGLIQGVSLRRRGTARGKRLWSIDSIRSYLNSQMNGRATK*
Ga0066388_10000019943300005332Tropical Forest SoilMKHSDRKLGRQLTDKTTALAQASPASDAEFCDSPGAFRRFGLRRSLLYDLHALGLIKGVSLRRRGTTRGKRLWSIDSIRSYLTSQMEGGDEIP*
Ga0066388_10051130323300005332Tropical Forest SoilMTAKSKDAGGNARVTIAPVAASPANDAEFCDSPGAFHRFGLKRSMLYDLSARGLIKGVSLRKVGSTKGKRLWACDSIRTYLLSQMESLK*
Ga0066388_10051628823300005332Tropical Forest SoilVKSTSQNIGWQPTDKTTAPVTASPFFDAEFCDSPGAFHRFARKRTYLYELERLGLIRGVSLRKKGAARGKKLWSIASIRAYLESQMKNEGEG*
Ga0066388_10358844423300005332Tropical Forest SoilMSKKPTNAGGETAKTIAPVKASPANDAEFCDSPGAFARFGLRRSLLYDLHSKRYIRGVSLRKRGTTRGKRLWSIDSIRSYLGSQMDAAIEGGGK*
Ga0066388_10722731423300005332Tropical Forest SoilMNSNDKVGNARLTIAPLAASAANDAEFCDSPGAFHRFGLKRSMLYALSGRGLIKGVSLRKRGSSKGKRLWSCDSIRAFLHKQMEGDNEAT*
Ga0066388_10843848913300005332Tropical Forest SoilMSLDTEKLQKLGEQIAEKTAAPVQASPASEAEFCDNGGAYLRFGIRRSLLYQLHKEGLIDGVSLRRRGAARGKRLWSCDSIRRYLASQMKGAK*
Ga0066388_10872079913300005332Tropical Forest SoilMPAKSINIGGQLTEQTTAPVQASPAYDAEFCDCAGAFARFGLKRSLLYELHSLGLIKGVSLRRRGTARGKRLWSIDSIRSYLASQMENGGDAK*
Ga0070668_10110293223300005347Switchgrass RhizosphereMKRTSDTKIGGQRTDKTTAPVTASPASDAEFCDSPGAFARFGLRRSLLYELHSLGLIKGVSLRRRGAARGKRLWSIDSIRSYLDSQMDATAAK*
Ga0070711_10052302513300005439Corn, Switchgrass And Miscanthus RhizosphereMSLDSKKIGGEPTDTTTLPVTPSPANDGEFCDSPRAFQRFGLRRSLLYALAAEGHIAGCSLRRRGRQRGKRLWSIDSIRRYLASQMDGRAVK*
Ga0070706_10037608223300005467Corn, Switchgrass And Miscanthus RhizosphereMLANSLRNGGYSETTAPVTASPANDAEFCDSGGAFVRFGLRRSLLYDLHALGLIKGVSLRRRGAARGKRLWSIDSIRLYLASQMESGK*
Ga0070707_10030108333300005468Corn, Switchgrass And Miscanthus RhizosphereMRVDTKTSGMSNNISNIAVSGTAKTTAPVEASPANDAEFCDSPGAFVRFGLRRSLLYDLYGQGLIRGVSLRRRGAARGKRLWSIDSIRCYLASQMDSAKEVRNEAH*
Ga0066706_1092585923300005598SoilMSNSNNKGGNVRLTIAPLAASAANDAEFCDSPGAFYRFGLRRSMLYELSARGLIKGCSLRKRGA
Ga0066903_10011778943300005764Tropical Forest SoilMAVMKNSPDKGGNARLTIAPLAVSAANDAEFCDSPGAFYRFGLKRSMLYELSARGLIRGVSLRKRGSSKGKRLWSCDSIRAYLTSQMENPK*
Ga0066903_10019443213300005764Tropical Forest SoilMPAKSINNGGRKTDRTTEPIAASPASDAEFCDSPGAFQRFGLRRSLLYELHSLGLIRGVSLRRRGAARGKRLWSIDSIRSYLNSQME
Ga0066903_10079134633300005764Tropical Forest SoilMVLAMPKNSINAAGDTAKTIAPVQASPVNDAEFCDSPGAFLRFGLRRSLLYELDADGLIKGVSLRRRGAARGKRLWSIDSIRSYLASQMDSTIEGGAK*
Ga0066903_10102507923300005764Tropical Forest SoilVKTLSNKKSGGQQTANTTAPVQASPANGADAEFCDSLGAFLRFGLRRSLLYELHHLGLIKGVSLRRRGTARGKRLWSIESIRAYLAAQMDNGGEAK*
Ga0066903_10103824923300005764Tropical Forest SoilMSKMSLDTKKIGRQETDKTTAPVTASPASDAEFCDSPGAFARFGLRRSLLYELHHLGLIKGVSLRRRGTARGKRLWSIDSIRDYLVSQMENGSEP*
Ga0066903_10117502213300005764Tropical Forest SoilMSKMSLDTKKIGRQETGKTTAPVIASPVSDAEFCDSPGAFYRFGLRRSLLYELHHLGLIKGVSLRRRGAARGKRLWSIDSIRAYLASQMENGGTA*
Ga0066652_10026764423300006046SoilMPTKSINIGGQLTDKTTAPIAVSPVNDAEFCDSFGAFVRFGLRRSLLYELHAQGLIRGCSLRRRGAVRGKRLWSIDSIRSYLASQMDGRATP*
Ga0068865_10202081023300006881Miscanthus RhizosphereMSLDTKKIGGEPTDTTTLPVTPSPAHDGEFCDSPGAFQRFGLRRSLLYALAADGHIAGCSLRRRGRQRGKRLWSIASIRRYLASQMEADK*
Ga0066709_10041421013300009137Grasslands SoilMSNSNNKGGNVRLTIAPLAASAANDAEFCDSPGAFYRFGLRRSMLYELSARGLIKGCSLRKRGATKGKRLWSCDSIRAYIASQIEA
Ga0066709_10245646223300009137Grasslands SoilMSTKSIDLGGQLTEKTTAPVAASPANDAEFCDSPGAFLRFGLRRSLLYELHAQGLIQGVSLRRRGAARGKRLWSIDSIRSYLASQMQGAK*
Ga0066709_10338900413300009137Grasslands SoilMPRKFIGERQTETTTAPVQASPANDAEFCDCAGAFARFGLKRSLLYELHALGLIKGVSLRRRGTARGKRLWSIDSIRSYLQSQMERAE*
Ga0126384_1160843013300010046Tropical Forest SoilMAVMKNSPDKGGNARLTIAPLAVSAANDAEFCDSPGAFCRFGLKRSMLYELSARGLIRGVSLRKRGSSKGKRLWSCDSIRAYLTSQMENPK*
Ga0126382_1058178323300010047Tropical Forest SoilMLQKAISTGGHPKTTALVQASPANDAEFCDSLGAFVRFGLRRSLLYDLNAQGLIKGMSLRRRGAARGKHFGASDSIRSYLASQMEAAK*
Ga0126370_1187558813300010358Tropical Forest SoilMSKMSLDVKKIGRQETDKTTALLIASPASDAEFCDSPGAFARFGLRRSLLYELHHLGLIKGVSLRRRGTARGKRLWSIDSIRDYLVSQMENGSEP*
Ga0126376_1208473923300010359Tropical Forest SoilLVKTTSHLGRRQTDITTVPVQASPAYDAEFCDCAGAFARFGLRRSLLYELHSLGLIKGVSLRRRGTARGKRLWSIDSIRAYLASQMENGGRA*
Ga0126372_1111859713300010360Tropical Forest SoilMNSNDKGDNARLTIAPLAASAANDAEFCDSPGAFYRFGLKRSMLYELSARGLIRGVSLRKRGSSKGKRLWSCDSIRAYLTSQMENPK*
Ga0126378_1275794223300010361Tropical Forest SoilMSKNINNFAEGGTAQTIAPVQASPANDAEFCDSLGAFVRFGLRRSLLYDLYAQGLIKGVSLRRRGAVRGKRLWSIDSIRAYLREEMEAAK
Ga0126381_10033884623300010376Tropical Forest SoilMSSNTINGGGGHPQITIAPVQASPANDAEYCDSFGAFVRFGLRRSLLYDLHAQGLIKGVSLRRRGAARGKRLWSIDSIRSYLASQMEVAK*
Ga0126381_10045191133300010376Tropical Forest SoilMISAGGKTETTTAPVKASPAYDAEFCDCAGAFARFGLKRSLLYELHSLGLIKGVSLRRRGTARGKRLWSIDSIRNYLASQMEGAK*
Ga0126381_10073205323300010376Tropical Forest SoilVKSTSQKIGWQPTDKTTAPVTASPFFDAEFCDSPGAFHRFALKRTYLYQLERLGLIRGVSLRKKGAARGKKLWSIASIRAYLESQMKNEGEG*
Ga0137364_1084684423300012198Vadose Zone SoilVHIQVDTKMSGMSNNISNIAVSGTAKTTVPVQASPVNDAEFCDSPGAFLRFGLRRSLLYDLYGHGLIRGVSLRRRGAARGKRLWSIDSIRCYLASQMDSAIEGGTK*
Ga0137383_1103905333300012199Vadose Zone SoilAVMKNSYDTGGNARRTIAPVTASPANDAEFCDSPGAFHRFGLKRSLLYELSARGLIKGVSLRKRGATKGKRLWSCDSIRAYLAKQMQGAQ*
Ga0137382_1094960823300012200Vadose Zone SoilMSLDTEKLQKLGEQIAEKTAAPVQASPASEAEFCDNGGAYLRFGIRRSLLYQLHKEGLIDGVSLRRRGAARGKRLWSCDSI
Ga0137362_1128250523300012205Vadose Zone SoilMKNSNDKGGNARLTIAPLAASAANDAEFCDSPGAFYRFGLKRSMLYELSARGLIKGISLRKRGAAKGKRLWSCDSIRTYLSSQMKAAK*
Ga0137376_1040673513300012208Vadose Zone SoilMKKTSNVGAQSDMTTAPVVASPLNDAEFVDSPGAFVRFGLRRSLLYALHREGLISGVSLRRKGTVRGKRLWSCDSIREFLRRQMETENETS*
Ga0137376_1064630923300012208Vadose Zone SoilMRISKSRRQPDLTTAPVSASPVNGAEWCDSPGAKELFGLKRSMLYELLARGAIAGCSLRKRGAVKGKRLWNCDSIRRYLQSQMERAK*
Ga0137379_1041343733300012209Vadose Zone SoilMKNSYDTGGNARRTIAPVTASPANDAEFCDSPGAFHRFGLKRSLLYELSARGLIKGVSLRKRGATKGKRLWSCDSIRAYLAKQMQGAQ*
Ga0137377_1123104723300012211Vadose Zone SoilMKKTSNVGAQSDMTTAPVVASPLNDAEFVDSPGAFVRFGLRRSLLYALHREGLISGVSLRRKGTVRGKRLWSCDSIREFLLRQMETDNETS*
Ga0137367_1033603723300012353Vadose Zone SoilMPAKSIDLGGQRTEQTTAPVQASPANDAEFCDSPGAFARFGLKRTLLYELHSLGLIKGVSLRRRGKTRGKRLWSIDSIRSYLNSQMDNGGAP*
Ga0137371_1090695223300012356Vadose Zone SoilMKNSNDKGGNAQLTIAPLAASAANDAEFCDSPGAFYRFGLRRSLLYDLHAQGLIEGVSLRRRGAARGKRLWSIDSIRSYLAAQMESATK*
Ga0137384_1065808223300012357Vadose Zone SoilMKNSYDTGGNARRTIAPVTASPANDAEFCDSPGAFHRFGLKRSLLYELSARGLIKGVSLRKRGATKGKRLWSCDSIRAY
Ga0137360_1034097423300012361Vadose Zone SoilMPAKSIDLGGQRTEQTAAPVQASPANDAEFCDSPGAFIRFGLRRSLLYDLYAQGLIKGVSLRRRGAARGKRLWSIDSIRSYLASQMDTAK*
Ga0137373_1005895433300012532Vadose Zone SoilMKGSSKQTGEQITGLTTAPVSASPENDAEFCDCKGAFSRFGLRRSLLYELHSLGLIKGVSLRRRGAVRGKRLWSVDSIRAFLREQMEGSNGAG*
Ga0182024_1183142923300014501PermafrostMTPNVTDAAGGTAQTTAPIAVSSAPDAEFCDSPGAKARFGLGRTYLYQLLEQGLIKGVSLRKRGQTKGKRLWYVDSIRRYLHSQMEAGN*
Ga0132258_1034489863300015371Arabidopsis RhizosphereMSLDSNKIGRQETDKTTAPVTVSPASDAEFCDSPGAFARFGLRRSLLYELHHLGLIKGVSLRRRGTTRGKRLWSIDSIRDYLVSQMENGGES*
Ga0132255_10386165413300015374Arabidopsis RhizosphereMPLDTKKLGGKQTDKTTSPVTASPTSDAEFCDSPGAFARFGLRRSLLYELHSLGLIKGVSLRRRGMARGKRLWSIDSIRAYLVSQMENGGEAQ*
Ga0182033_1089753313300016319SoilVKSTSYKIGRRETEQTTAAVTASPAYDAEFCDCAGAFARFGLKRSLLYELHGLGLIKGVSLRRRGTARGKRLWSIDSIRSYLASQMENEGEA
Ga0184620_1029915823300018051Groundwater SedimentMPLDTKKLGGQLTDKTTAPVIASPAFDAEFCDSSGAFARFGLRRSLLYELHHLGLIKGVSLRRRRAPRGKRLWSIDSI
Ga0066655_1026184823300018431Grasslands SoilMSIPSNNSGGHPQTIAPVTTSLSNDTEFCDSPGAFLRFGLKRSMLYELNARGLIKGCSLRKRGATKGKRLWSCDSIRSYLASAMKDAK
Ga0066669_1003650513300018482Grasslands SoilMSLDTKKLGEQITEKTTAPVTASPANDAEFCDSSGAFVRFGLRRSLLYELHAQGLIQGVSLRRRGAARGKRLWSIGSIRSFLAAQMEIAE
Ga0066669_1076244033300018482Grasslands SoilMAKSINIGGQLNDKITAPIAALPGNDAEFCDSPGAFWRFGLRRSMLYELNARGLIKGVSLRKRGATKGKRLWNCDSIRAYLASQIEAGK
Ga0210407_10000074193300020579SoilMVLDTKKLGGQLTEKTTAPVAASQANDAEFCDSPGAFMRFGLRRSLLYDLHALGLIRGVSLRRRGAQRGKRLWDVASIRTYLSSQMEASK
Ga0210406_1005817143300021168SoilMAKSINIGAQLTDKTTAPVAASSGNDAEFCDSSGAFMRFGLRRSLLYDLHALGLIRGVSLRRRGAQRGKRLWDVASIRTYLSSQMEASK
Ga0126371_1364913723300021560Tropical Forest SoilMAVMKNSHDKGGNARLTIAPLAASAANDAEFCDSPGAFYRFGLKRSMLYELSARGLIRGVSLRKRGSSKGKRLWSCDSIRAYLTSQME
Ga0222622_1000517443300022756Groundwater SedimentMPLDTKKLGGQLTDKTTAPVIASPAFDAEFCDSSGAFARFGLRRSLLYELHHLGLIKGVSLRRRGAPRGKRLWSIDSIRSYLASQMENGGQA
Ga0208193_107320423300025463PeatlandYFRPAVPIKSPMPVQEENKMGGNALTVAPVQASPSNDAEFCDSRGAERRFGLKRSLLYELLAEGLIRGVSLRRRGQMKGKRLWNCDSIRTYLNAQMQEQ
Ga0207684_1042083323300025910Corn, Switchgrass And Miscanthus RhizosphereMLANSLRNGGYSETTAPVTASPANDAEFCDSGGAFVRFGLRRSLLYDLHALGLIKGVSLRRRGAARGKRLWSIDSIRLYLASQMESGK
Ga0207693_1095559223300025915Corn, Switchgrass And Miscanthus RhizosphereMKKTSNVGAQSDMTTAPVVASPLNDAEFVDSPGAFLRFGLRRSLLYALHREGLISGVSLRRKGTVRGKRLWSCDSIRTFLRRQIETDNETS
Ga0207663_1050172423300025916Corn, Switchgrass And Miscanthus RhizosphereMSLDSKKIGGEPTDTTTLPVTPSPANDGEFCDSPRAFQRFGLRRSLLYALAAEGHIAGCSLRRRGRQRGKRLWSIDSIRRYLASQMDGRAVK
Ga0207704_1188607613300025938Miscanthus RhizosphereMSLDTKKIGGEPTDTTTLPVTPSPAHDGEFCDSPGAFQRFGLRRSLLYALAADGHIAGCSLRRRGRQRGKRLWSIASIRRYLASQMEADK
Ga0209473_107229833300026330SoilHPQTIAPVTTSLSNDTEFCDSPGAFLRFGLKRSMLYELNARGLIKGCSLRKRGATKGKRLWSCDSIRSYLASAMKDAK
Ga0209283_1084714623300027875Vadose Zone SoilMPAKSINTVGGQTTDEITAPVAASPANDAEFCDSPGAFLRFGLRRSLLYELHAQGLIQGVSLRRRGAARGKRLWSIASIRSYLASQMQGAKLMHTG
Ga0170824_10819190523300031231Forest SoilVKSTSLEIGGQLTDRTTATLRASTSGEAEFCDSDGAFARFGLGRPLLYELLNLGLIKGCSLRRHGALRGKRLWSIESIRGYLESQMDGGGHERAS
Ga0170824_11200060313300031231Forest SoilMSKKTDNAGGDTAKTIAPVQASPANDVEFCDNPGAFVRFGLKRSLLYELYAQGLIKGVSLRRRGAARGKRLWSIDSIRSYLREQMEAAK
Ga0170820_1475055423300031446Forest SoilMSAKSINKIGGQITNTTTAAVTASPANDVEFCDSPGAFVRFGLRRSLLYDLYGQGLIKGVSLRRRGAARGKRLWSIDSIR
Ga0310915_1099061923300031573SoilVKSTSYKIGRRETEQTTAAVTASPAYDAEFCDCAGAFARFGLKRSLLYELHGLGLIKGVSLRRRGTARGKRLWSIDSIRSYLASQMENEGEAT
Ga0310913_1037284123300031945SoilSSSKKIRDQRSETTAPVAASPASDAEFCDSPGAFQRFALRRSHLYQLHKDGLVKGVSLRRCGAARGKRLWSIDSIRSYLASQMDTKRKDL
Ga0310913_1037661113300031945SoilMPAKSINLGGQLTEQTTAPVQASPANDAEFCDCPGAFLRFGLRRSLLYELHKLGLIKGVSLRRRGTTRGKRLWSIDSIRSYLASQMEEDR
Ga0306922_1216311313300032001SoilVKTTSHKIGGRETETTTAPVQASPAFDAEFCDSPGAFVRFGLRRSLLYQLHAEGLIHGCSLRRKGRQRGKRLWSIASIRSYLVQMEDGE
Ga0326727_1012905823300033405Peat SoilMSLDTHNAEDGTAYTTDPVQASLANDAEFCDSYGAKARFGLGRTYLYQLLEQGLISGVSLRKRGARTGKRLWCVDSIRRYLHSQMGVN


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.