NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F072155

Metagenome Family F072155

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072155
Family Type Metagenome
Number of Sequences 121
Average Sequence Length 81 residues
Representative Sequence MICTRCGQREANTEAEAAALAAKGGPVLPAGVCFPCAWKDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVESFR
Number of Associated Samples 92
Number of Associated Scaffolds 121

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 66.12 %
% of genes near scaffold ends (potentially truncated) 22.31 %
% of genes from short scaffolds (< 2000 bps) 83.47 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (56.198 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(39.669 % of family members)
Environment Ontology (ENVO) Unclassified
(40.496 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(45.455 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 49.38%    β-sheet: 0.00%    Coil/Unstructured: 50.62%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 121 Family Scaffolds
PF01872RibD_C 4.13
PF14534DUF4440 4.13
PF13783DUF4177 1.65
PF02311AraC_binding 1.65
PF03992ABM 1.65
PF13649Methyltransf_25 1.65
PF12867DinB_2 1.65
PF02517Rce1-like 0.83
PF07831PYNP_C 0.83
PF13683rve_3 0.83
PF08241Methyltransf_11 0.83
PF12697Abhydrolase_6 0.83
PF02861Clp_N 0.83
PF02899Phage_int_SAM_1 0.83
PF12710HAD 0.83
PF01527HTH_Tnp_1 0.83
PF01906YbjQ_1 0.83
PF11225DUF3024 0.83
PF04892VanZ 0.83
PF00144Beta-lactamase 0.83
PF06271RDD 0.83
PF00589Phage_integrase 0.83
PF04191PEMT 0.83
PF00903Glyoxalase 0.83
PF02472ExbD 0.83
PF14539DUF4442 0.83
PF11954DUF3471 0.83
PF12680SnoaL_2 0.83
PF05866RusA 0.83
PF00190Cupin_1 0.83
PF04255DUF433 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 121 Family Scaffolds
COG0262Dihydrofolate reductaseCoenzyme transport and metabolism [H] 4.13
COG1985Pyrimidine reductase, riboflavin biosynthesisCoenzyme transport and metabolism [H] 4.13
COG0213Thymidine phosphorylaseNucleotide transport and metabolism [F] 0.83
COG0393Uncharacterized pentameric protein YbjQ, UPF0145 familyFunction unknown [S] 0.83
COG0542ATP-dependent Clp protease, ATP-binding subunit ClpAPosttranslational modification, protein turnover, chaperones [O] 0.83
COG0848Biopolymer transport protein ExbDIntracellular trafficking, secretion, and vesicular transport [U] 0.83
COG1266Membrane protease YdiL, CAAX protease familyPosttranslational modification, protein turnover, chaperones [O] 0.83
COG1680CubicO group peptidase, beta-lactamase class C familyDefense mechanisms [V] 0.83
COG1686D-alanyl-D-alanine carboxypeptidaseCell wall/membrane/envelope biogenesis [M] 0.83
COG1714Uncharacterized membrane protein YckC, RDD familyFunction unknown [S] 0.83
COG2367Beta-lactamase class ADefense mechanisms [V] 0.83
COG2442Predicted antitoxin component of a toxin-antitoxin system, DUF433 familyDefense mechanisms [V] 0.83
COG4449Predicted protease, Abi (CAAX) familyGeneral function prediction only [R] 0.83
COG4570Holliday junction resolvase RusA (prophage-encoded endonuclease)Replication, recombination and repair [L] 0.83
COG4973Site-specific recombinase XerCReplication, recombination and repair [L] 0.83
COG4974Site-specific recombinase XerDReplication, recombination and repair [L] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms57.02 %
UnclassifiedrootN/A42.98 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10006801All Organisms → cellular organisms → Bacteria4037Open in IMG/M
3300002561|JGI25384J37096_10082398All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1160Open in IMG/M
3300002561|JGI25384J37096_10123829Not Available862Open in IMG/M
3300002562|JGI25382J37095_10074843All Organisms → cellular organisms → Bacteria1254Open in IMG/M
3300002908|JGI25382J43887_10390855Not Available587Open in IMG/M
3300002909|JGI25388J43891_1019584All Organisms → cellular organisms → Bacteria1195Open in IMG/M
3300002911|JGI25390J43892_10160652Not Available525Open in IMG/M
3300004463|Ga0063356_101293750All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes1066Open in IMG/M
3300004780|Ga0062378_10140876Not Available631Open in IMG/M
3300004808|Ga0062381_10440375Not Available507Open in IMG/M
3300005166|Ga0066674_10122574All Organisms → cellular organisms → Bacteria1217Open in IMG/M
3300005166|Ga0066674_10159342All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria1067Open in IMG/M
3300005172|Ga0066683_10125584All Organisms → cellular organisms → Bacteria1566Open in IMG/M
3300005180|Ga0066685_10235479All Organisms → cellular organisms → Bacteria1260Open in IMG/M
3300005446|Ga0066686_10044577All Organisms → cellular organisms → Archaea → DPANN group → Candidatus Aenigmarchaeota → unclassified Aenigmarchaeota → Candidatus Aenigmarchaeota archaeon CG1_02_38_142678Open in IMG/M
3300005546|Ga0070696_101486304Not Available579Open in IMG/M
3300005553|Ga0066695_10084314All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111933Open in IMG/M
3300005553|Ga0066695_10505896Not Available742Open in IMG/M
3300005553|Ga0066695_10787454Not Available550Open in IMG/M
3300005556|Ga0066707_10220673All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111229Open in IMG/M
3300005556|Ga0066707_10240154All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Ktedonobacteria → Ktedonobacterales → Ktedonobacteraceae → Ktedonobacter1180Open in IMG/M
3300005556|Ga0066707_10328781Not Available1000Open in IMG/M
3300005557|Ga0066704_10064116All Organisms → cellular organisms → Bacteria2351Open in IMG/M
3300005561|Ga0066699_10061662All Organisms → cellular organisms → Bacteria2350Open in IMG/M
3300005568|Ga0066703_10078002Not Available1911Open in IMG/M
3300005569|Ga0066705_10488154All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium 13_1_20CM_69_28772Open in IMG/M
3300006034|Ga0066656_10420643Not Available867Open in IMG/M
3300006034|Ga0066656_11003899Not Available535Open in IMG/M
3300006049|Ga0075417_10674532Not Available530Open in IMG/M
3300006796|Ga0066665_10748333All Organisms → cellular organisms → Bacteria773Open in IMG/M
3300006796|Ga0066665_11684152Not Available502Open in IMG/M
3300006800|Ga0066660_10570729All Organisms → cellular organisms → Bacteria948Open in IMG/M
3300006852|Ga0075433_10107945All Organisms → cellular organisms → Bacteria2468Open in IMG/M
3300006871|Ga0075434_102489971Not Available519Open in IMG/M
3300007255|Ga0099791_10399439Not Available662Open in IMG/M
3300007258|Ga0099793_10015742All Organisms → cellular organisms → Bacteria3023Open in IMG/M
3300007258|Ga0099793_10101802Not Available1330Open in IMG/M
3300007258|Ga0099793_10169883All Organisms → cellular organisms → Bacteria1038Open in IMG/M
3300009012|Ga0066710_101038676All Organisms → cellular organisms → Bacteria1266Open in IMG/M
3300009012|Ga0066710_101249379All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111151Open in IMG/M
3300009012|Ga0066710_101390801Not Available1088Open in IMG/M
3300009012|Ga0066710_104260678Not Available534Open in IMG/M
3300009137|Ga0066709_103300732Not Available587Open in IMG/M
3300009147|Ga0114129_12901069Not Available567Open in IMG/M
3300009162|Ga0075423_10503814Not Available1273Open in IMG/M
3300010301|Ga0134070_10013068All Organisms → cellular organisms → Bacteria2637Open in IMG/M
3300010329|Ga0134111_10070173All Organisms → cellular organisms → Bacteria → Proteobacteria1305Open in IMG/M
3300010329|Ga0134111_10109891All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae1064Open in IMG/M
3300010329|Ga0134111_10574001Not Available503Open in IMG/M
3300010391|Ga0136847_11799638All Organisms → cellular organisms → Bacteria1950Open in IMG/M
3300010400|Ga0134122_10245759All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1509Open in IMG/M
3300011429|Ga0137455_1078535All Organisms → cellular organisms → Bacteria953Open in IMG/M
3300011438|Ga0137451_1210161Not Available616Open in IMG/M
3300012096|Ga0137389_10024298All Organisms → cellular organisms → Bacteria4309Open in IMG/M
3300012189|Ga0137388_10008969All Organisms → cellular organisms → Bacteria6952Open in IMG/M
3300012198|Ga0137364_10121925All Organisms → cellular organisms → Bacteria1859Open in IMG/M
3300012198|Ga0137364_10457159All Organisms → cellular organisms → Bacteria958Open in IMG/M
3300012199|Ga0137383_10443927Not Available950Open in IMG/M
3300012201|Ga0137365_10037530All Organisms → cellular organisms → Bacteria3702Open in IMG/M
3300012201|Ga0137365_10311520All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1165Open in IMG/M
3300012201|Ga0137365_11011663Not Available602Open in IMG/M
3300012201|Ga0137365_11268985Not Available524Open in IMG/M
3300012203|Ga0137399_10312909All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1299Open in IMG/M
3300012203|Ga0137399_10384745All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → Gemmatimonadaceae → unclassified Gemmatimonadaceae → Gemmatimonadaceae bacterium1169Open in IMG/M
3300012203|Ga0137399_10405093Not Available1138Open in IMG/M
3300012204|Ga0137374_10386430All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium1117Open in IMG/M
3300012204|Ga0137374_10399078Not Available1094Open in IMG/M
3300012204|Ga0137374_10688730Not Available769Open in IMG/M
3300012206|Ga0137380_10024359All Organisms → cellular organisms → Bacteria5571Open in IMG/M
3300012207|Ga0137381_11561048Not Available551Open in IMG/M
3300012208|Ga0137376_10199930Not Available1729Open in IMG/M
3300012208|Ga0137376_10662187Not Available901Open in IMG/M
3300012208|Ga0137376_11159414Not Available661Open in IMG/M
3300012208|Ga0137376_11516359Not Available561Open in IMG/M
3300012211|Ga0137377_10068623All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium3316Open in IMG/M
3300012285|Ga0137370_10226902All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1102Open in IMG/M
3300012349|Ga0137387_10103827All Organisms → cellular organisms → Bacteria1984Open in IMG/M
3300012349|Ga0137387_10988906Not Available604Open in IMG/M
3300012350|Ga0137372_10459943Not Available953Open in IMG/M
3300012353|Ga0137367_10263883Not Available1237Open in IMG/M
3300012354|Ga0137366_10099346All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Epsilonproteobacteria → Campylobacterales → Sulfurovaceae → Sulfurovum → unclassified Sulfurovum → Sulfurovum sp. XGS-022206Open in IMG/M
3300012356|Ga0137371_10142685All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1877Open in IMG/M
3300012359|Ga0137385_11662580Not Available502Open in IMG/M
3300012360|Ga0137375_10708000All Organisms → cellular organisms → Bacteria821Open in IMG/M
3300012362|Ga0137361_10127748All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → Gemmatimonadaceae → unclassified Gemmatimonadaceae → Gemmatimonadaceae bacterium2248Open in IMG/M
3300012362|Ga0137361_10659186All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes957Open in IMG/M
3300012532|Ga0137373_10978629Not Available614Open in IMG/M
3300012685|Ga0137397_10193773All Organisms → cellular organisms → Bacteria1511Open in IMG/M
3300012922|Ga0137394_11441827Not Available548Open in IMG/M
3300012925|Ga0137419_10932260Not Available716Open in IMG/M
3300012927|Ga0137416_11977513All Organisms → cellular organisms → Bacteria534Open in IMG/M
3300012930|Ga0137407_10251737All Organisms → cellular organisms → Bacteria1601Open in IMG/M
3300012976|Ga0134076_10391346Not Available618Open in IMG/M
3300015054|Ga0137420_1212447All Organisms → cellular organisms → Bacteria1004Open in IMG/M
3300015264|Ga0137403_10915252Not Available726Open in IMG/M
3300017654|Ga0134069_1267343Not Available598Open in IMG/M
3300017657|Ga0134074_1188170All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → unclassified Cyanobacteria → Cyanobacteria bacterium 13_1_40CM_2_61_4730Open in IMG/M
3300018028|Ga0184608_10161896All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium969Open in IMG/M
3300018056|Ga0184623_10120374Not Available1215Open in IMG/M
3300018059|Ga0184615_10679562Not Available526Open in IMG/M
3300018071|Ga0184618_10000596All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes8443Open in IMG/M
3300018071|Ga0184618_10005366All Organisms → cellular organisms → Bacteria3569Open in IMG/M
3300018084|Ga0184629_10196380All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1041Open in IMG/M
3300018433|Ga0066667_10370074All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Ktedonobacteria → Ktedonobacterales → Ktedonobacteraceae → Ktedonobacter1147Open in IMG/M
3300018468|Ga0066662_10252477All Organisms → cellular organisms → Bacteria1443Open in IMG/M
3300018482|Ga0066669_10458043All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_111096Open in IMG/M
3300019883|Ga0193725_1007711All Organisms → cellular organisms → Bacteria3078Open in IMG/M
3300020003|Ga0193739_1019908All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales1748Open in IMG/M
3300020170|Ga0179594_10282649Not Available628Open in IMG/M
3300026277|Ga0209350_1048497All Organisms → cellular organisms → Bacteria1234Open in IMG/M
3300026301|Ga0209238_1001236All Organisms → cellular organisms → Bacteria9826Open in IMG/M
3300026306|Ga0209468_1002787All Organisms → cellular organisms → Bacteria7168Open in IMG/M
3300026320|Ga0209131_1183746All Organisms → cellular organisms → Bacteria1003Open in IMG/M
3300026343|Ga0209159_1134582All Organisms → cellular organisms → Bacteria1010Open in IMG/M
3300026532|Ga0209160_1166001Not Available970Open in IMG/M
3300026538|Ga0209056_10316850Not Available1059Open in IMG/M
3300027655|Ga0209388_1154077All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → Gemmatimonadetes → Gemmatimonadales → Gemmatimonadaceae → unclassified Gemmatimonadaceae → Gemmatimonadaceae bacterium649Open in IMG/M
3300028536|Ga0137415_10048138All Organisms → cellular organisms → Bacteria4144Open in IMG/M
3300032180|Ga0307471_101286748All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium894Open in IMG/M
3300034165|Ga0364942_0250933Not Available578Open in IMG/M
3300034177|Ga0364932_0367221Not Available543Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil39.67%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil19.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil14.88%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil5.79%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment4.96%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere4.13%
Wetland SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Wetland Sediment1.65%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.65%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.65%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.83%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.83%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.83%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.83%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.83%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002909Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004780Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T0Bare1FreshEnvironmentalOpen in IMG/M
3300004808Wetland sediment microbial communities from St. Louis River estuary, USA, under dissolved organic matter induced mercury methylation - T4Bare1FreshEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09082015EnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300011429Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT600_2EnvironmentalOpen in IMG/M
3300011438Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT500_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300034165Sediment microbial communities from East River floodplain, Colorado, United States - 19_s17EnvironmentalOpen in IMG/M
3300034177Sediment microbial communities from East River floodplain, Colorado, United States - 17_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1000680123300002558Grasslands SoilMATTIPCTRCGKLTAYTDADYAALAAKGGPVLPSGVCFPCAWRDPALQPALRDXAERKKAETIRRGRELLARPLEFIDRFVESLR*
JGI25384J37096_1008239813300002561Grasslands SoilTIPCTRCGKLTAYTDADYAALAAKGGPVLPSGVCFPCAWRDPALQPALRDYAERKKAETIRRGRELLARPLEFIDRFVESLR*
JGI25384J37096_1012382923300002561Grasslands SoilMICTRCGRRKAYNEAEAAALVAKGSPALPVGVCFLCAWQDPALQPALREYAERKKAETIRRTRELLARPLEFIDRLVARF
JGI25382J37095_1007484323300002562Grasslands SoilMATTIPCTRCGKLTAYTDADYAALAAKGGPVLPSGVCFPCAWRDPALQPALRDYAERKKAETIRRGRELLARPLEFIDRFVESLR*
JGI25382J43887_1039085513300002908Grasslands SoilCPLAQSPAMICTRCGRRKAYNEVEAAALAAKGGPALPVGVCFLCAWQDPALQPALREYTERKKAETIRRGRELLARPLEFIDRLVERFR*
JGI25388J43891_101958423300002909Grasslands SoilMATTIPCTRCGKLMAYTDADYAALAAKGGPVLPSGVCFPCAWRDPALQPALRDXAERKKAETIRRGRELLARPLEFIDRFVESLR*
JGI25390J43892_1016065213300002911Grasslands SoilMATTIPCTRCGKLTAHTEADYAAIAAKGGPVLPGGVCFPCAWRDPALQPALREYAERKKAETIRRARELLARPLEFI
Ga0063356_10129375033300004463Arabidopsis Thaliana RhizosphereMICTRCGQRKANSEAEAAALVAKGGPVLPAGVCFPCAWNDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVESFR*
Ga0062378_1014087613300004780Wetland SedimentWGLVMICTRCGQREAKTEAEAAALAAKGGPVLPAGVCFPCAWNDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVESFR*
Ga0062381_1044037513300004808Wetland SedimentMICTRCGQREAKTEAEAAALAAKGGPVLPAGVCFPCAWNDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVESFR*
Ga0066674_1012257423300005166SoilMATTIPCTRCGKRTAYTEADYATLAAKGAPVLPGGVCFPCAWRDPALQPALREYAERKKAETIRRARELLVRPLEFIDRFVESLR*
Ga0066674_1015934233300005166SoilMICRRCGQRKANTQAEAAALAAKGGPVLPAGVCFPCAWQDTSLQPALREYAERKKAETIRGAREFLARPLEFIDRLMDSLR*
Ga0066683_1012558413300005172SoilMIDCTRCGQPEAHTEAECAALAAKGGPALPAGVCFPCAWKDPLLRPALREYAERKKAETIRRARELLARPLEFIDRLVDRLR*
Ga0066685_1023547923300005180SoilMATTIPCTRCGKLTAYTDADYAALAAKGGPVLPSGVCFPCAWRDPALQPALRDYAERKKVETIRRGRELLARPLEFIDRFVESLR*
Ga0066686_1004457733300005446SoilMICTRCGQREAKTEAEAAALAAKGGPVLPAGVCFPCAWKDPALQPGLRKYAERKKAETIRRARELLARPLEFIDRLVESFR*
Ga0070696_10148630413300005546Corn, Switchgrass And Miscanthus RhizosphereMICTRCGRRKAYDEAEAAALVAKGGPALPVGVCFVCAWQDPALQPALREYTVRKKAETIRRARELLARPLEFIDRLVEKFR*
Ga0066695_1008431433300005553SoilMATTIPCTRCGKRTAYTEADYVALAAKGGPVLPGGVCFPCAWRDPALQPALREYAERKKTETICRARELLVRPLEFIDRFVESLW*
Ga0066695_1050589613300005553SoilMIDCTRCGQPEAHTEAECAALAAKGGPALPAGVCFPCAWKDPLLRPALREYAERKKAETIRRARELLGRPLEFIDRLVDRLR*
Ga0066695_1078745413300005553SoilMICTRCGQRKAYTETEAAALSAKGGPLLPVGVCFLCAWKDPALQPALRAYAERKKAETILRARELLARPLEFIDRLVESFR*
Ga0066707_1022067323300005556SoilMATTIPCTRCGKRTAYTEADYAALAAKGGPVLPGGVCFPCAWRDPALQPALREYAERKKAETIRRARELLVRPLEFIDRFVESLW*
Ga0066707_1024015413300005556SoilMATTIPCTRCGKLTAHTEADYAAIAAKGGPVLPGGVCFPCAWRDPALQPALREYAEREKAETIRRARELLARPLEFIDRFVESLR*
Ga0066707_1032878133300005556SoilMICTRCGQREAKTEAEAAALAAKGGPVLPAGVCFPCAWKDPALQPGLRKYAERKKAETIRRARVLLARPLEFIDRLVESFR*
Ga0066704_1006411623300005557SoilMICTRCGQREAKTEAEAAALAAKGGPVLPAGVCFPCAWKDPALQPGLRRYAERKKAETIRLARELLARPLEFIDRLVESFR*
Ga0066699_1006166243300005561SoilVATTIPCTRCGELTAYTEADYAVLASQGGPVLPGGVCFPCAWRDPTLQPALREYAERKKAETIRRARELLAKPLEFIDRFVESLRY*
Ga0066703_1007800253300005568SoilAALGAKGGPVLPAGVCFPCAWKDPALQPGLRRYAERKKAETIRLARELLARPLEFIDRLVESFR*
Ga0066705_1048815423300005569SoilMATTIPCIRCGELTAYTEADYAVLAAKGGPVLPGGVCFPCAWRDPTLQPALREYAERKKAETIRRARELLAKPLEFIDRFVESLRC*
Ga0066656_1042064313300006034SoilPKRLGMATTIPCTRCGKRTAYTEADYVALAAKGGPVLPGGVCFPCAWRDPALQPALREYAERKKTETICRARELLVRPLEFIDRFVESLW*
Ga0066656_1100389913300006034SoilMICTRCGQRKAYTETEAAALAGKGGPLLPVGVCFLCAWKDPALQPALRAYVERKKAETIRRARELLARPLEFIDRFVESFR*
Ga0075417_1067453223300006049Populus RhizosphereRCGQREAKTEAEAAALAAKGGPVLPAGVCFPCAWNDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVESLR*
Ga0066665_1074833313300006796SoilMATTIPCTRCGKLTAHTEADYAAIAAKGGPVLPGGVCFPCAWRDPALQPALREYAERKKAETIRRARELLARPLEFIDRFVESLR*
Ga0066665_1168415213300006796SoilMTCTRCGQRKAYTEAEAAALAAKGGPVLPVGVCFPCAWQDGALQPALREYAQRKKAETIRRARELLVRPLEFIDRLVERFR*
Ga0066660_1057072923300006800SoilYAVLASQGGPVLPGGVCFPCAWRDPTLQLALREYAERKKAETIRRARELLAKPLEFIDRFVESLRY*
Ga0075433_1010794523300006852Populus RhizosphereMICTRCGRRKAYDEAEAAALVAKGGPALPVGVCFVCAWQDPTLQPALREYTVRKKAETIRRARELLARPLEFIDRLVERFR*
Ga0075434_10248997113300006871Populus RhizosphereMICTRCGRRKAYDEAEAAALVAKGGPALPVGVCFVCAWQDPALQPALREYTVRKKAETIRRARELLARPLEFIDRLVERFR*
Ga0099791_1039943923300007255Vadose Zone SoilETEAAALSAKGGPLLPVGVCFLCAWQDPALQPALRAFAERKKAETIRRARELLARPLEFIDRLVESFR*
Ga0099793_1001574213300007258Vadose Zone SoilTQLKRGPLGCPVAQSPAMICTRCGRRKAYNETEAAALVAKGGPALPVGVCFLCAWQDPALQPALREYTERKKAETIRRARELVARPLEFIDRLVEKFR*
Ga0099793_1010180233300007258Vadose Zone SoilMICTRCGHREAKTEAEAAALAAKGGPVLPAGVCFPCAWKDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVESFR*
Ga0099793_1016988313300007258Vadose Zone SoilMSCTRCRQRKAYTEPEAAALAAKGGPVLPAGICFPCAWQDPTLQPALRAYADRKKAETIRRAREFLARPLECIDRLVESLR*
Ga0066710_10103867613300009012Grasslands SoilMICTRCGKRKAYTEAEAAALASKGGPVLPVGVCFLCAWQDPALQPELREYTQRKKAQTIRRARELLARPLELIDRLVGRFR
Ga0066710_10124937913300009012Grasslands SoilMATTIPCTRCGKRTAYTEADYAALAAKGGPVLPGGVCFPCAWRDPALQPALREYAERKKTETICRARELLVRPLEFIDRFVESLW
Ga0066710_10139080123300009012Grasslands SoilCTRCGQRKAYTEAEAAALAAKGGPVLPVGVCFPCAWQDGALQPALREYAQRKKAETIRRARELLARPLEFIDRLVERFR
Ga0066710_10426067813300009012Grasslands SoilMIDCTRCGQPKAHTEAECAALAAKGGPALPAGVCFACAWKDPLLQPALREYAERKKAETIRRARELLARPLEFIDRLVDRLG
Ga0066709_10330073213300009137Grasslands SoilMTCTRCGQRKAYTEAEAAALAAKGGPVLPVGVCFPCAWQDGALQPALREYAQRKKAETIRRARELLARPLEFIDRLVERFR*
Ga0114129_1290106913300009147Populus RhizosphereMICTRCGQREAKSEAEAAALAAKGGPVLPAGVCLPCAWNDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVESLR*
Ga0075423_1050381423300009162Populus RhizosphereMICTRCGRRKAYDEAEAAALVAKGGPALPVGVCFVCAWQDPTLQPALREYTVRKNTETIRRARELLARPLEFIDRLVERFR*
Ga0134070_1001306843300010301Grasslands SoilMIDCTRCGQPKAHTEAECAALAAKGGPALPAGVCFACAWKDPLLRPALREYAERKKAETIRRARELLARPLEFIDRLVDRLR*
Ga0134111_1007017333300010329Grasslands SoilMATTIPCTRCGKRTAYTEADYVALAAKGGPVLPGGVCFPCAWRDPALQPALREYAERKKAETIRRARELLVRPLEFIDRFVESLW*
Ga0134111_1010989113300010329Grasslands SoilLVAKGGPALPVGVCFLCAWQDPALQPALREYTERKKAETIRRARELVARPLEFIDRLVERFR*
Ga0134111_1057400113300010329Grasslands SoilAAALAAKGGPVLPVGVCFPCAWRDGALQPALREYAQRKKAETIRRARELLVRPLEFIDRLVERFR*
Ga0136847_1179963823300010391Freshwater SedimentMICTRCGSSKAHTEAECTALAAKGGPTLPPGICFRCAWEDPALQKALGEWAEHRKAETFRRARELLVRPLEFIDRLVDSLR*
Ga0134122_1024575943300010400Terrestrial SoilRKAYDEAEAAALVAKGGPALPVGVCFVCAWQDPALQPALREYTVRKKAETIRRARELLARPLEFIDRLVEKFR*
Ga0137455_107853523300011429SoilMINCTRCGKPKAHTETECAALAAKGGPVLPAGVCFPCAWKDPLLQPALREYAERKKAETIRRARELLARPLEFIDRLVDSLR*
Ga0137451_121016113300011438SoilMINCMRCGQPKAHTEAECAALAAKGGPALPAGVCFPCAWKDPLLQPALKEYAERKKAETIRRARELLARPLEFIDRLVDSLR*
Ga0137389_1002429873300012096Vadose Zone SoilMATTIPCTRCGKRTAYTEADYAALAAKGGPVLPGGVCFPCAWRDPALQPALRDYAERKKAETIRRARELLARPLEFIDRFVESLQ*
Ga0137388_1000896953300012189Vadose Zone SoilMETTIPCTRCGKRTAYTEADYAALAAKGGPVLPGGVCFPCAWRDPALQPALRDYAERKKAETIRRARELLARPLEFIDRFVESLQ*
Ga0137364_1012192513300012198Vadose Zone SoilMATTIPCTRCGKRTAYTEADYATLAAKGAPVLPGGVCFPCAWRDPALQPALREYAERKKAETIRRARELLVRPLEFIDRFVESLW*
Ga0137364_1045715923300012198Vadose Zone SoilSMIDCTRCGQPKAHTEAECAALAAKGGPALPAGVCFACAWKDPLLQPALREYAERKKAETIRRARELLARPLEFIDRLVDRLR*
Ga0137383_1044392723300012199Vadose Zone SoilMICTRCGKRKAYTEAEAAALASKGGPVLPAGVCFLCAWQDPALQPELREYTQRKKAQTIRRARELLARPLELIDRLVERFR*
Ga0137365_1003753043300012201Vadose Zone SoilMIDCTRCGQPKAHTEAECAALAAKGGPALPAGVCFACAWKDPLLQPALREYAERKKAETIRRARELLARPLEFIDRLVDRLR*
Ga0137365_1031152023300012201Vadose Zone SoilMIDCTRCGQPKAHTEAECAALATKGGPALPAGVCFPCAWKDPLLRPALREYAERKKAETIRRARELLARPLEFIDRLVDRLR*
Ga0137365_1101166323300012201Vadose Zone SoilAEAAALAAKGGPVLPVGVCFPCAWQDGALQPALREYAQRKKAETIRRARELLVRPLAFIDRLVERFR*
Ga0137365_1126898513300012201Vadose Zone SoilMICTRCGHREAKTEAEAAALAAKGGPVLAAGVCFPCAWKDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVESFR*
Ga0137399_1031290913300012203Vadose Zone SoilMINCMRCSQPKAHTEAECAALAAKGGPALPTGVCFPCAWKDPLLQPALREYAERKKAETIGRARELLARP
Ga0137399_1038474513300012203Vadose Zone SoilMICTRCGRRKAYNETEAAALVAKGGPALPVGVCFLCAWQDPALQPALREYTERKKAETIRRARELVARPLEFIDRLVEKFR*
Ga0137399_1040509333300012203Vadose Zone SoilMICTRCGKRKAYTEAKAAALASKGGPVLPVGVCFLCAWQDPALQPELREYTQRKKAQTIQRARELLARPLELIDRLVERFR*
Ga0137374_1038643023300012204Vadose Zone SoilMINCMRCGQPKAHTEAECAALAAKAGPVLPAGVCFPCAWKDPLLQPALREYAERKKAETIRRARELLARPLESIDRLVDSLR*
Ga0137374_1039907813300012204Vadose Zone SoilMICTRCGHREAKTEAEAGALAAKGGPVLPAGVCFPCAWKDPALQPGLREYAERKKAETMRRARELLARPLEFIDRLVESFR*
Ga0137374_1068873013300012204Vadose Zone SoilMATTIPCTRCGKRTAYTEADYAALAAKGGPVLPGGVCFPCAWRDPALQPVLRDYAERKKAETIRRAREFLARPLEIIDRFVESFR*
Ga0137380_1002435933300012206Vadose Zone SoilVAKGGPALPVGVCFVCAWQEPALQPALREYTERKKAETIRRTRELLARPLEFIDRLVERFR*
Ga0137381_1156104813300012207Vadose Zone SoilMICTRCGKRKAYTEAEASALASKGGPVLPAGVCFLCAWQDPALQPELREYTQRKKAQTIRRARELLARPLELIDRLVERFR*
Ga0137376_1019993033300012208Vadose Zone SoilMTCTRCGQRKAYTEAEAAALAAKGGPVLPVGVCFPCVWRDGALQPALREYAQRKKAETIRRARELLVRPLEFIDRLVERFR*
Ga0137376_1066218713300012208Vadose Zone SoilGKRKAYTEAEAAALASKGGPVLPAGVCFLCAWQDPALQPEFREYTQRKKAQTIRRARELLARPLELIDRLVERFR*
Ga0137376_1115941423300012208Vadose Zone SoilMICTRCGQRKANAEAEAAALAAKGGPVLPAGVCFPCAWQDPTLQPGLREYAERKKAETIRRARDLLARPLEFIDRLVESFR*
Ga0137376_1151635913300012208Vadose Zone SoilLVAKGGPALPVGVCFLCAWQDPALQPALREYTERKKAETIRRTRELVARPLESIDRLVERFR*
Ga0137377_1006862363300012211Vadose Zone SoilMTCTRCGQRKAYTEAEAAALAAKGGPVLPVGVCFPCAWRDGALQPALREYAQRKKAETIRRARELLVRPLEFIDRLVERFR*
Ga0137370_1022690223300012285Vadose Zone SoilMICTRCSRRKAYNEAEAAALVAKGGPALPVGVCFLCAWQDPALQPALREYTERKKAETIRRARELVARPLEFIDRLVESFR*
Ga0137387_1010382713300012349Vadose Zone SoilMICTRCGRRKAYNEAEAAALVAKGGPALPVGVCFVCAWQEPALQPALREYTERKKAETIRRTRELLARPLEFIDRLVERFR*
Ga0137387_1098890613300012349Vadose Zone SoilMATSIPCTRCGKLTAYTEADYAALAAKGGPVLPGGVCFPCAWRDPALQPALREYAERKKAETIRRARELLVRPLEFIDRFVESLR*
Ga0137372_1045994313300012350Vadose Zone SoilMICTRCGHREAKTEAEAAALAAKGGPVLAAGVCFPCAWKDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVESFQ*
Ga0137367_1026388323300012353Vadose Zone SoilMICTQCGHREAKTEAEAAALAAKGGPVLPAGVCFPCAWKDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVESFR*
Ga0137366_1009934653300012354Vadose Zone SoilMINCMRCGQPKAHTEAECAALAAKGGPVLPAGVCFPCAWKDPLLQPALREYAERKKAETIRRARELLARPLEFIDRLVDSLR*
Ga0137371_1014268513300012356Vadose Zone SoilRCGKRTAYTEADYAALAAKGGPVLPGGVCFPCAWRDPALQPALREYAERKKAETIRRARELLVRPLEFIDRFVESLW*
Ga0137385_1166258013300012359Vadose Zone SoilAHVLVMICTRCGKRKAYTEAEAAALASKGGPVLPVGVCFICAWQDPALQPELREYTQRKKAQTIRRARELLARPLELIDRLVERFR*
Ga0137375_1070800023300012360Vadose Zone SoilMINCMRCGQPKAHTEAECAALAAKGGPVLPAGVCFPCAWKDPLLQPALREYAERKKAETIRRARELLARPLESIDRLVDSLR*
Ga0137361_1012774843300012362Vadose Zone SoilMICTRCGQRKAYTETEAAALSAKGGPLLPVGVCFLCAWKDPALQPALRAYAERKKAETIRRARELLARPLEFIDRLVERFR*
Ga0137361_1065918613300012362Vadose Zone SoilLAAKGGPALPVGVCFLCAWQDPALQPALREYTERKKAETIRRGRELLARPLEFIDRLVERFR*
Ga0137373_1097862913300012532Vadose Zone SoilMICTQCGHREAKTEAEAAALAAKGGPVLPAGVCFPCAWKDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVESFQ*
Ga0137397_1019377323300012685Vadose Zone SoilMINCMRCSQPKAHTEAECATLAAKGGPALPAGVCFPCAWKDPLLQPALREYAERKKAETIGRARELLARPLEFIDRLVESLR*
Ga0137394_1144182713300012922Vadose Zone SoilMICTRCGQREANTEAEAAALAAKGGPVLPAGVCFPCAWKDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVEGFR*
Ga0137419_1093226013300012925Vadose Zone SoilMICTRCGQREANTEAEAAVLAAKGGPALPAGVCFPCAWKDPALQSGLREYAERKKAETIRRARELLARPLELIDRVVESFR*
Ga0137416_1197751313300012927Vadose Zone SoilRCGKLTAYTDADYAALAAKGGPVLPSGVCFPCAWRDPALQPALRDYAERKKAETIRRGRELLARPLEFIDRFVESLR*
Ga0137407_1025173723300012930Vadose Zone SoilMICTRCGQREANTEAEAAALAAKGGPVLPAGVCFPCAWKDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVESFR*
Ga0134076_1039134613300012976Grasslands SoilMICTRCGHREATTEAEAVALAAKGGPVLPAGVCFPCAWKDPALQPGLREYAERKKAETIRRARELLARPLEFIDRLVESFR*
Ga0137420_121244723300015054Vadose Zone SoilVMICTRCGKRKAYTEAEAAALASKGGPVLPVGVCFVCAWQDPALQPELREYTQRKKAQTIRRARELLARPLELIDRLVEKFR*
Ga0137403_1091525223300015264Vadose Zone SoilMICTRCGHREAKTEAEAAALAAKGGPVLPAGVCFPCAWKDPALQPGLREYAERKKAETIRRARELLARPLELIDRLVESFR*
Ga0134069_126734323300017654Grasslands SoilAEAEAASLAAKGGPVLPAGVCFPCAWQDSTLQPGLREYAERKKAETIRRARELLARPLEFIDRLVESFR
Ga0134074_118817013300017657Grasslands SoilMICTRCGQRKAYTETEAAALSAKGGPLLPVGVCFLCAWKDPALQPALRAYAERKKAETILRARELLARPLEFIDRLVESFR
Ga0184608_1016189633300018028Groundwater SedimentRCGQRKAYTEAEAAALAAKGGPVLPVGVCFPCAWQDPALQPALREYARRKKAETIRRARELLVRPLEFIDRLVERFR
Ga0184623_1012037423300018056Groundwater SedimentMTCTRCGQRKAYTEAEAAALAAKGGPVLPVGVCFPCAWQDPALQPALREYARRKKAETIRRARELLARPLEFIDRLVESFR
Ga0184615_1067956213300018059Groundwater SedimentMICTRCGSSKAHTEAECTALAVKGGPTLPAGICFRCAWEDPALHKALSEWAERKKAETYRRVREFLVRPLEFIDRLVDGLR
Ga0184618_1000059643300018071Groundwater SedimentMINCTRCGEPKAHTEAECAALAAKGGPALPTGVCFPCAWKDPLLQPALREYAERKKAETIRRAREILARPLEFIDRLVDSLR
Ga0184618_1000536663300018071Groundwater SedimentMICTRCGKPKAHTEAECAALVGKGGPSLPAGVCFPCAWQDPLLQPGLRDYAERKKAETIRRARELLARPLEFIDRLVDSLR
Ga0184629_1019638013300018084Groundwater SedimentMINCMRCGQPKAHTEAECAALAAKGGPALPAGVCFPCAWKDPLLQPALKEYAERKKAETIRRARELLARPLEFIDRLVDSLR
Ga0066667_1037007423300018433Grasslands SoilMATTIPCTRCGKLTAHTEADYAAIAAKGGPVLPGGVCFPCAWRDPALQPALREYAERKKAETIRRARELLARPLEFIDRFVESLR
Ga0066662_1025247723300018468Grasslands SoilMATTIPCTRCGKLTAYTDADYAALAAKGGPVLPSGVCFPCAWRDPALQPALRDYAERKKAETIRRGRELLARPLEFIDRFVESLR
Ga0066669_1045804313300018482Grasslands SoilMATTIPCTRCGKRTAYTEADYVALAAKGGPVLPGGVCFPCAWRDPALQPALREYAERKKTETICRARELLVRPLEFIDRFVESLW
Ga0193725_100771153300019883SoilMICTRCGKPKAHTEAECAALVAKGGPLLPAGVCFPCAWEDPLLRPALRDYAERKKAETIRRARELLARPLEFIDRLVDSLR
Ga0193739_101990813300020003SoilMICTRCGSAEAHTEAEYTALAAKGGPTLPAGICFRCAWEDPALHKALSEWAERKKAETYRRVREFLVRPLEFIDRLVDGLR
Ga0179594_1028264913300020170Vadose Zone SoilMICTRCGQREANTEAEAAALAAKGGPVLPAGVCFPCAWKDPALQPGLREYAERKKAETIRRARELLARPLEFIGRLVESFR
Ga0209350_104849723300026277Grasslands SoilMATTIPCTRCGKLMAYTDADYAALAAKGGPVLPSGVCFPCAWRDPALQPALRDYAERKKAETIRRGRELLARPLEFIDRFVESLR
Ga0209238_1001236143300026301Grasslands SoilMATTIPCTRCGKLTAYTDADYAALAAKGGPVLPSGVCFPCAWRDPALQPALRDYAERKKVETIRRGRELLARPLEFIDRFVESLR
Ga0209468_100278743300026306SoilMATTIPCTRCGKRTAYTEADYATLAAKGAPVLPGGVCFPCAWRDPALQPALREYAERKKAETIRRARELLVRPLEFIDRFVESLR
Ga0209131_118374623300026320Grasslands SoilMICTRCGQREANTEAEAAALAAKGGPVLPAGVCFPCAWKDPALQPGLREYAERKKAENIRRVRELLARPLEFIDRLVESFR
Ga0209159_113458213300026343SoilAYTDADYAALAAKGGPVLPSGVCFPCAWRDPALQPALRDYAERKKAETIRRGRELLARPLEFIDRFVESLR
Ga0209160_116600133300026532SoilGAKGGPVLPAGVCFPCAWKDPALQPGLRRYAERKKAETIRLARELLARPLEFIDRLVESF
Ga0209056_1031685023300026538SoilMICTRCGQREAKTEAEAAALAAKGGPVLPAGVCFPCAWKDPALQPGLRKYAERKKAETIRRARVLLARPLEFIDRLVESFR
Ga0209388_115407723300027655Vadose Zone SoilMICTRCGQRKAYTETEAAALSAKGGPLLPVGVCFLCAWKDPALQPALRAYAERKKAETIRRARELLARPLEFIDRLVERFR
Ga0137415_1004813813300028536Vadose Zone SoilRCGKLTAYTDADYAALAAKGGPVLPSGVCFPCAWRDPALQPALRDYAERKKAETIRRGRELLARPLEFIDRFVESLR
Ga0307471_10128674823300032180Hardwood Forest SoilMTKAIPCTRCGKQTAYSEADYAALAAKGGPVLPGGVCFPCAWRDPALQPALRDYAERKKAETVRRARELLARPLEFIDRLVESFR
Ga0364942_0250933_208_4533300034165SedimentMICPRCGKPKAHTEAECAKLVAKGGPSLPAGVCFPCAWQDPLLQPALRDYAERKKAETIRRARELLARPLEFIDRLVDSLR
Ga0364932_0367221_164_4273300034177SedimentLSKAFHDDLRAVRRAKGTKESECAALVGKGGPLLPDGICFRCAWEDPLLQPALREYAERKKAETIRRAREFLARPLEFIDRLVDSMR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.