NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F045022

Metagenome Family F045022

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F045022
Family Type Metagenome
Number of Sequences 153
Average Sequence Length 109 residues
Representative Sequence MSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVQEGNQEKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEEPLVWLGNTLFQL
Number of Associated Samples 106
Number of Associated Scaffolds 153

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 71.24 %
% of genes near scaffold ends (potentially truncated) 20.26 %
% of genes from short scaffolds (< 2000 bps) 75.82 %
Associated GOLD sequencing projects 95
AlphaFold2 3D model prediction Yes
3D model pTM-score0.84

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (46.405 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Loam → Unclassified → Soil
(13.725 % of family members)
Environment Ontology (ENVO) Unclassified
(36.601 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(40.523 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 30.88%    β-sheet: 23.53%    Coil/Unstructured: 45.59%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.84
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.58.4.11: PA3566-liked1y0ha_1y0h0.79043
d.58.4.0: automated matchesd3qmqa13qmq0.78583
d.58.4.0: automated matchesd4dpoa14dpo0.77053
d.58.4.0: automated matchesd2bbea_2bbe0.76783
d.58.4.11: PA3566-liked1tuva_1tuv0.74312


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 153 Family Scaffolds
PF01613Flavin_Reduct 7.19
PF06877RraB 3.92
PF00903Glyoxalase 3.27
PF01042Ribonuc_L-PSP 2.61
PF10604Polyketide_cyc2 2.61
PF13442Cytochrome_CBB3 1.31
PF13495Phage_int_SAM_4 1.31
PF14534DUF4440 1.31
PF01904DUF72 1.31
PF09037Sulphotransf 1.31
PF01434Peptidase_M41 1.31
PF02566OsmC 0.65
PF00670AdoHcyase_NAD 0.65
PF06068TIP49 0.65
PF01850PIN 0.65
PF13655RVT_N 0.65
PF00296Bac_luciferase 0.65
PF01208URO-D 0.65
PF07676PD40 0.65
PF09966DUF2200 0.65
PF01872RibD_C 0.65
PF07336ABATE 0.65
PF07883Cupin_2 0.65
PF11706zf-CGNR 0.65
PF08240ADH_N 0.65
PF04014MazE_antitoxin 0.65
PF12867DinB_2 0.65
PF05168HEPN 0.65
PF08922DUF1905 0.65
PF03824NicO 0.65
PF15919HicB_lk_antitox 0.65
PF14076DUF4258 0.65
PF02837Glyco_hydro_2_N 0.65
PF00400WD40 0.65
PF04075F420H2_quin_red 0.65
PF02538Hydantoinase_B 0.65
PF03319EutN_CcmL 0.65
PF12344UvrB 0.65
PF05368NmrA 0.65
PF04041Glyco_hydro_130 0.65
PF04255DUF433 0.65
PF00891Methyltransf_2 0.65
PF01642MM_CoA_mutase 0.65

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 153 Family Scaffolds
COG1853FMN reductase RutF, DIM6/NTAB familyEnergy production and conversion [C] 7.19
COG3076Regulator of RNase E activity RraBTranslation, ribosomal structure and biogenesis [J] 3.92
COG0251Enamine deaminase RidA/Endoribonuclease Rid7C, YjgF/YER057c/UK114 familyDefense mechanisms [V] 2.61
COG0146N-methylhydantoinase B/oxoprolinase/acetone carboxylase, alpha subunitAmino acid transport and metabolism [E] 1.31
COG4576Carboxysome shell and ethanolamine utilization microcompartment protein CcmK/EutMEnergy production and conversion [C] 1.31
COG0465ATP-dependent Zn proteasesPosttranslational modification, protein turnover, chaperones [O] 1.31
COG4424LPS sulfotransferase NodHCell wall/membrane/envelope biogenesis [M] 1.31
COG1801Sugar isomerase-related protein YecE, UPF0759/DUF72 familyGeneral function prediction only [R] 1.31
COG5516Uncharacterized conserved protein containing a Zn-ribbon-like motif, possibly RNA-bindingGeneral function prediction only [R] 0.65
COG3250Beta-galactosidase/beta-glucuronidaseCarbohydrate transport and metabolism [G] 0.65
COG2442Predicted antitoxin component of a toxin-antitoxin system, DUF433 familyDefense mechanisms [V] 0.65
COG2250HEPN domain protein, predicted toxin of MNT-HEPN systemDefense mechanisms [V] 0.65
COG2152Predicted glycosyl hydrolase, GH43/DUF377 familyCarbohydrate transport and metabolism [G] 0.65
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 0.65
COG1985Pyrimidine reductase, riboflavin biosynthesisCoenzyme transport and metabolism [H] 0.65
COG1895HEPN domain protein, predicted toxin of MNT-HEPN systemDefense mechanisms [V] 0.65
COG1884Methylmalonyl-CoA mutase, N-terminal domain/subunitLipid transport and metabolism [I] 0.65
COG1765Uncharacterized OsmC-related proteinGeneral function prediction only [R] 0.65
COG1764Organic hydroperoxide reductase OsmC/OhrADefense mechanisms [V] 0.65
COG1224DNA helicase TIP49, TBP-interacting proteinTranscription [K] 0.65
COG0499S-adenosylhomocysteine hydrolaseCoenzyme transport and metabolism [H] 0.65
COG0407Uroporphyrinogen-III decarboxylase HemECoenzyme transport and metabolism [H] 0.65
COG0262Dihydrofolate reductaseCoenzyme transport and metabolism [H] 0.65


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms53.59 %
UnclassifiedrootN/A46.41 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000559|F14TC_101214758Not Available599Open in IMG/M
3300002120|C687J26616_10039436All Organisms → cellular organisms → Bacteria → Terrabacteria group1692Open in IMG/M
3300002120|C687J26616_10074964All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1122Open in IMG/M
3300002123|C687J26634_10009716All Organisms → cellular organisms → Bacteria3937Open in IMG/M
3300002243|C687J29039_10132944Not Available899Open in IMG/M
3300002243|C687J29039_10140397Not Available869Open in IMG/M
3300002407|C687J29651_10054234All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1429Open in IMG/M
3300002485|C687J35088_10071845All Organisms → cellular organisms → Bacteria1057Open in IMG/M
3300002530|C687J35503_10154739Not Available575Open in IMG/M
3300002561|JGI25384J37096_10140896Not Available781Open in IMG/M
3300002912|JGI25386J43895_10009623All Organisms → cellular organisms → Archaea2743Open in IMG/M
3300003319|soilL2_10201359All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia4103Open in IMG/M
3300004019|Ga0055439_10094970All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi877Open in IMG/M
3300004114|Ga0062593_100326612All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1325Open in IMG/M
3300004114|Ga0062593_100990941All Organisms → cellular organisms → Bacteria860Open in IMG/M
3300004156|Ga0062589_100651356All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi925Open in IMG/M
3300004156|Ga0062589_101141963All Organisms → cellular organisms → Bacteria740Open in IMG/M
3300005186|Ga0066676_10012838All Organisms → cellular organisms → Bacteria4187Open in IMG/M
3300005289|Ga0065704_10301155All Organisms → cellular organisms → Bacteria → Spirochaetes → Spirochaetia → Leptospirales → Leptospiraceae → Leptospira866Open in IMG/M
3300005332|Ga0066388_100055559All Organisms → cellular organisms → Bacteria4128Open in IMG/M
3300005446|Ga0066686_10021529All Organisms → cellular organisms → Archaea3625Open in IMG/M
3300005447|Ga0066689_10903432Not Available546Open in IMG/M
3300005559|Ga0066700_10044343All Organisms → cellular organisms → Archaea2697Open in IMG/M
3300005559|Ga0066700_10232554All Organisms → cellular organisms → Archaea1283Open in IMG/M
3300005586|Ga0066691_10493915Not Available731Open in IMG/M
3300005598|Ga0066706_10071171All Organisms → cellular organisms → Archaea2433Open in IMG/M
3300005598|Ga0066706_10520421All Organisms → cellular organisms → Archaea947Open in IMG/M
3300006794|Ga0066658_10032275All Organisms → cellular organisms → Archaea2136Open in IMG/M
3300006796|Ga0066665_11585718Not Available514Open in IMG/M
3300006845|Ga0075421_100086095All Organisms → cellular organisms → Bacteria3962Open in IMG/M
3300006969|Ga0075419_10011208All Organisms → cellular organisms → Bacteria5543Open in IMG/M
3300009038|Ga0099829_10398841All Organisms → cellular organisms → Archaea1137Open in IMG/M
3300009088|Ga0099830_11263989Not Available613Open in IMG/M
3300009089|Ga0099828_11427360Not Available611Open in IMG/M
3300009090|Ga0099827_10255801All Organisms → cellular organisms → Archaea1473Open in IMG/M
3300009090|Ga0099827_11078191Not Available697Open in IMG/M
3300009100|Ga0075418_11090888Not Available864Open in IMG/M
3300009100|Ga0075418_13102053Not Available506Open in IMG/M
3300009146|Ga0105091_10540359Not Available596Open in IMG/M
3300009168|Ga0105104_10424627Not Available741Open in IMG/M
3300009597|Ga0105259_1153609Not Available559Open in IMG/M
3300009678|Ga0105252_10331865All Organisms → cellular organisms → Bacteria685Open in IMG/M
3300009691|Ga0114944_1292087Not Available669Open in IMG/M
3300009822|Ga0105066_1173646Not Available502Open in IMG/M
3300010047|Ga0126382_10360117All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1119Open in IMG/M
3300010047|Ga0126382_11962125Not Available556Open in IMG/M
3300010359|Ga0126376_12014626Not Available619Open in IMG/M
3300010362|Ga0126377_10008860All Organisms → cellular organisms → Bacteria7727Open in IMG/M
3300010362|Ga0126377_11187663Not Available833Open in IMG/M
3300010362|Ga0126377_13442194Not Available512Open in IMG/M
3300010391|Ga0136847_10011128Not Available653Open in IMG/M
3300010391|Ga0136847_10863754All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1923Open in IMG/M
3300010391|Ga0136847_13691737Not Available726Open in IMG/M
3300011413|Ga0137333_1005457All Organisms → cellular organisms → Bacteria2804Open in IMG/M
3300011413|Ga0137333_1101572Not Available667Open in IMG/M
3300012039|Ga0137421_1138011Not Available715Open in IMG/M
3300012133|Ga0137329_1059974Not Available504Open in IMG/M
3300012160|Ga0137349_1010389All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1360Open in IMG/M
3300012166|Ga0137350_1014912All Organisms → cellular organisms → Bacteria1412Open in IMG/M
3300012168|Ga0137357_1000648All Organisms → cellular organisms → Bacteria5537Open in IMG/M
3300012168|Ga0137357_1095562Not Available611Open in IMG/M
3300012189|Ga0137388_11411540Not Available635Open in IMG/M
3300012225|Ga0137434_1041674All Organisms → cellular organisms → Bacteria679Open in IMG/M
3300012355|Ga0137369_10994962Not Available556Open in IMG/M
3300012532|Ga0137373_10405017All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1059Open in IMG/M
3300012675|Ga0137337_1051929Not Available644Open in IMG/M
3300012675|Ga0137337_1060030Not Available597Open in IMG/M
3300012676|Ga0137341_1030548Not Available885Open in IMG/M
3300012918|Ga0137396_10096282All Organisms → cellular organisms → Archaea2106Open in IMG/M
3300012927|Ga0137416_11278859Not Available662Open in IMG/M
3300012931|Ga0153915_12206446Not Available644Open in IMG/M
3300012948|Ga0126375_12015325Not Available510Open in IMG/M
3300012972|Ga0134077_10000031All Organisms → cellular organisms → Archaea19754Open in IMG/M
3300014150|Ga0134081_10321954Not Available560Open in IMG/M
3300014877|Ga0180074_1090498Not Available681Open in IMG/M
3300015255|Ga0180077_1031797Not Available996Open in IMG/M
3300015256|Ga0180073_1134279Not Available534Open in IMG/M
3300015371|Ga0132258_11217901Not Available1903Open in IMG/M
3300018029|Ga0187787_10118840All Organisms → cellular organisms → Bacteria867Open in IMG/M
3300018031|Ga0184634_10350309Not Available677Open in IMG/M
3300018032|Ga0187788_10094774All Organisms → cellular organisms → Bacteria1073Open in IMG/M
3300018053|Ga0184626_10099039All Organisms → cellular organisms → Bacteria → Terrabacteria group1235Open in IMG/M
3300018059|Ga0184615_10117859Not Available1497Open in IMG/M
3300018059|Ga0184615_10376121Not Available783Open in IMG/M
3300018063|Ga0184637_10094189All Organisms → cellular organisms → Bacteria1841Open in IMG/M
3300018074|Ga0184640_10076100All Organisms → cellular organisms → Bacteria1429Open in IMG/M
3300018077|Ga0184633_10099686All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria1500Open in IMG/M
3300018078|Ga0184612_10064384All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1911Open in IMG/M
3300018079|Ga0184627_10276109Not Available883Open in IMG/M
3300018082|Ga0184639_10035637All Organisms → cellular organisms → Bacteria2548Open in IMG/M
3300018082|Ga0184639_10584923Not Available549Open in IMG/M
3300018084|Ga0184629_10055428All Organisms → cellular organisms → Bacteria1824Open in IMG/M
3300018084|Ga0184629_10084063All Organisms → cellular organisms → Bacteria1529Open in IMG/M
3300021090|Ga0210377_10027766All Organisms → cellular organisms → Bacteria4075Open in IMG/M
3300021090|Ga0210377_10071562All Organisms → cellular organisms → Bacteria2360Open in IMG/M
3300024241|Ga0233392_1038637Not Available528Open in IMG/M
3300025146|Ga0209322_10061771All Organisms → cellular organisms → Bacteria → Terrabacteria group1787Open in IMG/M
3300025146|Ga0209322_10228980Not Available785Open in IMG/M
3300025160|Ga0209109_10150456Not Available1172Open in IMG/M
3300025164|Ga0209521_10328599All Organisms → cellular organisms → Bacteria859Open in IMG/M
3300025164|Ga0209521_10360171Not Available806Open in IMG/M
3300025289|Ga0209002_10168041All Organisms → cellular organisms → Bacteria1380Open in IMG/M
3300025313|Ga0209431_10017006All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi5575Open in IMG/M
3300025313|Ga0209431_10041332Not Available3573Open in IMG/M
3300025313|Ga0209431_10079732All Organisms → cellular organisms → Bacteria2560Open in IMG/M
3300025313|Ga0209431_10079814All Organisms → cellular organisms → Bacteria2559Open in IMG/M
3300025313|Ga0209431_10650617Not Available785Open in IMG/M
3300025313|Ga0209431_11147005Not Available536Open in IMG/M
3300025314|Ga0209323_10621332Not Available605Open in IMG/M
3300025322|Ga0209641_10021026Not Available5116Open in IMG/M
3300025324|Ga0209640_10265591All Organisms → cellular organisms → Bacteria1442Open in IMG/M
3300025324|Ga0209640_10612684All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium877Open in IMG/M
3300025324|Ga0209640_10766488Not Available763Open in IMG/M
3300025325|Ga0209341_10170452All Organisms → cellular organisms → Bacteria1828Open in IMG/M
3300025326|Ga0209342_10207231All Organisms → cellular organisms → Bacteria1744Open in IMG/M
3300025327|Ga0209751_10010415All Organisms → cellular organisms → Bacteria8356Open in IMG/M
3300026298|Ga0209236_1054511All Organisms → cellular organisms → Archaea1975Open in IMG/M
3300026313|Ga0209761_1035517All Organisms → cellular organisms → Archaea2939Open in IMG/M
3300026325|Ga0209152_10071273All Organisms → cellular organisms → Archaea1259Open in IMG/M
3300026326|Ga0209801_1198921All Organisms → cellular organisms → Archaea805Open in IMG/M
3300026332|Ga0209803_1022850All Organisms → cellular organisms → Bacteria3058Open in IMG/M
3300026332|Ga0209803_1236675Not Available636Open in IMG/M
3300026548|Ga0209161_10152043All Organisms → cellular organisms → Archaea1336Open in IMG/M
3300026548|Ga0209161_10204559All Organisms → cellular organisms → Archaea1086Open in IMG/M
(restricted) 3300027799|Ga0233416_10008968All Organisms → cellular organisms → Bacteria3238Open in IMG/M
(restricted) 3300027799|Ga0233416_10148087Not Available804Open in IMG/M
3300027875|Ga0209283_10473016Not Available809Open in IMG/M
3300027909|Ga0209382_10153941All Organisms → cellular organisms → Bacteria2661Open in IMG/M
(restricted) 3300027995|Ga0233418_10137844Not Available766Open in IMG/M
(restricted) 3300027995|Ga0233418_10212106Not Available643Open in IMG/M
(restricted) 3300028043|Ga0233417_10013707All Organisms → cellular organisms → Bacteria3040Open in IMG/M
(restricted) 3300028043|Ga0233417_10101452All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1210Open in IMG/M
(restricted) 3300028043|Ga0233417_10383786All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Ardenticatenia → Ardenticatenales → unclassified Ardenticatenales → Ardenticatenales bacterium645Open in IMG/M
3300028536|Ga0137415_10344715Not Available1293Open in IMG/M
3300030619|Ga0268386_10510894Not Available820Open in IMG/M
(restricted) 3300031248|Ga0255312_1153333Not Available574Open in IMG/M
3300031576|Ga0247727_10002381All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi43185Open in IMG/M
3300031576|Ga0247727_10004921All Organisms → cellular organisms → Bacteria27756Open in IMG/M
3300031576|Ga0247727_10016789All Organisms → cellular organisms → Bacteria11754Open in IMG/M
3300031576|Ga0247727_10067388All Organisms → cellular organisms → Bacteria → Proteobacteria4099Open in IMG/M
3300031576|Ga0247727_10070369All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi3965Open in IMG/M
3300031576|Ga0247727_10093245All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria3217Open in IMG/M
3300031576|Ga0247727_10141708All Organisms → cellular organisms → Bacteria2353Open in IMG/M
3300031576|Ga0247727_10209210All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1773Open in IMG/M
3300031576|Ga0247727_10504258Not Available939Open in IMG/M
3300031576|Ga0247727_10577100Not Available851Open in IMG/M
3300031576|Ga0247727_10582948Not Available845Open in IMG/M
3300031576|Ga0247727_10752282Not Available704Open in IMG/M
3300031949|Ga0214473_10525145All Organisms → cellular organisms → Bacteria1317Open in IMG/M
3300033417|Ga0214471_10232179All Organisms → cellular organisms → Bacteria1511Open in IMG/M
3300033814|Ga0364930_0325162Not Available516Open in IMG/M
3300034177|Ga0364932_0165156Not Available842Open in IMG/M
3300034177|Ga0364932_0292719Not Available615Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil13.73%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil11.11%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil10.46%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment8.50%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil7.84%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm7.84%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment4.58%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.58%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.58%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.27%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.61%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.61%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment1.96%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.96%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment1.31%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.31%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.31%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.31%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.31%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands0.65%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.65%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs0.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.65%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.65%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.65%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.65%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.65%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.65%
Deep Subsurface SedimentEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface Sediment0.65%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.65%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.65%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300002120Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2EnvironmentalOpen in IMG/M
3300002123Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_3EnvironmentalOpen in IMG/M
3300002243Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_2EnvironmentalOpen in IMG/M
3300002407Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_1EnvironmentalOpen in IMG/M
3300002485Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_1EnvironmentalOpen in IMG/M
3300002530Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 19_3EnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300004019Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005289Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009146Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 10-12cm March2015EnvironmentalOpen in IMG/M
3300009168Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300009597Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT299EnvironmentalOpen in IMG/M
3300009678Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT100EnvironmentalOpen in IMG/M
3300009691Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP2EnvironmentalOpen in IMG/M
3300009822Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300011413Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT231_2EnvironmentalOpen in IMG/M
3300012039Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT534_2EnvironmentalOpen in IMG/M
3300012133Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT121_2EnvironmentalOpen in IMG/M
3300012160Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT630_2EnvironmentalOpen in IMG/M
3300012166Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT660_2EnvironmentalOpen in IMG/M
3300012168Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT860_2EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012675Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT333_2EnvironmentalOpen in IMG/M
3300012676Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT433_2EnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014877Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT366_16_10DEnvironmentalOpen in IMG/M
3300015255Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT466_16_10DEnvironmentalOpen in IMG/M
3300015256Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT333_16_10DEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300018029Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP06_20_MGEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018032Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP10_20_MGEnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b2EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300024241Subsurface microbial communities from Mancos shale, Colorado, United States - Mancos A_50_July_PBEnvironmentalOpen in IMG/M
3300025146Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 1EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025164Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 4EnvironmentalOpen in IMG/M
3300025289Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 2EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300025314Soil microbial communities from Rifle, Colorado, USA - sediment 19ft 2EnvironmentalOpen in IMG/M
3300025322Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025325Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025326Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025327Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_1 (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027909Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5 (SPAdes)Host-AssociatedOpen in IMG/M
3300027995 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_1_MGEnvironmentalOpen in IMG/M
3300028043 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0.5_MGEnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300033814Sediment microbial communities from East River floodplain, Colorado, United States - 55_j17EnvironmentalOpen in IMG/M
3300034177Sediment microbial communities from East River floodplain, Colorado, United States - 17_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
F14TC_10121475813300000559SoilVSETVAIILRVRKEQGDKFEKLFKEEVLPLWHESKAQGKFIAASLTRVQDGNQQKAGIRDYSLHVERPSHAEHDEFDSSARFMEFLPKAQAMQPEEPLVWFGTTQFKVP*
C687J26616_1003943633300002120SoilMSETIAIILRFREEHVDEFERLFKEEVLPLWHEFKAQGKFIAASLTPVQEGNQEKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPRARAMQPEEPLVWLGNTLFQL*
C687J26616_1007496423300002120SoilMSETVAIILRFREEHADTFERLFQEEVLPLWHEFKAQGKFIAASLTPVQEGNQQTAGVRDYILHVEVPGRAEHEEFDSHPRFEEFLPKARAMQPEEPLVWLG
C687J26634_1000971623300002123SoilMSETVAIILRFREEHADTFERLFQEEVLPLWHEFKAQGKFIAASLTPVQEGNQQTAGVRDYILHVEVPGRAEHEEFDSHPRFEEFLPKARAMQPEEPLVWLGRTLFQV*
C687J29039_1013294423300002243SoilMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVQEGNQEKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEEPLVWLGNTLFQL*
C687J29039_1014039733300002243SoilSETVAIILRFREEDAEQFESAFKAEVYPLWEEFRTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFQV*
C687J29651_1005423433300002407SoilMSETVAIILRFREEHADTFERLFQEEVLPLWHEFKAQGKFIAASLTPVQEGNQQTAGVRDYILHVEVPGRAEHEEFDSHPRFEEFLPKARAMQPEEPLVWLGPTLFQV*
C687J35088_1007184523300002485SoilMSETVAIILRFREEHADTFERLFQEEVLPLWHEFKAQGKFIAASLTPVQEGNQQTAGVRDYILHVEVPGRAEHEEFDSHPRFEEFLPKARAMQPEEPLVWLGXTLFQV*
C687J35503_1015473913300002530SoilMSETVAIILRFREEDAEQFESAFKAEVYPLWEEFKTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFQV*
JGI25384J37096_1014089613300002561Grasslands SoilMTQTNAIILRFREGEAGDFEVLFRKEVLPLWRQFKARGKIIAASLTPVQDGNQGRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
JGI25386J43895_1000962323300002912Grasslands SoilMTQTNAIILRFREEEAGNFEALFRKEVLPLWRQFKARGKIIAASLTPVQDGNRRRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
soilL2_1020135963300003319Sugarcane Root And Bulk SoilMSETVAIILRFREEDAEQFESAFKAEIYPLWEEFKAQGKFISASLTPALDGSEKKDGFRDYILHVETPSRAEHSEFDSEPRFLPFLEKFKVLQPEEPKVWLGNTLFQI*
Ga0055439_1009497013300004019Natural And Restored WetlandsMSETVAIILRFQEKDVEQFETAFKAEVIPLWEEFKAQGKFISASLTPALEGSEKKRGFRDYILHVEVPSRAEHEEFDSEPRFLPFLDRFRAMQPEEPKVWLGNTLFQV*
Ga0062593_10032661223300004114SoilMSQTIAIILRFRKEEADHFEELFRAEVYPIWEEFKAEGKFIAASLTPVQGGSEGKEGVRDYILHVEAPGMDAHNEFDTLPRFLTFLEKARKMQPEEPKVWFGDTLFQI*
Ga0062593_10099094113300004114SoilMSETVAIILRFREEGAEQFESAFKAEVYPLWEEFKTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFMPFLEKFRAMQPEEPKVWLGNTLFQV*
Ga0062589_10065135613300004156SoilMSQTIAIILRFRKEEADHFEELFRAEVYPIWEEFKAEGKFIAASLTPVQGGSEGKEGVRDYILHVEAPGMDAHNEFDTLPRFLTFLEKARKMQPEEPKV
Ga0062589_10114196313300004156SoilIILRFREEGAEQFESAFKAEVYPLWEEFKTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFMPFLEKFRAMQPEEPKVWLGNTLFQV*
Ga0066676_1001283863300005186SoilMTQTNAIILRFREEEAGNFEALFRKEVLPLWRQFKARGMIIAASLTPVQDGNQGRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
Ga0065704_1030115523300005289Switchgrass RhizosphereMSETVAIILRFREEDAEQFESAFKAEVYPLWEEFKTQGKFISASLAPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRGMQPEDPKVWLGNTLFQV*
Ga0066388_10005555943300005332Tropical Forest SoilMSETVAIILRFREEDAERFEAAFKAEVYPLWEEFKTHGKFIAASLTPALDGSEKKDGFRDYILHVEVPSRADHTEFDSEPRFLPFLDTFKAMQPEEPKVWLGNTLFQI*
Ga0066686_1002152953300005446SoilMTQTNAIILRFREEEAGNFEALFRKEVLPLWRQFKARGMIIAASLTPVQDGNQGRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPK
Ga0066689_1090343213300005447SoilMTQTNAIILRFREEEAGNFEALFKKEVLPLWRRFKARGKIIAASLTPVQDGNQGRRGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEE
Ga0066700_1004434333300005559SoilMTQTNAIILRFREEEAGDFEVLFRKEVLPLWRQFKARGKIIAASLTPVQDGNQGRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPQAQAMQPEEPLVWLGNTLFQV*
Ga0066700_1023255423300005559SoilMTQTNAIILRFRDEEVGNFEALFRKEVLPLWRQFKARGKIIAASLTPVQDGNRRRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
Ga0066691_1049391513300005586SoilMTQTNAIILRFHEEEAGNFEALFKKEVLPLWRQFKARGKIIAASLTPVQDGNQGRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPNAQAMQPEEPLVWLGNTLFQV*
Ga0066706_1007117123300005598SoilMTQTNAIILRFREEEAGNFEALFKKEVLPLWRQFKARGKIIAASLTPVQDGNRGRKGVRDYILHVEVPGMAEHSEFDSNASFLKFLPRAQAMQPEEPLVWLGNTLFQV*
Ga0066706_1052042113300005598SoilMTQTNAIILRFREEEAGNFEALFKKEVLPLWRRFKARGKIIAASLTPVQDGNQGRRGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
Ga0066658_1003227533300006794SoilMTQTNAIILRFREEEAGDFEVLFRKEVLPLWRQFKARGKIIAASLTPVQDGNQGRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPE
Ga0066665_1158571823300006796SoilTLLFGKSADVLSMTQTNAIILRFREEEAGNFEALFKKEVLPLWRRFKARGKIIAASLTPVQDGNQGRRGVRDYILHVEVPSMAEHSEFDSNASFLKFLAKAQAMQPEAPLVWLGNTLFQV
Ga0075421_10008609523300006845Populus RhizosphereMSETVAIILRFREEDAEQFESAFEAEVYHLWEEFKAQGKFISASLTPALDGSEKKDGFRDYILHVEVPSRAEHDEFDSEPRFLPFLEKFQALQPEEPKVWLGNTLFQI*
Ga0075419_1001120823300006969Populus RhizosphereMSETVAIILRFREEDAEQFESAFEAEVYHLWEEFKAQGKFSSASLKPALDGSEKKDGFRDYILHVEVPSRAEHDEFDSEPRFLPFLEKFQALQPEEPKVWLGNTLFQI*
Ga0099829_1039884113300009038Vadose Zone SoilMTQTNAIILRFRQEEAGNFEALFKKEVLPLWRRFKARGKIIAASLTPVQDGNQGRRGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
Ga0099830_1126398913300009088Vadose Zone SoilVVSEARSRQNIVIWEISDVPNMTQTNAIVLRFREEEAEKFEALFRKEVLPLWRQFKARGNIIAASLTPVQDGNQGRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
Ga0099828_1142736013300009089Vadose Zone SoilVVSEARSRQNIVIWEISDVPNMTQTNAIVLRFREEEAEKFEALFRKEVHPLWRQFKARGKIIAASLTPVQDGNQARKGVRDYILHVEVPGMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
Ga0099827_1025580123300009090Vadose Zone SoilMTQTNAIILRFREEEAGNFEALFKKEVLPLWRRFKARGKIIAASLTPVQDGNQGRRGVRDYILHEEVPSMAEHSEFDSNASLLKFLPKAQAMQPEEPLVWLGNTLFQV*
Ga0099827_1107819123300009090Vadose Zone SoilMTQTNAIILRFREEEAEKFEALFRKEVLPLWRQFKARGKIIAASLTPVQDGNQVRKGVRDYILHVEVPGMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
Ga0075418_1109088823300009100Populus RhizosphereMSETVAIILRFREEDAEQFESAFKAEVYLLWEEFKTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFQV*
Ga0075418_1310205313300009100Populus RhizosphereMTETIAIILRFHKQDAEQFESAFRTEVYPLWEEFKTQGKFISASLTPILEGSEMKDEFQDYILHVEVPSRAQHDEFDSEPRFMPFLDKFQATQPEEPRVWLGNTLFQV*
Ga0105091_1054035913300009146Freshwater SedimentRFRQEDAEQFESAFKAEVYPLWEEFKTQGKFISGSLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFQV*
Ga0105104_1042462713300009168Freshwater SedimentVFLKAKRLTALGKKEVRMSETVAIILRFREEDAEQFESAFKAEVYPLWEEFKTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFQV*
Ga0105259_115360913300009597SoilARSDQHFLRHRCFIYFDKKLEVPMSETIAIILRFREEDAERFESAFKAEVYPLWEEFKSQGKFISASLTPVMDGNEKKAGYRDYILYVVVPSRAEHEEFDSEPRFLPFLEKFRAMQPEEPKVWLGNTLFQI*
Ga0105252_1033186513300009678SoilMSETIAIILRFREEDAERFESAFKAEVYPLWEEFKSQGKFISASLTPVMDGNEKKAGYRDYILHVVVPSRAEHEEFDLEPRFLPFLEKFRAMQPEEPKVWLGNTLFQI*
Ga0114944_129208723300009691Thermal SpringsMSETVAIVLRFREEDAGKFESAFKTEVYPLWQEFKAQGKFISASLTPAVDGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLKKFQALQPEEPKVWLGDTLFQV*
Ga0105066_117364623300009822Groundwater SandQFESAFKAEVYPLWEEFKTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFQV*
Ga0126382_1036011713300010047Tropical Forest SoilMSETVAIILRFREEDAEQFESAFKAEVFPLWEEFKAQGKFIAASLTPALDGSEKKGGFRDYILHVEVPSRAEHTEFDSEPRFLPFLDKFRALQPDEPKVWLGNTLFQI*
Ga0126382_1196212513300010047Tropical Forest SoilMSETVAIILRFHKEDAEQFESAFKAEVYPLWEEFKAQGKFISASLTPALDGSEKKDGFRDYILHVEVPSRAEHSEFDEEPRFLPFLKKFQALQPEESKVWLGNTLFQI*
Ga0126376_1201462613300010359Tropical Forest SoilMTQTNSIILRIGAEKTKEFEVMFEKEVLPLWRKFKDEGKFISASLTPVLDGNQSKDGIQDYILHVEVPSMSEHNEFDTNPLFTKFLPKAQALQPEEPLVWLGQTLFQV*
Ga0126377_1000886083300010362Tropical Forest SoilMSETVAIILRFREEDAEQFESSFKAEVYPLWEEFKALGKFISASLTPAIDGSEKKDGLRDYILHVEVPSRAEHDEFDSEPRFLPFLEKYKALQPEEPKVWLGNTLFQI*
Ga0126377_1118766323300010362Tropical Forest SoilMEVLMSETVAIILRFREEDAQRFEAAFAAEVYPLWEEFKAQGKFLSASLTPALDGSEKKEGFRDYILHVEVPSRAEHSEFDEEPRFLPFLEKFKALQPEEPKVWLGNTLFQI*
Ga0126377_1344219423300010362Tropical Forest SoilEQFESAFKVEVYPLWEEFKVQGKFIAASLTPALDGSEKKDGFRDYILHVEVPSRAEHTEFDSEPRFLPFLDTFKAMQPEEPKVWLGNTLFQI*
Ga0136847_1001112813300010391Freshwater SedimentMSQTNAIILRFREVDADRFEKLFEAEILPLWGTFKAQGKFIAASLTPVAGGSEIKKGVRDYILHVEVPSMAEHEEFDSSAPFLAFLAKAKLMQPEEPKVWLGTTRFQV*
Ga0136847_1086375423300010391Freshwater SedimentVSETIAIILRFHEEDAESFESAFKTEVYPLWEEFKSKGKFISASLTPVTDGSEKKAGFRDYILHVEVPSRSEHSEFDSEPRFLPFLEKFQAVQPEEPKVWLGNTLFQI*
Ga0136847_1369173723300010391Freshwater SedimentMSETVAIILRFREEDAEQFESAFKAEVYPLWEEFRTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFQV*
Ga0137333_100545713300011413SoilMSETIAIILRFREEDAERFESAFKAEVYPLWEEFKSQGKFISASLTPVMDGNEKKDGYRDYILHVVVPSRAEHEEFDSEPRFLPFLEKFRAMQPEEPKVWLGNTLFQI*
Ga0137333_110157223300011413SoilMSETIAIILRFREADAERFESAFKTEVYPLWEEFKAQGKFISASLTPAIAGSEKKAGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFKPLQPEEPKVWLGNTLFQI*
Ga0137421_113801123300012039SoilMSETIAIILRFREEDAKRFESAFKAEVYPLWEEFKSQGKFISASLTPVMDGNEKKAGYRDYILHVVVPSRAEHEEFDLEPRFLPFLEKFRAMQPEEPKVWLGNTLFQI*
Ga0137329_105997413300012133SoilPMSETIAIILRFREEDAERFESAFKAEVYPLWEEFKSEGKFISASLTPVMDGNEKKAGYRDYILHVVVPSRAEHEEFDSEPRFLPFLEKFRAMQPEEPKVWLGNTLFQI*
Ga0137349_101038913300012160SoilRFESAFKAEVYPLWEEFKSQGKFISASLTPVMDGNEKKDGYRDYILHVVVPSRAEHEEFDSEPRFLPFLEKFRAMQPEEPKVWLGNTLFQI*
Ga0137350_101491223300012166SoilMSETIAIILRFREEDAERFESAFKAEVYPLWEEFKSQGKFISASLTPVMDGNEKKDGYRDYILHVVVPSRAEHEEFDSEPRFLPFLEKFRAMQPEEPKVWLENTLFQI*
Ga0137357_100064873300012168SoilMSETIAIILRFREADAERFESTFKTEVYPLWEEFKAQGKFISASLTPAMAGSEKKAGFQDYILHVEVPSRAEHEEFDSEPRFLPFLEKFKPLQPEEPKVWLGNTLFHI*
Ga0137357_109556223300012168SoilMIWAKPPRLWKNLGDFYKVEVVMSETIAIILRFREEDAGRFESAFKTEVYSLWEEFKSQGKFISASLTPVMDGSEKKKGFRDYILHVEVPSRAQHDEFDSEPRFLPFLKQFQALQPEEPKVWLGNTLFQI*
Ga0137388_1141154013300012189Vadose Zone SoilMTQTNAIILVFREEGAGNFEALFRKEVLPLWRQFKARGNIIAASLTPVQDGNQGRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
Ga0137434_104167413300012225SoilMSETIAIILRFREEDAERFESAFKAEVYPLWEELKSQGKFISASLTPVMDGNEKKNGYRDYILHVVVPSRAEHEEFDSEPRFLPFLEKFRTMQPEEPKVWLGNTLFQI*
Ga0137369_1099496213300012355Vadose Zone SoilATTTDNSEGHMSQTIAIILRFREAEANRFEEIFKAEVYPLWQEFKAQGKFITASLTPVQDGSEMKEGVRDYILHVEVPGMAEHDEFDSLPTFLTFLEKARPLQPEEPKVWFGDTLVQV*
Ga0137373_1040501723300012532Vadose Zone SoilVSWYPKQNYLASGNLLLTGQKELSMSETIAIILHFHEEDAEYFESAFKTNVYPLWEDFKAQGKFISASLTPVTDGSEMKEGIRDYILHVEVPSRAEHDEFDSEPRFLPFLEKFQPLQPEEPKVWLGNTLFQI*
Ga0137337_105192923300012675SoilMSETIAIILRFREEHADEFEKLFQEEVLPLWHEFKTQGKFIAASLTPVQEGNQQKAGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARALQPEEPLVWLGPTLFQL*
Ga0137337_106003013300012675SoilKHRIARSDQHFLRHRCFIYFDKKLEVPMSETIAIILRFREEDAERFESAFKAEVYPLWEEFKSQGKFISASLTPVMDGNEKKDGYRDYILHVVVPSRAEHEEFDSEPRFLPFLEKFRAMQPEEPKVWLGNTLFQI*
Ga0137341_103054823300012676SoilMSETIAIILRFREADAERFESAFKTEVYPLWEEFKAQGKFISASLTPAMAGSEKKAGFQDYILHVEVPSRAEHEEFDSEPRFLPFLEKFKPLQPEEPKVWLGNTLFQI*
Ga0137396_1009628223300012918Vadose Zone SoilMTQTNAIILRFREEEAEDFEALFKKEVLPLWRRFKAQGKIIAASLTPVQDGNRTKKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
Ga0137416_1127885923300012927Vadose Zone SoilLRFREEEAGNFEALFKKEVLPLWRQFKDRGKIIAASLTPVQDGNQGRKGVRDYILHVEVPSMADHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
Ga0153915_1220644623300012931Freshwater WetlandsMSGYFSVNGYARRYSRSCINVKSLQEVKMSQTVAIILRFRKDEAQRFEQIFEAEILPMWEQYKAQGKFLSASLTPVEDGSEVKEGVRDYILHVEVPSMAEHQEFDSSAPFLAFLQKAKPMQPEEPKVWLGNTLFQ
Ga0126375_1201532523300012948Tropical Forest SoilMEVLMSETVAIILRFREEDAQRFEAAFAAEVYPLWEEFKAQGKFISASLTPALDGSEKKEGFRDYILHVEVPSRAEHEEFDSAPRFLPFLKKFQALQPEEPKVWLGNTLFQI*
Ga0134077_1000003173300012972Grasslands SoilMTQTNAIILRLREEEAGNFEALFRKEVLPLWRQFKARGMIIAASLTPVQDGNQGRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
Ga0134081_1032195413300014150Grasslands SoilMTQTNAIILRFREEEAGNFEALFRKEVLPLWRQFKARGKIIAASLTPVQDGNQGRKRVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV*
Ga0180074_109049813300014877SoilVFLKAKRLTALSKKEVRMSETVAIILRFREEDAKQFESAFKAEVYPLWEEFKTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFQV*
Ga0180077_103179723300015255SoilMWATLKAKHRIARSDQHFLRHRCFIYFDKKLEVPMSETIAIILRFREEDAERFESAFKAEVYPLWEEFKSQGKFISASLTPVMDGNEKKDGYRDYILHVVVPSRAEHEEFDSEPRFLPFLEKFRAMQPEEPKVWLGNTLFQI*
Ga0180073_113427913300015256SoilMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVHEGNQQKERVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEQPLVWLGPTLFQL*
Ga0132258_1121790123300015371Arabidopsis RhizosphereMSQTIAIILRFRKDEADQFEQLFKAEVYPIWEAFKAEGKFIAASLTPVQGGSEGKEGVRDYILHIEAPGMDAHNEFDTLPRFLTFLEKARKMQPEEPKVWFGDTLFQI*
Ga0187787_1011884023300018029Tropical PeatlandMSETIAIILRFREEDAQQFESAFKAQVYPLWEEFKALGKFISASLTPVLDGNEKKDGFRDYILHVEVPSRAEHDEFDSGQRFLPFLREFQAMQPEEPKVWLGNTLFQV
Ga0184634_1035030913300018031Groundwater SedimentMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVQEGNQQKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEEPLVWLGPTLIQL
Ga0187788_1009477413300018032Tropical PeatlandMSETIAIILRFREEDAQQFEAAFKEQVYPLWEEFKALGKFISASLTPVLDGSEKKDGFRDYILHVEVPSRAEHDEFDSGQRFLPFLREFQAMQPEEPKVWLGNTLFQV
Ga0184626_1009903923300018053Groundwater SedimentMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVQEGNQQKEGVRDYILHVEVPSRAEHEEFDSHAQFMEFLPKARAMQPEEPLVWLGPTLIQL
Ga0184615_1011785913300018059Groundwater SedimentMSETVAIILRFREEDAEQFESAFKAEVYPLWEEFKTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFQV
Ga0184615_1037612123300018059Groundwater SedimentMSETIAIILRFREEHADEFEKLFQEEVLPLWHEFKTQGKFIAASLTPVQEGNQQKEGVRDYILHIEVPSRAEHEEFDSHARFMEFLPKARAMQPEEPLVWLGPTLFQL
Ga0184637_1009418933300018063Groundwater SedimentMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVQEGNQNKEGVRDYILHVEVPSRAEHEEFDSHARFVEFLPKVRAMQPEEPLVWLGPTLFQL
Ga0184640_1007610023300018074Groundwater SedimentMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVHEGNQQKERVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEEPLVWLGPTLIQL
Ga0184633_1009968623300018077Groundwater SedimentMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVQEGNQQKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEQPLVWLGPTLFQL
Ga0184612_1006438433300018078Groundwater SedimentMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVHEGNQQKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARVMQPEEPLVWLGPTLIQL
Ga0184627_1027610913300018079Groundwater SedimentMLATRFRPPAAGLYPLFTIHSALFFGGSMSQTNAIILRFREGDAERFEKLFEAEILPLWKQFQAQGKFIAASLTPVDDGSEIKKGVKDYILHVEVPSMAEHEEFDSSAPFLAFLKKAKVMQPEEPKVWLGTTRFQV
Ga0184639_1003563743300018082Groundwater SedimentMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVHEGNQQKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEQPLVWLGPTLFQL
Ga0184639_1058492323300018082Groundwater SedimentMSETVAIILRFREEHADEFERLFKEEVLPLWHEFKTQGKFIAASLTPVQEGNQQKEGVRDYILHVEVPSRAEHEAFDTHARFVEFLPKARAMQPEEPLVWLGPTLFQL
Ga0184629_1005542833300018084Groundwater SedimentMSETIAIILRFREEDAERFESAFKAEVYPLWEEFKSQGKFISASLTPVMDGNEKKDGYRDYILHVVVPSRAEHEEFDSEPRFLPFLEKFRAMQPEEPKVWLGNTLFQI
Ga0184629_1008406333300018084Groundwater SedimentMSETIAIILRFRKEDTEQFEAAFKTEVYSLWEEFKAQGKFISASLTPVTTGNEMKAEFQDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEEPKVWLGNTLFQI
Ga0210377_1002776633300021090Groundwater SedimentMSETIAIILRFREEDTGRFESAFKTEVYPLWEEFKSQGKFISASLTPVEAGSEKKKGFRDYILHVEVPSRAEHEEFDSEPRFLPFLKQFQALQPEEPKVWLGNTLFQI
Ga0210377_1007156243300021090Groundwater SedimentMSETIAIILRFREEDAGRFESAFKTEVYPLWEEFKSQGKFISASLTPVMDGSEKKKGFRDYILHVEVPSRAEHDQFDSEPRFLPFLKQFQAMQPEEQKVWLGNTLFQI
Ga0233392_103863713300024241Deep Subsurface SedimentMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVHEGNQQKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEEPLVWLGSTLFQL
Ga0209322_1006177123300025146SoilMSETVAIILRFREEDAEQFESAFKAEVYPLWEEFRTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFQV
Ga0209322_1022898023300025146SoilMSETVAIILRFREEHADTFERLFQEEVLPLWHEFKAQGKFIAASLTPVQEGNQQTAGVRDYILHVEVPGRAEHEEFDSHPRFEEFLPKARAMQPEEPLVWLGRTLFQV
Ga0209109_1015045613300025160SoilMSETIAIILRFREEHAAEFERLFKEEVLPLWHEFKTQGKFIAASLTPVQEGNQQKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEQPLVWLGPTLFQL
Ga0209521_1032859923300025164SoilLSQTVDTFRTVAIILRFREEESERFERLFTAEVYPLWEQFKAQGKFLAASLTPVQDGSEIKEGARDYILHVELPGMDEHHEFDAQPPFLKFMEKARPLQPEEPKVWLGDTLFRV
Ga0209521_1036017113300025164SoilMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVQEGNQEKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARALQPEEPLVWLGPTLFQL
Ga0209002_1016804123300025289SoilMSETVAIILRFREEHADTFERLFQEEVLPLWHEFKAQGKFIAASLTPVQEGNQQTAGVRDYILHVEVPGRAEHEEFDSHPRFEEFLPKARAMQPEEPLVWLGPTLFQV
Ga0209431_1001700683300025313SoilMSETIAIILRFREEHVDEFERLFKEEVLPLWHEFKAQGKFIAASLTPVQEGNQEKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPRARAMQPEEPLVWLGNTLFQL
Ga0209431_1004133213300025313SoilMSETVAIILRFREEHADTFERLFQEEVLPLWHEFKAQGKFIAASLTPVQGGNRQTAGVRDYILHVEVPSRAEHEEFDSHPRFEEFLPKARAMQPEEPLVWL
Ga0209431_1007973223300025313SoilMEGRNPLSETVAIILRFREEDTPRFEGLFREEVYPLWEEFKSEGRFISASLTPAIDGSETKPGIRDYILHVEVPGRAEHDQFDSDPRFTKFLEKVQPLQPEEPRVWLGNTLFQV
Ga0209431_1007981453300025313SoilMSETIAIILRFREEHADTFERLFKEEVLPLWHEFKAQGKFIAASLTPVQEGNQQTAGVRDYILHVEVPGRAEHEEFDSHPRFEEFLPKARAMQPEEPLVWLGPTL
Ga0209431_1065061723300025313SoilMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKTQGKFIAASLTPVQEGNQQKEGVRDYILHVEVPSRTEHEEFDSHARFMEFLPKARALQPEEPLVWLGPTLFQL
Ga0209431_1114700513300025313SoilMSETVAIILRFRKEDAEQFESAFKAEVYPLWQEFKAQGKFITASLTPAAEGNEKKAGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRALQPEEPKVWLGNTLFQV
Ga0209323_1062133223300025314SoilMSETIAIILRFREEHVDEFERLFKEEVLPLWHEFKTQGKFIAASLTPVQEGNQEKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEEPLVWLGNTLFQL
Ga0209641_1002102633300025322SoilMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVQEGNQEKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEEPLVWLGNTLFQL
Ga0209640_1026559123300025324SoilMSETIAIILRFREEHADEFEKLFQEEVLPLWREFKTQGKFIAASLTPVQEGNQQKAGVRDYILHVEVPSRAEHEEFDSHARFVEFLPKARAMQPEEPLVWLGPTLFQL
Ga0209640_1061268423300025324SoilMSETIAIILRFREEHADEFERLFKEEVLPLWYEFKAQGKFIAASLTPVQEGNQQKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEEPLVWLGSTLFQL
Ga0209640_1076648813300025324SoilMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVHEGNQQKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEEPLVWLGPTLFQL
Ga0209341_1017045213300025325SoilAGRLSSRPLGAQVSAPHVTLSALRMEGRNPLSETVAIILRFREEDTPRFEGLFREEVYPLWEEFKSEGRFISASLTPAIDGSETKPGIRDYILHVEVPGRAEHDQFDSDPRFTKFLEKVQPLQPEEPRVWLGNTLFQV
Ga0209342_1020723113300025326SoilILRFREEHADEFERLFKEEVLPLWHEFKTQGKFIAASLTPVQEGNQQKGGVRDYILHIEVPSWAEHEEFDSHARFVEFLPKARAMQPEEPLVWLGPTLFQL
Ga0209751_1001041543300025327SoilMSETIAIILRFREEHADTFERLFKEEVLPLWHEFKAQGKFIAASLTPVQEGNQQTAGVRDYILHVEVPGRAEHEEFDSHPRFEEFLPKARAMQPEEPLVWLGPTLFQV
Ga0209236_105451123300026298Grasslands SoilMTQTNAIILRFREEEAGNFEALFRKEVLPLWRQFKARGMIIAASLTPVQDGNQGRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV
Ga0209761_103551723300026313Grasslands SoilMTQTNAIILRFREGEAGDFEVLFRKEVLPLWRQFKARGKIIAASLTPVQDGNQGRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV
Ga0209152_1007127313300026325SoilMTQTNAIILRFREEEAGDFEVLFRKEVLPLWRQFKARGKIIAASLTPVQDGNRRRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEE
Ga0209801_119892113300026326SoilMTQTNAIILRFREEEAGNFEALFRKEVLPLWRQFKARGKIIAASLTPVQDGNRRRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV
Ga0209803_102285053300026332SoilIILRFREEEAGNFEALFRKEVLPLWRQFKARGKIIAASLTPVQDGNRRRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV
Ga0209803_123667513300026332SoilMTQTNAIILRFREEEAGDFEVLFRKEVLPLWRQFKARGKIIAASLTPVQDGNQGRKGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV
Ga0209161_1015204313300026548SoilMTQTNAIILRFREEEAGNFEALFKKEVLPLWRQFKARGKIIAASLTPVQDGNRGRKGVRDYILHVEVPGMAEHSEFDSNASFLKFLPRAQAMQPEEPLVWLGNTLFQV
Ga0209161_1020455923300026548SoilMTQTNAIILRFREEEAGNFEALFKKEVLPLWRRFKARGKIIAASLTPVQDGNQGRRGVRDYILHVEVPSMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV
(restricted) Ga0233416_1000896823300027799SedimentMSETIAIILRFREENADAFERLFAEEVLPLWHEFKAQGKFIAASLTPVQEGNQRKEGVRDYILHVEVPSRAEHTEFDSHPRFEAFLPKARAMQPEEPLVWLGPTLHQV
(restricted) Ga0233416_1014808723300027799SedimentMSETVAIILRFREEDAEQFESAFKAEVYPLWEEFKTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNALFQV
Ga0209283_1047301613300027875Vadose Zone SoilMTQTNAIVLRFREEEAEKFEALFRKEVHPLWRQFKARGKIIAASLTPVQDGNQARKGVRDYILHVEVPGMAEHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV
Ga0209382_1015394143300027909Populus RhizosphereMSETVAIILRFREEDAEQFESAFEAEVYHLWEEFKAQGKFISASLTPALDGSEKKDGFRDYILHVEVPSRAEHDEFDSEPRFLPFLEKFQALQPEEPKVWLGNTLFQI
(restricted) Ga0233418_1013784413300027995SedimentMSETIAIILRFREERADEFERLFKDEVLPLWHEFKAQGKFIAASLTPVQEGNQNKEGVRDYILHIEVPSRAEHEEFDSHARFVEFLPKARAMRPEEQPVWLRPTLFQL
(restricted) Ga0233418_1021210623300027995SedimentMSETVAIILRFREEHADAFERLFAEEVLPLWHEFKAQGKFIAASLTPVQEGNQRKEGVRDYILHVEVPSRAEHTEFDSHPRFEAFLPTARAMQPEEPLVWLGPTLHQV
(restricted) Ga0233417_1001370723300028043SedimentMSETVAIILRFREEDAEQFESAFKAEVYPLWEEFKTQGKFISASLTPALDGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFQV
(restricted) Ga0233417_1010145213300028043SedimentMSETIAIILRFREERADEFERLFKDEVLPLWHEFKAQGKFIAASLTPVQEGNQNKEGVRDYILHVEVPSRAEHEEFDSHARFVEFLPKARAMRPEEPPVWLRPTLFQL
(restricted) Ga0233417_1038378623300028043SedimentMSETVAIILRFREEHADAFERLFAEEVLPLWHEFKAQGKFIAASLTPVQEGNQQTEGVRDYILHVEVPSRAEHTEFDSHPRFEAFLPKARAMQPEEPLVWLGPTLHQV
Ga0137415_1034471513300028536Vadose Zone SoilLRFREEEAGNFEALFKKEVLPLWRQFKDRGKIIAASLTPVQDGNQGRKGVRDYILHVEVPSMADHSEFDSNASFLKFLPKAQAMQPEEPLVWLGNTLFQV
Ga0268386_1051089423300030619SoilMSETVAIILRFREEDAEQFESAFKAEVYPLWEEFKTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFRL
(restricted) Ga0255312_115333323300031248Sandy SoilMSETIAIILRFRGKDAEKFESAFKTEVVPLWEEFKAQGKFIAASLTPVQDGSEKKDGFRDYILHVEVPSRAEHEAFDSEPRFLPFLKQFQALQPEEPKVWLGNTLFQV
Ga0247727_1000238123300031576BiofilmMSETIAIILRFREEHADEFERLFKEEVLPLWHEFKTQGKFIAASLTLVQDGNQQKEGVRDYILHVEVPSRAEHEEFDTHARFVEFLPKARAMQPEEPLVWLGSTLFQL
Ga0247727_10004921113300031576BiofilmMSETIAIILRFREEHADEFERLFKEDVLPLWHEFKTQGKFIAASLTPVQEGNRQKEGVRDYILHVEVPSRAEHEAFDTHARFVEFLPKARAMQPEEPLVWLGSTLFQL
Ga0247727_10016789113300031576BiofilmMSETVAIILRFNEGDADRFESAFETEVYPLWQEFKAQGKFISASLTPVMDGSEMKDGFRDYILHVEVPSRAEHDEFDSEPRFLPFLEKFQAMQPEEP
Ga0247727_1006738823300031576BiofilmMSETIAIILRFREEHADEFEKLFKEEVLPLWHEFKMQGKFIAASLTPVQEGNQQKAGVRDYILHVEVPGRAEHEEFDSHARFMEFLPKARAMQPEEPLVWLGPTLFQL
Ga0247727_1007036943300031576BiofilmMSETIAIILRFRKEDAEQFEAAFKTEVYPLWEEFKAQGKFISASLTPVAEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFQAMQPEEPKVWLGNTLFQI
Ga0247727_1009324583300031576BiofilmMSETIAIILRFRAEDAERFESAFKAEVYPLWEEFKTRGKFISASLTPVTAGSEKKDGFRDYILHVEVPSRAEHKEFDSEPRFLPFLEKFRALQPEDPKVWLGNTLFQI
Ga0247727_1014170823300031576BiofilmMSETIAIILRFRKEDTGQFESAFKTEVYPLWEEFKAQGKFISASLTPVAEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFQAMQPEEPKVWLGNTLFQI
Ga0247727_1020921043300031576BiofilmMSETVAIILRFRKADAEEFERAFKAEVYPLWEEYKSQGKFISASLTPATDGSEKRPGFRDYILHVEAPSRAEHSEFDSEPRFLPFLEKFRAMQPEEPKVWLGNTLFQI
Ga0247727_1050425813300031576BiofilmMSQTNAIILRLRQEHADTFESLFRTEVLPLWHEFKDKGKFLSASLTPVHAGNQQRPGVRDYILHVEVPSMSEHEEFDSDPRFLDFLRKVRPALQAEDPLVWLGPTLFQV
Ga0247727_1057710013300031576BiofilmMSETIAIILRFRAEDAERFESAFKAEVYPLWEEFKTLGKFISASLTPVTAGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFFEKFRALQPEDPKVWLGNTLFQI
Ga0247727_1058294813300031576BiofilmMSETMAIILRFREEHADEFERLFKEEVLPLWHEFKAQGKFIAASLTPVHEGNQQKEGVRDYILHVEVPSRAEHEEFDSHARFMEFLPKARAMQPEQPLVWLGPTLFQL
Ga0247727_1075228223300031576BiofilmMSETIAIILRFRAEDAERFESAFKAEVYPLWEEFKTLGKFISASLTPVTAGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRALQPEDPKVWLGNTLFQI
Ga0214473_1052514523300031949SoilMSETIAIILRFREERADEFEKLFKEEVLPLWHEFKTQGKFIAASLTPVQEGNQQKEGVRDYILHIEVPSRAEHEEFDSHARFVEFLPKARAMQPEEPLVWLGPTLFQL
Ga0214471_1023217923300033417SoilMPQTIAIILRFREAEVDNFEQLFGAEVYPLWQEFKAQGRLITASLTPVEDGSEMQDGVRDYILHLELMGMEDHHAFDTDPRFISFLKKARPLQPAEAKVWFGEPRFTI
Ga0364930_0325162_3_2693300033814SedimentSAFKAEVYPLWEEFKTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFQV
Ga0364932_0165156_427_7533300034177SedimentMSETVAIILRFREEDAEQFESAFKAEVHPLWEEFKTQGKFISASLTPALEGSEKKDGFRDYILHVEVPSRAEHEEFDSEPRFLPFLEKFRAMQPEDPKVWLGNTLFQV
Ga0364932_0292719_2_2623300034177SedimentMSETIAIILRFREERADEFEKLFKEEVLPLWHEFKTQGKFIAASLTPVQEGNQQKEGVRDYILHIEVPSRAEHEEFDSHARFVEFLP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.