NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F043035

Metagenome / Metatranscriptome Family F043035

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F043035
Family Type Metagenome / Metatranscriptome
Number of Sequences 157
Average Sequence Length 119 residues
Representative Sequence MASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWRRARLSRVAADRLVAKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Number of Associated Samples 116
Number of Associated Scaffolds 157

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 66.24 %
% of genes near scaffold ends (potentially truncated) 29.30 %
% of genes from short scaffolds (< 2000 bps) 64.97 %
Associated GOLD sequencing projects 104
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (83.439 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(24.841 % of family members)
Environment Ontology (ENVO) Unclassified
(23.567 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(70.701 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 66.67%    β-sheet: 0.00%    Coil/Unstructured: 33.33%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 157 Family Scaffolds
PF00160Pro_isomerase 36.94
PF01613Flavin_Reduct 26.11
PF06155GBBH-like_N 9.55
PF02687FtsX 5.10
PF07992Pyr_redox_2 5.10
PF00005ABC_tran 3.18
PF00135COesterase 2.55
PF12704MacB_PCD 2.55
PF02776TPP_enzyme_N 0.64
PF13620CarboxypepD_reg 0.64
PF00575S1 0.64

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 157 Family Scaffolds
COG0652Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin familyPosttranslational modification, protein turnover, chaperones [O] 36.94
COG1853FMN reductase RutF, DIM6/NTAB familyEnergy production and conversion [C] 26.11
COG3536Uncharacterized conserved protein, DUF971 familyFunction unknown [S] 9.55
COG2272Carboxylesterase type BLipid transport and metabolism [I] 2.55


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms83.44 %
UnclassifiedrootN/A16.56 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001593|JGI12635J15846_10027913All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia4546Open in IMG/M
3300001867|JGI12627J18819_10143796All Organisms → cellular organisms → Bacteria973Open in IMG/M
3300002245|JGIcombinedJ26739_100068903All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia3259Open in IMG/M
3300002558|JGI25385J37094_10101778All Organisms → cellular organisms → Bacteria → Proteobacteria849Open in IMG/M
3300005167|Ga0066672_10065833All Organisms → cellular organisms → Bacteria2127Open in IMG/M
3300005174|Ga0066680_10004289All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6669Open in IMG/M
3300005174|Ga0066680_10707236All Organisms → cellular organisms → Bacteria618Open in IMG/M
3300005176|Ga0066679_10019485All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3568Open in IMG/M
3300005526|Ga0073909_10587958Not Available548Open in IMG/M
3300005529|Ga0070741_10000796All Organisms → cellular organisms → Bacteria105609Open in IMG/M
3300005531|Ga0070738_10036638All Organisms → cellular organisms → Bacteria3307Open in IMG/M
3300005534|Ga0070735_10016401All Organisms → cellular organisms → Bacteria5458Open in IMG/M
3300005534|Ga0070735_10249524All Organisms → cellular organisms → Bacteria1076Open in IMG/M
3300005537|Ga0070730_10012335All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6941Open in IMG/M
3300005538|Ga0070731_10185919All Organisms → cellular organisms → Bacteria1380Open in IMG/M
3300005541|Ga0070733_10238307All Organisms → cellular organisms → Bacteria1195Open in IMG/M
3300005542|Ga0070732_10197875All Organisms → cellular organisms → Bacteria1201Open in IMG/M
3300005552|Ga0066701_10374698All Organisms → cellular organisms → Bacteria881Open in IMG/M
3300005554|Ga0066661_10386783All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium856Open in IMG/M
3300005557|Ga0066704_10055705All Organisms → cellular organisms → Bacteria2509Open in IMG/M
3300005557|Ga0066704_10092763All Organisms → cellular organisms → Bacteria1972Open in IMG/M
3300005561|Ga0066699_10543716All Organisms → cellular organisms → Bacteria832Open in IMG/M
3300005568|Ga0066703_10091295All Organisms → cellular organisms → Bacteria1776Open in IMG/M
3300005568|Ga0066703_10290449All Organisms → cellular organisms → Bacteria990Open in IMG/M
3300005586|Ga0066691_10102365All Organisms → cellular organisms → Bacteria1609Open in IMG/M
3300005602|Ga0070762_11213492Not Available522Open in IMG/M
3300005712|Ga0070764_11054675All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300006173|Ga0070716_100843088All Organisms → cellular organisms → Bacteria713Open in IMG/M
3300006176|Ga0070765_100270809All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1562Open in IMG/M
3300006755|Ga0079222_10007720All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales3725Open in IMG/M
3300006794|Ga0066658_10013942All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3019Open in IMG/M
3300006797|Ga0066659_10083078All Organisms → cellular organisms → Bacteria2131Open in IMG/M
3300006797|Ga0066659_10264101All Organisms → cellular organisms → Bacteria1296Open in IMG/M
3300006800|Ga0066660_10030182All Organisms → cellular organisms → Bacteria3320Open in IMG/M
3300006800|Ga0066660_10969932All Organisms → cellular organisms → Bacteria686Open in IMG/M
3300006804|Ga0079221_10001089All Organisms → cellular organisms → Bacteria8427Open in IMG/M
3300006806|Ga0079220_10003500All Organisms → cellular organisms → Bacteria5329Open in IMG/M
3300006806|Ga0079220_10702157All Organisms → cellular organisms → Bacteria743Open in IMG/M
3300006954|Ga0079219_10084820All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1502Open in IMG/M
3300010048|Ga0126373_10802176All Organisms → cellular organisms → Bacteria1003Open in IMG/M
3300010048|Ga0126373_12737392Not Available550Open in IMG/M
3300010376|Ga0126381_100117168All Organisms → cellular organisms → Bacteria3438Open in IMG/M
3300010379|Ga0136449_100344200All Organisms → cellular organisms → Bacteria2684Open in IMG/M
3300011120|Ga0150983_11172520All Organisms → cellular organisms → Bacteria1118Open in IMG/M
3300011120|Ga0150983_11659569All Organisms → cellular organisms → Bacteria994Open in IMG/M
3300011120|Ga0150983_12622758All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium894Open in IMG/M
3300011120|Ga0150983_14633502All Organisms → cellular organisms → Bacteria764Open in IMG/M
3300011120|Ga0150983_16665142All Organisms → cellular organisms → Bacteria625Open in IMG/M
3300011271|Ga0137393_10647425All Organisms → cellular organisms → Bacteria906Open in IMG/M
3300012096|Ga0137389_11356692Not Available606Open in IMG/M
3300012169|Ga0153990_1096318All Organisms → cellular organisms → Bacteria662Open in IMG/M
3300012199|Ga0137383_10019519All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia4674Open in IMG/M
3300012199|Ga0137383_10045673All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3118Open in IMG/M
3300012206|Ga0137380_10029935All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5018Open in IMG/M
3300012206|Ga0137380_10479109All Organisms → cellular organisms → Bacteria1098Open in IMG/M
3300012209|Ga0137379_10385608All Organisms → cellular organisms → Bacteria1311Open in IMG/M
3300012211|Ga0137377_10099907All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2748Open in IMG/M
3300012349|Ga0137387_10437150All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11949Open in IMG/M
3300012351|Ga0137386_10090009All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2159Open in IMG/M
3300012351|Ga0137386_10519403All Organisms → cellular organisms → Bacteria858Open in IMG/M
3300012357|Ga0137384_10009805All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Koribacter → Candidatus Koribacter versatilis7754Open in IMG/M
3300012359|Ga0137385_10052166All Organisms → cellular organisms → Bacteria3655Open in IMG/M
3300012361|Ga0137360_10101240All Organisms → cellular organisms → Bacteria2204Open in IMG/M
3300012362|Ga0137361_10618388All Organisms → cellular organisms → Bacteria992Open in IMG/M
3300012927|Ga0137416_10218029All Organisms → cellular organisms → Bacteria1541Open in IMG/M
3300012930|Ga0137407_11009518Not Available788Open in IMG/M
3300014158|Ga0181521_10427390Not Available647Open in IMG/M
3300014162|Ga0181538_10479511Not Available656Open in IMG/M
3300017927|Ga0187824_10000913All Organisms → cellular organisms → Bacteria6926Open in IMG/M
3300017927|Ga0187824_10035214All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1516Open in IMG/M
3300017930|Ga0187825_10013244All Organisms → cellular organisms → Bacteria2763Open in IMG/M
3300017936|Ga0187821_10142071All Organisms → cellular organisms → Bacteria903Open in IMG/M
3300017961|Ga0187778_10703891Not Available683Open in IMG/M
3300017994|Ga0187822_10058826All Organisms → cellular organisms → Bacteria1097Open in IMG/M
3300018006|Ga0187804_10322864Not Available675Open in IMG/M
3300018012|Ga0187810_10236222All Organisms → cellular organisms → Bacteria748Open in IMG/M
3300018062|Ga0187784_11239961Not Available591Open in IMG/M
3300018468|Ga0066662_10838480All Organisms → cellular organisms → Bacteria896Open in IMG/M
3300018468|Ga0066662_12107138Not Available591Open in IMG/M
3300018468|Ga0066662_12315574Not Available564Open in IMG/M
3300020579|Ga0210407_10154895All Organisms → cellular organisms → Bacteria1767Open in IMG/M
3300020580|Ga0210403_10155194All Organisms → cellular organisms → Bacteria1872Open in IMG/M
3300020580|Ga0210403_10295029All Organisms → cellular organisms → Bacteria1328Open in IMG/M
3300020581|Ga0210399_10052164All Organisms → cellular organisms → Bacteria3287Open in IMG/M
3300020581|Ga0210399_11047744All Organisms → cellular organisms → Bacteria655Open in IMG/M
3300020582|Ga0210395_10022004All Organisms → cellular organisms → Bacteria4690Open in IMG/M
3300020583|Ga0210401_11563538Not Available517Open in IMG/M
3300021046|Ga0215015_10332109All Organisms → cellular organisms → Bacteria1918Open in IMG/M
3300021046|Ga0215015_10487943All Organisms → cellular organisms → Bacteria → Acidobacteria25067Open in IMG/M
3300021088|Ga0210404_10033262All Organisms → cellular organisms → Bacteria2325Open in IMG/M
3300021168|Ga0210406_11283267Not Available528Open in IMG/M
3300021171|Ga0210405_10028536All Organisms → cellular organisms → Bacteria4482Open in IMG/M
3300021171|Ga0210405_10575313All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium 13_1_40CM_4_67_11879Open in IMG/M
3300021171|Ga0210405_10765009Not Available742Open in IMG/M
3300021178|Ga0210408_10860385All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium707Open in IMG/M
3300021180|Ga0210396_10072993All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3122Open in IMG/M
3300021181|Ga0210388_11120685All Organisms → cellular organisms → Bacteria671Open in IMG/M
3300021403|Ga0210397_10347560All Organisms → cellular organisms → Bacteria1099Open in IMG/M
3300021405|Ga0210387_10338995All Organisms → cellular organisms → Bacteria1327Open in IMG/M
3300021407|Ga0210383_11099553All Organisms → cellular organisms → Bacteria → Acidobacteria671Open in IMG/M
3300021420|Ga0210394_10021975All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae5880Open in IMG/M
3300021420|Ga0210394_10138945All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae2107Open in IMG/M
3300021420|Ga0210394_10318089All Organisms → cellular organisms → Bacteria1366Open in IMG/M
3300021420|Ga0210394_10838280Not Available802Open in IMG/M
3300021432|Ga0210384_10061029All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3408Open in IMG/M
3300021432|Ga0210384_10915340All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia → Eubacteriales → Eubacteriales incertae sedis → Clostridiales Family XVII. Incertae Sedis → Sulfobacillus777Open in IMG/M
3300021474|Ga0210390_10045970All Organisms → cellular organisms → Bacteria3580Open in IMG/M
3300021476|Ga0187846_10005634All Organisms → cellular organisms → Bacteria6348Open in IMG/M
3300021477|Ga0210398_11457566Not Available534Open in IMG/M
3300021559|Ga0210409_10073924All Organisms → cellular organisms → Bacteria3166Open in IMG/M
3300021559|Ga0210409_10268848All Organisms → cellular organisms → Bacteria1541Open in IMG/M
3300021559|Ga0210409_10281334All Organisms → cellular organisms → Bacteria1502Open in IMG/M
3300021559|Ga0210409_10782523All Organisms → cellular organisms → Bacteria826Open in IMG/M
3300021559|Ga0210409_10826329All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium799Open in IMG/M
3300021559|Ga0210409_10934727Not Available741Open in IMG/M
3300021559|Ga0210409_11243740Not Available620Open in IMG/M
3300022504|Ga0242642_1007365All Organisms → cellular organisms → Bacteria1291Open in IMG/M
3300022507|Ga0222729_1014831All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium865Open in IMG/M
3300022529|Ga0242668_1117336Not Available557Open in IMG/M
3300022709|Ga0222756_1094507Not Available504Open in IMG/M
3300022722|Ga0242657_1024231All Organisms → cellular organisms → Bacteria1176Open in IMG/M
3300022724|Ga0242665_10372997Not Available516Open in IMG/M
3300026298|Ga0209236_1022103All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae3533Open in IMG/M
3300026318|Ga0209471_1103646All Organisms → cellular organisms → Bacteria1227Open in IMG/M
3300026328|Ga0209802_1003569All Organisms → cellular organisms → Bacteria10400Open in IMG/M
3300026335|Ga0209804_1001792All Organisms → cellular organisms → Bacteria13565Open in IMG/M
3300026532|Ga0209160_1274532All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium572Open in IMG/M
3300026551|Ga0209648_10031803All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4601Open in IMG/M
3300026552|Ga0209577_10095766All Organisms → cellular organisms → Bacteria2415Open in IMG/M
3300027565|Ga0209219_1001968All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes3900Open in IMG/M
3300027605|Ga0209329_1029569All Organisms → cellular organisms → Bacteria1132Open in IMG/M
3300027706|Ga0209581_1006674All Organisms → cellular organisms → Bacteria8360Open in IMG/M
3300027725|Ga0209178_1008489All Organisms → cellular organisms → Bacteria3226Open in IMG/M
3300027775|Ga0209177_10038848All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1293Open in IMG/M
3300027842|Ga0209580_10518993Not Available593Open in IMG/M
3300027965|Ga0209062_1265961Not Available592Open in IMG/M
3300028047|Ga0209526_10125626All Organisms → cellular organisms → Bacteria1802Open in IMG/M
3300028536|Ga0137415_10121182All Organisms → cellular organisms → Bacteria2461Open in IMG/M
3300028906|Ga0308309_10944254Not Available748Open in IMG/M
3300031708|Ga0310686_104543985All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae907Open in IMG/M
3300031715|Ga0307476_10154402All Organisms → cellular organisms → Bacteria1651Open in IMG/M
3300031754|Ga0307475_10008876All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae6605Open in IMG/M
3300031754|Ga0307475_10036320All Organisms → cellular organisms → Bacteria3621Open in IMG/M
3300031823|Ga0307478_10562058All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium953Open in IMG/M
3300031823|Ga0307478_10817137Not Available780Open in IMG/M
3300031962|Ga0307479_10057755All Organisms → cellular organisms → Bacteria3738Open in IMG/M
3300031962|Ga0307479_11017528All Organisms → cellular organisms → Bacteria797Open in IMG/M
3300031962|Ga0307479_11019127All Organisms → cellular organisms → Bacteria796Open in IMG/M
3300032160|Ga0311301_11056368All Organisms → cellular organisms → Bacteria → Acidobacteria1064Open in IMG/M
3300032180|Ga0307471_100067378All Organisms → cellular organisms → Bacteria3066Open in IMG/M
3300032180|Ga0307471_100341105All Organisms → cellular organisms → Bacteria1604Open in IMG/M
3300032180|Ga0307471_100986546All Organisms → cellular organisms → Bacteria1009Open in IMG/M
3300032180|Ga0307471_102900741All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300032770|Ga0335085_10000021All Organisms → cellular organisms → Bacteria588913Open in IMG/M
3300032783|Ga0335079_10035815All Organisms → cellular organisms → Bacteria5699Open in IMG/M
3300032805|Ga0335078_10394975All Organisms → cellular organisms → Bacteria1818Open in IMG/M
3300032805|Ga0335078_11787338All Organisms → cellular organisms → Bacteria → Acidobacteria669Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil24.84%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil14.01%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.46%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil7.64%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil7.64%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil7.01%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment4.46%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil4.46%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.82%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil2.55%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.55%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.91%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.91%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog1.27%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil1.27%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.27%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.64%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.64%
Attine Ant Fungus GardensHost-Associated → Fungi → Mycelium → Unclassified → Unclassified → Attine Ant Fungus Gardens0.64%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300001867Texas A ecozone_OM1H0_M2 (Combined assembly for Texas A ecozone Site metagenome samples, ASSEMBLY_DATE=20130705)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005531Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen12_06102014_R2EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005538Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1EnvironmentalOpen in IMG/M
3300005541Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen05_05102014_R1EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300006173Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-2 metaGEnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012169Attine ant fungus gardens microbial communities from North Carolina, USA - TSNC074 MetaGHost-AssociatedOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014158Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin02_60_metaGEnvironmentalOpen in IMG/M
3300014162Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin23_30_metaGEnvironmentalOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300018012Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_5EnvironmentalOpen in IMG/M
3300018062Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020582Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-OEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021403Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-OEnvironmentalOpen in IMG/M
3300021405Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021477Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-OEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022504Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-2-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022507Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-27-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022529Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022709Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-O (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022722Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-12-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027565Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027605Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027706Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen15_06102014_R2 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027965Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen12_06102014_R2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12635J15846_1002791353300001593Forest SoilMASPGNLFRVMNEFVFMLVGGLLVLFALTGRYLFNPRQPSWIVLSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA*
JGI12627J18819_1014379623300001867Forest SoilMASPGNLFRVLNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA*
JGIcombinedJ26739_10006890333300002245Forest SoilMASPGNLFRVVNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRLAAERLVTKIGGGSLIAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA*
JGI25385J37094_1010177823300002558Grasslands SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWRRARLSRVAADRLVAKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA*
Ga0066672_1006583323300005167SoilLASSLHLFRVVNEIVFMLVGALLVLFALTGRYLFNPRRPEWIALSIVLILWGLGTWRRARFSRSAAERLVTKIGGGSLALGGLVMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0066680_1000428943300005174SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSAVLILWGLGTWRRARLSRVAADRLVAKIGGGSLSLAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0066680_1070723613300005174SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA*
Ga0066679_1001948523300005176SoilMASPSHLFRVVNEFVFMLVGALLVIFALTGRYLFNPRQPGWISLSVVLILWGLGTWRRAQLSRVPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0073909_1058795823300005526Surface SoilTQLFRLVNEFVFILVGGLLVLFALTGRYLFNPRQPAWLVLSTVMILWGLRAWRRSRLSVLSADRLVGKIGGGSLALAGLILLSLAWAPFRWAGWLLLAAGAVFVLRGLASAAILARAASVRWPSAK*
Ga0070741_100007961003300005529Surface SoilMSNPGHMFRVMNEFVFMLVGALLALFALTGRYLFNPRRPAWIALSVVLVLWGVGTWSRAHHSRLANERLVTKIGGGSLIVAGLIMLSLAWAPFRYAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0070738_1003663813300005531Surface SoilMANPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSALLIVWGLGTWRRAPRFADKSDRLVTKIAGGSLSAAGLVMLSLAWAPFEYAGWLLLATGSIFVLRGLVSAAIMARSA*
Ga0070735_1001640123300005534Surface SoilMASPSHLFRVVNEFVFMLAGALLALFALTRPYLVNARQPGWIALSAVLVLWGLGTWRRARRSVDHAERLVTKIAGGSLVLAGLIMLSLAWAPFQWAGWMLLATGGIFVLRGLVSAAIMARSA*
Ga0070735_1024952423300005534Surface SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPSWIALSVVLILWGLGTWRRAHLSRVAADRLVTKIGGGSLAVAGLIMLSLAWAPRWAGWLLLATGGVFVLRGLVSAAIMARSA*
Ga0070730_1001233543300005537Surface SoilMLVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0070731_1018591923300005538Surface SoilMLVGALLALFALTGRYLFNPRRPGWIALSVVLIVWGLGTWRRAQLSRVPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA*
Ga0070733_1023830733300005541Surface SoilINEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWSRARASRAAAERLVTKIGGGSLTAGGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA*
Ga0070732_1019787533300005542Surface SoilMASPGNLFRVMNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0066701_1037469813300005552SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGGIF
Ga0066661_1038678323300005554SoilFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSIVLILWGLGTWHRARLSRGAAERLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0066704_1005570543300005557SoilMASPIHLFRVVNEFVFMLVGALLVLFALTGRYLSNPRRPGWIGLSVVLILWGLGTWRRAHLSRIAADRLVTKIGGGSLALGGLIMLSLAWAPFRWAEWLLLATGGVFVLRGLVSAAIMAR
Ga0066704_1009276323300005557SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWRRARLSRVAADRLVAKIGGGSLSLAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0066699_1054371613300005561SoilSRAKFPSSRATPTIARGRRSSCSTCASNACNAAMASPSHLFRVVNEFVFMLVGALLVIFALTGRYLFNPRQPGWIGLSVVLILWGLGTWRRAQLSRVPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0066703_1009129533300005568SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLVLWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA*
Ga0066703_1029044913300005568SoilMASPIHLFRVVNEFVFMLVGALLVLFALTGRYLSNPRRPGWIGLSVVLILWGLGTWRRAHLSRIAADRLVTKIGGGSLALGGLIMLSLAWAPFRWAEWLLLATGGVFVLRGLVSAAIMARSV*
Ga0066691_1010236533300005586SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALAVVLVLWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0070762_1121349213300005602SoilMASPGNLFRVLNEFVFMMVGGLLVLFALTGRYLFNPRRPGWIVLSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATG
Ga0070764_1105467523300005712SoilEFVFMLVGALLALFALTGRYLFNPRRPGWIALSVVLIVWGLGTWRRAQLSRVPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA*
Ga0070716_10084308823300006173Corn, Switchgrass And Miscanthus RhizosphereMPSPGNLFRIVNEFVFMMVGALLVLFALTGRYLFNPRRPGWIALSIVIVLWGVGTWSRARADRVPAERLVTRIGGGSLVAAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAILARSA*
Ga0070765_10027080933300006176SoilMASPGNLFRVLNEFVFMMVGGLLVLFALTGRYLFNPRRPGWIVLSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA*
Ga0079222_1000772023300006755Agricultural SoilMASPGHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIGLSVVLILWGFGTWRRARASRFPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGAVFVLRGLVSAAIMARSA*
Ga0066658_1001394233300006794SoilLASSLHLFRVVNEIVFMLVGALLVLFALTGRYLFNPRRPGWIALSIVLILWGLGTWHRARLSRGAAERLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0066659_1008307833300006797SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLVLWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0066659_1026410113300006797SoilLASSLHLFRVVNEIVFMLVGALLVLFALTGRYLFNPRRPEWIALSIVLILWGLGTWRRARFSRSAAERLVTKIGGGSLALGGLVMLSLAWAPFRWAGWLL
Ga0066660_1003018213300006800SoilRATPTIARGRRSSCSTCASNACNAAMASPSHLFRVVNEFVFMLVGALLVVFALTGRYLFNPRQPGWISLSVVLILWGLGTWRRAQLSRVPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA*
Ga0066660_1096993213300006800SoilMLVGALLVLFALTGRYLFNPRRPEWIALSIVLILWGLGTWRRARFSRSAAERLVTKIGGGSLALGGLVMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0079221_1000108983300006804Agricultural SoilMASPGHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIGLSVVLILWGFGTWRSARASRFPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGAVFVLRGLVSAAIMARSA*
Ga0079220_1000350063300006806Agricultural SoilMASPGHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIGLSVVLILWGFGTWRRARASRFPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATG
Ga0079220_1070215723300006806Agricultural SoilVVPSPGNLFRLVNEFVFMMVGALLVLFALTGRYLFNPRRPGWIALSIVIALWGVGTWSRARASRVAAERLVTKIGGGSLIASGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA*
Ga0079219_1008482033300006954Agricultural SoilMASPGHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIGLSVVLILWGFGTWRRARASRFPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGA
Ga0126373_1080217623300010048Tropical Forest SoilMPNPGHMFRVLNEFVFMLVGALLALFALTRVLFFNPRQPAWIALSVVLVLWGVGTWARARRSRLAAERLVTKIGGGSLIAAGLIMMSLAWAPFRYAGWLLLATGGVFVLRGLVSAAIMARSV*
Ga0126373_1273739223300010048Tropical Forest SoilMPNPGHMFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPAWIGLSVVLVLWGVGTWSRARHDRIAAERLVTKIGGGSLVAAGLIMLSLVWAPFRYAGWLLLATGGVFVLRGLVSAAIMARSA*
Ga0126381_10011716823300010376Tropical Forest SoilMPNPGHMFRVLNEFVFMLVGALLELFALTRVLFFNPRQPAWIALSVVLVLWGVGTWARARRSRLAAERLVTKIGGGSLIAAGLIMMSLAWAPFRYAGWLLLATGGVFVLRGLVSAAIMARSV*
Ga0136449_10034420043300010379Peatlands SoilMASPSHLFRVVNEFVFMLAGALLALFALTRPYLVNARQPGWILLSAVLVLWGLGTWRRARRSVDHAERLVTKIAGGSLVLAGLIMLSLAWAPFQWAGWMLLATGGIFVLRGLVSAAIMVRSA*
Ga0150983_1117252033300011120Forest SoilMASPSQLFRVMNEFVFMLVGALLALFALTGRYLFNPRRPGWIALSVVLIVWGLGTWRRAQLSRVPADRLVTKIAGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA*
Ga0150983_1165956923300011120Forest SoilMASPSHLFRVVNEFVFMLAGALLALFALTRPYLVNARQPGWIALSAVLILWGVGTWRRARRSVDNADRLVTKIAGGSLILAGIIMLSLAWAPFEWAGWMLLATGGIF
Ga0150983_1262275823300011120Forest SoilMASPGNLFRVVNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA*
Ga0150983_1463350223300011120Forest SoilMASPGNLFRVMNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSIVLILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA*
Ga0150983_1666514223300011120Forest SoilLNEFVFMMVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSKVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA*
Ga0137393_1064742523300011271Vadose Zone SoilMASPSHLFRVVNEFVFMLVGALLALFALTGRYLFNPRRPGWIALSVVLVLWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA*
Ga0137389_1135669223300012096Vadose Zone SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGPYLFNPRRPGWIALSAVLILWGLGTWRRARLSRVAADRLVAKIGGGSLAAGGLIMLSLAWAPVRWAGWLLLATGSIFVLRGLVSAAIMVRSA*
Ga0153990_109631823300012169Attine Ant Fungus GardensNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSIVLILWGLGTWSRARLSREAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMVRSA*
Ga0137383_1001951943300012199Vadose Zone SoilVASPSHLFRVVNEFVFMLVGALLVLFALTGRYFFNPRRPGWIALSVVLILWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA*
Ga0137383_1004567333300012199Vadose Zone SoilMASPTHLFRVVNEFVFMLVGALLVLFALVGRYLFNPRRPGWIALSVVLILWGLGTWSRARRSGIAVDRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA*
Ga0137380_1002993543300012206Vadose Zone SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYFFNPRRPGWIALSVVLILWGLGTWSRARRSRIAGDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA*
Ga0137380_1047910933300012206Vadose Zone SoilRVVNEFVFMLVGALLVLFALTGRYFFNPRRPGWIALSVVLVLWGLGTWSRARRSGIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA*
Ga0137379_1038560823300012209Vadose Zone SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYFFNPRRPGWIALSVVLVLWGLGTWSRARRSGIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA*
Ga0137377_1009990733300012211Vadose Zone SoilVASPSHLFRVVNEFVFMLVGALLVLFALTGRYFFNPRRPGWIALSVVLVLWGLGTWSRARRSGIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA*
Ga0137387_1043715023300012349Vadose Zone SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYFFNPRRPGWIALSVVLILWGLGTWSRARRSGIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA*
Ga0137386_1009000923300012351Vadose Zone SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYFFNPRRPGWIALSVVLILWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA*
Ga0137386_1051940323300012351Vadose Zone SoilMASPTHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWSRARRSGIAVDRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA*
Ga0137384_1000980573300012357Vadose Zone SoilMASPTHLFRVVNEFVFMLVGALLVLFALTGRYFFNPRRPGWIALSVVLVLWGLGTWSRARRSGIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA*
Ga0137385_1005216653300012359Vadose Zone SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYFFNPRRPGWIALSVVLILWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGSIFVLR
Ga0137360_1010124023300012361Vadose Zone SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGPYLFNPRRPGWIALSVVLILWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA*
Ga0137361_1061838823300012362Vadose Zone SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGPYLFNPRRPGWIALSVVLILWRLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA*
Ga0137416_1021802923300012927Vadose Zone SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLPATGSIFVLRGLVSAAIMARSA*
Ga0137407_1100951823300012930Vadose Zone SoilMASPTQLFRLVNEFVFILVGGLLVLFALTGRYLFNPRQPAWLVLSAVMILWGLSAWRRSRLSILSADRLVGKIGGGSLALAGLILLSLAWAPFRWAGWLLLAAGAIFVLRGLASAAILARAA*
Ga0181521_1042739023300014158BogMASPSHLFRVVNEFVFMLAGALLALFALTRPYLLNARQPGWIALSAVLVFWGLGTWRRARRSVDHAERLVTKIAGGSLVLAGLIMLSLAWAPFQWAGWMLLATGGIFVLRGLVSAAIMVRSA*
Ga0181538_1047951123300014162BogMVSPSHLFRVVNEFVFMLAGALLALFALTRPYLLNARQPGWIALSAVLVFWGLGTWRRARRSVDHAERLVTKIAGGSLVLAGLIMLSLAWASFQWAGWMLLATGGIFVLRGLVSAAIMVRSA*
Ga0187824_1000091343300017927Freshwater SedimentMASPSHTFRVVNEFVFMMVGALLVFFALTGRYLFNPRRPSWIALSVVLILWGLATWSRARVSRLPAERLVAKIAGGSLTAAGLIMLSLAWAPFRWAGWLLVATGSIFVLRGLVSAAIMARSA
Ga0187824_1003521433300017927Freshwater SedimentMASPGNLFRVMNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA
Ga0187825_1001324423300017930Freshwater SedimentMLVGGLLVLFALTGRYLFNPRRPGWIALSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA
Ga0187821_1014207113300017936Freshwater SedimentMASPSHLFRLVNEFVFMLAGALLALFALTRPSLLNARQPSWIALSALLIVWGMGTWWRARRSGDNADRLVTRIAGGSLILAGLIMISLAWAPFQWAGWMLLATGGIFVLRGLISAAIMARSA
Ga0187778_1070389113300017961Tropical PeatlandMASPSHLFRVVNEFVFMLAGALLALFALTRPYLPNARQPGWIALSALLILWGLGTWWRAGRSTGSADRLVSKIAGASLTLAGLIMISLAWAPFEWAGWLLLATGGIFVLRGLISAAIMARSA
Ga0187822_1005882633300017994Freshwater SedimentRGNLFRVMNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA
Ga0187804_1032286413300018006Freshwater SedimentMASPSHLFRVVNEFVFMLAGALLALFALTRPYLVNARQPGWIVLSAVLVLWGLGTWRRARRSVDHAERLVTKIAGGSLVLAGMIMLSLAWAPFQWAGWML
Ga0187810_1023622223300018012Freshwater SedimentMASPSHLFRVVNELVFMLVGALLVLFALTGRYLFNPRRPAWIALSVVLILWGLGTWRQAHLSRVAADRLVTKIGGGSLAVAGLIMLSLAWAPFRWAGWLLLAAGGIFVLRGLVSAAIMARSA
Ga0187784_1123996123300018062Tropical PeatlandMPNPGRLFRVINEFVFMLVGALLVMFALTGRYLFNPRRPGWIALSLVLVLWGLGTWSRARRSALKAERLVTKIAGGSLAAAGLIMLSLAWAPFQYAGWLLLATGSIFVLRGLISAAIMARSA
Ga0066662_1083848023300018468Grasslands SoilMASPIHLFRVVNEFVFMLVGALLVLFALTGRYLSNPRRPGWIGLSVVLILWGLGTWRRAHLSRIAADRLVTKIGGGSLALGGLIMLSLAWAPFRWAEWLLLATGGVFVLRGLVSAAIMARSV
Ga0066662_1210713813300018468Grasslands SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWRRARLSRVAADRLVAKIGGGSLSLAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA
Ga0066662_1231557423300018468Grasslands SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLYLAWPPFRWAGWLLLATGSIFVLRGLVSAAIMARSA
Ga0210407_1015489533300020579SoilMASPGNLFRVVNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210403_1015519443300020580SoilNEFVFMLVGVLLVLFALTGRYLFNPRRPGWIVLSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA
Ga0210403_1029502923300020580SoilMASPGNLFRVVNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLIAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210399_1005216423300020581SoilMASPGNLFRVLNEFVFMMVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210399_1104774423300020581SoilMASPGNLFRVLNEFVFMMVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSKVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210395_1002200443300020582SoilMASPGNLFRVVNEFVFMMVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSKVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210401_1156353813300020583SoilNEFVFMLVGALLALFALTGRYLFNPRRPGWIALSVVLIVWGLGTWRRAQLSRVPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA
Ga0215015_1033210933300021046SoilMASPDHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWRRAELRRVAADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSD
Ga0215015_1048794333300021046SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWSRAARSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210404_1003326213300021088SoilSPGSLFRILNEFVFMMVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRLAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMVRSA
Ga0210406_1128326713300021168SoilMASPGNLFRVMNEFVFMLVGVLLVLFALTGRYLFNPRRPGWIVLSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVL
Ga0210405_1002853653300021171SoilMASPGSLFRILNEFVFMMVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210405_1057531323300021171SoilVASPSQLFRVVNEFVFMLAGALLALFALTSRYLFNPRRPGWIGLSVLLIVWGLVTWRRAQLSGVPAERLVAKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATG
Ga0210405_1076500913300021171SoilMASPGNLFRVMNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSA
Ga0210408_1086038523300021178SoilMMVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSKVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210396_1007299353300021180SoilMASPGNLFRVMNEFVFMLVGGLLVLFALTGRYLFNPRRPGWMVLSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210388_1112068513300021181SoilGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210397_1034756023300021403SoilMLVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210387_1033899513300021405SoilMASPSHLFRVVNEFVFMLAGALLALFALTRPYLVNARQPGWIALSAVLILWGVGTWRRARRSVDNADRLVTKIAGGSLILAGIIMLSLAWAPFEWAGWMLLATGGIFVLRGLVSAAIMA
Ga0210383_1109955313300021407SoilMASPSHLFRVVNEFVFMLAGALLALFALTRPYLVNARQPGWIALSAVLILWGVGTWRRARRSVDNADRLVTKIAGGSLILAGIIMLSLAWAPFEWAGWMLLATGGIFVLRGLVSAAIMARSA
Ga0210394_1002197543300021420SoilMASPGNLFRVMNEFVFMLVGGLLVLFALTGRYLFNPRQPSWIVLSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLIAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210394_1013894533300021420SoilMASPGNLFRVVNEFVFMMVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRAHLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210394_1031808923300021420SoilMASPSQLFRVMNEFVFMLVGALLALFALTGRYLFNPRRPGWIALSVVLIVWGLGTWRRAQLSRVPVDRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA
Ga0210394_1083828023300021420SoilMASPSHLFRVVNEFVFMLAGALLALFALTRPYLVNARQPGWIALSAVLILWGVGTWRRARRSVDNADRLVTKIAGGSLILAGLIMLSLAWAPFEWAGWMLLATGGIFVLRGLVSAAIMARSE
Ga0210384_1006102913300021432SoilRVVNEFVFMLAGALLALFALTSRYLFNPRRPGWIGLSVLLIVWGLVTWRRAQLSGVPAERLVAKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA
Ga0210384_1091534013300021432SoilLVLFALTGRYLFNPRRPGWIALSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPLRWAGWLLLATGSIFVLRGLVSAAIMARSA
Ga0210390_1004597043300021474SoilMASPGNLFRVVNEFVFMMVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSKVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMVRSA
Ga0187846_1000563453300021476BiofilmMASPSQLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWRRAQLSRVPADRLVTKIGGGSLALGGLIMLSLASAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA
Ga0210398_1145756613300021477SoilMASPSQLFRVMNEFVFMLVGALLALFALTGRYLFNPRRPGWIALSVVLIVWGLGTWRRAQLSRVPADRLVTKIAGGSLALGGLIMLSLAWAPFRWAGWLLLA
Ga0210409_1007392433300021559SoilMASPSHLFRVVNEFVFMLAGALLALFALTRPYLVNARQPGWIALSAVLVLWGLGTWRRARRSADNAERLVTKIAGGSLVLAGLIMLSLAWAPFQWAGWMLLATGGIFVLRGLVSAAIMARSA
Ga0210409_1026884833300021559SoilMASPSQLFRVVNEFVFMLAGALLALFALTSRYLFNPRRPGWIGLSLLLIVWGLVTWRRAQLSGVPAERLVAKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA
Ga0210409_1028133423300021559SoilMASPGNLFRVVNEFVFMLVGALLALFALTGRYLFNPRRPGWIALSVVLILWGLGTWRRAYVSRVAADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARST
Ga0210409_1078252323300021559SoilMASPGNLFRVMNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA
Ga0210409_1082632923300021559SoilGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIRGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210409_1093472723300021559SoilMWRTKFPSCLATRAIARARPSLCSMSASNASETPMASPGNLFRVVNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIGLSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLIAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0210409_1124374023300021559SoilMASPGNLFRVVNEFVFMLVGGLLVLFALTGRYLFNPRRPGWMVLSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMVRSA
Ga0242642_100736523300022504SoilMLVGGLLVLFALTGRYLFNPRRPGWIALSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPLRWAGWLLLATGSIFVLRGLVSAAIMARSA
Ga0222729_101483113300022507SoilMLERGLLFLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMVRSA
Ga0242668_111733613300022529SoilMASPSHLFRVVNEFVFMLAGALLALFALTRPYLVNARQPGWIALSAALILWGVGTWRRARRSVDNADRLVTKIASGSLILAGLIMLSLAWAPFEWAGWMLLATGGIFVLRGLVSAAIMARSA
Ga0222756_109450713300022709SoilMWRTKFPSCLATRVIARGRPSLCSMSASNASERQWPAPNLFRVVNEFVFMMVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSKVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVS
Ga0242657_102423133300022722SoilMASPGNLFRVMNEFVFMLVGGLLVLFALTGRYLFNPRQPSWIVLSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLIAGGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA
Ga0242665_1037299723300022724SoilMMVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGRLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0209236_102210333300026298Grasslands SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWRRARLSRVAADRLVAKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0209471_110364613300026318SoilNEFVFMLVGALLVVFALTGRYLFNPRQPGWISLSVVLILWGLGTWRRAQLSRVPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA
Ga0209802_100356963300026328SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSAVLILWGLGTWRRARLSRVAADRLVAKIGGGSLSLAGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA
Ga0209804_100179283300026335SoilLASSLHLFRVVNEIVFMLVGALLVLFALTGRYLFNPRRPEWIALSIVLILWGLGTWRRARFSRSAAERLVTKIGGGSLALGGLVMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMARSA
Ga0209160_127453223300026532SoilGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA
Ga0209648_1003180363300026551Grasslands SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWRRAQLSRVAADRLVTKIGGGSLALGGLIMLSLAWTPFRWAGWLLLATGGVFVLRGLVSAAIMARSA
Ga0209577_1009576653300026552SoilHLFRVVNEFVFMLVGALLVIFALTGRYLFNPRQPGWIGLSVVLILWGLGTWRRAQLSRVPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0209219_100196823300027565Forest SoilMASPGNLFRVMNEFVFMLVGGLLVLFALTGRYLFNPRQPSWIVLSIVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGSIFVLRGLVSAAIMARSA
Ga0209329_102956923300027605Forest SoilMASPGNLFRVVNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRLAAERLVTKIGGGSLIAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0209581_100667483300027706Surface SoilMSNPGHMFRVMNEFVFMLVGALLALFALTGRYLFNPRRPAWIALSVVLVLWGVGTWSRAHHSRLANERLVTKIGGGSLIVAGLIMLSLAWAPFRYAGWLLLATGGIFVLRGLVSAAIMARSA
Ga0209178_100848933300027725Agricultural SoilMASPGHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIGLSVVLILWGFGTWRRARASRFPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGAVFVLRGLVSAAIMARSA
Ga0209177_1003884833300027775Agricultural SoilMASPGHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIGLSVVLILWGFGTWRRARASRFPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWL
Ga0209580_1051899313300027842Surface SoilMASPGNLFRVMNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFV
Ga0209062_126596123300027965Surface SoilMANPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSALLIVWGLGTWRRAPRFADKSDRLVTKIAGGSLSAAGLVMLSLAWAPFEYAGWLLLATGSIFVLRGLVSAAIMARSA
Ga0209526_1012562623300028047Forest SoilMASPGNLFRVMNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRLAAERLVTKIGGGSLIAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0137415_1012118223300028536Vadose Zone SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIALSVVLILWGLGTWSRARRSRIAVDRLVTKIGGGSLALAGLIMLSLAWAPFRWAGWLLPATGSIFVLRGLVSAAIMARSA
Ga0308309_1094425413300028906SoilMASPSQLFRVMNEFVFMLVGALLALFALTGRYLFNPRRPGWIALSVVLIVWGLGTWRRAQLSRVPADRLVTKIAGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA
Ga0310686_10454398523300031708SoilMASPSHLFRVVNEFVFMLAGALLALFALTRPYLVNARQPGWIALSAVLILWGVGTWRRARRSVDNADRLVTKIAGGSLILAGLIMLSLAWAPFEWAGWMLLATGGIFVLRGLVSAAIMARSA
Ga0307476_1015440223300031715Hardwood Forest SoilMLVGALLALFALTGRYLFNPRRPGWIALSVVLIVWGLGTWRRAQLSRVPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA
Ga0307475_1000887633300031754Hardwood Forest SoilMASPSQLFRVVNEFVFMLAGALLALFALTSRYLFNPRRPGWIGLSVLLIVWGLVTWRRAQLSGVPAERLVAKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA
Ga0307475_1003632023300031754Hardwood Forest SoilMTSPGNLFRVVNEFVFMLVGGLLVLFALTGRYLFKPRRPGWIALSVVIIHWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMVRSA
Ga0307478_1056205813300031823Hardwood Forest SoilVLFALTGRYLFNPRRPGWIGLSVLLIVWGLGTWRRAQLSRVPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA
Ga0307478_1081713713300031823Hardwood Forest SoilMASPGNLFRVLNEFVFMMVGGLLVLFALTGRYLFNPRRPGWIVLSIVVVLWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0307479_1005775533300031962Hardwood Forest SoilMASPSQLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIGLSVLLIVWGLGTWRRAQLSRVPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA
Ga0307479_1101752823300031962Hardwood Forest SoilMASPGNLFRVLNEFVFMMVGGLLVLFALTGRYLFNPRRPGWIVLSIVVVLWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVL
Ga0307479_1101912713300031962Hardwood Forest SoilMTSPGNLFRVVNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVIILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMVRSA
Ga0311301_1105636833300032160Peatlands SoilMASPSHLFRVVNEFVFMLAGALLALFALTRPYLVNARQPGWILLSAVLVLWGLGTWRRARRSVDHAERLVTKIAGGSLVLAGLIMLSLAWAPFQWAGWMLLATGGIFVLRGLVSAAIMVRSA
Ga0307471_10006737853300032180Hardwood Forest SoilMASPSHLFRVVNEFVFMLVGALLVLFALTGRYLFNPRRPGWIGLSVVLILWGLGTWRRAQLSRIPADRLVTKIGGGSLALGGLIMLSLAWAPFRWAGWLLLATGGVFVLRGLVSAAIMARSA
Ga0307471_10034110533300032180Hardwood Forest SoilMASPGNLFRVVNELVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVVILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGIFVLRGLVSAAIMVRSA
Ga0307471_10098654623300032180Hardwood Forest SoilMTSPGNLFRVVNEFVFMLVGGLLVLFALTGRYLFNPRRPGWIALSVVIILWGLGTWSRARLSRVAAERLVTKIGGGSLVAGGLIMLSLAWAPFRWAGWLLLATGGVFVIRGLVSAAIMVRSA
Ga0307471_10290074123300032180Hardwood Forest SoilQLFRMMNEFVFILVGGLLALFALTGRYLFNPRQPAWLVLSAVMILWGLRTWRQSRLSILGADRLVGKIGGGSLAVSGLLMLSLAWAPFRWAGWLLLATGAVFVLRGLASAAILARAA
Ga0335085_100000211973300032770SoilMASPSHLFRVVNEFVFMLAGALLALFALTRPYLVNARQPGWIVLSAVLVLWGLGTWRRARRSVDHAERLVTKIAGGSLVLAGLIMLSLAWAPFQWAGWMLLATGGIFVLRGLVSAAIMVRSA
Ga0335079_1003581553300032783SoilMASPSHLFRVVNEFVFMLAGALLALFALTRPQLLNARQPAWIVLSAVLILWGLGTWWRARRSVGNAERLVTKIGGGSLTLAGLIMISLAWAPFEWAGWLLLATGGVFVLRGLVSAAIMVRAA
Ga0335078_1039497533300032805SoilMASPSHLFRVINEFVFMLAGALLALFALTRPYLVNARQPGWIALSAVLVVWGLGTWWRAGRSADHADRLVTKIAGGSLILAGVIMVSLAWAPFQWAGWMLLATGGIFVLRGLVSAAIMVRSA
Ga0335078_1178733813300032805SoilGYLFRVLNEFVFMLAGALLALFALTRPSLLNARQPGWIALSALVVVWGLGTWRRAQRSADRADRLVTKIAGGSLTVAGLVMLSLAWAPFTWAGWLLLATGGIFILRGLVSAAIMARAARS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.