NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F030006

Metagenome / Metatranscriptome Family F030006

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F030006
Family Type Metagenome / Metatranscriptome
Number of Sequences 186
Average Sequence Length 90 residues
Representative Sequence MASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASASR
Number of Associated Samples 162
Number of Associated Scaffolds 186

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 72.93 %
% of genes near scaffold ends (potentially truncated) 27.42 %
% of genes from short scaffolds (< 2000 bps) 73.12 %
Associated GOLD sequencing projects 154
AlphaFold2 3D model prediction Yes
3D model pTM-score0.70

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (89.247 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(18.817 % of family members)
Environment Ontology (ENVO) Unclassified
(33.871 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(34.409 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 31.67%    β-sheet: 11.67%    Coil/Unstructured: 56.67%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.70
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
a.4.5.34: SCF ubiquitin ligase complex WHB domaind1ldda_1ldd0.64398
a.4.5.0: automated matchesd2xiga_2xig0.6379
a.4.5.0: automated matchesd3mwma_3mwm0.6379
a.4.5.0: automated matchesd5nbca_5nbc0.63682
a.4.5.14: Forkhead DNA-binding domaind1vtnc11vtn0.63594


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 186 Family Scaffolds
PF16326ABC_tran_CTD 13.98
PF07883Cupin_2 13.44
PF14833NAD_binding_11 8.60
PF08240ADH_N 2.69
PF01177Asp_Glu_race 2.15
PF00313CSD 1.08
PF00005ABC_tran 1.08
PF04392ABC_sub_bind 0.54
PF12848ABC_tran_Xtn 0.54
PF11249DUF3047 0.54
PF00589Phage_integrase 0.54
PF10282Lactonase 0.54
PF03446NAD_binding_2 0.54
PF00561Abhydrolase_1 0.54
PF08299Bac_DnaA_C 0.54

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 186 Family Scaffolds
COG0593Chromosomal replication initiation ATPase DnaAReplication, recombination and repair [L] 0.54
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 0.54


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms89.25 %
UnclassifiedrootN/A10.75 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000891|JGI10214J12806_10538818All Organisms → cellular organisms → Bacteria → Proteobacteria4340Open in IMG/M
3300004019|Ga0055439_10007155All Organisms → cellular organisms → Bacteria2362Open in IMG/M
3300004058|Ga0055498_10021484All Organisms → cellular organisms → Bacteria962Open in IMG/M
3300004062|Ga0055500_10002235All Organisms → cellular organisms → Bacteria → Proteobacteria2641Open in IMG/M
3300004156|Ga0062589_100103058All Organisms → cellular organisms → Bacteria1823Open in IMG/M
3300004463|Ga0063356_100063264All Organisms → cellular organisms → Bacteria3805Open in IMG/M
3300004463|Ga0063356_101298590All Organisms → cellular organisms → Bacteria1064Open in IMG/M
3300004480|Ga0062592_100221349All Organisms → cellular organisms → Bacteria1357Open in IMG/M
3300004643|Ga0062591_100112858All Organisms → cellular organisms → Bacteria1794Open in IMG/M
3300005295|Ga0065707_10212508All Organisms → cellular organisms → Bacteria1241Open in IMG/M
3300005295|Ga0065707_10874519All Organisms → cellular organisms → Bacteria574Open in IMG/M
3300005336|Ga0070680_100240780All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1529Open in IMG/M
3300005336|Ga0070680_101219320All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria651Open in IMG/M
3300005440|Ga0070705_100374642All Organisms → cellular organisms → Bacteria1046Open in IMG/M
3300005445|Ga0070708_100013993All Organisms → cellular organisms → Bacteria6591Open in IMG/M
3300005467|Ga0070706_100528381All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1097Open in IMG/M
3300005518|Ga0070699_100182137All Organisms → cellular organisms → Bacteria1864Open in IMG/M
3300005530|Ga0070679_101779575All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300005546|Ga0070696_100251780All Organisms → cellular organisms → Bacteria1336Open in IMG/M
3300005547|Ga0070693_100061861All Organisms → cellular organisms → Bacteria2178Open in IMG/M
3300005617|Ga0068859_102667580All Organisms → cellular organisms → Bacteria549Open in IMG/M
3300005844|Ga0068862_100148379All Organisms → cellular organisms → Bacteria2086Open in IMG/M
3300005937|Ga0081455_10079124All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria2700Open in IMG/M
3300006049|Ga0075417_10251483All Organisms → cellular organisms → Bacteria847Open in IMG/M
3300006845|Ga0075421_100695749All Organisms → cellular organisms → Bacteria1182Open in IMG/M
3300007255|Ga0099791_10260695All Organisms → cellular organisms → Bacteria823Open in IMG/M
3300007255|Ga0099791_10656019All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300007265|Ga0099794_10283140All Organisms → cellular organisms → Bacteria857Open in IMG/M
3300009053|Ga0105095_10671139All Organisms → cellular organisms → Bacteria578Open in IMG/M
3300009078|Ga0105106_10822166All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria662Open in IMG/M
3300009088|Ga0099830_10934871Not Available717Open in IMG/M
3300009089|Ga0099828_11837116Not Available532Open in IMG/M
3300009147|Ga0114129_10040892All Organisms → cellular organisms → Bacteria6534Open in IMG/M
3300009157|Ga0105092_10381358All Organisms → cellular organisms → Bacteria801Open in IMG/M
3300009171|Ga0105101_10311142All Organisms → cellular organisms → Bacteria762Open in IMG/M
3300009795|Ga0105059_1005430All Organisms → cellular organisms → Bacteria1120Open in IMG/M
3300009803|Ga0105065_1023575All Organisms → cellular organisms → Bacteria745Open in IMG/M
3300009806|Ga0105081_1006412All Organisms → cellular organisms → Bacteria1219Open in IMG/M
3300009812|Ga0105067_1068196Not Available595Open in IMG/M
3300009820|Ga0105085_1043685All Organisms → cellular organisms → Bacteria804Open in IMG/M
3300010359|Ga0126376_10097023Not Available2237Open in IMG/M
3300010366|Ga0126379_10971611Not Available953Open in IMG/M
3300010391|Ga0136847_11814901All Organisms → cellular organisms → Bacteria684Open in IMG/M
3300010400|Ga0134122_10700717All Organisms → cellular organisms → Bacteria952Open in IMG/M
3300010403|Ga0134123_10049238All Organisms → cellular organisms → Bacteria3163Open in IMG/M
3300011119|Ga0105246_11069353All Organisms → cellular organisms → Bacteria735Open in IMG/M
3300011406|Ga0137454_1031169All Organisms → cellular organisms → Bacteria778Open in IMG/M
3300011406|Ga0137454_1041405All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria709Open in IMG/M
3300011436|Ga0137458_1070054All Organisms → cellular organisms → Bacteria970Open in IMG/M
3300012034|Ga0137453_1110565Not Available545Open in IMG/M
3300012096|Ga0137389_11135801All Organisms → cellular organisms → Bacteria669Open in IMG/M
3300012203|Ga0137399_10486182All Organisms → cellular organisms → Bacteria1035Open in IMG/M
3300012225|Ga0137434_1017582All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria906Open in IMG/M
3300012355|Ga0137369_10765296All Organisms → cellular organisms → Bacteria661Open in IMG/M
3300012360|Ga0137375_10089192All Organisms → cellular organisms → Bacteria → Proteobacteria3184Open in IMG/M
3300012685|Ga0137397_10059994All Organisms → cellular organisms → Bacteria2743Open in IMG/M
3300012925|Ga0137419_11114544All Organisms → cellular organisms → Bacteria658Open in IMG/M
3300012929|Ga0137404_10078980All Organisms → cellular organisms → Bacteria2604Open in IMG/M
3300012929|Ga0137404_11565762All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300013308|Ga0157375_11624508All Organisms → cellular organisms → Bacteria764Open in IMG/M
3300014873|Ga0180066_1001335All Organisms → cellular organisms → Bacteria3137Open in IMG/M
3300014877|Ga0180074_1030442All Organisms → cellular organisms → Bacteria1091Open in IMG/M
3300014881|Ga0180094_1093592All Organisms → cellular organisms → Bacteria686Open in IMG/M
3300014884|Ga0180104_1039994All Organisms → cellular organisms → Bacteria1233Open in IMG/M
3300014885|Ga0180063_1022111All Organisms → cellular organisms → Bacteria1742Open in IMG/M
3300015245|Ga0137409_10044810All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4227Open in IMG/M
3300017939|Ga0187775_10211681Not Available725Open in IMG/M
3300017959|Ga0187779_10267241Not Available1086Open in IMG/M
3300017959|Ga0187779_10661630All Organisms → cellular organisms → Bacteria703Open in IMG/M
3300017961|Ga0187778_10036325All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → Candidatus Sulfotelmatomonas → Candidatus Sulfotelmatomonas gaucii3009Open in IMG/M
3300017973|Ga0187780_10090625All Organisms → cellular organisms → Bacteria2105Open in IMG/M
3300017974|Ga0187777_10003729All Organisms → cellular organisms → Bacteria9728Open in IMG/M
3300017997|Ga0184610_1002903All Organisms → cellular organisms → Bacteria3900Open in IMG/M
3300017999|Ga0187767_10370415Not Available509Open in IMG/M
3300018000|Ga0184604_10004350All Organisms → cellular organisms → Bacteria2517Open in IMG/M
3300018027|Ga0184605_10159917All Organisms → cellular organisms → Bacteria → Proteobacteria1016Open in IMG/M
3300018031|Ga0184634_10044111All Organisms → cellular organisms → Bacteria1820Open in IMG/M
3300018031|Ga0184634_10280272All Organisms → cellular organisms → Bacteria765Open in IMG/M
3300018051|Ga0184620_10152062All Organisms → cellular organisms → Bacteria749Open in IMG/M
3300018054|Ga0184621_10037792All Organisms → cellular organisms → Bacteria1571Open in IMG/M
3300018058|Ga0187766_11319093Not Available527Open in IMG/M
3300018059|Ga0184615_10131389All Organisms → cellular organisms → Bacteria1413Open in IMG/M
3300018060|Ga0187765_10545845Not Available740Open in IMG/M
3300018063|Ga0184637_10065849All Organisms → cellular organisms → Bacteria2217Open in IMG/M
3300018071|Ga0184618_10001359All Organisms → cellular organisms → Bacteria6023Open in IMG/M
3300018074|Ga0184640_10051318All Organisms → cellular organisms → Bacteria1706Open in IMG/M
3300018074|Ga0184640_10211464All Organisms → cellular organisms → Bacteria877Open in IMG/M
3300018075|Ga0184632_10387229All Organisms → cellular organisms → Bacteria590Open in IMG/M
3300018076|Ga0184609_10504818All Organisms → cellular organisms → Bacteria550Open in IMG/M
3300018079|Ga0184627_10066865All Organisms → cellular organisms → Bacteria1876Open in IMG/M
3300018422|Ga0190265_10022079All Organisms → cellular organisms → Bacteria5104Open in IMG/M
3300018422|Ga0190265_10027482All Organisms → cellular organisms → Bacteria → Proteobacteria4664Open in IMG/M
3300018422|Ga0190265_10240895All Organisms → cellular organisms → Bacteria1850Open in IMG/M
3300018422|Ga0190265_10245762All Organisms → cellular organisms → Bacteria1833Open in IMG/M
3300018429|Ga0190272_10038539All Organisms → cellular organisms → Bacteria2666Open in IMG/M
3300018429|Ga0190272_10150134All Organisms → cellular organisms → Bacteria1603Open in IMG/M
3300019249|Ga0184648_1032124All Organisms → cellular organisms → Bacteria574Open in IMG/M
3300019254|Ga0184641_1465701All Organisms → cellular organisms → Bacteria644Open in IMG/M
3300019360|Ga0187894_10021295All Organisms → cellular organisms → Bacteria4559Open in IMG/M
3300019377|Ga0190264_11556848All Organisms → cellular organisms → Bacteria578Open in IMG/M
3300019458|Ga0187892_10009175All Organisms → cellular organisms → Bacteria13128Open in IMG/M
3300019458|Ga0187892_10086669All Organisms → cellular organisms → Bacteria1923Open in IMG/M
3300019458|Ga0187892_10105669All Organisms → cellular organisms → Bacteria1671Open in IMG/M
3300019487|Ga0187893_10345630All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1033Open in IMG/M
3300019487|Ga0187893_10345634All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1033Open in IMG/M
3300019872|Ga0193754_1004571All Organisms → cellular organisms → Bacteria1345Open in IMG/M
3300019879|Ga0193723_1008245All Organisms → cellular organisms → Bacteria → Proteobacteria3399Open in IMG/M
3300019880|Ga0193712_1138320All Organisms → cellular organisms → Bacteria506Open in IMG/M
3300019881|Ga0193707_1067897All Organisms → cellular organisms → Bacteria1111Open in IMG/M
3300019882|Ga0193713_1016392All Organisms → cellular organisms → Bacteria2212Open in IMG/M
3300019883|Ga0193725_1086010All Organisms → cellular organisms → Bacteria759Open in IMG/M
3300019886|Ga0193727_1048016All Organisms → cellular organisms → Bacteria1388Open in IMG/M
3300019997|Ga0193711_1027054All Organisms → cellular organisms → Bacteria723Open in IMG/M
3300019998|Ga0193710_1000961All Organisms → cellular organisms → Bacteria3198Open in IMG/M
3300020002|Ga0193730_1151194All Organisms → cellular organisms → Bacteria613Open in IMG/M
3300020060|Ga0193717_1117426All Organisms → cellular organisms → Bacteria821Open in IMG/M
3300020063|Ga0180118_1233975All Organisms → cellular organisms → Bacteria653Open in IMG/M
3300020068|Ga0184649_1214622All Organisms → cellular organisms → Bacteria776Open in IMG/M
3300021051|Ga0206224_1038092All Organisms → cellular organisms → Bacteria621Open in IMG/M
3300021078|Ga0210381_10171918All Organisms → cellular organisms → Bacteria745Open in IMG/M
3300021090|Ga0210377_10259338All Organisms → cellular organisms → Bacteria1080Open in IMG/M
3300021972|Ga0193737_1052491All Organisms → cellular organisms → Bacteria583Open in IMG/M
3300022756|Ga0222622_10533275All Organisms → cellular organisms → Bacteria842Open in IMG/M
3300024330|Ga0137417_1093520All Organisms → cellular organisms → Bacteria1029Open in IMG/M
3300025160|Ga0209109_10102055All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1479Open in IMG/M
3300025165|Ga0209108_10034514All Organisms → cellular organisms → Bacteria2843Open in IMG/M
3300025558|Ga0210139_1005570All Organisms → cellular organisms → Bacteria2312Open in IMG/M
3300025904|Ga0207647_10617693All Organisms → cellular organisms → Bacteria597Open in IMG/M
3300025907|Ga0207645_10029587All Organisms → cellular organisms → Bacteria3531Open in IMG/M
3300025910|Ga0207684_10109141All Organisms → cellular organisms → Bacteria2368Open in IMG/M
3300025912|Ga0207707_10728585All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Acetobacteraceae831Open in IMG/M
3300025917|Ga0207660_10225324All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1472Open in IMG/M
3300025922|Ga0207646_10028165All Organisms → cellular organisms → Bacteria5116Open in IMG/M
3300025933|Ga0207706_10434964All Organisms → cellular organisms → Bacteria1136Open in IMG/M
3300025955|Ga0210071_1005596All Organisms → cellular organisms → Bacteria1431Open in IMG/M
3300025957|Ga0210089_1011230All Organisms → cellular organisms → Bacteria969Open in IMG/M
3300025981|Ga0207640_10657372All Organisms → cellular organisms → Bacteria894Open in IMG/M
3300026035|Ga0207703_12158609All Organisms → cellular organisms → Bacteria533Open in IMG/M
3300026285|Ga0209438_1001018All Organisms → cellular organisms → Bacteria9074Open in IMG/M
3300026351|Ga0257170_1004772All Organisms → cellular organisms → Bacteria1558Open in IMG/M
3300026354|Ga0257180_1028564All Organisms → cellular organisms → Bacteria750Open in IMG/M
3300026535|Ga0256867_10033738All Organisms → cellular organisms → Bacteria → Proteobacteria2133Open in IMG/M
3300027388|Ga0208995_1024981All Organisms → cellular organisms → Bacteria1039Open in IMG/M
3300027815|Ga0209726_10327774All Organisms → cellular organisms → Bacteria737Open in IMG/M
3300027846|Ga0209180_10002136All Organisms → cellular organisms → Bacteria9858Open in IMG/M
3300027862|Ga0209701_10481808Not Available677Open in IMG/M
3300027882|Ga0209590_10788208All Organisms → cellular organisms → Bacteria604Open in IMG/M
3300027947|Ga0209868_1022251All Organisms → cellular organisms → Bacteria653Open in IMG/M
3300027952|Ga0209889_1066174All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria743Open in IMG/M
3300027961|Ga0209853_1104674Not Available715Open in IMG/M
3300028380|Ga0268265_10112435All Organisms → cellular organisms → Bacteria2226Open in IMG/M
3300028715|Ga0307313_10107617All Organisms → cellular organisms → Bacteria849Open in IMG/M
3300028792|Ga0307504_10034720All Organisms → cellular organisms → Bacteria1359Open in IMG/M
3300028792|Ga0307504_10046947All Organisms → cellular organisms → Bacteria1217Open in IMG/M
3300028793|Ga0307299_10328112All Organisms → cellular organisms → Bacteria574Open in IMG/M
3300028807|Ga0307305_10108402All Organisms → cellular organisms → Bacteria1283Open in IMG/M
3300028819|Ga0307296_10132125All Organisms → cellular organisms → Bacteria1345Open in IMG/M
3300028824|Ga0307310_10271123All Organisms → cellular organisms → Bacteria820Open in IMG/M
3300028878|Ga0307278_10158526All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1013Open in IMG/M
3300030006|Ga0299907_10272675All Organisms → cellular organisms → Bacteria1385Open in IMG/M
3300030619|Ga0268386_10006707All Organisms → cellular organisms → Bacteria8688Open in IMG/M
3300030619|Ga0268386_10033529All Organisms → cellular organisms → Bacteria4063Open in IMG/M
3300030620|Ga0302046_10299700All Organisms → cellular organisms → Bacteria1322Open in IMG/M
3300030987|Ga0308155_1038468All Organisms → cellular organisms → Bacteria500Open in IMG/M
3300030990|Ga0308178_1105760All Organisms → cellular organisms → Bacteria603Open in IMG/M
(restricted) 3300031150|Ga0255311_1038037All Organisms → cellular organisms → Bacteria1008Open in IMG/M
(restricted) 3300031150|Ga0255311_1040708All Organisms → cellular organisms → Bacteria975Open in IMG/M
(restricted) 3300031150|Ga0255311_1079319All Organisms → cellular organisms → Bacteria702Open in IMG/M
(restricted) 3300031197|Ga0255310_10147917Not Available645Open in IMG/M
(restricted) 3300031197|Ga0255310_10217153All Organisms → cellular organisms → Bacteria537Open in IMG/M
3300031229|Ga0299913_10249238All Organisms → cellular organisms → Bacteria1759Open in IMG/M
(restricted) 3300031248|Ga0255312_1081075All Organisms → cellular organisms → Bacteria785Open in IMG/M
3300031720|Ga0307469_10069700All Organisms → cellular organisms → Bacteria2323Open in IMG/M
3300031720|Ga0307469_10419185All Organisms → cellular organisms → Bacteria1149Open in IMG/M
3300031740|Ga0307468_101367807Not Available649Open in IMG/M
3300031949|Ga0214473_10024045All Organisms → cellular organisms → Bacteria7130Open in IMG/M
3300031949|Ga0214473_10343285All Organisms → cellular organisms → Bacteria1696Open in IMG/M
3300032174|Ga0307470_10096915All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1677Open in IMG/M
3300033513|Ga0316628_102827663All Organisms → cellular organisms → Bacteria638Open in IMG/M
3300034165|Ga0364942_0022061All Organisms → cellular organisms → Bacteria2023Open in IMG/M
3300034178|Ga0364934_0340735All Organisms → cellular organisms → Bacteria568Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil18.82%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil10.22%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment8.60%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil5.38%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland4.84%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.30%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand4.30%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands3.76%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment3.23%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil3.23%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere2.69%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment2.15%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.15%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere2.15%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.61%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.61%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil1.61%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.61%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze1.61%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.61%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.08%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.08%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.08%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.08%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.08%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.08%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.08%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment0.54%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.54%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.54%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.54%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.54%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.54%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.54%
Deep Subsurface SedimentEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Deep Subsurface Sediment0.54%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere0.54%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.54%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.54%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.54%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.54%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300004019Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2EnvironmentalOpen in IMG/M
3300004058Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004062Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005530Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005547Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-3 metaGEnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005844Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2Host-AssociatedOpen in IMG/M
3300005937Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S3T2R1Host-AssociatedOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006845Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009053Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009171Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm May2015EnvironmentalOpen in IMG/M
3300009795Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_40_50EnvironmentalOpen in IMG/M
3300009803Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_40_50EnvironmentalOpen in IMG/M
3300009806Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_50_60EnvironmentalOpen in IMG/M
3300009812Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N2_50_60EnvironmentalOpen in IMG/M
3300009820Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300011406Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT539_2EnvironmentalOpen in IMG/M
3300011436Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT642_2EnvironmentalOpen in IMG/M
3300012034Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT526_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300014873Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200B_16_10DEnvironmentalOpen in IMG/M
3300014877Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT366_16_10DEnvironmentalOpen in IMG/M
3300014881Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_1DaEnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017939Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_10_MGEnvironmentalOpen in IMG/M
3300017959Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_10_MGEnvironmentalOpen in IMG/M
3300017961Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_20_MGEnvironmentalOpen in IMG/M
3300017973Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP10_20_MGEnvironmentalOpen in IMG/M
3300017974Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_10_MGEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300017999Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP10_10_MGEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018058Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018060Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_10_MGEnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019249Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019254Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019377Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 TEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019872Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a1EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019880Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a1EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300019997Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m2EnvironmentalOpen in IMG/M
3300019998Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m1EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020060Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2EnvironmentalOpen in IMG/M
3300020063Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT730_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020068Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021051Subsurface sediment microbial communities from Mancos shale, Colorado, United States - Mancos A1EnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300021972Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2m2EnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025558Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025904Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025907Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025955Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025957Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025973Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025981Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026035Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026351Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-BEnvironmentalOpen in IMG/M
3300026354Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-BEnvironmentalOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300027388Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027947Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_40_50 (SPAdes)EnvironmentalOpen in IMG/M
3300027952Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_0_10 (SPAdes)EnvironmentalOpen in IMG/M
3300027961Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_30_40 (SPAdes)EnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028715Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_203EnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300028807Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_186EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300030987Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_144 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030990Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_149 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300034165Sediment microbial communities from East River floodplain, Colorado, United States - 19_s17EnvironmentalOpen in IMG/M
3300034178Sediment microbial communities from East River floodplain, Colorado, United States - 27_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI10214J12806_1053881843300000891SoilMASPTTVFTVGDFIRRVVDSLLSGECRGQSLCARCLIKLTKDNLDRSYAKPEIVRVMDEIFAAPGALTVTPAATCGVCARKKVACLGGSAAR*
Ga0055439_1000715543300004019Natural And Restored WetlandsMASPSTVYTVGDFIRRVVDSLLHGECRGQLLCARCLVKLTKDHLDRSYAKPEVTRVLDDIFASPGSLTLASASTCGLCARKKVPCLGGSAPR*
Ga0055498_1002148433300004058Natural And Restored WetlandsMALPSTVYTAGDFIRRVVDSLLHGERRGQFLCARCLVKIAKDNLDRSYMKPDIARAMEDIFASPGSLTLAPASLCALCARKKVACLGVSTSR*
Ga0055500_1000223543300004062Natural And Restored WetlandsMASPSTVYTVGDFIRRVVDSLLHGECRGQLLCARCLVKLTKDHLDRSYAKPEVTRVMDDIFASPGSLTLADASTCGLCARKKVPCLGGSAPR*
Ga0062589_10010305843300004156SoilMSLPSTVYTAGDFVRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFANPGALTLAPASLCALCARKKVTCLGGSGPR*
Ga0063356_10006326453300004463Arabidopsis Thaliana RhizosphereMTSPTTVFTVGDFIRRVVDSLLSGECRGQSLCARCLIKLTKDNLDRSYAKPEIVRVMDEIFAAPGALTVTPAATCGVCARKKVACLGGSAAR*
Ga0063356_10129859013300004463Arabidopsis Thaliana RhizosphereMALPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLAKDNLDRSYMKPDIARVMDDIFATPGTLTLAPASLCALCARKKVACLGASPPR*
Ga0062592_10022134913300004480SoilMSLPSTVYTAGDFVRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFSNPGALILAPASLCALCARKKVTC
Ga0062591_10011285833300004643SoilMSLPSTVYTAGDFVRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFSNPGALILAPASLCALCARKKVTCLGGSGPR*
Ga0065707_1021250833300005295Switchgrass RhizosphereMVSPATVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASASR*
Ga0065707_1087451913300005295Switchgrass RhizosphereMSLPSTVYTAGDFIRRVVESLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMEEIFANPGALALAPASLCALCARKKV
Ga0070680_10024078043300005336Corn RhizosphereRPSMASPTTVFTVGDFIRRVVDSLLSGECRGQSLCARCLIKLTKDNLDRSYAKPEIVRVMDEIFAAPGALTVTPAATCGVCARKKVACLGGSAAR*
Ga0070680_10121932033300005336Corn RhizosphereRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFSNPGALILAPASLCALCARKKVTCLGGSGPR*
Ga0070705_10037464233300005440Corn, Switchgrass And Miscanthus RhizosphereMSSPTTVYTVGDFIRRVVDSLLNGECRGQALCVRCLVKLTKDNLDRSYAKPEIARVMDEIFAAPDPLTLTPASTCGLCSRKKVACLGGSPAR*
Ga0070708_10001399373300005445Corn, Switchgrass And Miscanthus RhizosphereMRGQETVYTIGDFVRRVVDSLLRGECRGQFLCSRCLVKLAKDNLDKSYVKADIMRVMEDIFSAPGPITHAPASTCALCAKKKTACLGVPTSVS*
Ga0070706_10052838133300005467Corn, Switchgrass And Miscanthus RhizosphereMRGQETVYTISDFVRRVVDSLLRGECRGQFLCSRCLVKLAKDNLDKSYVKADIMRVMEDIFSAPGPITHAPASTCALCAKKKTACLGVPTSVS*
Ga0070699_10018213753300005518Corn, Switchgrass And Miscanthus RhizosphereMRGQETVYTIGDFVRRVVDSLLRGECRGQFLCSRCLVKLAKDNLDKSYVKADIMRVMDDIFSAPGPITHAPASTCALCAKKKTACLGVPTSVS*
Ga0070679_10177957523300005530Corn RhizosphereMSLPSTVYTAGDFIRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFSNPGALILAPASLCALCARKKVTCLGGSGPR*
Ga0070696_10025178013300005546Corn, Switchgrass And Miscanthus RhizosphereMISPTTVFTVGDFIRRVVDSLLSGECRGQSLCARCLIKLTKDNLDRSYAKPEIVRVMDEIFAAPGALIVTPAATCGVCARKKVACLGGSAAR*
Ga0070693_10006186113300005547Corn, Switchgrass And Miscanthus RhizosphereMSLPSTVYTAGDFIRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFSNPGALILAPASLCALCARKKVTC
Ga0068859_10266758023300005617Switchgrass RhizosphereMSFPSTVYTAGDFVRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFANPGALTLAPASLCALCARKKVTCLGGSGPR*
Ga0068862_10014837953300005844Switchgrass RhizosphereGPGPPPRTPRPEGAAREAVNALMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFANPGALTLAPASLCALCARKKVTCLGGSGPR*
Ga0081455_1007912413300005937Tabebuia Heterophylla RhizosphereMGGHEPVYTVSDFVRRVIDSLLRGDCRGQFLCARCLIKLAKDHLDRSYVKADIARVIGDIFNAPGPITHAPAATCALCTKKKTPCLGVPTSES*
Ga0075417_1025148313300006049Populus RhizosphereHRDPVRHGPGPPPCTPRPEGPAREEVNALMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASASR*
Ga0075421_10069574923300006845Populus RhizosphereMPAPDTVYTAGDFIRRVVESLLHGECRGLSLCVRCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPATTCARCARKKVSCLGGSASR*
Ga0099791_1026069533300007255Vadose Zone SoilMSLPSTVYTAGDFIRRVVESLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFANPGALMLAPASLCALCARKKVTCLGGSGPR*
Ga0099791_1065601923300007255Vadose Zone SoilMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGPLTLAPASLCALCARKKVSCLGASASR*
Ga0099794_1028314013300007265Vadose Zone SoilMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARKKVSCLGGSASR*
Ga0105095_1067113913300009053Freshwater SedimentVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLAKDNLDRSYMKPDIARVMDDIFATPGTLTLAPASICALCARKKVACLGASPPR*
Ga0105106_1082216613300009078Freshwater SedimentVDSLLRGECRGQFLCARCLVKLTKDNLDRSYMKPDIARVMDDIFATPGALTLVSSSLCALCARKKVPCLGASAPR*
Ga0099830_1093487123300009088Vadose Zone SoilMRGQGTVYTVSDFVRRVVDSLLRGDCRGQFLCSRCLVKLAKDNLDKSYMKADIARVMDDIFNTPGPITHAAASTCALCARKKTPCLGVPTSVS*
Ga0099828_1183711613300009089Vadose Zone SoilMRGQGTVYTISDFVRRVVDSLLRGDCRGQFLCSRCLVKLAKDNLDKSYMKADIARVMDDIFNTPGPITHAAASTCALCARKKTPCLG
Ga0114129_1004089253300009147Populus RhizosphereMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASASR*
Ga0105092_1038135833300009157Freshwater SedimentGVLMALTSTVYTAGDFIRRVVDSLLHGERRGQFLCARCLVRLAKDNLDRSYMKPDIARAMEDIFAAPGALTLAPASLCAQCARKKVACLGASAPR*
Ga0105101_1031114213300009171Freshwater SedimentLSVSSSSTVYTAGDFIRRVVDSLLRGECRGQFLCARCLVKLTKDNLDRSYMKPDIARVMDDIFATPGTLTLAPASLCALCARKKVACLGASPPR*
Ga0105059_100543023300009795Groundwater SandMRGQETVYTISDFVRRVVDSLLRGDCRGQFLCSRCLIKLAKDNLDKSYVKADIARVMDDIFNTPGPITHAPASTCALCARKKMLCLGVPIFVS*
Ga0105065_102357513300009803Groundwater SandMRGQETVYTISDFVRRVVDSLLRGDCRGQFLCSRCLIKLAKDNLDKSYVKADIARVMEDIFNAPGPITHAPVSTCALCAKKKTPCLGVPTSVS*
Ga0105081_100641243300009806Groundwater SandMRGQETVYTISDFVRRVVDSLLRGDCRGQFLCSRCLIKLAKDNLDKSYVKADITRVMEDIFNAPGPITHAPVSTCALCAKKKTPCLGVPTSVS*
Ga0105067_106819613300009812Groundwater SandMRGQETVYTVSDFVRRVVDSLLRGDCRGQFLCSRCLVKLAKDNLDKSYVKADIARVMDDIFNAPGPITHAPVSTCALCAKKKTPCLGVPTSLS*
Ga0105085_104368523300009820Groundwater SandMRGQETVYTISDFVRRVVDSLLRGDCRGQFLCSRCLIKLAKDNLDKSYVKADIARVMDDIFNTPGPITHAPASTCALCARKKTPCLGVPTSVS*
Ga0126376_1009702323300010359Tropical Forest SoilMGGHEPVYTVSDFVRRVVDSLLRGDCRGQFLCARCLVTLAKDHLDKSYVKAEIARVISDIFNAPGPITHAPAATCALCAKKQTPCLGVPTSVS*
Ga0126379_1097161123300010366Tropical Forest SoilMGGHEPVYTVSDFVRRVVDSLLRGDCRGQFLCARCLVTLAKDHLDKSYVKAEIARVIGDIFNAPGPITHAATATCALCAKKQTPCLGVPTSVS*
Ga0136847_1181490113300010391Freshwater SedimentMALPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLAKDNLDRSYMKPDIARVMDDIFATPGALTLAPASLCALCARKKVACLGASAAR*
Ga0134124_1011072313300010397Terrestrial SoilMSLPSTVYTAGDFIRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFANPGALTLADYAAE
Ga0134122_1070071733300010400Terrestrial SoilMTSPTTVFTVGDFIRRVVDSLLSGECRGQSLCARCLIKLTKDNLDRSYAKPEIVRVMDEIFAAPGALTVTPAATCGVCARKKVACLGGPAAR*
Ga0134123_1004923843300010403Terrestrial SoilMSLPSTVYTAGDFVRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFANPGALILAPASLCALCARKKVTCLGGSGPR*
Ga0105246_1106935323300011119Miscanthus RhizosphereMSLPSTVYTAGDFVRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMEEIFANPGALALAPASLCALCARKKVTCLGGSGPR*
Ga0137454_103116913300011406SoilMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFVILGLFMFVLVSMCALC
Ga0137454_104140523300011406SoilRVVDSLLHGECRGQFLCARCLVKLAKDNLDRSYMKPDIARVMDDIFATPGPLTLAPASLCALCARKKVACLGASAAR*
Ga0137458_107005433300011436SoilMALPSTVFTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASLCALCARKKVACLGASAAR*
Ga0137453_111056513300012034SoilMVLPSTVFTAGDFIRRVVDSLLHGECRGQFLCARCLVKLAKDNLDRSYMKPDIARVMEDIFATPGSLTLAPASLCALCARKKVACLGAS
Ga0137389_1113580123300012096Vadose Zone SoilMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARKKVSCLGGSASG*
Ga0137388_1165494223300012189Vadose Zone SoilMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPAS
Ga0137399_1048618213300012203Vadose Zone SoilMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGAPASR*
Ga0137434_101758213300012225SoilDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASLCALCARKKVSCLGGSASR*
Ga0137369_1076529633300012355Vadose Zone SoilMTSPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDHLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCAR
Ga0137375_1008919243300012360Vadose Zone SoilMTSPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDHLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASASR*
Ga0137397_1005999413300012685Vadose Zone SoilMSLPSTVYTAGDFIRRVVESLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFANPGALILAPASLCALCARKKVTCLGGSGPR*
Ga0137419_1111454433300012925Vadose Zone SoilMSLPSTVYTAGDFIRRVVESLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLC
Ga0137404_1007898033300012929Vadose Zone SoilMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGALASR*
Ga0137404_1156576213300012929Vadose Zone SoilRTPRPEGAAREEVSALMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASASR*
Ga0157375_1162450833300013308Miscanthus RhizosphereMSLPSTVYTAGDFTRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFANPGALTLAPASLCALCARKKVTCLGGSGPR*
Ga0180066_100133543300014873SoilMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTPAPASTCALCARKKVSCLGGSASR*
Ga0180074_103044223300014877SoilMASPSTVFTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFANPGSLALAPASLCALCARKKVACLGASAPR*
Ga0180094_109359213300014881SoilMPVPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGPLTLAPASTCALCARKKVSCLGGSASR*
Ga0180104_103999423300014884SoilMPAPATVYTVGDFIRRVVDSLLHGECRSQLLCARCLVKLTKDNLDRSYAKPEIARVMDDIFATPGPLTLAPASTCALCARKKVSCLGGSASR*
Ga0180063_102211143300014885SoilAPATVYTVGDFIRRVVDSLLHGECRSQLLCARCLVKLTKDNLDRSYAKPDIARVMDDIFANPGSLALAPASLCALCARKKVACLGASAPR*
Ga0137409_1004481033300015245Vadose Zone SoilMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKVTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGALASR*
Ga0187775_1021168113300017939Tropical PeatlandRRVVDSLLRGDRRGQFLCARCLVKLAKGNLDRSYVKSDVEQAMDDIFDAPGSLTHAPESSCAMCAKKKTRCLGVTIS
Ga0187779_1026724113300017959Tropical PeatlandMRGQEPVYTVSDFVRRVVDSLLRGDCRGQFLCARCLVRLAKDHLDRSYVKAEIARAIGDIFNAPGPITHAPAATCALCAKKQTPCLGVPTSLS
Ga0187779_1066163023300017959Tropical PeatlandMERLDTVYTVDDFVRRVVESLLRARRGTFLCSRCLVKLTKENLDNSYARPDIVRVIAGIFDAPGWITHVPASRCAQCEKKKTSCLGVSLSSKEPK
Ga0187778_1003632543300017961Tropical PeatlandMRGQEPVYTVSDFVRRVVDSLLRGDCRGQFLCARCLIRLAKDHLDKSYAKAEIARAIGDIFNAPGPITHAPAATCALCAKKQTPCLGVPTSLS
Ga0187780_1009062543300017973Tropical PeatlandMRGQEPVYTVSDFVRRVVDSLLRGDCRGQFLCARCLIRLAKDHLDKSYAKAEIARAIGDIFNAPGPITHAPASTCALCAKKQTPCLGVPTSLS
Ga0187777_10003729133300017974Tropical PeatlandMRGQEPVYTVSDFVRRVVDSLLRGDCRGQFLCARCLVRLAKDHLDRSYVKAEIARAIGDIFNAPGPITHAPAATCAVCAKKQTPCLGVPTSLS
Ga0184610_100290353300017997Groundwater SedimentMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARRKVSCLGGSASR
Ga0187767_1037041523300017999Tropical PeatlandDFVRRVVDSLLRGDCRGQFLCARCLIRLAKDHLDKSYAKAEIARAIGDIFNAPGPITHAPASTCALCAKKQTPCLGVPTSLS
Ga0184604_1000435043300018000Groundwater SedimentMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPA
Ga0184605_1015991733300018027Groundwater SedimentMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLC
Ga0184634_1004411143300018031Groundwater SedimentMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARKKVSCLGGSASR
Ga0184634_1028027223300018031Groundwater SedimentMHQQATVYTVSDFVRRVVDSLLRGDCRGQFLCSRCLVKLAKDNLDKSYAKPDIMRVMDDIFNTPGPITHAPRSTCGLCARKK
Ga0184620_1015206213300018051Groundwater SedimentMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASICALCARKKVSCLGASVSR
Ga0184621_1003779233300018054Groundwater SedimentMALPATVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGALASR
Ga0184621_1015909813300018054Groundwater SedimentMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAP
Ga0187766_1131909323300018058Tropical PeatlandACGKKLPGPVLGHFVRRVVDSLLRGDCRGQFLCARCLIKLAKDHLDKSYVKADIARGVGDIFNAPGPITHAPAATCALCAKKQTPCLGVPTSVS
Ga0184615_1013138923300018059Groundwater SedimentMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLSKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARKKVSCLGGSALR
Ga0187765_1054584523300018060Tropical PeatlandMPAQAPVYTVSDFVRRVVDSLLRGDCRGQFLCARCLIKLAKDHLDKSYVKADIARGVGDIFNAPGPITHAPAATCALCAKKQTPCLGVPTSVS
Ga0184637_1006584923300018063Groundwater SedimentMRGQETVYTISDFVRRVVDSLLRGDCRGQFLCSRCLIKLAKDNLDKSYVKADIARVMDDIFSTPGPITHAPASTCALCARKKTPCLGVPTSVS
Ga0184618_1000135973300018071Groundwater SedimentMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASVSR
Ga0184640_1005131833300018074Groundwater SedimentMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLAKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARKKVSCLGGSASR
Ga0184640_1021146413300018074Groundwater SedimentMHQQATVYTVSDFVRRVVDSLLRGDCRGQFLCSRCLVKLAKDNLDKSYAKPDIMRVMDDIFNTPGPITHAPRSTCGLCARKKIPCLGVPLS
Ga0184632_1038722923300018075Groundwater SedimentMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARKKVSCLGGSALR
Ga0184609_1050481813300018076Groundwater SedimentPVRHGPGPPPRTPRPEGAAREEVSALMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLKASASR
Ga0184627_1006686523300018079Groundwater SedimentMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARRKVSCLRGSASR
Ga0190265_1002207963300018422SoilMALPSTVYTASDFIRRVVDSLLHGECRGQFLCARCLVKLAKDNLDRSYAKPDVARVMDDIFASPGSLVLAPASLCALCARKKVSCLGASAPR
Ga0190265_1002748243300018422SoilVSSPATVYTVGDFIRRVVESLLHGECRGQSLCARCLVKLTKDNLDRSYSKPDILRVMEDIFANPDPLTLAPESTCALCARKKVSCLGGSASR
Ga0190265_1024089533300018422SoilMAMPPTVYTVGDFVRRVVDSLLRGECRGQSLCARCLVKLTRDNLDRSYSKPDITQVMDDIFADPGAFTLAPVSTCALCARKKVSCLSVALNP
Ga0190265_1024576243300018422SoilMALPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLAKDNLDRSYMKPDIARAMDEIFAEPGALTLAPASLCAQCARKKVACLGASPPR
Ga0190272_1003853943300018429SoilMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDHLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARKKVSCLGGSASR
Ga0190272_1015013443300018429SoilMALPSTVYTAGDFIRRVVDSLLHGDHRGQFLCARCLVKLAKDNLDRSYMEPDSARVMDDIFATPGLLTLAPASLCALCARKKVACLGASLR
Ga0184648_103212413300019249Groundwater SedimentMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTPAPASTCALCARKKVSCLGGSASR
Ga0184641_146570113300019254Groundwater SedimentMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASLSR
Ga0187894_1002129513300019360Microbial Mat On RocksMPAPDAVYTAGDFIRRVVESLLHGECRGSLLCVRCLVKLTKDNLDRSYAKPDIARVMEDIFATPGALTLAPATTCARCARKKVSCLGGSAAR
Ga0190264_1155684823300019377SoilMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYSKPDILRVMEDIFANPEPLTLAPESTCALCARKKVSCLGGSASR
Ga0187892_10009175133300019458Bio-OozeMRGQETVYTISDFVRRVVDSLLRGDCRGQFLCSRCLIKLAKDNLDKSYVKADIARVMEDIFNAPGPITHAPVSTCALCAKKKTPCLGVPTSVS
Ga0187892_1008666943300019458Bio-OozeVYTAGDFIRRVVESLLHGECRGSLLCVRCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPATTCARCARKKVSCLGGSVSR
Ga0187892_1010566923300019458Bio-OozeVSGQPVVYTIGDFVRRVVGSLLQGEWRGRFLCARCLVKLTRGNMDRSYAKPDIARVMDGIFADPGAITLAPASACAHCTRKKVSCLGVPLSR
Ga0187893_1034563043300019487Microbial Mat On RocksVESLLHGECRGSLLCVRCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPATTCARCARKKVSCLGGSVSR
Ga0187893_1034563443300019487Microbial Mat On RocksVESLLHGECRGSLLCVRCLVKLTKDNLDRSYAKPDIARVMEDIFATPGALTLAPATTCARCARKKVSCLGGSAAR
Ga0193754_100457143300019872SoilMSLPSTVYTAGDFIRRVVESLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMEEIFANPGALALAPASLCALCARKKVTCLGGSGPR
Ga0193723_100824563300019879SoilMSLPSTVYTAGDFIRRVVESLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMEEIFANPGALALAPASLCALCARKKVTCLGGSGSR
Ga0193712_113832013300019880SoilYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASVSR
Ga0193707_106789723300019881SoilMALPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASLSR
Ga0193713_101639243300019882SoilMALPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASVSR
Ga0193725_108601013300019883SoilMALPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGPLTLAPASLCALCARKKVSCLGASVSR
Ga0193727_104801623300019886SoilMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGALASR
Ga0193711_102705423300019997SoilMSLPSTVYTAGDFIRRVVESLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMEDIFANPGALALAPASLCALCARKKVTCLGGSGPR
Ga0193710_100096143300019998SoilMSLPSTVYTAGDFIRRVVESLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGPLTLAPASLCALCARKKVTCLGGSGPR
Ga0193730_115119423300020002SoilMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASASR
Ga0193717_111742613300020060SoilMALPSTVYTAGDFIRRVVDSLLHGDHRGQFLCARCLVKLAKDNLDRSYMKPDIARVMDDIFATPGLLTLAPAS
Ga0180118_123397523300020063Groundwater SedimentMPAPATVYTVGDFIRRVVDSLLHGECRSQLLCARCLVKLTKDNLDRSYAKPEIARVMDDIFATPGSLTLAPASTCALCARKKVSCLGGSASR
Ga0184649_121462233300020068Groundwater SedimentMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLSKDNLDRSYAKPDIARVMDDIFATPGSLPLAPASTCALCARKKVSCLGGSALR
Ga0206224_103809233300021051Deep Subsurface SedimentMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALC
Ga0210381_1017191813300021078Groundwater SedimentMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLVPASLC
Ga0210377_1025933823300021090Groundwater SedimentMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLSKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARKKVSCLGGSALR
Ga0193719_1004179113300021344SoilMSLPSTVYTAGDFIRRVVESLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMEEIFANPGALALALA
Ga0193737_105249113300021972SoilYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGALASR
Ga0222622_1053327523300022756Groundwater SedimentMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGLLTLAPASLCALCARKKVSCLGASASR
Ga0137417_109352043300024330Vadose Zone SoilMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGAPASR
Ga0209109_1010205523300025160SoilMPAPQTVYTISDFVRRVVESLLRGECRGQSLCARCLVKLTRDNLDRSYAKPDVTQVMDEIFADPRALTLAPASTCALCARKKVSCLSVAPHP
Ga0209108_1003451433300025165SoilMPAPQTVYTISDFVRRVVESLLRGECRGQSLCARCLVKLTRDNLDRSYAKPDVTQVMDEIFADPRALMLAPASTCALCARKKVSCLSVAPHP
Ga0210139_100557043300025558Natural And Restored WetlandsMASPSTVYTVGDFIRRVVDSLLHGECRGQLLCARCLVKLTKDHLDRSYAKPEVTRVLDDIFASPGSLTLASASTCGLCARKKVPCLGGSAPR
Ga0207647_1061769333300025904Corn RhizosphereMSLPSTVYTAGDFVRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFSNPGALILAPASLCALCARKK
Ga0207645_1002958743300025907Miscanthus RhizosphereMSLPSTVYTAGDFVRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFSNPGALILAPASLCALCARKKVTCLGGSGPR
Ga0207684_1010914143300025910Corn, Switchgrass And Miscanthus RhizosphereMRGQETVYTISDFVRRVVDSLLRGECRGQFLCSRCLVKLAKDNLDKSYVKADIMRVMEDIFSAPGPITHAPASTCALCAKKKTACLGVPTSVS
Ga0207707_1072858523300025912Corn RhizosphereMTSPTTVFTVGDFIRRVVDSLLSGECRGQSLCARCLIKLTKDNLDRSYAKPEIVRVMDEIFAAPGALTVTPAATCGVCARKKVACLGGSAAR
Ga0207660_1022532433300025917Corn RhizosphereMASPTTVFTVGDFIRRVVDSLLSGECRGQSLCARCLIKLTKDNLDRSYAKPEIVRVMDEIFAAPGALTVTPAATCGVCARKKVACLGGSAAR
Ga0207646_1002816573300025922Corn, Switchgrass And Miscanthus RhizosphereMRGQETVYTIGDFVRRVVDSLLRGECRGQFLCSRCLVKLAKDNLDKSYVKADIMRVMEDIFSAPGPITHAPASTCALCAKKKTACLGVPTSVS
Ga0207706_1043496413300025933Corn RhizosphereMSLPSTVYTAGDFVRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFANPGALTLAPASLCALCARKKVTCLGGSGPR
Ga0210071_100559623300025955Natural And Restored WetlandsMASPSTVYTVGDFIRRVVDSLLHGECRGQLLCARCLVKLTKDHLDRSYAKPEVTRVMDDIFASPGSLTLADASTCGLCARKKVPCLGGSAPR
Ga0210089_101123023300025957Natural And Restored WetlandsMALPSTVYTAGDFIRRVVDSLLHGERRGQFLCARCLVKIAKDNLDRSYMKPDIARAMEDIFASPGSLTLAPASLCALCARKKVACLGVSTSR
Ga0210145_101563913300025973Natural And Restored WetlandsMASPSTVYTVGDFIRRVVDSLLHGECRGQLLCARCLVKLTKDHLDRSYAKPEVTRVMDDIFASPGSLTLADAS
Ga0207640_1065737233300025981Corn RhizosphereMSLPSTVYTAGDFIRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFSNPGALILAPASLCALCARKKVTCLGGSGPR
Ga0207703_1215860913300026035Switchgrass RhizosphereTAGDFVRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFSNPGALILAPASLCALCARKKVTCLGGSGPR
Ga0209438_100101863300026285Grasslands SoilMSLPSTVYTAGDFIRRVVESLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFANPGALILAPASLCALCARKKVTCLGGSGPR
Ga0257170_100477243300026351SoilMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARKKVACLGGSASG
Ga0257180_102856433300026354SoilMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARKKVSC
Ga0256867_1003373843300026535SoilVSALRPSSSSTVYTVGDFVRRVVASLLHGEFRGRFLCARCLVKLTKENLDRSYTKPDVTMVMDEIFADPGPAITLVLASACALCARKKVSCLGVAPPR
Ga0208995_102498123300027388Forest SoilMTSPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVRLTKDNLDRSYAKPDVARVMDDIFAAPGSLTLAPASLCALCARKKVSCLGASVSR
Ga0209726_1032777413300027815GroundwaterSLSRAGKRGPARGERLMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARKKVSCLGGSASR
Ga0209180_10002136123300027846Vadose Zone SoilMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARKKVSCLGGSASG
Ga0209701_1048180813300027862Vadose Zone SoilMRGQGTVYTVSDFVRRVVDSLLRGDCRGQFLCSRCLVKLAKDNLDKSYMKADIARVMDDIFNTPGPITHAAASTCALCARKKTPCLGVPTSVS
Ga0209590_1078820813300027882Vadose Zone SoilMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCAQ
Ga0209868_102225113300027947Groundwater SandMRGQETVYTISDFVRRVVDSLLRGDCRGQFLCSRCLIKLAKDNLDKSYVKADIARVMDDIFNTPGPITHAPASTCALCARKKTPCLGVPTSVS
Ga0209889_106617413300027952Groundwater SandRRVVDSLLRGDCRGQFLCSRCLVKLAKDNLDKSYVKADIARVMDDIFNAPGPITHAPVSTCALCAKKKTPCLGVPTSLS
Ga0209853_110467423300027961Groundwater SandMRGQETVYTVSDFVRRVVDSLLRGDCRGQFLCSRCLVKLAKDNLDKSYVKADIARVMDDIFNAPSPIAHAPASTCALCARTNMPGLGVPLS
Ga0268265_1011243523300028380Switchgrass RhizosphereMSLPSTVYTAGDFIRRVVESLLRGECRGQFLCARCLVKLTKDNLDRSYAKPDVARAMEDIFANPGALTLAPASLCALCARKKVTCLGGSGPR
Ga0307313_1010761723300028715SoilDPVWHGPGPPPRTPRPEGAAREEVSALMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASLSR
Ga0307504_1003472033300028792SoilMEKLATVYTVDDFVRRVVDSLLRGGERRGLFLCSRCLVKLTRENLDKSYAKPEIARVMDDIFDAPGSITHEPTCACAGCGRKKVRCLGVPLP
Ga0307504_1004694743300028792SoilMRGQETVYTISDFVRRVVDSLLRGECRGQFLCYRCLVKLAKDNLDKSYMKADIMRVMDDIFRAPGPITHAPASTCALCAKKKTACLGVPTSLS
Ga0307299_1032811213300028793SoilSPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASLSR
Ga0307305_1010840213300028807SoilMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCL
Ga0307296_1013212513300028819SoilMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVSRVMDDIFATPGSLTLAPASLCALCARKKVSCLGASLSR
Ga0307310_1027112323300028824SoilMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGALTLAPASLCALCARKKVSCLGASLSR
Ga0307278_1015852633300028878SoilIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASAAR
Ga0299907_1027267523300030006SoilVSALRPSSSTTVYTVGDFVRRVVASLLHGEFRGRFLCARCLVKLTKENLDRSYTKPDVTMVMDEIFADPGPAITLVPASACALCARKKVSCLGVAPPR
Ga0268386_1000670733300030619SoilVSALRPSSSTTVYTVGDFVRRVVASLLHGEFRGRFLCARCLVKLTKENLDRSYTKPDVTMVMDEIFADPGPAITLVLASACALCARKKVSCLGVAPPR
Ga0268386_1003352913300030619SoilVSALRPSSSSTVYTVGDFVRRVVASLLHGEFRGRFLCARCLVKLTKENLDRSYTKPDVTMVMDEIFAEPGPAITLVLASA
Ga0302046_1029970043300030620SoilMPAPATVYTVGDFIRRVVDSLLHGECRGQFLCARCLVKLTKENLDRSYTKPDVTMVMDEIFADPGPAITLVPASACALCARKKVSCLGVAPPR
Ga0308155_103846813300030987SoilRDPVRHGPGPPPRTPRPEGAAREEVSALMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGSLTLAPASLCALCARKKVSCLGASLSR
Ga0308178_110576033300030990SoilMASPSTVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDVARVMDDIFATPGLLTLAPASLCALCARKKVSCL
(restricted) Ga0255311_103803723300031150Sandy SoilMTLPSTVYTAGDFIRRVVESLLHGECRGQFLCARCLVKLTKDNLDRSYTKPDVARVMDDIFANPGALALAPASLCGLCARKKVTCLGGGAPR
(restricted) Ga0255311_104070813300031150Sandy SoilVISSSTVYTAGDFIRRVVDSLLRGECRGQFLCARCLVKLTKDNLDRSYMKPDIARVMDEIFATPGALTLVSSSLCALCARKKVPCLGAAAPR
(restricted) Ga0255311_107931923300031150Sandy SoilMEKLATVYTVDDFVRRVVDSLLRGGERRGLFLCSRCLVKLTRENLDKSYAKPEIARVMDDIFDAPGSITHEPTCACAGCGRKKVR
(restricted) Ga0255310_1014791713300031197Sandy SoilMRGQETVYTISDFVRRVVDSLLRGECRGQFLCYRCLVKLAKDNLDKSYVKADIMRVMDDIFSAPGPITHAPASTCALCAKKKTACLGVPTSLS
(restricted) Ga0255310_1021715323300031197Sandy SoilSLPPASSGAPGNGDRLSMEKLATVYTVDDFVRRVVDSLLRGGERRGLFLCSRCLVKLTRENLDKSYAKPEIARVMDDIFDAPGSITHEPTCACAGCGRKKVRCLGVPLP
Ga0299913_1024923843300031229SoilVSALRPSSSTAVYTVGDFVRRVVASLLHGEFRGQFLCARCLVKLTKENLDRSYTKPDVTMVMDEIFADPGPAITLVPASACALCARKKVSCLGVAPPR
(restricted) Ga0255312_108107523300031248Sandy SoilVYTAGDFIRRVVDSLLRGECRGQFLCARCLVKLTKDNLDRSYMKPDIARVMDEIFATPGALTLVSSSLCALCARKKVPCLGAAAPR
Ga0307469_1006970043300031720Hardwood Forest SoilMRGQETVYTIGDFVRRVVDSLLRGECRGQFLCSRCLVKLAKDNLDKSYVKADIMRVMDDIFSAPGPITHAPASTCALCAKKKTACLGVPTSVS
Ga0307469_1041918523300031720Hardwood Forest SoilMSSPTTVYTVGDFIRRVVDSLLNGECRGQALCVRCLVKLTKDNLDRSYAKPEIARVMDEIFAAPDPLTLTPASTCGLCSRKKVACLGGSPAR
Ga0307468_10136780713300031740Hardwood Forest SoilMALPSTVYTASDFIRRVVDSLLHGECRGQFLCARCLVKLAKDNLDRSYMKPDIARVMEDIFATPGSLTLAPASLCSQCARKKVACLGASAPR
Ga0214473_1002404543300031949SoilMRGQETVYTISDFVRRVVDSLLRGECRGQFLCSRCLVKLAKDNLDRSYMKADIARVMDDIFNAPGPITRAPVSTCALCAKKKTPCLGVPTSVG
Ga0214473_1034328543300031949SoilMAAPATVYTAGDFIRRVVDSLLHGECRGQFLCARCLVKLTKDNLDRSYAKPDIARVMDDIFATPGSLTLAPASTCALCARKKVSCLGGSASR
Ga0307470_1009691513300032174Hardwood Forest SoilMALPSTVYTASDFIRRVVDSLLHGECRGQFLCARCLVKLAKDNLDRSYMKPDIARVMEDIFATPGSLTLAPASLCSQCARKKVACLGAKAPR
Ga0316628_10282766313300033513SoilPHAPEPQDKAREEVSVISSSTVYTAGDFIRRVVDSLLRGECRGQFLCARCLVKLTKDNLDRSYMKPDIARVMDEIFATPGALTLVSSSLCALCARKKVPCLGASAPR
Ga0364942_0022061_793_11133300034165SedimentLAGRRLRGGHRDGDSMHQQATVYTVSDFVRRVVDSLLRGDCRGQFLCSRCLVKLAKDNLDKSYAKPDIMRVMDDIFNTPGPITHAPRSTCGLCARKKIPCLGVPLP
Ga0364934_0340735_175_4503300034178SedimentMHQQATVYTVSDFVRRVVDSLLRGDCRGQFLCSRCLVKLAKDNLDKSYAKPDIMRVMDDIFNTPGPITHAPRSTCGLCARKKVPCLGVPLS


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.