NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F044355

Metagenome Family F044355

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F044355
Family Type Metagenome
Number of Sequences 154
Average Sequence Length 97 residues
Representative Sequence LTRATPLLALLGLLALATSASAEDAWVLWLGTGTTYTPFGAYGGQTGERECKEAVAQLMTEMRKNTTQLSEFLKSSSRYICLPDTVDPRGPKGK
Number of Associated Samples 129
Number of Associated Scaffolds 154

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 73.86 %
% of genes near scaffold ends (potentially truncated) 24.03 %
% of genes from short scaffolds (< 2000 bps) 76.62 %
Associated GOLD sequencing projects 119
AlphaFold2 3D model prediction Yes
3D model pTM-score0.64

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (63.636 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(20.779 % of family members)
Environment Ontology (ENVO) Unclassified
(37.662 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(37.013 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 31.15%    β-sheet: 18.03%    Coil/Unstructured: 50.82%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.64
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 154 Family Scaffolds
PF060523-HAO 3.90
PF00300His_Phos_1 3.25
PF01636APH 3.25
PF02698DUF218 2.60
PF06348DUF1059 2.60
PF00072Response_reg 1.95
PF01208URO-D 1.95
PF08241Methyltransf_11 1.95
PF03030H_PPase 1.30
PF13649Methyltransf_25 1.30
PF03435Sacchrp_dh_NADP 1.30
PF00535Glycos_transf_2 1.30
PF00578AhpC-TSA 1.30
PF00702Hydrolase 0.65
PF01841Transglut_core 0.65
PF02416TatA_B_E 0.65
PF07690MFS_1 0.65
PF09851SHOCT 0.65
PF02518HATPase_c 0.65
PF13683rve_3 0.65
PF00699Urease_beta 0.65
PF13751DDE_Tnp_1_6 0.65
PF13541ChlI 0.65
PF00239Resolvase 0.65
PF01135PCMT 0.65
PF00589Phage_integrase 0.65
PF03992ABM 0.65
PF11249DUF3047 0.65
PF07238PilZ 0.65
PF12570DUF3750 0.65
PF08281Sigma70_r4_2 0.65
PF13276HTH_21 0.65
PF04055Radical_SAM 0.65
PF00583Acetyltransf_1 0.65

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 154 Family Scaffolds
COG1434Lipid carrier protein ElyC involved in cell wall biogenesis, DUF218 familyCell wall/membrane/envelope biogenesis [M] 2.60
COG2949Uncharacterized periplasmic protein SanA, affects membrane permeability for vancomycinCell wall/membrane/envelope biogenesis [M] 2.60
COG0407Uroporphyrinogen-III decarboxylase HemECoenzyme transport and metabolism [H] 1.95
COG3808Na+ or H+-translocating membrane pyrophosphataseEnergy production and conversion [C] 1.30
COG0832Urease beta subunitAmino acid transport and metabolism [E] 0.65
COG1826Twin-arginine protein secretion pathway components TatA and TatBIntracellular trafficking, secretion, and vesicular transport [U] 0.65
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 0.65
COG2226Ubiquinone/menaquinone biosynthesis C-methylase UbiE/MenGCoenzyme transport and metabolism [H] 0.65
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 0.65
COG2518Protein-L-isoaspartate O-methyltransferasePosttranslational modification, protein turnover, chaperones [O] 0.65
COG2519tRNA A58 N-methylase Trm61Translation, ribosomal structure and biogenesis [J] 0.65
COG4122tRNA 5-hydroxyU34 O-methylase TrmR/YrrMTranslation, ribosomal structure and biogenesis [J] 0.65


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms63.64 %
UnclassifiedrootN/A36.36 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000891|JGI10214J12806_11110157All Organisms → cellular organisms → Bacteria → Proteobacteria1553Open in IMG/M
3300001661|JGI12053J15887_10330828Not Available740Open in IMG/M
3300001661|JGI12053J15887_10529300Not Available562Open in IMG/M
3300002121|C687J26615_10053802Not Available1001Open in IMG/M
3300002122|C687J26623_10015017Not Available1992Open in IMG/M
3300002245|JGIcombinedJ26739_101224755All Organisms → cellular organisms → Bacteria640Open in IMG/M
3300003994|Ga0055435_10066840Not Available899Open in IMG/M
3300004463|Ga0063356_101016693Not Available1185Open in IMG/M
3300005174|Ga0066680_10262009Not Available1100Open in IMG/M
3300005294|Ga0065705_10321121All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1010Open in IMG/M
3300005295|Ga0065707_10573312Not Available706Open in IMG/M
3300005332|Ga0066388_100192676All Organisms → cellular organisms → Bacteria2654Open in IMG/M
3300005332|Ga0066388_101408380All Organisms → cellular organisms → Bacteria1211Open in IMG/M
3300005406|Ga0070703_10158036Not Available857Open in IMG/M
3300005406|Ga0070703_10549343All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium526Open in IMG/M
3300005440|Ga0070705_100096874All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1853Open in IMG/M
3300005445|Ga0070708_100073099All Organisms → cellular organisms → Bacteria → Proteobacteria3090Open in IMG/M
3300005467|Ga0070706_100030703All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales4952Open in IMG/M
3300005467|Ga0070706_100040079All Organisms → cellular organisms → Bacteria4323Open in IMG/M
3300005467|Ga0070706_100336484All Organisms → cellular organisms → Bacteria1407Open in IMG/M
3300005471|Ga0070698_100636263All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1008Open in IMG/M
3300005471|Ga0070698_101694803Not Available584Open in IMG/M
3300005536|Ga0070697_100010922All Organisms → cellular organisms → Bacteria7093Open in IMG/M
3300005546|Ga0070696_100436567All Organisms → cellular organisms → Bacteria1031Open in IMG/M
3300005549|Ga0070704_100037221All Organisms → cellular organisms → Bacteria3322Open in IMG/M
3300005880|Ga0075298_1000598All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1780Open in IMG/M
3300006050|Ga0075028_100629150All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium640Open in IMG/M
3300006806|Ga0079220_10632808All Organisms → cellular organisms → Bacteria769Open in IMG/M
3300007255|Ga0099791_10000228All Organisms → cellular organisms → Bacteria19651Open in IMG/M
3300007258|Ga0099793_10060010All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1698Open in IMG/M
3300009038|Ga0099829_10000663All Organisms → cellular organisms → Bacteria18224Open in IMG/M
3300009038|Ga0099829_10020213All Organisms → cellular organisms → Bacteria4552Open in IMG/M
3300009088|Ga0099830_10002643All Organisms → cellular organisms → Bacteria9552Open in IMG/M
3300009088|Ga0099830_10024758All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria3970Open in IMG/M
3300009090|Ga0099827_10120428All Organisms → cellular organisms → Bacteria2113Open in IMG/M
3300009147|Ga0114129_10030022All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium7697Open in IMG/M
3300009174|Ga0105241_11939900All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium578Open in IMG/M
3300010359|Ga0126376_10268038All Organisms → cellular organisms → Bacteria1464Open in IMG/M
3300010360|Ga0126372_10294780All Organisms → cellular organisms → Bacteria1422Open in IMG/M
3300010360|Ga0126372_12940608All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300010366|Ga0126379_10683044All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1119Open in IMG/M
3300010400|Ga0134122_10597116All Organisms → cellular organisms → Bacteria1019Open in IMG/M
3300012174|Ga0137338_1146172All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium516Open in IMG/M
3300012189|Ga0137388_10133710All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2176Open in IMG/M
3300012199|Ga0137383_10499916Not Available890Open in IMG/M
3300012205|Ga0137362_10943246All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium736Open in IMG/M
3300012207|Ga0137381_10993976Not Available724Open in IMG/M
3300012209|Ga0137379_10342611Not Available1404Open in IMG/M
3300012211|Ga0137377_10003324All Organisms → cellular organisms → Bacteria12426Open in IMG/M
3300012360|Ga0137375_10449408All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1111Open in IMG/M
3300012361|Ga0137360_10660727All Organisms → cellular organisms → Bacteria896Open in IMG/M
3300012582|Ga0137358_10028026All Organisms → cellular organisms → Bacteria3678Open in IMG/M
3300012918|Ga0137396_10673373Not Available764Open in IMG/M
3300012923|Ga0137359_10612719All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium954Open in IMG/M
3300012929|Ga0137404_10012260Not Available5971Open in IMG/M
3300012929|Ga0137404_11282298Not Available675Open in IMG/M
3300012930|Ga0137407_10036178All Organisms → cellular organisms → Bacteria3903Open in IMG/M
3300012944|Ga0137410_10017769All Organisms → cellular organisms → Bacteria4865Open in IMG/M
3300013297|Ga0157378_13024588Not Available522Open in IMG/M
3300014881|Ga0180094_1009444All Organisms → cellular organisms → Bacteria1784Open in IMG/M
3300014884|Ga0180104_1046192All Organisms → cellular organisms → Bacteria1158Open in IMG/M
3300014885|Ga0180063_1057255All Organisms → cellular organisms → Bacteria1131Open in IMG/M
3300015170|Ga0120098_1019617Not Available822Open in IMG/M
3300015245|Ga0137409_10227881Not Available1665Open in IMG/M
3300015264|Ga0137403_11184654Not Available610Open in IMG/M
3300017939|Ga0187775_10004953All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3198Open in IMG/M
3300017997|Ga0184610_1127178Not Available825Open in IMG/M
3300018000|Ga0184604_10068495Not Available1030Open in IMG/M
3300018028|Ga0184608_10116629All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1129Open in IMG/M
3300018028|Ga0184608_10247215All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria785Open in IMG/M
3300018028|Ga0184608_10490783Not Available526Open in IMG/M
3300018031|Ga0184634_10045521All Organisms → cellular organisms → Bacteria1796Open in IMG/M
3300018032|Ga0187788_10249878All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium703Open in IMG/M
3300018052|Ga0184638_1029486All Organisms → cellular organisms → Bacteria1972Open in IMG/M
3300018052|Ga0184638_1046754All Organisms → cellular organisms → Bacteria1575Open in IMG/M
3300018053|Ga0184626_10029101Not Available2274Open in IMG/M
3300018061|Ga0184619_10070939All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1539Open in IMG/M
3300018064|Ga0187773_10009696All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium3988Open in IMG/M
3300018066|Ga0184617_1110945All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium776Open in IMG/M
3300018071|Ga0184618_10109774Not Available1092Open in IMG/M
3300018071|Ga0184618_10386581All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium594Open in IMG/M
3300018075|Ga0184632_10486839Not Available507Open in IMG/M
3300018076|Ga0184609_10172596All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1002Open in IMG/M
3300018084|Ga0184629_10071631Not Available1638Open in IMG/M
3300018089|Ga0187774_11041474Not Available574Open in IMG/M
3300018422|Ga0190265_10096050All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2762Open in IMG/M
3300019458|Ga0187892_10006853All Organisms → cellular organisms → Bacteria16625Open in IMG/M
3300019487|Ga0187893_10260250All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1269Open in IMG/M
3300019789|Ga0137408_1017163Not Available1624Open in IMG/M
3300019879|Ga0193723_1036077Not Available1479Open in IMG/M
3300019881|Ga0193707_1031030All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1747Open in IMG/M
3300019883|Ga0193725_1028133All Organisms → cellular organisms → Bacteria1511Open in IMG/M
3300019883|Ga0193725_1035162Not Available1324Open in IMG/M
3300019885|Ga0193747_1143038All Organisms → cellular organisms → Bacteria549Open in IMG/M
3300019886|Ga0193727_1157645Not Available609Open in IMG/M
3300019998|Ga0193710_1017949Not Available728Open in IMG/M
3300020001|Ga0193731_1097871All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium757Open in IMG/M
3300020002|Ga0193730_1075422All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium957Open in IMG/M
3300020004|Ga0193755_1000844All Organisms → cellular organisms → Bacteria → Proteobacteria9056Open in IMG/M
3300020004|Ga0193755_1160031All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium676Open in IMG/M
3300020015|Ga0193734_1049397Not Available775Open in IMG/M
3300020021|Ga0193726_1126048All Organisms → cellular organisms → Bacteria1136Open in IMG/M
3300021073|Ga0210378_10196806Not Available770Open in IMG/M
3300021078|Ga0210381_10207997All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium685Open in IMG/M
3300021080|Ga0210382_10554521Not Available509Open in IMG/M
3300021086|Ga0179596_10068891All Organisms → cellular organisms → Bacteria1529Open in IMG/M
3300021088|Ga0210404_10677035Not Available588Open in IMG/M
3300021344|Ga0193719_10035090All Organisms → cellular organisms → Bacteria2168Open in IMG/M
3300021344|Ga0193719_10272098Not Available713Open in IMG/M
3300021432|Ga0210384_10094634Not Available2682Open in IMG/M
3300021560|Ga0126371_11503937All Organisms → cellular organisms → Bacteria → Acidobacteria801Open in IMG/M
3300022694|Ga0222623_10360017All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium555Open in IMG/M
3300022756|Ga0222622_10800047Not Available689Open in IMG/M
3300025324|Ga0209640_10027230All Organisms → cellular organisms → Bacteria → Proteobacteria4981Open in IMG/M
3300025885|Ga0207653_10076013All Organisms → cellular organisms → Bacteria1155Open in IMG/M
3300025910|Ga0207684_10011899All Organisms → cellular organisms → Bacteria7584Open in IMG/M
3300025910|Ga0207684_10073537All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria2903Open in IMG/M
3300025910|Ga0207684_10229371Not Available1602Open in IMG/M
3300025922|Ga0207646_10081117All Organisms → cellular organisms → Bacteria2900Open in IMG/M
3300025922|Ga0207646_10146397All Organisms → cellular organisms → Bacteria2129Open in IMG/M
3300025999|Ga0208417_105679All Organisms → cellular organisms → Bacteria614Open in IMG/M
3300026340|Ga0257162_1000168All Organisms → cellular organisms → Bacteria5644Open in IMG/M
3300026351|Ga0257170_1003293All Organisms → cellular organisms → Bacteria1771Open in IMG/M
3300026371|Ga0257179_1001097Not Available1853Open in IMG/M
3300026371|Ga0257179_1013937Not Available876Open in IMG/M
3300026446|Ga0257178_1041638Not Available589Open in IMG/M
3300026481|Ga0257155_1041517All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium705Open in IMG/M
3300026481|Ga0257155_1062937All Organisms → cellular organisms → Bacteria590Open in IMG/M
3300026497|Ga0257164_1004897Not Available1494Open in IMG/M
3300026507|Ga0257165_1009341All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1508Open in IMG/M
3300026535|Ga0256867_10181026Not Available778Open in IMG/M
3300026557|Ga0179587_10284933Not Available1062Open in IMG/M
3300026557|Ga0179587_10518846Not Available782Open in IMG/M
3300027388|Ga0208995_1058693Not Available676Open in IMG/M
3300027846|Ga0209180_10386076All Organisms → cellular organisms → Bacteria795Open in IMG/M
3300027882|Ga0209590_10113125All Organisms → cellular organisms → Bacteria1639Open in IMG/M
3300027903|Ga0209488_10833746All Organisms → cellular organisms → Bacteria651Open in IMG/M
3300027903|Ga0209488_10930640Not Available607Open in IMG/M
3300028047|Ga0209526_10332125All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1022Open in IMG/M
3300028673|Ga0257175_1012776All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1301Open in IMG/M
3300028793|Ga0307299_10064019All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1362Open in IMG/M
3300028811|Ga0307292_10496733Not Available523Open in IMG/M
3300028814|Ga0307302_10294203All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium798Open in IMG/M
3300028824|Ga0307310_10330371Not Available747Open in IMG/M
3300028828|Ga0307312_10493535Not Available808Open in IMG/M
3300028878|Ga0307278_10436872All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria574Open in IMG/M
3300030619|Ga0268386_10402608Not Available962Open in IMG/M
(restricted) 3300031248|Ga0255312_1006486All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2787Open in IMG/M
3300031720|Ga0307469_10021383All Organisms → cellular organisms → Bacteria → Proteobacteria3527Open in IMG/M
3300031720|Ga0307469_10402579Not Available1169Open in IMG/M
3300032174|Ga0307470_10298866All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1091Open in IMG/M
3300032180|Ga0307471_100383293All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1528Open in IMG/M
3300032770|Ga0335085_12529617All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium509Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil20.78%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil15.58%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere11.69%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment10.39%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil7.79%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.90%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil3.25%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil2.60%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.60%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland2.60%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.95%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.95%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.30%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.30%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.30%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil1.30%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.30%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.65%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.65%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.65%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.65%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.65%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.65%
FossillEnvironmental → Terrestrial → Soil → Fossil → Unclassified → Fossill0.65%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.65%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.65%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.65%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.65%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.65%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.65%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002121Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1EnvironmentalOpen in IMG/M
3300002122Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300003994Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005880Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_201EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300012174Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT366_2EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300014881Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_1DaEnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015170Fossil microbial communities from human bone sample from Teposcolula Yucundaa, Mexico - TP48EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017939Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_10_MGEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018032Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_BV01_MP10_20_MGEnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018061Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_b1EnvironmentalOpen in IMG/M
3300018064Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP05_10_MGEnvironmentalOpen in IMG/M
3300018066Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_b1EnvironmentalOpen in IMG/M
3300018071Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_b1EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018089Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP05_20_MGEnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300019998Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m1EnvironmentalOpen in IMG/M
3300020001Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a2EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020015Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m1EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_coex redoEnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022694Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_coexEnvironmentalOpen in IMG/M
3300022756Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_5_b1EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025885Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025999Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_201 (SPAdes)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026351Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-05-BEnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026446Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-11-BEnvironmentalOpen in IMG/M
3300026481Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-AEnvironmentalOpen in IMG/M
3300026497Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-BEnvironmentalOpen in IMG/M
3300026507Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-BEnvironmentalOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027388Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM2_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300027616Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300028793Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_159EnvironmentalOpen in IMG/M
3300028811Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_149EnvironmentalOpen in IMG/M
3300028814Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_183EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028878Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_117EnvironmentalOpen in IMG/M
3300030619Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (Novaseq)EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
JGI10214J12806_1111015713300000891SoilMCIVRAASRTIIALCCLVAPLAPPASAEAAWVLWLGTGTTYTPFGAYGGATGEKDCKEAAAQLMTDMKKDAKQLGEFLRSSSRYLCLPDTVDPRGPKSK*
JGI12053J15887_1033082823300001661Forest SoilMRAVIASILLLLVSVSVAHAQGAWVLWLATGTTYTPFGAYGGAIGEKECKEAVAQVMTEMSKDAKQMTEFLKASSRYLCLPDTVDPRGPKGAK*
JGI12053J15887_1052930013300001661Forest SoilMRAVIASILLLLVIVSVAHAEGAWVLWLGTGTTYTPFGAYGGNTGEKECKEAATQLMTDMSKDAKQMSEFLKASSRYLCLPDTVDPRGPKGMK*
C687J26615_1005380213300002121SoilMTHPMTSVLVVLCWLLAFATSAHAECAWVLWLGTGSTYTPFGAFGGNTAEKDCKESATQLMTDMRKNPKQLGEFLKSSSRYL
C687J26623_1001501733300002122SoilMTHPMTSVLVVLCWLLAFATSAHAECAWVLWLGTGSTYTPFGAFGGNTAEKDCKESATQLMTDMRKNPKQLGEFLKSSSRYLCLPDTVDPRGPKGK*
JGIcombinedJ26739_10122475513300002245Forest SoilVDADAVGVPERRLSRVTPLLALLGVLTLATSASAEGAWVLWLGTGTTYTPFGAYGGATGERECKEAVAQLMTEMRKNSTQLSEFLKSSSRYICLPDTVDPRGSKGK*
Ga0055435_1006684033300003994Natural And Restored WetlandsMTAAISARILLLMCCLLALATSAAAECAWVLWLGTGSTYTPFGAYGGSNGEKTCQETAAQLTTGVVKDPTQRTEFLKSSSRYLCLPDTIDPRGPKGK*
Ga0063356_10101669323300004463Arabidopsis Thaliana RhizosphereRTIIALCCLVAPLAPPASAEAAWVLWLGTGTTYTPFGAYGGATGEKDCKEAAAQLMTDMKKDARQLGEFLRSSSRYLCLPDTVDPRGPKSK*
Ga0066680_1026200933300005174SoilVDPDAVRVSERRLTRATPLLALLGLLALATSASAVDAWVLWLGTGTTYTPFGAYGGQTGERECKEAVAQLMTEMRKNTTQLSEFLKSSSRYICLPDTVDPRGPKGK*
Ga0065705_1032112123300005294Switchgrass RhizosphereMIAAGYAGKGTLVLLGLLFVSTSASAEGAWVLWLGTGTTYTPFGAYGGNTGEKDCKEASAQLMTSMSKDAKQLAEFLKASSRYLCLPDTVDPRGPKGTK*
Ga0065707_1057331223300005295Switchgrass RhizosphereMIPAVYTRTMVAVLCCVLAFTTSASAESAWVLWFGSGTTYIAFGAYGGATGEKECKEAAAQLMASMSKDGKQLTEFLKSSSRYVCLPDTVDPRGPKAK*
Ga0066388_10019267633300005332Tropical Forest SoilMITIRRLLLVSFGLLAAATPARADCAWVLWLGTGSAYTPFGAYGASTGERDCKEAAAQLMTELQKDAKQLREFLKSSSRY
Ga0066388_10140838023300005332Tropical Forest SoilMITIRCLLFVSLGLLAAATPARADCAWVLWLGTGSTYTPFGAYGASTGERDCKEAAAQLMTDLQKDAKQLREFLKSSSRYVCLPDTVDPRGPKAK*
Ga0070703_1015803613300005406Corn, Switchgrass And Miscanthus RhizosphereVDPDAVRVSERRLIRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKGK*
Ga0070703_1054934313300005406Corn, Switchgrass And Miscanthus RhizosphereMRVVIASILLLLVIVSVARAECAWVLWLGIGTTYTAFGAYGANAGERECKEAITQLMTDMRKDAKQLGEFLRSSSRYLCLPDTVDPRGPKEKGAK*
Ga0070705_10009687423300005440Corn, Switchgrass And Miscanthus RhizosphereVDPDAVRVSERRLIRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQLNEFLKSSSRYICLPDTLDPRGPKGK*
Ga0070708_10007309923300005445Corn, Switchgrass And Miscanthus RhizosphereVDPDAVRVSERRLIRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQLNEFLKSSSRYICLPDTLDPRGTKGK*
Ga0070706_10003070363300005467Corn, Switchgrass And Miscanthus RhizosphereMRVVFALICVLALATFASAESAWVLWLGTGTRYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQLNEFLKSSSRYICLPDTLDPRGPKGK*
Ga0070706_10004007923300005467Corn, Switchgrass And Miscanthus RhizosphereMTIRWPSIVVTLLCCLLAFTTSASAECAWVLWLGTGASYTPFGAYGGTTGEKEFQEASTQLMTGMKKNSKELAEFLKSSSRYLCLPDTVDPRGPKVK*
Ga0070706_10033648413300005467Corn, Switchgrass And Miscanthus RhizosphereVDPDAVRVSERSLTRATPLLALLGLLALATSASAGDAWVLWLGTGTTYTPFGAYGGQTGERECKEAVAQLMTEMRKNTTQLSEFLKSSSRYICLPDTVDPRGPKGK*
Ga0070698_10063626323300005471Corn, Switchgrass And Miscanthus RhizosphereKGTLVLLGLLFGSTSASAEGAWVLWLGTGTTYTPFGAYGGNTGEKDCKEASAQLMTSMSKDAKQLAEFLKASSRYLCLPDTVDPRGPKGTK*
Ga0070698_10169480313300005471Corn, Switchgrass And Miscanthus RhizosphereMRVLFALICVLALATFAFAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQLNEFLKSSSRYICLPDTLDPRG
Ga0070697_10001092243300005536Corn, Switchgrass And Miscanthus RhizosphereMRVVIASILLLLVIVSVARAECAWVLWLGIGTTYTAFGAYGANAGERECKEAITQLMTDMRKDAKQLAEFLRSSSRYLCLPDTVDPRGPKEKGAK*
Ga0070696_10043656743300005546Corn, Switchgrass And Miscanthus RhizosphereLSRVTPLLALLGTLTLATVAAAEGAWVLWLGTGTTYTPFGAYGGATGERECKEAVAQLMTEMRKNSTQLSEFLKSSSRYICLPDTVDPRGPKRK*
Ga0070704_10003722123300005549Corn, Switchgrass And Miscanthus RhizosphereMIAARYAHKGTLVLLGLLFVSTSASAEGAWVLWLGTGTTYTPFGAYGGNTGEKDCKEASAQLMTSMSKDAKQLAEFLKASSRYLCLPDTVDPRGPKGTK*
Ga0075298_100059823300005880Rice Paddy SoilMRDTRRVWTREDLLRTAIPRWRRVFLTALCCFLAPATSASAECAWVLWLGTGTGYTPFGAYGGATGEKDCKEASAQLVTAMKENPKALSEFLKSSSRYLCLPDTVDPRGPKRK*
Ga0075028_10062915013300006050WatershedsVDADAIGVPERRLSRVTPLLALLGVLTLPTSASAEGAWVLWLGTGTTYTPFGAYGGATGERECKEAVAQLMTEMRKNSTQLSEFLKSSSRYICLPDTVDPRGSKGK*
Ga0079220_1063280823300006806Agricultural SoilMITTRCVLLALLGLLAAATPARAECAWVLWLGTGSTYTPFGAYGPSTGERECKEAAAQLMTELRKDGKQLGEFLKSSSRYICLPDTVDPRGPKAK*
Ga0099791_10000228203300007255Vadose Zone SoilLIRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKAK*
Ga0099793_1006001043300007258Vadose Zone SoilLIRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQLNEFLKSSSRYICLPDTLDPRGPKGK*
Ga0099829_10000663173300009038Vadose Zone SoilMRAVIASILLLLVIVSAAHAECAWVLWLGTGLTYTPFGAYGANTGEKDCKEAVTQLMTDMSKNAKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG*
Ga0099829_1002021373300009038Vadose Zone SoilLTRATPLLALLGLLALATSASAEDAWVLWLGTGTTYTPFGAYGGQTGERECKEAVAQLMTEMRKNTTQLSEFLKSSSRYICLPDTVDPRGPKGK*
Ga0099830_10002643113300009088Vadose Zone SoilMRAVIASILLLLVIVSAAHAECAWVLWLGTGLTYTPFGACGANTGEKDCKEAVTQLMTDMSKNAKQLGEFLKSSSRYLYLPDTVDPRGPKGGKG*
Ga0099830_1002475863300009088Vadose Zone SoilLTRATPLLALLGLLALATSASAEDAWVLWLGTGTTYTPFGAYGGQTGERECKEAVAQLMTEMRRNTTQLSEFLKSSSRYICLPDTVDPRGPKGK*
Ga0099827_1012042833300009090Vadose Zone SoilMIAAIYARVLGAALCCLLALATSASAECAWVLWLGIGMTYTALGAYGANTSEKDCKEAVAQLMTDMRKDAKRLGEFLKSSSRYLCLPDTVDPRGPKGTK*
Ga0114129_1003002293300009147Populus RhizosphereMGHAAMIAASYAHKGTLVLLGLLFVSTSASAEGAWVLWLGTGTTYTPFGAYGGNMGEKDCKEAAAQLMTSMSKDAKQLAEFLKASSRYLCLPDTVDPRGPKGTK*
Ga0105241_1193990023300009174Corn RhizosphereMSTVRAASRTVAALCFVIALALPASAESAWVLWLGTGTTYTPFGAYGGATGEKDCKEAAAQLMTDMKKDARQLGEFLRSSSRYLCLPDTVDPRGPKSK*
Ga0126376_1026803833300010359Tropical Forest SoilMITIRRLLVVILGLVAAATPARADCAWVLWLGTGSTYTPFGAYGPSTGERDCKEAAAQLMTEMRKDAKQLREFLKSSSRYVCLPDTVDPRGPKAK*
Ga0126372_1029478023300010360Tropical Forest SoilMITIRCLLFVSLGLLATATPARADCAWVLWLGTGSTYTPFGAYGASTGERDCKEAAAQLMTDLQKDAKQLREFLKSSSRYVCLPDTVDPRGPKAK*
Ga0126372_1294060813300010360Tropical Forest SoilMITIRRLLLVSFGLLAAATPARADCAWVLWLGTGSAYTPFGAYGASTGERDCKEAAAQLMTELQKDAKQLREFLKSSSRYICLPDTVDPRGPKAK*
Ga0126379_1068304413300010366Tropical Forest SoilMITIRRLLVVILGLLAAATPARADCAWVLWLGTGSTYTPFGAYGPSTGERDCKEAAAQLMTELQKDAKQLREFLKSSSRYVCLPDTVDPRGPKAK*
Ga0134122_1059711613300010400Terrestrial SoilMSIARAASRTITVLCGLLAVVAPASAESAWVLWLGTGTTYTPFGAYGGNTGEKDCKEAAAQLMADMKKDAKQLGEFLRSSSRYLCLPDTVDPRGPKPK*
Ga0137338_114617223300012174SoilAQALPQTELLPMTRRWPGIVVAILCGLLTLTTSSSAESAWVLWLGTGTTYTPFGAYGGNTGEKDCKEAATQLMADMKKDTKQLSEFLKSSSRYLCLPETVDPRGPKGK*
Ga0137388_1013371013300012189Vadose Zone SoilVDPDAVRVSERRLIRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKAK*
Ga0137383_1049991643300012199Vadose Zone SoilLTRATPLLALLGLLALATSASAVDAWVLWLGTGTTYTPFGAYGGQTGERECKEAVAQLMTEMRKNTTQLSEFLKSSSRYICLPDTVDPRGPKGK
Ga0137362_1094324623300012205Vadose Zone SoilLTRATPLLALLGLLALATSASAEDAWVLWLGTGTAYTPFGAYGGQTGERECKEAVAQLMTEMRKNTTQLSEFLKSSSRYICLPDTVDPRGPKGK*
Ga0137381_1099397613300012207Vadose Zone SoilLTRATPLLALLGLLALATSASAVDAWVLWLGTGTTYTPFGAYGGQTGERECKEAVAQLMTEMRKNTTQLSEFLKSSSRYICLPAT
Ga0137379_1034261123300012209Vadose Zone SoilLTRATPLLALLGLLALATSASAVDAWVLWLGTGTTYTPFGAYGGQTGERECKEAVAQLMTEMRKNTTQLSEFLKSSSRYICLPDTVDPRGPKGK*
Ga0137377_10003324203300012211Vadose Zone SoilDAVRVSERLTRATPLLALLGLLALATSASAEDAWVLWLGTGTTYTPFGAYGGQTGERECKEAVAQLMTEMRKNTTQLSEFLKSSSRYICLPDTVDPRGPKGK*
Ga0137375_1044940823300012360Vadose Zone SoilMGHAAMIAASYARKGTLVLLGLLVSSTSASAEGAWVLWLGTGTTYTPFGAYGGNTGEKDCKEAAAQLMTSMSKDAKQLADFLKASSRYLCLPDTVDPRGPKGTK*
Ga0137360_1066072733300012361Vadose Zone SoilLTRATPPLALLGLLTLATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKAK*
Ga0137358_1002802663300012582Vadose Zone SoilVDPDAVRVSERRLTRATPPLALLGLLTLATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQLNEFLKSSSRYICLPDTLDPRGPKGK*
Ga0137396_1067337313300012918Vadose Zone SoilMRVVIASILLLLIIVSVAHAEGAWVLWLGTGTTYTPFGAYGGNTGEKECKEAATQLMTDMRKDAKQMTEFLKASSRYLCLPDTVDPRGPKGTK*
Ga0137359_1061271923300012923Vadose Zone SoilLTRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKDTAQLNEFLKSSSRYICLPDTLDPRGPKGK*
Ga0137404_1001226083300012929Vadose Zone SoilMIAAIYARIVGPVLCWLLAFTTSASAEGAWVLWLGTGTIYTPFGAYGGNTGEKDCKEAAAQLMTSMGKDAKQMSDFLKASSRYLCLPEAVDPRGPKGTK*
Ga0137404_1128229813300012929Vadose Zone SoilVDPDAVRVSERRLTRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKGK*
Ga0137407_1003617813300012930Vadose Zone SoilMGHAAMIAASYARKGTLVLLGLLFGSTSASAEGAWVLWLGTGTTYTPFGAYGGNTGEKDCKEASAQLMTSMSKDAKQLAEFLKASSRYLCLPDTVDPRGPKGTK*
Ga0137410_1001776953300012944Vadose Zone SoilLIRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKGK*
Ga0157378_1302458813300013297Miscanthus RhizosphereGLLTLATSASAEGAWVLWLGTGTTYTPFGAYGGATGERECKEAVAQLMTEMRKNSTQLSEFLKSSSRYICLPDTVDPRGSKGK*
Ga0180094_100944423300014881SoilMPQTELLPMTRRRPGIVVAILCGLLTLTTSSSAESAWVLCLGTGTTYTPFGAYGGNTGEKDCKEAATQLMADMKKDTKQLSEFLKSSSRYLCLPETVDPRGPKGK*
Ga0180104_104619233300014884SoilMPQTELLPMTRRWPGIVVAILCGLLTLTTSSSAESAWVLCLGTGTTYTPFGAYGGNTGEKDCKEAATQLMADMKKDTKQLSEFLKSSSRYLCLPETVDPRGPKGK*
Ga0180063_105725523300014885SoilMPQTELLPMTRRWPGIVVAILCGLLTLTTSSSAESAWVLWLGTGTTYTPFGAYGGNTGEKDCKEAATQLMADMKKDTKQLSEFLKSSSRYLCLPETVDPRGPKGK*
Ga0120098_101961723300015170FossillVSTSVVHAECAWVLWLSTGTTYTPIVAYGGNTGEKECKEAVAQLMTDMSKDAKQFGKFLKASSRYLCLPDNVQPRGPKGTK*
Ga0137409_1022788123300015245Vadose Zone SoilMRVVIASILLLLIIVSVAHAEGAWVLWLGTGTTYTPFGAYGGNTGEKECKEAATQLMTDMSKDAKQMTEFLKASSRYLCLPDTVDPRGPKGTK*
Ga0137403_1118465413300015264Vadose Zone SoilLTRATPPLALLGLLALATFASAESAWVLRLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKGK*
Ga0187775_1000495313300017939Tropical PeatlandMTTRWPSVVVTTLCCLLALAPSAAAECAWVLWFGTGTGFTPMRAYGSSTGEKACQAASAQMLADLQKDPKQLTEFLKSSSRYICLPDTVDPRGPKGK
Ga0184610_112717823300017997Groundwater SedimentMRAIIASILLLLVIVSVAHAECAWVLWLGTGLTYTPFGAYGANTGEKDCKEAVTQLMTDMSKNAKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0184604_1006849513300018000Groundwater SedimentMIILASILLLLVSAPAAHADCAWVLWLGTGTTYTPFVAYGANTGEKECKEAVAQLMTDMSKDAMKLSEFLKASSRYVCLPDTVEPRGQKGTK
Ga0184608_1011662923300018028Groundwater SedimentMIILASILLLLVSAPAAHAECAWVLWLSTGTTYTPFVAYGANTGEKECKDAVAQLMTDMSNDAMKLSEFLKASSRYVCLPDTVEPRGPKGTK
Ga0184608_1024721513300018028Groundwater SedimentPRLRPRMGHTAMIAASYAHKGTLVLLGLLSVSTSASAEGAWVLWLGTGTTYTPFGAYGGNTGEKDCKEAFAQLMTSMSKDAKQLAEFLKASSRYLCLPDTVDPRGPKGTK
Ga0184608_1049078323300018028Groundwater SedimentKDPMRAVIASILLLLVIVSVAHAECAWVLWLGTGLTYTPFGAYGANTGEKDCKEAVTQLMTDMSKNAKQLGKFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0184634_1004552123300018031Groundwater SedimentMRAVIASILLLLVIVSVAHAECAWVLWLGTGLTYTPFGAYGANTGEKDCKEAVTQLMTDMSKNAKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0187788_1024987823300018032Tropical PeatlandALAPSAAAECAWVLWFGTGTGFTPMRAYGSSTGEKACQAASAQMLADLQKDPKQLTEFLKSSSRYICLPDTVDPRGPKGK
Ga0184638_102948613300018052Groundwater SedimentLLVIVSVAHAECAWVLWLGTGLTYTPFGAHGANTGEKDCKEAVPQLMTDMSKNAKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0184638_104675443300018052Groundwater SedimentDPMRAVIASILLLLVIVSVAHAECAWVLWLGTGLTYTPFGAYGASTGEKDCKEAVTQLMTDMSKNAKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0184626_1002910143300018053Groundwater SedimentMRAIIASILLLLGIVSVAHAECAWVLWLGTGLTYTPFGAYGANTGEKDCKEAVTQLMTDMSKNAKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0184619_1007093933300018061Groundwater SedimentMIILASILLLLVSAPAAHAECAWVLWLETGTTYTPFVAYGANAGEKECKEAVAQLMTDMSNDAMKLSEFLKASSRYVCLPDTVEPRGPKETK
Ga0187773_1000969633300018064Tropical PeatlandMTTRWPSVVVTTLCCLLALAPSAAAECAWVLWFGTGTGFTPMGAYGSSTGEKACQAASAQMLADLQKDPKQLTEFLKSSSRYICLPDTVDPRGPKGK
Ga0184617_111094523300018066Groundwater SedimentMIILASILLLLVSAPAAHADCAWVLWLGTGTTYTPFVAYGANTGEKECKEAVAQLMTDMSKDAMKLSEFLKASSRYVCLPDTVEPRGAKGTK
Ga0184618_1010977433300018071Groundwater SedimentMIILASILLLLVSAPAAHAECAWVLWLGTGTTYTPFVAYGANTGEKECKEAVAQLMTDMSKDAMKLSEFLKASSRYVCLPDTVEPRGSKGTK
Ga0184618_1038658113300018071Groundwater SedimentMRAVLASILLLLVIVSVAHAEGAWVLWLGTGTTYTPFGAYGGNTGENECKEAATQLMIDMSKDAKQMSEFLKASSRYLCLPDTVDPRGPKGTK
Ga0184632_1048683913300018075Groundwater SedimentMRVIIASILLLLVIVSVAHAECAWVLWLGTGMTYTPFGAYGANTGEKDCKEAVTQLMTDMSKNAKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0184609_1017259623300018076Groundwater SedimentMIILASILLLLVSAPAAHAECAWVLWLETGTTYTPFVAYGANAGEKECKEAVAQLMTDMSKDAMKLSEFLKASSRYVCLPDTVEPRGPKETK
Ga0184629_1007163133300018084Groundwater SedimentMRAVIASILLLLVIVSVAHAECAWVLWLGTGLTYTPFGAYGANTGEKDCKEAVTQLMMDMSKNAKQLAEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0187774_1104147413300018089Tropical PeatlandTTLCCLLALAPSAAAECAWVLWFGTGTGFTPMGAYGSSTGEKACQAASAQMLADLQKDPKQLTEFLKSSSRYICLPDTVDPRGPKGK
Ga0190265_1009605033300018422SoilVTTRLPGIAIAILSGLLAVTTSSSAQSAWVLWLGTGTTYTPFGAYGGATGEKDCKEAANQLMADMRKDAKQLGEFLKSSSRYLCLPETVDPRGPKPK
Ga0187892_10006853143300019458Bio-OozeMKAVIASILLLLVIVSAAHAECAWVLWLGTGTTYTPFGAYGASTGEQACKEAVTQLMTAMRKDSKQLTEFLKSSSRYLCLPDTVDPRGPKERGAK
Ga0187893_1026025023300019487Microbial Mat On RocksMPTSSMITRSPGIIVLILCGLLALATPASAESAWVLWLGTGTTYTPFGAYGGATGEKDCKEAANQLMTEMRKDARLLGEFLKSSSRYICLPETVDPRGPKAK
Ga0137408_101716313300019789Vadose Zone SoilMIAASYARKGTLVLLGLLFGSTSASAEGAWVLWLGTGTTYTPFGAYGGNTGEKDCKEASAQLMTSMSKDAKQLAEFLKASSRYLCLPDTVDPRGPKGTK
Ga0193723_103607733300019879SoilMIPVVYTRTMVAVLCCVLAFTTSASAESAWVLWFGTGTTYTAFGAYGGATGEKECKEAAAQLMADMSKDGKRLAEFLKSSSRYVCLPDTVDPRGPKAK
Ga0193707_103103013300019881SoilRHRGHRVDADAVGVPERRLSRVTPLLALLGVLTLATSASAEGAWVLWLGTGTTYTPFGAYGGATGERECKEAVAQLMTEMRKNSTQLSEFLKSSSRYICLPDTVDPRGSKGK
Ga0193725_102813343300019883SoilMIPAVYTRTMVAVLCCVLAFTTSASAESAWVLWFGTGTTYTAFGAYGGATGEKECKEAAAQLMADMSKDGKRLAEFLKSSSRYVCLPDTVDPRGPKGK
Ga0193725_103516213300019883SoilMRAVIASILLLLVIVSAAHAECAWVLWLGTGLTYTPFGAYGANTGEKDCKEAVTQLMTDMSKNAKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0193747_114303813300019885SoilVDADAVGVPERRLSRVTPLLALLGVLTLATSASAEGAWVLWLGTGTTYTPFGAYGGATGERECKEAVAQLMTEMRKNSTQLSEFLKSSSRYICLPDTVDPRGSKGK
Ga0193727_115764523300019886SoilMIILASILLLLISAPAAHADCAWVLWLGTATTYTPFVAYGANTGEKECKDAVAQLMTDMSNDAMKLSEFLKASSRYVCLPDTVEPRGPKETK
Ga0193710_101794913300019998SoilMIPVVYTRTMVAVLCCVLAFTTSASAESAWVLWLGTGTTYTALGAYGGATGERECKEAAAQLMANMSKDGKQLTEFLKSSSRYVCLPDTVDPRGPKAK
Ga0193731_109787113300020001SoilAQARRHRGHRLDADAVGVPERRLSRVTPLLALLGVLTLATSASAEGAWVLWLGTGTTYTPFGAYGGATGERECKEAVAQLMTEMRKNSTQLSEFLKSSSRYICLPDTVDPRGSKGK
Ga0193730_107542213300020002SoilMIILASILLLLVSAPAAHADCAWVLWLETGTTYTPFVAYGANTGEKECKDAVAQLMTDMSKDAMKLSEFLKASSRYVCLPDTVEPRGPKGTK
Ga0193755_100084483300020004SoilMIPAVYTRTMVAVLCCVLAFTTSASAESAWVLWFGTGTTYTAFGAYGGATGEKECKEAAAQLMADMSKDGKRLAEFLKSSSRYVCLPDTVDPRGPKAK
Ga0193755_116003113300020004SoilMIILASILLLLVSAPAAHAECAWVLWLETGTTYTPFVAYGANAGEKECKEAVAQLMTDMSKDAMKLSEFLKASSRYVCLPDTVEPRGLKGTK
Ga0193734_104939713300020015SoilVDADAVGVPERRLSRITPLLALLGGLTLATSASADGAWVLWLGTGTTYTPFGAYGGVSGERECKESVAQLMTEMRKNSTQLSEFLKSSSRYICLPDTVDPRAPKGK
Ga0193726_112604833300020021SoilRVDTDAVGVPERRVSRVTPLLALLGWLAIATSASAEGAWVLWLGTGTTYTPFGAFGGATGERECKESVAQLMTEMRKNSTQLSEFLKSSSRYICLPDTVDPRGPKGK
Ga0210378_1019680613300021073Groundwater SedimentILLLLVNVSVAHAECAWVLWLGTGLTYTPFGAYGANTGEKDCKEAVTQLMTDMSKNAKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0210381_1020799723300021078Groundwater SedimentMIILASILLLLVSAPAAHAECAWVLWLGTGTTYTPFVAYGANTGEKECKDAVAQLMTDMSKDAMKLSEFLKASSRYVCLPDTVEPRGPKETK
Ga0210382_1055452113300021080Groundwater SedimentMIILASILLLLVSAPAAHADCAWVLWLETGTTYTPFVAYGANTGEKECKEAVAQLMTDMSTDAMKLSEFLKASSRYVCLPD
Ga0179596_1006889133300021086Vadose Zone SoilVDPDAVRVSERRLTRATPLLALLGLLALATSASAEDAWVLWLGTGTTYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKAK
Ga0210404_1067703513300021088SoilMTRRSPLIGVAALGCLWAAVTSASAECAWVLWLGTGTTYTPFGAYGGSTGEKTCQEAAAQLVASMGKDPKQLTEFLKSSSRYLCLPDTVDPRRPKETK
Ga0193719_1003509013300021344SoilMIILASILLLLVSAPAAHADCAWVLWLETGTTYTPFVAYGANTGEKECKEAVAQLMTDMSKDAMKLSEFLKASSRYVCLPDTVEPRGPKETK
Ga0193719_1027209813300021344SoilAKGQREQMIPVVYTRTMVAVLCCVLAFTTSASAESAWVLWFGTGTTYTAFGAYGGATGEKECKEAAAQLMADMSKDGKRLAEFLKSSSRYVCLPDTVDPRGPKAK
Ga0210384_1009463433300021432SoilMVAALARTLTGQCIPLVLLCLLVLTTTASADGAWVLWLGTGTGYTPFGAYGGATGEKDCKEASAQLMTSMKENPKALSEFLKSSSRYICLPDTIDPRGPKGK
Ga0126371_1150393713300021560Tropical Forest SoilMITIRRLLVEILGLLAAATPARADCAWVLWLGTGSAYTPFGAYGASTGERDCKEAAAQLMTELQKDAKQLREFLKSSSRYICLPDTVDPRGPKVK
Ga0222623_1036001723300022694Groundwater SedimentMIILASILLLLVSAPAAHAECAWVLWLGTRTTYTPFVAYGANTGEKECKEAVAQLMTDMSKDAMKLSEFLKASSRYVCLPDTVEPRGPKETK
Ga0222622_1080004713300022756Groundwater SedimentPRLRPRMGHTAMIAASYAHKGTLVLLGLLSVSTSASAEGAWVLWLGTGTTYTPFGAYGGNTGEKDCKEASAQLMTSMSKDAKQLAEFLKASSRYLCLPDTVDPRGPKGK
Ga0209640_1002723063300025324SoilMTHPMTSVLVVLCWLLAFATSAHAECAWVLWLGTGSTYTPFGAFGGNTAEKDCKESATQLMTDMRKNPKQLGEFLKSSSRYLCLPDTVDPRGPKGK
Ga0207653_1007601313300025885Corn, Switchgrass And Miscanthus RhizosphereMRVVIASILLLLVIVSVARAECAWVLWLGIGTTYTAFGAYGANAGERECKEAITQLMTDMRKDAKQLGEFLRSSSRYLCLPDTVDPRGPKEKGAK
Ga0207684_1001189963300025910Corn, Switchgrass And Miscanthus RhizosphereMRVVFALICVLALATFASAESAWVLWLGTGTRYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQLNEFLKSSSRYICLPDTLDPRGPKGK
Ga0207684_1007353753300025910Corn, Switchgrass And Miscanthus RhizosphereVDPDAVRVSERRLIRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKGK
Ga0207684_1022937133300025910Corn, Switchgrass And Miscanthus RhizosphereVDPDAVRVSERSLTRATPLLALLGLLALATSASAGDAWVLWLGTGTTYTPFGAYGGQTGERECKEAVAQLMTEMRKNTTQLSEFLKSSSRYICLPDTVDPRGPKGK
Ga0207646_1008111733300025922Corn, Switchgrass And Miscanthus RhizosphereVDPDAVRVSERRLIRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQLNEFLKSSSRYICLPDTLDPRGPKGK
Ga0207646_1014639713300025922Corn, Switchgrass And Miscanthus RhizosphereMTTRWPSIVVTLLCCLLAFTTSASAECAWVLWLGTGASYTPFGVYGGTTGEKEFQEASTQLMTGMKKNSKELAEFLKSSSRYLCLPDTVDPRGPTVK
Ga0208417_10567923300025999Rice Paddy SoilMIGRSPCILVTTLCCVLALATSASAECAWVLWLGTGTGYTPFGAYGGATGEKDCKEASAQLVTAMKENPKALSEFLKSSSRYLCLPDTVDPRGPKRK
Ga0257162_100016893300026340SoilVDPDAVRVSERRLIRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKAK
Ga0257170_100329353300026351SoilPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKAK
Ga0257179_100109733300026371SoilMRAVIASILLLLVIVSAAHAECAWVLWLGTGLTYTPFGAYGANTGEKDCKEAVTQLMTDMSKNSKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0257179_101393723300026371SoilVDPDAVRVSERRLTRATPLLALLGLLALATSASAEDAWVLWLGTGTTYTPFGAYGGQTGERECKEAVAQLMTEMRRNTTQLSEFLKSSSRYICLPDTVDPRGPKGK
Ga0257178_104163813300026446SoilMRAVIASILLLLVIVSVAHAECAWVLWLGTGLTYTPFGAYGANTGEKDCKEAVTQLMTDMSKNSKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0257155_104151713300026481SoilEQARRHRGHRVDADAVGVPERRLSRVTPLLALLGVLTLATSASAEGAWVLWLGTGTTYTPFGAYGGATGERECKEAVAQLMTEMRKNSTQLSEFLKSSSRYICLPDTVDPRGSKGK
Ga0257155_106293723300026481SoilPDAVRVSERRLTRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKAK
Ga0257164_100489723300026497SoilVDPDAVRVSERRLIRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAVAQLMTEMRRNTTQLSEFLKSSSRYICLPDTLDPRGPKGK
Ga0257165_100934133300026507SoilMRAVIASILLLLVIVSAAYAECAWVLWLGTGLTYTPFGAYGANTGEKDCKEAVTQLMTDMSKNSKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0256867_1018102613300026535SoilMKVAIAGILLVLVIVSVAHAECAWVLWLGTGTTYTPFGAYGGNAGERECKAAATQLMTDMSKDAKQLAEFLKASSRYLCLPDTVEPRGPKGTK
Ga0179587_1028493333300026557Vadose Zone SoilVSERRLIRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKAK
Ga0179587_1051884633300026557Vadose Zone SoilVDADAVGVPERRLSRVTPLLAVLGVLTLATSASAEGAWVLWLGTGTTYTPFGAYGGATGERECKEAVAQLMTEMRKNSTQLSECLKSSSRYICLPDTVDPRGSKGK
Ga0208995_105869323300027388Forest SoilMRAVIASILLLLVSVSVAHAQGAWVLWLATGTTYTPFGAYGGAMGEKECKEAVAQVMTEMSKDAKQMTEFLKASSRYLCLPDTVDPRGPKGK
Ga0209106_105630433300027616Forest SoilVIVSVAHAEGAWVLWLGTGTTYTPFGAYGGNTGEKECKEAATQLMTDMSKDAKQMSEFLKASSRYLCLPDTVDPRGPKGMK
Ga0209180_1038607623300027846Vadose Zone SoilMTIRWPSIVVTLLCCLLAFTTSASAECAWVLWLGTGASYTPFGAYGGTTGEKEFQEASTQLMTGMKKNSKELAEFLKSSSRYLCLPDTVDPRGPKVK
Ga0209590_1011312533300027882Vadose Zone SoilAGYPSGRAFRASDPAMIAAIYARVLGAALCCLLALATSASAECAWVLWLGIGMTYTALGAYGANTSEKDCKEAVAQLMTDMRKDAKRLGEFLKSSSRYLCLPDTVDPRGPKGTK
Ga0209488_1083374623300027903Vadose Zone SoilLIRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAIAQLMTEMRKNTTQQNEFLKSSSRYICLPDTLDPRGPKAK
Ga0209488_1093064023300027903Vadose Zone SoilVPEEVGEDPMRAVIASILLLLVIVSAAHAECAWVLWLGTGLTYTPFGAYGANTGEKDCKEAVTQLMTDMSKNAKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0209526_1033212513300028047Forest SoilGEEEKARRHRGHRVDADAVGVPERRLSRVTPLLAVLGVLTLATSASAEGAWVLWLGTGTTYTPFGAYGGATGERECKEAVAQLMTEMRKNSTQLSEFLKSSSRYICLPDTVDPRGSKGK
Ga0257175_101277613300028673SoilVDPDAVRVSERRLTRATPPLALLGLLALATFASAESAWVLWLGTGTGYTPFGAYGGQTGERECKEAVAQLMTEMRRNTTQLSEFLKSSSRYICL
Ga0307299_1006401923300028793SoilMIILASILLLLVSAPAAHAECAWVLWLETGTTYTPFVAYGANAGEKECKEAVAQLMTDMSKDAMKLSEFLKASSRYVCLPDTVEPRGSKGTK
Ga0307292_1049673313300028811SoilMIILASILLLLVSAPAAHAECAWVLWLETGTTYTPFVAYGANAGEKECKEAVAQLMTDMSKDAMKLSEFLKASSRYVCLPDTVE
Ga0307302_1029420323300028814SoilMIILASILLLLVSAPAAHADCAWVLWLETGTTYTPFVAYGANAGEKECKEAVAQLMTDMSKDAMKLSEFLKASSRYVCLPDTVEPRGLKGTK
Ga0307310_1033037113300028824SoilMDADAVGVPERRLSRITPLLALLGGLTLATSASADGAWVLWLGTGTTYTPFGAYGGVSGERECKESVAQLMTEMRKNSTQLSEFLKSSSRYICLPDTVDPRAPKGK
Ga0307312_1049353513300028828SoilMRAVIASILLLLVIVSAAYAECAWVLWLGTGLTYTPFGAYGANTGEKDCKEAVTQLMTDMSKNAKQLGEFLKSSSRYLCLPDTVDPRGPKGGKG
Ga0307278_1043687223300028878SoilPPRLRPRVGHAAMIAASYAREGTLVLLGLLFVSTPASAEGAWVLWLGTGTTYTPFGAYGGNTGEKDCKEASAQLMTSMSKDAKQLAEFLKASSRYLCLPDTVDPRGPKGTK
Ga0268386_1040260823300030619SoilMTILASILLLLVIVSVARAQSAWVLWLGTGTTYTPFGAYGGNTAEKDCKEAATQLMTNMGKDAKQMSEFLKASSRYLCLPEAIDPRGPKGTK
(restricted) Ga0255312_100648633300031248Sandy SoilMTHPMTSALVALCCLLAFATSAQAECAWVLWLGTGSTYTPFGAFGGNTAEKDCKESATQLMTDMRKNPKQLGEFLKSSSRYLCLPDTVDPRGPKGK
Ga0307469_1002138323300031720Hardwood Forest SoilMSIARAASRTITVLCGLLAVVAPASAESAWVLWLGTGTTYTPFGAYGGNTGEKDCKEAAAQLMADMKKDAKQLGEFLRSSSRYLCLPDTVDPRGPKPK
Ga0307469_1040257923300031720Hardwood Forest SoilMIPAVYTRTMVAVLCCVLAFTTSASAESAWVLWFGSGTTYVAFGAYGGATGEKECKEAAAQLMASMSKDGKQLAEFLKSSSRYVCLPDTVDPRGPKAK
Ga0307470_1029886623300032174Hardwood Forest SoilVDADAVGVPERRLSRVTPLLALLGVLTLPTSASAEGAWVLWLGTGTTYTPFGAYGGATGERECKEAVAQLMTEMRKNSTQLSEFLKSSSRYICLPDTVDPRGSKGK
Ga0307471_10038329323300032180Hardwood Forest SoilMIILASLMLVLISTSVARAECAWVLWLGTGTTYTPFGAYGGNTGERECKEAVTQLMTDMRKDSKQLGEFLKASSRYLCLPDTVEPRGPKGLK
Ga0335085_1252961723300032770SoilMLLALFCLLALATSASAESAWVLWLGTGTTYTPFGAYGGQTGEKECKEAATQLMTEMNKDTKQLSEFLKAGSRYICLPDTVDPRGPKGK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.