NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F071279

Metagenome Family F071279

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F071279
Family Type Metagenome
Number of Sequences 122
Average Sequence Length 129 residues
Representative Sequence VTLHRLFAATVLMALAPVAAARAQGVEGTRSGTIFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSMGLDYNKEAFATLAYGKYNGTERIVGSVSGFPLLARLSNADAFDIEGGLRYY
Number of Associated Samples 98
Number of Associated Scaffolds 122

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 70.49 %
% of genes near scaffold ends (potentially truncated) 99.18 %
% of genes from short scaffolds (< 2000 bps) 95.08 %
Associated GOLD sequencing projects 93
AlphaFold2 3D model prediction Yes
3D model pTM-score0.41

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (66.393 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Agricultural → Soil
(13.115 % of family members)
Environment Ontology (ENVO) Unclassified
(39.344 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(43.443 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 12.42%    β-sheet: 46.58%    Coil/Unstructured: 40.99%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.41
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 122 Family Scaffolds
PF08713DNA_alkylation 5.74
PF13672PP2C_2 4.10
PF13602ADH_zinc_N_2 3.28
PF07228SpoIIE 2.46
PF00582Usp 1.64
PF04909Amidohydro_2 1.64
PF12704MacB_PCD 0.82
PF02518HATPase_c 0.82
PF09411PagL 0.82
PF00155Aminotran_1_2 0.82
PF00174Oxidored_molyb 0.82
PF04237YjbR 0.82
PF00682HMGL-like 0.82
PF01261AP_endonuc_2 0.82
PF13466STAS_2 0.82
PF00270DEAD 0.82
PF08448PAS_4 0.82
PF00106adh_short 0.82

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 122 Family Scaffolds
COG49123-methyladenine DNA glycosylase AlkDReplication, recombination and repair [L] 5.74
COG2041Molybdopterin-dependent catalytic subunit of periplasmic DMSO/TMAO and protein-methionine-sulfoxide reductasesEnergy production and conversion [C] 0.82
COG2315Predicted DNA-binding protein with ‘double-wing’ structural motif, MmcQ/YjbR familyTranscription [K] 0.82
COG3915Uncharacterized conserved proteinFunction unknown [S] 0.82


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A66.39 %
All OrganismsrootAll Organisms33.61 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090005|LWSO_GLQAYWI02HUXADNot Available504Open in IMG/M
3300000033|ICChiseqgaiiDRAFT_c0462473Not Available583Open in IMG/M
3300000956|JGI10216J12902_100830013All Organisms → cellular organisms → Bacteria → environmental samples → uncultured bacterium804Open in IMG/M
3300000956|JGI10216J12902_116094178Not Available529Open in IMG/M
3300000956|JGI10216J12902_121409648Not Available614Open in IMG/M
3300003319|soilL2_10293383Not Available1245Open in IMG/M
3300004153|Ga0063455_101489898Not Available527Open in IMG/M
3300004157|Ga0062590_101463896Not Available683Open in IMG/M
3300004463|Ga0063356_104913507Not Available574Open in IMG/M
3300004480|Ga0062592_101237617Not Available700Open in IMG/M
3300005093|Ga0062594_101696015All Organisms → cellular organisms → Bacteria660Open in IMG/M
3300005093|Ga0062594_102351645Not Available581Open in IMG/M
3300005295|Ga0065707_10474741Not Available736Open in IMG/M
3300005338|Ga0068868_101231222Not Available693Open in IMG/M
3300005354|Ga0070675_100698131Not Available924Open in IMG/M
3300005441|Ga0070700_101218042Not Available629Open in IMG/M
3300005444|Ga0070694_100283295All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1264Open in IMG/M
3300005456|Ga0070678_101999771Not Available548Open in IMG/M
3300005457|Ga0070662_101390651Not Available604Open in IMG/M
3300005459|Ga0068867_101963587Not Available553Open in IMG/M
3300005526|Ga0073909_10206964All Organisms → cellular organisms → Bacteria → Acidobacteria851Open in IMG/M
3300005545|Ga0070695_100701681Not Available803Open in IMG/M
3300005617|Ga0068859_101660253Not Available706Open in IMG/M
3300005719|Ga0068861_100239462All Organisms → cellular organisms → Bacteria → Acidobacteria1543Open in IMG/M
3300005833|Ga0074472_10797073Not Available610Open in IMG/M
3300005842|Ga0068858_100329932All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Comamonadaceae → Ramlibacter → Ramlibacter agri1459Open in IMG/M
3300005842|Ga0068858_101745196Not Available615Open in IMG/M
3300006358|Ga0068871_102409674Not Available502Open in IMG/M
3300006755|Ga0079222_10883862Not Available746Open in IMG/M
3300006847|Ga0075431_100367178All Organisms → cellular organisms → Bacteria → Acidobacteria1445Open in IMG/M
3300006847|Ga0075431_101627076All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300009094|Ga0111539_11570656Not Available763Open in IMG/M
3300009100|Ga0075418_10804440All Organisms → cellular organisms → Bacteria → Acidobacteria1015Open in IMG/M
3300009609|Ga0105347_1240361Not Available741Open in IMG/M
3300010037|Ga0126304_10272721All Organisms → cellular organisms → Bacteria → Acidobacteria1117Open in IMG/M
3300010048|Ga0126373_13231327Not Available507Open in IMG/M
3300010399|Ga0134127_10959007All Organisms → cellular organisms → Bacteria913Open in IMG/M
3300010399|Ga0134127_13659234Not Available504Open in IMG/M
3300010403|Ga0134123_11933073Not Available647Open in IMG/M
3300011441|Ga0137452_1153281Not Available775Open in IMG/M
3300012166|Ga0137350_1101364Not Available583Open in IMG/M
3300012232|Ga0137435_1007685All Organisms → cellular organisms → Bacteria3035Open in IMG/M
3300012885|Ga0157287_1114252Not Available517Open in IMG/M
3300012948|Ga0126375_10221096All Organisms → cellular organisms → Bacteria1260Open in IMG/M
3300012955|Ga0164298_10884494Not Available648Open in IMG/M
3300012960|Ga0164301_11674489Not Available530Open in IMG/M
3300012971|Ga0126369_10107433All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2541Open in IMG/M
3300013306|Ga0163162_13376866Not Available510Open in IMG/M
3300013308|Ga0157375_11383885Not Available829Open in IMG/M
3300014326|Ga0157380_10076492All Organisms → cellular organisms → Bacteria2724Open in IMG/M
3300014326|Ga0157380_10983775All Organisms → cellular organisms → Bacteria → Acidobacteria876Open in IMG/M
3300014876|Ga0180064_1108020Not Available588Open in IMG/M
3300014879|Ga0180062_1118290Not Available608Open in IMG/M
3300015255|Ga0180077_1047988Not Available842Open in IMG/M
3300015257|Ga0180067_1163106Not Available510Open in IMG/M
3300015373|Ga0132257_103981114Not Available537Open in IMG/M
3300015373|Ga0132257_104249744Not Available521Open in IMG/M
3300015374|Ga0132255_101071144Not Available1211Open in IMG/M
3300015374|Ga0132255_103192583Not Available699Open in IMG/M
3300015374|Ga0132255_104624338Not Available583Open in IMG/M
3300016445|Ga0182038_11674223Not Available573Open in IMG/M
3300017965|Ga0190266_10816703All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium600Open in IMG/M
3300017965|Ga0190266_10937582Not Available572Open in IMG/M
3300018422|Ga0190265_10807683All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Suessiales → Symbiodiniaceae → Symbiodinium → Symbiodinium microadriaticum1061Open in IMG/M
3300018469|Ga0190270_12175702Not Available615Open in IMG/M
3300018476|Ga0190274_12497120Not Available614Open in IMG/M
3300018481|Ga0190271_13264825Not Available544Open in IMG/M
3300020000|Ga0193692_1064720Not Available814Open in IMG/M
3300020005|Ga0193697_1085150Not Available766Open in IMG/M
3300020020|Ga0193738_1023898Not Available1917Open in IMG/M
3300024232|Ga0247664_1138580Not Available569Open in IMG/M
3300025923|Ga0207681_10461684All Organisms → cellular organisms → Bacteria1034Open in IMG/M
3300025923|Ga0207681_10691842All Organisms → cellular organisms → Bacteria847Open in IMG/M
3300025923|Ga0207681_11053247Not Available683Open in IMG/M
3300025930|Ga0207701_10244465All Organisms → cellular organisms → Bacteria1567Open in IMG/M
3300025930|Ga0207701_10758923All Organisms → cellular organisms → Bacteria819Open in IMG/M
3300025934|Ga0207686_11847702Not Available500Open in IMG/M
3300025937|Ga0207669_11181145Not Available648Open in IMG/M
3300025938|Ga0207704_10431365All Organisms → cellular organisms → Bacteria → Acidobacteria1047Open in IMG/M
3300025941|Ga0207711_10695452All Organisms → cellular organisms → Bacteria948Open in IMG/M
3300025961|Ga0207712_11709489Not Available564Open in IMG/M
3300025972|Ga0207668_11528421Not Available602Open in IMG/M
3300026023|Ga0207677_12130389Not Available522Open in IMG/M
3300026035|Ga0207703_10529132All Organisms → cellular organisms → Bacteria1110Open in IMG/M
3300026035|Ga0207703_11035435Not Available788Open in IMG/M
3300026118|Ga0207675_100046768All Organisms → cellular organisms → Bacteria4043Open in IMG/M
3300026118|Ga0207675_100706777Not Available1017Open in IMG/M
3300027533|Ga0208185_1147912All Organisms → cellular organisms → Bacteria542Open in IMG/M
(restricted) 3300027799|Ga0233416_10340046Not Available517Open in IMG/M
3300027821|Ga0209811_10262742Not Available660Open in IMG/M
(restricted) 3300027995|Ga0233418_10300343Not Available559Open in IMG/M
3300028379|Ga0268266_11174329Not Available742Open in IMG/M
3300028380|Ga0268265_10067389All Organisms → cellular organisms → Bacteria2771Open in IMG/M
3300028380|Ga0268265_10798713All Organisms → cellular organisms → Bacteria919Open in IMG/M
3300028587|Ga0247828_10117677All Organisms → cellular organisms → Bacteria → Acidobacteria1285Open in IMG/M
3300028592|Ga0247822_11524246Not Available565Open in IMG/M
3300028812|Ga0247825_10132065All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → candidate division NC101705Open in IMG/M
3300028889|Ga0247827_10287161All Organisms → cellular organisms → Bacteria → Acidobacteria954Open in IMG/M
3300030336|Ga0247826_11384199Not Available568Open in IMG/M
3300031538|Ga0310888_10684945Not Available628Open in IMG/M
3300031547|Ga0310887_10138203All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1262Open in IMG/M
3300031547|Ga0310887_10152516All Organisms → cellular organisms → Bacteria1213Open in IMG/M
3300031547|Ga0310887_10497923Not Available733Open in IMG/M
3300031562|Ga0310886_10367580All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium840Open in IMG/M
3300031740|Ga0307468_100879439All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium774Open in IMG/M
3300031740|Ga0307468_101107863Not Available706Open in IMG/M
3300031744|Ga0306918_11180334Not Available591Open in IMG/M
3300031770|Ga0318521_10317427All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales919Open in IMG/M
3300031796|Ga0318576_10630167Not Available505Open in IMG/M
3300031908|Ga0310900_10433666Not Available1006Open in IMG/M
3300031908|Ga0310900_10676126All Organisms → cellular organisms → Bacteria → Acidobacteria824Open in IMG/M
3300031908|Ga0310900_11767957Not Available526Open in IMG/M
3300031910|Ga0306923_10347233All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Bryobacterales1692Open in IMG/M
3300031944|Ga0310884_10942155All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium534Open in IMG/M
3300031947|Ga0310909_10048389All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes3253Open in IMG/M
3300032075|Ga0310890_11339874Not Available586Open in IMG/M
3300032075|Ga0310890_11760647Not Available515Open in IMG/M
3300032144|Ga0315910_11153244Not Available605Open in IMG/M
3300032180|Ga0307471_102909680Not Available608Open in IMG/M
3300032180|Ga0307471_103314904Not Available571Open in IMG/M
3300034147|Ga0364925_0325779Not Available577Open in IMG/M
3300034148|Ga0364927_0071328Not Available935Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil13.11%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil10.66%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere8.20%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil7.38%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere4.10%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere4.10%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil3.28%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.28%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.28%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.28%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere3.28%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.28%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil2.46%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil2.46%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.46%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.46%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment1.64%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.64%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Corn, Switchgrass And Miscanthus Rhizosphere1.64%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment1.64%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.64%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.64%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere1.64%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lentic → Sediment → Freshwater Sediment0.82%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.82%
Serpentine SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Serpentine Soil0.82%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.82%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.82%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.82%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.82%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.82%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.82%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Soil → Unclassified → Miscanthus Rhizosphere0.82%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.82%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.82%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.82%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.82%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090005Sediment microbial communities from Lake Washington, Seattle, for Methane and Nitrogen Cycles, original sample replicate 1EnvironmentalOpen in IMG/M
3300000033Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300004153Grasslands soil microbial communities from Hopland, California, USA (version 2)EnvironmentalOpen in IMG/M
3300004157Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - - Combined assembly of AARS Block 2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005354Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M4-3 metaGHost-AssociatedOpen in IMG/M
3300005441Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005456Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaGHost-AssociatedOpen in IMG/M
3300005457Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaGHost-AssociatedOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005617Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2Host-AssociatedOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005833Microbial communities from Cathlamet Bay sediment, Columbia River estuary, Oregon - S.174_CBKEnvironmentalOpen in IMG/M
3300005842Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2Host-AssociatedOpen in IMG/M
3300006358Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2Host-AssociatedOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009100Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD2Host-AssociatedOpen in IMG/M
3300009609Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT890EnvironmentalOpen in IMG/M
3300010037Serpentine soil microbial communities from UC McLaughlin Reserve, CA, USA - Plot25EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011441Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT513_2EnvironmentalOpen in IMG/M
3300012166Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT660_2EnvironmentalOpen in IMG/M
3300012232Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT100_2EnvironmentalOpen in IMG/M
3300012885Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S104-311B-1EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300013308Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M3-5 metaGHost-AssociatedOpen in IMG/M
3300014326Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S3-5 metaGHost-AssociatedOpen in IMG/M
3300014876Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200_16_10DEnvironmentalOpen in IMG/M
3300014879Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT45_16_10DEnvironmentalOpen in IMG/M
3300015255Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT466_16_10DEnvironmentalOpen in IMG/M
3300015257Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT231_16_10DEnvironmentalOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300017965Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 220 TEnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018469Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 320 TEnvironmentalOpen in IMG/M
3300018476Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 531 TEnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300020000Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3a1EnvironmentalOpen in IMG/M
3300020005Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3m2EnvironmentalOpen in IMG/M
3300020020Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a1EnvironmentalOpen in IMG/M
3300024232Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK05EnvironmentalOpen in IMG/M
3300025923Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025930Switchgrass rhizosphere bulk soil microbial communities from Kellogg Biological Station, Michigan, USA, with PhiX - S5 (SPAdes)EnvironmentalOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025937Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025938Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025941Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025972Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026035Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S1-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300027533Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT700 (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027821Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027995 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_1_MGEnvironmentalOpen in IMG/M
3300028379Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028587Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day3EnvironmentalOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028889Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day2EnvironmentalOpen in IMG/M
3300030336Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day1EnvironmentalOpen in IMG/M
3300031538Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D1EnvironmentalOpen in IMG/M
3300031547Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D4EnvironmentalOpen in IMG/M
3300031562Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D3EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031770Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f17EnvironmentalOpen in IMG/M
3300031796Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f24EnvironmentalOpen in IMG/M
3300031908Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - C24D1EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031944Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T60D1EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032144Garden soil microbial communities collected in Santa Monica, California, United States - Edamame soilEnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300034147Sediment microbial communities from East River floodplain, Colorado, United States - 44_j17EnvironmentalOpen in IMG/M
3300034148Sediment microbial communities from East River floodplain, Colorado, United States - 18_j17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
LWSO_066018202088090005Freshwater SedimentMELFLELNRLLTATVLLVMAPVAAAHAQGVEGTRSATIFVGTGVGLSGNAINEAVGTIDGKPSVLVEQALSNHFSDALRLRFTGSMGLDYNKEAFATLAYGKYNGTERIVGSVAGFPLLARLSNADAFDIEGGL
ICChiseqgaiiDRAFT_046247313300000033SoilLKTLRPWTSALLLVLVSVTSAAAQGVEGTRSVSVFVGTGVGLSGNAIEEAVGSIEGTPSVFVEQSIGNHFSDAFRLRFGGSYGLDYNKEIFATFANXXLXYNGTERIVGSVGGYPLYTRFSNADAFDIEGG
JGI10216J12902_10083001313300000956SoilMMWWTATNHGVILTFNRLLAAAALFVLIPLGTARAQGVEGMRSVTLFVGTGLSLAGNAINEGVGTIDGKPSVLVEQALSNHFSDALRVRFTGSLGLDYRKEVFATFAYGKYNGTERIVGSVSGYPLLARLSNADAFDIEGGLRYYLRPEGP
JGI10216J12902_11609417813300000956SoilLIWNRLLIAAVLLVITPVAAVHAQGVEGTRSATLFVGTGLSLSGNAINEGVGTIDGRPSVFVEQALSNHFSDALRLRFTGSMGLDYNKEAFATLAYGKYNGTERIV
JGI10216J12902_12140964813300000956SoilVTLNRLLAATLLAVSGPIGTAHAQGVEGVRSATLSVGTGLSLAGNAINEGSGTIDGKPSVLVEQALSNHFSDALRVRFTGSLGLDYNKEVFATFGYGKYNGTERVVGSVSGYPLL
soilL2_1029338313300003319Sugarcane Root And Bulk SoilVTSPRLLTVTILLLIAPAAVARGQGVEGMRSLTVFVGTGVSLSGNAINEGVGTIQGRPAVLVEQAMSNHFSDALRLRFTGALGLDYNKEVFATFAYGKYNGTHRIVGSVSGY
Ga0063455_10148989813300004153SoilMGVLSTRTRLLAAIIVSVTAPVTAARAQGVEGMRSATLFVGTGISLAGNVSNEGVGTIDGKPSVIVEQAFSNHFSDALRLRFTGSLGLDYNKEAFATLAYGKYNGTERIIG
Ga0062590_10146389613300004157SoilLTLKRCFAVTVLIALAPVAAARAQGVEGTRSATIFVGTGLSLSGTAINEGVGTIDGKPSVFVEQALSNHFSEGLRLRFTGSKGLDYNKEAFATLAYGNYNGTERIVGSVSGFPLLARL
Ga0063356_10491350713300004463Arabidopsis Thaliana RhizosphereLTLNRMLAASVLLIMAPVAAHAQGVEGTRSATLFVGTGLSLAGNAINEGVGSIDGRPSVLVEQALSNHFSDALKIRFTGSQGLDYNKEAFATFAFGKFNGTERIVGSVSGYP
Ga0062592_10123761713300004480SoilLKTLRPWTSALLLVLVSVTSTAAQGVEGTRSVSVFVGTGVGLTGNAIEEAVGSIEGTPSVFVEQSIGNHFSDAFRLRFGGSYGLDYNKEIFATFAYGRYNGTERIVGSVGGYPLYTRFSNADAFDIDGGLRYY
Ga0062594_10169601523300005093SoilLTLPRLLTVTILLVIVPVAVARGQGVEGMRSLTVFVGTGVSLSGNAINEGVGTIEGKPAVLVEQAMSNHFSDALRLRFTGGLGLDYNKEVFATFAYGKYNGTHRIVG
Ga0062594_10235164513300005093SoilLKLHRLVTVTVLLVLAPVAAARAQGVEGTRSGTLFVGTGLSLAGNAINEGIGTIDGQPSVLVEQALSNHFSDALRLRFTGSMGLDYNKEAFATFAYGKYNGTERIVGSVSGF
Ga0065707_1047474113300005295Switchgrass RhizosphereMALAPVAAARAQGVEGTRSATIFVGTGLTLSGNAINEGVGTIDGKPSVLVEQGLSNHFSDALKIRFTGSMGLDYNKEAFATLAYAKYNGTERIVGSVSGFPLLARLSNADAFDIEGGLRYYLRPEGPL
Ga0068868_10123122213300005338Miscanthus RhizosphereLTLNRLFAATVLIAVAPLAAADAQGVEGTWSATIFVGTAVSLSGNAIEEGVGTINGNPSVFVEQALSNHFSDALRLRFTGSKGLDYNKEVFATLAYGKYNGTERIVGSVSGFPLRARLSNADAFDV
Ga0070675_10069813113300005354Miscanthus RhizosphereLILNRLLAATVLLVLAPAAAAHAQGVEGTRSATIFVGTGISLAGNAINEGVGTIDGKPSVLVEQALSNHFSDALKLRFTGSKGLDYNKEAYATLAYGKYNGTER
Ga0070700_10121804213300005441Corn, Switchgrass And Miscanthus RhizosphereLTLPRLLTVTILLVIVPVAVARGQGVEGMRSLTVFVGTGVSLSGNAINEGVGTIEGKPAVLVEQAMSNHFSDALRLRFTGGLGLDYNKEVFATFAYGKYNGTHRIVGSVSGYPLTARFSNPDAFDIEGGLRYYLRP
Ga0070694_10028329513300005444Corn, Switchgrass And Miscanthus RhizosphereLTLPRLLTVTILLVIVPVAVARGQGVEGMRSLTVFVGTGVSLSGNAINEGVGTIEGKPAVLVEQAMSNHFSDALRLRFTGGLGLDYNKEVFATFAYGKYNGTHRIVGSVSGYPLTARFSNPDAFDIEGGLRYYL
Ga0070678_10199977113300005456Miscanthus RhizosphereLLLKRLLAATVLVVIASAPAAHAQGVEGQRSLTLFVGTGLSLAGNAINEAVGTIDGKPSVFVEQALSNHFSDGLKLRFSGGLGLDFNKEVFATFAYGKYNGTHRIVGSVSGYPLLARLSNADAFDFEGGLRYYLRPE
Ga0070662_10139065123300005457Corn RhizosphereLKLHRLVTVTVLLVLAPVAGARAQGVEGTRSGTLFVGTGLSLAGNAINEGIGTIDGQPSVLVEQALSNHFSDALRLRFTGSLGLDYNKEAFATFAYGKYNGTERIVGSVSGF
Ga0068867_10196358713300005459Miscanthus RhizosphereVTLHRLFAATVLMALAPVAAARAQGVEGTRSGTIFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSLGLDYNREVFATFAYGKYNGTERI
Ga0073909_1020696413300005526Surface SoilMMWQDIESRITGRPPVQPNGAFLTLKRLLAATVLLVVAPVAVARAQGVEGVRSVTLFVGTGLSLAGNAINEGVGTIDNKPSVIVEQSISNHFSDGLRLRFTGSTGLDYNKEAFVTFAYGKYNGTHRIV
Ga0070695_10070168113300005545Corn, Switchgrass And Miscanthus RhizosphereLTLPRLLTVTILLVIVPVAVARGQGVEGMRSLTVFVGTGVSLSGNAINEGVGTIEGKPAVLVEQAMSNHFSDALRLRFTGGLGLDYNKEVFATFAYGKYNGTHRIVGSVSGYPLTARFSNPDAFDIEGGLRYYLRPEGNLRTYVAGAAGL
Ga0068859_10166025313300005617Switchgrass RhizosphereLILNRLLAATVLLVLAPAAAAHAQGVEGTRSATIFVGTGISLAGNAINEGVGTIDGKPSVLVEQALSNHFSDALKLRFTGSKGLDYNKEAYATLAYGKYNGTERIVGSVSGFPLLARLSNADAFDIEGGLRYYLR
Ga0068861_10023946223300005719Switchgrass RhizosphereLKLHRLVTVTVLLVLAPVAAARAQGVEGTRSGTLFVGTGLSLAGNAINEGIGTIDGQPSVLVEQALSNHFSDALRLRFTGSIGLDYNKEAFATFAYGKYNGTERIVGSVSGFP
Ga0074472_1079707313300005833Sediment (Intertidal)MLRLLTIGVLVGLSPVAAAHAQGVEGTRSITGSIGTGLGLAGNAINEATGTIQGKPSVFVEQAMSNHYSDALRLKATYSMGLDYNKEVFGTFAYGKYNGTERLVGSVAGYPLYVRFQNADAIDLEGGLRYYLRPE
Ga0068858_10032993213300005842Switchgrass RhizosphereMPLRTPRRKTTRLAGSCFSHGVFLTLHRLFAATALIALVPVAAARAQGVEGTRSATIFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDGLRLRFTGSKGLDYNKEAYLTLAYGKFNGTERIVGSVSGFPLLARLSNT
Ga0068858_10174519623300005842Switchgrass RhizosphereLPLKRLLAAAVLLVIAVIVPVGAAHAQGVEGTRSVTLFVGTGLSLAGNAINEAVGTIDGKPSVFVEQALSNHFSDALRLRFTGSLGLDYNKEVFATFAYGKYNGTE
Ga0068871_10240967413300006358Miscanthus RhizosphereMAPAAAARAQGVEGVRSVTLFVGTGISLAGNAINEAVGTIDGKPSVFVEQALSNHFSDALKLRFTGSLGLDYNKEAFATFAYGKYNGTERIVGSVSGYPLLARLSNADAFDFEGGLRYYLRPEGPIRTYVA
Ga0079222_1088386213300006755Agricultural SoilVTLRRLLAAVLVASIPTAAHAQGVEGTRSISVNVGTALSLAGNAIEEGVGTIDGRPSVLVEQSLSNHFSDALKLRFTGSLGLDYRREVFATFGWGKYNGVVWLLKNHRDWIDAEYCVNEGGW
Ga0075431_10036717823300006847Populus RhizosphereLPLHRLFTATVLLVLAPVAAAHAQGVEGTRSATIFVGTGLSLAGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSKGLDYNKEAFATLAYGKYNGTERIVGSVSGFPLLARLSNADAFDIEGGLRYYLRPEGPLRTYVA
Ga0075431_10162707613300006847Populus RhizosphereLNSTRLLTAGALLLLSTSVHAQGVEGTRSVSVFLGTGFGLSGNAIEEAVGTINGTPSVFVEQGIGNHFSDALRLRFVGSYGLDYNKEAFATFAYGRYNGTERIVGSVGGYPLYT
Ga0111539_1157065613300009094Populus RhizosphereLTLPRLLTVTVLLLITPVAVAHGQGVEGMRSLTVFVGTGVSLSGNAINEGVGTIEGKPAVLVEQAMSNHFSDALRLRFTGALGLDYNKEVFATFAYGKYNGTHRIVGSVSGYPLTARFSNPDAFDVEG
Ga0075418_1080444023300009100Populus RhizosphereVFLPLHRLFTATVLLVLAPVAAAHAQGVEGTRSATIFVGTGLSLAGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSKGLDYNKEAFATLAYGKYNGTERIVGSVS
Ga0105347_124036113300009609SoilLTFKQLLAVIVLFFVIAPVSAARAQGVEGIRSVTLSVGTGISLAGNAINEGVGTIDGKPAVLVEQALSNHFSDALRLRFTGSLGLDYNKEVFATFAYGKYNGTERVVGSVSGYPLLARLSNADAFDIEGGLRYYLRPEGPIRTYVAGAAGLRFL
Ga0126304_1027272113300010037Serpentine SoilVTIFFGTGLRLAGNAINEGIGTIDGKPSVLVEQALSNHFSDALRLRFTGSMGLDYNKEAFATLAYGKYNGTERIVGSVSGFPLLARLSNADAFDIEGGLRYYL
Ga0126373_1323132713300010048Tropical Forest SoilMAAVFVLVISPLAVAHAQGVEGVRAVAVSVGTGVSLAGNAINEGAGTINGQPSVLVEQALSNHFSDALKLRFTGSQGLDYNKEAFVTFAWGKYNGTERIVGSVAGYPLRARLSNTDALDLEGGLRYYFRPEGP
Ga0134127_1095900723300010399Terrestrial SoilVFVTLHRLFAATVLMALAPVAAARAQGVEGTRSGTIFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSMGLDYNKEAFATLAYGKYNGTERIVGSVSGFPLLARLSNADAFDIEGG
Ga0134127_1365923413300010399Terrestrial SoilVNRLFAATVLIALVPVAAARAQGVEGTRSATIFVGTGLSLSGTAINEGVGTIDGKPSVFVEQALSNHFSDALRLRFTGSKGLDYNKEAFATLAYGNYNGTERIVGSVSGFPLLARLSNADAFDIEGGVRYYL
Ga0134123_1193307313300010403Terrestrial SoilVFLILNRLLAATVLLVLAPAAAAHAQGVEGTRSATIFVGTGLSLAGNAINEGVGTIDGKPSVLVEQALSNHFSDALKLRFTGSKGLDYNKEAYATLAYGKYNGTERIVGSVSGYPLVVRLSNGDAFDLEGGLRYYLRPEGPIRTYVAGAL
Ga0137452_115328123300011441SoilMTLRRWLAATVLLVIAPVAAAHAQGVEGIRSATLFVGTGLSLAGNAINEAVGTIDGKPAVFVEQALSNHFSDGLRLRFTGSLGLDYKKEVFATFAYGKYNGTHRIVGSVSGYPLLARFSNADAFDIEGGPAHAGAP*
Ga0137350_110136413300012166SoilMAPVAAAHAQGVEGIRSVTLFVGTGLGLAGNAINEAVGTIDDKPSVFVEQALSNHFSDALRLRFTGSLGLDYNKEVFATFAYGKYNGTERVVGSVSGYPLLARLSNADAFDIEG
Ga0137435_100768513300012232SoilMIWEEIKSRITGRSASNAGLAHGAFLRLRRLLATTVLLLTAPVAAARAQGVEGMRSVTLFVGTGLSLAGNAINEGVGTIDGKPSVIVEQSISNHFSDALRLRFTGSKGLDYNKEVFATFAYGKYNGTHRTVGSISGYPLVARFSNPDAFDIEGGLRYY
Ga0157287_111425213300012885SoilLTLKQLLAATLLFVTAPVAAAHAQGVEGVRSVTLSVGTGISLAGNAINEGSGTIDGKPSVLVEQALSNHFSDALRLRFTGSMGLDYNKEAFATLAYAKYNGTERIVGSVSGFPLLARL
Ga0126375_1022109623300012948Tropical Forest SoilMAPVAARAQGVEGTRSVTLFVGTGIGLAGNAIEEGVGTIDGKPSVLVEQALSNHFSDALKLRFTGSLGLDYRREVFGTFGWGKYNGTERIVGSVAGYPLRARLSNSDAFDFEGGLRYYVRPEGPLRTYVAAAAGLRYLLATDATFRV
Ga0164298_1088449413300012955SoilVFLTLNRLFAATVLIAVAPLAAADAQGVEGTWSATIFVGTAVSLSGNAIEEGVGTINGNPSVFVEQALSNHFSDALRLRFTGSKGLDYNKEVFATLAYGKYNGTERIVGSVSGFPLRARLSNADAFD
Ga0164301_1167448913300012960SoilVSLPLIRLLAATVLMVIAPVAGAHAQGVEGTRSVTIFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSKGLDYNKEAFATLAYGKFNGT
Ga0126369_1010743313300012971Tropical Forest SoilMAPSAAHAQGVEGTRSVTVFVGTGLSLAGNAIEEGVGTIDGKPSVLVEQALSNHFSDALKLRFTGSLGLDYRREVFGTFGWGKYNGTERIVGSVAGYPLRARLSNSDAFDFEGGLRYYLRPEGPIRTYVAAAAGLRYL
Ga0163162_1337686613300013306Switchgrass RhizosphereVFLKLHRLVTVTVLLVLAPVAAARAQGVEGTRSGTLFVGTGLSLAGNAINEGIGTIAGQPSVLVEQALSNHFSDALRLRFTGSMGLDYNKEAFATFAYGKYNGTERIVGSVSGFPLLARLSNADAFDIEGGLRYYLRPEGPLRTYVAGAL
Ga0157375_1138388513300013308Miscanthus RhizosphereVFLTLNRLFAATVLIAVAPLAAADAQGVEGTWSATIFVGTAVSLSGNAIEEGVGTINGNPSVFVEQALSNHFSDALRLRFTGSKGLDYNKEVFATLAYGKYNGTERIVGSVSGFPLRARLSNADAFDVEGGLRYYLRPEGPIRTYVAGAAGLRYLQ
Ga0157380_1007649253300014326Switchgrass RhizosphereVFVTLHRLFAATVLMALAPVAAARAQGVEGTRSGTIFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSMGLDYNKEAFATFAYGKYNGTERIVGSVSGFPLRARLSNADAFDI
Ga0157380_1098377523300014326Switchgrass RhizosphereVFLKLHRLVTVTVLLVIAPVAGARAQGVEGTRSGTLFVGTGLSLAGNAINEGIGTIDGQPSVLVEQALSNHFSDALRLRFTGSMGLDFNKEAFATFAYGKYNGTERIVGSVSGFPLRARLSNADAFDI
Ga0180064_110802013300014876SoilLTFKQLLAVIVLFFVIAPVSAARAQGVEGIRSVTLSVGTGISLAGNAINEGVGTIDGKPAVLVEQALSNHFSDALRLRFTGSLGLDYKKEVFATFGYGKYNGTERVVGSVSGYPLLARLSNADAFDIEGGLRYYLRPEGPI
Ga0180062_111829023300014879SoilMAPVAAAHAQGVEGIRSATLSVGTGISLAGNAINEAVGTIDGKPSVLVEQALSNHFSDALRLRFTGSLGLDYNKEVFATFAYGKYNGSHRIVGSVSGYPLLARFSNADAFD
Ga0180077_104798823300015255SoilMAPVAAAHAQGVEGVRSATLSVGTGLSLAGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSLGLDYNKEVFATFAYGKYNGTHRIVGSVSGYPLLARLSNADAFDIEGGLRYYLRPEGPIRTYVAGAAGLRFLQ
Ga0180067_116310613300015257SoilLTFKQLLAVIVLFVIAPVSAARAQGVEGIRSVTLSVGTGISLAGNAINEGVGTIDGKPAVLVEQALSNHFSDALRLRFTGSLGLDYNKEVFATFAYGKYNGTERIVGSVAGYPLLARFSNADAFD
Ga0132257_10398111413300015373Arabidopsis RhizosphereVTLNRLLAFTLLVVSAPIATAHAQGVEGVRSVTLSVGTGVSLAGNAINEGAGTIDGKPAVLVEQALSNHFSDALRVRFTGSLGLDYNKEVFATFAYSKFNGSHRIVGSISGY
Ga0132257_10424974413300015373Arabidopsis RhizosphereLAATVLFVLAPVAAARAQGVEGVRSVTVFVGTGVGLSGNAINEGVGTINDKPAVIVEQAFSNHFSDALRLRGSYALGLDYNKEVFATFAWGKYNGTHRIVGSVSGYPLTARFSNPDAI
Ga0132255_10107114413300015374Arabidopsis RhizosphereMSPVAASAQGVEGTRSVTVYVGTGLSLAGNAFEEGVGTIDGKPSVLVEQALSNHFSDALKLRFTGSLGLDYRREIFATFGWGKYNGTERIVGSVAGYPLRARLSNADA
Ga0132255_10319258313300015374Arabidopsis RhizosphereLSLKRLLAATVRLVIAVIVPVAAAHAQGVEGTRSVTLFVGTGLSLAGNAINEAVGTIDGKPSVFVEQALSNHFSDALKLRFTGSLGLDYNKEVFATFAYGKYNGTERIVGSVSGYPLLARFSNADAFDIEGGLRYYLRPEGPIRTYV
Ga0132255_10462433813300015374Arabidopsis RhizosphereMGALLTLKRRLAASVFLVISPVVAHAQGVEGTRSATIFVGTGLSVHGNAINEGVGTIDGRPSVLVEQALSNHFSDALRLRFSGGVGLDYNKEVYATVAYAKYNGTERIVGSVSGYP
Ga0182038_1167422313300016445SoilMPLKLNRRLAAALLIVIAPVAVAHAQGVEGTKSVTLFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRIRFTGSVGLDYNKEAYATVAYGKFNGTER
Ga0190266_1081670313300017965SoilLTLKRLLAAAVLLVIAPPAAAHAQGVEGMRSAMFSVGTGVSLAGNAINEGAGTIAGRPSVLVEQSLSNHFSDALRLRFTGSIGLDYNKEMFATFGYGKYNGTERIVGSVSGYPLL
Ga0190266_1093758213300017965SoilLTLKRLLAATLLFVTAPVAAAHAQGVEGVRSVTLSVGTGISLAGNAINEGSGTIDGKPSVLVEQALSNHFSDALRVRFTGSLGLDYNREVFATFGYGKYNGTERIVGSVAGYPLLARFSNADAFDIE
Ga0190265_1080768323300018422SoilLTSKRLLAATVLFVIASLNAAHAQGVEGTRSVTLYVGTGISLAGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSLGLDYKKEVFATFAYGKYNGTERIVGSVSGY
Ga0190270_1217570213300018469SoilVTLKRLLAATLLFVTAPVADAHAQGVEGVRSITLSVGTGVSLAGQAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSLGLDYRREVFATFAYGKYNGTERIVGSVSGYPLLARLSNADAFDIEG
Ga0190274_1249712023300018476SoilMVLAPAAAARAQGVEGTRSGTIFVGTGISLAGNAINEGVGTIDGKPSVIVEQALSNHFSDALRLRFTGSLGLDYNKEAFATFAYGKYNGTERIIGSVAGYPLLARLSNVDAFDIEGGLRYYLKPEGPIRTYVAGAVGLRFLQATDATFR
Ga0190271_1326482523300018481SoilLNLKRLSAATALFVATAVAAAHAQGVEGTRSVTLSVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSLGLDYNKEAFATFAYGKYNGTERIVGSVSGYPLLARLSNADAFDIEGGLRYYLRPEGPIRTYVAAA
Ga0193692_106472013300020000SoilMGGLSTRNQLLAATIVLVMAPVAGARAQGVEGTRSATIFVGTGISLAGNAINEGVGTIDGKPSVIVEQALSNHFSDALRLRFTGSLGLDYNKEAYATLAYGKYNGTERIIGSVAGYPLLARLSNVDAFDIEGGLRYYLKPEGPIRTYVAGAVGLRFLQA
Ga0193697_108515013300020005SoilLTLKQLLAATVLLVIAPVAAAHAQGVEGTRSVTLSVGTGLSLAGNAINEGAGTIEGKPAVFVEQALSNHFSDALRVRFTGSLGLDYNKEAFATFAYGKYNGTERVVGSVSGYPLLARLSNADAFDIEGGLRYYLRPEGPIRTYVAG
Ga0193738_102389813300020020SoilLNLKRLSAATALFVATAVAAAHAQGVEGTRSVTLSVGTGISLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSLGLDYNKEAFATFAYGKYNGTERIVGSVAGYPLLARLSNADAFDIEGGIRYYL
Ga0247664_113858013300024232SoilMPQYQSMPQPRTHTTRLAGSRFSHRVCLTLNRLFAATVLIALAPVAVARAQGVEGTRSATLFVGTGLSLAGNAINEGVGTIDGKPSVLVEQALSNHFSDGLRLRITGSKGLDYDKEAFLTLAYGKFNGTERIVGSVSGF
Ga0207681_1046168423300025923Switchgrass RhizosphereMALAPVAAARAQGVEGTRSATIFVGTGLTLSGNAINEGVGTIDGKPSVLVEQGLSNHFSDALKIRFTGSMGLDYNKEAFATLAYAKYNGTERIVGSVSGF
Ga0207681_1069184223300025923Switchgrass RhizosphereVTLHRLFAATVLMALAPVAAARAQGVEGTRSGTIFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSMGLDYNKEAFATLAYGKYNGTERIVGSVSGFPLLARLSNADAFDIEGGLRYY
Ga0207681_1105324713300025923Switchgrass RhizosphereLTLKRLLAATLLFVTAPVAAAHAQGVEGVRSVTLSVGTGISLAGNAINEGSGTIDGKPSVLVEQALSNHFSDALRVRFTGSLGLDYNREVFATFGYGKYNGTERVVGSVSGYPLLARLSNADAFDIEGGLRYYLRPEGPIRTYVAGAAGVRF
Ga0207701_1024446523300025930Corn, Switchgrass And Miscanthus RhizosphereLTLKRLLAATLLFVTTPVAAAHAQGVEGVRSVTLSVGTGISLAGNAINEGSGTIDGKPSVLVEQALSNHFSDALRVRFTGSLGLDYNREVFATFGYGKYNGTERVVGSVSGYPLLARLSNADA
Ga0207701_1075892323300025930Corn, Switchgrass And Miscanthus RhizosphereVTLNRLLAATLLVVSAPIATAHAQGVEGVRSVTLSVGTGLSLAGNAINEGAGTIDGKPAVLVEQALSNHFSDALRVRFTGNLGLDYNKEVFATFGYGKYNGTERVVGSVSGYPLLARLSNADAFDIEGGLRYYL
Ga0207686_1184770213300025934Miscanthus RhizosphereMVPVAAAHAQGVEGTRSVTLFVGTGLSLAGNAINEAVGTIDGKPSVFVEQALNNHFSDALRLRFTGSLGLDYNREVFATFAYGKYNGTERIVGSVSGYPGGDSRT
Ga0207669_1118114513300025937Miscanthus RhizosphereLTLKRLLAATLLFVTTPVAAAHAQGVEGVRSVTLSVGTGISLAGNAINEGSGTIDGKPSVLVEQALSNHFSDALRVRFTGSLGLDYNREVFATFGYGKYNGTERVVGSVSGYPLLARLSNADAFDIEGGLRYYLRPEGPIRTYVAGAAGVRFLQAT
Ga0207704_1043136513300025938Miscanthus RhizosphereLPLHRLLTATVLLLLAPAAAARAQGVEGTRTATIFVGTGLSLAGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSKGLDYNKEAFATLAYGKYNGTERIIGSV
Ga0207711_1069545213300025941Switchgrass RhizosphereVTSRRLLAATVLLVIAPAAVAHAQGVEGVRSVTLFVGTGLSLSGNAINEAVGTIDGKPSVFVEQALSNHFSDALKLRFTGSLGLDYNKEAFATFAYGKYNGTERIVG
Ga0207712_1170948923300025961Switchgrass RhizosphereVTLHRLFAATVLMALAPVAAARAQGVEGTRSATIFVGTGLTLSGNAINEGVGTIDGKPSVLVEQGLSNHFSDALKIRFTGSMGLDYNKEAFATLAYAKYNGTERIVGSVSGFPLLARLSNADAFDIE
Ga0207668_1152842113300025972Switchgrass RhizosphereVTLHRLFAATVLMALAPVAAARAQGVEGTRSGTIFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSMGLDYNKEAFATLAYGKYNGTERIVGSVSGFPLLARLSNADAFDIEGGLRYYLRPEGPLRTYV
Ga0207677_1213038913300026023Miscanthus RhizosphereLTLNRLFAATVLIAVAPLAAADAQGVEGTWSATIFVGTAVSLSGNAIEEGVGTINGNPSVFVEQALSNHFSDALRLRFTGSKGLDYNKEVFATLAYGKYNGTERIVGSVSGFPLRARLSNADAFDVEGGLRYYL
Ga0207703_1052913213300026035Switchgrass RhizosphereVTSKRLLAATVLLVIVPVAVAHAQGVEGVRSVTLFVGTGLNLSGNAINEAVGTIDGKPSVFVEQALSNHFSDALKLRFSGSLGLDYNKEAFATFAYGKYNGTERIVGSVSGYPLLARFSNADAFDFEGGLRYY
Ga0207703_1103543523300026035Switchgrass RhizosphereLTLHRLFAATALIALVPVAAARAQGVEGTRSATIFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDGLRLRFTGSKGLDYNKEAYLTLAYGKFNGTERIVGSVSGFPLLARLSNTNAFDIEGGLRYYLRPEGPLRTYVAGAA
Ga0207675_10004676853300026118Switchgrass RhizosphereMALAPVAAARAQGVEGTRSATIFVGTGLTLSGNAINEGVGTIDGKPSVLVEQGLSNHFSDALKIRFTGSMGLDYNKEAFATLAYAKYNGTERIVGSVSGFPLLARLSNADAFDIEGGLR
Ga0207675_10070677723300026118Switchgrass RhizosphereLPLKRLLAAAVLLVIAVIVPVGAAHAQGVEGTRSVTLFVGTGLSLAGNAINEAVGTIDGKPSVFVEQAFNNHFSDALRLRFTGSLGLDYNREVFATFAYGKYNGTERIVGSVSGYPLLARFSNADAFDFEGGLRYYLRPEGPIRTYVA
Ga0208185_114791213300027533SoilVTLFVGTGIGLAGNATNEAVGTIDGKPSVFVEQALSNHFSDGLRLRFTGSLGLDYNKEVFATFAYGKYNGTHRVVGSVSGYPLLARFSNADAFDIEGGLRYYLRPEGPIRTYVAGAAGLRFLQATDVTFVV
(restricted) Ga0233416_1034004613300027799SedimentMGDFLTLKRRLAAAVLLVMAPVAAAHAQGVEGTRSATLFVGTGLSLAGNAINEGVGTIDGRPSVLVEQALSNHFSDALRLRFAGGLGLDYNKEVYATFAYGKFNGTERIVGSVAGYPLRARLSNTDAFD
Ga0209811_1026274213300027821Surface SoilMGGLSTRNQLLAATLVLVMAPAAGARAQGVEGTRSATLFVGTGISLAGDAINEGVGTIDGKPSVIVEQALSNHFSDALRLRFTGSLGLDYNKEAYATLAYGKYNGTERIIGSVAGYPLLARLSNVDAFDIEGGL
(restricted) Ga0233418_1030034313300027995SedimentMVPVAAAHAQGVEGLRSATLFVGTGLSLYGNAINEGVGTIDGKPSVLVEQALSNHFSDALKLRFSGSLGLDYNKEVFATFAYGKYNGTERIVGSVAGYPLRARLSNADAFDIEGGLRYYFRPEGPIRTYLAGAAGVRFFQATDVT
Ga0268266_1117432913300028379Switchgrass RhizosphereLTLNRLFAATVLIAVAPLAAADAQGVEGTWSATIFVGTAVSLSGNAIEEGVGTINGNPSVFVEQALSNHFSDALRLRFTGSKGLDYNKEVFATLAYGKYNGTERIVGSVSGFPLRARLSNADAFDVEGGLRYYLRPE
Ga0268265_1006738953300028380Switchgrass RhizosphereVTLHRLFAATVLMALAPVAAARAQGVEGTRSGTIFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGSMGLDYNKEAFATLAYGKYNGTERIVGSVSGFPLLARLSNADAFDIEGGLRYYLRPEGPLRTYVA
Ga0268265_1079871313300028380Switchgrass RhizosphereVTLHRLFAATVLMALAPVAAARAQGVEGTRSATIFVGTGLTLSGNAINEGVGTIDGKPSVLVEQGLSNHFSDALKIRFTGSMGLDYNKEAFATLAYAKYNGTERIVGSVSGFPLLARLSNADAFDIEGGLRYYLRPEGPLRTYVAGALG
Ga0247828_1011767723300028587SoilLKLHRLVTVTILLVIAPVAAAHAQGVEGTRSGTLFVGTGLSLAGNAINEGIGTIDGQPSVLVEQALSNHFSDALRLRFTGSIGLDYNKEAFATFAYGKYNGTERIVGSVSGFPLRARLSNADAFDIEGGLRYYLRPEGPLRT
Ga0247822_1152424613300028592SoilLTLKRLLAATLLLVTAPVAAAHAQGVEGVRSVTLSVGTGISLAGNAINEGSGTIDGKPSVLVEQALSNHFSDALRVRFTGSLGLDYNREVFATFGYGKYNGTERVVGS
Ga0247825_1013206523300028812SoilLTLKRLLAATVIFVIAQVAAAHAQGVEGIRSATLFVGTGISLSGNAINEGVGTIDGKPAVLVEQSLSNHFSDALRLRFSGGLGLDYNKEAFATFAYGKYNGTHRTVGSVSGYPLVARFSNADAFDIEGVSIG
Ga0247827_1028716123300028889SoilLKLHRLVTVTVLLVLAPVAAARAQGVEGTRSGTLFVGTGLSLAGNAINEGIGTIDGQPSVLVEQALSNHFSDALRLRFTGSIGLDYNKEAFATFAYGKYNGTERIVGSVSGFPLRARFPTPSC
Ga0247826_1138419913300030336SoilLKLHRLVTVTILLVIAPVAAAHAQGVEGTRSGTLFVGTGLSLAGNAINEGIGTIDGQPSVLVEQALSNHFSDALRLRFTGSIGLDYNKEAFATFAYGKYNGTER
Ga0310888_1068494513300031538SoilMMWRDIESRITGRPESAAGPANGALLTSKRLLAAAVLLVIAPVAAAHAQGVEGVRSVTLFVGTGLGLAGNAINEAVGTIDGKPSVFVEQALSNHFSDALRLRFTGSLGLDYNKEAFATFAYSKFNGSHRIVGSIAGYPLLARLSNADAFDIEGGLRYYLRPEGPIRT
Ga0310887_1013820323300031547SoilLTSLRLLIVTILLLITPVAVARAQGVEGMRSITVFVGTGVSLSGNAINEGVGTIEGKPAVLVEQAMSNHFSDALRLRFTGSLGLDYNKEAFATFAYGKYNGTHRIVGS
Ga0310887_1015251623300031547SoilLTFKQLLAVIVLFFVIAPVSAARAQGVEGIRSVTLSVGTGISLAGNAINEGVGTIDGKPAVLVEQALSNHFSDALRLRFTGSLGLDYNKEVFATFAYGKYNGTERVVGSVSGYPLLARLSNAD
Ga0310887_1049792323300031547SoilLKTLRPWTSALLLVLVSVTSAAAQGVEGTRSVSVFVGTGVGLSGNAIEEAVGSIEGTPSVFVEQSIGNHFSDAFRLRFGGSYGLDYNKEIFATFAYGRYNGTERIVGSVGGYPLYTRFSNADAFDIEGGLRYYF
Ga0310886_1036758023300031562SoilLTLPRLLTVTILLVIVPVAVARGQGVEGMRSLTVFVGTGVSLSGNAINEGVGTIEGKPAVLVEQAMSNHFSDALRLRFTGGLGLDYNKEVFATFAYGKYNGTHRIVGSVS
Ga0307468_10087943913300031740Hardwood Forest SoilLTLPRLLTVTILLVIVPVAVARGQGVEGMRSLTVFVGTGVSLSGNAINEGVGTIEGKPAVLVEQAMSNHFSDALRLRFTGGLGLDYNKEVFATFAYGKYNGTHRIVGSVSGYPLTARFSNPDAFDIEGGLRYYLRPEGNLRTYVAGAA
Ga0307468_10110786313300031740Hardwood Forest SoilVTLTRRLAATVLMVIAPIAAAHAQGVEGTRSATLFVGTGLSLSGNAINEGVGTIDGRPSVLVEQALSNHFSDALRIRFTGSLGLDYNNEAFATLAYGKFNGTERIVGSVSGFPLRARLSNTDAFDIEGGLRHYLRPE
Ga0306918_1118033413300031744SoilMPLKLNRRLAAALLIVIAPVAVAHAQGVEGTKSVTLFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRIRFTGSVGLDYNKEAYATVAYGKFNGTERIVGSVSGFPLLARLSNTNAFDIEGGLRYYLRPEGPLRTYVAGALGLRFLQ
Ga0318521_1031742713300031770SoilMPLKLNRRLAAALLIVIAPVAVAHAQGVEGTKSVTLFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRIRFTGSAGLDYNKEAYATVAYGKFNGTERIVGSVSGFPLLARLSNTNAFDIEGGLRYYL
Ga0318576_1063016713300031796SoilLTTDRLVAAVFVLVMSPLAVAHAQGLEGVRAVAISVGTGVSLAGDAINEGVGTIDGKPSVLVEQALSNHFSDALKLRLTGSLGLDYNKEAFVTFAWGKYN
Ga0310900_1043366623300031908SoilLTSLRLLIVTILLLITPVAVARGQGVEGMRSITVFVGTGVSLSGNAINEGVGTIEGKPAVLVEQAMSNHFSDALRLRFTGSLGLDYNKEAFATFAYGKYNGTHRIVGSVSGYPLTARFSN
Ga0310900_1067612613300031908SoilLKLHRLVTVTVLLVLAPVAAARAQGVEGTRSGTLFVGTGLSLAGNAINEGIGTIDGQPSVLVEQALSNHFSDALRLRFTGSMGLDYNKEAFATFAYGKYNGTERIVGSVSGFPLLARLSNADAFDIEGGLRYYLRPEGPLRTYVA
Ga0310900_1176795713300031908SoilLTLKRLLAATLLFVTAPVAAAHAQGVEGVRSVTLSVGTGISLAGNAINEGSGTIDGKPSVLVEQALSNHFSDALRVRFTGSLGLDYNKEVFATFGYGKYNGTERVVGSV
Ga0306923_1034723313300031910SoilMPLKLNRRLAAALLIVIAPVAVAHAQGVEGTKSVTLFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRIRFTGSVGLDYNKEAYATVAYGKFNGTERIVGSVSGFPLLARLSNTNAFDI
Ga0310884_1094215523300031944SoilLKLLRPLTVTILLVTTPVAVARAQGLEGMRSLTVFVGTGVSLSGNAINEGVGTIQGKPAVLVEQAMSNHFSDALRLRFTGALGLDYNKEVFATFAYGKYNGTH
Ga0310909_1004838933300031947SoilMPLKLNRRLAAALLIVIAPVAVAHAQGVEGTKSVTLFVGTGLSLSGNAINEGVGTIDGKPSVLVEQALSNHFSDALRIRFTGSAGLDYNKEAYATVAYGKFNGTERIVGSVSGFPLLARLSNTNAFDIEGGLRYY
Ga0310890_1133987413300032075SoilLKLHRLVTVTVLLVLAPVAAARAQGVEGTRSGTLFVGTGLSLAGNAINEGIGTIDGQPSVLVEQALSNHFSDALRLRFTGSIGLDYNKEAFATFAYGKYNGTERIVGSVSGFPLRARLSN
Ga0310890_1176064713300032075SoilRWRCGHGTAGGSRDGLNRNMMWRDIESRLTGRPNSNVGPANGALLTLKRLLAATLLLVIAPIAAAHAQGVEGMRSVTLFVGTGIGLAGNATNEAVGTIDGKPSVFVEQALSNHFSDGLRLRFTGSLGLDYNKEVFATFAYGKYNGTHRVVGSVSGYPLLARFSNADAFDIE
Ga0315910_1115324413300032144SoilMMWRHRIAITGCPQSTPVQPLEFLLTLNRLLAVTLFFFLAPAAAVRAQGVEGMRSVSVSLGTGISLAGNAINEGVGTIDGKPSVLVEQALSNHFSDALRLRFTGGLGLDYNKEVFATFAYGKYNGTERIVGSVSGYPLLARLSNADAFDIEGGLRYYLRPEGPIRTYVA
Ga0307471_10290968013300032180Hardwood Forest SoilLTLKRLLATALLFIAPVAVAHAQGVEGTRSVTVFVGTGLSLAGNAINEGVGTIEGKPSVIVEQSISNHFSDGLRLRFTGSLGLDYNKEAFVTFAYGKYNGTHRIVGSISGYPLVARFS
Ga0307471_10331490413300032180Hardwood Forest SoilMTLRRLLAATVLLVIAPVAAAHAQGVEGIRSATLFVGTGLSLAGNAINEAVGTIDGKPAVFVEQALSNHFSDGLRLRFTGSLGLDYNKEVFATFAYSKFNGTHRIVGSVSGYPLLARFSNADAFDIEGGLRYYL
Ga0364925_0325779_3_4433300034147SedimentLTLKRLLAATVLLVMAPVAAAHAQGVEGMRSVTLFVGTGLSLAGNAINEAAGTIDGKPSIFVEQALSNHFSDGLRLRFTGSLGLDYNKEAFVTFAYGKYNGSHRVVGSVAGYPLLARFSNADAFDIEGGLRYYLRPEGPIRTYVAGA
Ga0364927_0071328_3_4103300034148SedimentLTFKQLLAVIVLFVIAPVSAARAQGVEGIRSVTLSVGTGISLAGNAINEGVGTIDGKPAVLVEQALSNHFSDALRLRFTGSLGLDYKKEVFATFGYGKYNGTERVVGSVSGYPLLARLSNADAFDIEGGLRYYLRP


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.