NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F040717

Metagenome / Metatranscriptome Family F040717

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F040717
Family Type Metagenome / Metatranscriptome
Number of Sequences 161
Average Sequence Length 80 residues
Representative Sequence MIRRLGKTLALTCAGIVLGTTLATAAMVTGTVTSLDDKGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGMTTCKAM
Number of Associated Samples 109
Number of Associated Scaffolds 161

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 79.50 %
% of genes near scaffold ends (potentially truncated) 24.22 %
% of genes from short scaffolds (< 2000 bps) 94.41 %
Associated GOLD sequencing projects 103
AlphaFold2 3D model prediction Yes
3D model pTM-score0.47

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (75.155 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(23.603 % of family members)
Environment Ontology (ENVO) Unclassified
(28.571 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(32.298 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 0.00%    β-sheet: 30.19%    Coil/Unstructured: 69.81%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.47
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 161 Family Scaffolds
PF11127DUF2892 4.35
PF04008Adenosine_kin 1.86
PF05532CsbD 1.24
PF00441Acyl-CoA_dh_1 1.24
PF00296Bac_luciferase 1.24
PF13359DDE_Tnp_4 1.24
PF09832DUF2059 0.62
PF12762DDE_Tnp_IS1595 0.62
PF13751DDE_Tnp_1_6 0.62
PF05239PRC 0.62
PF13384HTH_23 0.62
PF01609DDE_Tnp_1 0.62
PF00196GerE 0.62
PF13613HTH_Tnp_4 0.62
PF09369MZB 0.62
PF02805Ada_Zn_binding 0.62
PF02237BPL_C 0.62
PF00933Glyco_hydro_3 0.62

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 161 Family Scaffolds
COG1839Adenosine/AMP kinaseNucleotide transport and metabolism [F] 1.86
COG1960Acyl-CoA dehydrogenase related to the alkylation response protein AidBLipid transport and metabolism [I] 1.24
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 1.24
COG3237Uncharacterized conserved protein YjbJ, UPF0337 familyFunction unknown [S] 1.24
COG0340Biotin-protein ligaseCoenzyme transport and metabolism [H] 0.62
COG1472Periplasmic beta-glucosidase and related glycosidasesCarbohydrate transport and metabolism [G] 0.62
COG2169Methylphosphotriester-DNA--protein-cysteine methyltransferase (N-terminal fragment of Ada), contains Zn-binding and two AraC-type DNA-binding domainsReplication, recombination and repair [L] 0.62
COG3039Transposase and inactivated derivatives, IS5 familyMobilome: prophages, transposons [X] 0.62
COG3293TransposaseMobilome: prophages, transposons [X] 0.62
COG3385IS4 transposase InsGMobilome: prophages, transposons [X] 0.62
COG5421TransposaseMobilome: prophages, transposons [X] 0.62
COG5433Predicted transposase YbfD/YdcC associated with H repeatsMobilome: prophages, transposons [X] 0.62
COG5659SRSO17 transposaseMobilome: prophages, transposons [X] 0.62


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A75.16 %
All OrganismsrootAll Organisms24.84 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000363|ICChiseqgaiiFebDRAFT_13791123Not Available561Open in IMG/M
3300000559|F14TC_101771624Not Available585Open in IMG/M
3300004463|Ga0063356_100643452Not Available1444Open in IMG/M
3300004463|Ga0063356_100790014All Organisms → cellular organisms → Bacteria1322Open in IMG/M
3300004463|Ga0063356_100951637Not Available1220Open in IMG/M
3300004463|Ga0063356_102813651Not Available749Open in IMG/M
3300004633|Ga0066395_10158181Not Available1154Open in IMG/M
3300005180|Ga0066685_10506176Not Available836Open in IMG/M
3300005289|Ga0065704_10481361Not Available670Open in IMG/M
3300005294|Ga0065705_10178668Not Available1590Open in IMG/M
3300005332|Ga0066388_103205865Not Available836Open in IMG/M
3300005332|Ga0066388_103282449All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → unclassified Anaerolineae → Anaerolineae bacterium827Open in IMG/M
3300005544|Ga0070686_100070042All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2292Open in IMG/M
3300005562|Ga0058697_10456702Not Available644Open in IMG/M
3300005577|Ga0068857_100143417All Organisms → cellular organisms → Bacteria2160Open in IMG/M
3300005713|Ga0066905_102154372Not Available519Open in IMG/M
3300005719|Ga0068861_101410364All Organisms → cellular organisms → Bacteria681Open in IMG/M
3300005764|Ga0066903_100687626All Organisms → cellular organisms → Bacteria1799Open in IMG/M
3300005886|Ga0075286_1071694Not Available508Open in IMG/M
3300005981|Ga0081538_10029645All Organisms → cellular organisms → Bacteria → Proteobacteria3733Open in IMG/M
3300005981|Ga0081538_10392364Not Available500Open in IMG/M
3300006049|Ga0075417_10235573Not Available874Open in IMG/M
3300006049|Ga0075417_10435131All Organisms → cellular organisms → Bacteria653Open in IMG/M
3300006194|Ga0075427_10001076All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales3423Open in IMG/M
3300006844|Ga0075428_101874262Not Available623Open in IMG/M
3300006846|Ga0075430_101773082Not Available506Open in IMG/M
3300006852|Ga0075433_11880133Not Available514Open in IMG/M
3300006853|Ga0075420_100689369All Organisms → cellular organisms → Bacteria881Open in IMG/M
3300006853|Ga0075420_100762893Not Available833Open in IMG/M
3300006854|Ga0075425_102901368Not Available526Open in IMG/M
3300006854|Ga0075425_102918669All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Nitrospinae → unclassified Nitrospinota → Nitrospinae bacterium SCGC AAA008-D05524Open in IMG/M
3300006931|Ga0097620_102889289Not Available526Open in IMG/M
3300006935|Ga0081246_1241915Not Available540Open in IMG/M
3300006938|Ga0081245_1343473Not Available503Open in IMG/M
3300007255|Ga0099791_10444150Not Available627Open in IMG/M
3300007265|Ga0099794_10419569Not Available700Open in IMG/M
3300009012|Ga0066710_100189316All Organisms → cellular organisms → Bacteria → Proteobacteria2915Open in IMG/M
3300009089|Ga0099828_10714739Not Available899Open in IMG/M
3300009094|Ga0111539_12652460Not Available581Open in IMG/M
3300009143|Ga0099792_10133924All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium1346Open in IMG/M
3300009147|Ga0114129_10053518All Organisms → cellular organisms → Bacteria5661Open in IMG/M
3300009147|Ga0114129_11164140Not Available962Open in IMG/M
3300009147|Ga0114129_13354070Not Available516Open in IMG/M
3300009610|Ga0105340_1556401Not Available512Open in IMG/M
3300009792|Ga0126374_10442307Not Available921Open in IMG/M
3300009792|Ga0126374_10817832Not Available714Open in IMG/M
3300009811|Ga0105084_1014919Not Available1240Open in IMG/M
3300009818|Ga0105072_1132588Not Available516Open in IMG/M
3300010046|Ga0126384_11118683Not Available723Open in IMG/M
3300010047|Ga0126382_10135367Not Available1662Open in IMG/M
3300010047|Ga0126382_10308732Not Available1192Open in IMG/M
3300010047|Ga0126382_10464944All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1007Open in IMG/M
3300010102|Ga0127453_1076945Not Available671Open in IMG/M
3300010114|Ga0127460_1091823Not Available640Open in IMG/M
3300010133|Ga0127459_1009130Not Available534Open in IMG/M
3300010137|Ga0126323_1029648All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → unclassified Anaerolineae → Anaerolineae bacterium594Open in IMG/M
3300010145|Ga0126321_1100774All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium659Open in IMG/M
3300010145|Ga0126321_1320064All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium1204Open in IMG/M
3300010145|Ga0126321_1372861Not Available530Open in IMG/M
3300010147|Ga0126319_1016282Not Available600Open in IMG/M
3300010147|Ga0126319_1552804All Organisms → cellular organisms → Bacteria → unclassified Bacteria → bacterium601Open in IMG/M
3300010154|Ga0127503_10675394Not Available695Open in IMG/M
3300010154|Ga0127503_10895618All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → unclassified Anaerolineae → Anaerolineae bacterium954Open in IMG/M
3300010154|Ga0127503_11106743Not Available503Open in IMG/M
3300010360|Ga0126372_10605371All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1052Open in IMG/M
3300010362|Ga0126377_13283938Not Available523Open in IMG/M
3300011332|Ga0126317_10278347Not Available518Open in IMG/M
3300011332|Ga0126317_10563710All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → unclassified Anaerolineae → Anaerolineae bacterium795Open in IMG/M
3300012206|Ga0137380_10728450Not Available859Open in IMG/M
3300012212|Ga0150985_102133661Not Available941Open in IMG/M
3300012212|Ga0150985_105038346Not Available884Open in IMG/M
3300012212|Ga0150985_106623910Not Available607Open in IMG/M
3300012212|Ga0150985_106707501Not Available584Open in IMG/M
3300012212|Ga0150985_115941860Not Available502Open in IMG/M
3300012212|Ga0150985_117096626Not Available612Open in IMG/M
3300012212|Ga0150985_121290443All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1623Open in IMG/M
3300012362|Ga0137361_11188602Not Available685Open in IMG/M
3300012393|Ga0134052_1192793Not Available797Open in IMG/M
3300012396|Ga0134057_1191883Not Available739Open in IMG/M
3300012397|Ga0134056_1247040All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1322Open in IMG/M
3300012406|Ga0134053_1017043All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1427Open in IMG/M
3300012469|Ga0150984_108356633Not Available513Open in IMG/M
3300012469|Ga0150984_120542023Not Available611Open in IMG/M
3300012918|Ga0137396_10347692Not Available1097Open in IMG/M
3300012929|Ga0137404_10094753All Organisms → cellular organisms → Bacteria2396Open in IMG/M
3300013306|Ga0163162_10029832All Organisms → cellular organisms → Bacteria5399Open in IMG/M
3300015053|Ga0137405_1383343Not Available1093Open in IMG/M
3300015371|Ga0132258_10467527All Organisms → cellular organisms → Bacteria3147Open in IMG/M
3300017792|Ga0163161_11449229Not Available601Open in IMG/M
3300018433|Ga0066667_11856403Not Available548Open in IMG/M
3300019208|Ga0180110_1147654Not Available588Open in IMG/M
3300019228|Ga0180119_1119852All Organisms → cellular organisms → Bacteria642Open in IMG/M
3300019233|Ga0184645_1061864Not Available544Open in IMG/M
3300019233|Ga0184645_1154162Not Available557Open in IMG/M
3300019233|Ga0184645_1178910Not Available517Open in IMG/M
3300019254|Ga0184641_1171447Not Available643Open in IMG/M
3300019254|Ga0184641_1312358Not Available522Open in IMG/M
3300019254|Ga0184641_1360157Not Available555Open in IMG/M
3300019259|Ga0184646_1427078All Organisms → cellular organisms → Bacteria735Open in IMG/M
3300019259|Ga0184646_1474980Not Available518Open in IMG/M
3300019269|Ga0184644_1027799Not Available1164Open in IMG/M
3300019269|Ga0184644_1641381Not Available941Open in IMG/M
3300019279|Ga0184642_1667592Not Available567Open in IMG/M
3300020065|Ga0180113_1351189Not Available605Open in IMG/M
3300020065|Ga0180113_1436201All Organisms → cellular organisms → Bacteria510Open in IMG/M
3300021286|Ga0179583_1034350Not Available504Open in IMG/M
3300021951|Ga0222624_1406999All Organisms → cellular organisms → Bacteria873Open in IMG/M
3300025961|Ga0207712_10262725All Organisms → cellular organisms → Bacteria1401Open in IMG/M
3300027056|Ga0209879_1037620Not Available797Open in IMG/M
3300027379|Ga0209842_1051680Not Available743Open in IMG/M
3300027511|Ga0209843_1056004Not Available692Open in IMG/M
3300027577|Ga0209874_1031363Not Available1455Open in IMG/M
3300027671|Ga0209588_1227863Not Available574Open in IMG/M
3300027907|Ga0207428_10182507All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → unclassified Cyanobacteria → Cyanobacteria bacterium 13_1_40CM_2_61_41585Open in IMG/M
3300027954|Ga0209859_1056023Not Available635Open in IMG/M
3300028381|Ga0268264_11863514Not Available611Open in IMG/M
3300030570|Ga0247647_1211805Not Available558Open in IMG/M
3300030570|Ga0247647_1237478Not Available535Open in IMG/M
3300030829|Ga0308203_1059508Not Available595Open in IMG/M
3300030829|Ga0308203_1093013Not Available509Open in IMG/M
3300030830|Ga0308205_1019333All Organisms → cellular organisms → Bacteria770Open in IMG/M
3300030830|Ga0308205_1040635Not Available596Open in IMG/M
3300030902|Ga0308202_1105926Not Available586Open in IMG/M
3300030902|Ga0308202_1131672Not Available543Open in IMG/M
3300030902|Ga0308202_1134860Not Available538Open in IMG/M
3300030903|Ga0308206_1003564All Organisms → cellular organisms → Bacteria1875Open in IMG/M
3300030903|Ga0308206_1012170Not Available1313Open in IMG/M
3300030903|Ga0308206_1049634Not Available828Open in IMG/M
3300030903|Ga0308206_1072124Not Available728Open in IMG/M
3300030903|Ga0308206_1142651Not Available571Open in IMG/M
3300030904|Ga0308198_1056591Not Available618Open in IMG/M
3300030905|Ga0308200_1101985Not Available612Open in IMG/M
3300030905|Ga0308200_1147375Not Available541Open in IMG/M
3300030987|Ga0308155_1036830Not Available507Open in IMG/M
3300030989|Ga0308196_1040588All Organisms → cellular organisms → Bacteria616Open in IMG/M
3300030990|Ga0308178_1004933Not Available1623Open in IMG/M
3300030990|Ga0308178_1138664Not Available550Open in IMG/M
3300030993|Ga0308190_1047461Not Available818Open in IMG/M
3300030993|Ga0308190_1199742Not Available503Open in IMG/M
3300031039|Ga0102760_10959687Not Available534Open in IMG/M
3300031089|Ga0102748_11157130All Organisms → cellular organisms → Bacteria527Open in IMG/M
3300031091|Ga0308201_10178876Not Available686Open in IMG/M
3300031092|Ga0308204_10129858All Organisms → cellular organisms → Bacteria → Elusimicrobia → Elusimicrobia727Open in IMG/M
3300031092|Ga0308204_10177593Not Available650Open in IMG/M
3300031092|Ga0308204_10214935Not Available606Open in IMG/M
3300031092|Ga0308204_10223823Not Available598Open in IMG/M
3300031092|Ga0308204_10363835Not Available501Open in IMG/M
3300031093|Ga0308197_10243036All Organisms → cellular organisms → Bacteria635Open in IMG/M
3300031093|Ga0308197_10299331Not Available592Open in IMG/M
3300031094|Ga0308199_1122686Not Available594Open in IMG/M
3300031094|Ga0308199_1195158Not Available507Open in IMG/M
3300031094|Ga0308199_1201468Not Available502Open in IMG/M
3300031097|Ga0308188_1037270Not Available520Open in IMG/M
3300031098|Ga0308191_1012815Not Available776Open in IMG/M
3300031114|Ga0308187_10367945Not Available559Open in IMG/M
3300031421|Ga0308194_10352071Not Available525Open in IMG/M
3300031422|Ga0308186_1032309Not Available547Open in IMG/M
3300031424|Ga0308179_1057230Not Available518Open in IMG/M
3300032075|Ga0310890_10301179Not Available1153Open in IMG/M
3300032159|Ga0268251_10234239Not Available735Open in IMG/M
3300034664|Ga0314786_168825Not Available524Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil23.60%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment9.94%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere9.32%
SoilEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Soil6.83%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil6.83%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.97%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil4.35%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand4.35%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere4.35%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil3.11%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.48%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere2.48%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.24%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil1.24%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil1.24%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere1.24%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Unclassified → Tabebuia Heterophylla Rhizosphere1.24%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere1.24%
AgaveHost-Associated → Plants → Phylloplane → Unclassified → Unclassified → Agave1.24%
Tropical Rainforest SoilEnvironmental → Terrestrial → Soil → Unclassified → Tropical Rainforest → Tropical Rainforest Soil1.24%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil0.62%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.62%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil0.62%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.62%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.62%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.62%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Switchgrass Rhizosphere0.62%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.62%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere0.62%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000363Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000559Amended soil microbial communities from Kansas Great Prairies, USA - control no BrdU total DNA F1.4 TC clc assemlyEnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005289Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2Host-AssociatedOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005544Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3L metaGEnvironmentalOpen in IMG/M
3300005562Agave microbial communities from Guanajuato, Mexico - As.Ma.eHost-AssociatedOpen in IMG/M
3300005577Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2Host-AssociatedOpen in IMG/M
3300005713Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 36 (version 2)EnvironmentalOpen in IMG/M
3300005719Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2Host-AssociatedOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005886Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_205EnvironmentalOpen in IMG/M
3300005981Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S5T2R1Host-AssociatedOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006194Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1Host-AssociatedOpen in IMG/M
3300006844Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2Host-AssociatedOpen in IMG/M
3300006846Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4Host-AssociatedOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006853Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD4Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S2-2 (version 2)Host-AssociatedOpen in IMG/M
3300006935Tropical rainforest soil microbial communities from the Amazon Forest, Brazil, analyzing deforestation - Metatranscriptome P72I A10 (Metagenome Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300006938Tropical rainforest soil microbial communities from the Amazon Forest, Brazil, analyzing deforestation - Metatranscriptome P72I A001 (Metagenome Metatranscriptome) (version 2)EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009094Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (version 2)Host-AssociatedOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009610Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT700EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300009811Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_20_30EnvironmentalOpen in IMG/M
3300009818Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_30_40EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010102Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010114Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010133Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010137Soil microbial communities from California, USA to study soil gas exchange rates - SR-CA-SC1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010145Soil microbial communities from Hawaii, USA to study soil gas exchange rates - KP-HI-INT metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010147Soil microbial communities from California, USA to study soil gas exchange rates - BB-CA-RED metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010154Soil microbial communities from Willow Creek, Wisconsin, USA - WC-WI-TBF metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300011332Soil microbial communities from California, USA to study soil gas exchange rates - SR-CA-SC2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012393Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012396Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012397Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012406Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_4_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013306Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S5-5 metaGHost-AssociatedOpen in IMG/M
3300015053Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300017792Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S4-5 metaGHost-AssociatedOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019208Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT231_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019228Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT790_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019233Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019254Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019259Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019269Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019279Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020065Metatranscriptome of soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaT ERMLT499_16_1Ra (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021286Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_08_16RNAfungal (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021951Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300027056Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027379Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_10_20 (SPAdes)EnvironmentalOpen in IMG/M
3300027511Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027577Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N1_20_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027907Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027954Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300030570Metatranscriptome of soil fungal communities from truffle orchard in Rollainville, France - Cnb12 (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030829Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_357 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030830Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_368 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030902Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_356 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030903Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_369 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030904Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_202 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030905Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_204 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030987Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_144 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030989Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_197 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030990Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_149 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300030993Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_185 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031039Forest soil microbial communities from USA, for metatranscriptomics studies - Jemez Pines PI 6C (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031089Forest soil microbial communities from USA, for metatranscriptomics studies - Jemez Pines PI 2B (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031091Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_355 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031092Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_367 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031093Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_198 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031094Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_203 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031097Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_183 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031098Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_186 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031114Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_182 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031422Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_181 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031424Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_150 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032159Agave microbial communities from Guanajuato, Mexico - As.Ma.e (v2)Host-AssociatedOpen in IMG/M
3300034664Metatranscriptome of lab incibated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20R3 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiFebDRAFT_1379112323300000363SoilMLRMLSKYVGLSLAVLCLATPLAMAQTMTGTVTKIDDKGMATVKTEDGKEHMVKGEGWKVGAKVECMTKEGTTSCKAM*
F14TC_10177162423300000559SoilMLRMLSKYVGLSLAVLCLATPLAMAQTMTGTVTKIDDKGMATVKTEDGKEHMVKGEGWKVGAKVECMTKEGMTSCKAM*
Ga0063356_10064345213300004463Arabidopsis Thaliana RhizosphereMIRKIGKTFAMSCASLILGTTLAFAGGMKGTVTSIDDKGMATVRTTDGKEHKVKGEGWQVGAQVECEQKEGTTSCKAAKM*
Ga0063356_10079001433300004463Arabidopsis Thaliana RhizosphereMRRKLSTIVACTCASVVLGATLTFAGGMMKGRVTAIDDQSMATVLAEDGKEYKVKGEGWKVGTLVECDMKEGMTACKAASTAPKM*
Ga0063356_10095163733300004463Arabidopsis Thaliana RhizosphereMIRKLGKTFALSCASLVLGTTLAFAGGMKGTVTSIDDKGMATVKTTDGKEHKVKGEGWQVGAQVECEQKEGTTSCKAGKM*
Ga0063356_10281365113300004463Arabidopsis Thaliana RhizosphereMIRQLGKSFALTCASLVLGTTLAFAGGMMKGKVVAIDANGMATVKTDDGKEHKVKGEGWKVGANVECEEKEGMTSCKAVAM*
Ga0066395_1015818123300004633Tropical Forest SoilMIRKLGTSFALTCASLVLGTTLAFAGGMMNGKVTAIDDKGMATVKTEDGKEHKVKGEGWKVGANVQCEEKEGMTSCKAVGT*
Ga0066685_1050617613300005180SoilMLRTLGKTIALTCASLVLGTTLATAAMMTGTVTTIDEKGMATVKMEDGKEHKVKGEGWKVGAKVSCEMKEGKTECKAM*
Ga0065704_1048136113300005289Switchgrass RhizosphereMIQQLGKSFALTCASLVLGTTLAFAGGMMKGKVVEIDANGMATVKTDDGKEHKVKGEGWKVGANVECEEKEGMTSCKAVAM*
Ga0065705_1017866823300005294Switchgrass RhizosphereMIQQLGKSFALTCASLVLGTTLAFAGGMMKGKVVAIDANGMATVKTDDGKEHKVKGEGWKVGANVECEEKEGMTSCKAVAM*
Ga0066388_10320586513300005332Tropical Forest SoilMIRRLGKTLALTCAGIVLGTTLATAVMINGTVTDVDEKGMATVKTDDGKQHKVKGEGWKVGAKVECETKEGATMCKATH*
Ga0066388_10328244913300005332Tropical Forest SoilKEIDMLLRKTLMLVCAGLVLSTTLATAATMQGIVTAINADGIATVKTEDGKEHKVKGEGWKVGAKVDCEIKEGKTACKAM*
Ga0070686_10007004253300005544Switchgrass RhizosphereLALTCAGLALGATLTFAGDMKGTVTQIDDKGMATVKTDDGKEHKVKGEGWKVGAKVDCQAKEGTTTCKAASQM*
Ga0058697_1045670213300005562AgaveMIRNLGKSFAFTCASVILGTTLAFAGGTMKGKVTAIDDKGMATVKTEDGKEHKVKGEGWKVGANVECEEKEGMTSCKAAM*
Ga0068857_10014341733300005577Corn RhizosphereMIRKLGKTLALTCAGLALGATLTFAGDMKGTVTQIDDKGMATVKTDDGKEHKVKGEGWKVGAKVDCQAKEGTTTCKAASQM*
Ga0066905_10215437213300005713Tropical Forest SoilMLRTLGKTMTLACAGLVLSATLATAGAMKGTVMGVDDNGMATVKTEDGKEHKVKGEGWKAGAKVECETKEGKTACKAT*
Ga0068861_10141036423300005719Switchgrass RhizosphereFTCASLVLGATLTFAGDMKGTVTQIDDKGMATVKTDDGKEHKVKGEGWKVGAKVDCQTKEGTTSCKAASQM*
Ga0066903_10068762633300005764Tropical Forest SoilMIRKLGTSFALTCASLVLGTTLAFAGGMMNGKVTAIDDKGMATVKTEDGKEHKVKGEGWKVGANVQCEEKEGMTSCKAVGS*
Ga0075286_107169423300005886Rice Paddy SoilMGLAFSRTASWAGACAGLVLSITLATAATVQSTVTEVDSRGMATVKTEDDKEHKVKGEGWKTGARVEYANKEGKTERKAM*
Ga0081538_1002964523300005981Tabebuia Heterophylla RhizosphereMIRELGKTLALTCASLILGTTLAFAGGMKGTVTNIDDKGMATVKTEDGKEHKVKGEGWKVGAIVECEQKEGMTSCKAAKM*
Ga0081538_1039236413300005981Tabebuia Heterophylla RhizosphereMIRKISKTFALSCASFILGTTLAFAGGMKGTVTSIDDKGMATVRTTDGKEHKVKGEGWQVGAQVECEQKEGMTSCKAAKM*
Ga0075417_1023557313300006049Populus RhizosphereMLRMLSRYLGLSVAVLCFVATLATAQTMTGTVTKIDDKGMATVKTEDGKEHMVKGEGWKVGAKVECMTKEGTTSCKAM*
Ga0075417_1043513123300006049Populus RhizosphereMKIREIGKTLAFTCASLVLGATLTFAGDMKGTVTQIDDKGMATVKTDDGKEHKVKGEGWKVGAKVDCQTKEGTTSCKAASQM*
Ga0075427_1000107633300006194Populus RhizosphereMLRMLSRYLGLSVAVLCFVATLATAQTMTGTVTTIDDKGMATVKTEDGKEHMVKGEGWKVGAKVECMTKEGTTSCKAM*
Ga0075428_10187426213300006844Populus RhizosphereMRRKLSTIVACTCASVVLGATLTFAGGMMKGRVTAIDDQSMATVLAEDGKEYKVKGDGWKVGTLVECDMKEGMTACKAASTAPK
Ga0075430_10177308213300006846Populus RhizosphereMKIREIGKTLAFTCASLVLGATLTFAGDMKGTVTQIDDKGMATVKTDDGKEHKVKGEGWKVGAKVDCQTKEGTTS
Ga0075433_1188013313300006852Populus RhizosphereMRTLSRYLGLSVAVLCLVTTLAMAQTMTGTVTKIDDKGMATVKTEDGKEHMVKGEGWKVGAKVECTTKEGTTSCKAM*
Ga0075420_10068936923300006853Populus RhizosphereMSAPFFYGHEQRINNRMEDIMKIREIGKTLAFTCASLVLGATLTFAGDMKGTVTQIDDKGMATVKTDDGKEHKVKGEGWKVGAKVDCQTKEGTTSCKAASQM*
Ga0075420_10076289313300006853Populus RhizosphereMRRKLSTIVACTCASVVLGATLTFAGGMMKGRVTAIDDQSMATVLAEDGKEYKVKGDGWKVGTLVECDMKEGMTACKAASTAPKM*
Ga0075425_10290136813300006854Populus RhizosphereMIRKLSKSFAFTCATLVLGTTLAFAGGTMKGKVTAIDDKGMATVKTEDGKEHKVKGEGWKVGANVECEEKEGMTSCKASM*
Ga0075425_10291866913300006854Populus RhizosphereMKIREIGKTLAFTCASLVLGATLTFAGDMKGTVTQIDDKGMATVKTDDGKEHKVKGEGWKVGAKVDCQTKEGTTSC
Ga0097620_10288928913300006931Switchgrass RhizosphereMIRTIGKTLALTCAGLVLGTTLAFAGDMKGTVTQIDDKGMATVKMDDGKETKVKGEGWKVGAKVDCQTKEGTTSCKAASQM*
Ga0081246_124191523300006935Tropical Rainforest SoilMLRTLGKTLMLMCAGLVLSTTLATAATMHGTVTAIASDGMATVKTDDGKEYRVKGEGWKVGAKVECEIKEGKTECKAM*
Ga0081245_134347313300006938Tropical Rainforest SoilMLQHLGKTLALACAGFVLSTTLATAAMTMGTVMSVDDRGMATVKTEDGKEHKVKGEGWKPGAKVECESKEGKTECKAVQ*
Ga0099791_1044415023300007255Vadose Zone SoilMVRRLGKTLALTCAGIVLGTTLATAAMMTGTVVSMDEKGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGTTMCKAM*
Ga0099794_1041956913300007265Vadose Zone SoilMVRRLGKTLALTCVGIVLGTTLATAAMMTGTVVSMDEKGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGTTMCKAM*
Ga0066710_10018931613300009012Grasslands SoilMLRTLGKTIALTCASLVLGTTLATAAMMTWTVTTIDEKGMATVKMEDGKEHKVKGEGWKVGAKVSCEMKEGKTECKAM
Ga0099828_1071473923300009089Vadose Zone SoilMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTEIDDKGMATVKTEDGKEHKVKGEGWKAGAKVQCETKEGKTECKAI*
Ga0111539_1265246013300009094Populus RhizosphereMLLGTGYASDTTALLGKETRMMRMLSRYLGLSAAVLCFVATLATAQTMTGTVTKIDDKGMATVKTEDGKEHMVKGEGWKVGAKVECTTKEGTTSCKAM*
Ga0099792_1013392423300009143Vadose Zone SoilMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTEIDDKGMATVKTEDGKEHKVKGEGWKVGATVQCETKEGKTECKAI*
Ga0114129_1005351843300009147Populus RhizosphereMEDIMKIREIGKTLAFTCASLVLGATLTFAGDMKGTVTQIDDKGMATVKTDDGKEHKVKGEGWKVGAKVDCQTKEGTTSCKAASQM*
Ga0114129_1116414013300009147Populus RhizosphereMIRTLGKTLALTCAGLVLSTTLATAAMMMGTVESVDDKGMATVKTEDGKEHKVKGEGWKAGAKVECETKEGKTDCKAVK*
Ga0114129_1335407013300009147Populus RhizosphereMIRTLGKTLALTCAGLVLSTTLATAAMMMGTVESVDDKGMATVKTEDGKEHKVKGEGWKAGAKVEC
Ga0105340_155640113300009610SoilMRRKLSTIVACTCASVALGATLTFAGGMMKGRVTAIDDQSMATVLAEDGKEYKVKGDGWKVGTLVECDMKAGMSACKAASTAPKM*
Ga0126374_1044230713300009792Tropical Forest SoilMIRKLGTSFALTCASLVLGTTLAFAGGMMKGKVTAIDDKGMATVKTEDGKEHKVKGEGWKVGANVECEEKGCTFGAIAGKK*
Ga0126374_1081783213300009792Tropical Forest SoilVEDIYMLRKLGNSFALTCASLVLGTTLAFAGGMMKGKVTAIDDKGMATVKTEDGKEHKVKGEGWKVGANVQCEEKEGMTSCKAVGT*
Ga0105084_101491913300009811Groundwater SandMQMIRTLGKYVGLSVAVLCLVTSLALAGGMTGTVTKIDDKGMATVKMEDGKEHQVKGEGWKVGAKVECDVKEGKTACKAM*
Ga0105072_113258813300009818Groundwater SandPSLQRNLFSEEDMQMIRTLGKYVGLSVAVLCLVTSLALAGGMTGTVTKIDDKGMATVKMEDGKEHQVKGEGWKVGAKVECDVKEGKTACKAM*
Ga0126384_1111868313300010046Tropical Forest SoilMIRKLGKSFALTCASLVLGTTLAFAGGTMKGKVVGIDDKGMATVKTDDGKEHKVKGEGWKVGATVECEEKEGMTSCKAVAM*
Ga0126382_1013536733300010047Tropical Forest SoilMRVWRIIYMIRKLGKSFALTCASLVLGTTLAFAGGMMKGKVTAIDDKGMATVKTDDGKEHKVKGEGWKVGANVECEEKEGMTSCKAVAM*
Ga0126382_1030873233300010047Tropical Forest SoilMIRRLGKTLALTCAGIVLGTTLATAATMNGTVTDVNDKGMATVKTDDGKEHKVKGEGWKVGAKVECETKEGATMCKATH*
Ga0126382_1046494423300010047Tropical Forest SoilMLRKLGNSFALTCASLVLGTTLAFAGGMMKGKVTAIDDKGMATVKTEDGKEHKVKGEGWKVGANVQCEEKEGMTSCKAVGT*
Ga0127453_107694513300010102Grasslands SoilMIRRLGKTLALTCAGIVLGTTLATAAMMTGTVVSMDEKGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGTTMCKAM*
Ga0127460_109182313300010114Grasslands SoilMVRRLGKTLALTCVGIVLGTTLATAAMMTGTVVSLDEKGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGTTMCKAM*
Ga0127459_100913013300010133Grasslands SoilMVRRLGKTLALTCVGIVLGTTLATAAMMTGTVVSMDEKGMATVKTEDGKGHKVKGEGWKVGAKVECETKEGTTMCKAM*
Ga0126323_102964813300010137SoilACAGLVLSITLATAATAQSTVTEVDSRGIATVKTEDDKEHKVKGEGWKTGARIECANKEGKAERKTI*
Ga0126321_110077413300010145SoilMRRTFGKTLTLACAGLVLGTTLATAAMMQGTVTDIDDKGMATVKTEDGKDHKVKGEGWKVGAKVECETKEGKTACKAM*
Ga0126321_132006423300010145SoilMIRQLGKSFALTCASLVLGTTLAFAGGTMKGKVTAIDDKGMATVKTDDGKEHKVKGEGWKVGANVQCEEKEGMTSCKAVGM*
Ga0126321_137286113300010145SoilMRRTLGQTLALTCASLVLGTTLATAAMMTGTVTSIDDKGMATVKMEDGKEHKVKGEGWKTGAKVECEMKEGKTECKAVQ*
Ga0126319_101628213300010147SoilMLRTLGKTLTLACAGLVLSTTLATAAMMKGTVTEVDANGMATVKTEDGKEHKVKGEGWKAGAKVECETKEGKTECKAT*
Ga0126319_155280423300010147SoilMKTVAMAFAGLLLSTTLATAGGMKGTVTAVDDQGVATVKMENGNEYKVKGEGWKPGAKVDCDAREGKTE
Ga0127503_1067539413300010154SoilMIRRLGKTLALTCAGIVLGATLATAAMMNGTVTAIDDTGMATVKMEDGQTHKVKGEGWTVGAKVQCETKEGKTACKAM*
Ga0127503_1089561823300010154SoilMLQTLGKTLTLACAGLVLSPTLATAAMMNGTVTNIDDKGMATVKTEDGKEHKVKGEGWKVGATVQCETKEGKTECKAI*
Ga0127503_1110674313300010154SoilSGHEQRINKRMEDMMKIREIGKTLAFTCASLLLGATLTFAGDMKGTVTQIDDKGMATVKTEDGKEHKVKGEGWKVGAKVDCQTKEGTTSCKAASQM*
Ga0126372_1060537113300010360Tropical Forest SoilLTCASLVLGTTLAFAGGMMNGKVTGIDDKGMATVKTEDGKEHKVKGEGWKVGANVQCEEKEGMTSCKAVGS*
Ga0126377_1328393813300010362Tropical Forest SoilMIRKLGKSFALTCASLVLGTTLAFAGGTMKGKVTAIDDKGMATVKTDDGKEHKVKGEGWKVGANVECEEKEGMTSCKAVAM*
Ga0126317_1027834713300011332SoilMIRKLGKTFALSCASLILGTTLAFAGGMKGTVTKIDDNGMATVKTEDGKEHKVKGEGWQVGAQVECEQKEGMTSCKAAKM*
Ga0126317_1056371013300011332SoilGACAGLVLSITLATAATAQSTVTEVDSRGIATVKTEDDKEHKVKGEGWKTGARIECANKEGKAERKTI*
Ga0137380_1072845013300012206Vadose Zone SoilPMIRTLGTYLGLSVAVLCLATSLVTAGERGMKGTVTAIDDKGMAIVKTEDGKEHKVKGEGWKVGATVECKLKEDTTSCKAAM*
Ga0150985_10213366113300012212Avena Fatua RhizosphereMIQQLGKSFALTCASLVLGTTLAFAGGMMKGKVVAIDVNGMATVKTDDGKEHKVKGEGWKVGANVECEEKEGMTSCKAVAM*
Ga0150985_10503834613300012212Avena Fatua RhizosphereMIRKLSKSFAFTCASFVLGTTLAFAGGTMKGKVTAIDANGMATVKTEDGKEHKVKGEGWKVGANVECEQKEGMTSCKAVM*
Ga0150985_10662391013300012212Avena Fatua RhizosphereMVRRLGKTLALTCAGIVLGTTLATAAMVNGTVTDIDDKGMATVKTEDGQQHKVKGEGWKVGAKVQCDTKEGKTECKAM*
Ga0150985_10670750113300012212Avena Fatua RhizosphereVGDIDMIRKLGTTIAFTCASLVLGATLTFAGGMMKGTVTNIDDKGMATVKTEDGKEHKVKGEGWKVGAKVECEEKEGMTSCKAAM*
Ga0150985_11594186013300012212Avena Fatua RhizosphereMIRKLGKTLALTCAGLASGATLTFAGDMKGTVTQIDDKGMATVKTDDGKEHKVKGEGWKVGAKVDCQAKEGTTTCKAASQM*
Ga0150985_11709662613300012212Avena Fatua RhizosphereVSNISRYVEDIVMIRKIGTTIAFTCASLVLSATLTFAGGMMKGKVVDVDDKGMATVKTEDGKEHKVKGEGWKVGANVQCEEKEGTTSCKAVGM*
Ga0150985_12129044333300012212Avena Fatua RhizosphereMIRRLGKTLALGCAGIVLGTTLATAAMMTGTVMSMDDKGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGATMCKATQ*
Ga0137361_1118860213300012362Vadose Zone SoilHSVEDIRMIRKLGKTLALTCAGLVLGTTLAFAGDMKGTVTAIDDKGMATVKTEDGKEHKVKGEGWKVGAKVDCQAKEGTTTCKAASQM*
Ga0134052_119279313300012393Grasslands SoilMVRRLGKTLALTCVGIVLGTTLATAAMMTGTVVSMDEKGMATVKTEDGKEHKVKGEGWKVGAQVECETKEGTTMCKAM*
Ga0134057_119188313300012396Grasslands SoilLSPALQYCYWWDSGGEIYMVRRLGKTLALTCVGIVLGTTLATAAMMTGTVVSLDEKGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGTTMCKAM*
Ga0134056_124704023300012397Grasslands SoilYMIRRLGKTLALTCVGIVLGTTLATAAMTGTVVSMDEKGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGTTMCKAM*
Ga0134053_101704323300012406Grasslands SoilMIRRLGKTLALTCVGIVLGTTLATAAMMTGTVVSLDEKGMATVKTEDGKEHKVKGEGWKVGAQVECETKEGTTMCKAM*
Ga0150984_10835663313300012469Avena Fatua RhizosphereMIRRLGKTLALSCAGIVLGTTLATAAMMTGTVVSMDEKGMATVKTDDGKEHKVKGEGWKVGAKVECEAKEGATMCKATQ*
Ga0150984_12054202313300012469Avena Fatua RhizosphereISWDVSKLSRKCGGYMIQQLGKSFALTCASLVLGTTLAFAGGMMKGKVVAIEANGMATVKTDDGKEHKVKGEGWKVGANVECEEKEGMTSCKAVAM*
Ga0137396_1034769213300012918Vadose Zone SoilMLQTLGRYLGLSVAVLLLATSLGIAGEKGMKGTVTAIDDKGIATVKTEDGKEHKVKGEGWKVGAKVECALKEGTTSCKAVM*
Ga0137404_1009475323300012929Vadose Zone SoilMVRRLGKTLALTCVGIVLGTTLATAAMMTGTVVSMDENGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGTTMCKAM*
Ga0163162_1002983263300013306Switchgrass RhizosphereMIRKLGKTLALTCAGLALGATLTFAGDMKGTVTQIDDKGMATVKTDDGKEHKVKGEGWKVGAKVDCQAKEGTTTCKAAPQM*
Ga0137405_138334313300015053Vadose Zone SoilMQEKYLTPAFSLSMMPPSLQRNLFSEEDMQMIRTLGKYVGLSVAVLCLVTSLALAGGMTGTVTKIDDKGMATVKMDDGKEHQVKGEGWKVGAKVECDVKEGKTACKAM*
Ga0132258_1046752733300015371Arabidopsis RhizosphereMIRKLGKILALTCAGLALGATLTFAGDMKGTVTQIDDKGMATVKTDDGKEHKVKGEGWKVGAKVDCQAKEGTTTCKAASQM*
Ga0163161_1144922923300017792Switchgrass RhizosphereDCPTWVGPCGTSQRSCGGYRKIQRRGKTFALSCASFILGTILACAGGMKGMVTSIDDKGMATVRTEDGKEHKVKGEGWQVGAQVECEQKEGTTSCKAAKM
Ga0066667_1185640313300018433Grasslands SoilMIRKLSKSFAFTCASFVLGTTLAFAGGTMKGKVTAIDANGMATVKTEDGKEHKVKGEGWKVGANVECEQKEGMT
Ga0180110_114765413300019208Groundwater SedimentMLRTLGKTLTLACAGLVLSTTLATAGMMKGTVMGVDDNGMATVKTEDGKEHKVKGESWKAGAKVECETKEGKTACKAI
Ga0180119_111985223300019228Groundwater SedimentMIRTVTKTIAMAFAGLLLSTTLATAGGMKGTVTAVDNQGVATVKTDDGKEYKVRGEGWKPGAKVDCEAREGKTECKATQ
Ga0184645_106186413300019233Groundwater SedimentMVRRLGKTLALTCAGIVLSTTLATAAMMNGTVTDIDDKGMATVKTEDGQQHKVKGEGWKVGAKVQCDIKDGKTDCKAT
Ga0184645_115416213300019233Groundwater SedimentMIRKLGKTFALTCAGLVLGTTLAFAGDMKGTVTAIDDKGMATVKTEDGKEHKVKGEGWKVGAKVDCQAKEGTTTCKAASQM
Ga0184645_117891013300019233Groundwater SedimentMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTEIDDKGMATVKTEDGKEHKVKGEGWKAGAKVQCETKEGKTECKAI
Ga0184641_117144713300019254Groundwater SedimentMVRRLGKTLALTCAGIVLGTTLATAGMMTGTVTSVDDKGMATVKTDDGKEHKVKGEGWKAGAKVECEMKEGKTECKAM
Ga0184641_131235813300019254Groundwater SedimentMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTAIDDKGMATVKTDDGKEHKVKGEGWKVGATVQCEIKEGKTECKAI
Ga0184641_136015713300019254Groundwater SedimentMIRRLGKTLALGCAGIVLGTTLATAAMMTGTVVSMDDKGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGATMCKATQ
Ga0184646_142707823300019259Groundwater SedimentMIRTLGRGVVFVLAVLCLVTSLATAAGTTGTVTKIDDKGMATVKMDDGKEHVVKGEGWKVGAKVECNVKEGKTACKAM
Ga0184646_147498013300019259Groundwater SedimentMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTDIYDKGMATVKTEDGKEHKVKGEGWKAGAKVQCETKEGKTECKAI
Ga0184644_102779933300019269Groundwater SedimentMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTAIDDKGIATVKTDDGKEHKVKGEGWKVGATVQCEIKEGKTECKAM
Ga0184644_164138123300019269Groundwater SedimentMVRRLGKTLALTCAGIVLGTTLATAGMMTGTVTSVDDKGMATVKTDDGKEHKVKGEGWKAGAKVECESKEGKTDCKAAK
Ga0184642_166759213300019279Groundwater SedimentMRRTWGKTLALTCASLVLGTTLATAAMMTGTVTDIDSKGMATVKTEDGKEHKVKGEGWTVGAKVSCEIKEGKTECKAM
Ga0180113_135118913300020065Groundwater SedimentMIRKLGKTFALSCASLVLGTTLAFAGGMKGTVTSIDDKGMATVKTDDGKEHKVKGEGWQVGAAVECEQKEGMTSCKAAKKM
Ga0180113_143620113300020065Groundwater SedimentLFVEEDMRMIRTLGKYVGLSVAGLCLVTSLALAGGTTGTVTKIDDKGMATVTMGDGKEHVVKGEGWKVGAKVECEVKEGKTACKAM
Ga0179583_103435013300021286Vadose Zone SoilMIRRLGKTLALTCAGIVLGATLATAAMMNGTVTAIDDTGMATVKMEDGQTHKVKGEGWTVGAKVQCETKEGKTACKAM
Ga0222624_140699923300021951Groundwater SedimentMLRRLGKTLALTCAGIVLGATLATAAMMNGTVTAIDDAGMATVKMEDGQTHKVKGEGWTVGAKVQCETKEGKTACKAM
Ga0207712_1026272523300025961Switchgrass RhizosphereMIQQLGKSFALTCASLVLGTTLAFAGGMMKGKVVAIDANGMATVKTDDGKEHKVKGEGWKVGANVECEEKEGMTSCKAVAM
Ga0209879_103762013300027056Groundwater SandMIRTLGKYVGLSVAVLCLVTSLALAGGMTGTVTKIDDKGMATVKMEDGKEHQVKGEGWKVGAKVECDVKEGKTACKAM
Ga0209842_105168013300027379Groundwater SandMVRRLGKTLALTCAGIVLSTTLATAAMMTGTVTSVDDKGMATVKTEDGKEHKVKGEGWKTGAKVECEMKEGKTECKAM
Ga0209843_105600413300027511Groundwater SandTLALTCAGIVLSTTLATAAMMTGTVTSVDDKGMATVKMEDGKEHKVKGEGWKTGAKVECEMKEGKTECKAM
Ga0209874_103136313300027577Groundwater SandMIRTLGKYVGLSVAVFCLMTSLAMAGGMTGTVTQINDKGMATVKMADGKEHVVKGEGWKVGATVECEVKEGNTVCKAK
Ga0209588_122786313300027671Vadose Zone SoilMVRRLGKTLALTCVGIVLGTTLATAAMMTGTVVSMDEKGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGTTMCKAM
Ga0207428_1018250733300027907Populus RhizosphereMKIREIGKTLAFTCASLVLGATLTFAGDMKGTVTQIDDKGMATVKTDDGKEHKVKGEGWKVGAKVDCQTKEGTTSCKAASQM
Ga0209859_105602313300027954Groundwater SandMVRRLGKTLALTCAGIVLSTTLATAAMMTGTVTSVDDKGMATVKMEDGKEHKVKGEGWKTGAKVECEMKEGKTECKAM
Ga0268264_1186351413300028381Switchgrass RhizosphereMIRKLGKTLALTCAGLALGATLTFAGDMKGTVTQIDDKGMATVKTDDGKEHKVKGEGWKVGAKVDCQAKEGTTTCKAASQM
Ga0247647_121180513300030570SoilGYEMIRKLGKTFALSCASLVLATTLAFAGGMKGTVTSIDDKGMATVRTTDGKEHKVKGEGWQVGAQVECEQKEGMTSCKAAKM
Ga0247647_123747813300030570SoilMIRRFGKTFALSCASLILGTTLVFAGGMKGTVTSIDDKGMATVKTDDGKEHKVKGEGWKVGATVECEQKEGMTSCKAAKM
Ga0308203_105950823300030829SoilMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTEIDDKGMATVKTEDGKEHKVKGEGWKAGAKVQCEIKEGKTECKAM
Ga0308203_109301313300030829SoilMVRRLGKTLALTCAGIVLGTTLATAAMMNGTVTAIDDKGMATVKMEDGQTHKVKGEGWTVGAKVQCDMKEGKTECKAM
Ga0308205_101933323300030830SoilMLRTLGKTLTLACAGLVLSTALATAATMQGTVTAIDANGMATVKTDDGKEHKVKGEGWKVGAKVDCESKEGKTECKAM
Ga0308205_104063513300030830SoilMVRRLGKTLALICAGIVLGTTLATAAMVNGTVTDIDDKGMATVKTEDGQTHKVKGEGWTVGAKVQCDTKEGKTACKAM
Ga0308202_110592613300030902SoilMYMIRRLGKTLALTCAGIVLGATLATAAMMNGTVTAIDDAGMATVKMEDGQTHKVKGEGWTVGAKVQCETKEGKTACKAM
Ga0308202_113167213300030902SoilMIRRLGKTLALACAGIVLGTTLATAAMVTGTVTSLDDKGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGMTTCKAM
Ga0308202_113486023300030902SoilMVRRLGKTLALTCAGIVLGTTLATAAMMNGTVTAIDDKGMATVKMEDGQTHKVKGEGWTVGAKVQC
Ga0308206_100356423300030903SoilMVRRLGKTLALTCAGIVLGTTLATAAMMTGTVVSMDEKGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGTTMCKAM
Ga0308206_101217013300030903SoilMLRRLGKTLALTCGGIVLGATLATAAMMNGTVTAIDDAGMATVKMEDGQTHKVKGEGWTVGAKVQCETKEGKTACKAM
Ga0308206_104963413300030903SoilMIRTLGKYVGLSVAVLCLVTSLALAGGMTGTVTKLDDKGMATVKMDDGKEHQVKGEGWKVGAKVECDVKEGKTACKAM
Ga0308206_107212413300030903SoilMVRRLGKTLALTCAGIMLATTLATAGMMTGTVTSVDDKGMATVKTDDGKEHKVKGEGWKAGAKVECEMKE
Ga0308206_114265113300030903SoilMETTLTKGHNMLRTLGKTLALTCAGLVLSTTLATAAMMMGTVESVDDKGMATVKMDDGKEHKVKGEGWKVGAKVECEMKEGKTE
Ga0308198_105659113300030904SoilMIRILGKYVGLSVAVLCLITSLAMAGGTTGTVTKVDDKGMATVKMADGKEHVVKGEGWKVGATVECDVKEGNTVCKAK
Ga0308200_110198513300030905SoilMIRRLGKTLALTCAGIVLGTTLATAAMVTGTVTSLDDKGMATVKTEDGKEHKVKGEGWKVGAKVECETKEGMTTCKAM
Ga0308200_114737513300030905SoilMIRRLGKTLALTCAGIVLGATLTTAAMMNGTVTAIDDSGMATVKMEDGQTHKVKGEGWTVGAKVQCDTKEGKTACKAM
Ga0308155_103683013300030987SoilMLRRLGKTLALTCAGIVLGATLATAAMMNGTVTAIDDAGMATVKMEDGQTHKVKGEGWTVGAKVQCDMKEGKTECKAM
Ga0308196_104058823300030989SoilMIRTLGKYVGLSVAVLCLVASLALAGGMTGTVTKVDEKGMATVKMEDGKEHQVKGEGWKVGAKVECAMKEGTTSCKAM
Ga0308178_100493323300030990SoilMIRKLGKTFALTCAGLVLGSTLAFAGDMKGTVTAIDDKGMATVKTEDGKEHKVKGEGWKVGAKVDCQAKEGTTTCKAASQM
Ga0308178_113866413300030990SoilMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTEIDDKGMATVKTEDGKEHKVKGEGWKAGAKVQCETKEGKTECK
Ga0308190_104746113300030993SoilMLRTLGKSLTLACAGLVLSTTLATAAMMNGTVTAIDDKGMATVKTDDGKEHKVKGEGWKVGATVQCEIKEGKTECKAM
Ga0308190_119974213300030993SoilLFSEEDMQMIRTLGKYVGLSVAVLCLVTSLALAGGMTGTVTKIDDKGMATVKMDDGKEHQVKGEGWKVGAKVECDVKEGKTACKAM
Ga0102760_1095968713300031039SoilMLRMLSKYVGLSLAVLCLATPLAMAQTMTGTVTKIDDKGMATVKTEDGKEHMVKGEGWKVGAKVECMTKEGMTSCKAM
Ga0102748_1115713013300031089SoilIRTLGKYVGLSVAVLCLVASLALAGGMTGTVTKVDEKGMATVKMEDGKEHQVKGEGWKVGAKVECAMKEGTTSCKAM
Ga0308201_1017887613300031091SoilMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTDIDNKGMATVKTEDGKEHKVKGEGWKAGAKVQCETKEGKTECKAI
Ga0308204_1012985823300031092SoilEDMQMIRTLGKYVGLSVAVLCLVTSLALAGGMTGTVTKVDEKGMATVKMEDGKEHQVKGEGWKVGAKVECAMKEGTTSCKAM
Ga0308204_1017759323300031092SoilMLRTLGKTLTLACAGLVLSTTLATAATMQGTVTAIDANGMATVKTDDGKEHKVKGEGWKVGAKVDCESKEGKTECKAM
Ga0308204_1021493523300031092SoilMETTLTKGHNMLRTLGKTLALTCAGLVLSTTLATAAMMMGTVESVDDKGMATVKMDDGKEHKVKGEGWKVGAKVECEMK
Ga0308204_1022382313300031092SoilMIRRLGKTLALTCAGVVLGASLATAGMMTGTVVGMDEKGMATVKTDDGKEHKVKGEGWKVGAKVECETKEGATMCKATQ
Ga0308204_1036383523300031092SoilGMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTEIDDKGMATVKTEDGKEHKVKGEGWKAGAKVQCETKEGKTECKAI
Ga0308197_1024303623300031093SoilMIRTLGRYVGLSVAVLCLITSLAMAGGTTGTVTKIDDKGMATVKMEDGKEHQVKGEGWKVGAKVECEVKEGKTSCKAM
Ga0308197_1029933113300031093SoilDTTALLGKETRMLRMLSRYLGLSVAVLCFVATLATAQTMTGTVTKIDDKGMATVKTEDGKEHMVKGEGWKVGAKVECMTKEGTTSCKAM
Ga0308199_112268613300031094SoilMIRTLGRGVVFVLAVLCLVTSLATAAGMTGTVTKIDDNGMATVKMADGKEHVVKGEGWKVGATVECDVKEGNTVCKAK
Ga0308199_119515813300031094SoilMIRTLGKYVGLSVAVLCLVTSLALAGGMTGTVTKIDDKGMATVKMDDGKEHQVKGEGWKVGAKVECDVKEGKTACKAM
Ga0308199_120146813300031094SoilMVRRLGKTLALTCAGIVLGTTLATAAMMNGTVTAIDDKGMATVKMEDGQTHKVKGEGWTVGAKVQCETKEGKTACKAM
Ga0308188_103727013300031097SoilMVRRLGKTLALTCSGIVLGTTLATAGMMTGTVTSVDDKGMATVKTDDGKEHKVKGEGWKAGAKVECESKEGKTDCKAAK
Ga0308191_101281513300031098SoilMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTDIDDKGMATVKTEDGKEHKVKGEGWKAGAKVQCETKEGKTECKAI
Ga0308187_1036794513300031114SoilMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTNIDDKGIATVKTDDGKEHKVKGEGWKVGATVQCEIKEGKTECKAM
Ga0308194_1035207123300031421SoilMKIREIGKTLAFTCASLLLGATLTFAGDMKGTVTQIDDKGMATVKTEDGKEHKVKGKGWKVGAKVECALKEGTTSCKVAM
Ga0308186_103230913300031422SoilMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTAIDDKGMATVKTDDGKEHKVKGEGWKAGAKVQCETKEGKTECKAI
Ga0308179_105723013300031424SoilMLRTLGKTLTLACAGLVLSTTLATAAMMNGTVTDIDDKGMATVKTEDGKEHKVKGEGWKAGAKVQCETKEGKTECK
Ga0310890_1030117923300032075SoilMRRKLSTIVACTCASVVLGATLTFAGGMMKGRVTAIDDQSMATVLAEDGKEYKVKGDGWKVGTLVECDMKEGMTACKAASTAPKM
Ga0268251_1023423923300032159AgaveMIRNLGKSFAFTCASVILGTTLAFAGGTMKGKVTAIDDKGMATVKTEDGKEHKVKGEGWKVGANVECEEKEGMTSCKAAM
Ga0314786_168825_204_4463300034664SoilMIRRLGKTIALSCASLILGTTLVFAGGMKGTVTSIDDKGMATVKTDDGKEHKVKGEGWKVGATVECEQKEGMTSCKAAKM


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.