NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F102986

Metagenome Family F102986

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102986
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 196 residues
Representative Sequence MRKTGFALAFLLLSTMPAIAQTSEFGLLIGGSKRLISHSDQAQGLGISDNFKFSNSNREIFYAIQVDPGTFFRIKGGQIEGPVAFQFTDAAGHRARTDVPKGKVEHVDALIDYRFSEAFGSTGLFAGVGLYRQRATLNDLAVPAVQRGNQTETNYGFQGGVNGDFPMTRRTGFIAELAYHWINYNYKVRYLTLSGGLRFQF
Number of Associated Samples 78
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 47.52 %
% of genes near scaffold ends (potentially truncated) 43.56 %
% of genes from short scaffolds (< 2000 bps) 79.21 %
Associated GOLD sequencing projects 68
AlphaFold2 3D model prediction Yes
3D model pTM-score0.74

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(26.733 % of family members)
Environment Ontology (ENVO) Unclassified
(36.634 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(51.485 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (beta-barrel) Signal Peptide: Yes Secondary Structure distribution: α-helix: 9.17%    β-sheet: 58.52%    Coil/Unstructured: 32.31%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.74
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
f.4.1.1: Outer membrane proteind1p4ta_1p4t0.73465
f.4.1.1: Outer membrane proteind1qjpa_1qjp0.67916
f.4.1.0: automated matchesd3qraa_3qra0.64421
f.4.1.4: PsbO-liked5b5eo_5b5e0.62764
f.4.4.0: automated matchesd2x55a12x550.62696


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF00571CBS 26.73
PF13505OMP_b-brl 9.90
PF01244Peptidase_M19 7.92
PF00156Pribosyltran 5.94
PF03279Lip_A_acyltrans 1.98
PF01619Pro_dh 1.98
PF01070FMN_dh 0.99
PF13462Thioredoxin_4 0.99
PF01425Amidase 0.99
PF00356LacI 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG2355Zn-dependent dipeptidase, microsomal dipeptidase homologPosttranslational modification, protein turnover, chaperones [O] 7.92
COG0506Proline dehydrogenaseAmino acid transport and metabolism [E] 1.98
COG1560Palmitoleoyl-ACP: Kdo2-lipid-IV acyltransferase (lipid A biosynthesis)Lipid transport and metabolism [I] 1.98
COG4261Predicted acyltransferase, LPLAT superfamilyGeneral function prediction only [R] 1.98
COG0069Glutamate synthase domain 2Amino acid transport and metabolism [E] 0.99
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 0.99
COG1304FMN-dependent dehydrogenase, includes L-lactate dehydrogenase and type II isopentenyl diphosphate isomeraseEnergy production and conversion [C] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300004463|Ga0063356_100164383All Organisms → cellular organisms → Bacteria2556Open in IMG/M
3300005167|Ga0066672_10415482All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium880Open in IMG/M
3300005171|Ga0066677_10076319All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1732Open in IMG/M
3300005171|Ga0066677_10148085All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1284Open in IMG/M
3300005174|Ga0066680_10078963All Organisms → cellular organisms → Bacteria1983Open in IMG/M
3300005177|Ga0066690_10024276All Organisms → cellular organisms → Bacteria3433Open in IMG/M
3300005178|Ga0066688_10265213All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1102Open in IMG/M
3300005178|Ga0066688_10611737All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium700Open in IMG/M
3300005178|Ga0066688_10978992All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium517Open in IMG/M
3300005184|Ga0066671_10173151All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1277Open in IMG/M
3300005186|Ga0066676_10897516All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium595Open in IMG/M
3300005187|Ga0066675_11017472All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium622Open in IMG/M
3300005450|Ga0066682_10382813All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium903Open in IMG/M
3300005451|Ga0066681_10378620All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium870Open in IMG/M
3300005467|Ga0070706_100021700All Organisms → cellular organisms → Bacteria5914Open in IMG/M
3300005468|Ga0070707_100298063All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1567Open in IMG/M
3300005468|Ga0070707_101206793All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium723Open in IMG/M
3300005468|Ga0070707_101241476All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium711Open in IMG/M
3300005471|Ga0070698_100148440All Organisms → cellular organisms → Bacteria2293Open in IMG/M
3300005471|Ga0070698_101168687All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium718Open in IMG/M
3300005529|Ga0070741_10047289All Organisms → cellular organisms → Bacteria5327Open in IMG/M
3300005529|Ga0070741_10358288All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1351Open in IMG/M
3300005534|Ga0070735_10014759All Organisms → cellular organisms → Bacteria5812Open in IMG/M
3300005537|Ga0070730_10202496All Organisms → cellular organisms → Bacteria → Proteobacteria1324Open in IMG/M
3300005552|Ga0066701_10176200All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1301Open in IMG/M
3300005552|Ga0066701_10259102All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1076Open in IMG/M
3300005557|Ga0066704_10212256All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1311Open in IMG/M
3300005557|Ga0066704_10781347All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium595Open in IMG/M
3300005559|Ga0066700_10048934All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2588Open in IMG/M
3300005559|Ga0066700_10423252All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium937Open in IMG/M
3300005569|Ga0066705_10305529All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1008Open in IMG/M
3300005575|Ga0066702_10160642All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1337Open in IMG/M
3300005891|Ga0075283_1053748All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium697Open in IMG/M
3300005902|Ga0075273_10009867All Organisms → cellular organisms → Bacteria1544Open in IMG/M
3300005903|Ga0075279_10038391All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium762Open in IMG/M
3300005903|Ga0075279_10087012All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium564Open in IMG/M
3300006046|Ga0066652_101274730All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium694Open in IMG/M
3300006791|Ga0066653_10320264All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium781Open in IMG/M
3300006854|Ga0075425_100953477All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium980Open in IMG/M
3300006903|Ga0075426_11176519All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium581Open in IMG/M
3300007255|Ga0099791_10287649All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium782Open in IMG/M
3300009012|Ga0066710_100324406All Organisms → cellular organisms → Bacteria2266Open in IMG/M
3300009012|Ga0066710_101770017All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium935Open in IMG/M
3300009012|Ga0066710_104686318All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium511Open in IMG/M
3300009038|Ga0099829_10898698All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium735Open in IMG/M
3300009088|Ga0099830_10121814All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1975Open in IMG/M
3300009089|Ga0099828_10039921All Organisms → cellular organisms → Bacteria → Proteobacteria3838Open in IMG/M
3300009089|Ga0099828_10070127All Organisms → cellular organisms → Bacteria2965Open in IMG/M
3300009089|Ga0099828_10898743All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium791Open in IMG/M
3300009089|Ga0099828_10926829All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium778Open in IMG/M
3300009137|Ga0066709_103682020All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium556Open in IMG/M
3300010336|Ga0134071_10488610All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium635Open in IMG/M
3300010337|Ga0134062_10044297All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1796Open in IMG/M
3300011998|Ga0120114_1004844All Organisms → cellular organisms → Bacteria3404Open in IMG/M
3300012004|Ga0120134_1012515All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1304Open in IMG/M
3300012019|Ga0120139_1024034All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1402Open in IMG/M
3300012096|Ga0137389_10200462All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1661Open in IMG/M
3300012096|Ga0137389_11201779All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium649Open in IMG/M
3300012198|Ga0137364_10555330All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium864Open in IMG/M
3300012204|Ga0137374_10020987All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria7400Open in IMG/M
3300012208|Ga0137376_10379208All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1228Open in IMG/M
3300012208|Ga0137376_11272223All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium626Open in IMG/M
3300012209|Ga0137379_10086804All Organisms → cellular organisms → Bacteria3010Open in IMG/M
3300012209|Ga0137379_11286495All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium637Open in IMG/M
3300012211|Ga0137377_10833127All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium855Open in IMG/M
3300012211|Ga0137377_11449733All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium614Open in IMG/M
3300012285|Ga0137370_10568363All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium699Open in IMG/M
3300012350|Ga0137372_10637098All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium778Open in IMG/M
3300012353|Ga0137367_10071920All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2566Open in IMG/M
3300012363|Ga0137390_10693881All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium981Open in IMG/M
3300012532|Ga0137373_10048390All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3969Open in IMG/M
3300012922|Ga0137394_10524136All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1006Open in IMG/M
3300012923|Ga0137359_11114993All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium675Open in IMG/M
3300013764|Ga0120111_1007237All Organisms → cellular organisms → Bacteria → Proteobacteria3658Open in IMG/M
3300013770|Ga0120123_1010217All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1798Open in IMG/M
3300015371|Ga0132258_10034967All Organisms → cellular organisms → Bacteria11295Open in IMG/M
3300017656|Ga0134112_10126929All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium971Open in IMG/M
3300018075|Ga0184632_10170594All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium960Open in IMG/M
3300018433|Ga0066667_12324820All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium502Open in IMG/M
3300018482|Ga0066669_11481276All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium616Open in IMG/M
3300020579|Ga0210407_10173749All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1668Open in IMG/M
3300021432|Ga0210384_10691642All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium913Open in IMG/M
3300025910|Ga0207684_10893256All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium747Open in IMG/M
3300025922|Ga0207646_10452538All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1158Open in IMG/M
3300025922|Ga0207646_10465417All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1140Open in IMG/M
3300025922|Ga0207646_11265498All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium645Open in IMG/M
3300026010|Ga0207999_1006875All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium962Open in IMG/M
3300026328|Ga0209802_1075806All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1576Open in IMG/M
3300026331|Ga0209267_1061247All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1668Open in IMG/M
3300026527|Ga0209059_1096249All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1219Open in IMG/M
3300026530|Ga0209807_1351040All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium503Open in IMG/M
3300026550|Ga0209474_10612951All Organisms → cellular organisms → Bacteria → Terrabacteria group → Deinococcus-Thermus → Deinococci → Thermales → Thermaceae → Meiothermus → Meiothermus silvanus557Open in IMG/M
3300026551|Ga0209648_10014720All Organisms → cellular organisms → Bacteria → Proteobacteria6826Open in IMG/M
3300027748|Ga0209689_1075709All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1796Open in IMG/M
3300027857|Ga0209166_10135915All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1348Open in IMG/M
3300027875|Ga0209283_10081769All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2093Open in IMG/M
3300027875|Ga0209283_10150048All Organisms → cellular organisms → Bacteria1545Open in IMG/M
3300027986|Ga0209168_10058209All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2051Open in IMG/M
3300031753|Ga0307477_10084279All Organisms → cellular organisms → Bacteria2205Open in IMG/M
3300031962|Ga0307479_10241550All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1782Open in IMG/M
3300032180|Ga0307471_102109539All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium709Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil26.73%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil25.74%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere9.90%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.93%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil5.94%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost4.95%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil4.95%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.97%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.98%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.98%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.99%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.99%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005891Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_10C_80N_304EnvironmentalOpen in IMG/M
3300005902Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_5C_80N_102EnvironmentalOpen in IMG/M
3300005903Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_10C_0N_303EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300011998Permafrost microbial communities from Nunavut, Canada - A30_35cm_6MEnvironmentalOpen in IMG/M
3300012004Permafrost microbial communities from Nunavut, Canada - A30_5cm_6MEnvironmentalOpen in IMG/M
3300012019Permafrost microbial communities from Nunavut, Canada - A7_5cm_12MEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300013764Permafrost microbial communities from Nunavut, Canada - A28_35cm_6MEnvironmentalOpen in IMG/M
3300013770Permafrost microbial communities from Nunavut, Canada - A15_5cm_18MEnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026010Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_10C_80N_103 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026527Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
Ga0063356_10016438333300004463Arabidopsis Thaliana RhizosphereMRIVPRMIRFKFILALSLLAAAPAFAQSSEFGILLGGSKRLISSKDEEAGLGVSDNFKFTNSVREIYYSVSLDPGTRFRIKAGQITAPVAFQLTTATGNVRRDASKGTIDHIDGLIDYKFSETFGSTGLFAGVGLYRQTGTVSDPNLGVGRTSRQEETNYGFSGGVNGDFPLSRRAGVIVEAAYHWVNYHYKPRYVTLSGGLRFSF*
Ga0066672_1041548213300005167SoilMRIVTGMRNTAFVLAFFLASAASAVAQTSEFGVLIGGSKRLISRRDEKQGIGVSDNFKFSNSDREIFYGVELDPGTLFKIRAGQIEAPVAFQYSTASGTARTDLPKGKVEHIDAAVDYRFSEPFGSTGLFAGVGLYRQRGTLTDTAIPTEQRGVQTETNYGFLGGFNGDFPITKRTGFIVEATYHWINYNYKVRYITLGGGLRFSF*
Ga0066677_1007631923300005171SoilMRKTAFLLAFLLVSTMPALAQTSEVGIEIGGSKRLISRTDQARGLGISDNFKFSNSVREAFYAVQLDPGTFFKIKGGQIEGPVAFQYTSGTAKARTDVPKGTIEHIDGLIDYRFSEPFGSTGLYAGVGLYRQRATLTDTAIPALQRGNQTETNYGLQGGVNTDFPITRRAGFIAELAYHWINYNYKVRYLTLSGGLRFSF*
Ga0066677_1014808523300005171SoilMRIVTAMRKPAFLLAILIISATSAMAQTSQIGFLIGGSKRLISRSDQAQGLGISDNFKFSNSNREIFYGVQLDPGTFFKIRAGQIEGPLAFQYNTASGTARTDIPKGKVEHVDAVVDYRFSEPFGTTGIFAGVGLYRQSGTITDTTVPTEQRGNQAETNYGLLGGINGDFPITKRTGFIAEVT
Ga0066680_1007896323300005174SoilMRNRAFLLVCLLASAAPAVAQTSEFGITVGGSKRLISRRDEAQGIGISDNFKFSNSDREIYYGVQIEPATFFKIRAGQIEAPLAFQYTNSANQKARTDIRKGKVEHADAVVDYRFSEPFGTTGLFAGVGLYRQRGTLTDTAIPVEQRGNQTETNYGFLGGVNGDFPITRRTGFIVEMTYHWIAYNYKVRYVTLGGGLRFSF*
Ga0066690_1002427643300005177SoilMRIVTGMRNTAFVLAFFLASAASAVAQTSEFGVLIGGSKRLISRRDEKQGIGVSDNFKFSNSDREIFYGVELDPGTFFKIRAGQIEAPVAFQYSTASGTARTDLPKGKVEHIDAAVDYRFSEPFGSTGLFAGVGLYRQRGTLTDTAIPTEQRGVQTETNYGFLGGFNGDFPITRRTGFIVEATYHWINYNYKVRYITLGGGLRFSF*
Ga0066688_1026521313300005178SoilMRIVTAMRKFAFAPAFLLLSAMPAIAQTSQFGLLIGGSKRLISRRDEAKGLGVSDNFKFSNSVREVFYAVQLDPGTFFKVKGGQIEAPVAFQFINAAGDKTRTDVPKGKVEHIDAIIDYRFSEPFGSTGLFAGAGLYRQRATLSDLAIPDVQRGNQTETNYGFQGGVNGDFPITRRSGFIAELAYHWINYNYKVRYLTLSGGLRFSF*
Ga0066688_1061173713300005178SoilLTPTESRIVTGMRKTAFLLAFLLVSTMPALAQTSEVGIEIGGSKRLISRTDQARGLGISDNFKFSNSVREAFYAVQLDPGTFFKIKGGQIEGPVAFQYTSGTAKARTDVPKGTIEHIDGLIDYRFSEPFGSTGLYAGVGLYRQRATLTDTAIPALQRGNQTETNYGLQGGVNTDFPITRRAGFIAELAYHWINYNY
Ga0066688_1097899213300005178SoilQTSEFGILVGGSKRLISHSDQAAGIGVSDKFKFGNSVREIYYAVQLDPGTNFKIKAGQIEGPVAFQYLNGNTKARTDIAKGKVEHVDGIIDYRFSEAFGSTGLFGGVGLYRQRGSITDQAVPVEQRGSTEETNYGFQGGVNGDFPLSRRVGFVAEVAYHWINFNYKPRYIT
Ga0066671_1017315113300005184SoilMRIVTAMRKTGFLLAFLLISAAPAVAQTSQIGFLIGGSKRLISHSDQAAGRGISDNFRFSNSDRELFYAVQLDPGTFFKIRAGQIEGPVAFQYTAAGAKARTDIPKGKVEHLDAVVDYRFSEPFGTTGLFAGLGLYRQSGTLTDTAVPVDQRGSASETNYGALGGVNGDFPITRRTGFIAELTYHWINYNYKVRYLTLAGGLRFSF*
Ga0066676_1089751613300005186SoilTRMRKIAFPLAFLLLTAMPAIAQTSEFGFLIGGSKRLISKSDQARGLGISDNFKFSNSDREIYYGVQLDPGTFFKIKAAQIEGPVAFQYQTDTGKARTDIKKGKVEHVDAVVDYRFSEPFGSTGIFAGVGLYRQRGSITDTAVPVEQRGNQTETNYGFLGGINGDFPITRRTGFIAELTYHWINYDYKVRYLTLGGGL
Ga0066675_1101747213300005187SoilNGKRRERGDRLTPTESRIVTGMRKTAFLLAFLLVSTMPALAQTSEVGIEIGGSKRLISRTDQARGLGISDNFKFSNSVREAFYAVQLDPGTFFKIKGGQIEGPVAFQYTSGTAKARTDVPKGTIEHIDGLIDYRFSEPFGSTGLYAGVGLYRQRATLTDTAIPALQRGNQTETNYGLQGGVNTDFPITRRAGFIAELAYHWINYNY
Ga0066682_1038281313300005450SoilMRIVPRMFRSILFAALSLLVAASALAQTSELGVLFGGSKRLISHSDEAAGLGISDNFKFGNSVREMYYSLELDPGTRFKIKAGEIEAPVAFQFATASGKVRSDLPKGKVEHADALIDYRFSEAFGSTGIFAGVGLYRQRGNVTDDRVPADLRGQQEETNYGFSGGVNGDFPITRRSGVIVEATYHWINYHYRPRYVTLSAGLRFSF*
Ga0066681_1037862013300005451SoilMRIVTAMRKPAFLLAILFIPATSAMAQTSQIGFLIGGSKRLISRSDQAQGLGISDNFKFSNSNREIFYGVQLDPGTFFKIRAGQIEGPLAFQYNTGSGTARTDIKKGKVEHIDAVVDYRFSEPFGTTGIFAGLGLYRQSGTIIDTAVPTEQRGNQAETNYGLLGGVNGDFPITKRTGFIAEVTYHWINYNFKVRYLTLSGGLRFSF*
Ga0070706_10002170023300005467Corn, Switchgrass And Miscanthus RhizosphereMRIVTGMRKNTLVLAILFASAMPALAQTSQFGLTIGGSKRLISHTDQARGIGVSDHFKFSNSVREVFYAVQLDPGTFFKIKGGQIEGPAAFQYRAENGGLARTDVSKGTIEHIDGLIDYRFSEAFGSTGLFAGAGLYRQRGNLTDAAVPAGQRGNQTETNYGFQGGVNSDFPITRRAGFIAELAYHWINYHYKVRYLTLSGGLRFSF*
Ga0070707_10029806333300005468Corn, Switchgrass And Miscanthus RhizosphereMRIVTAMRNTAFLLAFFLLSAAPAVAQTSQIGFLIGGSKRLISHSDQAKGLGISDKFKFSNSDREVYYAVQLDPGTFFKVRAGQIEGPLAFQFMNGGTKARTDIPKGKVEHIDAVVDYRFSEPFGTTGLFAGLGLYRQSGTLTDAAVPVEQRGRTSETNYGALGGVNGDFPITKRTGFIAELTYHWINYNYKVRYLTLAGGLRFSF*
Ga0070707_10120679313300005468Corn, Switchgrass And Miscanthus RhizosphereSTRIVKPMRKTAFALALLAFSAMPAAAQTSEIGFLLGGTKRLISHSDQAKGLGISDSFKFSNSDREIYYGIQLDPGTWFKVKAAQIEGPLAFQYSTASGGKARTDIKKGKIEHVDAVIDYRFSEPFGATGLFAGVGLYRQRGSLTDAAIPADQRGTQTETNYGFLGGVNGDFPITRRTGFIVEATYHWIAYNYKVRYVTLGGGLRFSF*
Ga0070707_10124147613300005468Corn, Switchgrass And Miscanthus RhizosphereMRIVTGMRKNTLVLAILFASAMPALAQTSQFGLTIGGSKRLISHTDQARGIGVSDHFKFSNSVREVFYAVQLDPGTFFKIKGGQIEGPAAFQYRAENGGLARTDVSKGTIEHIDGLIDYRFSEAFGSTGLFAGAGLYRQRGNLTDTAVPAGQRGNQTETNYGFQGGVNSDFPITRRAGFIAELAYHWINYHYKVRYLTLSGGLRFSF*
Ga0070698_10014844033300005471Corn, Switchgrass And Miscanthus RhizosphereMRIVTGMRKNTLVLAILFASAMPALAQTSQFGLTIGGSKRLISHTDQARGIGVSDHFKFSNSVREVFYAVQLDPGTFFKIKGGQIEGPAAFQYRAENGGLARTDISKGTIEHIDGLIDYRFSEAFGSTGLFAGAGLYRQRGNLTDAAVPAGQRGNQTETNYGFQGGVNSD
Ga0070698_10116868713300005471Corn, Switchgrass And Miscanthus RhizosphereMRIVTRMRKTGFALAFLLLLAVPAFAQTSEFGLLIGGSKRLISHTDQAQGLGISDHFKFSNSNREVYYAIQLDPGTFFRIKGGQIEGPVAFQFTNAAGSRARTDVPKGKVEHVDALIDYRFSEAFGSTGLFAGVGLYRQRATLSDLSIPAVQRGDQTETNYGFSGGVNGDFPITRRTGFIAELAYHWINYNYKVRYLTLSGGLRFQF*
Ga0070741_1004728953300005529Surface SoilMRIVPGMTKPAWMLIILLLAAAPLAAQTNEVGIMIGGTKRLISHTDQARGIGVSDNFKFSNSDREIFWGTQLDPGSFFKIQGGQISGPAAFQYTNAGGQKARTDISKGTIEHVDGTVDYRFSEPFGSTGLFAGVGLYRQRGSITDTAVPTEQRGNQTETNYGFVGGVNTDFPITRRTGFMLELAYHWINFNYKVRYLTLGGGLRFSF*
Ga0070741_1035828823300005529Surface SoilMRIVTAMRTTLFLLAISLVCAAPALAQTSEVGFLIGGTKRLISHTDQANGLGISDNFKFSNSDREIFYGVQLDPGTFFKVRLGQIEGPLAFQYMPAGGGHARTDIRKGTIEHADAVIDYRFSEPFGTTGLFAGLGLYRQRGTLSDQAVPVEQRGVQTETNYGVLGGVNGDFPITKSVGFIAELTYHWINYNYKVRYLTLGG
Ga0070735_1001475933300005534Surface SoilMRIVTAMRTSALFLPIFLVCAAPAFAQTSEVGILIGASKRLISHTDQANGLGISDNFKFSNSDREIFYAVQLDPGTFFKIRVGQIEGPLAFQYTPAAGGRARTDISKGTVEHADAVVDYRISEPFGTTGLFAGLGLYRQRGTLSDQAVPVEQRGVQTETNYGVLGGVNGDFPITKHTGFIAELTYHWINYNYKVRYLTLGGGLRFSF*
Ga0070730_1020249623300005537Surface SoilMGIVPRMFRTTFAAALALLVSAPAFAQTSEFGVLVGGSKRLISHSDQAAGLGISDNFKLSNSVRELFYSVEIDPGTRFKIKAGQITAPVAFQFTSPTGPKRTDVAKGTVDHVDGVIDYRFSEPFGSTGLFAGIGLYRQSGTVTDTAVPVEQRGRAEETNYGFSGGVNGDFPMTRRSGIVVEVTYHAINYHYKVRYVTATGGLRFSF*
Ga0066701_1017620013300005552SoilLNTTKSQRIVTGMRNRAFLLVCLLASAAPAVAQTSEFGITVGGSKRLISRRDEAQGIGISDNFKFSNSDREIYYGVQIEPATFFKIRAGQIEAPLAFQYTNSANQKARTDIRKGKVEHADAVVDYRFSEPFGTTGLFAGVGLYRQRGTLTDTAIPVEQRGNQTETNYGFLGGVNGDFPITRRTGFIVEMTYHWIAYNYKVRYVTLGGGLRFSF*
Ga0066701_1025910213300005552SoilMRIVTAMRKFAFALAFLLLSAMPAIAQTSQFGLLIGGSKRLISRRDEAKGLGISDNFKFSNSVREVFYAVQLDPGTFFKVKGGQIEAPVAFQFINGAGVKTRTDVPKGKVEHIDAIIDYRFSEPFGSTGLFAGAGLYRQRATLSDLAIPEVQRGNQTETNYGFQGGVNGDFPITRRSGFIAELAYHWINYNYKVRYLTLS
Ga0066704_1021225623300005557SoilMRIVTAMRKFAFAPAFLLLSAMPAIAQTSQFGLLIGGSKRLISRRDEAKGLGVSDNFKFSNSVREVFYAVQLDPGTFFKVKGGQIEAPVAFQFINGAGVKTRTDVPKGKVEHIDAIIDYRFSEPFGSTGLFAGAGLYRQRATLSDVAIPELQRGNQTETNYGLQGGVNGDFPITRRSGFIAELVYHWINYNYKVRY
Ga0066704_1078134713300005557SoilILFAATPAFAQTSEFGILLGGSKRLISHSDQAAGIGVSDNFKFGNSVREIFYAVQLDPGTNFKIKAGQIEGPVAFQYTTPTGTARTDSAKGTVEHVDGLIDYRFSEAFGSTGLFAGVGLYRQRGTITDQAVPLVQRGTQTETNYGFQGGVNGDFPLTRRVGFIAEVAYHWINYHYKPRYVTLTGGLRLSF*
Ga0066700_1004893413300005559SoilIGGSKRLISRRDEKQGIGVSDNFKFSNSDREIFYGVELDPGTFFKIRAGQIEAPVAFQYSTASGTARTDLPKGKVEHIDAAVDYRFSEPFGSTGLFAGVGLYRQRGTLTDTAIPTEQRGVQTETNYGFLGGFNGDFPITRRTGFIVEATYHWINYNYKVRYITLGGGLRFSF*
Ga0066700_1042325223300005559SoilVIRMIKNLLGLMLLLATPALAQTSQFGILIGGSKRLISHSDQAAGLGISDNFRFSNSVREIYYAVQLDPGTNFKIKAGQIEGPVAFQYATTTGTARTDAAKGTIEHVDGLIDYRFSEVFGSTGLFAGVGLYRQRGTITDQAVPLVQRGTQTETNYGFQGGVNGDFPLTRRVGFIAEVAYHWINYHYKPRYVTLTGGLRLSF*
Ga0066705_1030552913300005569SoilMRIVTGMRNTAFVLAFFLASAASAVAQTSEFGVLIGGSKRLISRRDEKQGIGVSDNFKFSNSDREIFYGVELDPGTFFKIRAGQIEAPVAFQYSTASGTARTDLPKGKVEHIDAAVDYRFSEPFGSTGLFAGVGLYRQRGTLTDTAIPTEQRGVQTETNYGFLGGFNGDFPITKRTGFIVEA
Ga0066702_1016064223300005575SoilMRIVTAMRNTALLLAFLLISAAPAVGQTSQIGFLIGGSKRLISHSDQAAGRGISDNFRFSNSDRELFYAVQLDPGTFFKIRAGQIEGPVAFQYTAAGAKARTDIPKGKVEHLDAVVDYRFSEPFGTTGLFAGLGLYRQSGTLTDTAVPVDQRGSTSETNYGALGGVNGDFPITRRTGFIAELTYHWINYNYKVRYLTLAGGLRFSF*
Ga0075283_105374823300005891Rice Paddy SoilLMIGGSKRLISRSDQAAGIGVSDNFRFSNSDREAFYGVQVDPGTFFKVQIGQIEGPLAFQYTTPEGATARTDIRKGTLEHVSGTIDYRFSEAFGSTGLFAGVGLYRQHGSVTDTAVPDAQRGNESETNYGFVGGVNADFPITRRTGFIAELAYHWVNFHYKVRYLTLGGGLRFSF*
Ga0075273_1000986713300005902Rice Paddy SoilFGLMIGGSKRLISRSDQAAGIGVSDNFRFSNSDREAFYGVQVDPGTFFKVQIGQIEGPLAFQYTTPEGATARTDIRKGTLEHVSGTIDYRFSEAFGSTGLFAGVGLYRQHGSVTDTAVPDAQRGNESETNYGFVGGVNADFPITRRTGFIAELAYHWVNFHYKVRYLTLGGGLRFSF*
Ga0075279_1003839113300005903Rice Paddy SoilPLAAQNTSEFGLMIGGSKRLISRSDQAAGIGVSDNFRFSNSDREAFYGVQVDPGTFFKVQIGQIEGPLAFQYTTPEGATARTDIRKGTLEHVSGTIDYRFSEAFGSTGLFAGVGLYRQHGSVTDTAVPDAQRGNETETNYGFVGGVNADFPITRRTGFIAELAYHWVNFHYKVRYLTLGGGLRFSF*
Ga0075279_1008701213300005903Rice Paddy SoilLLIAAPAFAQVSEFGVLVGGSKRLISHSDEAAGLGISDNFKFSNSVREIYYSIMLDPGTRFRIKAGQITAPVAFQFTAADGSNIRSDLQKGKVEHIDGIIDYRFPEAFGATGIFAGIGLYRQSGTVTDENVPADQRGRQEETNYGFSGGVNGDFPLSRHSGVIVEATYHWINYHYRPRYITLTGGLR
Ga0066652_10127473013300006046SoilKTGFALAFLLLATTPAFAQTSEFGLLIGGSKRLISRSDQAQGLGISDNFKFSNSNREVYYAIQLDPGTFFRIKGGQIEGPVAFQFTNAAGNAARTDVPKGKVEHVDALIDYRFSEPFGATGLFAGVGLYRQRGTLVDLSIPAVQRGNQTETNYGFSGGVNGDFPITRRTGFIAELAYHWINYNYKVRYLTLSGGLRFQF*
Ga0066653_1032026413300006791SoilIQQVGFLLGGSKRLISQTDQSQGIGVSDNFSFSNSNRELFYGVQLDPGTWFKLRFGQIEGPVAFAYNTAGGKARTDIAKGQLEDIATVIDYRFSEPFGSTGLFGGLGYYRQNGTITDPAVPVEQHGRVSETNYGFLGGVNTDFPITKRTGFMAELTYHWINMDYKVRYLTLSGGLRFSF*
Ga0075425_10095347713300006854Populus RhizosphereMRIVTGMRNCAFLLVFLLASAAPAVAQTSEFGLTIGGSKRLISRRDQAQGIAVSDNFKFSNSDREIYYGVQIEPGTFFKIRAGQIEAPLAFQYTTAANEKARTDIRKGKVEHVDAVVDYRFSEPFGTTGLFAGVGLYRQRGTLTDKAIPLEQRGNQTETNYGFLGGVNGDFPLTRRTGFIVEAAYHWIAYNYKVRFVTLGGGL
Ga0075426_1117651913300006903Populus RhizosphereSEVGILIGASKRLISHTDQANGLGISDNFKFSNSDREIFYAVQLDPGTFFKIRVGQIEGPLAFQYTPAGGGRARTDINKGTVEHADAVVDYRISEPFGTTGLFAGLGLYRQRGTLRDQAVPVEQRGVQTETNYGVLGGVNGDFPITKHTGFIAELTYHWINYNYKVRYLTLGGGLRFTF*
Ga0099791_1028764913300007255Vadose Zone SoilMRKTGFSLAFLLLFTVPAIAQTSEFGLLIGGSKRLISHTDQARGLGISDNFKFSNSNREVYYAIQVDPGTWFRIKGGQIEGPVAFRYTTAVGTPARTDVPKGKIEHVNALIDYRFSEPFGSTGLFAGVGLYRQQATLNDVAIPVVQRGNQTETNYGFQGGVNGDFPMSRRTGFIAELAYHWINYNYKVRYVTLSGGVRFS
Ga0066710_10032440623300009012Grasslands SoilMRIVTAMRTFAFALAFLLLSAMPAIAQTSQFGLLIGGSKRLISRRDEAKGLGISDNFKFSNSVREVFYAVQLDPGTFFKIKGGQIEAPVAFQFLNAAGDKTRTDVPKGKVEHVDAIIDYRFSEPFGSTGLFAGAGLYRQRATLSDLAIPEVQRGNQTETNYGFQGGVNGDFPITRRSGFIAELAYHWINYNYKVRYLTLSGGLRFSF
Ga0066710_10177001723300009012Grasslands SoilMRIVTGMRNTAFVLAFFLASAVPAVAQTSQFGVLIGGSKRLISRRDEKQGIGVSDNFKFSNSDREIFYGVELDPGTFFKIRAGQIEAPVAFQYSTASGRARTDLPKGKVEHIDAVVDYRFSEPFGSTGLFAGVGLYRQRGTLTDTAIPAEQRGVQTETNYGFLGGVNGDFPITKRTGFIVEATYHWINYNYKVRYITLGGGLR
Ga0066710_10468631813300009012Grasslands SoilLFAFSAMPAAAQTSQIGFLLGGTKRLISHTDQARGLGISDNFKFSNSDREIYYGIQLDPGTWFKIKGAQIEGPLAFQYTNAAGAKARTDIKKGKIEHVDAVIDYKFSEPFGSTGLFAGIGLYRQRGSLTDAAIPSDQRGTQTETNYGFLGGVNGDFPITTRTGFIVEATY
Ga0099829_1089869813300009038Vadose Zone SoilMRIVTGMRKNNLALAILLVSAMPALAQTSQFGLTIGGSKRLISHTDQARGIGVSDNFKFSNSVREVFYAIQLDPGTFFKIKGGQIEGPAAFQYRADNGGLARTDVSKGTIEHIDGLIDYRFSEAFGSTGLFAGAGLYRQRGNLTDTAVPAGQRGNQTETNYGFQGGVNSDFPITRRTGFIAELAYHWINYHYKVRYLTLSGGFRFSF*
Ga0099830_1012181423300009088Vadose Zone SoilMRIVTGMRKNALVLAILLVSAMPALAQTSQFGLTIGGSKRLISHTDQARGIGVSDNFKFGNSVREVFYAIQLDPGTFFKIKAGQIEGPAAFQYRAENGGLARTDISKGTIEHIDGLIDYRFSEAFGSTGLFAGAGLYRQRGNLTDTAVPVGQRGNQTETNYGFQGGVNSDFPITRRTGFIAELAYHWINYHYKVRYLTLSGGLRFSF*
Ga0099828_1003992133300009089Vadose Zone SoilMRKFVFALAFLLLSAVPAMAQTSEFGLTIGGSKRLISHRDQAKGLGVSDSFKFSNSVRELFYAVQLDPGTFFKIKGGQIEAPVAFQFVNAAGAKARTDVSKGKVEHIDAIIDYRFSEPFGSTGLFAGAGLYRQRATLSDLAIPEVQRGDQTETNYGFQGGVNGDFPITRRTGFIAELAYHWINYNYKVRYVTLSGGLRFSF*
Ga0099828_1007012723300009089Vadose Zone SoilMRIVTGMRKNALVLAILLVSAMPALAQTSQFGLTIGGSKRLISHTDQARGIGVSDNFKFSNSVREVFYAIQLDPGTFFKIKAGQIEGPAAFQYRAENGGLARTDISKGTIEHIDGLIDYRFSEAFGSTGLFAGAGLYRQRGNLTDTAVPVGQRGNQTETNYGFQGGVNSDFPITRRTGFIAELAYHWINYHYKVRYLTLSGGLRFSF*
Ga0099828_1089874313300009089Vadose Zone SoilMRKIALPLACLLLSAMPAMAQTSEFGLLIGGSKRLISHSDQASGLGISDNFKFSNSDREVYYGVQLDPGTFFKIKAGQIEGPVAFQYTTAAGNARTDVKKGKVEHVDAVVDYRFSEPFGSTGIFAGVGLYRQRGTLSDTAIPVEQRGNQTEANYGFLGGINGDFPITRRTGFIVEVTYHWINYNYKVRYVTLGGGLRFAF*
Ga0099828_1092682913300009089Vadose Zone SoilMKTAFALLMLAAAPAFAQTSQFGILIGGSKRLISHSDQAAGIGVSDKFKFGNSVREIYYTVQLDPGTNFKIKAGQIEGPVAFQYLNGNTKARTDIAKGKVEHVDGIVDYRFSEAFGSTGLFGGVGLYRQRGSITDQAVPVEQRGSTEETNYGFQGGVNGDFPLSRRIGFVAEV
Ga0066709_10368202013300009137Grasslands SoilDEAKGLGISDNFKFSNSVREVFYAVQLDPGTFFKIKGGQIEAPVAFQFLNAAGDKTRTDVPKGKVEHVDAIIDYRFSEPFGSTGLFAGAGLYRQRATLSDLAIPEVQRGNQTETNYGFQGGVNGDFPITRRSGFIAELAYHWINYNYKVRYLTLSGGLRFSF*
Ga0134071_1048861013300010336Grasslands SoilMRKIAFPLAFLLLTAMPAIAQTSEFGFLIGGSKRLISKSDQARGLGISDNFKFSNSDREIYYGVQLDPGTFFKIKAAQIEGPVAFQYQTDTGKARTDIKKGKVEHVDAVVDYRFSEPFGSTGIFAGVGLYRQRGSITDTAVPVEQRGNQTETNYGFLGGINGDFPITRRTGFIAELTYHWIN
Ga0134062_1004429733300010337Grasslands SoilMAQIQQVGFLLGGSKRLISQTDQSQGIGVSDNFSFSNSNRELFYGVQLDPGTWFKLRFGQIEGPVAFAYNTAGGKARTDIAKGQLEHIDTVIDYRFSEPFGSTGLFGGLGYYRQNGTITDPAVPVEQHGRVSETNYGFLGGVNTDFPITKRTGFMAELTYHWINMDYKVRYLTLSGGLRFSF*
Ga0120114_100484443300011998PermafrostMKKLLLLMLFAATPAFAQTSEFGILLGGSKRLISHSDQAAGLGVSDNFRFSNSVREIFYAVQLDPGTNFKIKAGQIEGPVAFQFTTPTGTARTDSAKGTVEHVDGIIDYRFSEAFGSTGLFAGVGLYRQRGTINDQAVPLEQRGTQTETNYGFQGGVNGDFPLTRRAGFIAEVAYHWINYHYKPRYVTLTGGLRLSF*
Ga0120134_101251523300012004PermafrostMEKLLLLMLFAATPAFAQTSEFGILLGGSKRLISHSDQAAGLGVSDNFRFSNSVREIFYAVQLDPGTNFKIKAGQIEGPVAFQYTTPTGTARTDSAKGTVEHVDGIIDYRFSEAFGSTGLFAGVGLYRQRGTISDQAVPLVQRGTLTETNYGFQGGVNGDFPLTRRVGFIAEVAYHWINYHYKPRYVTLTGGLRLSF*
Ga0120139_102403433300012019PermafrostMKKLLLLMLFAATPAFAQTSEFGILLGGSKRLISHSDQAAGLGVSDNFRFSNSVREIFYAVQLDPGTNFKIKAGQIEGPVAFQYTTPTGTARTDSAKGTVEHVDGIIDYRFSEAFGSTGIFAGVGLYRQRGTISDQAVPLVQRGTLTETNYGFQGGVNGDFPLTRRAGFIAEVAYHWINYHYKPRYVTLTGGLRLSF*
Ga0137389_1020046223300012096Vadose Zone SoilMRKTGFSLAFLLLFTVPAIAQTSEFGLLIGGSKRLISHTDQARGLGISDSFKFSNSNREVYYAIQVDPGTWFRIKGGQIEGPVAFRYTTAAGTPARTDVPKGKIEHVNALIDYRFSEPFGSTGLFAGVGLYRQQATLNDVAIPVVQRGNQTETNYGFQGGVNGDFPMSRRTGFIAELAYHWINYNYKVRYVTLSGGVRFSF*
Ga0137389_1120177913300012096Vadose Zone SoilMRKNALVLAILLVSAMPALAQTSQFGLTIGGSKRLISHTDQARGIGVSDNFKFSNSVREVFYAIQLDPGTFFKIKGGQIEGPAAFQYRAENGGLARTDVSKGTIEHIDGLIDYRFSEAFGSTGLFAGAGLYRQRGNLTDTAVPVGQRGNQTETNYGFQGGVNSDFPITRRTGFIAELTYHWINYHYKVRYLTLSGGLR
Ga0137364_1055533013300012198Vadose Zone SoilMRKPAFLLAILFIPATSAMAQTSQIGFLIGGSKRLISRSDQAQGLGISDNFKFSNSNREIFYGVQLDPGTFFKIRAGQIEGPLAFQYNTGSGTARTDLPKGKVEHIDAVVDYRFSEPFGTTGIFAGVGLYRQSGTITDTAVPTEQRGNQAETNYGLLGGINGDFPITKRTGFIAEVTYHWINYNFKVRYLTLSGGLRFSF*
Ga0137374_1002098753300012204Vadose Zone SoilMRNTAFLLAFLIVSAAPAVAQTSQVGLLIGGTKRLISHTDQARGLGISDNFKFSNSDREIFYGVQLDPGTWFKIRGGQIEGPVAFQYTTGGGTARTDIPKGKVEHIDAVIDYRFSEPFGTTGLFAGAGLYRQRATLTAAAIPVEQRGVQTETNYGFLGGVNGDFPITRRTGFIVEATYHWINYNYKVRYVTLGGGLRFTF*
Ga0137376_1037920823300012208Vadose Zone SoilMRKTGFALAFLLLSTMPAIAQTSEFGLLIGGSKRLISHSDQAQGLGISDNFKFSNSNREIFYAIQVDPGTFFRIKGGQIEGPVAFQFTDAAGHRARTDVPKGKVEHVDALIDYRFSEAFGSTGLFAGVGLYRQRATLNDLAVPAVQRGNQTETNYGFQGGVNGDFPMTRRTGFIAELAYHWINYNYKVRYLTLSGGLRFQF*
Ga0137376_1127222313300012208Vadose Zone SoilMTKTGFALALLLLSTTPAFAQTSEFGLLIGGSKRLISHTVQARGLGISDNFKFSNSNREVYYAIQLDPGTFFRIKGGQIEGPVAFQFTNAAGIATRTDVPKGKVEHVDALIDYRFSEAFGATGLFAGVGLYRQRATLSDLAIPAVQRGDQTETNYGFSGGVNGDFPI
Ga0137379_1008680433300012209Vadose Zone SoilMRNTAFVLAFFLASAAPAVAQTSEFGVLIGGSKRLISRRDEAKGVGVSDNFKFSNSDREIFYGVELDPGTFFKIRAGQIEAPVAFQYSTASGTARTDLPKGKVEHIDAVVDYRFSEPFGSTGLFAGVGLYRQRGTLTDTAIPAEQRGVQTETNYGFLGGVNGDFPITKRTGFIVEATYHWINYNYKVRYITLGGGLRFSF*
Ga0137379_1128649513300012209Vadose Zone SoilMTKLAFALAFLLLSTMPAIAQTSEFGILIGGSKRLISHSDEARGLGISDSFKFSNSVREIYYAVQVDPGTFFRIKGGQIEGPTAFQFTTAAGTRARTDVPKGHIEHIDALIDYRFSEPFGATGLFAGVGLYRQRATLSDLAVPEVQRGNQTETNYGFSGGVNGDFPITRRTGFIAELAYHWINYNYKVRYLTLSGGLR
Ga0137377_1083312723300012211Vadose Zone SoilMTKLAFALAFLLLSTMPAIAQTSEFGILIGGSKRLISHSDEARGLGISDSFKFSNSVREVYYAVQVDPGTFFRIKGGQIEGPTAFQFTTAAGARARTDVPKGHIEHIDALIDYRFSEPFGATGLFAGVGLYRQRATLSDLAVPEVQRGNQTETNYGFSGGVNGDFPITRRTGFIAELAYHWINYNYKVRYLTLSGGLRFQF*
Ga0137377_1144973313300012211Vadose Zone SoilRTLYFAESDSMRIVTGMRKTAFLLAFLLVSTMPALAQTSEVGIEIGGSKRLISRTDQARGLGISDNFKFSNSVREVFYAIQLDPGTFFKIKGGQIEGPVAFQYTSGTAKARTDVPKGTIEHIDGLIDYRFSEPFGSTGLYAGVGLYRQRATLSDTAIPAVQRGDQTETNYGLQGGVNTDFPITRRAGFIAELAYHWINYNYKVR
Ga0137370_1056836323300012285Vadose Zone SoilGFLIGGSKRLISRSDQAQGLGISDNFKFSNSNREIFYGVQLDPGTFFKIRAGQIEGPLAFQYNTGSGTARTDIPKGKVEHIDAVVDYRFSEPFGTTGIFAGLGLYRQTGTITDTAVPTEQRGNQAETNYGLLGGVNGDFPITKRTGFIAEVTYHWINYNFKVRYLTLSGGLRFSF*
Ga0137372_1063709823300012350Vadose Zone SoilMRKTGFALAFLLLATTPALAQTSEFGLLIGGSKRLISRSDQAQGLGISDNFKFSNSNREVYYAIQLDPGTFFRIKGGQIEGPVAFQFTNAAGNAARTDVAKGKVEHIDALIDYRFSEPFGATGLFAGVGLYRQRATLVDLSIPAVQRGNQTETNYGFSGGVNGDFPITRRTGFIAELAYHWINYNYKVRYLTLSGGLRFQF*
Ga0137367_1007192023300012353Vadose Zone SoilMRNTAFLLAFLIVSAAPAVAQTSQVGLLIGGTKRLISHTDQARGLGISDNFKFSNSDREIFYGVQLDPGTWFKIRGGQIEGPVAFQYTTGGGTARTDIPKGKVEHIDAVIDYRFSEPFGTTGLFAGAGLYRQRATLADAAIPVEQRGVQTETNYGFLGGVNGDFPITRRTGFIVEATYHWINYNYKVRYVTLGGGLRFTF*
Ga0137390_1069388113300012363Vadose Zone SoilMNIRKPKPGQLPEREVASEPIYHARRRFLQAAGIGVSDKFKFGNSVREIYYAVQLDPGTNFKIKAGQIEGPVAFQYLNGNTKARTDIAKGKVEHVDGIVDYRFSEAFGSTGLFGGVGLYRQRGSITDQAVPVEQRGSTEETNYGFQGGVNGDFPLSRRIGFVAEVAYHWINLHYKTRYITLSGGFRFSF*
Ga0137373_1004839023300012532Vadose Zone SoilMRNTAFLLAFLIVSAAPAVAQTSQVGLLIGGTKRLISHTDQARGLGISDNFKFSNSDREIFYGVQLDPGTWFKIRGGQIEGPVAFQYTTGGGTARTDIPKGKVEHIDAVIDYRFSEPFGTTGLFAGAGLYRQRATLTAAAIPVEQRGVQTATNYGFLGGVNGDFPITRRTGFIVEATYHWINYNYKVRYVTLGGGLRFTF*
Ga0137394_1052413623300012922Vadose Zone SoilMRKTGFALAFLILLTVPAFAQTSEFGLLIGGSKRLISHTDEAQGLGISDNFKFSNSNREVYYAIQLDPGTFFRIKGGQIEGPVAFQFTNAAGTKARTDVPKGKVEHVDALIDYRFSEPFGSTGLFAGVGLYRQRGTLSDLSIPPVQRGDQTETNYGFSGGVNGDFPITR
Ga0137359_1111499313300012923Vadose Zone SoilMRKTGFTLAFLLLSTVPAMAQTSEFGILIGGSKRLISHTDEAQGLGISDSFKFSNSVREIYYAVQVDPGTFFRIKGGQIEGPVAFQYMTAAGTRARTDVPKGHIEHVDALVDYRFSEPFGATGLFAGVGLYRQRATLSDLAIPEGQRGNQTETNYGFSGGVNGDFPITRRTGFIAELAYHWINYNYKVRYLTLSGGLRFQF*
Ga0120111_100723753300013764PermafrostVTGMKKLLLLMLFAATPAFAQTSEFGILLGGSKRLISHSDQAAGLGVSDNFRFSNSVREIFYAVQLDPGTNFKIKAGQIEGPVAFQFTTPTGTARTDSAKGTVEHVDGIIDYRFSEAFGSTGLFAGVGLYRQRGTINDQAVPLEQRGTQTETNYGFQGGVNGDFPLTRRAGFIAEVAYHWINYHYKPRYVTLTGGLRLSF*
Ga0120123_101021743300013770PermafrostGSKRLISHSDQAAGLGVSDNFRFSNSVREIFYAVQLDPGTNFKIKAGQIEGPVAFQFTTPTGTARTDSAKGTVEHVDGIIDYRFSEAFGSTGLFAGVGLYRQRGTISDQAVPLVQRGTLTETNYGFQGGVNGDFPLTRRAGFIAEVAYHWINYHYKPRYVTLTGGLRLSF*
Ga0132258_10034967103300015371Arabidopsis RhizosphereMAQIQQVGFFLGGSKRLISQTDQSQGIGVSDNFSFSNSNRELFYGVQLDPGTWFKIRLGQIEGPVAFAYNTAGGKARTDIAKGQVEHIDTVIDYRFSEPFGTTGLFGGIGYYRQNGTISDPAVPVEQHGRVSETNYGLLGGVNTDFPITKRTGFMAELAYHWINMEYKVRYLTLSGGLRFSF*
Ga0134112_1012692923300017656Grasslands SoilAMPAIAQTSQFGLLIGGSKRLISRRDEAKGLGISDNFKFSNSVREVFYAVQLDPGTFFKIKGGQIEAPVAFQFLNAAGDKTRTDVPKGKVEHVDAIIDYRFSEPFGSTGLFAGAGLYRQRATLSDLAIPEVQRGNQTETNYGFQGGVNGDFPITRRSGFIAELAYHWINYNYKVRYLTLSGGLRFSF
Ga0184632_1017059423300018075Groundwater SedimentMRIVTRMRKTGFALAFLLLLAVPAFAQTSEFGLLIGGTKRLISRTDQAQGLGISDNFKFSNSNREVYYAIQLDPGTWFRIKGGQIEGPVAFQFTNAAGNPARTDVPKGKVEHVDALIDYRFSEPFGATALFAGVGLYRQRATLNDLAIPAAQRGNQTETNYGFSGGVNGDFPITRRTGFIAELAYHWINYDYKVRYLTLSGGLRFQF
Ga0066667_1232482013300018433Grasslands SoilDDGIEIGGSKRHIARTDQARGLGISDNFKFSNSVREAFYAVQLDPGTFFKIKGGQIEGPVAFQYTSGTAKARTDVPKGTIEHIDGLIDYRFSEPFGSTGLYAGVGLYRQRATLTDTAIPALQRGDQTETNYGLQGGVNTDFPITRRAGFIAELAYHWINYNYKVRY
Ga0066669_1148127613300018482Grasslands SoilLLSAMPAIAQTSQFGLLIGGSKRLISRRDEAKGLGISDNFKFSNSVREVFYAVQLDPGTNFKIKAGQIEGPVAFQYLNGNTKARTDIAKGKVEHVDGIIDYRFSEVFGSTGLFGGVGLYRQRGSITDQAVPVEQRGSTEETNYGFQGGVNGDFPLSRRVGFVAEVAYHWINFNYKPRYITLSGGFRFSF
Ga0210407_1017374923300020579SoilMRIVTAMRNTGVLLAFLLLSAAPAVAQTSEIGFLIGGSKRLISHSDQAAGLGISDHFKFSNSDRELFYAVQLDPGTFFKIRAGQIEGPLAFQYTTPGGKARTDIPKGRVEHVDAVIDYRFSEPFGTTGIFAGVGLYRQSGTLTDAAIPVEQRGTTSETNYGALGGVNGDFPITKRTGFIAELTYHWINYNYKVRYLTLAGGLRFSF
Ga0210384_1069164223300021432SoilVRYMMKSGFALAILLLSTVPAFAQTSEFGLLIGGSKRLISHSDQAQGLGISDNFKFSNSVREVYYAVQLDPGTWFRIKGGQIEGPVAFQFTNAAGTKARTDVPKGSIEHVDALIDYRFSEAFGATGLFAGVGLYRQRATLSDLSIPEVQRGNQTETNFGFSGGVNGDFPITRHTGFIAELAYHWINYNYKVRYLTLGGGLRFTF
Ga0207684_1089325623300025910Corn, Switchgrass And Miscanthus RhizosphereMRIVTGMRKNTLVLAILFASAMPALAQTSQFGLTIGGSKRLISHTDQARGIGVSDHFKFSNSVREVFYAVQLDPGTFFKIKGGQIEGPAAFQYRAENGGLARTDVSKGTIEHIDGLIDYRFSEAFGSTGLFAGAGLYRQRGNLTDAAVPAGQRGNQTETNYGFQGGVNSD
Ga0207646_1045253813300025922Corn, Switchgrass And Miscanthus RhizosphereMRIVTAMRNTAFLLAFFLLSAAPAVAQTSQIGFLIGGSKRLISHSDQAKGLGISDKFKFSNSDREVYYAVQLDPGTFFKVRAGQIEGPLAFQFMNGGTKARTDIPKGKVEHIDAVVDYRFSEPFGTTGLFAGLGLYRQSGTLTDAAVPVEQRGRTSETNYGALGGVNGDFPITKRTGFIAELTYHWINYNYKVRYLTLAGGLRFSF
Ga0207646_1046541713300025922Corn, Switchgrass And Miscanthus RhizosphereMRIVTGMRKNTLVLAILFASAMPALAQTSQFGLTIGGSKRLISHTDQARGIGVSDHFKFSNSVREVFYAVQLDPGTFFKIKGGQIEGPAAFQYRAENGGLARTDVSKGTIEHIDGLIDYRFSEAFGSTGLFAGAGLYRQRGNLTDTAVPAGQRGNQTETNYGFQGGVNSDFPITRRAGFIAELAYHWINYHYKVRYLTLSGGLRFSF
Ga0207646_1126549813300025922Corn, Switchgrass And Miscanthus RhizosphereWRYQPLTANRQPLKGTALNQRTSTRIVTSMKPAFALLLLAAAPVFAQTSEFGILIGGSKRLISHSDQAAGIGVSDKFKFGNSVREIYYAVQLDPGTNFKIKAGQIEGPVAFQYLNGNTKARTDIAKGKVEHVDGIVDYRFSEAFGSTGLFGGVGLYRQRGSITDQAVPVEQRGSTEETNYGFQGGVNGDFPLSRRVGFVAEVAYHWINFKYKPR
Ga0207999_100687513300026010Rice Paddy SoilMTKPALTLTLVLLAAMPLAAQNTSEFGLMIGGSKRLISRSDQAAGIGVSDNFRFSNSDREAFYGVQVDPGTFFKVQIGQIEGPLAFQYTTPEGATARTDIRKGTLEHVSGTIDYRFSEAFGSTGLFAGVGLYRQHGSVTDTAVPDAQRGNESETNYGFVGGVNADFPITRRTGFIAELAYHWVNFHYK
Ga0209802_107580623300026328SoilMRNRAFLLVCLLASAAPAVAQTSEFGITVGGSKRLISRRDEAQGIGISDNFKFSNSDREIYYGVQIEPATFFKIRAGQIEAPLAFQYTNSANQKARTDIRKGKVEHADAVVDYRFSEPFGTTGLFAGVGLYRQRGTLTDTAIPVEQRGNQTETNYGFLGGVNGDFPITRRTGFIVEMTYHWIAYNYKVRYVTLGGGLRFSF
Ga0209267_106124733300026331SoilMRIVTGMRNTAFVLAFFLASAASAVAQTSEFGVLIGGSKRLISRRDEKQGIGVSDNFKFSNSDREIFYGVELDPGTFFKIRAGQIEAPVAFQYSTASGTARTDLPKGKVEHIDAAVDYRFSEPFGSTGLFAGVGLYRQRGTLTDTAIPTEQRGVQTETNYGFLGGFNGDFPITRRTGFIVEATYHWINYNYKVRYITLGGGLRFSF
Ga0209059_109624923300026527SoilMRIVTAMRNTALLLAFLLISAAPAVGQTSQIGFLIGGSKRLISHSDQAAGRGISDNFRFSNSDRELFYAVQLDPGTFFKIRAGQIEGPVAFQYTAAGAKARTDIPKGKVEHLDAVVDYRFSEPFGTTGLFAGLGLYRQSGTLTDTAVPVDQRGSTSETNYGALGGVNGDFPITRRTGFIAELTYHWINYNYKVRYLTLAGGLRFSF
Ga0209807_135104013300026530SoilSAASAVAQTSEFGVLIGGSKRLISRRDEKQGIGVSDNFKFSNSDREIFYGVELDPGTFFKIRAGQIEAPVAFQYSTASGTARTDLPKGKVEHIDAAVDYRFSEPFGSTGLFAGVGLYRQRGTLTDTAIPTEQRGVQTETNYGFLGGFNGDFPITKRTGFIVEATYHW
Ga0209474_1061295113300026550SoilLGLLIGGSKRLISRSDQAQGLGISDNFKFSNSNREVYYAIQLDPGTFFRIKGGQIEGPVAFQFTNAAGTRARTDVPKGKVEHVDALIDYRFSEAFGATGLFAGLGLYRQRGTLSDLSIPAVQRGDQTETNYGFSGGVNGDFPITRRTGFIATGPDAHGLYPSRDARYLYVTNRGNGSISVIAFAA
Ga0209648_1001472083300026551Grasslands SoilMRIVTDMRKTGFALAFLLFSTVPAFAQTSEFGLLIGGSKRLISHTDQARGIGVSDSFKFSNSVREVYYAVQLDPGTFFRIKGGQIEGPVAFQFTNAAGAHARTDVPKGKVEHIDALIDYRFSEAFGATGLFAGVGLYRQRATLNDLSVPVVQRGNQTETNYGFQGGVNGDFPITRRTGFIAELAYHWINYNYKVRYVTLSGGLRFSF
Ga0209689_107570923300027748SoilMRKTAFLLAFLLVSTMPALAQTSEVGIEIGGSKRLISRTDQARGLGISDNFKFSNSVREAFYAVQLDPGTFFKIKGGQIEGPVAFQYTSGTAKARTDVPKGTIEHIDGLIDYRFSEPFGSTGLYAGVGLYRQRATLTDTAIPALQRGNQTETNYGLQGGVNTDFPITRRAGFIAELAYHWINYNYKVRYLTLSGGLRFSF
Ga0209166_1013591523300027857Surface SoilMGIVPRMFRTTFAAALALLVSAPAFAQTSEFGVLVGGSKRLISHSDQAAGLGISDNFKLSNSVRELFYSVEIDPGTRFKIKAGQITAPVAFQFTSPTGPKRTDVAKGTVDHVDGVIDYRFSEPFGSTGLFAGIGLYRQSGTVTDTAVPVEQRGRAEETNYGFSGGVNGDFPMTRRSGIVVEVTYHAINYHYKVRYVTATGGLRFSF
Ga0209283_1008176923300027875Vadose Zone SoilMRIVTAMRKFVFALAFLLLSAVPAMAQTSEFGLTIGGSKRLISHRDQAKGLGVSDSFKFSNSVRELFYAVQLDPGTFFKIKGGQIEAPVAFQFVNAAGAKARTDVSKGKVEHIDAIIDYRFSEPFGSTGLFAGAGLYRQRATLSDLAIPEVQRGDQTETNYGFQGGVNGDFPITRRTGFIAELAYHWINYNYKVRYVTLSGGLRFSF
Ga0209283_1015004833300027875Vadose Zone SoilMRIVTGMRKNALVLAILLVSAMPALAQTSQFGLTIGGSKRLISHTDQARGIGVSDNFKFSNSVREVFYAIQLDPGTFFKIKAGQIEGPAAFQYRAENGGLARTDISKGTIEHIDGLIDYRFSEAFGSTGLFAGAGLYRQRGNLTDTAVPVGQRGNQTETNYGFQGGVNSDFPITRRTGFIAELAYHWINYHYKVRYLTLSGGLRFSF
Ga0209168_1005820913300027986Surface SoilMRIVTAMRTSALFLPIFLVCAAPAFAQTSEVGILIGASKRLISHTDQANGLGISDNFKFSNSDREIFYAVQLDPGTFFKIRVGQIEGPLAFQYTPAAGGRARTDISKGTVEHADAVVDYRISEPFGTTGLFAGLGLYRQRGTLSDQAVPVEQRGVQTETNYGVLGGVNGDFPITKHTGFIAELTYHWINYNYKVRYLTLGGGLRFSF
Ga0307477_1008427933300031753Hardwood Forest SoilMRIVTAMRNTGFLLAFLLLSAAPAVAQTSEIGFLIGGSKRLISRSDQAAGLGISDHFKFSNSDRELFYGVQLDPGTFFKIRAGQIEGPLAFQYTTASGKARTDIPKGRVEHVDAVIDYRFSEPFGTTGLFAGVGLYRQSGTLTDAAIPAEQRGPTSETNYGALGGVNGDFPITKRTGFIAELTYHWINYNYKVRY
Ga0307479_1024155023300031962Hardwood Forest SoilMRIVTAMRNTGFLLAFFLVSAAPAVAQTSEIGFLIGGSKRLISRSDQAAGLGISDHFKFSNSDRELFYAVQLDPGTFFKIRAGQIEGPLAFQYTTASGKARTDIPKGKVEHVDAVIDYRFSEPFGTTGLFAGVGLYRQSGTLTDAAIPAEQRGPTSETNYGALGGVNGDFPITKRTGFIAELTYHWINYNYKVRYLTLAGGLRFSF
Ga0307471_10210953913300032180Hardwood Forest SoilAFLLAFLLASAAPAVAQTSEFGITIGGSKRLISKSDQAQGIGISDHFKFSNSDREVYYGVQVEPGTFFKIRAGQITAPAAFQYTNASNEKARTDIPKGKVEHVDAVVDYRFSEPFGSTGIFAGVGLYRQRGTLTDMAIPVEQRGNQTETNYGVLGGVNGDFPITRRTGFIVEAVYHWINYAYKVRYVTLGGGLRISF


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.