NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F043873

Metagenome / Metatranscriptome Family F043873

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F043873
Family Type Metagenome / Metatranscriptome
Number of Sequences 155
Average Sequence Length 136 residues
Representative Sequence REKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Number of Associated Samples 107
Number of Associated Scaffolds 155

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Archaea
% of genes with valid RBS motifs 0.65 %
% of genes near scaffold ends (potentially truncated) 96.13 %
% of genes from short scaffolds (< 2000 bps) 86.45 %
Associated GOLD sequencing projects 90
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (93.548 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(39.355 % of family members)
Environment Ontology (ENVO) Unclassified
(65.161 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(68.387 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 71.64%    β-sheet: 0.00%    Coil/Unstructured: 28.36%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 155 Family Scaffolds
PF00355Rieske 58.71
PF07681DoxX 15.48
PF07690MFS_1 2.58
PF01253SUI1 1.29
PF01638HxlR 0.65
PF13187Fer4_9 0.65
PF00296Bac_luciferase 0.65
PF00113Enolase_C 0.65
PF03352Adenine_glyco 0.65
PF13365Trypsin_2 0.65

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 155 Family Scaffolds
COG2259Uncharacterized membrane protein YphA, DoxX/SURF4 familyFunction unknown [S] 15.48
COG4270Uncharacterized membrane proteinFunction unknown [S] 15.48
COG0023Translation initiation factor 1 (eIF-1/SUI1)Translation, ribosomal structure and biogenesis [J] 1.29
COG0148EnolaseCarbohydrate transport and metabolism [G] 0.65
COG1733DNA-binding transcriptional regulator, HxlR familyTranscription [K] 0.65
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 0.65
COG28183-methyladenine DNA glycosylase TagReplication, recombination and repair [L] 0.65


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10178272All Organisms → cellular organisms → Archaea569Open in IMG/M
3300002558|JGI25385J37094_10195711All Organisms → cellular organisms → Archaea541Open in IMG/M
3300002558|JGI25385J37094_10214369All Organisms → cellular organisms → Archaea514Open in IMG/M
3300002560|JGI25383J37093_10019101All Organisms → cellular organisms → Archaea2291Open in IMG/M
3300002561|JGI25384J37096_10096645All Organisms → cellular organisms → Archaea1038Open in IMG/M
3300002562|JGI25382J37095_10120845All Organisms → cellular organisms → Archaea894Open in IMG/M
3300002908|JGI25382J43887_10141299All Organisms → cellular organisms → Archaea1233Open in IMG/M
3300002911|JGI25390J43892_10024199All Organisms → cellular organisms → Archaea1464Open in IMG/M
3300002911|JGI25390J43892_10026952All Organisms → cellular organisms → Archaea1386Open in IMG/M
3300002912|JGI25386J43895_10185271All Organisms → cellular organisms → Archaea522Open in IMG/M
3300002914|JGI25617J43924_10046456All Organisms → cellular organisms → Archaea1575Open in IMG/M
3300002916|JGI25389J43894_1024183All Organisms → cellular organisms → Archaea1052Open in IMG/M
3300005166|Ga0066674_10287499All Organisms → cellular organisms → Archaea773Open in IMG/M
3300005166|Ga0066674_10313247All Organisms → cellular organisms → Archaea737Open in IMG/M
3300005167|Ga0066672_11010385All Organisms → cellular organisms → Archaea507Open in IMG/M
3300005172|Ga0066683_10614806All Organisms → cellular organisms → Archaea656Open in IMG/M
3300005172|Ga0066683_10739477All Organisms → cellular organisms → Archaea578Open in IMG/M
3300005174|Ga0066680_10491449All Organisms → cellular organisms → Archaea772Open in IMG/M
3300005174|Ga0066680_10736994All Organisms → cellular organisms → Archaea600Open in IMG/M
3300005176|Ga0066679_10500728All Organisms → cellular organisms → Archaea792Open in IMG/M
3300005176|Ga0066679_10848001All Organisms → cellular organisms → Archaea579Open in IMG/M
3300005176|Ga0066679_10925282All Organisms → cellular organisms → Archaea547Open in IMG/M
3300005180|Ga0066685_11134778All Organisms → cellular organisms → Archaea509Open in IMG/M
3300005181|Ga0066678_10101034All Organisms → cellular organisms → Archaea1746Open in IMG/M
3300005446|Ga0066686_10786372All Organisms → cellular organisms → Archaea634Open in IMG/M
3300005447|Ga0066689_10583915All Organisms → cellular organisms → Archaea705Open in IMG/M
3300005552|Ga0066701_10000817All Organisms → cellular organisms → Archaea9859Open in IMG/M
3300005552|Ga0066701_10656658All Organisms → cellular organisms → Archaea633Open in IMG/M
3300005553|Ga0066695_10257972All Organisms → cellular organisms → Archaea1100Open in IMG/M
3300005554|Ga0066661_10664714All Organisms → cellular organisms → Archaea613Open in IMG/M
3300005557|Ga0066704_10266446All Organisms → cellular organisms → Bacteria1162Open in IMG/M
3300005557|Ga0066704_10293222All Organisms → cellular organisms → Bacteria1100Open in IMG/M
3300005558|Ga0066698_10803919All Organisms → cellular organisms → Archaea609Open in IMG/M
3300005559|Ga0066700_10333137All Organisms → cellular organisms → Bacteria1071Open in IMG/M
3300005559|Ga0066700_10871900All Organisms → cellular organisms → Archaea600Open in IMG/M
3300005559|Ga0066700_10966408All Organisms → cellular organisms → Archaea562Open in IMG/M
3300005569|Ga0066705_10619638All Organisms → cellular organisms → Archaea661Open in IMG/M
3300005569|Ga0066705_10940666All Organisms → cellular organisms → Archaea512Open in IMG/M
3300005587|Ga0066654_10274631All Organisms → cellular organisms → Archaea895Open in IMG/M
3300005598|Ga0066706_11423914All Organisms → cellular organisms → Archaea522Open in IMG/M
3300006034|Ga0066656_10587199All Organisms → cellular organisms → Archaea722Open in IMG/M
3300006034|Ga0066656_10718165All Organisms → cellular organisms → Archaea642Open in IMG/M
3300006046|Ga0066652_100171116All Organisms → cellular organisms → Archaea1842Open in IMG/M
3300006755|Ga0079222_11151335All Organisms → cellular organisms → Archaea688Open in IMG/M
3300006791|Ga0066653_10199495All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → Candidatus Parcubacteria1013Open in IMG/M
3300006794|Ga0066658_10088335All Organisms → cellular organisms → Archaea1435Open in IMG/M
3300006794|Ga0066658_10501429All Organisms → cellular organisms → Archaea660Open in IMG/M
3300006796|Ga0066665_10255351All Organisms → cellular organisms → Archaea1389Open in IMG/M
3300006796|Ga0066665_10990575All Organisms → cellular organisms → Archaea645Open in IMG/M
3300006797|Ga0066659_10266576All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Parcubacteria group → Candidatus Parcubacteria1291Open in IMG/M
3300006797|Ga0066659_10701922All Organisms → cellular organisms → Archaea827Open in IMG/M
3300006800|Ga0066660_11007878All Organisms → cellular organisms → Archaea668Open in IMG/M
3300007255|Ga0099791_10200336All Organisms → cellular organisms → Archaea941Open in IMG/M
3300007255|Ga0099791_10415071All Organisms → cellular organisms → Archaea649Open in IMG/M
3300007258|Ga0099793_10018677All Organisms → cellular organisms → Bacteria2816Open in IMG/M
3300007258|Ga0099793_10308894All Organisms → cellular organisms → Archaea769Open in IMG/M
3300009012|Ga0066710_104840692All Organisms → cellular organisms → Archaea504Open in IMG/M
3300009089|Ga0099828_10391694All Organisms → cellular organisms → Archaea1253Open in IMG/M
3300009137|Ga0066709_103380337All Organisms → cellular organisms → Archaea580Open in IMG/M
3300009137|Ga0066709_104137910All Organisms → cellular organisms → Archaea528Open in IMG/M
3300010122|Ga0127488_1016091All Organisms → cellular organisms → Archaea735Open in IMG/M
3300010127|Ga0127489_1069006All Organisms → cellular organisms → Archaea875Open in IMG/M
3300010304|Ga0134088_10315408All Organisms → cellular organisms → Archaea756Open in IMG/M
3300010321|Ga0134067_10085672All Organisms → cellular organisms → Archaea1061Open in IMG/M
3300010326|Ga0134065_10016638All Organisms → cellular organisms → Archaea2028Open in IMG/M
3300010329|Ga0134111_10040932All Organisms → cellular organisms → Archaea1652Open in IMG/M
3300010329|Ga0134111_10289357All Organisms → cellular organisms → Archaea680Open in IMG/M
3300010333|Ga0134080_10170716All Organisms → cellular organisms → Archaea930Open in IMG/M
3300010333|Ga0134080_10401830All Organisms → cellular organisms → Archaea634Open in IMG/M
3300010335|Ga0134063_10241402All Organisms → cellular organisms → Archaea858Open in IMG/M
3300010358|Ga0126370_12358265All Organisms → cellular organisms → Archaea528Open in IMG/M
3300010361|Ga0126378_10477941All Organisms → cellular organisms → Archaea1361Open in IMG/M
3300011269|Ga0137392_10697618All Organisms → cellular organisms → Archaea840Open in IMG/M
3300011269|Ga0137392_10773629All Organisms → cellular organisms → Archaea793Open in IMG/M
3300011271|Ga0137393_10539507All Organisms → Viruses → Predicted Viral1001Open in IMG/M
3300012096|Ga0137389_10323970All Organisms → cellular organisms → Archaea1307Open in IMG/M
3300012198|Ga0137364_10025546All Organisms → cellular organisms → Archaea3701Open in IMG/M
3300012199|Ga0137383_11171430All Organisms → cellular organisms → Archaea554Open in IMG/M
3300012200|Ga0137382_10903009All Organisms → cellular organisms → Archaea636Open in IMG/M
3300012201|Ga0137365_11215441All Organisms → cellular organisms → Archaea538Open in IMG/M
3300012203|Ga0137399_10019699All Organisms → cellular organisms → Archaea4385Open in IMG/M
3300012203|Ga0137399_10471607All Organisms → cellular organisms → Archaea1051Open in IMG/M
3300012204|Ga0137374_10029133All Organisms → cellular organisms → Archaea6068Open in IMG/M
3300012206|Ga0137380_11587001All Organisms → cellular organisms → Archaea539Open in IMG/M
3300012207|Ga0137381_10163545All Organisms → cellular organisms → Archaea1919Open in IMG/M
3300012207|Ga0137381_10313269All Organisms → cellular organisms → Archaea1367Open in IMG/M
3300012208|Ga0137376_10607367All Organisms → cellular organisms → Archaea946Open in IMG/M
3300012209|Ga0137379_10451999All Organisms → cellular organisms → Archaea1193Open in IMG/M
3300012209|Ga0137379_11017298All Organisms → cellular organisms → Archaea734Open in IMG/M
3300012209|Ga0137379_11237209All Organisms → cellular organisms → Archaea653Open in IMG/M
3300012209|Ga0137379_11415750All Organisms → cellular organisms → Archaea598Open in IMG/M
3300012349|Ga0137387_10713935All Organisms → cellular organisms → Archaea725Open in IMG/M
3300012349|Ga0137387_11121946All Organisms → cellular organisms → Archaea559Open in IMG/M
3300012350|Ga0137372_10234620All Organisms → cellular organisms → Archaea1450Open in IMG/M
3300012351|Ga0137386_10856827All Organisms → cellular organisms → Archaea653Open in IMG/M
3300012351|Ga0137386_10926106All Organisms → cellular organisms → Archaea623Open in IMG/M
3300012357|Ga0137384_11000460All Organisms → cellular organisms → Archaea672Open in IMG/M
3300012357|Ga0137384_11475717All Organisms → cellular organisms → Archaea529Open in IMG/M
3300012359|Ga0137385_11144321All Organisms → cellular organisms → Archaea639Open in IMG/M
3300012359|Ga0137385_11340901All Organisms → cellular organisms → Archaea578Open in IMG/M
3300012361|Ga0137360_11792536All Organisms → cellular organisms → Archaea519Open in IMG/M
3300012362|Ga0137361_10152013All Organisms → cellular organisms → Bacteria2069Open in IMG/M
3300012362|Ga0137361_10561263All Organisms → cellular organisms → Archaea1047Open in IMG/M
3300012363|Ga0137390_10934274All Organisms → cellular organisms → Archaea821Open in IMG/M
3300012363|Ga0137390_11575880All Organisms → cellular organisms → Archaea595Open in IMG/M
3300012363|Ga0137390_11857490All Organisms → cellular organisms → Archaea532Open in IMG/M
3300012918|Ga0137396_10853266All Organisms → cellular organisms → Archaea669Open in IMG/M
3300012925|Ga0137419_11257961All Organisms → cellular organisms → Archaea621Open in IMG/M
3300012927|Ga0137416_10924950All Organisms → cellular organisms → Archaea776Open in IMG/M
3300012927|Ga0137416_11460156All Organisms → cellular organisms → Archaea620Open in IMG/M
3300012971|Ga0126369_12764201All Organisms → cellular organisms → Archaea574Open in IMG/M
3300012975|Ga0134110_10182703All Organisms → cellular organisms → Archaea875Open in IMG/M
3300015358|Ga0134089_10295854All Organisms → cellular organisms → Archaea671Open in IMG/M
3300017657|Ga0134074_1024235All Organisms → cellular organisms → Archaea2013Open in IMG/M
3300018006|Ga0187804_10375558All Organisms → cellular organisms → Archaea627Open in IMG/M
3300018431|Ga0066655_10067880All Organisms → cellular organisms → Archaea1900Open in IMG/M
3300018431|Ga0066655_10231658All Organisms → cellular organisms → Archaea1166Open in IMG/M
3300018468|Ga0066662_11883637All Organisms → cellular organisms → Archaea626Open in IMG/M
3300018468|Ga0066662_11924907All Organisms → cellular organisms → Archaea619Open in IMG/M
3300025922|Ga0207646_10270896All Organisms → cellular organisms → Archaea1535Open in IMG/M
3300026297|Ga0209237_1020951All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota3726Open in IMG/M
3300026297|Ga0209237_1048406All Organisms → cellular organisms → Archaea2156Open in IMG/M
3300026298|Ga0209236_1034213All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota2682Open in IMG/M
3300026298|Ga0209236_1120444All Organisms → cellular organisms → Archaea1158Open in IMG/M
3300026301|Ga0209238_1132030All Organisms → cellular organisms → Archaea805Open in IMG/M
3300026309|Ga0209055_1138415All Organisms → cellular organisms → Archaea874Open in IMG/M
3300026313|Ga0209761_1035819All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota2924Open in IMG/M
3300026313|Ga0209761_1112877All Organisms → cellular organisms → Archaea1334Open in IMG/M
3300026317|Ga0209154_1064133All Organisms → cellular organisms → Archaea1610Open in IMG/M
3300026318|Ga0209471_1196155All Organisms → cellular organisms → Archaea773Open in IMG/M
3300026324|Ga0209470_1085359All Organisms → cellular organisms → Archaea1439Open in IMG/M
3300026325|Ga0209152_10000880All Organisms → cellular organisms → Archaea12618Open in IMG/M
3300026328|Ga0209802_1015356All Organisms → cellular organisms → Archaea4352Open in IMG/M
3300026328|Ga0209802_1072226All Organisms → cellular organisms → Archaea1627Open in IMG/M
3300026328|Ga0209802_1130842All Organisms → cellular organisms → Archaea1089Open in IMG/M
3300026331|Ga0209267_1027608All Organisms → cellular organisms → Archaea2756Open in IMG/M
3300026331|Ga0209267_1300118All Organisms → cellular organisms → Archaea534Open in IMG/M
3300026335|Ga0209804_1009369All Organisms → cellular organisms → Archaea5335Open in IMG/M
3300026514|Ga0257168_1131468All Organisms → cellular organisms → Archaea558Open in IMG/M
3300026528|Ga0209378_1002864All Organisms → cellular organisms → Archaea11772Open in IMG/M
3300026532|Ga0209160_1124920All Organisms → cellular organisms → Archaea1254Open in IMG/M
3300026538|Ga0209056_10245997All Organisms → cellular organisms → Archaea1274Open in IMG/M
3300026538|Ga0209056_10492831All Organisms → cellular organisms → Archaea648Open in IMG/M
3300026547|Ga0209156_10274579All Organisms → cellular organisms → Archaea770Open in IMG/M
3300026548|Ga0209161_10112909All Organisms → cellular organisms → Archaea1621Open in IMG/M
3300026548|Ga0209161_10322586All Organisms → cellular organisms → Archaea716Open in IMG/M
3300026548|Ga0209161_10341728All Organisms → cellular organisms → Archaea677Open in IMG/M
3300026550|Ga0209474_10471494All Organisms → cellular organisms → Archaea636Open in IMG/M
3300026551|Ga0209648_10062239All Organisms → cellular organisms → Archaea3162Open in IMG/M
3300026552|Ga0209577_10614529All Organisms → cellular organisms → Archaea644Open in IMG/M
3300026552|Ga0209577_10736368All Organisms → cellular organisms → Archaea554Open in IMG/M
3300027643|Ga0209076_1005624All Organisms → cellular organisms → Bacteria2994Open in IMG/M
3300028536|Ga0137415_10252418All Organisms → cellular organisms → Archaea1572Open in IMG/M
3300031720|Ga0307469_10037561All Organisms → cellular organisms → Bacteria2900Open in IMG/M
3300032205|Ga0307472_102048339All Organisms → cellular organisms → Archaea574Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil39.35%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil29.03%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil17.42%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil8.39%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.94%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.29%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.65%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.65%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.65%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010122Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_4_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010127Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Met_40_2_8_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300015358Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1017827223300002558Grasslands SoilTLVIDITGAVREHRPIPREVLSLHIEKLKTAQSDLEVLLGVYNSVLDVETTRLTSDIILQIEHLQEDFEYLAELHPRPPTLSHASHIEDLILRAVRITKEELVALGTDNQQIRALEDWLIQYTKNRRPQERRQQPIEVKGGHQIA*
JGI25385J37094_1019571123300002558Grasslands SoilDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLLKKEEPIAVRGGHTI*
JGI25385J37094_1021436913300002558Grasslands SoilLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
JGI25383J37093_1001910143300002560Grasslands SoilTLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
JGI25384J37096_1009664513300002561Grasslands SoilLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLQKKEEPIAVRGGHTI*
JGI25382J37095_1012084533300002562Grasslands SoilLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLQKKEEPIAVRGGHTI*
JGI25382J43887_1014129933300002908Grasslands SoilVLLLHIEKLKTAQSDLEVLLGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIPQRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTVYTKERRPVQKREQPIEVRGGHTIS*
JGI25390J43892_1002419913300002911Grasslands SoilEQVPIPREVLLLHIEKLKTAQSDLEVLLGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIPQRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTVYTKERRPVQKREQPIEVRGGHTIS*
JGI25390J43892_1002695243300002911Grasslands SoilVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIAHMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
JGI25386J43895_1018527113300002912Grasslands SoilVLSLHIEKLKTAQSDLEVLLGVYNSVLDVETTRLTSDIILQIEHLQEDFEYLAELHPRPPTLSHASHIEDLILRAVRITKEELVALGTDNQQIRALEDWLIQYTKNRRPQERRQQPIEVKGGHQIA*
JGI25617J43924_1004645643300002914Grasslands SoilIPKDVLMLHIEKLKTAQSDLETLLGVYXNVLDVEIARXTSDXVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLAKKEQPISVRGGHTI*
JGI25389J43894_102418333300002916Grasslands SoilKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVETARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066674_1028749913300005166SoilVGFLTTFLITLVIDVTSAVREQAPIPREVLLLHIEKLKTAQSDLEVLLGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIPQRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTVYTKERRPVQKREQPIEVRGGHTIS*
Ga0066674_1031324713300005166SoilERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066672_1101038513300005167SoilEKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALENWLTQYTRERRPLPKKEPIAVRGGHTI*
Ga0066683_1061480613300005172SoilEKLKTAQSDLEVLLGVYNNVLDVEIELLTSDVILQIEHLQEDFEYLAEIQPRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTEYTRVRRPVQRREQPIEVTGGHTIS
Ga0066683_1073947723300005172SoilKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066680_1049144923300005174SoilEKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066680_1073699413300005174SoilKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066679_1050072813300005176SoilDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIHALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066679_1084800113300005176SoilKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPSESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066679_1092528223300005176SoilLLLHIEKLKTAQSDLETLLGVYSNVLDVETARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKSELIALGTDNAQIHALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066685_1113477813300005180SoilVLDVETARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIHALEDWLTQYTRERRPLRKKEEPIAVRGGHTI*
Ga0066678_1010103413300005181SoilREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIHALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066686_1078637213300005446SoilKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNSQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066689_1058391513300005447SoilAKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066701_1000081713300005552SoilRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKQELIALGTDNTQIRALEDWLTQYTRETRPLPKKEQPIAVRGGHTI*
Ga0066701_1065665813300005552SoilKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLLKKEEPIAVRGGHTI*
Ga0066695_1025797233300005553SoilIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066661_1066471423300005554SoilREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPSESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066704_1026644633300005557SoilGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066704_1029322233300005557SoilREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066698_1080391913300005558SoilREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTQSHASHMEQVLLKTVHLTKEELIALGTDNAQIRALEDWLTQYTRETRPLPKKEQPIAVRGGHTI*
Ga0066700_1033313733300005559SoilVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKEELIALGTDNAQIHALEDWLTQYTRERRSMPTKKEPIAVRGGHTI*
Ga0066700_1087190013300005559SoilKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066700_1096640823300005559SoilSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKQELIALGTDNTQIRALEDWLTQYTRETRPLPKKEQPIAVRGGHTI*
Ga0066705_1061963823300005569SoilREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALENWLTQYTRERRPLPKKEPIAVRGGHTI*
Ga0066705_1094066613300005569SoilASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVETARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIHALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066654_1027463113300005587SoilAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKSELIALGTDNVQIHALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066706_1142391413300005598SoilLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066656_1058719913300006034SoilKSAQSDLETLLGVYSNVLDVETARLTSDVVLQIEHLQEDFEYLAETQPRPPTQSHASHMEQVLLKTVHLTKEELIALGTDNAQIRALEDWLTQYTRETRPLPKKEQPIAVRGGHTI*
Ga0066656_1071816523300006034SoilQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066652_10017111643300006046SoilERLAKERLRSKLGVGFLTTFLITLVIDVTSAVREQAPIPREVLLLHIEKLKTAQSDLEVLLGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIPQRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTVYTKERRPVQKREQPIEVRGGHTIS*
Ga0079222_1115133523300006755Agricultural SoilFLTTFLITLVIDVTSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTTDVVLQIEHLQEDFEYLAETQPRPPTESHASHIEEVLLKAVHLTKGELIALGTDNAQIRALEDWLTQYTQEKRPLPRKEQPIAVRGGHTIR*
Ga0066653_1019949513300006791SoilHIEKLKTAQSDLETLLGVYSNVLDVEIAHMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066658_1008833543300006794SoilAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIHALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066658_1050142923300006794SoilIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVETARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066665_1025535113300006796SoilKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIAHMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066665_1099057513300006796SoilEKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLQKKEEPIAVRGGHTI*
Ga0066659_1026657613300006797SoilLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIHALEDWLTQYTRERRPLPKKGEPIAVRGGHTI*
Ga0066659_1070192213300006797SoilKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0066660_1100787823300006800SoilIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLLKKEEPIAVRGGHTI*
Ga0099791_1020033613300007255Vadose Zone SoilSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHAWHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLTKKEQPIAVRGGHTI*
Ga0099791_1041507123300007255Vadose Zone SoilAKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPRDVLLLHIEKLKTAQSDLETLLGIYSNVLDVEIARLTSDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERKPLPTKEQPIAVRGGHTI*
Ga0099793_1001867713300007258Vadose Zone SoilKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPRDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKQELIALGTDNAQIRALEDWLTQYTRETRPLPKKEQPIAVRGGHTI*
Ga0099793_1030889413300007258Vadose Zone SoilERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLAKKEQPISVRGGHTI*
Ga0066710_10484069223300009012Grasslands SoilIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLLKKEEPIAVRGGHTI
Ga0099828_1039169413300009089Vadose Zone SoilTTFLITLVIDITSASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLPKKEQPISVRGGHTI*
Ga0066709_10338033723300009137Grasslands SoilLHIEKLKTAQSDLETLLGVYSSVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDSAQIRALEDWLTQYTRERRPLPKKEEPISVRGGHTI*
Ga0066709_10413791023300009137Grasslands SoilPIPREVLLLHIEKLKTAQSDLEVLLGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIPQRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTVYTKERRPVQKREQPIEVRGGHTIS*
Ga0127488_101609123300010122Grasslands SoilGVGFLTTFLITLVIDVTSAVREQAPIPREVLLLHIEKLKTAQSDLEVLLGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIPQRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTVYTKERRPVQKREQPIEVRGGHTIS*
Ga0127489_106900613300010127Grasslands SoilGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIPQRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTVYTKERRPVQKREQPIEVRGGHTIS*
Ga0134088_1031540813300010304Grasslands SoilLITLVIDITSASREKSPIPKDVLMLHIEKLKSAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTQSHASHMEQVLLKTVHLTKEELIALGTDNAQIRALEDWLTQYTRETRPLPKKEQPIAVRGGHTI*
Ga0134067_1008567233300010321Grasslands SoilEKLKTAQSDLETLLGVYSNVLDVETARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIHALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0134065_1001663843300010326Grasslands SoilVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0134111_1004093243300010329Grasslands SoilLLHIEKLKTAQSDLETLLGVYSNVLDVEIAHMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0134111_1028935713300010329Grasslands SoilLHIEKLKTAQSDLEVLLGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIPQRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTVYTKERRPVQKREQPIEVRGGHTIS*
Ga0134080_1017071623300010333Grasslands SoilLAKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0134080_1040183013300010333Grasslands SoilLAKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIAHMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0134063_1024140213300010335Grasslands SoilVLDVEIELLTSDVILQIEHLQEDFEYLAEIQPRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTEYTRERRPVQRREQPIEVTGGHTIS*
Ga0126370_1235826513300010358Tropical Forest SoilHIEKLKTAQSDLEVLLGIYNSVLDVETARLTSDVILQIEHLQEDFEYLAEMPTRPPTQSHASHIEQLLLRTVQMTKEELVALGTDNQQIRALEEWLIQFTRDKRPAQKLEQPIEVKGGHTIR*
Ga0126378_1047794113300010361Tropical Forest SoilGVGFLTTFLITLVIDITTAVREQAPIPKEVLTLHIEKLKTAQSDLEVLLGIYNSVLDVEIARLTSDVILQIEHLQEDFEYLAEMPMRPPTQSHASHIEQLLLRTVQMTKEELVALGTDNQQIRALEEWLIQFTRDKKPAQKLEQPIEVKGGHTIR*
Ga0137392_1069761823300011269Vadose Zone SoilPVEKLAKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHAWHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLKKKEQPIAVRGGHTI*
Ga0137392_1077362913300011269Vadose Zone SoilKLGVGFLTTFLITLVIDITSASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKEELIALGTDNAQIRALEDWLTQYTREMRPLAKKEQPISVRGGHTI*
Ga0137393_1053950713300011271Vadose Zone SoilHIEKLKTAQSDLEVLLGVYNSVLGIEITRLTSDIILQIEHLQEDFEYLAELHPRPPTLTHASHIEDLLLRAVSITKEELVALGTDNQQIRALEDWLTQFTKNRRSRDRREEPIEVKGGHQIT*
Ga0137389_1032397013300012096Vadose Zone SoilSASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLPKKEQPIPVRGGHTI*
Ga0137364_1002554673300012198Vadose Zone SoilKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIAHMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0137383_1117143013300012199Vadose Zone SoilAQSDLETLLGVYSNVLDVETARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLRKKEEPIAVRGGHTI*
Ga0137382_1090300923300012200Vadose Zone SoilLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTEAELIALGTDNAQIRALEDWLTQHTRERRPLPKKEEPIAVRGGHTI*
Ga0137365_1121544113300012201Vadose Zone SoilLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVETARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIHALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0137399_1001969983300012203Vadose Zone SoilGFLTTFLITLVIDITSASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLRAVHLTKEELVALGTDNAQIRALEDWLTQYTRERRPLQKKEQPIAVRGGHTI*
Ga0137399_1047160723300012203Vadose Zone SoilLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKQELIALGTDNAQIRALEDWLTQYTRETRPLPKKEQPIAVRGGHTI*
Ga0137374_10029133103300012204Vadose Zone SoilGIYNNVLDVEIERLTSDVILQIEHLQEDFEYLAETQPRPPSESHAAHIEQLLLRTVHLTEEELIALGTDNQQIRALEEWLTQYTKERGPATKREQPIEVRGGHTIS*
Ga0137380_1158700123300012206Vadose Zone SoilEVLLGVYNNVLDVEIELLTSDVILQIEHLQEDFEYLAEIQPRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTEYTRERRPVQRREQPIEVTGGHTIS*
Ga0137381_1016354513300012207Vadose Zone SoilTAQSDLEVLLGVYNNVLDVEIELLTSDVILQIEHLQEDFEYLAEIQPRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALENWLTEYTRERRPVQRREQPIEVTGGHTIS*
Ga0137381_1031326943300012207Vadose Zone SoilTAQSDLEVLLGVYNNVLDVEIELLTSDVILQIEHLQEDFEYLAEIQPRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTEYTRERRPVQRREQPIEVTGGHTIS*
Ga0137376_1060736713300012208Vadose Zone SoilPIEKLAKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0137379_1045199923300012209Vadose Zone SoilGPIEKLAKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKQELIALGTDNTQIRALEDWLTQYTRETRPLPKKEQPIAVRGGHTI*
Ga0137379_1101729823300012209Vadose Zone SoilGVGFLTTFLITLVIDITSAVREQAPIPREVLLLHIEKLKTAQSDLEVLLGVYNNVLDVEIELLTSDVILQIEHLQEDFEYLAEIQPRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTEYTRERRPVQRREQPIEVTGGHTIS*
Ga0137379_1123720923300012209Vadose Zone SoilREVLLLHIEKLKTAQSDLEVLLGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIPQRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTVYTKERRPVQKREQPIEVRGGHTIS*
Ga0137379_1141575013300012209Vadose Zone SoilIEKLAKERLRSKLGVGFLTTFLITLVIDITSASREKNPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVETARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAEIHALEDWLTQYTRERRPLLKKEEPIAVRGGHTI*
Ga0137387_1071393523300012349Vadose Zone SoilARLAKERLRSKLGVGFLTTFLITLVIDITGAVREHRPIPREVLSLHIEKLKTAQSDLEVLLGVYNSVLDVETTRLTSDIIVQIEHLQEDFEYLAELHPRPPTLSHASHIEDLILRAVRITKEELVALGTDNQQIRALEDWLIQYTKNRRPQERRQEPIEVKGGHQIA*
Ga0137387_1112194623300012349Vadose Zone SoilVYSNVLDVEIARMTNDVIRQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIHALEDWLTQYTRERRPMQKKEEPIAVRGGHTI*
Ga0137372_1023462033300012350Vadose Zone SoilVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0137386_1085682723300012351Vadose Zone SoilASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLRKKEEPIAVRGGHTI*
Ga0137386_1092610623300012351Vadose Zone SoilDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLQKKEEPIAVRGGHTI*
Ga0137384_1100046013300012357Vadose Zone SoilLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVETARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLRKKEEPIAVRGGHTI*
Ga0137384_1147571713300012357Vadose Zone SoilLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0137385_1114432123300012359Vadose Zone SoilSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLQKKEEPIAVRGGHTI*
Ga0137385_1134090113300012359Vadose Zone SoilDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI*
Ga0137360_1179253613300012361Vadose Zone SoilVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTQSHASHMEQVLLKTVHLTKEELIALGTDNAQIRALEDWLTQYTRETRPLPKKEQPIAVRGGHTI*
Ga0137361_1015201313300012362Vadose Zone SoilPVEKLAKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTQSHASHMEQVLLKTVHLTKQELIALGTDNAQIRALEDWLTQYTREMRPLPKKEQPIAVRGGHTI*
Ga0137361_1056126313300012362Vadose Zone SoilIDITSASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKQELIALGTDNTQIRALEDWLTQYTRETRPLPKKEQPIAVRGGHTI*
Ga0137390_1093427413300012363Vadose Zone SoilQSDLETLLGVYSNVLDVEIARMTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLPKKEQPIPVRGGHTI*
Ga0137390_1157588023300012363Vadose Zone SoilRSKLGVGFLTTFLITLVIDITSASREKSPIPRDVLLLHIEKLKTAQSDLETLLGIYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKADLIALGTDNVQIRALEDWLTQYTRERRPLSKKEQPIAVRGGHTI*
Ga0137390_1185749023300012363Vadose Zone SoilSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLAKKEQPISVRGGHTI*
Ga0137396_1085326613300012918Vadose Zone SoilASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLRAVHLTKEELVALGTDNAQIRALEDWLTQYTRERRPLQKKEQPIAVRGGHTI*
Ga0137419_1125796123300012925Vadose Zone SoilDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLSKKEQPIPVRGGHTI*
Ga0137416_1092495013300012927Vadose Zone SoilERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLSKKEQPIPVRGGHTI*
Ga0137416_1146015613300012927Vadose Zone SoilKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRETRSLPKKEQPISVRGGHTI*
Ga0126369_1276420113300012971Tropical Forest SoilIDITSASREGSPIPKDVLLLHVEKLKTAQSDLETLLGIYSNVLDVEVGRLTSDVILQIEHLQEDFEYLAETQPRPPTPSHAAHIEWLLLKTVHLTKEELISLGTENHQIRALEDWLTQYTKDKRPYARGEERVVVSGGHALS*
Ga0134110_1018270323300012975Grasslands SoilLLLHIEKQKTLQSDLELLLEVYNNVLDVEIELLTSDVILQIEHLQEDFEYLAEIQPRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTEYTRERRPVQRREQPIEVTGGHTIS*
Ga0134089_1029585423300015358Grasslands SoilLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKEELIALGTDNAQIHALEDWLTQYTRERRSMPTKKEPIAVRGGHTI*
Ga0134074_102423513300017657Grasslands SoilLHIEKLKTAQSDLEVLLGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIPQRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTVYTKERRPVQKREQPIEVRGGHTIS
Ga0187804_1037555813300018006Freshwater SedimentFLITLVIDITSASREKSPIPKDVLLMHIEKLKTAQTDLETLLGVYSNVLDVDVARLTSDVILQIEHLQEDFEYLAEIQPRPPTQSHATHIEQLLLRTVHLTKEELVALGTDNQQIDALEDWLTDYTRQRKTPPAREQPIEVRGGHTIS
Ga0066655_1006788013300018431Grasslands SoilLAKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIAHMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0066655_1023165833300018431Grasslands SoilLAKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0066662_1188363713300018468Grasslands SoilFLTTFLITLVIDVTSAVREQVPIPREVLLLHIEKLKTAQSDLEVLLGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIPQRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTVYTKERRPVQKREQPIEVRGGHTIS
Ga0066662_1192490713300018468Grasslands SoilIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIHALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0207646_1027089613300025922Corn, Switchgrass And Miscanthus RhizosphereSASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIALLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLAKKEQPISVRGGHTI
Ga0209237_102095113300026297Grasslands SoilRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209237_104840623300026297Grasslands SoilLLLHIEKLKTAQSDLEVLLGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIPQRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTVYTKERRPVQKREQPIEVRGGHTIS
Ga0209236_103421363300026298Grasslands SoilREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209236_112044413300026298Grasslands SoilDLEVLLGVYNSVLDVETTRLTSDIILQIEHLQEDFEYLAELHPRPPTLSHASHIEDLILRAVRITKEELVALGTDNQQIRALEDWLIQYTKNRRPQERRQQPIEVKGGHQIA
Ga0209238_113203013300026301Grasslands SoilDLEVLLGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIPQRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTVYTKERRPVQKREQPIEVRGGHTIS
Ga0209055_113841523300026309SoilSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209761_103581913300026313Grasslands SoilLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209761_111287733300026313Grasslands SoilIEKLAKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209154_106413313300026317SoilASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVETARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209471_119615523300026318SoilSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209470_108535913300026324SoilITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209152_1000088013300026325SoilTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVETARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209802_101535673300026328SoilLIALVIDITSASREKSPIPKDVLLLHIEKLKTAQSELETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIHALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209802_107222613300026328SoilEKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209802_113084223300026328SoilTSASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKQELIALGTDNAQIRALEDWLTQYTREMRPLPKKEQPIAVRGGHTI
Ga0209267_102760813300026331SoilTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHMTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209267_130011823300026331SoilGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLQKKEEPIAVRGGHTI
Ga0209804_100936913300026335SoilKLAKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIHALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0257168_113146823300026514SoilLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQILLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLTKKEQPIAVRGGHTI
Ga0209378_1002864183300026528SoilKLAKERLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209160_112492013300026532SoilVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209056_1024599713300026538SoilLETLLGVYSNVLDVEIARMTNDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDDAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209056_1049283123300026538SoilVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLQKKEEPIAVRGGHTI
Ga0209156_1027457913300026547SoilLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIHALEDWLTQYTRERRPLRKKEEPIAVRGGHTI
Ga0209161_1011290943300026548SoilLLHIEKLKTAQSDLEVLLGIYNNVLDIEIELLTSDVILQIEHLQEDFEYLAEIQPRPPTESHASHIEQLLLRTVHLTKEELIALGTDNQQIRALEDWLTEYTRERRPVQRREQPIEVTGGHTIS
Ga0209161_1032258623300026548SoilVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALENWLTQYTRERRPLPKKEPIAVRGGHTI
Ga0209161_1034172823300026548SoilVIDITSASREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLLKKEEPIAVRGGHTI
Ga0209474_1047149423300026550SoilQSDLETLLGVYSNVLDVETARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209648_1006223913300026551Grasslands SoilSREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDIVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLAKKEQPISVRGGHTI
Ga0209577_1061452913300026552SoilREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPSESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLPKKEEPIAVRGGHTI
Ga0209577_1073636813300026552SoilREKSPIPKDVLLLHIEKLKTAQSDLETLLGVYSNVLDVEIARMTNDVILQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKAELIALGTDNAQIRALEDWLTQYTRERRPLQKKEEPIAVRGGHTI
Ga0209076_100562413300027643Vadose Zone SoilTTFLITLVIDITSASREKSPIPRDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKQELIALGTDNAQIRALEDWLTQYTRETRPLPKKEQPIAVRGGHTI
Ga0137415_1025241813300028536Vadose Zone SoilRLRSKLGVGFLTTFLITLVIDITSASREKSPIPKDVLMLHIEKLKTAQSDLETLLGVYSNVLDVEIARLTSDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKAVHLTKEELIALGTDNAQIRALEDWLTQYTRERRPLSKKEQPIPVRGGHTI
Ga0307469_1003756113300031720Hardwood Forest SoilLLGVYSNVLDVEIARMTNDIVLQIEHLQEDFEYLAETQPRPPTQSHASHMEQVLLKTVHLTKEELIALGTDNAQIHALEEWLTQYTRETRPLAKREQPIAVRGGHTI
Ga0307472_10204833913300032205Hardwood Forest SoilLLGVYSNVLGVDIARMTTDVVLQIEHLQEDFEYLAETQPRPPTESHASHMEQVLLKTVHLTKGELIALGTDDAQIRALEDWLTQYTRERRPSPKETPIAVRGGHTI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.