NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026009

3300026009: Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_10C_80N_401 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026009 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0111376 | Gp0116050 | Ga0208530
Sample NameRice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_10C_80N_401 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?Y
Use PolicyOpen

Dataset Contents
Total Genome Size47811615
Sequencing Scaffolds41
Novel Protein Genes42
Associated Families42

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available14
All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1
All Organisms → cellular organisms → Bacteria6
All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria3
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium7
All Organisms → cellular organisms → Bacteria → Terrabacteria group1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia → environmental samples → uncultured Thermoleophilia bacterium1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia1
All Organisms → cellular organisms → Bacteria → environmental samples → uncultured bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Hafniaceae → Hafnia → Hafnia alvei1
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameNatural And Restored Wetland Microbial Communities From The San Francisco Bay, California, Usa, That Impact Long-Term Carbon Sequestration
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil → Natural And Restored Wetland Microbial Communities From The San Francisco Bay, California, Usa, That Impact Long-Term Carbon Sequestration

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomepaddy fieldpaddy field soil
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationUSA: Twitchell Island, California
CoordinatesLat. (o)38.1087Long. (o)-121.653Alt. (m)N/ADepth (m)0
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F001212Metagenome / Metatranscriptome746Y
F001248Metagenome / Metatranscriptome737Y
F001931Metagenome / Metatranscriptome615Y
F004686Metagenome / Metatranscriptome428Y
F007491Metagenome / Metatranscriptome350Y
F007880Metagenome / Metatranscriptome343N
F009820Metagenome312Y
F009994Metagenome / Metatranscriptome310Y
F010008Metagenome / Metatranscriptome310Y
F011243Metagenome / Metatranscriptome293Y
F011711Metagenome / Metatranscriptome288Y
F012266Metagenome / Metatranscriptome282Y
F013130Metagenome / Metatranscriptome274Y
F024508Metagenome / Metatranscriptome205Y
F026365Metagenome / Metatranscriptome198Y
F028203Metagenome / Metatranscriptome192Y
F032743Metagenome / Metatranscriptome179Y
F033082Metagenome / Metatranscriptome178Y
F034308Metagenome / Metatranscriptome175Y
F035429Metagenome / Metatranscriptome172Y
F038329Metagenome / Metatranscriptome166Y
F045169Metagenome / Metatranscriptome153N
F047144Metagenome / Metatranscriptome150Y
F047804Metagenome / Metatranscriptome149Y
F053350Metagenome / Metatranscriptome141Y
F064065Metagenome129Y
F074535Metagenome119Y
F076547Metagenome / Metatranscriptome118Y
F079119Metagenome / Metatranscriptome116Y
F080009Metagenome / Metatranscriptome115N
F081174Metagenome / Metatranscriptome114Y
F082662Metagenome / Metatranscriptome113Y
F083445Metagenome / Metatranscriptome113Y
F085485Metagenome / Metatranscriptome111Y
F087354Metagenome / Metatranscriptome110N
F091808Metagenome / Metatranscriptome107Y
F092455Metagenome / Metatranscriptome107Y
F097106Metagenome104Y
F097920Metagenome / Metatranscriptome104Y
F101110Metagenome / Metatranscriptome102Y
F101118Metagenome / Metatranscriptome102N
F105144Metagenome / Metatranscriptome100N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0208530_1001544Not Available1462Open in IMG/M
Ga0208530_1001633Not Available1433Open in IMG/M
Ga0208530_1001963All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1329Open in IMG/M
Ga0208530_1002728All Organisms → cellular organisms → Bacteria1173Open in IMG/M
Ga0208530_1002895Not Available1146Open in IMG/M
Ga0208530_1003036All Organisms → cellular organisms → Bacteria1127Open in IMG/M
Ga0208530_1003752All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1040Open in IMG/M
Ga0208530_1004000Not Available1010Open in IMG/M
Ga0208530_1004379All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria976Open in IMG/M
Ga0208530_1004503All Organisms → cellular organisms → Bacteria965Open in IMG/M
Ga0208530_1004747All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium945Open in IMG/M
Ga0208530_1005259All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium910Open in IMG/M
Ga0208530_1005349All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes904Open in IMG/M
Ga0208530_1005693Not Available881Open in IMG/M
Ga0208530_1005720All Organisms → cellular organisms → Bacteria879Open in IMG/M
Ga0208530_1006289All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium847Open in IMG/M
Ga0208530_1006607All Organisms → cellular organisms → Bacteria831Open in IMG/M
Ga0208530_1007542All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium789Open in IMG/M
Ga0208530_1008669All Organisms → cellular organisms → Bacteria → Terrabacteria group746Open in IMG/M
Ga0208530_1008972Not Available737Open in IMG/M
Ga0208530_1009664All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia → environmental samples → uncultured Thermoleophilia bacterium715Open in IMG/M
Ga0208530_1010741All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria685Open in IMG/M
Ga0208530_1010743Not Available685Open in IMG/M
Ga0208530_1011031Not Available677Open in IMG/M
Ga0208530_1011488Not Available666Open in IMG/M
Ga0208530_1013463Not Available624Open in IMG/M
Ga0208530_1014012All Organisms → cellular organisms → Bacteria614Open in IMG/M
Ga0208530_1014360Not Available608Open in IMG/M
Ga0208530_1016145All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria580Open in IMG/M
Ga0208530_1017219All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Thermoleophilia565Open in IMG/M
Ga0208530_1017296Not Available564Open in IMG/M
Ga0208530_1018150Not Available554Open in IMG/M
Ga0208530_1019873All Organisms → cellular organisms → Bacteria → environmental samples → uncultured bacterium533Open in IMG/M
Ga0208530_1019954All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium532Open in IMG/M
Ga0208530_1019971All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Enterobacterales → Hafniaceae → Hafnia → Hafnia alvei532Open in IMG/M
Ga0208530_1021288All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium518Open in IMG/M
Ga0208530_1021429All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium517Open in IMG/M
Ga0208530_1021539All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium516Open in IMG/M
Ga0208530_1021676All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium514Open in IMG/M
Ga0208530_1021693All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium514Open in IMG/M
Ga0208530_1022583Not Available506Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0208530_1001544Ga0208530_10015441F033082MPSSPASSSSVAARCLGCGREFETEQVVLLGQTFAAERYCGMCRATEQVNAEEQKVNARWRRVMVPSAYADCSFANF
Ga0208530_1001633Ga0208530_10016333F012266LSFCFEVGALSGSLVSPAPEGPCDDMCRLNALSSESDGDAADFLD
Ga0208530_1001963Ga0208530_10019631F076547LIRPALEPGEWKERRSGPISLDVGDTFDYVVIEGPSGETVTVSGADEIFALISLANDALPLDDPRKITHGWLARVEAIRDMLLEMAAEGEESLERDRRELAKGWDQMVQVLSAILPPDEPHDERDDYER
Ga0208530_1002728Ga0208530_10027283F001212MWEECIRCNEAPAVDEQGYCGHCHWAVRAEIEEGFYALREYLRAWARYADWCDARGLSTV
Ga0208530_1002895Ga0208530_10028953F045169VTEWGHNGDLSAPAVVAAPALQLTEADHIDEIPRLTRNQRAVMDWLNEGRDVSMRDLAKQLPRAPVTVKSLFHRGLIGYGQSWEEANGGPDDDRTVRLTPRGRSVMELLQARDHESA
Ga0208530_1003036Ga0208530_10030361F083445SAVHELAHLRARLVDLQSELLNRMHERRQIIQRVAEQDLERQQEGGRSSRTAAMQRARASEEVRSFDQNVAALREKIRRLEIEIERDRLSIEVAIAGATDRTSGLVAQRGAA
Ga0208530_1003752Ga0208530_10037521F009994MKRVALLCMGLAFGAANVAEAQLTMQMSNGWAFTFSGNVNAFMIYQTSKTAGG
Ga0208530_1004000Ga0208530_10040002F092455RPRRGACVNTFHDAFLDARRRIESGADAEQIVPALLKLAEAADEIELAQGLYEDEDKDEEPG
Ga0208530_1004379Ga0208530_10043793F032743RMVDGERVRVEVGFEGGQVISGFVDPSSADQLERALHEDGPRVVVLQSEDGPYHVVVPRVAYFKRVVRAGRVGFGSG
Ga0208530_1004503Ga0208530_10045031F011711MGDKRQKNQLQMVLAFAVEGRSEAPKARREGTESSTAKRECESPAIPD
Ga0208530_1004747Ga0208530_10047471F091808VRVKLGNPAHVGRLLAYLAFDPTVVAHRIADDEIEVSFLGSFNADAQQMQTELRLRAWIAANRE
Ga0208530_1005259Ga0208530_10052593F097920LVVPMHYRTGAVNFLEPPDVFLEALGARVERLETSETEVEALLGTAAEPVVALLAPPV
Ga0208530_1005349Ga0208530_10053492F009820MRLRLALPAILALLVAGPARAQVQVRVGDPSRATYSELHEALREGSPAADSVRAVLNIRKPARLWDYLGAAIDGTGDWNSGLIALTHLAELRSAAYADSAARLKQRLAAVEGVPFPQNPGLKAEDVEPSLQAILLERRRAVAGDSAVLADILARIPARDYDHGDAWVLGRLGAGAADSVAGRFLSAKDQEFKVRYLTLLSYFTDPTLIPLVSRIYAAPDSFGVPPRMAIRASDALLWIGTRESLQALLDARALAR
Ga0208530_1005693Ga0208530_10056932F101110MTEQRRSAASRLLPFLKVERELADERMQRAVAAVCARHPRLAEVEVGWFGHRLQGDAGGRYHALGFLSDPDRGIEDGRGFVVDVVAGHVLRELPLERRRDLPADITALA
Ga0208530_1005720Ga0208530_10057202F024508MYTRIQSRAFVALSLGLVPIGSSLNGQSVARADEPSQDNKIQLSYHVKAITRDSAGQYTMSGTVNGEVQGRA
Ga0208530_1006289Ga0208530_10062892F038329MTRLEPPYETNGSGIPVSGASPSTAARLIAAWPQTSAVIPAARRLPNGSLQEMASRRP
Ga0208530_1006607Ga0208530_10066071F053350VLGLLDLRERGERLEPSRFEIDPTVTDTVRGILADVRDGGDRVLAELALRFDGADLSASGLVV
Ga0208530_1007542Ga0208530_10075421F085485GSLDGGVGAVWSARQAWYVSGGLRHTKALPEIYNPAMAQQWAELARIARGRYHRALHFAGVMTQGTSSCNCGLRPPAAHRVLVRALDAQGLGHIPLPVGGTNIIG
Ga0208530_1008669Ga0208530_10086692F047144AAGVANVPRSLDPADPLANLTGNPQTGWKRRALETLVERATAALA
Ga0208530_1008972Ga0208530_10089722F035429MDAGTERDRDASPFDEQGRVDVPGAPGWRYLPISPSGNDSLAVTVAGPEDVRLSFTVPGFLSRGDELGEVTRVVIRAWERMDRASGLGA
Ga0208530_1009664Ga0208530_10096641F101118MQGRDDAYAIRIGLLAVVAFMSWPVWLAIVSGQHTFTTAVGLSALVPLALTLMVPAFAFHAGGHMHRALARLEHRLHIDSFIDHHLHRH
Ga0208530_1010741Ga0208530_10107411F080009MATRKQQRRKYQRAVAHARHLDADIADERDEGKTKPERKQQARPVRGAPVPPSALRTAKRAVLFATLFYLVITFTSSAISPGIKLANTAVMFVLFWSMGWMVENFVWRRYQKKHG
Ga0208530_1010743Ga0208530_10107432F026365VRRLLASLSAVALTLALPVAAYAADSADTGKPTGHDWTLDSIFVIALSIPAILVILTLIDIARGKHTERHDH
Ga0208530_1011031Ga0208530_10110311F028203MTDTTTAKLETGKARVQDMGERTQDYLDRAAQSASSGMDRMTDSARRSVDTAAESAKAGLDWATEKASVLRDRNTALMNTVTDTVTQRPLVAIGVAAAVGYLLGRLMHSSD
Ga0208530_1011488Ga0208530_10114881F013130MASKSATEDKRRRTTMVPVTTMEEIPLLSDQERADLLKSLKEAEARAIAGEGVDYDSEAFKARLLEIYRGRKP
Ga0208530_1013463Ga0208530_10134632F001248RIRIGVPDDSYVDRRDRDTVDVELWDEGRGEHLAAVSTVLEPDQDGEARELLREVVAGLESGELEPTAAALEPLADRVR
Ga0208530_1014012Ga0208530_10140121F079119RVMRLLDRLGFLYNGDGIGGEPRRATATGKPLRHWTIPVTLSGPRTIPFLEYHGARGTPEAEVLRLLDEHLESRELVVLYGHPCYEGVHEALLRKVFAAVIERGFRFVTLQAVAEQLQATAAAR
Ga0208530_1014360Ga0208530_10143602F034308MRLSFRQRILLILIGLGAVPTAVAVLGWGFTVRSTTPAAGARRAMEAVGSSGRTLLQTLDSTRLRPQERRALADHAAKLNAAIGSFQRAE
Ga0208530_1016145Ga0208530_10161452F087354MTDQTQIRRDPLVEHAVPLVAKAMYLADDANDLWEYFRDRVDAERHRIVEDAIREAIEPLEERFAIDDIARVASAMAAEHRPHGD
Ga0208530_1017219Ga0208530_10172191F081174YILSVVPSELPRADSLRFVPQVALRMYDAFWRTFANSSRLEKKLEDGRYHGINVTVLEPSPETGMFDPLAMLNAHPGKSKKLLWEGYRDTERALHGRPAEGTRRPR
Ga0208530_1017296Ga0208530_10172961F011243LPSPSPDSRFLALRFAAQEATDFGMNDDAVRRFAELRDRGASDAEIAGELRVPEDVVAALVKADAAQALARRIAAGEEPMYPVPEPEHRVFDTRSGSSAVPLAVLIAVLAGTI
Ga0208530_1018150Ga0208530_10181501F105144LSELDVAYFSDEPLREAAGHMRRLLAGEGETEDAERWAPLMAELNAVAAREAPSESALEELYWKLHLYRTEDELKTLRQNADLELSRQQELQRLEELRLRLLATLEAVRAHAPDR
Ga0208530_1019273Ga0208530_10192731F097106RIPVARARTNLADACRPPHLRKLMKPGSLRVVTIDDQRWVVRAVRRLDQSVPPLRFFNGAECRYVAEYPEDWPALPESELTSIFQRAHRGA
Ga0208530_1019873Ga0208530_10198731F064065FDVLLVRSPERRIDLLPQLDQFGALILDTYDFQDENKALAELQSFAQNHTLILLSSDAPLFSRIKSAVPRAVVIERSREDRGDWIAAQEPAEHYNSSSIRNTWKAIRTVLDRDKVAVASSEQSAPLLIAQTFHPKWVRSDNQPLYAATPFFTFGFFDKKPEMNFQRSRYDRGAVWFS
Ga0208530_1019954Ga0208530_10199541F010008VYHAYNDGELVATGRLTLDGPPQVGEEVRLNGRAHVVRDVGYGADAPVLTLEPR
Ga0208530_1019971Ga0208530_10199711F047804MRNAGVLAVLVIALSSFSTAQNKTSRPLDVAPEQIQRVEILYFPERVLVRAALTPERLEQLYQYKLELRDIRESPEWERLLPMLRQTAVTASGRNYDLRTAVLLFDKSGRRIASLYFDQFGAGGTINGQSGTISGGIYRWAKSLLKGVAG
Ga0208530_1021288Ga0208530_10212882F082662STTSSLAAISREEREALFTMVRPLLEGTYRLPLKHELTWTRLA
Ga0208530_1021429Ga0208530_10214291F007491TLAMTLTLLSGCTGGQEGSAAEEQRPRKNPIVTSSAEAQAPQVDVRQQLKSIGHAIPIYAGAKYRDDLTRRDAVMIRDQYGADAQVYTMATDDSYPQVYHYYTTYLSQFRAFQPGPPYPPTQNWRSMEVQLNEAMQDPFIPGDTIKSGEKNVLLQIAETEAEPKTVIRYIV
Ga0208530_1021539Ga0208530_10215391F001931ILSSVVQTLLRELKEGNGDKDRRRQVEEWMRVLAEKYPEFKIENGLRDYYLAEANRLRVDFENASDLTEKLSLGRHIESFLDRAAEYDRRIAER
Ga0208530_1021676Ga0208530_10216761F004686VDVETEGELTAGATLGYSPVAGDLRRRPGMEQRAGMNFAIRGSAPTLAGTRTSPVIRDKFVPNAKVAVDVDSAKFFELLIGRLSGK
Ga0208530_1021693Ga0208530_10216931F074535MGRVDAVTKAGWTRQHLRHGAHMLRYGPHPAATVYDSIGEDFFIALAPGWLNLGLWEGDGSD
Ga0208530_1022583Ga0208530_10225831F007880LCRRHLLVGEPARLYQDPSSKRFAKVCPLCYERAERRGWRADGRPIVAVHANPPADQALRERESLIDRLRGQLQSVEFDLDQVRNALAKAEQQAAELRGTKRELKELAGELKRREREIKSLTDDRRRADERAAEAEAAHRAEIARHHAVAAQLDERAAEAGRVNSRMA

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.