Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

blooming taxa analysis #39

Open
cuttlefishh opened this issue Dec 11, 2015 · 15 comments
Open

blooming taxa analysis #39

cuttlefishh opened this issue Dec 11, 2015 · 15 comments
Assignees
Labels

Comments

@cuttlefishh
Copy link
Collaborator

@rob-knight suggested: we should definitely do some blooming taxa analysis as per the Shade paper and maybe encourage Ashley to do that.

Ashley Shade is at Michigan State.

@ashleyshade
Copy link

Sure let me know how I can help

Ashley Shade, Ph.D.
Michigan State University
Microbiology and Molecular Genetics
On Dec 10, 2015 7:41 PM, "Luke Thompson" [email protected] wrote:

@rob-knight https://github.com/rob-knight suggested: we should
definitely do some blooming taxa analysis as per the Shade paper and maybe
encourage Ashley to do that

Ashley Shade is at Michigan State


Reply to this email directly or view it on GitHub
#39.

@cuttlefishh
Copy link
Collaborator Author

Thanks, Ashley! Can you send a link to the manuscript Rob is referring to?

Luke

On Dec 10, 2015, at 5:13 PM, Ashley Shade [email protected] wrote:

Sure let me know how I can help

Ashley Shade, Ph.D.
Michigan State University
Microbiology and Molecular Genetics
On Dec 10, 2015 7:41 PM, "Luke Thompson" [email protected] wrote:

@rob-knight https://github.com/rob-knight suggested: we should
definitely do some blooming taxa analysis as per the Shade paper and maybe
encourage Ashley to do that

Ashley Shade is at Michigan State


Reply to this email directly or view it on GitHub
#39.


Reply to this email directly or view it on GitHub #39 (comment).

@rob-knight
Copy link

A meta-analysis of changes in bacterial and archaeal communities with time. http://www.ncbi.nlm.nih.gov/pubmed/23575374
Conditionally rare taxa disproportionately contribute to temporal changes in microbial diversity. http://www.ncbi.nlm.nih.gov/pubmed/25028427

On Dec 11, 2015, at 10:01 AM, Luke Thompson [email protected] wrote:

Thanks, Ashley! Can you send a link to the manuscript Rob is referring to?

Luke

On Dec 10, 2015, at 5:13 PM, Ashley Shade [email protected] wrote:

Sure let me know how I can help

Ashley Shade, Ph.D.
Michigan State University
Microbiology and Molecular Genetics
On Dec 10, 2015 7:41 PM, "Luke Thompson" [email protected] wrote:

@rob-knight https://github.com/rob-knight suggested: we should
definitely do some blooming taxa analysis as per the Shade paper and maybe
encourage Ashley to do that

Ashley Shade is at Michigan State


Reply to this email directly or view it on GitHub
#39.


Reply to this email directly or view it on GitHub #39 (comment).


Reply to this email directly or view it on GitHub #39 (comment).

@cuttlefishh
Copy link
Collaborator Author

Thanks, Rob!

On Dec 11, 2015, at 10:02 AM, Rob Knight [email protected] wrote:

A meta-analysis of changes in bacterial and archaeal communities with time. http://www.ncbi.nlm.nih.gov/pubmed/23575374
Conditionally rare taxa disproportionately contribute to temporal changes in microbial diversity. http://www.ncbi.nlm.nih.gov/pubmed/25028427

On Dec 11, 2015, at 10:01 AM, Luke Thompson [email protected] wrote:

Thanks, Ashley! Can you send a link to the manuscript Rob is referring to?

Luke

On Dec 10, 2015, at 5:13 PM, Ashley Shade [email protected] wrote:

Sure let me know how I can help

Ashley Shade, Ph.D.
Michigan State University
Microbiology and Molecular Genetics
On Dec 10, 2015 7:41 PM, "Luke Thompson" [email protected] wrote:

@rob-knight https://github.com/rob-knight suggested: we should
definitely do some blooming taxa analysis as per the Shade paper and maybe
encourage Ashley to do that

Ashley Shade is at Michigan State


Reply to this email directly or view it on GitHub
#39.


Reply to this email directly or view it on GitHub #39 (comment).


Reply to this email directly or view it on GitHub #39 (comment).


Reply to this email directly or view it on GitHub #39 (comment).

@ashleyshade
Copy link

http://mbio.asm.org/content/5/4/e01371-14
http://www.cell.com/trends/microbiology/abstract/S0966-842X%2815%2900018-9


Ashley Shade, Ph.D.
Michigan State University
Microbiology and Molecular Genetics
Biomedical Physical Sciences rm 6179
517-884-5399 tel:517-884-5399
ShadeLab http://ashley17061.wix.com/shadelab

On Dec 11, 2015, at 1:01 PM, Luke Thompson [email protected] wrote:

Thanks, Ashley! Can you send a link to the manuscript Rob is referring to?

Luke

On Dec 10, 2015, at 5:13 PM, Ashley Shade [email protected] wrote:

Sure let me know how I can help

Ashley Shade, Ph.D.
Michigan State University
Microbiology and Molecular Genetics
On Dec 10, 2015 7:41 PM, "Luke Thompson" [email protected] wrote:

@rob-knight https://github.com/rob-knight suggested: we should
definitely do some blooming taxa analysis as per the Shade paper and maybe
encourage Ashley to do that

Ashley Shade is at Michigan State


Reply to this email directly or view it on GitHub
#39.


Reply to this email directly or view it on GitHub #39 (comment).


Reply to this email directly or view it on GitHub #39 (comment).

@ashleyshade
Copy link

Just want to reiterate - would love to contribute to this! let me know if and how we can!

@cuttlefishh
Copy link
Collaborator Author

Ashley, we would love to have you help also! We are working toward a paper on 20k samples from the EMP, focusing on those with the best metadata plus a few other factors. We are currently working to identify the final set of 20k samples, and meanwhile putting together a set of 2k samples for testing analysis methods. Most of the samples in the EMP do not have a specific temporal component. Do you have some thoughts for how your work on conditionally rare taxa (CRT) would translate to a dataset without a major temporal component?

@ashleyshade
Copy link

The temporal component is key for the "blooming" interpretation. However, the algorithm is generic in that it could easily be applied over space instead of time, but then the interpretation would concern the biogeography of populations that are typically rare but in some localities abundant. In these cases, we would hypothesize that contextual data would explain any local high abundance of otherwise rare taxa. In any one particular habitat, this would be a bit boring (the local environment drives differences in community structure - duh) but with the large EMP dataset it may be interesting to determine macroscale patterns of rarity. Perhaps we could use a model of the biogeographic distributions (based on the whole dataset) to test if these occurrences are different than expected given neutral assumptions.

@cuttlefishh
Copy link
Collaborator Author

Hi @ashleyshade -- we are mobilizing forces to analyze the first 20k EMP samples (28,685 samples with >30,000 reads per sample -> 20,347 samples). How do the next few weeks look for you? Your idea to look at macroscale patterns of rarity sounds promising. Is there a version of the BIOM table that you'd prefer to work with? We have closed-reference (Greengenes and Silva), open-reference (Greengenes), "deblurred" actual sequences, and hope to also have de novo (Swarm).

@cuttlefishh
Copy link
Collaborator Author

One clarification: The sequences per sample will vary depending on the OTU picking method. The 30k seqs/sample is for the closed-ref GG table. We are comparing GG to Silva, and also looking at the other OTU tables, to decide on a reasonable set of ~20k samples to include in all analyses.

@ashleyshade
Copy link

Hi Luke,
Awesome!! I am traveling but send the biom along with the metadata and I'll
have a start as I am able.

Ashley Shade, Ph.D.
Michigan State University
Microbiology and Molecular Genetics
On Jun 16, 2016 21:34, "Luke Thompson" [email protected] wrote:

Hi @ashleyshade https://github.com/ashleyshade -- we are mobilizing
forces to analyze the first 20k EMP samples (28,685 samples with >30,000
reads per sample -> 20,347 samples). How do the next few weeks look for
you? Your idea to look at macroscale patterns of rarity sounds promising.
Is there a version of the BIOM table that you'd prefer to work with? We
have closed-reference (Greengenes and Silva), open-reference (Greengenes),
"deblurred" actual sequences, and hope to also have de novo (Swarm).


You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#39 (comment), or mute
the thread
https://github.com/notifications/unsubscribe/ACLJlW5hXdF22dz-Q5Puzmqz0kBGCs8rks5qMaU2gaJpZM4GzOX2
.

@ashleyshade
Copy link

O yeah and I think closed reference is best for this.

Ashley Shade, Ph.D.
Michigan State University
Microbiology and Molecular Genetics
On Jun 17, 2016 12:15, "Ashley Shade" [email protected] wrote:

Hi Luke,
Awesome!! I am traveling but send the biom along with the metadata and
I'll have a start as I am able.

Ashley Shade, Ph.D.
Michigan State University
Microbiology and Molecular Genetics
On Jun 16, 2016 21:34, "Luke Thompson" [email protected] wrote:

Hi @ashleyshade https://github.com/ashleyshade -- we are mobilizing
forces to analyze the first 20k EMP samples (28,685 samples with >30,000
reads per sample -> 20,347 samples). How do the next few weeks look for
you? Your idea to look at macroscale patterns of rarity sounds promising.
Is there a version of the BIOM table that you'd prefer to work with? We
have closed-reference (Greengenes and Silva), open-reference (Greengenes),
"deblurred" actual sequences, and hope to also have de novo (Swarm).


You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#39 (comment), or mute
the thread
https://github.com/notifications/unsubscribe/ACLJlW5hXdF22dz-Q5Puzmqz0kBGCs8rks5qMaU2gaJpZM4GzOX2
.

@cuttlefishh
Copy link
Collaborator Author

OK great! We will get the closed-reference OTU table on the FTP site for you (probably Silva bc it assigns more reads than GG). I will email a condensed version of the mapping file, and you can let me know if it has what you need (the full one is 117MB or ~6x larger than this one).

@cuttlefishh
Copy link
Collaborator Author

Ashley has posed these questions:

  1. Are there taxa that are universally rare? If so, who are they and is there anything known about their biology that informs their ecology (e.g. spore formers/known dormant states)? Is their rarity an artifact of detection?
  2. Are there taxa that are consistently rare in some biomes but prevalent in others (suggesting connectivity/seeding between habitats as observed for aquatic systems by terrestrial systems by Del Giorgio’s group recently)
  3. Is there phylogenetic signal among rare taxa, within a biome? Is the composition of rare taxa within a biome different than the composition of the prevalent taxa?
  4. CRT - can we detect “blooming” populations over space or time?

@cuttlefishh
Copy link
Collaborator Author

I have passed to @ashleyshade the closed-ref table with 5000 samples plus mapping file of those samples only. The files are on my Dropbox: https://www.dropbox.com/sh/pd5zjrfb0i7li6r/AAA1Wj7fnEQPucAWxE9QdDE-a?dl=0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants