Skip to content

THOMASELOVE/432-sources

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

432-sources

Sources of Reading Material for PQHS/CRSP 432 with Dr. Love

This is a repository of things I may use or refer to during the semester. If you're looking for additional things to read, this is the right place.

Some items are password-protected, which if you click them will only let you download them from Github, and then open with a password on your own machine. The password is revealed in the first class session.

All of these items will be linked through Class READMEs, as well, as they become useful to us.

Course Notes

  • The 431 notes are here and will remain there until 2025-06-01.
  • The 432 notes are here and will remain there until 2025-06-01.

The Books We've Read in 431 or Will Read in 432 include:

  • (431) David Spiegelhalter The Art of Statistics available at Amazon and other retailers.
  • (432) Jeffrey Leek How to be a Modern Scientist available via Leanpub.

YouTube Video Series Recommended by Dr. Love

  1. Frank Harrell's Biostatistics for Biomedical Research (BBR) Course includes a series of lectures on many of the topics we'll be discussing in 432, in addition to several late-breaking items. Details on the course are available here and the notes are linked in the Statistics and Modeling books below.
  2. Richard McElreath Statistical Rethinking (Winter 2023) lecture series is the best introduction to Bayesian ideas and methods available.

Other Books or Book-Length Documents Recommended by Dr. Love

On Statistics and Modeling

  1. Frank E. Harrell and Chris Slaughter Biostatistics for Biomedical Research Notes (pdf) - also see the YouTube course above.
  2. Frank E. Harrell Regression Modeling Strategies, 2nd Edition, 2015.
  3. Max Kuhn and Julia Silge Tidy Modeling with R
  4. Paul Roback and Julie Legler Beyond Multiple Linear Regression: Applied Generalized Linear Models and Multilevel Models in R
  5. A Solomon Kurz Statistical rethinking with brms, ggplot2, and the tidyverse
  6. Julian J. Faraway Practical Regression and Anova using R, 2002.
  7. David G. Kleinbaum and Mitchel Klein Logistic Regression: A Self-Learning Text, 3rd Edition, 2010.
  8. Simon J. Sheather A Modern Approach to Regression with R, 2009.
  9. Eric Vittinghoff, David V. Glidden, Stephen C. Shiboski and Charles E. McCulloch Regression Methods in Biostatistics, 2nd Edition, 2012.
  10. Rob J Hyndman and George Athanasopoulos Forecasting: Principles and Practice

On Visualization and R

  1. Winston Chang R-Graphics Cookbook, version 2.0
  2. Kieran Healy Data Visualization: A practical introduction
  3. Claus O. Wilke Fundamentals of Data Visualization
  4. Rob Kabacoff Data Visualization with R

On Using R and Related Tools

  1. Hadley Wickham and Garrett Grolemund R for Data Science, 2nd edition
  2. Max Kuhn and Julia Silge Tidy Modeling with R (Yes, I'm listing it twice, because it's useful in both contexts.)
  3. Carrie Wright, Shannon Ellis, Stephanie Hicks, and Roger D. Peng Tidyverse Skills for Data Science in R.
  4. Chester Ismay and Albert Y. Kim Statistical Inference via Data Science: A Modern Dive into R and the Tidyverse
  5. Yihui Xie, Christophe Dervieux, Emily Riederer R Markdown Cookbook
  6. Yihui Xie, Amber Thomas, Alison Presmanes Hill blogdown: Creating Websites with R Markdown
  7. Yihui Xie, J. J. Allaire, Garrett Grolemund R Markdown: The Definitive Guide
  8. Hadley Wickham and Jenny Bryan R Packages
  9. Peter D. R. Higgins Reproducible Medical Research with R

Key Articles

  1. Statistical Inference in the 21st Century: A World Beyond p < 0.05 from 2019 in The American Statistician
  2. The American Statistical Association's 2016 Statement on p-Values: Context, Process and Purpose.
  3. Frank Harrell and colleagues' Glossary of Statistical Terms (pdf)
  4. Project-oriented workflow at tidyverse.org from Jenny Bryan.
  5. From the Ten Simple Rules series at PLOS Computational Biology:
  6. Karl W. Broman & Kara H. Woo (2018) Data Organization in Spreadsheets, The American Statistician, 72:1, 2-10, DOI: 10.1080/00031305.2017.1375989
  7. Min Q. Wang, Alice F. Yan and Ralph V. Katz (2018) Researcher Requests for Inappropriate Analysis and Reporting: A U.S. Survey of Consulting Biostatisticians Annals of Internal Medicine https://doi.org/10.7326/M18-1230.
  8. Peter C. Austin and Ewout W. Steyerberg (2015) The number of subjects per variable required in linear regression analyses J Clinical Epidemiology 68: 627-636.
  9. Richard D Riley, Joie Ensor, Kym I E Snell et al. Calculating the sample size required for developing a clinical prediction model (pdf) BMJ 2020; 368:m441. Link at BMJ.
  10. Andrew Gelman and John Carlin Beyond Power Calculations: Assessing Type S (Sign) and Type M (Magnitude) Errors

Miscellany

  • Andrew Gelman (and others) blog: Statistical Modeling, Causal Inference, and Social Science
  • datamethods "is a place where statisticians, epidemiologists, informaticists, machine learning practitioners, and other research methodologists communicate with themselves and with clinical, translational, and health services researchers to discuss issues related to data: research methods, quantitative methods, study design, measurement, statistical analysis, interpretation of data and statistical results, clinical trials, journal articles, statistical graphics, causal inference, medical decision making, and more." This new (2018-) resource's rationale is here.
  • RStudio Community "is a community for all things R and RStudio."
  • RStudio Cheat Sheets which have expanded enormously in recent years.
  • Dr. Love's favorite list of Colors in R.

Learning about Quarto (and making the switch from R Markdown)

  1. Virtually any code you have written in R Markdown can be run using Quarto instead, by simply switching the file extension from .Rmd to .qmd.
  2. It's still worth it to learn about how Quarto works, and why it differs from R Markdown when it does.

Here are some suggestions:

Need to have a tough talk with someone about statistical significance, and/or p values?

A Few Full-Text Open Source Papers (since 2018) that Explain or Use 432 Methods

I've separated this into its own page now.

About

Sources of Reading Material for PQHS/CRSP 432 with Dr. Love

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published