Working with Data for REU Students, 2023
A four week workshop presented by Liz Dobbins
(liz@axiomdatascience.com)
Introduction
This four week workshop will introduce undergraduate students to current best practices regarding reproducible scientific research and data. Whereas the previous version of this workshop included Carpentries lessons for hands on practice, four weeks might not be long enough to to include those. Instead, the emphasis is on motivations and developing the culture of Open Science. This will hopefully enable further exploration of these topics by the students on their own. As such, this workshop is meant to be an introduction to topics students will confront if they continue as a graduate student or scientist.
Topic Overview
Week 1: Open Science is Better Science
- Introductions
- Open Science (Slide)
- Mistakes
- Growth Mindset
- Scaffolding to enable Open Science
- Scripting
- Version control
- Data Life Cycle
Introduction of a Possible Team Project
Week 2: Understanding Code
- Interactive intro to Python (Colab Notebook)
- Understanding code (Slides)
- Learning code: why and how
- Where we run into trouble
- 6 Tips and Tricks
- Practice best practices (Colab Notebook)
Week 3: Data Life Cycle
- Pandas demo using GAK1 temperature (Colab Notebook)
- Tidy data (Slides)
- Data Carpentry’s Improving Messy Data (Tidy Data)
- Data sources and archives (Slides)
- DataONE data discovery activity
Week 4: TBD
- FAIR
- Findable
- Accessible
- Interoperable
- Reproducible
Instructor information:
Liz Dobbins has been working with oceanographic data for more than 30 years in both academia and the private sector. She has collected data at sea, processed sensor data, mapped assets, utilized numerical models, and used open-source Python tools to ingest data into a public-facing data portal. She is a certified Carpentries instructor and is eager to talk about best practices regarding scientific computing.