Link Search Menu Expand Document

Research Data Bootcamp Overview

Welcome to the Research Data Bootcamp for incoming CU Boulder graduate students. The Bootcamp is sponsored and organized by the Center for Research Data and Digital Scholarship (CRDDS), which is a collaboration between CU Boulder University Libraries, and Research Computing. You can view the CRDDS website here.

As incoming graduate students, it is likely that you will work with research data at some point during the course of your respective programs. The goal of this Bootcamp is to provide you with an overview of some of the skills, strategies, and best practices that will help you to get started in pursuing this kind of work. The Bootcamp does not presuppose any prior experience in working with research data, and all of the sessions are introductory in nature. Our purpose is not to help you master any given topic, but to provide a useful and productive starting point for further exploration and learning. As you explore further after the Bootcamp and into the semester, we hope you’ll make use of CRDDS’s many resources, such as opportunities for individual consultations and workshops (many of which will cover the material introduced here in greater depth). More information about these opportunities is provided on the CRDDS website. For related offerings, please also see the website of our collaborators at the Laboratory for Interdisciplinary Statistical Analysis (LISA).

In thinking about the various skills and tasks that are relevant to projects that involve research data, librarians and data professionals often think in terms of a heuristic called the research data lifecycle:

Research Lifecycle

The lifecycle model specifies, in broad terms, the distinct yet interconnected stages of a data-intensive research project; each stage involves a variety of more specific tasks and competencies, as represented in the following graphic from the CRDDS website:

Research Lifecycle Tasks

By attending the Bootcamp, our hope is that you will be empowered to navigate the research data lifecycle with respect to your own work, in a deliberate and intentional way.

The different sessions of the Bootcamp address themselves to various stages of this research data lifecycle (italicized words correspond to the stage(s) of the research data lifecycle that the session will emphasize).

  • The first session, on Data Discovery and Ethics, will be taught by Liz Novosel (Subject Librarian at CU Boulder Libraries) and Nickoal Eichmann-Kalwara (Digital Scholarship Librarian at CRDDS). It will provide you with an introduction to basic strategies for finding data that is relevant to your research needs, and explore basic principles of data ethics that you should consider when working with research data. (Plan; Develop; Publish; Preserve)
  • Our second session, on Data Management, will be taught by Matthew Murray (Data Librarian at CRDDS). It will explore basic principles of data management, and introduce tools for data management and storage. (Plan; Develop; Preserve).
  • Our third session, on Qualitative and Humanities Data will be taught by Nickoal Eichmann-Kalwara (Digital Scholarship Librarian at CU Boulder Libraries). It will provide an overview of different data types, with a particular focus on tools and principles for working with qualitative data. (Plan; Develop; Publish; Preserve)
  • Our fourth session, on Data Publishing and CU Scholar, taught by Melissa Cantrell (Scholarly Communication Librarian at CRDDS) and Matthew Murray (Data Librarian at CRDDS), will discuss basic principles of scholarly communication as it relates to data research, and best practices for data archiving (Plan; Publish; Preserve)
  • Our fifth session, on Data Analysis in R, will be taught by Scott Nordstrom (PhD Student in the Ecology and Evolutionary Biology department at CU Boulder, and CRDDS Statistics Consultant ). It aims to introduce R novices to useful principles, tools, and packages for data-based research in R/R Studio (Develop).
  • The sixth and seventh sessions will focus on the Python programming language, and will be taught by Tim Dunn (Visualization and Research Computing Specialist at CRDDS). The first Python session will focus on the fundamentals of the Python programming language, while the second will focus on using Python for data-based research (Develop).
  • The eighth session, on GitHub and GitHub Pages, will be taught by Phil White (Earth Sciences and GIS Librarian at CU Boulder Libraries). It will explore how you can use GitHub and GitHub pages as platforms to collaborate with others, and to share and disseminate your work (Publish; Preserve)

We will conclude the Bootcamp by discussing how you can continue your learning about the various topics introduced at the Bootcamp by participating in CRDDS programs and activities during the fall semester and beyond. We will also discuss the details of the Bootcamp microcredential, which you can earn by completing a small project that demonstrates your familiarity with the topics presented at the Bootcamp.