Data Discovery Using DataONE

Hands-on Activity

Introduction

The Long-Term Ecological Research (LTER) program requires all sites to publish their data online within two years of collection. However, they do not specify where data must be published. The Northern Gulf of Alaska (NGA LTER) has chosen to publish our data via Axiom Data Science’s Research Workspace (RW). Most other LTER sites submit their data to the Environmental Data Initiative (EDI). How could an investigator find datasets from multiple LTER sites if they are archived in different repositories?

DataONE (Data Observation Network for Earth) was established by the National Science Foundation as a network of data repositories. Data collections are held in separate “member” repositories but are searchable via a central interface. A list of member institutions is found at https://www.dataone.org/network/. Both Research Workspace and EDI are members of DataONE, so it would be a good search interface for investigators hoping to integrate data from several LTER sites.

This hands-on activity provides a chance to explore the DataONE portal and to practice searching for data using their interface.

Instructions

Plan your search

Think about the research you are doing. Think of a way you could expand it to include existing datasets. Some examples might include:

Comparing your data to other LTER sites such as:
- California Curent (LTER Site CCE)
- Beaufort Lagoon (BLE)
- Terrestrial Alaskan sites like Arctic LTER (ARC) or Bonanza Creek (BNZ)
Comparing your data to other research programs or organizations in the Gulf of Alaska, such as:
- GLOBEC
- Alaska Department of Fish and Game
- USGS
- Exxon Valdeze Oil Spill Trustee Council (EVOS)
- NOAA
Data of the same type or collected using the same instrument, using terms like:
- glider
- grazing rates
- trace metals

Before you start searching, take a moment to describe what you are looking for:

And underline keywords that might be useful.

Look for Datasets

Go to https://search.dataone.org/data and experiment with different searches:

Try searching by Location, Data Attribute, Member Node, etc.
Try using the free text search
Use the author names from one search result to perhaps turn up more relevant records

Result Evaluation

During your search, pick 3 results and record relevant information about them here:

Dataset 1

Dataset title

Authors/Organization

Description

Download format (if available)

How did you find it?

Dataset 2

Dataset title

Authors/Organization

Description

Download format (if available)

How did you find it?

Dataset 3

Dataset title

Authors/Organization

Description

Download format (if available)

How did you find it?

Final Questions for Discussion

Could you find what you wanted?
What metadata was important in your search?
Did you end up focusing on a single repository? Would it have been faster to go directly there?