Skip to content

Data Steward Job Simulation

The Data Steward should be skilled in data cleaning, analysis and management. This job simulation is designed to test the skills of applicants to the Data Steward position. The full exercise can be found here. Full instructions are included with the exercise. The following is a brief summary of the tasks required of the applicant.

Task Overview

Task 1

This task reflects a core job requirement for the Data Steward to collaborate with partners to improve data collection, storage, and management. The applicant will review a realistic but manufactured dataset that was compiled from two different partners. The data reflects attendance at each partner's event. In this task, the Data Steward will review the data for data quality issues and recommend best practices to help the partners collect data in a more systematic way.

Task 2

This task is intended to evaluate proficiency with SQL and knowledge of database normalization. The applicant will normalize the data to prepare it for efficient storage in a relational database management system. The applicant will use SQL to write the Data Definition Language (DDL) to create one or more tables necessary to store the data as efficiently as possible. The applicant is expected to normalize data to Boyce and Codd Normal Form.

Task 3

This task is intended to evaluate basic proficiency with JavaScript. Example code is provided that converts a simplified dataset to JSON format with JavaScript with some coding errors. The applicant is expected to find and fix errors to produce the desired output.

Rubric

The rubric for scoring is available here (please request access to view).