While the case study reference Hiive, the scenarios, processes, and data are fictitious. This case study is the property of The Hiive Company Limited, is confidential and is not to be disseminated without Hiive’s prior consent.
This repo is a starting code base for the Hiive Analytics Engineering case study. The tasks to complete for the case study were provided to you separately by the Hiive recruitment team.
The case study is designed to test your ability to work with data and dbt code. The case study is designed to be completed in 2-3 hours.
This repo uses DuckDB as a database, but you can use any database you like (see section below for other databases).
To start working on the case study you need to do the following:
-
Fork this repo into your own account
-
Clone the repo to your local machine
git clone https://github.com/hiivemarkets/data-tech-interview -
Install the requirements
# create a virtual environment python3 -m venv venv source venv/bin/activate # install the packages pip install -r requirements.txt -
Check that everything is working by running:
dbt debugYou should see a message
All checks passed!. -
Seed the starting data:
dbt seedThose will be materialized in main schema in the database, e.g
main.transaction_termination_reasons_seed.
If you want to work with other databases. You can do that by changing two things:
-
Install required adapter, e.g. for Postgres:
pip install dbt-postgres -
Change the database connection in the
profiles.ymlfile. You can find more information on how to do this in the dbt documentation.
⚠️ Don't commit your database credentials to the repository!
You can read more about the input data in the data description file.