Contributors: Jade Chen, Jessica Luo, Heidi Lantz, Nazia Edroos
Our dataset, obtained from FiveThirtyEight, contains the results of a nationwide survey that collected information about drug use and health.
We use this data to predict an individual’s age group as Youth (Under 21) or Adult (21 & Over) based on their history of reported substance use.
We created and ran three different models using a train/test split on our data. All three models were unable to completely correctly classify youth vs. adult due to some limitations with our dataset, mostly a limited sample size and imbalance in how data was categorized.
Use the following steps to reproduce the analysis in a containerized environment:
Pull the latest pre-built Docker image from Docker Hub:
docker pull jadeeechen/dsci-310-group-7-data-dudes:latest
docker-compose.yaml
file in this repository to start the container.
git clone https://github.com/DSCI-310-2025/dsci-310-group-7-data-dudes.git
in your terminaldocker-compose up
make all
In your terminal, use the following bash command to clone this repository from GitHub to your local machine.
git clone https://github.com/DSCI-310-2025/dsci-310-group-7-data-dudes.git
Set this newly cloned repository as your local working directory.
cd dsci-310-group-7-data-dudes/
Open the Docker application.
docker run hello-world
You should see something like this if you were successful:
Unable to find image 'hello-world:latest' locally
latest: Pulling from library/hello-world
1b930d010525: Pull complete
Digest: sha256:451ce787d12369c5df2a32c85e5a03d52cbcef6eb3586dd03075f3034f10adcd
Status: Downloaded newer image for hello-world:latest
Hello from Docker!
This message shows that your installation appears to be working correctly.
To generate this message, Docker took the following steps:
1. The Docker client contacted the Docker daemon.
2. The Docker daemon pulled the "hello-world" image from the Docker Hub.
(amd64)
3. The Docker daemon created a new container from that image which runs the
executable that produces the output you are currently reading.
4. The Docker daemon streamed that output to the Docker client, which sent it
to your terminal.
To try something more ambitious, you can run an Ubuntu container with:
$ docker run -it ubuntu bash
Share images, automate workflows, and more with a free Docker ID:
https://hub.docker.com/
For more examples and ideas, visit:
https://docs.docker.com/get-started/
Within your local computer, set up the environment using Docker image:
Windows users, run the following:
docker build -t data-dudes-analysis .
docker run -it --rm -p 8787:8787 -e PASSWORD="sushi" data-dudes-analysis
Mac users, run the following:
docker build --platform=linux/amd64 -t data-dudes-analysis .
docker run --platform=linux/amd64 -it --rm -p 8787:8787 -e PASSWORD="sushi" data-dudes-analysis
Access the analysis
project
folder in the bottom right panel to view all files.Run the analysis script
cd project
to set the correct working directorymake all
, which:
View the results
index.html
fileAttribution: DSCI310 Group 02 instructions
System Dependencies
R Dependencies
pkg.drugage
: 1.0.0 (Instructions to install at DSCI-310-2025/pkg.drugage)docopt
: 0.7.1dplyr
: 1.1.4parsnip
: 1.3.1pointblank
: 0.12.2readr
: 2.1.5recipes
: 1.1.1rsample
: 1.2.1tidyr
: 1.3.1workflows
: 1.2.0These are the licences contained in this repository: