Add dataset + Sweetviz reports; update notebooks

Add raw and cleaned Git classroom dataset (claude/dataset_D_git_classroom_activity.csv and _clean.csv) and two Sweetviz HTML reports. Update several lab notebooks (lab02_task1_datasets.ipynb, lab02_task1_datasets_v2b.ipynb, lab02_task2_telemetry_v4.ipynb, lab02_task3_git_activity_solutions.ipynb) and deploy/lab01_task2_telemetry.ipynb to incorporate the new data, cleanup steps, and EDA outputs.
This commit is contained in:
2026-03-02 16:12:22 +00:00
parent d689ada45e
commit cf6be3275a
9 changed files with 52607 additions and 85 deletions

View File

@@ -2,7 +2,7 @@
"cells": [
{
"cell_type": "code",
"execution_count": null,
"execution_count": 1,
"id": "d321d996",
"metadata": {},
"outputs": [],
@@ -610,7 +610,7 @@
"name": "stderr",
"output_type": "stream",
"text": [
"C:\\Users\\sss\\AppData\\Local\\Temp\\ipykernel_58292\\2163207487.py:2: FutureWarning: DataFrameGroupBy.apply operated on the grouping columns. This behavior is deprecated, and in a future version of pandas the grouping columns will be excluded from the operation. Either pass `include_groups=False` to exclude the groupings or explicitly select the grouping columns after groupby to silence this warning.\n",
"C:\\Users\\sss\\AppData\\Local\\Temp\\ipykernel_34380\\2163207487.py:2: FutureWarning: DataFrameGroupBy.apply operated on the grouping columns. This behavior is deprecated, and in a future version of pandas the grouping columns will be excluded from the operation. Either pass `include_groups=False` to exclude the groupings or explicitly select the grouping columns after groupby to silence this warning.\n",
" correlation = df.groupby('dataset').apply(lambda g: g['x'].corr(g['y'])).round(2)\n"
]
}
@@ -986,12 +986,12 @@
"name": "stderr",
"output_type": "stream",
"text": [
"C:\\Users\\sss\\AppData\\Local\\Temp\\ipykernel_58292\\1218167074.py:4: FutureWarning: \n",
"C:\\Users\\sss\\AppData\\Local\\Temp\\ipykernel_34380\\1218167074.py:4: FutureWarning: \n",
"\n",
"Passing `palette` without assigning `hue` is deprecated and will be removed in v0.14.0. Assign the `x` variable to `hue` and set `legend=False` for the same effect.\n",
"\n",
" sns.boxplot(data=df_focus, x='dataset', y=var, ax=ax,\n",
"C:\\Users\\sss\\AppData\\Local\\Temp\\ipykernel_58292\\1218167074.py:4: FutureWarning: \n",
"C:\\Users\\sss\\AppData\\Local\\Temp\\ipykernel_34380\\1218167074.py:4: FutureWarning: \n",
"\n",
"Passing `palette` without assigning `hue` is deprecated and will be removed in v0.14.0. Assign the `x` variable to `hue` and set `legend=False` for the same effect.\n",
"\n",