This project analyzes the impact of mental health factors on students’ academic performance (CGPA) using a large dataset. It includes:
- 📊 Data visualization (scatter plots, heatmaps)
- 🤖 Predictive modeling (Linear Regression)
- 🔬 Statistical hypothesis testing (p-values)
- 📄 Automated PDF research report generation
To evaluate how mental health indicators such as:
- Stress
- Anxiety
- Depression
- Burnout
- Overall Mental Health Index
affect students’ academic performance (academic_performance as proxy for CGPA).
-
File:
student_mental_health_burnout_1M.csv -
Contains:
- Mental health metrics
- Academic performance indicators
-
Identified relevant features:
- Independent variables: mental health indicators
- Dependent variable: academic performance
Generated:
- Scatter plots for each feature vs CGPA
- Correlation heatmap
-
Model: Linear Regression
-
Train-test split: 80/20
-
Metrics:
- R² Score
- RMSE
-
Pearson correlation test used
-
Computed p-values for each variable
-
Significance threshold:
p < 0.05
-
Generated a PDF research report
-
Includes:
- Model performance
- Coefficients
- Statistical results
- Visualizations
- Python 🐍
- Pandas
- NumPy
- Matplotlib
- SciPy
- Scikit-learn
- ReportLab
pip install pandas numpy matplotlib scipy scikit-learn reportlab- Place dataset in project folder:
student_mental_health_burnout_1M.csv- Run the script:
python main.py- Output:
- 📊 Charts (PNG files)
- 📄
mental_health_research.pdf
- Mental health variables show weak correlation with academic performance
- Predictive model shows low explanatory power
- Suggests other factors influence CGPA more strongly
-
Correlation does not imply causation
-
No time-series analysis
-
Missing external factors:
- Study habits
- Sleep patterns
- Socioeconomic background
- Use advanced ML models (Random Forest, XGBoost)
- Feature engineering
- Add more behavioral data
- Time-series or longitudinal analysis
- Interactive dashboards (Plotly / Power BI)
mental_health_research.pdf- Scatter plots
- Correlation heatmap
Feel free to fork and improve:
- Add new models
- Improve visualizations
- Enhance statistical rigor
This project is for educational and research purposes.
Shivakumar Shivampeta