Diabetes Risk Classification
Coursework comparison between XGBoost and logistic regression on imbalanced healthcare data.
Trained and benchmarked XGBoost and Logistic Regression on an imbalanced diabetes dataset. Achieved 75.3% with XGBoost and 81.2% with Logistic Regression, recommending the simpler model.
Key Outcomes
Context
Problem and Context
Trained and benchmarked XGBoost and Logistic Regression on an imbalanced diabetes dataset. Achieved 75.3% with XGBoost and 81.2% with Logistic Regression, recommending the simpler model.
Approach
Approach and Architecture
Model comparison on imbalanced data to select a simpler, robust classifier.
Implementation
Implementation Details
Results
Results and Tradeoffs
This project is presented as a concise technical overview rather than a full-length narrative case study.
Explore More
Related Projects
Browse adjacent work from the same archive group or jump back to the project archive.
NLP
Multilingual Sentiment & Keyword Analyzer
NLP coursework project combining multilingual transformers, sentiment analysis, and keyword extraction.
Open case studyComputer Vision
Swimming Pool Detection
Computer vision coursework project focused on transfer learning and aerial object detection.
Open case studyCoursework
Used Vehicle Pricing & Valuation Model
Regression case study using feature engineering and tree ensembles on cross-country car listings.
Open case study