ai

Mental Health Screening

ML-based depression detection system using text analysis and Random Forest classification

About the Project

Study project for learning machine learning and data science fundamentals
Text-based depression detection using Random Forest classifier trained on mental health dataset
Character-level TF-IDF features (7-grams) for robust text analysis without extensive preprocessing
Comprehensive model comparison: tested 10 algorithms including Naive Bayes, SVM, and ensemble methods
Flask web application with real-time prediction API deployed on PythonAnywhere
Systematic hyperparameter tuning with results exported to Excel for analysis
Mental Health Screening - Image 1

Key Highlights

Trained Random Forest classifier achieving high accuracy on mental health text classification
Implemented character 7-gram TF-IDF vectorization for feature extraction from raw text
Conducted extensive hyperparameter tuning across 10 ML algorithms with grid search
Built Flask REST API for real-time depression risk prediction from user text input
Applied text length filtering (60-3000 characters) and stratified train-test splitting
Evaluated models using accuracy, precision, recall, F1-score, and confusion matrices
Deployed production model with joblib persistence for consistent inference pipeline

Technologies Used

Backend

FlaskFlaskPythonPython

ML / Data Science

scikit-learnscikit-learnNLTKNLTKpandaspandasnumpynumpyJupyterJupyter

Cloud

PythonAnywherePythonAnywhere