Parkisons-Disease

Parkinson’s Disease Detection System 🧠🎙️

Welcome to the Parkinson’s Disease Detection Machine Learning Project. This tool leverages voice biomarkers—such as fundamental vocal frequencies, jitter, shimmer, and harmonic noise—to accurately predict the presence of Parkinson’s Disease using state-of-the-art machine learning algorithms.


🚀 Quick Start & Guides

If you’re eager to get the project running or want a deeper understanding of the code, please refer to the dedicated guide files in this repository:

  1. Setup & Execution Guide (SETUP_AND_RUN.md)
    • Read this first.
    • Detailed, step-by-step instructions on Python installation, downloading module dependencies (pip install -r requirements.txt), and executing the data pipeline perfectly.
  2. Technical Explanation (HOW_IT_WORKS.md)
    • Read this to understand the logic and math.
    • A deep-dive into the machine learning engineering used here: explaining data leakage prevention (Train/Test splitting), SMOTE balancing, hyperparameter fine-tuning via GridSearchCV, and Min-Max Feature Scaling.

📁 Repository Structure


📊 Evaluation Overview

Our completely balanced pipeline evaluates both strict, standard baselines and powerful ensemble techniques.

Rigorous testing guarantees the data is un-leaked before modeling. As shown in the generated model_metrics_comparison.csv, our most accurate combinations currently yield:

This establishes a very strong non-invasive metric for assessing early progression. Enjoy the project!