Budget: 6000 UAH Deadline: 7 days
Hello. The category "Databases and SQL" is not entirely accurate. SQL is suitable only for data extraction, while correlations and mathematical models fall under the realm of Data Science and medical statistics.
I am ready to implement both stages of your project in Python (Pandas, SciPy, Scikit-learn) or R:
Statistical analysis: I will perform data cleaning, calculate descriptive statistics, and find correlations (Pearson/Spearman) with mandatory assessment of statistical significance (p-value).
Predictive model: I will build an ML model (for example, logistic regression, Random Forest, or XGBoost) for classification and risk assessment of developing pathologies based on input factors.
Validation: I will evaluate the model's accuracy using ROC-AUC, Precision, and Recall metrics to ensure it has real clinical and predictive value, rather than just being overfitted to the test data.
Please write in private messages — we will discuss the structure of your data (volume, number of variables, presence of missing values) and I will start working immediately.