DATA2902: Data Analytics — Learning from Data (Advanced)
University of Sydney• 2025 S2
In Progress
R
MachineLearning
Statistical Modelling
Develops advanced data analytics skills using R, focusing on data acquisition, integration, statistical modelling, machine learning, and reproducible reporting with version control.
Learning Outcomes
Problem formulation: Frame domain-specific questions and select appropriate statistical analyses.
Data acquisition & integration: Extract, join, and harmonise data from multiple sources.
Summarisation & EDA: Build numerical and graphical summaries for varied—and large/complex—datasets.
Version control: Use a software version-control system (e.g., Git) effectively in analytics workflows.
Statistical inference: Choose, justify, and implement suitable parametric or non-parametric tests.
Linear modelling: Formulate, fit, and interpret linear models for multi-factor relationships.
Statistical ML: Apply a given classifier, design cross-validation, and report predictive accuracy.
Reproducible reporting: Produce reproducible reports (R Markdown/Quarto) that communicate results clearly to stakeholders.