Using data to understand the world, uncover patterns, and drive insights.
SUDATA Ă— SUBAA Datathon(24h)
24-hour optimization sprint on a supply-chain case blending MIP, clustering, and time-series signals.
Meituan Business Analytics Challenge
Causal inference on subsidy batches to estimate true incremental GMV and reallocate budget to higher-ROI segments.
CFA Institute Research Challenge
Empirical valuation lens linking innovation-heavy productive forces to market repricing with panel-style evidence.
Course · Medical imaging
CV pipeline from HOG/KNN baselines toward CNN/ResNet-style models with explainability hooks.
Citi Global Market Challenge 2026
A multi-asset portfolio strategy balancing alpha generation, transaction costs, and risk management across equities, fixed income, commodities, FX, and cash.
Creator growth analytics
Clustered thousands of AI-creator uploads to find better time-slot Ă— scale combinations and quantify view lift.
Course · Risk & policy
Hazard exposure scoring with socio-economic layers and parametric insurance recommendations.
Course · Survey analysis
Cleaned Likert survey data on study habits and expectations with clear reporting-ready visuals.
Short drama training corpus
JSON→tabular pipeline with dialogue metrics for rhythm, diversity, and engagement in role-play training data.
Course · Time series
Univariate and multivariate forecasting experiments to relate AI-cycle narratives to NVDA price dynamics.
Course · Econometrics
Compared regression and nonlinear forecast specs to test robustness and omitted-variable bias on earnings drivers.
Course · Statistical modeling
Red vs white Vino Verde modeling with selection, stability checks, and interpretable drivers of perceived quality.
Course · ML classifiers
Explored regional employer–income patterns with tree ensembles, KNN, NB, and regression baselines.
Course · Spatial scoring
Composite accessibility index with normalization, weighting, and defensible ranking across SA2 units.
Course · Model comparison
Accuracy vs runtime trade-offs across classical learners with stratified CV on medical vs sensor data.
YouTube comments Ă— ELM + Gemini
Large-scale comment mining to trace sentiment shifts, persuasion routes, and herding in a major pop-culture event.
Course · Regression suite
Predictive modeling for quick-service daily sales with elastic net, ridge/lasso, KNN, and rigorous CV.
Asia-Pacific Mathematical Contest in Modeling
My APMCM paper: optimization-first model, parameter sensitivity, and a tight writeup for our problem track.
COMAP Mathematical Contest in Modeling
Our MCM/ICM entry: clear assumptions, statistical core models, and evidence-backed answers for the chosen problem.