Statistical Methods
Causal Inference (DiD, RCT design, causal DAGs, wait-list controlled trials, synthetic controls, propensity-score matching, Rosenbaum sensitivity bounds, uplift modelling),
A/B Testing & Experimentation (variant design, power analysis, SRM detection, CUPED variance reduction, multiple-testing correction),
Bayesian Statistics (PyMC 5, NUTS, hierarchical models, posterior-predictive checks, PSIS-LOO),
Survival Analysis (Cox PH, Kaplan-Meier, Random Survival Forest),
Hypothesis Testing (t-test, Wilcoxon signed-rank, Mann-Whitney U, chi-square, Fisher z),
Stochastic Processes & Sequential Modeling (Hidden Markov Models, time-series, state-space methods),
SHAP, permutation importance, Brier, isotonic and LOOCV calibration,
bootstrap & Hodges-Lehmann confidence intervals, Effect-size estimation (Cohen’s d, rank-biserial).
Machine Learning, Deep Learning & GenAI
Supervised/Unsupervised Learning, Gradient Boosting (XGBoost, LightGBM),
Anomaly & Rare-Event Detection,
Neural Networks (CNNs, RNNs, LSTMs, Transformers, Two-Tower),
NLP, Computer Vision, Recommendation Systems (implicit feedback, negative sampling, embeddings & vector search),
Reinforcement Learning (Q-learning, DQN, PPO + GAE, Actor-Critic),
Probabilistic Graphical Models, LLMs, Prompt Engineering, RAG,
OpenAI API, Anthropic Claude tool-use, Zod structured outputs, LLM-as-judge evaluation,
LLM Fine-Tuning, RLHF, Agentic AI (LangChain, LlamaIndex), Generative AI Applications.
Programming & Databases
Python (NumPy, pandas, scikit-learn, TensorFlow, PyTorch, statsmodels, SciPy, XGBoost, LightGBM, PyMC),
TypeScript, C++, R, SQL, BigQuery, Spark/PySpark, DuckDB,
React 19, Vite, Vercel Serverless, Supabase (Postgres).
MLOps, Cloud & Visualisation
Google Cloud Platform (BigQuery, Vertex AI), Docker, Kubernetes,
MLOps (CI/CD, Model Deployment, Feature Pipelines, ETL/Airflow),
GitHub Actions, pytest, Vitest, ruff, Git/GitHub, Jupyter, LaTeX,
Tableau, Power BI, matplotlib, seaborn, Plotly.
Spoken Languages
English (Fluent), Russian (Native),
Ukrainian (Native), Belarusian (Fluent),
Spanish (Professional Working).
Extracurricular Coursework
Stanford CS229 Machine Learning, Stanford CS230 Deep Learning,
MIT RES.6-012 Introduction to Probability, Stanford EE178 Probabilistic Systems Analysis,
Imperial College Mathematics for Machine Learning, IBM Applied Data Science Specialization,
Statistical Rethinking 2026 (R. McElreath, Max Planck).