This page contains a list of coursework and serves to summarize major topics we have learned throughout my time here at OSU. If / when time permits, I wish to turn them into posts.

Overview

First year of Master's at OSU, major parts of our coursework focused on building the fundamentals: statistical methods and theories. Second year of Master's, we expanded beyond statistical methods and theories to include additional statistical models (e.g. GLMs) and simulation / optimization methods (e.g. EM). I also gained experience working with big (> 100 GB) data using cloud computing platforms (e.g. GCP, PySpark / Apache Spark).

Third year of PhD's also at OSU, we dived into advanced topics in statistics (e.g. decision theory, theories for GLMs/GLMMs). I also took a sneak peek of advanced techniques (e.g. reduced-rank regression, spatial GLMMs, INLA, MCMC), while completing course & independent projects.

Fourth year of PhD's, I spend most time in Bayesian spatial models such as Gaussian processes (GPs), variant GPs, NNGPs and NNMPs, and Markovian time series models such as MTDs.

(some of the major topics discussed)
[some of the work using R / Python / Query Language]

Scheduled coursework

PhD, Statistics, Oregon State Univeristy, 2021 — Present

Research

  • Dissertation (Bayesian Network, Finite Mixture Model, Copula Model and Dependence, Bayesian Hierarchical Model, MCMC Algorithms)

Completed coursework

PhD, Statistics, Oregon State Univeristy, 2021 — Present

Theories

  • Advanced Theory of Statistics II (Minimax, Admissibility, Shrinkage, Hypothesis Testing: UMP, MLR, UMPU)
  • Advanced Theory of Statistics III (Large Sample Theory; First-order and Higher-order Asymptotics, Edgeworth and Saddlepoint Approximations, M- and Z-Estimation)
  • Probability Theory (Measure Theory, Convergence, Central Limit Theorem, Characteristic Functions, Martingales)
  • Linear Models Theory I (Least Squares Estimation, Best Linear Unbiased Estimation BLUE, Multivariate Normal Distributions, Random Vectors and Matrices)
    Project Title: [Reduced-Rank Regression Model: A Review]
  • Linear Models Theory II (Generalized linear models GLM, Generalized linear mixed-effects models GLMM)
    Project Title: [Reparameterized SGLMM (Spatial Generalized Linear Mixed Model): A Review]

Methods

CS

  • Data Structures (Time Complexity Analysis, Aynamic Array and Bag ADT, Linked List, Stack and Queue ADT, BST/AVL Tree, MinHeap, HashMap)
    Programming Assignments: [Data-Structures-2023]

MS, Statistics, Oregon State Univeristy, 2019 — 2021

Research

  • Master’s Project (Counting Process, Poisson Process, Cox and Cluster Process, Hawkes Process, Spatio-Temporal Self-Exciting Process, Thinning Algorithm, 1D & 2D Simulations)

More Theories…

  • Advanced Theory of Statistics I (Measure, Decision Theory, Loss Functions, Exponential Families, Unbiasedness, Equivariance, Bayes)
  • Advanced Theory of Statistics II (Minimax, Admissibility, Shrinkage, Hypothesis Testing: UMP, MLR, UMPU)

Data Science

  • Data Science Tools and Programming / Big Data (Relational Databases: SQL; Non-Relational Databases: Spark; BigQuery, Data Wrangling, Data Visualization)
    using Cloud Computing Services, R, Python and Query Language [8 HWs] [See my project]

Simulation

  • Probability, Computing, and Simulation / Computational Statistics (Monte Carlo methods, Bootstrap estimates, Variance Reduction techniques, Optimization techniques)
    using R and GitHub [8 HWs] [See my project]

Methods

  • Generalized Regression Models / GLMs I (Exponential Families, Logistic Regression, Regression-type models for binomial, multinomial and Poisson data, Log-linear models for multidimensional contingency tables, Multinomial / Softmax Regression)
    using R [6 HWs] [See my project]
  • Spatial Statistics (Geostatistics, Markov Random Field, Poisson Process)
    pending/auditing
  • Time Series (Autoregressive Integrated Moving Average (ARIMA) model, Forecasting, Spectral Analysis)
    using R [6 HWs] [See our project]
  • Statistical Methods (Parametric tests: t-test, χ2-test, F-test; Nonparametric tests, Tests for Proportions, Tests for Binary Data)
    using R [6 HWs]
  • Linear Models (Simple & Multiple Linear Regression, Weighted Regression, Nonlinear Regression, Linear Models for Binary Data)
    using R [8 HWs]
  • Designs of Experiments (Factorial Designs, Random & Mixed Effect models, ANCOVA)
    using SAS [7 HWs]

Theories

  • Theory of Statistics I (Probability Distributions, Exponential Families, Functions of Random Variable: Transformations using cdf, pdf & mgf, Order Statistics)
  • Theory of Statistics II (Convergence, Sufficiency, Point Estimation: Maximum Likelihood, Bayes, UMVUEs, Rao-Blackwell, Lehmann-Scheffe theorem)
  • Theory of Statistics III (MP & UMP tests, CI, Large-Sample tests: LRT, Wald, score test)

Consulting

  • Consulting Practicum (Cases: GLMs, Occupancy Modeling, Meta-survey Analysis)
    2022, 2021, 2021, and more [See more details]

Others (Sampling Methods, Statistical / Machine Learning, etc)

  • Sampling Methods (Stratified Sampling, Cluster Sampling, Systematic Sampling, Multistage Sampling)
    using R [7 HWs + 1 Project]
  • Statistical Methods for Large and Complex Data Sets / Statistical Learning (PCA, Penalized Logistic Regression, Random Forests, Ensemble Methods)
    using R [See our project]
  • Multivariate Analysis (Multidimensional Scaling (MDS), Principal Components Analysis (PCA), Classification & Discriminant Analysis, Clustering)
    using R [6 HWs] [See my project]
  • Probability and Mathematical Statistics I, II

MAS (deferred), Applied Statistics, Colorado State Univeristy, 2019

  • Data Visualization Methods (Exploratory Data Analysis (EDA), Multipanel Conditioning, 3D Plotting)
    using R (base R, lattice, ggplot2) [See my project]
  • Quantitative Reasoning (Confounding; Interaction; Biases & Statistical Fallacies: Regression Toward the Mean, Data Snooping, Post-Hoc Hypothesis Testing)
    using R (base R, car, ggplot2) [3 HWs & 1 quiz]
  • Design and Data Analysis for Researchers II (Multiple Regression: ANCOVA, Model Selection & Diagnostics, Logistic Regression; Experimental Design: Fixed & Mixed Effects models)
    using R (base R, dplyr, car, ggplot2) [See my project]
  • Linear Algebra

Non-matriculated Status, Mathematics and CS, University of Washington, 2016 — 2018

  • Real Analysis (Advanced Calculus) I, Mathematical Reasoning, Linear Algebra, Differential Equations, Multivariate Calculus
  • Computer Programming I using Java

Other relevant coursework

BS, Earth and Space Sciences, University of Washington

  • Geographic Information Systems (GIS) using ArcGIS, Geoscience Communication, Statistical Methods using R, and Calculus I, II