Advanced Data Analysis

Course #OS4106

Est.imated Completion Time: 3 months


This course moves beyond the ordinary linear model to other types of statistical models that will be appropriate in different circumstances. Students are first introduced to supervised models, including logistic regression and "generalized linear models" (GLM). The importance of complexity control and a training-set/test-set division is emphasized. Non-parametric models are introduced through classification and regression trees. Classification performance assessment is discussed. Unsupervised models, to include clustering and principal components are presented. Throughout the course, examples are drawn from practical experience with conducting research and solving problems for Navy and DoD customers.

Included in degrees & certificates

  • 268
  • 367


  • OA3103

Learning Outcomes

Upon successful completion of this course, you should be able to:
• Distinguish between supervised and unsupervised methods
• Implement linear regression models
• Implement logistic regression models
• Implement random forest models
• Utilize regularization (Ridge, Lasso, ElasticNet)
• Distinguish between classification and regression
• Define and distinguish various classification metrics
• Utilize validation techniques to assess model performance and avoid overfitting
• Implement clustering models
• Reduce the dimensionality of your data by using principal component analysis
• Utilize exponential smoothing and ARIMA models for time series data
Offerings database access
Asset Publisher

Application Deadlines

  •  08 Jul 2024

    Fall Quarter applications due

Asset Publisher

Academic Calendar

No upcoming events.