Project Autodidact Overview

LEGAL RULES AND ETHICAL PRINCIPLES REGARDING DATA ACCESS AND USE

I will always be mindful of legal rules and ethical principles regarding data access and use.  Because the goal of Project Autodidact is to develop specific knowledge and skills, I will only use publicly available non-governmental data and synthetic data for Project Autodidact.

GOAL: DEVELOP SPECIFIC KNOWLEDGE AND SKILLS

Project Autodidact’s goal is to develop the knowledge and skills to use data science and artificial intelligence to create predictive and forecasting tools for multiple purposes described below.

OVERVIEW: FUNDAMENTALS

I’m a disciplined and motivated autodidact.  I created a self-study plan to achieve the goal of Project Autodidact.  I’m a strong believer in the importance of fundamentals in any endeavor, so the self-study plan begins with a deep review of applicable mathematics, probability, and statistics starting from basic arithmetic.  Most of these topics are a review for me.  Math is very easy for me.  I earned top grades in advanced/gifted math classes from elementary school through high school.  I completed calculus classes at a local junior college as a high school senior.  I earned a grade of A in calculus as a college freshman, which was a review of calculus that I had already learned in high school.  I identify the problem type (e.g., pattern recognition) and apply relevant rules and procedures to solve it.

OVERVIEW: STAGES AND MODULES

Project Autodidact is structured into four stages.  Each stage is a set of modules.  Each individual module is planned to be completed in one week, but the one week is flexible based on my available time.  There are a total of 116 modules among all four stages, including modules for mini-projects and rest (i.e., rest weeks).  Stage 1 is 3 modules.  Stage 2 is 74 modules.  Stage 3 is 3 modules.  Stage 4 is 36 modules.

If I maintain a pace of one module per week, I will complete the total project in 2 years and 12 weeks (116 weeks).

PROGRESS REPORTS

Specific details of each module will be shared in progress reports at www.InsightsBySE.com

TEMPORARY PAUSE AND RESUMPTION

Project Autodidact is designed for temporary pause and resumption if necessary to focus on another short project, such as a comedy festival or acting role.

ADAPTATION

I stay current on developments in the use of data science and artificial intelligence to create predictive and forecasting tools.  Project Autodidact will be adapted to address any significant developments, especially new methods or technologies.

CONTINUOUS LEARNING AFTER COMPLETION OF PROJECT AUTODIDACT

I’ve created a continuous learning plan to begin after the completion of Project Autodidact.

PRIMARY PROFESSIONAL GOAL AFTER COMPLETION OF PROJECT AUTODIDACT

Project Autodidact’s goal is to develop the knowledge and skills to use data science and artificial intelligence to create predictive and forecasting tools for the purposes described below.

My primary professional goal after completion of Project Autodidact is to use data science and artificial intelligence to save and improve veterans’ lives.  I will present proposals to government and private entities to create the following tools based on the following priority that may change based on circumstances and available resources:

  1. Predictive tools for military and veteran mental health (e.g., predict probability of PTSD during and after service)
  2. Forecasting tools for veterans’ mental health (e.g., forecast veterans’ mental health issues in the next decade)
  3. Predictive tools for the socioeconomic effects of proposed government policies on veterans (e.g., predict probability of veteran homelessness)
  4. Forecasting tools for the socioeconomic effects of proposed government policies on veterans (e.g., forecast labor participation of veterans over the next decade)

I’m fully aware of the institutional and political realities affecting the achievement of these ambitious goals.  I need to build coalitions of partners with shared values and goals to achieve these goals.  I can be very persuasive and influential if necessary.

Given my limited time and energy, I will prioritize projects based on circumstances and available resources.  I’ll focus on completing the highest priority project before starting the next project.

SECONDARY PROFESSIONAL GOAL AFTER COMPLETION OF PROJECT AUTODIDACT

My secondary professional goal after completion of Project Autodidact is to start a professional consulting business to leverage my domain knowledge in accounting, finance, and general business to help businesses reach their full potential.

Again, given my limited time and energy, I will prioritize projects based on circumstances and available resources.  I’ll focus on completing the highest priority project before starting the next project.

PROJECT AUTODIDACT DETAILS

STAGE 1 (3 MODULES): REVIEW OF APPLICABLE MATHEMATICS, PROBABILITY, AND STATISTICS FOR DATA SCIENCE

Module S1-1: Foundations of Arithmetic and Algebra

Module S1-2: Statistics, Probability, and Advanced Algebra

Module S1-3: Linear Algebra, Calculus, and Applications

STAGE 2 (74 MODULES): DATA SCIENCE

Stage 2: Cluster 1: Introduction to Data Science

Module S2-1: Cluster Preparation

Module S2-2: Data Science Tools

Module S2-3: Data Ethics

Module S2-4: Mini-Project

Module S2-5: Rest Week

Stage 2: Cluster 2: Python Programming Fundamentals

Module S2-6: Cluster Preparation

Module S2-7: Python Basics

Module S2-8: Python Data Structures

Module S2-9: Python Libraries

Module S2-10: Mini-Project

Module S2-11: Rest Week

Stage 2: Cluster 3: R Programming Fundamentals

Module S2-12: Cluster Preparation

Module S2-13: R Basics

Module S2-14: R Data Manipulation

Module S2-15: R Visualization

Module S2-16: Mini-Project

Module S2-17: Rest Week

Stage 2: Cluster 4: Julia Programming Fundamentals

Module S2-18: Cluster Preparation

Module S2-19: Julia Basics

Module S2-20: Julia DataFrames

Module S2- 21: Julia Performance

Module S2-22: Mini-Project

Module S2-23: Rest Week

Stage 2: Cluster 5: Databases and SQL Fundamentals

Module S2-24: Cluster Preparation

Module S2-25: SQL Basics

Module S2-26: SQL Joins

Module S2-27: Advanced SQL

Module S2-28: Mini-Project

Module S2-29: Rest Week

Stage 2: Cluster 6: Descriptive Statistics

Module S2-30: Cluster Preparation

Module S2-31: Central Tendency and Dispersion

Module S2-32: Data Distributions

Module S2-33: Mini-Project

Module S2-34: Rest Week

Stage 2: Cluster 7: Inferential Statistics

Module S2-35: Cluster Preparation

Module S2-36: Hypothesis Testing

Module S2-37: ANOVA and Chi-Square Tests

Module S2-38: Mini-Project

Module S2-39: Rest Week

Stage 2: Cluster 8: Regression Analysis

Module S2-40: Cluster Preparation

Module S2-41: Linear Regression

Module S2-42: Multiple Regression

Module S2-43: Mini-Project

Module S2-44: Rest Week

Stage 2: Cluster 9: Probability Fundamentals

Module S2-45: Cluster Preparation

Module S2-46: Probability Distributions

Module S2-47: Conditional Probability

Module S2-48: Bayesian Probability

Module S2-49: Mini-Project

Module S2-50: Rest Week

Stage 2: Cluster 10: Linear Algebra Fundamentals

Module S2-51: Cluster Preparation

Module S2-52: Vectors and Matrices

Module S2-53: Linear Transformations

Module S2-54: Eigenvalues and Eigenvectors

Module S2-55: Mini-Project

Module S2-56: Rest Week

Stage 2: Cluster 11: Calculus Fundamentals

Module S2-57: Cluster Preparation

Module S2-58: Limits and Derivatives

Module S2-59: Integrals

Module S2-60: Applications of Calculus

Module S2-61: Mini-Project

Module S2-62: Rest Week

Stage 2: Cluster 12: Data Visualization

Module S2-63: Cluster Preparation

Module S2-64: Basic Visualizations

Module S2-65: Advanced Visualizations

Module S2-66: Mini-Project

Module S2-67: Rest Week

Stage 2: Cluster 13: Machine Learning Fundamentals

Module S2-68: Cluster Preparation

Module S2-69: Supervised Learning

Module S2-70: Unsupervised Learning

Module S2-71: Mini-Project

Module S2-72: Rest Week

Stage 2: Capstone Project

Module S2-73: Capstone Project Development

Module S2-74: Capstone Presentation and Reflection

STAGE 3 (3 MODULES): REVIEW OF APPLICABLE MATHEMATICS, PROBABILITY, AND STATISTICS FOR ARTIFICIAL INTELLIGENCE

Module S3-1: Foundations of Arithmetic and Algebra

Module S3-2: Precalculus and Introductory Statistics

Module S3-3: Calculus, Linear Algebra, and Optimization

STAGE 4 (36 MODULES): ARTIFICIAL INTELLIGENCE

Stage 4: Cluster 1: Advanced Machine Learning

Module S4-1: Cluster Preparation

Module S4-2: Ensemble Methods

Module S4-3: Hyperparameter Tuning

Module S4-4: Model Interpretability

Module S4-5: Mini-Project

Module S4-6: Rest Week

Stage 4: Cluster 2: Deep Learning Fundamentals

Module S4-7: Cluster Preparation

Module S4-8: Neural Network Basics

Module S4-9: Convolutional Neural Networks

Module S4-10: Regularization and Optimization

Module S4-11: Mini-Project

Module S4-12: Rest Week

Stage 4: Cluster 3: Natural Language Processing

Module S4-13: Cluster Preparation

Module S4-14: Text Preprocessing

Module S4-15: Word Embeddings

Module S4-16: Sequence Models

Module S4-17: Mini-Project

Module S4-18: Rest Week

Stage 4: Cluster 4: Time Series Analysis

Module S4-19: Cluster Preparation

Module S4-20: Time Series Basics

Module S4-21: Advanced Time Series Models

Module S4-22: Time Series Decomposition

Module S4-23: Mini-Project

Module S4-24: Rest Week

Stage 4: Cluster 5: Data Engineering and Deployment

Module S4-25: Cluster Preparation

Module S4-26: Data Pipelines

Module S4-27: Model Deployment

Module S4-28: Mini-Project

Module S4-29: Rest Week

Stage 4: Cluster 6: Stakeholder Communication and Ethics

Module S4-30: Cluster Preparation

Module S4-31: Data Storytelling

Module S4-32: Ethical AI for Stakeholders

Module S4-33: Mini-Project

Module S4-34: Rest Week

Stage 4: Capstone Project

Module S4-35: Capstone Project Development

Module S4-36: Capstone Presentation

Never Doubt A Marine On A Mission