Introduction
- Data Science and analytics need data (not to mention Big-Data)
- What if you don't have data
- Creating Data and analyzing it (sometimes rolled into the same grand problem statement)
- Online vs Offline context of crating data
- Online gets covered in Reinforcement Learning
- In Offline we will discuss Design of Experiments (DOE) and Active Learning
- Critical difference between observational data and offline experimental data in DOE
- The operation of system can be conceptualized as a combination of some inputs, which when used together, result in outputs
- Formal experimentation involves systematic, purposeful changes to input variables in an attempt to gain knowledge about the system and/or find the ideal setting that result in the best output.
- The problems with adaptive One-Factor-At-a-Time (aOFAT)
- The discrete case
- Alternative is Orthogonal arrays. An illustration through the Full factorial.
Analysing Designed Experiments
- Classical Analysis
- The Take-The-Best Heuristic
- Where would we use Classical?
- Where would we use TTB?
- The statistical way
Sequential Experimentation and Active Learning
- Sequential Experimentation
- Active Learning as semi-supervised learning or optimal experimental design
- Strategies in Active Learning:
- Query by committee
- Expected model change
- Expected error reduction and variance reduction
No comments:
Post a Comment