diff --git a/regression/moneyball-motivation.Rmd b/regression/moneyball-motivation.Rmd index c7d12c87..8a26a7d9 100644 --- a/regression/moneyball-motivation.Rmd +++ b/regression/moneyball-motivation.Rmd @@ -37,7 +37,7 @@ h |> html_nodes("table") |> -### Sabermetics +### Sabermetrics Statistics have been used in baseball since its beginnings. The dataset we will be using, included in the __Lahman__ library, goes back to the 19th century. For example, a summary statistics we will describe soon, the _batting average_, has been used for decades to summarize a batter's success. Other statistics^[http://mlb.mlb.com/stats/league_leaders.jsp] such as home runs (HR), runs batted in (RBI), and stolen bases (SB) are reported for each player in the game summaries included in the sports section of newspapers, with players rewarded for high numbers. Although summary statistics such as these were widely used in baseball, data analysis per se was not. These statistics were arbitrarily decided on without much thought as to whether they actually predicted anything or were related to helping a team win.