Skip to content

Commit

Permalink
added tour-level doubles 2000-20
Browse files Browse the repository at this point in the history
  • Loading branch information
JeffSackmann committed Mar 13, 2020
1 parent ca3ce9a commit 980a85a
Show file tree
Hide file tree
Showing 23 changed files with 26,438 additions and 4 deletions.
10 changes: 7 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,11 +18,15 @@ MatchStats are included where I have them. In general, that means 1991-present f

There are some tour-level matches with missing stats. Some are missing because ATP doesn't have them. Others I've deleted because they didn't pass some sanity check (loser won 60% of points, or match time was under 20 minutes, etc). Also, Davis Cup matches are included in the tour-level files, but there are no stats for Davis Cup matches until the last few seasons.

---
# Doubles

Please read, understand, and abide by the license below. It seems like a reasonable thing to ask, given the hundreds of hours I've put into amassing and maintaining this dataset. Unfortunately, a few bad apples have violated the license, and when people do that, it makes me considerably less motivated to continue updating. That's one reason why I no longer keep these files completely up-to-date.
I've added tour-level doubles back to 2000. Filenames follow the convention atp_matches_doubles_yyyy.csv. I may eventually be able to add tour-level doubles from before 2000, as well as lower-level doubles for some years. Most of the columns are the same, though in a different order.

---
# Attention

Please read, understand, and abide by the license below. It seems like a reasonable thing to ask, given the hundreds of hours I've put into amassing and maintaining this dataset. Unfortunately, a few bad apples have violated the license, and when people do that, it makes me considerably less motivated to continue updating.

Also, if you're using this for academic/research purposes (great!), take a minute and cite it properly. It's not that hard, it helps others find a useful resource, and let's face it, you should be doing it anyway.

# License

Expand Down
1,430 changes: 1,430 additions & 0 deletions atp_matches_doubles_2000.csv

Large diffs are not rendered by default.

1,394 changes: 1,394 additions & 0 deletions atp_matches_doubles_2001.csv

Large diffs are not rendered by default.

1,333 changes: 1,333 additions & 0 deletions atp_matches_doubles_2002.csv

Large diffs are not rendered by default.

1,261 changes: 1,261 additions & 0 deletions atp_matches_doubles_2003.csv

Large diffs are not rendered by default.

1,299 changes: 1,299 additions & 0 deletions atp_matches_doubles_2004.csv

Large diffs are not rendered by default.

1,261 changes: 1,261 additions & 0 deletions atp_matches_doubles_2005.csv

Large diffs are not rendered by default.

1,270 changes: 1,270 additions & 0 deletions atp_matches_doubles_2006.csv

Large diffs are not rendered by default.

1,285 changes: 1,285 additions & 0 deletions atp_matches_doubles_2007.csv

Large diffs are not rendered by default.

1,281 changes: 1,281 additions & 0 deletions atp_matches_doubles_2008.csv

Large diffs are not rendered by default.

1,282 changes: 1,282 additions & 0 deletions atp_matches_doubles_2009.csv

Large diffs are not rendered by default.

1,296 changes: 1,296 additions & 0 deletions atp_matches_doubles_2010.csv

Large diffs are not rendered by default.

1,282 changes: 1,282 additions & 0 deletions atp_matches_doubles_2011.csv

Large diffs are not rendered by default.

1,301 changes: 1,301 additions & 0 deletions atp_matches_doubles_2012.csv

Large diffs are not rendered by default.

1,261 changes: 1,261 additions & 0 deletions atp_matches_doubles_2013.csv

Large diffs are not rendered by default.

1,276 changes: 1,276 additions & 0 deletions atp_matches_doubles_2014.csv

Large diffs are not rendered by default.

1,318 changes: 1,318 additions & 0 deletions atp_matches_doubles_2015.csv

Large diffs are not rendered by default.

1,355 changes: 1,355 additions & 0 deletions atp_matches_doubles_2016.csv

Large diffs are not rendered by default.

1,314 changes: 1,314 additions & 0 deletions atp_matches_doubles_2017.csv

Large diffs are not rendered by default.

1,286 changes: 1,286 additions & 0 deletions atp_matches_doubles_2018.csv

Large diffs are not rendered by default.

1,364 changes: 1,364 additions & 0 deletions atp_matches_doubles_2019.csv

Large diffs are not rendered by default.

271 changes: 271 additions & 0 deletions atp_matches_doubles_2020.csv

Large diffs are not rendered by default.

12 changes: 11 additions & 1 deletion matches_data_dictionary.txt
Original file line number Diff line number Diff line change
Expand Up @@ -86,4 +86,14 @@ winner_rank
winner_rank_points
- number of ranking points, where available
loser_rank
loser_rank_points
loser_rank_points

* _doubles_ files notes

The matches_doubles files have similar columns, though not all in the same order.

The identifying information for each player refers to 'winner1', 'winner2', 'loser1', and 'loser2'. The labels 1 and 2 are not assigned for any particular reason.

In general, the tournament IDs for doubles results are the same as for singles results (so, for instance, you can see which players entered both draws at the same event), though this is not guaranteed for every single tournament, since some of the data came from different sources.

The stats columns ('w_ace' etc) are per *team*, not per player. That's a function of how tennis stats are typically recorded, not a decision on my part.

0 comments on commit 980a85a

Please sign in to comment.