...ble_players = filter(players, to_year >= first_year_of_data) names_table = table(available_players$name) dupe_names = names(names_table[which(names_table > 1)]) available_players$name[available_players$name %in% dupe_names] = paste( available_players$name[availab...
ballr/players_data.R at master · toddwschneider/ballr
... historical analyses on-demand, so I had to write additional R scripts, but a potential future improvement might be to create a backend that caches the shot data and exposes additional endpoints that aggregate data across seasons, teams, or maybe even the whole league.
BallR: Interactive NBA Shot Charts with R and Shiny - Todd W. Schneider
Hexagonal charts, popularized by Kirk Goldsberry at Grantland, group shots into hexagonal regions, then calculate aggregate statistics within each hexagon. Hexagon sizes and opacities are proportional to the number of shots taken within each hexagon, while the color scale represents a metric of your choice, which can be one of:
BallR: Interactive NBA Shot Charts with R and Shiny - Todd W. Schneider
The NBA’s Stats API provides data for every single shot attempted during an NBA game since 1996, including location coordinates on the court. I built a tool called BallR, using R’s Shiny framework, to explore NBA shot data at the player-level.
BallR: Interactive NBA Shot Charts with R and Shiny - Todd W. Schneider
The down round is a “lemon” signal to the market that the company’s business plan is not working out. And one of the thorniest issues in dealing with down rounds is how a former unicorn keeps its employees after destroying the value of their shares...
Expect Some Unicorns to Lose Their Horns, and It Won’t Be Pretty - The New York Times
I make no claim that it’s a perfect model—it uses imperfect data, has some smelly features and omissions, and all of the usual correlation/causation caveats apply—but it seems to do at least an okay job quantifying the impact of temperature, rain, and snow on Citi Bike ridership.
A Tale of Twenty-Two Million Citi Bikes: Analyzing the NYC Bike Share System - Todd W. Schneider
The model’s root-mean-square error is 4,138, and residuals appear to be at least roughly normally distributed. Residuals appear to exhibit some heteroscedasticity, though, as the residuals have lower variance on dates with fewer trips.
A Tale of Twenty-Two Million Citi Bikes: Analyzing the NYC Bike Share System - Todd W. Schneider
62,714