Download and unzip the NYC taxi dataset from Cyrille Rossant on GitHub: https://github.com/ipython-books/minibook-2nd-data (Links to an external site.)
Open the notebook file attached below. You will be adding your code (make sure you add headers and comments) to the existing code, and make sure your code is well organized.
Please upload the data and display data columns, number of rows, variable types, and numeric statistics + categorical variable frequencies.
Display a scatter plot of pick up locations. For which vendor is it easiest to find a cab?
Display a histogram of trip distances. What is the most common trip distance?
Display a histogram of the fare total amounts. What can you say about the data?
How many unusually long trips (of greater than 100 miles) do you see?
NY Taxi Notebook Assignment
July 4th, 2020