Hosted on: https://used-vehicles.onrender.com
Please refer to this Github repository: https://github.com/peachykeenani/sprint_6_app
This web application explores the dataset vehicles_us.csv
. This project contains the following visualizations:
- Data viewer
- Vehicle
condition
bymanufacturer
Sum of sales
by vehiclecondition
- Vehicle
price
andcondition
comparison - Price distribution of
fuel
type bytransmission
- Price distribution of
4WD
vs.non-4WD
bytransmission
Price
anddays listed
distribution
Data Preprocessing: Filled in missing values for the following columns:
- model_year
- grouped by 'model'
- missing values replaced with
median
of year (2011)
- cylinders
- grouped by 'model'
- missing values replaced with
median
of cylinders (6)
- odometer
- grouped by 'model_year' and 'model'
- missing values replaced with
mean
of odometer as an integer (115,553)
Note Regarding scatter plots:
- Trendline is set to
expanding
mean trend. - Trendline scope is set to
overall
.
- The top two manufacturers with the most sales were
Ford
andChevrolet
(histograms:Vehicle types by manufacturer
andVehicle condtion by manufacturer
) - The conditions with the most sales came from
excellent
andgood
vehicles (Total sales by vehicle condition
histogram). 2013
models sold the most (Histogram of condition vs model_year
).- There is an inverse correlation between the vehicle
price
anddays listed
.