View on GitHub

Predict-Sales-by-Advertising-Ads

Objective : To predict sales for given budget spend on TV, Radio and Newspaper in dollars by Multiple Linear Regression model with statistical Analysis done from coefficients, p value, R² and Adj. R² value and F-statistic

Predict-Sales-by-Advertising-Ads


Project Overview


About Project

Machine Learning Regression model is developed to predict sales based on budgesting spends on various platforms for marketing adn advertising.The features are TV, Radio and Newspaper marketing spend in thousands of dollars.


Code and Resources used


Web Scraping

Dataset URL: https://www.kaggle.com/ashydv/advertising-dataset/notebooks

Data fields

Features:

Target:


Data Cleaning

There is no missing values in data.


EDA

I looked at the distributions of the data and the value counts for the various numerical features. Below are a few highlights :


Model Building

Multiple Linear Regression

Simple linear regression can easily be extended to include multiple features. This is called multiple linear regression:

y=β0+β1x1+…+βnxn

Each x represents a different feature, and each feature has its own coefficient. In this case:

y=β0+β1×TV+β2×Radio+β3×Newspaper

Advantages:

Model Prediction


Conclusion

To summarise, we have performed a multiple linear regression and have covered some basic introductory statistics as well. This is by no means a comprehensive analysis of the marketing data set but simply an example of how to perform and interpret a mulitple linear regression. It’s a good starting point, especially when attempting to understand the relevance of important statistical concepts like t-statistic, p-value and standard error.