Regression Analysis of Television Channels
Essay Preview: Regression Analysis of Television Channels
Report this essay
Table of ContentsIntroduction        Processing the dataset        ▪        The Dataset        ▪        Descriptive Statistics        ▪        Null Hypothesis Testing        ▪        Correlation between the Variables        ▪        Regression Model of the Dataset        ▪        Residuals Testing        ▪        Predicting with regression model        Conclusion – Key findings        Table of Figures:Figure 1: Sale Price vs. Market Retail Sales        Figure 2: Sale Price Vs. Market Buying Income        Figure 3: Sale Price Vs. TV Homes per Station        Figure 4: Sale Price vs. Network Hourly Rate        Figure 5: Sale Price Vs. National Spot Rate        Figure 6: Sale Price Vs. Age of Station        Figure 7: Sale Price Vs. Number of ties with major Networks        Figure 8: Sale Price Vs. Percent of Market Population in Urban Areas        Tables:Table 1: Variables of the dataset        Table 2: Descriptive statistics of the dataset        Table 3: STATA result for the two-sample t-test statistic for the mean of selling price depending on the age of the station        Table 4: Correlation coefficient table between each variable of the dataset        Table 5: Regression Results in STATA for regression of Sale Price with National Spot Rate and Market Buying Income        Table 6: Regression Result in STATA of sale price with national spot rate        Table 7: Regression results in STATA for Sale Price against National Spot Rate Market Buying Income, with no constant        Table 8: Results in STATA for Breusch Pagan test        Table 93: Results in STATA for White’s Test for homoscedasticity        Table 10: Results in STATA for Breusch-Godfrey LM test for autocorrelation        Table 11: Results in STATA of VIF (variance inflation factor) multicollinearity test        Table 12: Results in STATA for Ramsey RESET Test        Table 13: Results in STATA for Skewness/Kurtosis Tests for Normality        IntroductionIn the present project, we examine a data set with variables referring to the operation of 31 regional TV stations in the USA. Initially, we produce the descriptive statistics for these variables. Afterwards, a linear regression model is built based on the principle of independent and dependent variable, where the dependent variable is the selling price of each station.

The selection of the independent variables is based at first on the backward selection method, which means that the regression model starts by including all independent variables in its equation. The significance of each variable is evaluated based on its p-value as well as the effect on the value of R-squared, in order to reduce the number of independent variables included in the model. As a result, the final regression model has the least amount of significant variables as possible.However, in order to verify the model, the forward selection method is also performed. In this case, the regression modelling process begins by using each variable independently and then step-by-step slowly building the model, depending on the significance of the coefficient of each variable on the selling price of the TV stations. By following this selection process, we can be sure that the regression model includes every significant variable. The result however is the same with the backward selection process. Furthermore, the residual diagnostic checks are performed, in order to determine if all conditions required for the assumptions of the linear regression model are satisfied.Once the regression model is checked and verified, we then go on to predict the selling price of a hypothesized TV station, with its own values of independent variables. All of the above steps, (including regression and residual tests) are performed using STATA 14.1. Every command used during the process is shown, with its accompanying result, with the purpose to show the methodology we followed.Processing the datasetThe DatasetThe dataset selected for the present project[1] was obtained from the internet and refers to regional television stations, including their selling prices, in the United States of America.As mentioned before, it contains details for 31 TV stations, accompanied with 10 columns. The first column is the index name of each TV station, the second is the selling price which will be used as the dependent variable of the regression model, and the 8 that follow are the model’s independent variables.The table below shows in greater detail the data contained in the dataset, including the variable names, type and measurement units.DescriptionNameTypeMeasurement UnitStation Call lettersstationcallString-Sale PricesalepriceInteger$ (in thousands)Market Retail SalesmarketretailsalesInteger$ (in millions)Market Buying IncomemarketbuyingincomeInteger$ (in millions)TV Homes/stationtvhomesstationInteger$ (in thousands)Network Hourly RatenetworkhourlyrateInteger$ per HourNational Spot RatenationalspotrateInteger$ per HourAge of StationageofstationByte0 = Before 1952,1 = After 1952Number of ties with major networksnumberoftieswithmajornetworksByte0, 1, 2Percent of Market Population in Urban areaspercentofmarketpopulationinurbanFloatPercentage

Get Your Essay

Cite this page

Table Of Contentsintroduction And Sale Price. (June 17, 2021). Retrieved from https://www.freeessays.education/table-of-contentsintroduction-and-sale-price-essay/