StudentShare solutions
Triangle menu

Logistic regression classifier for the churn Data - Coursework Example

Not dowloaded yet

Extract of sample
Logistic regression classifier for the churn Data

The programming code is as follows: LOGISTIC REGRESSION VARIABLES good_bad   /METHOD=ENTER checking duration history purpose amount savings employed installp marital coapp resident property age other housing     existcr job depends telephon foreign   /CONTRAST (purpose)=Indicator   /CLASSPLOT   /PRINT=CORR   /CRITERIA=PIN(0.05) POUT(0.10) ITERATE(20) CUT(0.5). Then the analysis is presented below: Case Processing Summary Unweighted Cases N Percent Selected Cases Included in Analysis 964 96.4 Missing Cases 36 3.6 Total 1000 100.0 Unselected Cases 0 .0 Total 1000 100.0 a. If weight is in effect, see classification table for the total number of cases. Dependent Variable Encoding Original Value Internal Value Bad 0 Good 1 Categorical Variables Codings Frequency Parameter coding (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) purpose 3 1.000 .000 .000 .000 .000 .000 .000 .000 .000 .000 0 225 .000 1.000 .000 .000 .000 .000 .000 .000 .000 .000 1 100 .000 .000 1.000 .000 .000 .000 .000 .000 .000 .000 2 174 .000 .000 .000 1.000 .000 .000 .000 .000 .000 .000 3 268 .000 .000 .000 .000 1.000 .000 .000 .000 .000 .000 4 12 .000 .000 .000 .000 .000 1.000 .000 .000 .000 .000 5 22 .000 .000 .000 .000 .000 .000 1.000 .000 .000 .000 6 47 .000 .000 .000 .000 .000 .000 .000 1.000 .000 .000 8 9 .000 .000 .000 .000 .000 .000 .000 .000 1.000 .000 9 94 .000 .000 .000 .000 .000 .000 .000 .000 .000 1.000 X 10 .000 .000 .000 .000 .000 .000 .000 .000 .000 .000 Beginning block Classification Table Observed Predicted good_bad Percentage Correct bad good Step 0 good_bad bad 0 292 .0 good 0 672 100.0 Overall Percentage 69.7 Variables in the Equation B S.E. Wald df Sig. Exp(B) Step 0 Constant .834 .070 141.414 1 .000 2.301 Variables not in the Equation Score df Sig. Step 0 Variables checking 119.858 1 .000 duration 40.086 1 .000 History 48.045 1 .000 purpose 39.421 10 .000 purpose(1) 6.926 1 .008 purpose(2) 9.752 1 .002 purpose(3) 9.334 1 .002 purpose(4) .361 1 .548 purpose(5) 12.039 1 .001 purpose(6) .053 1 .817 purpose(7) .393 1 .531 purpose(8) 4.846 1 .028 purpose(9) 1.583 1 .208 purpose(10) .694 1 .405 amount 18.355 1 .000 savings 30.125 1 .000 employed 14.071 1 .000 installp 5.548 1 .019 marital 8.537 1 .003 coapp .419 1 .518 resident .000 1 .996 property 20.211 1 .000 age 7.933 1 .005 other 10.626 1 .001 housing .146 1 .703 existcr 2.184 1 .139 job .426 1 .514 depends .067 1 .797 telephon 2.137 1 .144 foreign 8.114 1 .004 a. Residual Chi-Squares are not computed because of redundancies. Block 1: Method = Enter Omnibus Tests of Model Coefficients Chi-square df Sig. Step 1 Step 299.197 29 .000 Block 299.197 29 .000 Model 299.197 29 .000 Model Summary Step -2 Log likelihood Cox & Snell R Square Nagelkerke R Square 1 883.255a .267 .378 a. Estimation terminated at iteration number 20 because maximum iterations has been reached. Final solution cannot be found. The sensitivity and specificity analysis can be done as follows: Classification Table Observed Predicted good_bad Total Good Bad good_bad Good 596 (TP) 76 (FP) 672 Bad 140 (FN) 152 (TN) 292 Total 736 (Sensitivity) 228 (Specificity) 964 TP: True Positive; TN: True Negative; FP: False Positive; FN: False Negative Sensitivity=TP/(TP+FN)=596/(596+140)=0.812 or 81,7% ...Show more

Summary

Question 1: For the churn data the package used for analysis is SPSS because it is more versatile and conversant. The churn variable is considered as output, good/bad and the other variables are treated as independent. The independent variables are checking, duration, history, purpose, amount, savings, employed, install, marital, coapp, resident, property, age, other, housing, existcr, job, depends, telephon and foreign…
Author : michaelmuller
Logistic regression classifier for the churn Data essay example
Read Text Preview
Save Your Time for More Important Things
Let us write or edit the coursework on your topic
"Logistic regression classifier for the churn Data"
with a personal 20% discount.
Grab the best paper

Related Essays

Procurement logistic and supply chain management
Successful supply chain management requires cross functional integration and marketing plays a critical role (Lambert and Cooper, 2000). Procurement is extension of supply chain management allows the smooth functioning of departments engaged in the management process.
8 pages (2000 words) Coursework
Data Analysis
Data for unemployed persons was obtained from the Annual Population Survey (APS). The APS is conducted annually during which information on the number and percentage of people who are employed, economically active, unemployment rate and economically inactive is collected.
7 pages (1750 words) Coursework
Statistics 401 Mod 4 Case - Regression Analysis
Each of the corresponding values of X and Y designate a specific point on the graph and therefore when the points are all plotted they form scattered dots on the surface of the graph. This so formed pattern of plotted points scattered on the graph are attributed to the name of the graph, scatter plots.
4 pages (1000 words) Coursework
Statistics 401 Mod 4 SLP - Regression Analysis
In an effort to establish this, I formed a simple regression in excel to try and put the presumption verses factual nature of the issue to rest. I related the variable values of SAL and the variable values of the DJIA. In say a typical line graph, the values of these variables in my opinion would form a pattern when plotted on the graph that can be pinned down to a specific mathematical formula.
3 pages (750 words) Coursework
Statistics 401 Mod 5 Case - Multiple Regression Analysis
38.1 16832.4 9.216641 7.03E-06 117060.6 193215.7 117060.6 193215.7 Interest Rate -1203318 141354.8 -8.51275 1.34E-05 -1523085 -883552 -1523085 -883552 Price Per Board Foot -17836.8 9105.462 -1.95891 0.081788 -38434.7 2761.226 -38434.7 2761.226 Multiple regression analysis is a data analysis procedure which is used to establish the relationship between a given variable and a set of other two variables.
4 pages (1000 words) Coursework
Multiple Regression
Below is the data that I have been collecting to date. SEX AGE SAL (K) EDU 1 39 23 14 2 29 33 16 2 18 32 16 1 21 54 12 1 50 48 18 2 49 37 16 1 62 70 15 2 23 23 12 1 20 36 13 1 30 35 14 2 32 21 11 1 48 55 16 There is a difference between simple regression and multiple regression.
2 pages (500 words) Coursework
Data base
B. Who manufactures the DBMS Oracle and what is Oracle’s current version number? Oracle Corporation manufactures oracle DBMS and its current version number is 12c Release 1: 12.1.0.1.2 (Oracle, Introduction to Oracle Database). D. What
1 pages (250 words) Coursework
Multiple regression exercise
To determine the relationship, we conducted a multiple regression test and the results are shown in the preceding sections below; The above table gives the summary statistics from the regression output. The value of R2 is given as 0.448 which means that
2 pages (500 words) Coursework
Solve a regression problem using SPSS
Figure 1 below shows R Squared = 0.965 (96.6%). However, ‘Adjusted R Square” is a robust diagnostic tool for multiple regressions since it takes into consideration the sample size and the explanatory variables. Since our
4 pages (1000 words) Coursework
Data Collection
systematic approach of gathering and measuring information on different variables so as to establish answers to research questions evaluate outcomes and test the hypotheses. A myriad of businesses across the globe depend on the collection of data for a number of reasons. Most
8 pages (2000 words) Coursework
Get a custom paper written
by a pro under your requirements!
Win a special DISCOUNT!
Put in your e-mail and click the button with your lucky finger
Your email
YOUR PRIZE:
Apply my DISCOUNT
Comments (0)
Rate this paper:
Thank you! Your comment has been sent and will be posted after moderation