Question? Call Us  +1 (817) 546-4770

INF30030 Business Analytics

Academic Anxiety?

Get an original paper within hours and nail the task

156 experts online

Free Samples

INF30030 Business Analytics

.cms-body-content table{width:100%!important;} #subhidecontent{ position: relative;
overflow-x: auto;
width: 100%;}

INF30030 Business Analytics

0 Download8 Pages / 1,863 Words

Course Code: INF30030
University: Swinburne University Of Technology

MyAssignmentHelp.com is not sponsored or endorsed by this college or university

Country: Australia

Question:

Response does not Objectives or submitted response address or partially doesn’t addressed the addresses assignment max assessment task requirements in terms of identifying the business objectives of the project.
Response is brief and lacks development Contains little evidence that the student has read, viewed, and consideredthe requirements of the assignment.
Complete and correct predictions. Writing is crisp, clear, and succinct. Choice of language enhances effectiveness of the document. Uses a preponderance of original language. Ideas are supported with evidence. No spelling, grammar, or punctuation errors are made. Demonstrates evidence of editing.
Model-Building Strategies and Methods for Logistic Regression in Applied Logistic Regression.
Data Mining and Business Analytics with R, Data Mining and Business Analytics with R.

Question:

Introduction
The Titanic tragedy was one of the most devastating and deadliest events that ever happened in modern history. Prediction models have been developed to estimate the probability of survival among the passengers in the liner, in consideration to factors such as class, gender and age among others. Lots of machine learning activities and predictive methods have been tried to develop a model with the highest predictive power of survivability in the incident.

Defining Business Objectives

This paper is focused on developing a predictive model to predict the probability that an individual would have survived the accident given different factors, which affected the victims differently. The passenger liner was divided into 3 classes –first class being in the topmost, second class in the middle and third class being at the bottom. This already shows that people in the third class were more likely to die compared to the other classes. However, it is important to prove this hypothesis, hence supporting our ideas and theories.
It has been documented that most people die because there were no enough lifesaving jackets, which rendered most of the people who could have survived death. Due to theories of nature, scarcity of the lifesaver jackets exposed men more compared to the other groups – women and children. In addition, this effect would have been affected by levels of class. It would be hypothesized that men in the first class were more romantic compared to those in second and third classes. Therefore, the trends of survivability would vary between class for men and women.  In an ideal situation, men and women in the third class would have struggled in the same manner to save their lives.
It is possible to predict their survivability based on the dynamic structure of the catastrophe. As much as the survivability levels would have been due to chance, these dynamics can explain to some level of confidence. Exploratory data analysis will be conducted to identify the predictive variables for survivability. Therefore, a model will be developed to explain the probability of survival using the provided variables explained in the metadata below.
Preparing Data
Survival, ticket class and port of embarkation were recorded as categorical variables using the factor() function for ease of analysis. Using the number of siblings and the number of parents, family size was calculated. Also, a large family was defined as which has more than three individuals. Extraction of individuals’ titles was done to generate other categorical variables which would possibly contribute in the model development. For instance, men were differentiated from male kids by extracting ‘Mr.’ titles. Subsets of the data were created to effectively analyse the data for insights into the model development stage.
Table 1: Data dictionary

Variable

Definition

Key

survival

Survival

0 = No, 1 = Yes

pclass

Ticket class

1 = 1st (Upper)
2 = 2nd (Middle)
3 = 3rd (Lower)

sex

Sex

0 = females, 1= males

Age

Age in years

 

sibsp

Number of siblings/spouses aboard the Titanic

 

parch

Number  of parents/children aboard the Titanic

 

fare

Passenger fare

 

embarked

Port of Embarkation

C = Cherbourg, Q = Queenstown, S = Southampton

Exploratory Data Analysis
According to our data set, 62.3% died and 37.7% survived. Among the males, 87.1% died while 17.4% died among the females. On average, those who survived had paid double as much fare as the survivors.
According to the figure below, a higher proportion of males died as compared to the females. More males in the middle and lower classes died as compared to those in the upper class. Amongst the females, the survival rate among those in the lower class was smaller compared to those in the upper and middle-class category (Jordan and Kleinberg, 2006).
Figure 1: Distribution of survival by gender and ticket class
As shown in the figure below, few passengers who had “miss” and “Mrs” titles died in upper class compared to middle and lower class categories.
Figure 2: Survival distribution by title and ticket class
On average, the survivors had larger families. Some extreme values are observed, indicating that few individuals had more than family members on board.
Figure 3: Survival by family size
More male died in all the classes than females and the proportions of females who died in the three class reduce significantly from third class to first class.
Data Sampling
Using the caret’s package function, createDataPartition (), the train and test datasets were created a 70 to 30 ratio respectively.  
set.seed(999)train.samples <-  createDataPartition(y = TitanicData$Survived, p = .70,list = FALSE)train <- TitanicData[train.samples, ]test <- TitanicData[-train.samples, ] The Logistic Model According to the data exploration performed in this paper, the best model includes ticket class, sex, age, passengers with "Mr." initials and family size. The model output is shown in the table below. ## glm(formula = Survived ~ Pclass + Sex + Age + Mr + Family.size, ##     family = "binomial", data = traing ## Deviance Residuals: ##     Min       1Q   Median       3Q      Max   ## -2.4310  -0.5103  -0.3149   0.5270   2.6117   ## Coefficients: ##  Estimate Std. Error z value Pr(>|z|)    
## (Intercept)   4.615628   0.527027   8.758  < 2e-16 *** ## PclassMiddle -1.559324   0.317087  -4.918 8.76e-07 *** ## PclassLower  -2.433104   0.317273  -7.669 1.74e-14 *** ## Sexmale      -2.337843   0.388411  -6.019 1.76e-09 *** ## Age         -0.033125   0.008768  -3.778 0.000158 *** ## Mr           -1.509350   0.403529  -3.740 0.000184 *** ## Family.size  -0.221536   0.081176  -2.729 0.006351 ** ## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 ## (Dispersion parameter for binomial family taken to be 1) ##     Null deviance: 987.05  on 731  degrees of freedom ## Residual deviance: 550.91  on 725  degrees of freedom ##   (185 observations deleted due to missingness) ## AIC: 564.91 ## (Intercept)  101.05126328 37.12359554 293.8110424 ## PclassMiddle   0.21027808  0.11159168   0.3876449 ## PclassLower    0.08776403  0.04631873   0.1610626 ## Sexmale        0.09653569  0.04421031   0.2039411 ## Age            0.96741808  0.95063377   0.9839373 ## Mr             0.22105368  0.10026833   0.4913842 ## Family.size    0.80128703  0.68099737   0.9369452 All the variables included in the model are statistically significant with 95% confidence level (Elliott and Woodward, 2007; McCluskey and Lalkhen, 2007; Ledolter, 2013). The table above includes exponents of the coefficient in the model, which indicate that all the predictor variables were associated with lower odds of survival.  Individuals in the middle class were less likely to survive by 21.03% compared to those in the first class keeping the other factors constant. Similarly, those in the lower class were less likely to survive by 91% compared to those in the upper-class category. Male individuals in the passenger liner were less likely to survive by approximately 90%, by controlling for the other variables in the model. Increasing age by 1 year reduces the odds of surviving by around approximately 3%. Males with "Mr." initials in their names were less likely to survive by approximately 78% after controlling for the other variables in the model. Finally, increasing family size by one member led to approximately 20% reduced chance of survival (Hosmer, Lemeshow and Sturdivant, 2013; Ledolter, 2013). Model Validation Figure 4: Model ROC plot According to the ROC plot shown above, the best threshold to be used in the prediction will be 0.54. The area under the curve is approximately 90.29%, showing that the model is very good. Model Prediction table(test$Pred_Survived) ##     Pred_Died Pred_Survived ##           209           105 prop.table(table(test$Pred_Survived)) ##     Pred_Died Pred_Survived ##     0.6656051     0.3343949 table(test$Pred_Survived, test$Sex)             ##   female male##   Pred_Died          7  202##   Pred_Survived    101    4 prop.table(table(test$Pred_Survived, test$Sex),2)       ##                     female       male##   Pred_Died     0.06481481 0.98058252##   Pred_Survived 0.93518519 0.01941748 table(test$Pred_Survived, test$Pclass)               ##                 Upper Middle Lower##   Pred_Died        37     52   120##   Pred_Survived    41     31    33 table(test$Pred_Survived, test$Survived)            ##                 Died Survived##   Pred_Died      181       28##   Pred_Survived   11       94 prop.table(table(test$Pred_Survived, test$Survived), 2)            ##                       Died   Survived##   Pred_Died     0.94270833 0.22950820##   Pred_Survived 0.05729167 0.77049180 33.63% (112) were predicted to have survived in the test dataset and 66.37% (221) to have died. Of those who survived, 16% (18) were men and 84% (94) were women. 15% of the survivors were from the third class, 26.6% from second class and 56.25% from first class (Michael, 2001; Sainani, 2013). Conclusion In conclusion, gender, age, ticket class, family size and having a "Mr" initial effectively predicts the probability of survival using the Titanic data set. The model's overall accuracy is 90.29%, indicating that it can accurately classify survival and deaths 90% of the times per 100 persons. According to the ROC curve, we can conclude that the best threshold to predict survival is around 0.54. Using this threshold, the model has a sensitivity of 77.8% and specificity of 94.27%. References Elliott, A. C. and Woodward, W. a. (2007) 'Analysis of Categorical Data', Statistical Analysis Quick Reference Guidebook, pp. 113–150. doi: 10.1007/SpringerReference_60770. Hosmer, D., Lemeshow, S. and Sturdivant, R. X. (2013) 'Model-Building Strategies and Methods for Logistic Regression', in Applied Logistic Regression, pp. 89–151. doi: 10.1002/0471722146.ch4. Jordan, M. and Kleinberg, J. (2006) 'Information Science and Statistics', Pattern Recognition, 4(356), pp. 791–799. doi: 10.1641/B580519. Ledolter, J. (2013) Data Mining and Business Analytics with R, Data Mining and Business Analytics with R. doi: 10.1002/9781118596289. McCluskey, A. and Lalkhen, A. G. (2007) 'Statistics III: Probability and statistical tests', Continuing Education in Anaesthesia, Critical Care and Pain, 7(5), pp. 167–170. doi: 10.1093/bjaceaccp/mkm028. Michael, R. S. (2001) 'Crosstabulation and Chi-square', Indiana University Retrieved, pp. 1–8. Sainani, K. L. (2013) 'Understanding linear regression', PM and R, 5(12), pp. 1063–1068. doi: 10.1016/j.pmrj.2013.10.002. Free Membership to World's Largest Sample Bank To View this & another 50000+ free samples. Please put your valid email id. E-mail Yes, alert me for offers and important updates Submit  Download Sample Now Earn back the money you have spent on the downloaded sample by uploading a unique assignment/study material/research material you have. After we assess the authenticity of the uploaded content, you will get 100% money back in your wallet within 7 days. UploadUnique Document DocumentUnder Evaluation Get Moneyinto Your Wallet Total 8 pages PAY 5 USD TO DOWNLOAD *The content must not be available online or in our existing Database to qualify as unique. Cite This Work To export a reference to this article please select a referencing stye below: APA MLA Harvard OSCOLA Vancouver My Assignment Help. (2021). Business Analytics. Retrieved from https://myassignmenthelp.com/free-samples/inf30030-business-analytics/probability-and-statistical-tests.html. "Business Analytics." My Assignment Help, 2021, https://myassignmenthelp.com/free-samples/inf30030-business-analytics/probability-and-statistical-tests.html. My Assignment Help (2021) Business Analytics [Online]. Available from: https://myassignmenthelp.com/free-samples/inf30030-business-analytics/probability-and-statistical-tests.html[Accessed 18 December 2021]. My Assignment Help. 'Business Analytics' (My Assignment Help, 2021) accessed 18 December 2021.

My Assignment Help. Business Analytics [Internet]. My Assignment Help. 2021 [cited 18 December 2021]. Available from: https://myassignmenthelp.com/free-samples/inf30030-business-analytics/probability-and-statistical-tests.html.

×
.close{position: absolute;right: 5px;z-index: 999;opacity: 1;color: #ff8b00;}

×

Thank you for your interest
The respective sample has been mail to your register email id

×

CONGRATS!
$20 Credited
successfully in your wallet.
* $5 to be used on order value more than $50. Valid for
only 1
month.

Account created successfully!
We have sent login details on your registered email.

User:

Password:

Are you pursuing a PhD course program? It is common among the research scholars to struggle with their dissertation topics at the initial stage. In such a situation, getting assistance from the experts at MyAssignmenthelp.com is a wise decision. The service offers you all the necessary support on the dissertation structure and the dissertation methodology. Unfortunately, most students are perplexed about the exact format of the dissertation. However, academic experts at MyAssignmenthelp.com are ready to provide you with dissertation assistance with relevant content and format.

Latest Management Samples

div#loaddata .card img {max-width: 100%;
}

MPM755 Building Success In Commerce
Download :
0 | Pages :
9

Course Code: MPM755
University: Deakin University

MyAssignmentHelp.com is not sponsored or endorsed by this college or university

Country: Australia

Answers:
Introduction
The process of developing a successful business entity requires a multidimensional analysis of several factors that relate to the internal and external environment in commerce. The areas covered in this current unit are essential in transforming the business perspective regarding the key commerce factors such as ethics, technology, culture, entrepreneurship, leadership, culture, and globalization (Nzelibe, 1996; Barza, 2…
Read
More

SNM660 Evidence Based Practice
Download :
0 | Pages :
8

Course Code: SNM660
University: The University Of Sheffield

MyAssignmentHelp.com is not sponsored or endorsed by this college or university

Country: United Kingdom

Answers:
Critical reflection on the objective, design, methodology and outcome of the research undertaken Assessment-I
Smoking and tobacco addiction is one of the few among the most basic general restorative issues, particularly to developed nations such as the UK. It has been represented that among all risk segments smoking is the fourth driving purpose behind infections and other several ailments like asthma, breathing and problems in the l…
Read
More
Tags:
Australia Maidstone Management Business management with marketing University of New South Wales Masters in Business Administration 

BSBHRM513 Manage Workforce Planning
Download :
0 | Pages :
20

Course Code: BSBHRM513
University: Tafe NSW

MyAssignmentHelp.com is not sponsored or endorsed by this college or university

Country: Australia

Answer:
Task 1
1.0 Data on staff turnover and demographics
That includes the staffing information of JKL industries for the fiscal year of 2014-15, it can be said that the company is having problems related to employee turnover. For the role of Senior Manager in Sydney, the organization needs 4 managers; however, one manager is exiting. It will make one empty position which might hurt the decision making process. On the other hand, In Brisba…
Read
More

MKT2031 Issues In Small Business And Entrepreneurship
Download :
0 | Pages :
5

Course Code: MKT2031
University: University Of Northampton

MyAssignmentHelp.com is not sponsored or endorsed by this college or university

Country: United Kingdom

Answer:
Entrepreneurial ventures
Entrepreneurship is the capacity and willingness to develop, manage, and put in order operations of any business venture with an intention to make profits despite the risks that may be involved in such venture. Small and large businesses have a vital role to play in the overall performance of the economy. It is, therefore, necessary to consider the difference between entrepreneurial ventures, individual, and c…
Read
More
Tags:
Turkey Istanbul Management University of Employee Masters in Business Administration 

MN506 System Management
Download :
0 | Pages :
7

Course Code: MN506
University: Melbourne Institute Of Technology

MyAssignmentHelp.com is not sponsored or endorsed by this college or university

Country: Australia

Answer:
Introduction
An operating system (OS) is defined as a system software that is installed in the systems for the management of the hardware along with the other software resources. Every computer system and mobile device requires an operating system for functioning and execution of operations. There is a great use of mobile devices such as tablets and Smartphones that has increased. One of the widely used and implemented operating syste…
Read
More
Tags:
Australia Cheltenham Computer Science Litigation and Dispute Management University of New South Wales Information Technology 

Next

Need an essay written specifically to meet your requirements?

Choose skilled experts on your subject and get an original paper within your deadline

156 experts online

Your time is important. Let us write you an essay from scratch

Tips and Tricks from our Blog

PROJ6016 Employer Based Project

Free Samples PROJ6016 Employer Based Project .cms-body-content table{width:100%!important;} #subhidecontent{ position: relative; overflow-x: auto; width: 100%;} PROJ6016 Employer

Read More »

PPMP20009 Marking Rubric

Free Samples PPMP20009 Marking Rubric .cms-body-content table{width:100%!important;} #subhidecontent{ position: relative; overflow-x: auto; width: 100%;} PPMP20009 Marking Rubric

Read More »