Limited Time OfferFLAT 20% off & $20 bonus sign up. Order Now
New! Hire Essay Assignment Writer Online and Get Flat 20% Discount!!Order Now
The dataset EastWestAirlinesCluster.xls contains information on 3999 passengers who belong to an airline’s frequent flier program. For each passenger the data include information on their mileage history and on different ways they accrued or spent miles in the last year. The goal is to try to identify clusters of passengers that have similar characteristics for the purpose of targeting different segments for different types of mileage offers.
a) Apply hierarchical clustering with Euclidean distance and Ward's method. Make sure to normalize the data first. How many clusters appear
b) What would happen if the data were not normalized
c) Compare the cluster centroid to characterize the different clusters, and try to give each cluster a label.
d) Use K-means clustering with the number of clusters that you found above. Does the same picture emerge
e) Which clusters would you target for offers, and what types of offers would you target to customers in that cluster.
XLMiner is used applying association rule on data variables. The inputs are as highlighted below:
Number of transaction = 500
Number of variables = 7
Minimum confidence = 0.5 or 50%
XLMiner has generated the input and the part of lists of rules is as shown below:
Rule 1 – As per first association rule, it would be fair to conclude that the concerned person who first purchases brush would then purchase nail polish with an estimated confidence of 100%.
Rule 2 - As per second association rule, it would be fair to conclude that the concerned person who first purchases nail polish would then purchase brush with an estimated confidence of 63.22%.
Rule 3 - As per third association rule, it would be fair to conclude that the concerned person who first purchases nail polish would then purchase bronzer with an estimated confidence of 59.20%.
Additionally, the utility of the association rule lies in the fact that they can enable identification of hidden associations prevalent in consumer buying behavior. However, in order to use the same a balance between support and confidence is required. This is because if the support is high, then the rules regarding rare items rule are not displayed. However, if the support is kept at a low value, hence the rules generated are quite more which tends to undermine the end utility of these and hence not recommended (Liebowitz, 2015).
Minimum confidence = 0.75 or 75%
XLMiner has generated the input and the part of lists of rules is as shown below.
It would be fair to conclude that increase in the confidence percentage from 0.5 to 0.75, the number of list of association rules displayed is decreased. This is because only the association rules which has fall in the range above the selected minimum confidence percentage would appear. Hence, the rules which display lower confidence percentage as compared with minimum confidence percentage would be removed automatically through XLMiner. This could be problematic since the rules not displayed may have high support levels and hence significant (Ragsdale, 2014).
The above output was generated using the Ward method whereby three clusters were chosen which have been obtained and the relevant output confirms the same if a horizontal line is drawn at a distance of about 990, the dendogram would be intersected at three unique points reflecting that three clusters have been obtained.
Key Observations:
Key Observations:
There is parity in terms of the number of cluster formed. However, the characteristics on close scrutiny would indicate that the underlying difference.
In line with the above observations, it would be fair to consider that this cluster would be labeled as “High Networth Flyers”. However, the comparison with hierarchical clustering output clearly indicates the difference as in that case, it was cluster 2 that comprise of these flyers. Thus, it would be fair to assume that the output of K-Means Clustering varies with hierarchical clustering (Grossmann & Rinderle-Ma, 2015).
For cluster 3, the incentive in the form of higher rewards point needs to be outlined so that the customer uses frequent flyer card and the bonus points utilized for flight transactions.
Abramowics, W. (2013) Business Information Systems Workshops: BIS 2013 International Workshops (5th ed.). New York: Springer.
Ana, A. (2014) Integration of Data Mining in Business Intelligence System (4th ed.). Sydney: IGA Global.
Grossmann, W. & Rinderle-Ma, S. (2015) Fundamentals of Business Intelligence (2nd ed.). New York: Springer.
Liebowitz, J. (2015) Business Analytics: An Introduction (2nd ed.). New York: CRC Press.
Ragsdale, C. (2014) Spread sheet Modelling and Decision Analysis: A Practical Introduction to Business Analytics (7th ed.). London: Cengage Learning.
Zaki, M.J.(2000), Generating non-redundant association rules. In: Proceeding of the ACM SIGKDD, pp. 34–43.
No matter how close the deadline is, you will find quick solutions for your urgent assignments.
All assessments are written by experts based on research and credible sources. It also quality-approved by editors and proofreaders.
Our team consists of writers and PhD scholars with profound knowledge in their subject of study and deliver A+ quality solution.
We offer academic help services for a wide array of subjects.
We care about our students and guarantee the best price in the market to help them avail top academic services that fit any budget.
15,000+ happy customers and counting!