Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

After a few beers your CIO invited his buddy from Blue Moon consulting to propos

ID: 3713989 • Letter: A

Question

After a few beers your CIO invited his buddy from Blue Moon consulting to propose a project using data mining to improve the targeting of the new service that you have been a principal in developing. The service has been quite successful so far, being marketed over the last 6 months via your ingenious, and very inexpensive, word-of-mouth campaign. You've already garnered a pretty large customer base without any targeting, and you've been seeing this success as your best stepping stone to bigger and better things in the firm. After some reflection, you've decided that your best course of action is to play a key role in ensuring the success of the data mining project as well. You agree with your CIO's statement in a meeting with Blue Moon, that very accurate targeting might cost-effectively expand your audience to consumers that word-of-mouth would not reach.

Identify the four most serious flaws in this abridged version of Blue Moon's proposal, and suggest how to ameliorate them. You can accept that Blue Moon has accurate information about the service.

"We will build a logistic regression (LR) model to predict service uptake for a consumer, based on the data on your existing customers, including their demographics and their usage of the service. We believe that logistic regression is the best choice of method because it is a tried and true statistical technique, and we can interpret the coefficients of the model to infer whether the attributes are statistically significant, and whether they make sense. If they are and they do, then we can have confidence that the model will be accurate in predicting service uptake. We will apply the model to our (Blue Moon's) large database of consumers, and select out those whom the LR model predicts to be the most likely to subscribe. It is a fixed-price, fixed-cost, fixed-term service, so this also will in effect rank them by expected profit as well."

Explanation / Answer

Ans: The most serious flaws are

1) Firstly, on target variable there isn't any past data, so the solution for this is to collect all this new data.

2) The model's accuracy need to be based on how good is prediciting for target varible. But not whether an attribute significant statistically.

3) The evaluation strategy is very unclear as we cant predict that the accuracy method is the best one. So we need to look precisly for this one.

4) Training and testing the data should be performed inorder to ensure that isn't overfit and well generalizes.

5) what percentage of the consumers would fall under "more likely to subscribe" should be established.

//// Thank You /////

Hire Me For All Your Tutoring Needs
Integrity-first tutoring: clear explanations, guidance, and feedback.
Drop an Email at
drjack9650@gmail.com
Chat Now And Get Quote