Welcome to Machine Learning for Business, a hands on workshop. We'll explore why data science and ML has become so important in the new era of business.
Retail Battle: real life examples¶
There are two very popular examples of when Data Science can be used as a competitive advantage for business.
Walmart predicting demand before Hurrican Frances¶
Hurrican Frances struck on 2004, bringing massive damages to Florida. The hurricane had been announced for a couple of weeks. In that time, Walmart had to decide what items to ship to Florida to stock their stores. Some items are expected to have higher demand during catastrophic climate events: batteries, water, flashlights, etc.
But Walmart was digging for something more: what else is usually consumed in these circumstances? To answer that question, they relied on data. According to the NY Times, Wal-Mart has 460 terabytes of data (as of 2004) stored about their customers, employees, purchases and sales. 460 TB is a huge volume of data, and Walmart knew they could take advantage from it. They decided to analyze and after some data mining, they found that there were specially two items that would be highly demanded due to the hurricane:
- Strawberry Pop-Tarts 🍓
- Beer 🍺
Who would have thought, right? This is the key. With data, we can rely on real behavior and past events, and not so much on human intuition, which can be flawed or biased. The result is that we can anticipate events and improve our decision making. This way, we can:
"Start predicting what's going to happen, instead of waiting for it to happen" -- Linda Dillman, Walmart's ex-CIO
Key take away from Walmart's example: Decision making can be greatly improved with data.
(Read more about Walmart's case in the original NY Times piece)
Target knows when you're pregnant¶
(even before your father)
This is a creepier story, but still very interesting. The year is 2012, and Andrew Pole was, at the time at least, an statistician at Target. He talked to the NY Times and outlined the process they had to identify, not just if a woman was pregnant or not, but also, her due date to within a small window. 🤰🍼
The key was to analyze purchase history of women in the same condition, and identify those items that became more demanded during those pregnancy months.
Using that data, Target could identify a customer that was pregnant, and even more importantly, offer coupons timed to very specific stages of her pregnancy.
There's a key difference with this second story compared to Walmart's, and it's the "real time" characteristic of it. After all, Target's example is set in 2012, ages into the technological future compared to Walmart's. In the former, the decision was made "offline", probably took a couple of weeks and involved some human-driven decision making. In the case of Target, the decision is made by a machine, in real time. This is the second important aspect of Data Science in business:
Key take away from Target's example: Being able to make decisions and take action in real time, much faster than anything driven by humans.
_(Read more about Targets's case in the original NY Times piece)_