Customer-Segmentation

Customer Segmentation is the subdivision of a market into discrete customer groups that share similar characteristics. Customer Segmentation can be a powerful means to identify unsatisfied customer needs. Using the above data companies can then outperform the competition by developing uniquely appealing products and services.

The most common ways in which businesses segment their customer base are:

Demographic information, such as gender, age, familial and marital status, income, education, and occupation.
Geographical information, which differs depending on the scope of the company. For localized businesses, this info might pertain to specific towns or counties. For larger companies, it might mean a customer’s city, state, or even country of residence.
Psychographics, such as social class, lifestyle, and personality traits.
Behavioral data, such as spending and consumption habits, product/service usage, and desired benefits.

Advantages of Customer Segmentation

Determine appropriate product pricing.
Develop customized marketing campaigns.
Design an optimal distribution strategy.
Choose specific product features for deployment.
Prioritize new product development efforts.

The Challenge

You are owing a supermarket mall and through membership cards, you have some basic data about your customers like Customer ID, age, gender, annual income and spending score. You want to understand the customers like who are the target customers so that the sense can be given to marketing team and plan the strategy accordingly.

K Means Clustering Algorithm

Specify number of clusters K.
Initialize centroids by first shuffling the dataset and then randomly selecting K data points for the centroids without replacement.
Keep iterating until there is no change to the centroids. i.e assignment of data points to clusters isn’t changing.

Data

This project is a part of the Mall Customer Segmentation Data competition held on Kaggle.

The dataset can be downloaded from the kaggle website which can be found here.

Environment and tools

scikit-learn
seaborn
numpy
pandas
matplotlib

https://github.com/SARTHAK-27/Customer-Segmentation

Conclusions

K means clustering is one of the most popular clustering algorithms and usually the first thing practitioners apply when solving clustering tasks to get an idea of the structure of the dataset. The goal of K means is to group data points into distinct non-overlapping subgroups. One of the major application of K means clustering is segmentation of customers to get a better understanding of them which in turn could be used to increase the revenue of the company.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Customer_Segmentation.ipynb		Customer_Segmentation.ipynb
Customer_Segmentation_Documentation		Customer_Segmentation_Documentation
Mall_Customers.csv		Mall_Customers.csv
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Customer-Segmentation

Advantages of Customer Segmentation

The Challenge

K Means Clustering Algorithm

Data

Environment and tools

Conclusions

About

Releases

Packages

Languages

SARTHAK-27/Customer-Segmentation

Folders and files

Latest commit

History

Repository files navigation

Customer-Segmentation

Advantages of Customer Segmentation

The Challenge

K Means Clustering Algorithm

Data

Environment and tools

Conclusions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages