Have The Chance That You Deserve

chance

What’s up ? This is THE stephane ANDRE !!! I watched an Olivier Roland’s video  and I learned some good stuff.

We’re lucky that we deserve. It’s true that there a bit of luck in everyone’s success. If I take my case, I recognize that I was lucky to be born in an industrialized country with internet and a good education. But I also created some of the luck that allows me to have my lifestyle today.

We aren’t all born under the same star or not with the same cards, that we can’t change. The thing we can do is to do with what we have to get something better. Having this state of mind is essential to having a better lifestyle. It’s important to be proactive. Whatever the bad luck or the handicap, you have the possibility to realize your dreams. It will be harder than other people but it’s possible with perseverance and patience.

Made with what you have

use what you got to have what you want

Here’s what you have to say to yourself : « My dream is that, maybe I’m not very lucky to succeed and maybe people will think I’m crazy, but I’m going to try to do something. I’m not going to do like the majority of people, I’m going to act, I’m going to do different and do interesting things ».

The truth is that the greatest craziness is to believe that by always doing the same thing, we will have different results. By always doing the same thing, we always have the same results. If you think you deserve better than that in your life, change your comfort, change your actions.

Seneca says something interesting in « Moral Letter to Lucilius ». He talks about people who commit suicide. He says it’s a paradox. It’s a paradox to give up life. What Seneca means is that it’s more constructive to try to change the actions that led to the results of being disgusted with life rather than leaving life.

It’s true that there are also elements that are outside our control or elements like psychology, depression, but whatever your despair, you can choose to change your actions to have another result. I will not lie to you, it will not be easy and it will not happen overnight. The main thing is that you have to try new things, experiment new things, get out of your comfort zone. It’s like this quote : « It’s when we walk that we risk failing ».

Go to another world

comfort zone

The majority of people are in a frozen environment. They’ve been doing the same things for years, they’ve been thinking the same way for years and their lifestyle is stagnating. Get out of this environment to experiment other things and meet people who think differently but share the same values as you. It’s these types of people that will motivate you to do things you never thought you’d do.

When you dream of something, it always seems impossible to you. Publish a book, start a company, change job because it’s frustrating, all that, you feel like it’s outside of your world. It’s true that it’s outside of your world but you have the possibility of making it a reality. Look, imagine that you are on the beach and you have to go to the mountain but you don’t know the way. The majority of people will stay on the beach because it’s more comfortable and safer.

It’s true that it’s safer to stay on the beach but over time, it becomes boring. The best plan is to go towards the mountain and on the way, you will find the first step. At this point, you can ask how to go to the second stage. At the second stage, you can ask how to go to the third stage, etc. In this way, gradually, you reach the mountain learning a lot of new things.

To start a company, it’s the same plan. You find an idea. Then you write a 2-3 pages project that explains how this idea can become a company. Then you look for how to finance this company. Then you create a product or service and you contact prospects, etc. It’s obvious that when you start these projects, you don’t really know the stages. Do this plan now with the knowledge you have, this will allow you to demystify the process and find the first stage.

We must stop dwelling. You have goals, you have frustrations, you want to do things to improve your lifestyle, run the plan. Go to the first stage and get information for the next stage. There is never a perfect time to start (that’s an excuse to procrastinate). You have to start imperfectly and do it anyway. It’s the only way to get the chance you deserve.

Subscribe to my newsletter and share this article if you think it can help someone you know. Thank you.

-Steph

P.S. If you’re in Miami and you like Caribbean food, go to my cousin’s bistro to eat Haitian food. Click here

Validate Data Mining In Tableau With A Chi-Square Test

validate validation

I have just enrolled in a Data Science course on Udemy  and I learned good stuff.

In this article we will start using statistics. Don’t worry we’ll do something simple, we’ll use the Chi-square test in a basic way. There is a special section to learn how to do statistics at an advanced level.

I’ll explain why we’re going to learn how to use the Chi-square test. The results we have with theses 2 bar charts are good. We see on theses 2 bar charts that age has a significant impact on the rate of client leaving the bank. We also see in which age groups the clients leaves the bank the most and which age groups the clients leave the bank the least. With that we have good insights.

In the A/B test « Gender », we can see that there is a correlation between the male and female sex and the choice to leave the bank. But as I said before, this A/B test is basic. The results of a basic A/B test visually shows us what is probably happenning in reality but we aren’t 100% sure of these results. To validate these results, we need do to use statistical tests like Chi-square test.

Doing a report based on basic A/B test is very risky and you can have completely false insights. I don’t advise you to do it (unless you want to leave your job). It’s for this reason that using Chi-square will help us to have strong insights.

Chi-square will allow us to know if our results are statistically significant. Our results are based on a sample of 10 000 clients and Chi-square test will tell us if these results are due to chance effects or if these results can represent all the client of the bank.

For example in our A/B test « Gender », we observed that in our sample of 10 000 clients, women are more likely to leave the bank compared to men.

tableau data mining science chi square test a/b test

Now, we aren’t sure if the results of this sample represent the behavior of all the bank’s clients.

To use basic Chi-square test, we use an online tool. Click here  .

tableau data mining science chi square test a/b test

On internet, there are plenty of websites to do a Chi-square test but we’ll use this one so that you can understand how it works. To do a Chi-square test, we need to use absolute values and in our A/B test we have percentage.

Let’s go back to Tableau. We’ll create a new tab with a version of A/B test with absolute values. In this way, we keep the A/B test with the percentages. Do a right-click on the « Gender » tab and select « Duplicate ».

tableau data mining science chi square test a/b test

Name the new tab « Gender Actual » to specify that it’s absolute values.

tableau data mining science chi square test a/b test

To have the absolute values, move « Number of Records » in « Measures » to the « Marks » area and put it over top of « SUM(Number of Records ».

tableau data mining science chi square test a/b test

tableau data mining science chi square test a/b test

Move « Number of Records » in « Measures » to « Rows » over « SUM(Number of Records ».

tableau data mining science chi square test a/b test

Cool, we have our absolute values.

tableau data mining science chi square test a/b test

We also need total absolute values, which means the total number of men and women. There is a very fast way to get that. Right-click on the vertical axis and select « Add Reference Line ».

tableau data mining science chi square test a/b test

Then in « Value », click on the drop-down on the right and select « Sum » to have the total sum of the observations.

tableau data mining science chi square test a/b test

And in « Scope », you select « Per Cell » option to specify that you want the total sums for each category, male and female.

tableau data mining science chi square test a/b test

Now, we have the total sum at the top of the bars. We will modify labels to have the absolute values. In « Label », we will change « Computation » to « Value » and click on the « OK » button.

tableau data mining science chi square test a/b test

tableau data mining science chi square test a/b test

Perfect, we have the total amount of observation at the top of each bar : 4543 women and 5457 men. We have what we need to use our online tool.

tableau data mining science chi square test a/b test

OK, I’ll explain how this tool works. « Sample1 » and « Sample2 » correspond to the independent variable « Gender ». You choose in which order you enter the data, « Sample1 » for men or the opposite. In our case, we use « Sample1 » for women and « Sample2 » for men.

« #success » corresponds to the result Y=1, which means in our case « yes, the client left the bank ».

« #trials » is the total number of observations, which means the total number of women in « Sample1 » and the total number of men « Sample2 ».

That’s how you enter the data :

  • For « Sample1 » in #success, you enter 1139 because there are 1139 women who left the bank. For « Sample1 » in #trials, you enter 4543 because there are 4543 women in total.

 

  • For « Sample2 » in #success, you enter 898 because there are 898 men who left the bank. For « Sample2 » in #trials, you enter 5457 because there are 5457 men in total.

tableau data mining science chi square test a/b test

Here is the verdict : « Sample1 is more successful ». « Sample1 » corresponds to women and #success is :« yes, the client left the bank ». This verdict means that of all the bank’s client, women are more likely to leave the bank than men. And look, there is something important, it’s « p<0.001 ». This means that the « p » is strictly less than 0.001.

tableau data mining science chi square test a/b test

« p » is the value that indicates whether an independent variable has a statistically significant effect on a dependent variable. In our case, the independent variable is « Gender » and the dependent variable is « Exited », which is : « yes, the client left the bank ». So « p » is strictly less than 0.001, which means that the independent variable « Gender » has a statistically significant effect on the dependent variable « Exited ». This shows us that out of the total number of bank’s clients, women are more likely to leave the bank than men.

This is how we use Chi-square test with this online tool. This is the same principle on all online tools that you can find on Google or DuckDuckGo . You can repeat these instructions that I gave you with other tools, you will get the same results.

It’s cool with the Chi-square we validated the A/B test and to specify that this A/B test is validated, we’ll color the tab in green.

Right-click on the tab, select « Color » and select « Green ».

tableau data mining science chi square test a/b test

tableau data mining science chi square test a/b test

Perfect, now we’ll validate another A/B test. Selects « HasCreditCard » tab.

tableau data mining science chi square test a/b test

We’re going to create an A/B test « HasCreditCard » only with absolute values. To save time, right-click on « Gender Actual » tab and select « Duplicate ».

tableau data mining science chi square test a/b test

We’ll remove the green color on the tab « Gender Actual (2) ». Right-click on the tab and select « Color » and « None ».

tableau data mining science chi square test a/b test

You rename the tab « HasCreditCard Actual ».

tableau data mining science chi square test a/b test

Move the variable « HasCrCard » over « Gender » in « Columns ».

tableau data mining science chi square test a/b test

tableau data mining science chi square test a/b test

Excellent, everything is ready to do a Chi-square test. We’ll remove « Exited » labels to better see the absolutes values. Make a click and drag out.

tableau data mining science chi square test a/b test

tableau data mining science chi square test a/b test

Perfect, let’s go back to our online tool. In this case, « Sample1 » is « no », which means client who don’t have credit card and « Sample2 » for « yes », which means clients who have a credit card.

That’s how you enter the data :

  • For « Sample1 » in #success, you enter 613 because there are 613 clients who left the bank. For « Sample1 » in #trials, you enter 2945 because there are 2945 clients who don’t have a credit card.
  • For « Sample2 » in #success, you enter 1424 because there are 1424 clients who left the bank. For « Sample2 » in #trials, you enter 7055 because there are 7055 clients who have a credit card.

tableau data mining science chi square test a/b test

Let’s look at the verdict, it’s « No significant difference ». « p » value is very high, it’s above 5%. This confirms that the independent variable « HasCrCard » has no statistically significant effect on the dependent variable « Exited ». That was the conclusion we had made when we had done the A/B test with percentages.

We had seen that there was 21% of « Exited » (clients who left the bank) in the category « no » and 20% in the category « yes ». With these results we concluded that most likely the variable « HasCrCard » had no impact on the rate of clients who left the bank. Chi-square test confirms our conclusion and we can put the tab « HasCrCard » in green to say that it’s OK.

Right-click on the tab « HasCreditCard » => « Color » => « Green ».

tableau data mining science chi square test a/b test

tableau data mining science chi square test a/b test

Excellent, now, you can do a statistical A/B test with 2 categories. Soon, we will do statistical A/B tests with more than 2 categories.

Share this article if you think it can help someone you know. Thank you.

-Steph

Bank Online Peer-To-Peer (Part 1)

digital online bank money currency account

What’s up ? This is THE stephane ANDRE !

Today, in my workout program (tracked with JeFit  ), I did shoulders, traps, calves and forearms. My muscles gains in forearms are slow but it’s cool, it’s the process. I didn’t use weight to train my flexors and extensors of my forearms so it take time. I started it since for 1 month. I hurt my right knee during a box sparring 1 month ago and now I can lift heavyweight on my left. Yeah back to do squat, leg press with one leg and deadlift. I’m happy.

Cryptocrurrency

cryptocurrency

Since 4 months, I think to buy cryptocurrency, espacially Ethereum . I’m more interested by Ethereum than Bitcoin, because with Ethereum you can make smart contract. By example, you and your friend make a bet on a match (basket, soccer, box, whatever). You don’t want to make a bet with a bookmaker because the charge are too high. With the smart I’ve seen, you can program something like this : « If Team A wins, the loser’s money is automatically transferred to the winner’s Ethereum account at the end of the match ». All without charge. In the smart contract that I saw, the 2 people had an Ethereum account.

The principle of cryptocurrency is to transfer money or buy something without a third part (bank or financial service) because charges increase year after year. In few months, I will buy a Ledger Nano S  (usb wallet for cryptocurrency)

Digital Bank

digital online bank money currency account

Because of cryptocurrency, I thought : « Is it possible to have a bank account and transfer money without a third part ? ». I have a Paypal account but I mean a real bank account with a debit card, you dig ? After watched an Oliver Roland’s video , I discovered 2 bank online with debit card or precisely pre-paid debit card. It’s Transferwise  and Revolut .

I read articles about the difference on the 2 and Revolut is excellent for people who travel a lot to avoid high currency exchange fees (even cash machine) when abroad and Transferwise is excellent for international money transfer with the lowest charge. Transferwise and Revolut have app on GooglePlay/AppStore and a deal with MasterCard for the pre-paid debit card.

2 days ago, I opened a Transferwise’s account (multi-currency) without meet somebody, I did everything online. I create an account like the majority of web services with a form and I gave them a scan of my ID and a scan of a bank statement to confirm my address. I received a confirmation by email less than 5 work day week. I’ll put money on this account in few days and I will ask to have a pre-paid debit card.

By looking a little more on Transferwise (because this company created in 2011), I understood that it could be beneficial for people who use Western Union or Ria to send money to parents living in the native country. My parents did it. You see, if you’re an immigrant or an expat it could be more avantageous about international money transfer fees. A story about the co-founder Kristo Käärmann (from Estonia) is he was working in the UK and was losing a lot of money transferring cash back home to pay for a mortgage. It can be also interesting in this case.

In January 2018, Transferwise launched the Mastercard pre-paid debit card, I really curious to use it to see how it work. As I told you, it’s been 2 days that I have my Transferwise’s account, that’s why this article is Part 1. I’ll give my point of view on Transferwise soon. And I think to open a Revolut’s account in few months.

Share this article if you think it can help someone you know. Thank you.

-Steph

Work Effectively And Earn More (Part 2)

work effectively effective

I watched an Olivier Roland’s video  and I learned good stuff.

If you don’t have read Part 1, click here .

5 actions to be effective

5

Optimize your working time

Use Pareto’s Law by focusing on the 20% of your actions that contribute 80% of your results and using Parkinson’s Law to determine how long to complete a task.

Here are other actions to put in place to optimize your time :

  • Don’t disperse yourself

  • Stop multitasking – This has been scientifically proven to be a waste of time and productivity. Read this scientific study .

  • Stop interruptions – Things like smartphone notifications, emails or messages.

  • Group actions.

  • Remove unnecessary tasks – To find out if you’re doing a useless task, ask yourself this question from Peter Drucker : « Why am I doing this ? Is it necessary ? » With this question, you can easily delete unnecessary tasks. Exceptionally, you can use a notification on your smartphone that displays this question every 30 minutes. It’s a type of reminder all day long.

  • Identify the 20% of things and people that cause 80% of your problems and delete them. If it’s someone in your family, talk to that person 2-3 times a week instead of every day.

Automate everything you can

Many tasks can be automated in companies. For example to send messages on social media (I use Buffer  ). It’s possible to automate a sale on internet, it’s the customer who does everything. The customer looks for a product, uses his/her credit card by filing out the payment form of the website, and the bill created automatically based on the information provided by the customer, etc.

It’s also possible to automate a company, this is the case of Drop Shipping. Drop Shipping is when you sell products that you don’t have in stock and that are sent directly from the supplier to the customers. Amazon offers this type of service too, you can put in their catalogs products that you sell and entrust to Amazon for the stock’s management, sending and returns of products. I wrote an article on Amazon’s drop shipping, here.

There is also the case of muses that explains Tim Ferriss in his book « 4 hours workweek ».

Delegate

Focus on your strengths and delegate the rest. Create a list of tasks that you want to delegate with instructions. Then gives these tasks to a team by assigning each type of task to a specialist.

Duplicate

There is no point in reinventing the wheel. You can duplicate the recipes of your mentors success and use that in your own company.

Recycle

A job that you did can be reused in a different form. For example, articles from a blog can be used to make a book, a podcast or a video.

4 actions to earn more

4

Determine your goal and strategy

Determine your goal, your process to reach it and the strategy to put in place. Here are some examples of strategies for developing your wealth :

  • Replace your salary with real estate income and start your own company.

  • Keep your work as employee and invest a maximum on stock market to create passive income.

  • Create a company to have a complementary income like a blog, a podcast or a Youtube channel.

  • Buy a piece of land and build several apartments (condos).

  • Etc.

Optimize your management to spend less money

  • Analyse the things you have to pay to eliminate waste : unnecessary subscription, insurance too expensive, etc.

  • Print your bank statement and analyze it

  • Seeking a way to achieve the same result by spending less : compare, buy cheaper, negotiate to save money for the things you really need.

  • Optimize your taxation by reducing your taxes.

Recycle your skills and your work

You can work on something once and get paid several times. You can create a seminar, keep 3 children instead of 1, you walk 5 dogs instead of 1, etc.

You can also use a job you have already done to create complementary income. Foe example, if you like to take picture, you can put them in stock photos on internet.

Duplicate the processes known to create wealth

  • Pay yourself first

  • Make money work for you by saving at least 10% of your income to invest them.

  • Invest in yourself with training to learn new skills

Here are the options you can use to create a company that serves your life (and not your life serving your company). With internet it’s easier to use these levers with a blog, podcast or a Youtube channel by creating content.

Share this article if you think it can help someone you know. Thank you.

-Steph

Look For Anomalies

anomaly

I have just enrolled in a Data Science course on Udemy  and I learned good stuff.

We’ll learn how to duplicate a bar char to create a new A/B test. We’ll create several A/B test to look for anomalies.

But before that, we’ll name the sheet. Right-click on the tabe and select « Rename Sheet ».

tableau a/b test tableau dataset anomalies

Rename the sheet « Gender ».

tableau a/b test tableau dataset anomalies

Now right-click on the « Gender » tab and select « Duplicate ».

tableau a/b test tableau dataset anomalies

Rename this new tab « Country ».

tableau a/b test tableau dataset anomalies

We’ll do an A/B test with the countries and we’ll reuse everything we did with the A/B test « Gender » to save time.

As you can see « Gender » is in « Columns ».

tableau a/b test tableau dataset anomalies

To use this A/B test with a variable other than « Gender », move the variable you want on top of « Gender » in « Columns ».

Go, go ! There is « Geography » in « Dimensions », takes « Geography » and puts it on « Gender ».

tableau a/b test tableau dataset anomalies

Boom with 1 click we have our A/B test for countries.

tableau a/b test tableau dataset anomalies

We have the percentage of clients who left and stayed in the bank for each country (Germany, Spain and France).

In this A/B test we can see that in Germany, many clients left the bank with a rate of 32%. For Spain and France, the rate of clients who left the bank is below the average departure rate (20%), 17% for Spain and 16% for France.

Already, we have interesting insigns. We can find out if in Germany there is a new aggressive competitor with more interesting offers or if there is a new law unfavorable to the bank’s offers that has been voted. It’s necessary to do reseach in Germany to find the reason for this high rate of departure.

You have seen, usually an A/B test has 2 categories but in our case, there are 3 categories. We could call it an A/B/C test but it’s a bit bizarre. When there are more than 2 categories, we call it a classification test.

In this article, I will continue to use the term A/B test but remember the term classification test for the next time.

Let’s do another A/B test quickly.

Duplicate this A/B test by right-clicking on the « Country » tab and selecting « Duplicate ».

tableau a/b test tableau dataset anomalies

tableau a/b test tableau dataset anomalies

This time we will study the variable « Has Cr Card ». This variable is « 1 » if the client has a credit card and « 0 » if the client doesn’t have a credit card.

You saw ? This variable is a categorical variable because it is binary « 1 » and « 0 » but it is in « Measures ». Since this variable is categorical, it should be in « Dimensions » so we will move the variable « Has Cr Card » from « Measure » to « Dimensions ».

tableau a/b test tableau dataset anomalies

tableau a/b test tableau dataset anomalies

Now that it’s done, move « Has Cr Card » over « Geography » in « Columns ».

tableau a/b test tableau dataset anomalies

tableau a/b test tableau dataset anomalies

It’s cool, we have a new A/B test for credit cards. What we can observe in this A/B test is that there is not a big difference between the departure rate of clients who don’t have a credit card (21%) and the departure rate of clients who have a credit card (20%).

It’s time to create aliases for this A/B test. Right-click on « Has Cr Card » and select « Alias…. ».

tableau a/b test tableau dataset anomalies

To start, « 0 » means that the clients don’t have a credit card so in « Value », you write « No ». « 1 » means that the clients has a credit card so in « Value », you write « Yes ». Then you click on the « OK » button.

tableau a/b test tableau dataset anomalies

tableau a/b test tableau dataset anomalies

That’s it, the bar chart is easy to read now. We understand that among clients who don’t have a credit card, 21% left the bank and among clients who have a credit card, 20% left the bank. We can conclude that having or not having a credit card doesn’t have a significant impact on the decision to leave the bank.

It’s time to rename this tab. Right-click on the « Sheet4 » tab and select « Rename Sheet ». Name the sheet « HasCreditCard ».

tableau a/b test tableau dataset anomalies

tableau a/b test tableau dataset anomalies

Let’s go, let’s do another A/B test with another variable. Let’s look at « Measure » and study the variable « IsActiveMember ».

The variable « IsActiveMember » is « 1 », if the client is active and « 0 » it the client is inactive. It’s necessary to detail the definition of IS ACTIVE. IS ACTIVE depends on the criteria of the bank. For example, it could be : « Did the client log in at least once to their bank account last month ? » or « Has the client made at least one banking transaction last month ? », etc.

As you can see, the variable « IsActiveMember » is a categorical variable (binary 1 and 0) so it’s a variable to move to « Dimensions ».

Here’s another way to move a variable from « Measures » to « Dimensions ». Right-click on « IsActiveMember » and select « Convert to Dimensions ».

tableau a/b test tableau dataset anomalies

Perfect, the variable « IsActiveMember » is in « Dimensions ».

tableau a/b test tableau dataset anomalies

We will duplicate our « HasCreditCard » sheet. Right-click on « HasCreditCard » tab and select « Duplicate ».

tableau a/b test tableau dataset anomalies

Renamce this tab « IsActiveMember ».

tableau a/b test tableau dataset anomalies

Since we have diplucted what we did with « HasCreditCard », we simply need to take the variable « IsActiveMember » from « Dimensions » and more that over « HasCrCard » in « Columns ».

tableau a/b test tableau dataset anomalies

tableau a/b test tableau dataset anomalies

Let’s create aliases to make reading this bar chart easier. Right-click on « IsActiveMember » and select « Aliases… ».

tableau a/b test tableau dataset anomalies

For « 0 », we put « No » because the client is not active and for « 1 », we put « Yes » because the client is active. Click on the « OK » button.

tableau a/b test tableau dataset anomalies

Here is what we can see with this A/B test « IsActiveMember ». Among inactive clients, 27% left the bank. Among active clients, 14% left the bank. This show is that clients who are not active are more likely to leave the bank than active clients.

Indeed, a client who is active means that he/she uses his/her bank account and products of the bank so an active client is satisfied with the bank. It’s possible that some clients leave the bank because of external factors such as a competitor, new regulations or elements of the private life of the client.

It’s cool, we created 4 A/B tests in a few minutes.

  1. An A/B test « Gender » that allowed us to see that women were more likely to leave the bank.

  2. An A/B test « Country » that allowed us to see that it is in Germany that clients are most likely to leave the bank.

  3. An A/B test « HasCreditCard » which allowed us to see that having or not having a credit card didn’t have a significant impact on the descision to leave the bank.

  4. An A/B test « IsActive Member » allows us to see that client who aren’t active are more likely to leave the bank .

I will leave you a homework. You’ll do an A/B test with the variable « Number Of Product » which is still a category variable. The variable « Number Of Products » indicates the number of product that the client has in the bank. Add aliases to make reading the bar chart easier.

I trust you I’ll give you the answer in th next article,

Share this article if you think you can help someone you know. Thank you.

-Steph

Dataset For Data Mining

dataset data mining

I have just enrolled in a Data Science course on Udemy  and I learned good stuff.

To have the dataset to do Data Mining, you need to go to the superdatascience website . In « Part.1 Visualization », you see the section « How to use Tableau for Data Mining ». Click on « Churn Modeling » to download the file.

dataset data mining

Once you have downloaded the file, move the file to the directory you created for the course. In this directory, create a new directory (unless you already do it) named « 2.Chunk investigation ».

dataset data mining

dataset data mining

Open this fiel with Excel or with other spreadsheet software.

dataset data mining

Know that we use this dataset for the visualization part but we will also use this dataset for the modeling part.

Let’s analyze the data of this dataset.

This dataset is quite large because it contains 10 000 lines and a few columns. This is the list of a bank’s client. The client information is :

  • Customer id (login)

  • Surname (last name)

  • Credit score ( is the measure that indicates the client’s ability to borrow)

  • Geography (client’s country)

  • Gender (male or female)

  • Age

  • Tenure -(the number of years the client is in the bank)

  • Balance (balance of the client’s bank account)

  • NumOfProduct (number of product that the client has in the bank – credit card, contract, account)

  • HasCrCard (does the client have a credit card ?)

  • IsActiveMember (did the client use his/her credit card during the last month ?)

  • EstimatedSalary (the bank’s estimate of the client’s annual salary)

  • Exited (did the client leave the bank ?)

Now, I will explain the context related to this dataset. This bank has branches in several countries like Germany, Spain and France. This bank noticed that lately there were many clients who left the bank. The bank has a report called « churn rate » which is the customers rate who leave the bank and for a few months the « churn rate » is really higher than usual. It’s for this reason that the bank needs a data scientist (you) to find the problem and propose solutions.

This dataset is a small sample of clients bank. These are 10 000 randomly selected client.

The column « Exited » is a column that didn’t exist before. This column has created when the bank realized that there was an abnormal number of client who were leaving the bank.

dataset data mining

Then the bank observed these clients for 6 months to see which client left the bank.

dataset data mining

In the « Exited » column, the number « 1 » means that the client left the bank and the number « 0 » means that the client stayed in the bank.

To analyze this dataset, you’ll need to do A/B Tests. For exemple, a classic A/B Test is to see if women are more likely to left the bank than men. That’s means, see the number of men who left the bank, see the number of women who left the bank and then normalize by the total number of clients. It’s important to normalize the number of clients because there are not the same proportions of women as men. Next, based on the last column « Exited », you’ll find out if it’s the men or women who are likely to left the bank.

Once you have relevant results, you can show your report to the bank. And with this report you should be able to propose solutions to the bank. For example, if the report says that women leave the bank in bulk, it’s because there is a problem and it’s necessary to see whether the bank is offering women something right. Or it’s possible that another bank offers a much more attractive offer for women or something else.

You will learn how to investigate in the dataset and find answer through client information with A/B tests.

Share this article if you think it can help someone you know. Thank you.

-Steph