How to Become a Data Scientist - Amazon S3 · reading, storing and manipulating large quantities of...

Preview:

Citation preview

set of laws

Framework

share with you a structured learning planthat you can start using right away after this webinar to accelerate your path into data science.

skills experience you need tools

make you Irresistible to all data science Employers

EVERYONE will want YOU

YES CHAT

• You are in High demand

• You make a lot of money ($110,000 Median Base Salary)

• You get to work on interesting and challenging problems

• You will be able to have a massive impact on the places where

you work (and others)

• You have High Security (which we know is rare now a days).

The "PHD" System

P - Program

H - Hack

D - Deliver

Statistical Analysis

P - Program

H - Hack

D - Deliver

Machine Learning

• Cool Tip #1:

• Commercial Software for Data Analysis is:

• Expensive:

• Inconvenient:

• Limiting:

1) Coding Saves Money

2) You Can Do Whatever you want:(You’re not restricted by the software’s capabilities)

3) Very Convenient: (You’ll be able to work Anywhere & for Anyone!)

The "PHD" System

P - Program

H - Hack

D - Deliver

Statistical Analysis

P - Program

H - Hack

D - Deliver

Machine Learning

• Remember:

YES CHAT

only 5 Families

Each Algorithm, is just ONE OPTION.

YOU DON’T NEED THEM ALL!

Machine Learning

Do these group

together?

Is this type A or B? or..etc?

Is this unusual?

How can I make this simpler?

How much – or – How

many?

Machine Learning

Clustering Algorithms

Classification Algorithms

Anomaly detection

Algorithms

Dimensionality Reduction Algorithms

Regression Algorithms

But no-one ever tells you!

most important

Step-by-StepSystem

The 6 Step Statistical Analysis Process

1) Determine Study’s properties

2) Set your significance Level

3) Investigate Data’s properties

4) Determine the appropriate statistical test

5) Run the test

6) Make a Conclusion to answer the study’s question

The 6 Step Statistical Analysis Process

1) Determine Study’s properties

2) Set your significance Level

3) Investigate Data’s properties

4) Determine the appropriate statistical test

5) Run the test

6) Make a Conclusion to answer the study’s question

The 6 Step Statistical Analysis Process

1) Determine Study’s properties

2) Set your significance Level

3) Investigate Data’s properties

4) Determine the appropriate statistical test

5) Run the test

6) Make a Conclusion to answer the study’s question

The 6 Step Statistical Analysis Process

1) Determine Study’s properties

2) Set your significance Level

3) Investigate Data’s properties

4) Determine the appropriate statistical test

5) Run the test

6) Make a Conclusion to answer the study’s question

The 6 Step Statistical Analysis Process

1) Determine Study’s properties

2) Set your significance Level

3) Investigate Data’s properties

4) Determine the appropriate statistical test

5) Run the test

6) Make a Conclusion to answer the study’s question

The 6 Step Statistical Analysis Process

1) Determine Study’s properties

2) Set your significance Level

3) Investigate Data’s properties

4) Determine the appropriate statistical test

5) Run the test

6) Make a Conclusion to answer the study’s question

WHATWHEN HOW

The "PHD" System

P - Program

H - Hack

D - Deliver

Statistical Analysis

P - Program

H - Hack

D - Deliver

Machine Learning

The "PHD" System

P - Program

H - Hack

D - Deliver

Statistical Analysis

P - Program

H - Hack

D - Deliver

Machine Learning

Structured Learning Process

YES CHAT

learning to code

not using Specialised Commercial Software?

Or

The Great Debate:

• First Appeared in August 1993 thanks to Ross Ihaka and Robert Gentleman• R is a language used specifically for Statistical Analysis and Data Science.

Pros:• It has a strong community• Built by statisticians

Cons:• Steep Learning Curve• Standalone Application (less flexible)• Becoming less popular

• First Appeared in February 1991 thanks to Guido van Rossum.• Python is a general purpose programming language.

Pros:• Very easy to learn and use• Cleaner code• Can do pretty much anything.• Allows easy data science integration

with web apps and production databases.

Cons:• Not as Specialised.

easiest flexible brighter future

• “Pandas” is a python library that is designed to make reading, storing and manipulating large quantities of data super easy.

• Pandas is incredibly powerful, and can easily read/writing data to/from spreadsheet or csv files.

• Pandas is your “backbone” for data analysis of all kinds.

• Scipy is a module that provides many scientific computing functions for Python.

• Scipy.stats is the place where hundreds of statistical tests are stored.

• Scipy will become your go-to tool for doing any data science ESPECIALLY statistical analysis.

• Scikit learn is python’s 1-stop-shop for machine learning!

• It is incredibly popular and can do anything!

1) Clustering

2) Classification

3) Regression

4) Anomaly Detection

5) Dimensionality Reduction

And MUCH More!

Module Purpose Documentation

Pandas Data Storing Powerhouse http://pandas.pydata.org/pandas-docs/stable/

Scipy Scientific Functions for Python https://docs.scipy.org/doc/

Scikit-Learn Machine Learning For Python http://scikit-learn.org/stable/documentation.html

Numpy Precursor to Pandas https://docs.scipy.org/doc/numpy/reference/

statsmodels More Scientific Functions for Python.

http://statsmodels.sourceforge.net/stable/

matplotlib Allow you to plot graphs in Python http://matplotlib.org/contents.html

seaborn Makes it easier to plot graphs in Python.

http://seaborn.pydata.org/

Other Useful Python Packages

Amazon

Picture Removed for Privacy Purposes

Amazon

Picture Removed for Privacy Purposes

• Top Tip:

kaggle.com

Great...

YOU YOUR circumstances.

Or

Law #4: The Law of Personalisation

Employment Freelance

• Employment is where you take a full (or part) time job at a company

Pros:• Consistent Income• Colleagues to Learn From• Structured Career Progression

Cons:• Less Control Over Your Future• Less Freedom Over Your Day• Relatively Fixed Income

Employment

Indeed.comGlassdoor.comCareerbuilder.comDice.comIdealist.org (For internships)LinkedIn.comMonster.com

Specific Company Websites

Sites to Find Jobs

• Freelance Work is where you acquire clients for yourself and complete work for them on a project-by-project basis.

Pros:• You’re the boss• Choose who you work with• Set your own standards• Can Work Internationally using

Internet

Cons:• Less Security• Highly dependent on market fluctuations• Competition on Freelance Platforms

Freelance

Upwork.comFiverr.comPeopleperhour.comFreelancer.com

Set up your own website

Sites to Find Freelance Work

Even if you want to get an employed job, get on as many freelance sites as you can!

You can start pitching for jobs and projects and then add them to your CV!

This is actually how a lot of Data Scientists get started! :D

Super Ninja Tip!

So far did we do a good job with just part 1 that even if you left this room right now… (but don’t) you could use some of our advice to accelerate your path into data science starting Right now?

Let me know in the chat by typing Yes (or no)

So let me ask you…

“ I Love Data Science Launch! You guys make it all so easy! I thought to myself there is NO WAY it’s that Easy!

Its amazing work you guys are doing and I can’t wait to start working with clients.”

Picture Removed for Privacy Purposes

2 Training Courses

How to analyse Data Using the 5 Families of Machine Learning Techniques.

The Step by Step Proven Process for Analysing Data Perfectly Every Time Using Statistics

• Our complete Step-by-Step System for statistical analysis.

• Designed to take you by the hand (as in step1, step2, step 3) to analyse data using statistical methods. No ambiguity or confusion.

What exactly is it?

• 4 part step-by-step digital course that walks you through the process of analysing data using statistical methods.

• Full 1080p HD Video with theory and practice lectures so nothing is left out (7 hours of valuable content)

What exactly is it?

• 5 Part course that walks you each of the 5 families of Machine Learning Algorithms, showing you the most important algorithms from each!

• Practicals and Theory Lectures + Projects for you to try out!

$497 Value

Bonus #1

•Preparation Checklist + Roadmaps

The Statistical Analysis Preparation Checklist

• It has 8 Simple questions in it that you will answer using the videos in part 1 of the course.

• The answers to these questions determine the properties of your data.

• The properties of your data will inform which statistical test you need to run to analyse it.

• This guide will walk you though this process so you don’t miss anything and will match perfectly with the course

The Test Selector Roadmaps

• Once you know your data’s properties you need to select the correct test.

• There are hundreds of statistical tests and memorising the use-case for all of them is a nightmare!

The Test Selector Roadmaps

• So what you do instead is use the roadmaps!

• The roadmaps are sets of Yes or No questions about your data’s properties.

• “Does it have property X, Yes or No?”

The Test Selector Roadmaps

• Use the checklist to answer them!

• This will show you the exact tests you need to run

• No need to memorise all the tests.

• Just use the roadmaps, watch the appropriate video and you’re done!

$497 Value

$297 Value

Bonus #2

• Code Swipes

$497 Value

$297 Value

$197

Bonus #3

• The Professional Report Template

$497 Value

$297 Value

$197

$97 Value

Bonus #4

•Epic Project Pack

•PROJECT BASED LEARNING

$497 Value

$297 Value

$197

$97 Value

$97 Value

Bonus #5

•The Perfect Cover Letter

$497 Value

$297 Value

$197

$97 Value

$97 Value

$97 Value

Bonus Training Course: The Python Bible

The Python Bible is Ziyad’s famous Python Course that teaches you the fundamentals of python programming.

The course has over 15,000 students in 143 countries around the world.

Bonus Training Course: The Python Bible

Will take you from absolute NO programming experience (like ever) all the way to confidently writing your own programs

You will build 11 python projects and be able to program confidently in Python by the end of the course EVEN IF you have never coded before IN YOUR LIFE.

$497 Value

$297 Value

$197

$97 Value

$97 Value

$97 Value

$197 Value

$497 Value

$297 Value

$197

$97 Value

$97 Value

$97 Value

$197 Value

Total Value: $1,676

TYPE IT IN THE CHAT

$9167

$1,676

$497 Value

$297 Value

$197

$97 Value

$97 Value

$97 Value

$197 Value

$197 Value

Total Value: $1,676 - SAVE: $1,179FINAL SALE PRICE (Public) $497

Save $200 TODAY on this Webinar

Special Today Only: $297

$497 Value

$297 Value

$197

$97 Value

$97 Value

$97 Value

$197 Value

Value: $1,676 - SAVE: $1,179

Normal Price: $497This Event Only: Save: $200

Special Today Only: $297

Here is What to do now

Click Here

Scroll Down + Click Here

Step 3: Create an Account

Not Done Yet ;)

Step 5: Enjoy!

Inside the Course

Inside the Course

want to work with You.

communication skills

amazing at analysing datacan’t explain the results

LOVE

$497 Value

$297 Value

$197

$97 Value

$97 Value

$97 Value

$197 Value

$497 Value

$297 Value$197

$97 Value

$97 Value$97 Value

$197 Value

$197 Value

Value: $1,676 - SAVE: $1,179

Normal Price: $497This Event Only: Save: $200

Special Today Only: $297

Here is What to do now

Click Here

Scroll Down + Click Here

Step 3: Create an Account

Not Done Yet ;)

Step 5: Enjoy!

Inside the Course

Inside the Course

Recommended