An Introduction to Statistics with Python: With Applications in the Life Sciences

Ebook434 pages3 hours

An Introduction to Statistics with Python: With Applications in the Life Sciences

Name: An Introduction to Statistics with Python: With Applications in the Life Sciences
Author: Thomas Haslwanter
ISBN: 9783319283166

By Thomas Haslwanter

Rating: 0 out of 5 stars

()

Read preview

About this ebook

This textbook provides an introduction to the free software Python and its use for statistical data analysis. It covers common statistical tests for continuous, discrete and categorical data, as well as linear regression analysis and topics from survival analysis and Bayesian statistics. Working code and data for Python solutions for each test, together with easy-to-follow Python examples, can be reproduced by the reader and reinforce their immediate understanding of the topic. With recent advances in the Python ecosystem, Python has become a popular language for scientific computing, offering a powerful environment for statistical data analysis and an interesting alternative to R. The book is intended for master and PhD students, mainly from the life and medical sciences, with a basic knowledge of statistics. As it also provides some statistics background, the book can be used by anyone who wants to perform a statistical data analysis.

Skip carousel

LanguageEnglish

PublisherSpringer

Release dateJul 20, 2016

ISBN9783319283166

Author

Thomas Haslwanter

Related authors

Skip carousel

Related to An Introduction to Statistics with Python

Titles in the series (2)

Skip carousel

An Introduction to Statistics with Python: With Applications in the Life Sciences
Ebook
An Introduction to Statistics with Python: With Applications in the Life Sciences
byThomas Haslwanter
Rating: 0 out of 5 stars
0 ratings
R for SAS and SPSS Users
Ebook
R for SAS and SPSS Users
byRobert A. Muenchen
Rating: 4 out of 5 stars
4/5

Related ebooks

Skip carousel

A Python Data Analyst’s Toolkit: Learn Python and Python-based Libraries with Applications in Data Analysis and Statistics
Ebook
A Python Data Analyst’s Toolkit: Learn Python and Python-based Libraries with Applications in Data Analysis and Statistics
byGayathri Rajagopalan
Rating: 0 out of 5 stars
0 ratings
Applied Natural Language Processing with Python: Implementing Machine Learning and Deep Learning Algorithms for Natural Language Processing
Ebook
Applied Natural Language Processing with Python: Implementing Machine Learning and Deep Learning Algorithms for Natural Language Processing
byTaweh Beysolow II
Rating: 0 out of 5 stars
0 ratings
R High Performance Programming
Ebook
R High Performance Programming
byAloysius Lim
Rating: 4 out of 5 stars
4/5
Instant Heat Maps in R How-to
Ebook
Instant Heat Maps in R How-to
bySebastian Raschka
Rating: 0 out of 5 stars
0 ratings
Hands-On Data Analysis with Pandas: Efficiently perform data collection, wrangling, analysis, and visualization using Python
Ebook
Hands-On Data Analysis with Pandas: Efficiently perform data collection, wrangling, analysis, and visualization using Python
byStefanie Molin
Rating: 0 out of 5 stars
0 ratings
Mastering Python Data Analysis
Ebook
Mastering Python Data Analysis
byMagnus Vilhelm Persson
Rating: 0 out of 5 stars
0 ratings
Text Analytics with Python: A Practitioner's Guide to Natural Language Processing
Ebook
Text Analytics with Python: A Practitioner's Guide to Natural Language Processing
byDipanjan Sarkar
Rating: 0 out of 5 stars
0 ratings
Learning Data Mining with Python - Second Edition
Ebook
Learning Data Mining with Python - Second Edition
byRobert Layton
Rating: 0 out of 5 stars
0 ratings
Python
Ebook
Python
byjustin wokocha
Rating: 0 out of 5 stars
0 ratings
R Object-oriented Programming
Ebook
R Object-oriented Programming
byKelly Black
Rating: 3 out of 5 stars
3/5
Learning NumPy Array
Ebook
Learning NumPy Array
byIvan Idris
Rating: 0 out of 5 stars
0 ratings
Mastering Python Regular Expressions
Ebook
Mastering Python Regular Expressions
byVictor Romero
Rating: 5 out of 5 stars
5/5
Introduction to Python 2018 Edition
Ebook
Introduction to Python 2018 Edition
byMark Lassoff
Rating: 4 out of 5 stars
4/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Advanced Python Development: Using Powerful Language Features in Real-World Applications
Ebook
Advanced Python Development: Using Powerful Language Features in Real-World Applications
byMatthew Wilkes
Rating: 0 out of 5 stars
0 ratings
Python from the Very Beginning
Ebook
Python from the Very Beginning
byJohn Whitington
Rating: 0 out of 5 stars
0 ratings
Web Scraping with Python
Ebook
Web Scraping with Python
byRichard Lawson
Rating: 4 out of 5 stars
4/5
Bayesian Analysis with Python
Ebook
Bayesian Analysis with Python
byOsvaldo Martin
Rating: 5 out of 5 stars
5/5
Learn Data Analysis with Python: Lessons in Coding
Ebook
Learn Data Analysis with Python: Lessons in Coding
byA.J. Henley
Rating: 0 out of 5 stars
0 ratings
Haskell from Another Site
Ebook
Haskell from Another Site
byJagoda Górska
Rating: 0 out of 5 stars
0 ratings
Visualizing Graph Data
Ebook
Visualizing Graph Data
byCorey Lanum
Rating: 0 out of 5 stars
0 ratings
Deploy Machine Learning Models to Production: With Flask, Streamlit, Docker, and Kubernetes on Google Cloud Platform
Ebook
Deploy Machine Learning Models to Production: With Flask, Streamlit, Docker, and Kubernetes on Google Cloud Platform
byPramod Singh
Rating: 0 out of 5 stars
0 ratings
Advanced Analytics with Transact-SQL: Exploring Hidden Patterns and Rules in Your Data
Ebook
Advanced Analytics with Transact-SQL: Exploring Hidden Patterns and Rules in Your Data
byDejan Sarka
Rating: 0 out of 5 stars
0 ratings
Simple Data Science (R)
Ebook
Simple Data Science (R)
byNarayana Nemani
Rating: 5 out of 5 stars
5/5
Think Like a Data Scientist: Tackle the data science process step-by-step
Ebook
Think Like a Data Scientist: Tackle the data science process step-by-step
byBrian Godsey
Rating: 0 out of 5 stars
0 ratings
Visual Studio Code for Python Programmers
Ebook
Visual Studio Code for Python Programmers
byApril Speight
Rating: 0 out of 5 stars
0 ratings
Learn Data Science Using SAS Studio: A Quick-Start Guide
Ebook
Learn Data Science Using SAS Studio: A Quick-Start Guide
byEngy Fouda
Rating: 0 out of 5 stars
0 ratings
Practical Data Science with Python 3: Synthesizing Actionable Insights from Data
Ebook
Practical Data Science with Python 3: Synthesizing Actionable Insights from Data
byErvin Varga
Rating: 0 out of 5 stars
0 ratings
Data Visualization: Representing Information on Modern Web
Ebook
Data Visualization: Representing Information on Modern Web
byAndy Kirk
Rating: 5 out of 5 stars
5/5
Swarm Intelligence
Ebook
Swarm Intelligence
byRussell C. Eberhart
Rating: 4 out of 5 stars
4/5

Applications & Software For You

Skip carousel

Adobe Illustrator: A Complete Course and Compendium of Features
Ebook
Adobe Illustrator: A Complete Course and Compendium of Features
byJason Hoppe
Rating: 0 out of 5 stars
0 ratings
The Best Hacking Tricks for Beginners
Ebook
The Best Hacking Tricks for Beginners
byRAJ TYAGI
Rating: 4 out of 5 stars
4/5
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
Ebook
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
byNigel Tillery
Rating: 0 out of 5 stars
0 ratings
Blender 3D Basics Beginner's Guide Second Edition
Ebook
Blender 3D Basics Beginner's Guide Second Edition
byGordon Fisher
Rating: 5 out of 5 stars
5/5
Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
Ebook
Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1
byKevin Clark
Rating: 5 out of 5 stars
5/5
Learn Power BI: A beginner's guide to developing interactive business intelligence solutions using Microsoft Power BI
Ebook
Learn Power BI: A beginner's guide to developing interactive business intelligence solutions using Microsoft Power BI
byGreg Deckler
Rating: 5 out of 5 stars
5/5
Adobe Photoshop: A Complete Course and Compendium of Features
Ebook
Adobe Photoshop: A Complete Course and Compendium of Features
byStephen Laskevitch
Rating: 5 out of 5 stars
5/5
Six Figure Blogging In 3 Months
Ebook
Six Figure Blogging In 3 Months
byShekhar Mishra
Rating: 4 out of 5 stars
4/5
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
Ebook
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
byAlex Parkinson
Rating: 4 out of 5 stars
4/5
Adobe InDesign CC: A Complete Course and Compendium of Features
Ebook
Adobe InDesign CC: A Complete Course and Compendium of Features
byStephen Laskevitch
Rating: 0 out of 5 stars
0 ratings
How Do I Do That In InDesign?
Ebook
How Do I Do That In InDesign?
byDave Clayton
Rating: 5 out of 5 stars
5/5
Audio Engineering: Know It All
Ebook
Audio Engineering: Know It All
byDouglas Self
Rating: 5 out of 5 stars
5/5
Hacks for TikTok: 150 Tips and Tricks for Editing and Posting Videos, Getting Likes, Keeping Your Fans Happy, and Making Money
Ebook
Hacks for TikTok: 150 Tips and Tricks for Editing and Posting Videos, Getting Likes, Keeping Your Fans Happy, and Making Money
byKyle Brach
Rating: 5 out of 5 stars
5/5
iPhone Photography For Dummies
Ebook
iPhone Photography For Dummies
byMark Hemmings
Rating: 0 out of 5 stars
0 ratings
Essential Affinity Photo 2
Ebook
Essential Affinity Photo 2
byRobin Whalley
Rating: 0 out of 5 stars
0 ratings
The Most Concise Step-By-Step Guide To ChatGPT Ever
Ebook
The Most Concise Step-By-Step Guide To ChatGPT Ever
byG.A. Pimpleton
Rating: 3 out of 5 stars
3/5
Hands-On Motion Graphics with Adobe After Effects CC: Develop your skills as a visual effects and motion graphics artist
Ebook
Hands-On Motion Graphics with Adobe After Effects CC: Develop your skills as a visual effects and motion graphics artist
byDavid Dodds
Rating: 0 out of 5 stars
0 ratings
Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
Ebook
Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer.
byGwendolyn Faraday
Rating: 5 out of 5 stars
5/5
Mastering QuickBooks 2020: The ultimate guide to bookkeeping and QuickBooks Online
Ebook
Mastering QuickBooks 2020: The ultimate guide to bookkeeping and QuickBooks Online
byCrystalynn Shelton
Rating: 0 out of 5 stars
0 ratings
FL Studio Cookbook
Ebook
FL Studio Cookbook
byShaun Friedman
Rating: 4 out of 5 stars
4/5
Hilarious Jokes for Minecrafters: Mobs, Creepers, Skeletons, and More
Ebook
Hilarious Jokes for Minecrafters: Mobs, Creepers, Skeletons, and More
byMichele C. Hollow
Rating: 1 out of 5 stars
1/5
iPhone Photography: A Ridiculously Simple Guide To Taking Photos With Your iPhone
Ebook
iPhone Photography: A Ridiculously Simple Guide To Taking Photos With Your iPhone
byScott La Counte
Rating: 0 out of 5 stars
0 ratings
Synthesizer Cookbook: How to Use Filters: Sound Design for Beginners, #2
Ebook
Synthesizer Cookbook: How to Use Filters: Sound Design for Beginners, #2
byScreech House
Rating: 3 out of 5 stars
3/5
GarageBand Basics: The Complete Guide to GarageBand: Music
Ebook
GarageBand Basics: The Complete Guide to GarageBand: Music
byAventuras De Viaje
Rating: 0 out of 5 stars
0 ratings
iPhone 14 Pro Max User Guide for Beginners and Seniors
Ebook
iPhone 14 Pro Max User Guide for Beginners and Seniors
byCharles J. Jones
Rating: 0 out of 5 stars
0 ratings
Logic Pro X For Dummies
Ebook
Logic Pro X For Dummies
byGraham English
Rating: 0 out of 5 stars
0 ratings
Vocal Rescue: Rediscover the Beauty, Power and Freedom in Your Singing
Ebook
Vocal Rescue: Rediscover the Beauty, Power and Freedom in Your Singing
byLois Alba
Rating: 4 out of 5 stars
4/5
Matlab: A Practical Introduction to Programming and Problem Solving
Ebook
Matlab: A Practical Introduction to Programming and Problem Solving
byDorothy C. Attaway
Rating: 4 out of 5 stars
4/5
GarageBand For Dummies
Ebook
GarageBand For Dummies
byBob LeVitus
Rating: 5 out of 5 stars
5/5
Memes for Music Producers: Top 100 Funny Memes for Musicians With Hilarious Jokes, Epic Fails & Crazy Comedy (Best Music Production Memes, EDM Memes, DJ Memes & FL Studio Memes 2021)
Ebook
Memes for Music Producers: Top 100 Funny Memes for Musicians With Hilarious Jokes, Epic Fails & Crazy Comedy (Best Music Production Memes, EDM Memes, DJ Memes & FL Studio Memes 2021)
byScreech House
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

#70 Beyond the Language Wars: R & Python for the Modern Data Scientist
Podcast episode
#70 Beyond the Language Wars: R & Python for the Modern Data Scientist
byDataFramed
0 ratings
0% found this document useful
#122 How Organizations Can Bridge the Data Literacy Gap
Podcast episode
#122 How Organizations Can Bridge the Data Literacy Gap
byDataFramed
0 ratings
0% found this document useful
It’s Not a Data Science Problem, It’s a Data Engineering Problem with Laurie Voss: Laurie Voss is a senior data analyst at Netlify, makers of a serverless platform designed to help teams build, deploy, and collaborate on web apps more effectively. Previously, Laurie worked as Chief Data Officer at npm, Inc., co-founded Snowball Factory,
Podcast episode
It’s Not a Data Science Problem, It’s a Data Engineering Problem with Laurie Voss: Laurie Voss is a senior data analyst at Netlify, makers of a serverless platform designed to help teams build, deploy, and collaborate on web apps more effectively. Previously, Laurie worked as Chief Data Officer at npm, Inc., co-founded Snowball Factory,
byScreaming in the Cloud
0 ratings
0% found this document useful
#121 — ChatGPT and How Generative AI is Augmenting Workflows
Podcast episode
#121 — ChatGPT and How Generative AI is Augmenting Workflows
byDataFramed
0 ratings
0% found this document useful
126 | FlowingData with Nathan Yau
Podcast episode
126 | FlowingData with Nathan Yau
byData Stories
0 ratings
0% found this document useful
#51 Francois Chollet - Intelligence and Generalisation
Podcast episode
#51 Francois Chollet - Intelligence and Generalisation
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Unlocking The Power of Data Lineage In Your Platform with OpenLineage: An interview with Julien Le Dem about the OpenLineage specification and the opportunity that it offers for simplifying the tracking and analysis of data lineage across your data platform.
Podcast episode
Unlocking The Power of Data Lineage In Your Platform with OpenLineage: An interview with Julien Le Dem about the OpenLineage specification and the opportunity that it offers for simplifying the tracking and analysis of data lineage across your data platform.
byData Engineering Podcast
0 ratings
0% found this document useful
Working with Code: How Does a Coder at NASA Do His Job?
Podcast episode
Working with Code: How Does a Coder at NASA Do His Job?
byWorking
0 ratings
0% found this document useful
Getting Technical about the Data Center Revolution with Jonathan Friedmann, CEO of Speedata
Podcast episode
Getting Technical about the Data Center Revolution with Jonathan Friedmann, CEO of Speedata
byMaking Data Simple
0 ratings
0% found this document useful
386 The Top 10 Books To Learn Python - Simple Programmer Podcast: Have you ever wondered what are the best books to learn Python? "Python is an interpreted, object-oriented, high-level programming language with dynamic semantics. Its high-level built in data structures, combined with dynamic typing and dynamic...
Podcast episode
386 The Top 10 Books To Learn Python - Simple Programmer Podcast: Have you ever wondered what are the best books to learn Python? "Python is an interpreted, object-oriented, high-level programming language with dynamic semantics. Its high-level built in data structures, combined with dynamic typing and dynamic...
bySimple Programmer Podcast
0 ratings
0% found this document useful
Harnessing Python for Research: Scientific Applications of Python with Michael Kennedy: Still scrabbling with Excel? Consider Python language uses, says programmer and podcaster Michael Kennedy. A general programming language that is easy to use in multiple environments, Python programming is limitless and has numerous open source...
Podcast episode
Harnessing Python for Research: Scientific Applications of Python with Michael Kennedy: Still scrabbling with Excel? Consider Python language uses, says programmer and podcaster Michael Kennedy. A general programming language that is easy to use in multiple environments, Python programming is limitless and has numerous open source...
byFinding Genius Podcast
0 ratings
0% found this document useful
Supercharging Your Process Mining with Python
Podcast episode
Supercharging Your Process Mining with Python
byMining Your Business
0 ratings
0% found this document useful
Episode 77 - Open Source
Podcast episode
Episode 77 - Open Source
byThe Structural Engineering Podcast
0 ratings
0% found this document useful
#110 - Dane Hillard on Python packaging and effective developer tooling
Podcast episode
#110 - Dane Hillard on Python packaging and effective developer tooling
byPybites Podcast
0 ratings
0% found this document useful
91: Python 3.8 - there's a lot more new than most people are talking about: Python 3.8.0 final is live and ready to download. This episode covers what's new, including new language features, standard library changes, and optimizations. Not just the big stuff everyone's already talking about. But also some little things that will make programming Python even more fun and easy.
Podcast episode
91: Python 3.8 - there's a lot more new than most people are talking about: Python 3.8.0 final is live and ready to download. This episode covers what's new, including new language features, standard library changes, and optimizations. Not just the big stuff everyone's already talking about. But also some little things that will make programming Python even more fun and easy.
byTest and Code
0 ratings
0% found this document useful
756 Is PYTHON The FUTURE Of Programming? (With Rafeh Qazi From Clever Programmer) - Simple Programmer Podcast
Podcast episode
756 Is PYTHON The FUTURE Of Programming? (With Rafeh Qazi From Clever Programmer) - Simple Programmer Podcast
bySimple Programmer Podcast
0 ratings
0% found this document useful
Catching Up With Pyre, A Fast Type Checker For Python: An interview with Shannon Zhu about the Pyre project, and how to decide which type checker is right for you
Podcast episode
Catching Up With Pyre, A Fast Type Checker For Python: An interview with Shannon Zhu about the Pyre project, and how to decide which type checker is right for you
byThe Python Podcast.__init__
0 ratings
0% found this document useful
#043 - Becoming a prolific Python content provider
Podcast episode
#043 - Becoming a prolific Python content provider
byPybites Podcast
0 ratings
0% found this document useful
Four Most Commonly Asked Questions About AI with Dr. Jerry Smith: Dr. Jerry Smith welcomes you to another episode of AI Live and Unbiased to explore the breadth and depth of Artificial Intelligence and to encourage you to change the world, not just observe it! Dr. Jerry is talking today about questions and...
Podcast episode
Four Most Commonly Asked Questions About AI with Dr. Jerry Smith: Dr. Jerry Smith welcomes you to another episode of AI Live and Unbiased to explore the breadth and depth of Artificial Intelligence and to encourage you to change the world, not just observe it! Dr. Jerry is talking today about questions and...
byAI Live & Unbiased
0 ratings
0% found this document useful
#039 - Tackling big challenges and overcoming decision fatigue with Anthony Shaw
Podcast episode
#039 - Tackling big challenges and overcoming decision fatigue with Anthony Shaw
byPybites Podcast
0 ratings
0% found this document useful
#5 How to use Bayes in the biomedical industry, with Eric Ma
Podcast episode
#5 How to use Bayes in the biomedical industry, with Eric Ma
byLearning Bayesian Statistics
0 ratings
0% found this document useful
#041 - Letting the tire hit the road ... how much software planning do you really need?
Podcast episode
#041 - Letting the tire hit the road ... how much software planning do you really need?
byPybites Podcast
0 ratings
0% found this document useful
Reflecting On The Past 6 Years Of Data Engineering: This podcast started almost exactly six years ago, and the technology landscape was much different than it is now. In that time there have been a number of generational shifts in how data engineering is done. In this episode I reflect on some of the major themes and take a brief look forward at some of the upcoming changes.
Podcast episode
Reflecting On The Past 6 Years Of Data Engineering: This podcast started almost exactly six years ago, and the technology landscape was much different than it is now. In that time there have been a number of generational shifts in how data engineering is done. In this episode I reflect on some of the major themes and take a brief look forward at some of the upcoming changes.
byData Engineering Podcast
0 ratings
0% found this document useful
DIY AI-based image analysis for pathology. How DeePathology incorporated respect for pathologists' time into their software w/ Chen Sagiv
Podcast episode
DIY AI-based image analysis for pathology. How DeePathology incorporated respect for pathologists' time into their software w/ Chen Sagiv
byDigital Pathology Podcast
0 ratings
0% found this document useful
Bringing Pure Python to Apache Kafka (with Tomáš Neubauer)
Podcast episode
Bringing Pure Python to Apache Kafka (with Tomáš Neubauer)
byDeveloper Voices
0 ratings
0% found this document useful
QuPath - open source quantitative pathology not only for pathologists w/ Pete Bankhead, University of Edinburgh
Podcast episode
QuPath - open source quantitative pathology not only for pathologists w/ Pete Bankhead, University of Edinburgh
byDigital Pathology Podcast
0 ratings
0% found this document useful
Practical Advice On Using Python To Power A Business: An interview with Chris Moffitt about his work on the Practical Business Python site and his experiences using and teaching Python for automating business processes.
Podcast episode
Practical Advice On Using Python To Power A Business: An interview with Chris Moffitt about his work on the Practical Business Python site and his experiences using and teaching Python for automating business processes.
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Py4Inf-08-Lists.mp3
Podcast episode
Py4Inf-08-Lists.mp3
byPython for Informatics's official Podcast.
0 ratings
0% found this document useful
15: Lean Software Development: An introduction to Lean Software Development
Podcast episode
15: Lean Software Development: An introduction to Lean Software Development
byTest and Code
0 ratings
0% found this document useful
Podcasting Continues to Grow 22% in Two Years: Because of My Podcast I'm on the Cover of a Comic Book Jeremy Dennis explains how he was able to commission a custom comic book cover thanks to his supporters. New Edison Research on Podcasting Edison Research did a telephone survey of 2000...
Podcast episode
Podcasting Continues to Grow 22% in Two Years: Because of My Podcast I'm on the Cover of a Comic Book Jeremy Dennis explains how he was able to commission a custom comic book cover thanks to his supporters. New Edison Research on Podcasting Edison Research did a telephone survey of 2000...
bySchool of Podcasting - Plan, Launch, Grow and Monetize Your Podcast
0 ratings
0% found this document useful

Skip carousel

Scikit-Learn: The Ultimate Python Library
APC
Article
Scikit-Learn: The Ultimate Python Library
Jul 15, 2019
4 min read
Rokoko Studio 2.0
3D World
Article
Rokoko Studio 2.0
Feb 23, 2021
1 min read
Want A Job In Data Science? You Might Have To Take A Standardized Test When Applying
Chicago Tribune
Article
Want A Job In Data Science? You Might Have To Take A Standardized Test When Applying
Jul 10, 2018
3 min read
Think Computers Are Less Biased Than People? Think Again.
The Christian Science Monitor
Article
Think Computers Are Less Biased Than People? Think Again.
Oct 3, 2018
Artificial intelligence is often billed as the answer to biased decisionmaking. But as long as people write that code, humans will have to wrestle with their own biases.
4 min read
A Quick Reference To… Symlinks
Linux Format
Article
A Quick Reference To… Symlinks
May 4, 2021
1 min read
33 Layer Tricks
Practical Photoshop
Article
33 Layer Tricks
Sep 28, 2020
6 min read
Install These Essential Apps
TechLife
Article
Install These Essential Apps
Nov 15, 2021
1 min read
Manipulate Data Like A Pro With Pandas
Linux Format
Article
Manipulate Data Like A Pro With Pandas
Jul 27, 2021
7 min read
2 The Use of Python in AI and ML
Techfastly
Article
2 The Use of Python in AI and ML
Nov 30, 2020
3 min read
Family History In The AI Era
Family Tree UK
Article
Family History In The AI Era
Apr 12, 2024
7 min read
This PC Does Not Exist
Maximum PC
Article
This PC Does Not Exist
May 23, 2023
7 min read
DJANGO Create A Database-driven Website
Linux Format
Article
DJANGO Create A Database-driven Website
Jun 4, 2019
The Django web framework was named after the famous guitarist Django Reinhardt and was first created by web developers at a small newspaper in Kansas. The main goals of Django is to enable fast development of complex websites with database needs. It
7 min read
Family History Software: An Introduction
Family Tree UK
Article
Family History Software: An Introduction
Feb 11, 2020
5 min read
Mailserver
Linux Format
Article
Mailserver
Jun 27, 2023
4 min read
ChatGPT Changed Everything. Now Its Follow-Up Is Here.
The Atlantic
Article
ChatGPT Changed Everything. Now Its Follow-Up Is Here.
Mar 14, 2023
6 min read
Make AI Work For You
Linux Format
Article
Make AI Work For You
Apr 2, 2024
8 min read
Mantis X8 Shooting Analysis System
Bow International
Article
Mantis X8 Shooting Analysis System
Aug 21, 2020
The Mantis Shooting System is an small device that mounts on your bow, records your movements while taking a shot, and plots them on a graph. That simple, huh? It's a little more complex than that. As we all know, archery is a game of consistent repe
3 min read
Overall Usefulness
Linux Format
Article
Overall Usefulness
Sep 22, 2020
3 min read
Polyfoto
Linux Format
Article
Polyfoto
Sep 21, 2021
1 min read
A Quick Reference To… Pip
Linux Format
Article
A Quick Reference To… Pip
Jul 2, 2019
Distros have package managers and repositories to make it easy for you to install software, but did you know that some programming languages also do this? Python is increasingly popular on Linux systems and it has a package manager called pip. This m
1 min read
PyScript – Bring Python Coding To The Web
APC
Article
PyScript – Bring Python Coding To The Web
Aug 8, 2022
4 min read
Why We Need To Fear The Risk Of AI Model Collapse
Evening Standard
Article
Why We Need To Fear The Risk Of AI Model Collapse
Dec 17, 2023
4 min read
As AI Language Skills Grow, So Do Scientists' Concerns
The Independent
Article
As AI Language Skills Grow, So Do Scientists' Concerns
Jul 17, 2022
5 min read
Note-taking Applications For Family History
Family Tree UK
Article
Note-taking Applications For Family History
Mar 10, 2023
7 min read
What Have Humans Just Unleashed?
The Atlantic
Article
What Have Humans Just Unleashed?
Mar 16, 2023
9 min read
Investigating with AI
Writing Magazine
Article
Investigating with AI
Jan 4, 2024
3 min read
Sigil
Linux Format
Article
Sigil
Oct 18, 2022
2 min read
ChatGPT Now Has A ‘Memory’
The Independent
Article
ChatGPT Now Has A ‘Memory’
Apr 30, 2024
1 min read
Workflow
Linux Format
Article
Workflow
Nov 17, 2020
3 min read
Django Packages
Linux Format
Article
Django Packages
Jun 4, 2019
Django Packages (https://djangopackages.org) is a site where web developers and enthusiasts can share their tools and creations. Before you start any project, go there and pick the closest thing to what you are trying to create. These packages are no
1 min read

Related categories

Skip carousel

Reviews for An Introduction to Statistics with Python

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

An Introduction to Statistics with Python - Thomas Haslwanter

The first part of the book presents an introduction to statistics based on Python. It is impossible to cover the whole language in 30 or 40 pages, so if you are a beginner, please see one of the excellent Python introductions available in the internet for details. Links are given below. This part is a kick-start for Python; it shows how to install Python under Windows, Linux, or MacOS, and goes step-by-step through documented programming examples. Tips are included to help avoid some of the problems frequently encountered while learning Python.

Because most of the data for statistical analysis are commonly obtained from text files, Excel files, or data preprocessed by Matlab, the second chapter presents simple ways to import these types of data into Python.

The last chapter of this part illustrates various ways of visualizing data in Python. Since the flexibility of Python for interactive data analysis has led to a certain complexity that can frustrate new Python programmers, the code samples presented in Chap. 3 for various types of interactive plots should help future Pythonistas avoid these problems.

Thomas HaslwanterAn Introduction to Statistics with PythonStatistics and Computing10.1007/978-3-319-28316-6_1

1. Why Statistics?

Thomas Haslwanter¹

(1)

School of Applied Health and Social Sciences, University of Applied Sciences Upper Austria, Linz, Austria

Statistics is the explanation of variance in the light of what remains unexplained.

Every day we are confronted with situations with uncertain outcomes, and must make decisions based on incomplete data: Should I run for the bus? Which stock should I buy? Which man should I marry? Should I take this medication? Should I have my children vaccinated? Some of these questions are beyond the realm of statistics (Which person should I marry?), because they involve too many unknown variables. But in many situations, statistics can help extract maximum knowledge from information given, and clearly spell out what we know and what we don’t know. For example, it can turn a vague statement like This medication may cause nausea, or You could die if you don’t take this medication into a specific statement like Three patients in one thousand experience nausea when taking this medication, or If you don’t take this medication, there is a 95 % chance that you will die.

Without statistics, the interpretation of data can quickly become massively flawed. Take, for example, the estimated number of German tanks produced during World War II, also known as the German Tank Problem. The estimate of the number of German tanks produced per month from standard intelligence data was 1,550; however, the statistical estimate based on the number of tanks observed was 327, which was very close to the actual production number of 342 (http://en.wikipedia.org/wiki/German_tank_problem).

Similarly, using the wrong tests can also lead to erroneous results.

In general, statistics will help to

Clarify the question.

Identify the variable and the measure of that variable that will answer that question.

Determine the required sample size.

Describe variation.

Make quantitative statements about estimated parameters.

Make predictions based on your data.

Reading the Book Statistics was originally invented—like so many other things—by the famous mathematician C.F. Gauss, who said about his own work, Ich habe fleissig sein müssen; wer es gleichfalls ist, wird eben so weit kommen. (I had to work hard; if you work hard as well, you, too, will be successful.). Just as reading a book about playing the piano won’t turn you into a great pianist, simply reading this book will not teach you statistical data analysis. If you don’t have your own data to analyze, you need to do the exercises included. Should you become frustrated or stuck, you can always check the sample Solutions provided at the end of the book.

Exercises Solutions to the exercises provided can be found at the end of the book. In my experience, very few people work through large numbers of examples on their own, so I have not included additional exercises in this book.

If the information here is not sufficient, additional material can be found in other statistical textbooks and on the web:

Books There are a number of good books on statistics. My favorite is Altman (1999): it does not dwell on computers and modeling, but gives an extremely useful introduction to the field, especially for life sciences and medical applications. Many formulations and examples in this manuscript have been taken from that book. A more modern book, which is more voluminous and, in my opinion, a bit harder to read, is Riffenburgh (2012). Kaplan (2009) provides a simple introduction to modern regression modeling. If you know your basic statistics, a very good introduction to Generalized Linear Models can be found in Dobson and Barnett (2008), which provides a sound, advanced treatment of statistical modeling.

WWW In the web, you will find very extensive information on statistics in English at

http://www.statsref.com/

http://www.vassarstats.net/

http://www.biostathandbook.com/

http://onlinestatbook.com/2/index.html

http://www.itl.nist.gov/div898/handbook/index.htm

A good German web page on statistics and regulatory issues is http://www.reiter1.com/.

I hope to convince you that Python provides clear and flexible tools for most of the statistical problems that you will encounter, and that you will enjoy using it.

References

Altman, D. G. (1999). Practical statistics for medical research. New York: Chapman & Hall/CRC.

Dobson, A. J., & Barnett, A. (2008). An introduction to generalized linear models (3rd ed.). Boca Raton: CRC Press.

Kaplan, D. (2009). Statistical modeling: A fresh approach. St Paul: Macalester College.

Riffenburgh, R. H. (2012). Statistics in medicine (3rd ed.). Amsterdam: Academic Press.

Thomas HaslwanterAn Introduction to Statistics with PythonStatistics and Computing10.1007/978-3-319-28316-6_2

2. Python

Thomas Haslwanter¹

(1)

School of Applied Health and Social Sciences, University of Applied Sciences Upper Austria, Linz, Austria

Python is a very popular open source programming language. At the time of writing, codeeval was rating Python the most popular language for the fourth year in a row (http://blog.codeeval.com/codeevalblog). There are three reasons why I have switched from other programming languages to Python:

It is the most elegant programming language that I know.

It is free.

It is powerful.

2.1 Getting Started

2.1.1 Conventions

In this book the following conventions will be used:

Text that is to be typed in at the computer is written in Courier font, e.g., plot(x,y).

Optional text in command-line entries is expressed with square brackets and underscores, e.g., [_InstallationDir_]\bin. (I use the underscores in addition, as sometimes the square brackets will be used for commands.)

Names referring to computer programs and applications are written in italics, e.g., IPython.

I will also use italics when introducing new terms or expressions for the first time.

Code samples are marked as follows:

A369821_1_En_2_Figa_HTML.gif Python code samples.

All the marked code samples are freely available, under http://www.quantlet.de.

Additional Python scripts (the listings of complete programs, as well as the Python code used to generate the figures) are available at github: https://github.com/thomas-haslwanter/statsintro_python.git, in the directory ISP (for Introduction to Statistics with Python). ISP contains the following subfolders:

Exercise_Solutions

contains the solutions to the exercises which are presented at the end of most chapters.

Listings

contains programs that are explicitly listed in this book.

Figures

lists all the code used to generate the remaining figures in the book.

Code_Quantlets

contains all the marked code samples, grouped by book-chapter.

Packages on github are called repositories , and can easily be copied to your computer: when git is installed on your computer, simply type

git clone [_RepositoryName_]

and the whole repository—code as well as data—will be cloned to your system.

(See Sect. 2.4.4 for more information on git, github and code-versioning.)

2.1.2 Distributions and Packages

2.1.2.1 a) Python Packages for Statistics

The Python core distribution contains only the essential features of a general programming language. For example, it does not even contain a specialized module for working efficiently with vectors and matrices! These specialized modules are being developed by dedicated volunteers. The relationship of the most important Python packages for statistical applications is delineated in Fig. 2.1.

A369821_1_En_2_Fig1_HTML.gif

Fig. 2.1

The structure of the most important Python packages for statistical applications

To facilitate the use of Python, the so-called Python distributions collect matching versions of the most important packages, and I strongly recommend using one of these distributions when getting started. Otherwise one can easily become overwhelmed by the huge number of Python packages available. My favorite Python distributions are

WinPython recommended for Windows users. At the time of writing, the latest version was 3.5.1.3 (newer versions also ok).https://winpython.github.io/

Anaconda by Continuum. For Windows, Mac, and Linux. Can be used to install Python 2.x and 3.x, even simultaneously! The latest Anaconda version at time of writing was 4.0.0 (newer versions also ok).https://store.continuum.io/cshop/anaconda/

Neither of these two distributions requires administrator rights. I am presently using WinPython, which is free and customizable. Anaconda has become very popular recently, and is free for educational purposes.

Unless you have a specific requirement for 64-bit versions, you may want to install a 32-bit version of Python: it facilitates many activities that require compilation of module parts, e.g., for Bayesian statistics (PyMC), or when you want to speed up your programs with Cython. Since all the Python packages required for this course are now available for Python 3.x, I will use Python 3 for this book. However, all the scripts included should also work for Python 2.7. Make sure that you use a current version of IPython/Jupyter (4.x), since the Jupyter Notebooks provided with this book won’t run on IPython 2.x.¹

The programs included in this book have been tested with Python 2.7.10 and 3.5.1, under Windows and Linux, using the following package versions:

ipython 4.1.2 … For interactive work.

numpy 1.11.0 … For working with vectors and arrays.

scipy 0.17.1 … All the essential scientific algorithms, including those for basic statistics.

matplotlib 1.5.1 … The de-facto standard module for plotting and visualization.

pandas 0.18.0 … Adds DataFrames (imagine powerful spreadsheets) to Python.

patsy 0.4.1 … For working with statistical formulas.

statsmodels 0.8.0 … For statistical modeling and advanced analysis.

seaborn 0.7.0 … For visualization of statistical data.

In addition to these fairly general packages, some specialized packages have also been used in the examples accompanying this book:

xlrd 0.9.4 … For reading and writing MS Excel files.

PyMC 2.3.6 … For Bayesian statistics, including Markov chain Monte Carlo simulations.

scikit-learn 0.17.1 … For machine learning.

scikits.bootstrap 0.3.2 … Provides bootstrap confidence interval algorithms for scipy.

lifelines 0.9.1.0 … Survival analysis in Python.

rpy2 2.7.4 … Provides a wrapper for R-functions in Python.

Most of these packages come either with the WinPython or Anaconda distributions, or can be installed easily using pip or conda. To get PyMC to run, you may need to install a C-compiler. On my Windows platform, I installed Visual Studio 15, and set the environment variable SET VS90COMNTOOLS=%VS14COMNTOOLS%.

To use R-function from within Python, you also have to install R. Like Python, R is available for free, and can be downloaded from the Comprehensive R Archive Network, the latest release at the time of writing being R-3.3.0 (http://cran.r-project.org/).

2.1.2.2 b) PyPI: The Python Package Index

The Python Package Index (PyPI) (Currently at https://pypi.python.org/pypi, but about to migrate to https://pypi.io) is a repository of software for the Python programming language. It currently contains more than 80,000 packages!

Packages from PyPI can be installed easily, from the Windows command shell (cmd) or the Linux terminal, with

pip install [_package_]

To update a package, use

pip install [_package_] -U

To get a list of all the Python packages installed on your computer, type

pip list

Anaconda uses conda, a more powerful installation manager. But pip also works with Anaconda.

2.1.3 Installation of Python

2.1.3.1 a) Under Windows

Neither WinPython nor Anaconda require administrator rights for installation.

WinPython

In the following, I assume that [_WinPythonDir_] is the installation directory for WinPython.

Tip: Do NOT install WinPython into the Windows program directory (typically C:\Program Files or C:\Program Files (x86)), because this typically leads to permission problems during the execution of WinPython.

Download WinPython from https://winpython.github.io/.

Run the downloaded .exe-file, and install WinPython into the [_WinPythonDir_] of your choice.

After the installation, make a change to your Windows Environment, by typing Win -> env -> Edit environment variables for your account:

Add[_WinPythonDir_]\python-3.5.1;[_WinPythonDir_] \python-3.5.1\Scripts\; to your PATH. (This makes Python and ipython accessible from the standard Windows command-line.)²

If you do have administrator rights, you should activate [_WinPythonDir_]\WinPython Control Panel.exe -> Advanced -> Register Distribution.(This associates .py-files with this Python distribution.)

Anaconda

Download Anaconda from https://store.continuum.io/cshop/anaconda/.

Follow the installation instructions from the webpage. During the installation, allow Anaconda to make the suggested modifications to your environment PATH.

After the installation: in the Anaconda Launcher, click update (besides the Apps), in order to ensure that you are running the latest version.

Installing Additional Packages

Important Note: When I have had difficulties installing additional packages, I have been saved more than once by the pre-compiled packages from Christoph Gohlke, available under http://www.lfd.uci.edu/~gohlke/pythonlibs/: from there you can download the [_xxx_x].whl file for your current version of Python, and then install it simply with pip install [_xxx_].whl.

2.1.3.2 b) Under Linux

The following procedure worked on Linux Mint 17.1:

Download Anaconda for Python 3.5 (I used the 64 bit version, since I have a 64-bit Linux Mint Installation).

Open terminal, and navigate to the location where you downloaded the file to.

Install Anaconda with bash Anaconda3-4.0.0-Linux-x86.sh

Update your Linux installation with sudo apt-get update

Notes

You do NOT need root privileges to install Anaconda, if you select a user writable install location, such as ~/Anaconda.

After the self extraction is finished, you should add the Anaconda binary directory to your PATH environment variable.

As all of Anaconda is contained in a single directory, uninstalling Anaconda is easy: you simply remove the entire install location directory.

If any problems remain, Mac and Unix users should look up Johansson’ installations tips:(https://github.com/jrjohansson/scientific-python-lectures).

2.1.3.3 c) Under Mac OS X

Downloading Anaconda for Mac OS X is simple. Just

go to continuum.io/downloads

choose the Mac installer (make sure you select the Mac OS X Python 3.x Graphical Installer), and follow the instructions listed beside this button.

After the installation: in the Anaconda Launcher, click update (besides the Apps), in order to ensure that you are running the latest version.

After the installation the Anaconda icon should appear on the desktop. No admin password is required. This downloaded version of Anaconda includes the Jupyter notebook, Jupyter qtconsole and the IDE Spyder.

To see which packages (e.g., numpy, scipy, matplotlib, pandas, etc.) are featured in your installation look up the Anaconda Package List for your Python version. For example, the Python-installer may not include seaborn. To add an additional package, e.g., seaborn, open the terminal, and enter pip install seaborn.

2.1.4 Installation of R and rpy2

If you have not used R previously, you can safely skip this section. However, if you are already an avid R used, the following adjustments will allow you to also harness the power of R from within Python, using the package rpy2.

2.1.4.1 a) Under Windows

Also R does not require administrator rights for installation. You can download the latest version (at the time of writing R3.0.0) from http://cran.r-project.org/, and install it into the [_RDir_] installation directory of your choice.

With WinPython

After the installation of R, add the following two variables to your Windows Environment, by typing Win -> env -> Edit environment variables for your account:

R_HOME=[_RDir_]\R-3.3.0

R_USER=[_YourLoginName_]

The first entry is required for rpy2. The last entry is not really necessary, just better style.

With Anaconda

Anaconda comes without rpy2. So after the installation of Anaconda and R, you should:

Get rpy2 from http://www.lfd.uci.edu/~gohlke/pythonlibs/: Christoph Gohlkes Unofficial Windows Binaries for Python Extension Packages are one of the mainstays of the Python community—Thanks a lot, Christoph!

Open the Anaconda command prompt

Install rpy2 with pip. In my case, the command was pip rpy2-2.6.0-cp35-none-win32.whl

2.1.4.2 b) Under Linux

After the installation of Anaconda, install R and rpy2 with conda install -c \url{https://conda.binstar.org/r} rpy2

2.1.5 Personalizing IPython/ Jupyter

When working on a new problem, I always start out with the Jupyter qtconsole (see Sect. 2.3). Once I have the individual steps working, I use the IPython command %history to get the sequence of commands I have used, and switch to an IDE (integrated development environment), typically Wing or Spyder (see below).

In the following, [_mydir_] has to be replaced with your home-directory (i.e., the directory that opens up when you run cmd in Windows, or terminal in Linux). And [_myname_] should be replaced by your name or your userID.

To start up IPython in a folder of your choice, and with personalized startup scripts, proceed as follows.

2.1.5.1 a) In Windows

Type Win+R, and start a command shell with cmd

In the newly created command shell, type ipython. (This will launch an ipython session, and create the directory [_mydir_]\.ipython).

Add the Variable IPYTHONDIR to your environment (see above), and set it to [_mydir_]\.ipython. This directory contains the startup-commands for your ipython-sessions.

Into the startup folder [_mydir_].ipython\profile_default\startup place a file with, e.g., the name 00_[_myname_].py, containing the startup commands that you want to execute every time that you launch ipython. My personal startup file contains the following lines:

import pandas as pd

import os

os.chdir(r’C:\[_mydir_]’)

This will import pandas, and start you working in the directory of your choice.

Note: since Windows uses \ to separate directories, but \ is also the escape character in strings, directory paths using a simple backslash have to be preceded by r, indicating raw strings.

Generate a file ipy.bat in mydir, containing

jupyter qtconsole

To see all Jupyter Notebooks that come with this book, for example, do the following:

Type Win+R, and start a command shell with cmd

Run the commands

cd [_ipynb-dir_]

jupyter notebook

Again, if you want, you can put this command sequence into a batch-file.

2.1.5.2 b) In Linux

Start a Linux terminal with the command terminal

In the newly created command shell, execute the following command

ipython

(This generates a folder. ipython)

Into the sub-folder .ipython/profile_default/startup, place a file with e.g., the name 00[_myname_].py, containing the lines

import pandas as pd

import os

os.chdir([_mydir_])

In your .bashrc file (which contains the startup commands for your shell-scripts), enter the lines

alias ipy=’jupyter qtconsole’

IPYTHONDIR=’~/.ipython’

To see all Jupyter Notebooks, do the following:

Go to [_mydir_]

Create the file ipynb.sh, containing the lines

#!/bin/bash

cd [wherever_you_have_the_ipynb_files]

jupyter notebook

Make the file executable, with chmod 755 ipynb.sh

Now you can start your IPython by just typing ipy, and the Jupyter Notebook by typing ipynb.sh

2.1.5.3 c) In Mac OS X

Start the Terminal either by manually opening Spotlight or the shortcut CMD + SPACE and entering Terminal and search for Terminal.

In Terminal,

Enjoying the preview?

Page 1 of 1

An Introduction to Statistics with Python: With Applications in the Life Sciences

About this ebook

Thomas Haslwanter

Related authors

Related to An Introduction to Statistics with Python

Titles in the series (2)

Related ebooks

Applications & Software For You

Related podcast episodes

Related articles

Related categories

Reviews for An Introduction to Statistics with Python

What did you think?

Book preview

An Introduction to Statistics with Python - Thomas Haslwanter

1. Why Statistics?

2. Python

2.1 Getting Started

2.1.1 Conventions

2.1.2 Distributions and Packages

2.1.3 Installation of Python

2.1.4 Installation of R and rpy2

2.1.5 Personalizing IPython/ Jupyter