1 - 5 Reading - Data - I

Hochgeladen von

intj2001712

0% fanden dieses Dokument nützlich (0 Abstimmungen)

9 Ansichten9 Seiten

reading data

Originaltitel

1_5 reading_data_I

Copyright

Verfügbare Formate

PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

reading data

Copyright:

Verfügbare Formate

Als PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

9 Ansichten9 Seiten

1 - 5 Reading - Data - I

Hochgeladen von

intj2001712

reading data

Copyright:

Verfügbare Formate

Als PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 9

Im Dokument suchen

Reading and Writing Data Part I Reading and Writing Data Part I

Roger D. Peng, Associate Professor of Biostatistics

Johns Hopkins Bloomberg School of Public Health
Reading Data Reading Data
There are a few principal functions reading data into R.
read.table, read.csv, for reading tabular data
readLines, for reading lines of a text le
source, for reading in R code les (inverse of dump)
dget, for reading in R code les (inverse of dput)
load, for reading in saved workspaces
unserialize, for reading single R objects in binary form

2/9
Writing Data Writing Data
There are analogous functions for writing data to les
write.table
writeLines
dump
dput
save
serialize

3/9
Reading Data Files with read.table Reading Data Files with read.table
The read.table function is one of the most commonly used functions for reading data. It has a few
important arguments:
file, the name of a le, or a connection
header, logical indicating if the le has a header line
sep, a string indicating how the columns are separated
colClasses, a character vector indicating the class of each column in the dataset
nrows, the number of rows in the dataset
comment.char, a character string indicating the comment character
skip, the number of lines to skip from the beginning
stringsAsFactors, should character variables be coded as factors?

4/9
read.table read.table
For small to moderately sized datasets, you can usually call read.table without specifying any other
arguments
R will automatically
data <- read.table("foo.txt")
skip lines that begin with a #
gure out how many rows there are (and how much memory needs to be allocated)
gure what type of variable is in each column of the table Telling R all these things directly makes
R run faster and more efciently.
read.csv is identical to read.table except that the default separator is a comma.

5/9
Reading in Larger Datasets with read.table Reading in Larger Datasets with read.table
With much larger datasets, doing the following things will make your life easier and will prevent R
from choking.
Read the help page for read.table, which contains many hints
Make a rough calculation of the memory required to store your dataset. If the dataset is larger
than the amount of RAM on your computer, you can probably stop right here.
Set comment.char = "" if there are no commented lines in your le.

6/9
Reading in Larger Datasets with read.table Reading in Larger Datasets with read.table
Use the colClasses argument. Specifying this option instead of using the default can make
read.table run MUCH faster, often twice as fast. In order to use this option, you have to know the
class of each column in your data frame. If all of the columns are numeric, for example, then
you can just set colClasses = "numeric". A quick an dirty way to gure out the classes of
each column is the following:

initial <- read.table("datatable.txt", nrows = 100)

classes <- sapply(initial, class)
tabAll <- read.table("datatable.txt",
colClasses = classes)
Set nrows. This doesnt make R run faster but it helps with memory usage. A mild overestimate
is okay. You can use the Unix tool wc to calculate the number of lines in a le.

7/9
Know Thy System Know Thy System
In general, when using R with larger datasets, its useful to know a few things about your system.
How much memory is available?
What other applications are in use?
Are there other users logged into the same system?
What operating system?
Is the OS 32 or 64 bit?

8/9
Calculating Memory Requirements Calculating Memory Requirements
I have a data frame with 1,500,000 rows and 120 columns, all of which are numeric data. Roughly,
how much memory is required to store this data frame?
1,500,000 ! 120 ! 8 bytes/numeric
= 1440000000 bytes
= 1440000000 / bytes/MB
= 1,373.29 MB
= 1.34 GB
2
20
9/9

Das könnte Ihnen auch gefallen

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Von Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Bewertung: 4 von 5 Sternen
4/5 (5794)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Von Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Bewertung: 4 von 5 Sternen
4/5 (1090)
Never Split the Difference: Negotiating As If Your Life Depended On It
Von Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Bewertung: 4.5 von 5 Sternen
4.5/5 (838)
Principles: Life and Work
Von Everand
Principles: Life and Work
Ray Dalio
Bewertung: 4 von 5 Sternen
4/5 (599)
The Glass Castle: A Memoir
Von Everand
The Glass Castle: A Memoir
Jeannette Walls
Bewertung: 4.5 von 5 Sternen
4.5/5 (1712)
Sing, Unburied, Sing: A Novel
Von Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Bewertung: 4 von 5 Sternen
4/5 (1103)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Von Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Bewertung: 4 von 5 Sternen
4/5 (894)
Grit: The Power of Passion and Perseverance
Von Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Bewertung: 4 von 5 Sternen
4/5 (587)
Shoe Dog: A Memoir by the Creator of Nike
Von Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Bewertung: 4.5 von 5 Sternen
4.5/5 (537)
The Perks of Being a Wallflower
Von Everand
The Perks of Being a Wallflower
Stephen Chbosky
Bewertung: 4.5 von 5 Sternen
4.5/5 (2099)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Von Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Bewertung: 4.5 von 5 Sternen
4.5/5 (474)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Von Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Bewertung: 4.5 von 5 Sternen
4.5/5 (344)
Bad Feminist: Essays
Von Everand
Bad Feminist: Essays
Roxane Gay
Bewertung: 4 von 5 Sternen
4/5 (1015)
The Outsider: A Novel
Von Everand
The Outsider: A Novel
Stephen King
Bewertung: 4 von 5 Sternen
4/5 (1839)
Her Body and Other Parties: Stories
Von Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Bewertung: 4 von 5 Sternen
4/5 (821)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Von Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Bewertung: 4.5 von 5 Sternen
4.5/5 (119)
The Emperor of All Maladies: A Biography of Cancer
Von Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Bewertung: 4.5 von 5 Sternen
4.5/5 (271)
Angela's Ashes: A Memoir
Von Everand
Angela's Ashes: A Memoir
Frank McCourt
Bewertung: 4.5 von 5 Sternen
4.5/5 (440)
The Little Book of Hygge: Danish Secrets to Happy Living
Von Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Bewertung: 3.5 von 5 Sternen
3.5/5 (399)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Von Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Bewertung: 3.5 von 5 Sternen
3.5/5 (2219)
A Man Called Ove: A Novel
Von Everand
A Man Called Ove: A Novel
Fredrik Backman
Bewertung: 4.5 von 5 Sternen
4.5/5 (4609)
Brooklyn: A Novel
Von Everand
Brooklyn: A Novel
Colm Tóibín
Bewertung: 3.5 von 5 Sternen
3.5/5 (1937)
The Art of Racing in the Rain: A Novel
Von Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Bewertung: 4 von 5 Sternen
4/5 (4200)
A Tree Grows in Brooklyn
Von Everand
A Tree Grows in Brooklyn
Betty Smith
Bewertung: 4.5 von 5 Sternen
4.5/5 (1929)
The Yellow House: A Memoir (2019 National Book Award Winner)
Von Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Bewertung: 4 von 5 Sternen
4/5 (98)
Steve Jobs
Von Everand
Steve Jobs
Walter Isaacson
Bewertung: 4.5 von 5 Sternen
4.5/5 (806)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Von Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Bewertung: 4.5 von 5 Sternen
4.5/5 (265)
The Woman in Cabin 10
Von Everand
The Woman in Cabin 10
Ruth Ware
Bewertung: 3.5 von 5 Sternen
3.5/5 (2322)
Yes Please
Von Everand
Yes Please
Amy Poehler
Bewertung: 4 von 5 Sternen
4/5 (1891)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Von Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Bewertung: 3.5 von 5 Sternen
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
Von Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Bewertung: 4.5 von 5 Sternen
4.5/5 (234)
Fear: Trump in the White House
Von Everand
Fear: Trump in the White House
Bob Woodward
Bewertung: 3.5 von 5 Sternen
3.5/5 (738)
John Adams
Von Everand
John Adams
David McCullough
Bewertung: 4.5 von 5 Sternen
4.5/5 (2409)
Wolf Hall: A Novel
Von Everand
Wolf Hall: A Novel
Hilary Mantel
Bewertung: 4 von 5 Sternen
4/5 (3811)
On Fire: The (Burning) Case for a Green New Deal
Von Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Bewertung: 4 von 5 Sternen
4/5 (73)
The Light Between Oceans: A Novel
Von Everand
The Light Between Oceans: A Novel
M.L. Stedman
Bewertung: 4.5 von 5 Sternen
4.5/5 (789)
The Unwinding: An Inner History of the New America
Von Everand
The Unwinding: An Inner History of the New America
George Packer
Bewertung: 4 von 5 Sternen
4/5 (45)
Manhattan Beach: A Novel
Von Everand
Manhattan Beach: A Novel
Jennifer Egan
Bewertung: 3.5 von 5 Sternen
3.5/5 (792)
The Constant Gardener: A Novel
Von Everand
The Constant Gardener: A Novel
John le Carre
Bewertung: 3.5 von 5 Sternen
3.5/5 (104)
Carla L Rueckert - A Wanderers Handbook
Dokument566 Seiten
Carla L Rueckert - A Wanderers Handbook
anon-207295
100% (35)
Rise of ISIS: A Threat We Can't Ignore
Von Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Bewertung: 3.5 von 5 Sternen
3.5/5 (137)
Little Women
Von Everand
Little Women
Louisa May Alcott
Bewertung: 4 von 5 Sternen
4/5 (104)
Al Marai ERP English
Dokument2 Seiten
Al Marai ERP English
arshleo
Noch keine Bewertungen
Morality: Reasoning On Different Approaches
Dokument3 Seiten
Morality: Reasoning On Different Approaches
intj2001712
Noch keine Bewertungen
Ceramics and Humour
Dokument5 Seiten
Ceramics and Humour
Cierre Wesley
Noch keine Bewertungen
Creating Locally Relevant Health Information
Dokument4 Seiten
Creating Locally Relevant Health Information
intj2001712
Noch keine Bewertungen
Defining a Permissible Action Range for AI Robotics Entrepreneurship
Dokument17 Seiten
Defining a Permissible Action Range for AI Robotics Entrepreneurship
intj2001712
Noch keine Bewertungen
A Life of Fun Playing With Solar Magnetic Fields: (Special Historical Review)
Dokument37 Seiten
A Life of Fun Playing With Solar Magnetic Fields: (Special Historical Review)
intj2001712
Noch keine Bewertungen
Japan Animation From Commercialism To Art
Dokument6 Seiten
Japan Animation From Commercialism To Art
intj2001712
Noch keine Bewertungen
Age Transcended A Semiotic and Rhetorical Analysis of The Discourse of Agelessness in North American Anti Aging Skin Care Advertisements
Dokument12 Seiten
Age Transcended A Semiotic and Rhetorical Analysis of The Discourse of Agelessness in North American Anti Aging Skin Care Advertisements
intj2001712
Noch keine Bewertungen
A Very Remarkable Piece of Iron Towards A Theory of Material Imagination in Virginia Woolf S Solid Objects
Dokument16 Seiten
A Very Remarkable Piece of Iron Towards A Theory of Material Imagination in Virginia Woolf S Solid Objects
intj2001712
Noch keine Bewertungen
Axioms: Wavelet-Based Monitoring For Biosurveillance
Dokument27 Seiten
Axioms: Wavelet-Based Monitoring For Biosurveillance
intj2001712
Noch keine Bewertungen
Questioning The Western Idea of Reason Through Hindu Philosophy An Analysis of The Circle of Reason by Amitav Ghosh
Dokument23 Seiten
Questioning The Western Idea of Reason Through Hindu Philosophy An Analysis of The Circle of Reason by Amitav Ghosh
Nana Zepol
Noch keine Bewertungen
Negotiating Transcription As A Relative Insider Implications For Rigor
Dokument11 Seiten
Negotiating Transcription As A Relative Insider Implications For Rigor
Purrpblvd
Noch keine Bewertungen
A Neurobehavioral Approach To The Treatment of Attention Deficit Disorder Add in Adult Individuals PDF
Dokument9 Seiten
A Neurobehavioral Approach To The Treatment of Attention Deficit Disorder Add in Adult Individuals PDF
Sreeniva Jaboodi
Noch keine Bewertungen
The Idea of Harmony and Its Musical Expression (With Musical Examples)
Dokument5 Seiten
The Idea of Harmony and Its Musical Expression (With Musical Examples)
intj2001712
Noch keine Bewertungen
Engaging The Thought of Bernard Lonergan Written by Louis Roy PDF
Dokument2 Seiten
Engaging The Thought of Bernard Lonergan Written by Louis Roy PDF
Sathish Vidyut
Noch keine Bewertungen
1 Understanding Whole System Change
Dokument26 Seiten
1 Understanding Whole System Change
intj2001712
Noch keine Bewertungen
1 Understanding Whole System Change
Dokument26 Seiten
1 Understanding Whole System Change
intj2001712
Noch keine Bewertungen
Information About Route 8: Reciprocal Certification For California Licensed Acupuncturists
Dokument7 Seiten
Information About Route 8: Reciprocal Certification For California Licensed Acupuncturists
intj2001712
Noch keine Bewertungen
Neijing Notes1-30 PDF
Dokument30 Seiten
Neijing Notes1-30 PDF
intj2001712
Noch keine Bewertungen
CSXLA of U.S.A. Workshop at Northern California
Dokument2 Seiten
CSXLA of U.S.A. Workshop at Northern California
intj2001712
Noch keine Bewertungen
Ex 2 Data 2
Dokument2 Seiten
Ex 2 Data 2
intj2001712
Noch keine Bewertungen
ABB Whitepaper 3 Digital Edition 6.23.2016
Dokument18 Seiten
ABB Whitepaper 3 Digital Edition 6.23.2016
intj2001712
Noch keine Bewertungen
Authors
Dokument1 Seite
Authors
Sidharth Sharma
Noch keine Bewertungen
Subtitle
Dokument2 Seiten
Subtitle
intj2001712
Noch keine Bewertungen
README
Dokument7 Seiten
README
Sidharth Sharma
Noch keine Bewertungen
Notice
Dokument1 Seite
Notice
mathfreak123
Noch keine Bewertungen
Machine Learning
Dokument10 Seiten
Machine Learning
anshul77
Noch keine Bewertungen
Change Log
Dokument2 Seiten
Change Log
Sidharth Sharma
Noch keine Bewertungen
README
Dokument7 Seiten
README
Sidharth Sharma
Noch keine Bewertungen
Programming Exercise 2
Dokument13 Seiten
Programming Exercise 2
joehakarashi
Noch keine Bewertungen
Split Core and Cavity Mold Design
Dokument3 Seiten
Split Core and Cavity Mold Design
Amilin Hatiara
Noch keine Bewertungen
Amber Tools 12
Dokument535 Seiten
Amber Tools 12
Anna Vera
Noch keine Bewertungen
Evaluate Virtual Machine Manager in System Center 2019 by Using A Preconfigured Virtual Hard Disk Microsoft Corporation
Dokument12 Seiten
Evaluate Virtual Machine Manager in System Center 2019 by Using A Preconfigured Virtual Hard Disk Microsoft Corporation
Jakub Xxx
100% (1)
Devices Emergency Rescue Tool
Dokument2 Seiten
Devices Emergency Rescue Tool
Diego Alves
Noch keine Bewertungen
Cisco ASA Oversubscription-Interface Errors Troubleshooting PDF
Dokument19 Seiten
Cisco ASA Oversubscription-Interface Errors Troubleshooting PDF
masterone1810
Noch keine Bewertungen
SliceViewer and VoxelViewer - en
Dokument55 Seiten
SliceViewer and VoxelViewer - en
emadhsobhy
Noch keine Bewertungen
Stable Baselines
Dokument239 Seiten
Stable Baselines
dridi
Noch keine Bewertungen
4.6 NOJA-598-00 OSM15 310 and RC10 Controller User Manual en v7 - 5
Dokument131 Seiten
4.6 NOJA-598-00 OSM15 310 and RC10 Controller User Manual en v7 - 5
Natalia Andrea
Noch keine Bewertungen
945gcm S Multiqig
Dokument86 Seiten
945gcm S Multiqig
Ricardo Silva
Noch keine Bewertungen
Manual de Instalación APD5 Epson
Dokument8 Seiten
Manual de Instalación APD5 Epson
Lily Droplet
Noch keine Bewertungen
Tellabs OHA Setup
Dokument11 Seiten
Tellabs OHA Setup
Lynmuel Uy
Noch keine Bewertungen
Social Commerce
Dokument3 Seiten
Social Commerce
Suhani Varshney
Noch keine Bewertungen
Cakephp Test
Dokument22 Seiten
Cakephp Test
hathanh13
Noch keine Bewertungen
USM v5 Deployment Guide
Dokument186 Seiten
USM v5 Deployment Guide
Anonymous P96rcQIL
Noch keine Bewertungen
PowerHA Mirror7.2
Dokument354 Seiten
PowerHA Mirror7.2
bhaskara_rao
0% (1)
Quiz TK1114
Dokument61 Seiten
Quiz TK1114
harriediskandar
Noch keine Bewertungen
Language Extensions For CBEA 2.6
Dokument168 Seiten
Language Extensions For CBEA 2.6
Ruben Palmer
Noch keine Bewertungen
Database Normalization Explained in 40 Characters
Dokument9 Seiten
Database Normalization Explained in 40 Characters
ommalae thevidiya paya
Noch keine Bewertungen
Shimadzu Prominence Brochureg
Dokument40 Seiten
Shimadzu Prominence Brochureg
asapphuvinh
0% (1)
Dynaform Manual
Dokument39 Seiten
Dynaform Manual
fawad h
Noch keine Bewertungen
Pengaruh Shift Kerja Terhadap Kinerja Karyawan Dengan Motivasi Sebagai Variabel Intervening (Studi Kasus Pada Ruang UGD Rumah Sakit Di Kota Madiun)
Dokument23 Seiten
Pengaruh Shift Kerja Terhadap Kinerja Karyawan Dengan Motivasi Sebagai Variabel Intervening (Studi Kasus Pada Ruang UGD Rumah Sakit Di Kota Madiun)
iwang saudji
Noch keine Bewertungen
13 ExampleThirteen PDF
Dokument11 Seiten
13 ExampleThirteen PDF
Martin
Noch keine Bewertungen
Bentley Installation Guide
Dokument15 Seiten
Bentley Installation Guide
ToribioGomez
Noch keine Bewertungen
Java Language Fundamental by Druga Sir
Dokument48 Seiten
Java Language Fundamental by Druga Sir
SheikhShoaib
Noch keine Bewertungen
Chap 03
Dokument26 Seiten
Chap 03
Amrutha Rajesh
Noch keine Bewertungen
Os ND06
Dokument3 Seiten
Os ND06
kevinbtech
Noch keine Bewertungen
Java 18-Collection Final
Dokument59 Seiten
Java 18-Collection Final
Kesava Mirthipati
Noch keine Bewertungen
Product Roadmap Powerpoint Template
Dokument1 Seite
Product Roadmap Powerpoint Template
Koustubha Khare
Noch keine Bewertungen
IXon Ultra 888 Hardware Guide 1.1
Dokument63 Seiten
IXon Ultra 888 Hardware Guide 1.1
Maitetxu25
Noch keine Bewertungen