SlideShare uma empresa Scribd logo
1 de 3
Baixar para ler offline
Introduction to Data Exploration with R
STS 98
This is a gentle course on programming, basic statistics, and working with data.
We’ll begin with an introduction to the R programming language, and learn how to use
it for basic tasks such as loading a data set to inspect its structure. From there, we’ll
discuss what kinds of data are common in practice, what it means for something to be
“random,” and strategies for summarizing data. Visualization is a major component of
exploratory data analysis, so we’ll also spend several weeks learning what qualities make
visualizations effective, using R’s graphics and mapping libraries to practice these skills.
Finally, we’ll examine what makes data “messy” and how to clean it up.
Here’s the schedule of topics by week:
1 git and R 6 Midterm
2 R Navigation and Graphics 7 Messy Data
3 Basic Statistics 8 Maps and Data Shape
4 Grouped Data 9 Advanced Graphics
5 Customized Graphics / Review 10 Advanced Topics / Review
Contacts & Meeting Times
A small army has been mobilized to help you learn:
Name @ucdavis.edu Office Hours
AI Nick Ulle naulle F 1:00–3:00 Shields 360
Lead TA Shaheen Amirebrahimi samirebrahimi W 2:00–4:00 Shields 360
Clark Fitzgerald rcfitzgerald T 4:30–6:30 Shields 360
Shaun Geer sgeer M 5:00–7:00 Shields 360
TA Nick Alonzo nalonzo —
Somayh Saleh somsaleh —
Lectures are TTh 3:10–4:30pm in Wellman 26. Discussions are in Shields 360, at the
following times:
am pm
Monday 10:00 11:00 2:10 4:10
Wednesday 1:10
1
Introduction to Data Exploration with R
Grading
Assignments (50%) There will be two short homeworks and three projects. They will
be graded according to the class rubric, which emphasizes writing and graphics.
Writing is an important step in the data exploration process, to reflect on your
results and to present them to others. Use graphics to motivate and support your
writing. Graphics should never appear alone, without a written interpretation, and
should not be redundant or irrelevant. It’s your responsibility to be familiar with
the rubric.
Assignments will be submitted using git. See the lecture notes for details.
Exams (30%) There will be two exams, both in Wellman 26:
• A midterm on May 3
• A final on June 6 from 10:30am – 12:30pm
They will have written-answer and multiple choice questions based on material
from class. You will not be asked to write code on the exams, so you don’t need to
memorize the details every R function. However, you may be asked to read code
and discuss what it does. You may also be asked to reason about, interpret, or
critique results for a given data set. There will be no make-up exams unless you
have proof of a medical emergency. If you require SDC accommodations, let me
know before the end of week 2.
Participation (20%) It’s critical that you participate in class and on Piazza. Asking
and answering questions is the best way to avoid falling behind and verify your
understanding of the material. We’ll also have peer reviews at the end of some
assignments. These are an opportunity to get additional feedback on your work
and learn new things from your classmates.
Piazza (www.piazza.com) is the class’ online forum. All announcements will be
posted to Piazza. Plan to visit daily. You should also post questions about class
material on Piazza rather than emailing them to me or a TA (emails about
grading or private matters are okay). Typically you’ll get an answer much faster,
because everyone in the class has the opportunity to answer your question. Posting
to Piazza also allows others to benefit from your questions and answers.
Please be polite and respectful to others when using Piazza. Before posting, search
Piazza to see if your question has already been asked, because duplicate postings
make it more difficult to find answers. When you do post a question, explain the
context and, if possible, give a small example of what you mean. Otherwise, you
may be asked to clarify your question before receiving an answer.
2
Introduction to Data Exploration with R
Academic Honesty
Be honest at all times! All writing, graphics, and output must be your own original work.
However, for assignments you may:
• Discuss the problems with your classmates.
• Search for relevant information online.
• Use short pieces of code (≤ 20 lines) you find on Piazza or online, as long as you
cite the source. This does not include your classmates’ GitHub repositories.
The university code of academic conduct (sja.ucdavis.edu/cac.html) applies to this
class. One of the many things listed there is:
Know the rules - ignorance is no defense. Those who violate campus rules
regarding academic misconduct are subject to disciplinary sanctions, including
suspension and dismissal.
If you’re unsure whether something is allowed, please ask!
3

Mais conteúdo relacionado

Mais procurados

Searching internet presentation
Searching internet presentationSearching internet presentation
Searching internet presentationSuzanne Arafat
 
Math problem solving service
Math problem solving serviceMath problem solving service
Math problem solving serviceChaejungMaeng
 
9 17-16 - when recommendation systems go bad - rec sys
9 17-16 - when recommendation systems go bad - rec sys9 17-16 - when recommendation systems go bad - rec sys
9 17-16 - when recommendation systems go bad - rec sysEvan Estola
 
Attracting Women to Computing and Why it Matters
Attracting Women to Computing and Why it MattersAttracting Women to Computing and Why it Matters
Attracting Women to Computing and Why it MattersGail Carmichael
 
The Library And Your Dissertion
The Library And Your DissertionThe Library And Your Dissertion
The Library And Your DissertionClive McGoun
 
Modeling Webinar: Normalization - It's Not Your Friend... or Your Enemy
Modeling Webinar: Normalization - It's Not Your Friend... or Your EnemyModeling Webinar: Normalization - It's Not Your Friend... or Your Enemy
Modeling Webinar: Normalization - It's Not Your Friend... or Your EnemyDATAVERSITY
 
Identifying Patterns for Fun, Profit, and Rankings - Search Exchange 2013
Identifying Patterns for Fun, Profit, and Rankings - Search Exchange 2013Identifying Patterns for Fun, Profit, and Rankings - Search Exchange 2013
Identifying Patterns for Fun, Profit, and Rankings - Search Exchange 2013Joe Hall
 
Intro to Machine Learning & AI
Intro to Machine Learning & AIIntro to Machine Learning & AI
Intro to Machine Learning & AIMostafa Elsheikh
 
OpenML Reproducibility in Machine Learning ICML2017
OpenML Reproducibility in Machine Learning ICML2017OpenML Reproducibility in Machine Learning ICML2017
OpenML Reproducibility in Machine Learning ICML2017Joaquin Vanschoren
 
Moving Your Machine Learning Models to Production with TensorFlow Extended
Moving Your Machine Learning Models to Production with TensorFlow ExtendedMoving Your Machine Learning Models to Production with TensorFlow Extended
Moving Your Machine Learning Models to Production with TensorFlow ExtendedJonathan Mugan
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learningKoundinya Desiraju
 

Mais procurados (16)

Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)Machine Learning Algorithms (Part 1)
Machine Learning Algorithms (Part 1)
 
Searching internet presentation
Searching internet presentationSearching internet presentation
Searching internet presentation
 
Searching internet presentation
Searching internet presentationSearching internet presentation
Searching internet presentation
 
Math problem solving service
Math problem solving serviceMath problem solving service
Math problem solving service
 
9 17-16 - when recommendation systems go bad - rec sys
9 17-16 - when recommendation systems go bad - rec sys9 17-16 - when recommendation systems go bad - rec sys
9 17-16 - when recommendation systems go bad - rec sys
 
What do you do in class?
What do you do in class?What do you do in class?
What do you do in class?
 
Attracting Women to Computing and Why it Matters
Attracting Women to Computing and Why it MattersAttracting Women to Computing and Why it Matters
Attracting Women to Computing and Why it Matters
 
The Library And Your Dissertion
The Library And Your DissertionThe Library And Your Dissertion
The Library And Your Dissertion
 
Modeling Webinar: Normalization - It's Not Your Friend... or Your Enemy
Modeling Webinar: Normalization - It's Not Your Friend... or Your EnemyModeling Webinar: Normalization - It's Not Your Friend... or Your Enemy
Modeling Webinar: Normalization - It's Not Your Friend... or Your Enemy
 
Msr13 mistake
Msr13 mistake Msr13 mistake
Msr13 mistake
 
Identifying Patterns for Fun, Profit, and Rankings - Search Exchange 2013
Identifying Patterns for Fun, Profit, and Rankings - Search Exchange 2013Identifying Patterns for Fun, Profit, and Rankings - Search Exchange 2013
Identifying Patterns for Fun, Profit, and Rankings - Search Exchange 2013
 
Computer science fundamental
Computer science fundamentalComputer science fundamental
Computer science fundamental
 
Intro to Machine Learning & AI
Intro to Machine Learning & AIIntro to Machine Learning & AI
Intro to Machine Learning & AI
 
OpenML Reproducibility in Machine Learning ICML2017
OpenML Reproducibility in Machine Learning ICML2017OpenML Reproducibility in Machine Learning ICML2017
OpenML Reproducibility in Machine Learning ICML2017
 
Moving Your Machine Learning Models to Production with TensorFlow Extended
Moving Your Machine Learning Models to Production with TensorFlow ExtendedMoving Your Machine Learning Models to Production with TensorFlow Extended
Moving Your Machine Learning Models to Production with TensorFlow Extended
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 

Destaque

00070942 derechos humanos
00070942 derechos humanos00070942 derechos humanos
00070942 derechos humanosJuan Bermudes
 
algunos problemas con la comida Denisse Gonzalez- KarynCortes
algunos problemas con la comida Denisse Gonzalez- KarynCortesalgunos problemas con la comida Denisse Gonzalez- KarynCortes
algunos problemas con la comida Denisse Gonzalez- KarynCortesdenisselml
 
Elementry sampling.PDF
Elementry sampling.PDFElementry sampling.PDF
Elementry sampling.PDFBarry Reynolds
 
Mobilidade  Urbana Sustentável - Unipalmares
Mobilidade  Urbana Sustentável - UnipalmaresMobilidade  Urbana Sustentável - Unipalmares
Mobilidade  Urbana Sustentável - UnipalmaresChico Macena
 
Will Lau Reference letter Scot Hobart
Will Lau Reference letter Scot HobartWill Lau Reference letter Scot Hobart
Will Lau Reference letter Scot HobartWill Lau
 
Reducción en el presupuesto Estatal para el año 2016
Reducción en el presupuesto Estatal para el año 2016Reducción en el presupuesto Estatal para el año 2016
Reducción en el presupuesto Estatal para el año 2016Miguel Ramírez
 
Calculo Diferencial Final
Calculo Diferencial FinalCalculo Diferencial Final
Calculo Diferencial Finalheribertotm
 
Tayla's letter of recommendation (Borja)
Tayla's letter of recommendation (Borja)Tayla's letter of recommendation (Borja)
Tayla's letter of recommendation (Borja)Tayla Mode
 
Sociedades por acciones simplificadas en Colombia
Sociedades por acciones simplificadas en ColombiaSociedades por acciones simplificadas en Colombia
Sociedades por acciones simplificadas en ColombiaMiguel Ramírez
 
Borrador proyecto final
Borrador proyecto finalBorrador proyecto final
Borrador proyecto finaldroiartzun
 

Destaque (20)

00070942 derechos humanos
00070942 derechos humanos00070942 derechos humanos
00070942 derechos humanos
 
algunos problemas con la comida Denisse Gonzalez- KarynCortes
algunos problemas con la comida Denisse Gonzalez- KarynCortesalgunos problemas con la comida Denisse Gonzalez- KarynCortes
algunos problemas con la comida Denisse Gonzalez- KarynCortes
 
Elementry sampling.PDF
Elementry sampling.PDFElementry sampling.PDF
Elementry sampling.PDF
 
Mobilidade  Urbana Sustentável - Unipalmares
Mobilidade  Urbana Sustentável - UnipalmaresMobilidade  Urbana Sustentável - Unipalmares
Mobilidade  Urbana Sustentável - Unipalmares
 
Moodle 1911 web
Moodle 1911 webMoodle 1911 web
Moodle 1911 web
 
certi 4
certi 4certi 4
certi 4
 
Will Lau Reference letter Scot Hobart
Will Lau Reference letter Scot HobartWill Lau Reference letter Scot Hobart
Will Lau Reference letter Scot Hobart
 
Reducción en el presupuesto Estatal para el año 2016
Reducción en el presupuesto Estatal para el año 2016Reducción en el presupuesto Estatal para el año 2016
Reducción en el presupuesto Estatal para el año 2016
 
Dok1
Dok1Dok1
Dok1
 
Parker letter
Parker letterParker letter
Parker letter
 
MODIRIAT_AMOOZESHGAH
MODIRIAT_AMOOZESHGAHMODIRIAT_AMOOZESHGAH
MODIRIAT_AMOOZESHGAH
 
4 months L&T
4 months L&T4 months L&T
4 months L&T
 
shell
shellshell
shell
 
Lode Profile.jpg
Lode Profile.jpgLode Profile.jpg
Lode Profile.jpg
 
Senior Certificate
Senior CertificateSenior Certificate
Senior Certificate
 
Calculo Diferencial Final
Calculo Diferencial FinalCalculo Diferencial Final
Calculo Diferencial Final
 
Tayla's letter of recommendation (Borja)
Tayla's letter of recommendation (Borja)Tayla's letter of recommendation (Borja)
Tayla's letter of recommendation (Borja)
 
Sociedades por acciones simplificadas en Colombia
Sociedades por acciones simplificadas en ColombiaSociedades por acciones simplificadas en Colombia
Sociedades por acciones simplificadas en Colombia
 
Borrador proyecto final
Borrador proyecto finalBorrador proyecto final
Borrador proyecto final
 
Relax_Overview
Relax_OverviewRelax_Overview
Relax_Overview
 

Semelhante a syllabus

How to start your data career
How to start your data careerHow to start your data career
How to start your data careerAdwait Bhave
 
Crime Benefits and Challenges.pdf
Crime Benefits and Challenges.pdfCrime Benefits and Challenges.pdf
Crime Benefits and Challenges.pdfsdfghj21
 
WF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the course
WF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the courseWF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the course
WF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the coursePenn State University
 
2webquest Srkthomas Statistics
2webquest Srkthomas Statistics2webquest Srkthomas Statistics
2webquest Srkthomas Statisticssrthomas
 
Descriptive Statistics in the NCAA
Descriptive Statistics in the NCAADescriptive Statistics in the NCAA
Descriptive Statistics in the NCAAsrkthomas
 
Webquest Descriptive Statistics of the NCAA
Webquest Descriptive Statistics of the NCAAWebquest Descriptive Statistics of the NCAA
Webquest Descriptive Statistics of the NCAAsrthomas
 
Three medical technologies essay.docx
Three medical technologies essay.docxThree medical technologies essay.docx
Three medical technologies essay.docxwrite5
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2Roger Barga
 
Landing your first Data Science Job: The Technical Interview
Landing your first Data Science Job: The Technical InterviewLanding your first Data Science Job: The Technical Interview
Landing your first Data Science Job: The Technical InterviewAnidata
 
Technology and.pdf
Technology and.pdfTechnology and.pdf
Technology and.pdfsdfghj21
 
dolan_SAM_syllabus_spring_2017
dolan_SAM_syllabus_spring_2017dolan_SAM_syllabus_spring_2017
dolan_SAM_syllabus_spring_2017Sean Dolan
 
Social Web: (Big) Data Mining | summer 2014/2015 course syllabus
Social Web: (Big) Data Mining | summer 2014/2015 course syllabusSocial Web: (Big) Data Mining | summer 2014/2015 course syllabus
Social Web: (Big) Data Mining | summer 2014/2015 course syllabusJakub Ruzicka
 
Web 2.0 lesson plan (t3)
Web 2.0 lesson plan (t3)Web 2.0 lesson plan (t3)
Web 2.0 lesson plan (t3)shamet1
 
Speech Expo Overview
Speech Expo OverviewSpeech Expo Overview
Speech Expo OverviewLaura Rebecca
 
ISSC362Course SummaryCourse ISSC362 Title IT Securit
ISSC362Course SummaryCourse  ISSC362 Title  IT SecuritISSC362Course SummaryCourse  ISSC362 Title  IT Securit
ISSC362Course SummaryCourse ISSC362 Title IT SecuritTatianaMajor22
 
Google Interview Prep Guide Software Engineer
Google Interview Prep Guide Software EngineerGoogle Interview Prep Guide Software Engineer
Google Interview Prep Guide Software EngineerLewis Lin 🦊
 
Modules module5mod5home.htmlmodule 5 homecomparing models
Modules module5mod5home.htmlmodule 5   homecomparing modelsModules module5mod5home.htmlmodule 5   homecomparing models
Modules module5mod5home.htmlmodule 5 homecomparing modelsPOLY33
 
Engaging Students in Distance Learning
Engaging Students in Distance LearningEngaging Students in Distance Learning
Engaging Students in Distance LearningS Gasson
 
Data science presentation
Data science presentationData science presentation
Data science presentationMSDEVMTL
 

Semelhante a syllabus (20)

How to start your data career
How to start your data careerHow to start your data career
How to start your data career
 
Crime Benefits and Challenges.pdf
Crime Benefits and Challenges.pdfCrime Benefits and Challenges.pdf
Crime Benefits and Challenges.pdf
 
WF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the course
WF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the courseWF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the course
WF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the course
 
2webquest Srkthomas Statistics
2webquest Srkthomas Statistics2webquest Srkthomas Statistics
2webquest Srkthomas Statistics
 
Descriptive Statistics in the NCAA
Descriptive Statistics in the NCAADescriptive Statistics in the NCAA
Descriptive Statistics in the NCAA
 
Webquest Descriptive Statistics of the NCAA
Webquest Descriptive Statistics of the NCAAWebquest Descriptive Statistics of the NCAA
Webquest Descriptive Statistics of the NCAA
 
Three medical technologies essay.docx
Three medical technologies essay.docxThree medical technologies essay.docx
Three medical technologies essay.docx
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
 
Landing your first Data Science Job: The Technical Interview
Landing your first Data Science Job: The Technical InterviewLanding your first Data Science Job: The Technical Interview
Landing your first Data Science Job: The Technical Interview
 
Technology and.pdf
Technology and.pdfTechnology and.pdf
Technology and.pdf
 
dolan_SAM_syllabus_spring_2017
dolan_SAM_syllabus_spring_2017dolan_SAM_syllabus_spring_2017
dolan_SAM_syllabus_spring_2017
 
Social Web: (Big) Data Mining | summer 2014/2015 course syllabus
Social Web: (Big) Data Mining | summer 2014/2015 course syllabusSocial Web: (Big) Data Mining | summer 2014/2015 course syllabus
Social Web: (Big) Data Mining | summer 2014/2015 course syllabus
 
Web 2.0 lesson plan (t3)
Web 2.0 lesson plan (t3)Web 2.0 lesson plan (t3)
Web 2.0 lesson plan (t3)
 
Speech Expo Overview
Speech Expo OverviewSpeech Expo Overview
Speech Expo Overview
 
ISSC362Course SummaryCourse ISSC362 Title IT Securit
ISSC362Course SummaryCourse  ISSC362 Title  IT SecuritISSC362Course SummaryCourse  ISSC362 Title  IT Securit
ISSC362Course SummaryCourse ISSC362 Title IT Securit
 
Google Interview Prep Guide Software Engineer
Google Interview Prep Guide Software EngineerGoogle Interview Prep Guide Software Engineer
Google Interview Prep Guide Software Engineer
 
Modules module5mod5home.htmlmodule 5 homecomparing models
Modules module5mod5home.htmlmodule 5   homecomparing modelsModules module5mod5home.htmlmodule 5   homecomparing models
Modules module5mod5home.htmlmodule 5 homecomparing models
 
0-Welcome.ppt
0-Welcome.ppt0-Welcome.ppt
0-Welcome.ppt
 
Engaging Students in Distance Learning
Engaging Students in Distance LearningEngaging Students in Distance Learning
Engaging Students in Distance Learning
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 

syllabus

  • 1. Introduction to Data Exploration with R STS 98 This is a gentle course on programming, basic statistics, and working with data. We’ll begin with an introduction to the R programming language, and learn how to use it for basic tasks such as loading a data set to inspect its structure. From there, we’ll discuss what kinds of data are common in practice, what it means for something to be “random,” and strategies for summarizing data. Visualization is a major component of exploratory data analysis, so we’ll also spend several weeks learning what qualities make visualizations effective, using R’s graphics and mapping libraries to practice these skills. Finally, we’ll examine what makes data “messy” and how to clean it up. Here’s the schedule of topics by week: 1 git and R 6 Midterm 2 R Navigation and Graphics 7 Messy Data 3 Basic Statistics 8 Maps and Data Shape 4 Grouped Data 9 Advanced Graphics 5 Customized Graphics / Review 10 Advanced Topics / Review Contacts & Meeting Times A small army has been mobilized to help you learn: Name @ucdavis.edu Office Hours AI Nick Ulle naulle F 1:00–3:00 Shields 360 Lead TA Shaheen Amirebrahimi samirebrahimi W 2:00–4:00 Shields 360 Clark Fitzgerald rcfitzgerald T 4:30–6:30 Shields 360 Shaun Geer sgeer M 5:00–7:00 Shields 360 TA Nick Alonzo nalonzo — Somayh Saleh somsaleh — Lectures are TTh 3:10–4:30pm in Wellman 26. Discussions are in Shields 360, at the following times: am pm Monday 10:00 11:00 2:10 4:10 Wednesday 1:10 1
  • 2. Introduction to Data Exploration with R Grading Assignments (50%) There will be two short homeworks and three projects. They will be graded according to the class rubric, which emphasizes writing and graphics. Writing is an important step in the data exploration process, to reflect on your results and to present them to others. Use graphics to motivate and support your writing. Graphics should never appear alone, without a written interpretation, and should not be redundant or irrelevant. It’s your responsibility to be familiar with the rubric. Assignments will be submitted using git. See the lecture notes for details. Exams (30%) There will be two exams, both in Wellman 26: • A midterm on May 3 • A final on June 6 from 10:30am – 12:30pm They will have written-answer and multiple choice questions based on material from class. You will not be asked to write code on the exams, so you don’t need to memorize the details every R function. However, you may be asked to read code and discuss what it does. You may also be asked to reason about, interpret, or critique results for a given data set. There will be no make-up exams unless you have proof of a medical emergency. If you require SDC accommodations, let me know before the end of week 2. Participation (20%) It’s critical that you participate in class and on Piazza. Asking and answering questions is the best way to avoid falling behind and verify your understanding of the material. We’ll also have peer reviews at the end of some assignments. These are an opportunity to get additional feedback on your work and learn new things from your classmates. Piazza (www.piazza.com) is the class’ online forum. All announcements will be posted to Piazza. Plan to visit daily. You should also post questions about class material on Piazza rather than emailing them to me or a TA (emails about grading or private matters are okay). Typically you’ll get an answer much faster, because everyone in the class has the opportunity to answer your question. Posting to Piazza also allows others to benefit from your questions and answers. Please be polite and respectful to others when using Piazza. Before posting, search Piazza to see if your question has already been asked, because duplicate postings make it more difficult to find answers. When you do post a question, explain the context and, if possible, give a small example of what you mean. Otherwise, you may be asked to clarify your question before receiving an answer. 2
  • 3. Introduction to Data Exploration with R Academic Honesty Be honest at all times! All writing, graphics, and output must be your own original work. However, for assignments you may: • Discuss the problems with your classmates. • Search for relevant information online. • Use short pieces of code (≤ 20 lines) you find on Piazza or online, as long as you cite the source. This does not include your classmates’ GitHub repositories. The university code of academic conduct (sja.ucdavis.edu/cac.html) applies to this class. One of the many things listed there is: Know the rules - ignorance is no defense. Those who violate campus rules regarding academic misconduct are subject to disciplinary sanctions, including suspension and dismissal. If you’re unsure whether something is allowed, please ask! 3