“An expert is one who knows more and more about less and less until he knows absolutely everything about nothing.”

Reproducible Research…free online course from John Hopkins

Learn the concepts and tools behind reporting modern data analyses in a reproducible manner. This is the fifth course in the Johns Hopkins Data Science Specialization.

About the Course

This course focuses on the concepts and tools behind reporting modern data analyses in a reproducible manner. Reproducible research is the idea that data analyses, and more generally, scientific claims, are published with their data and software code so that others may verify the findings and build upon them.  The need for reproducibility is increasing dramatically as data analyses become more complex, involving larger datasets and more sophisticated computations. Reproducibility allows for people to focus on the actual content of a data analysis, rather than on superficial details reported in a written summary. In addition, reproducibility makes an analysis more useful to others because the data and code that actually conducted the analysis are available. This course will focus on literate statistical analysis tools which allow one to publish data analyses in a single document that allows others to easily execute the same analysis to obtain the same results.

Course Syllabus

In this course you will learn to write a document using R markdown, integrate live R code into a literate statistical program, compile R markdown documents using knitr and related tools, and organize a data analysis so that it is reproducible and accessible to others.

Recommended Background

Course Format

There will be weekly video lectures,  quizzes, peer assessments.
As part of this class you will be required to set up a GitHub and an RPubs account. GitHub is a tool for collaborative code sharing and editing. RPubs is a web site for publishing data analyses from R. During this course and other courses in the Specialization you will be submitting links to files you publicly place in your GitHub account as part of peer evaluation. If you are concerned about preserving your anonymity you will need to set up an anonymous GitHub and RPubs accounts and be careful not to include any information you do not want made available to peer evaluators.


How do the courses in the Data Science Specialization depend on each other?
We have created a handy course dependency chart to help you see how the nine courses in the specialization depend on each other.
Will I get a Statement of Accomplishment after completing this class?
Yes. Students who successfully complete the class will receive a Statement of Accomplishment signed by the instructor.
What resources will I need for this class?
A computer is needed on which the R software environment can be installed (recent Mac, Windows, or Linux computers are sufficient).

How does this course fit into the Data Science Specialization?

This is the fifth course in the Specialization. We strongly recommend that you first takeThe Data Scientist’s Toolbox and R Programming before taking this course.


Post a Comment

About Blogger:

Hi,I,m Basim from Canada I,m physician and I,m interested in clinical research feild and web development.you are more welcome in our professional website.all contact forwarded to basimibrahim772@yahoo.com.

Let's Get Connected: Twitter | Facebook | Google Plus| linkedin

Subscribe to us