Introduction to Data Science

Location type
Logo Coursera
Provider rating: starstarstarstar_borderstar_border 6.3 Coursera has an average rating of 6.3 (out of 4 reviews)

Need more information? Get more details on the site of the provider.

Description

When you enroll for courses through Coursera you get to choose for a paid plan or for a free plan

  • Free plan: No certicification and/or audit only. You will have access to all course materials except graded items.
  • Paid plan: Commit to earning a Certificate—it's a trusted, shareable way to showcase your new skills.

Join the data revolution. Companies are searching for data scientists. This specialized field demands multiple skills not easy to obtain through conventional curricula. Introduce yourself to the basics of data science and leave armed with practical experience programming massive databases.

About the Course

Commerce and research is being transformed by data-driven discovery and prediction. Skills required for data analytics at massive levels – scalable data management on and off the cloud, parallel algorithms, statistical modeling, and proficiency with a complex ecosystem of tools and platforms – span a variety of disciplines and are not easy to obtain through conventional curricula. Tour th…

Read the complete description

Frequently asked questions

There are no frequently asked questions yet. Send an Email to info@springest.com

Didn't find what you were looking for? See also: Data Mining, Algorithms, Databases, SQL, and Programming.

When you enroll for courses through Coursera you get to choose for a paid plan or for a free plan

  • Free plan: No certicification and/or audit only. You will have access to all course materials except graded items.
  • Paid plan: Commit to earning a Certificate—it's a trusted, shareable way to showcase your new skills.

Join the data revolution. Companies are searching for data scientists. This specialized field demands multiple skills not easy to obtain through conventional curricula. Introduce yourself to the basics of data science and leave armed with practical experience programming massive databases.

About the Course

Commerce and research is being transformed by data-driven discovery and prediction. Skills required for data analytics at massive levels – scalable data management on and off the cloud, parallel algorithms, statistical modeling, and proficiency with a complex ecosystem of tools and platforms – span a variety of disciplines and are not easy to obtain through conventional curricula. Tour the basic techniques of data science, including both SQL and NoSQL solutions for massive data management (e.g., MapReduce and contemporaries), algorithms for data mining (e.g., clustering and association rule mining), and basic statistical modeling (e.g., logistic and non-linear regression).

About the Instructor(s)

Bill Howe is the Director of Research for Scalable Data Analytics at the UW eScience Institute and holds an Affiliate Assistant Professor appointment in Computer Science & Engineering, where he leads a group studying data management, analytics, and visualization systems for science applications. Howe has received awards from Microsoft Research and honors for papers in scientific data management, and serves on a number of program committees, organizing committees, and advisory boards in the area, including the advisory board of the Data Science certificate program at UW. He holds a Ph.D. in Computer Science from Portland State University and a Bachelor's degree in Industrial & Systems Engineering from Georgia Tech.

Course Syllabus

Specific Topics: * Data modeling: relations, key-value, trees, graphs, images, text 
* Relational algebra and parallel query processing 
* NoSQL systems, key-value stores 
* Tradeoffs of SQL, NoSQL, and NewSQL systems 
* Algorithm design in Hadoop (and MapReduce in general) 
* Basic statistical analysis at scale: sampling, regression 
* Introduction to data mining: clustering, association rules, decision trees 
* Case studies in analytics: social networking, bioinformatics, text processing

Recommended Background

You will need basic programming experience with Java or Python, and some familiarity with databases. The target audience is undergraduate students across disciplines who wish to build proficiency working with large datasets to perform predictive analytics.

Suggested Readings

There will be selected readings each week. Students may also find the following textbooks relevant for further reading: Mining of Massive Datasets (http://i.stanford.edu/~ullman/mmds.html), and Professional NoSQL (Wrox Programmer to Programmer).

Course Format

There will be a quiz and a programming assignment each week, as well as two exams. The assignments involving large datasets will be completed using Amazon Web Services or Microsoft Azure cloud services.

Provided by:

University: University of Washington

Instructor(s): Bill Howe

There are no reviews yet.

Share your review

Do you have experience with this course? Submit your review and help other people make the right choice. As a thank you for your effort we will donate $1.- to Stichting Edukans.

There are no frequently asked questions yet. Send an Email to info@springest.com