66 job interview questions for data scientistsįollow us on Twitter: | Popular Content on DSC.
17 short tutorials all data scientists should read (and practice).
#Million song dataset hadoop how to#
How to detect spurious correlations, and how to find the real ones.
Data Science Compared to 16 Analytic Disciplines.
13 New Trends in Big Data and Data Science.
Check out our FREE and successful Hadoop Starter Kit course at Udemy. We have been teaching Hadoop for several years now. We have worked with Hadoop clusters ranging from 50 all the way to 2000 nodes.
4 easy steps to becoming a data scientist All concepts are backed by interesting hands-on projects like analyzing million song dataset to find less familiar artists with hot songs.
10 Modern Statistical Concepts Discovered by Data Scientists.
50 Articles about Hadoop and Related Topics.
Misc: Top Links | Code Snippets | External Resources | Best Blogs | Subscribe | For Bloggers.
Buzz: Business News | Announcements | Events | RSS Feeds.
Knowledge: Research | Competitions | Webinars | Our Book | Members Only | Search DSC.
There was also a Kaggle competition and a Hackathon using it a couple of years ago. Read original article with description for each data repository. Accessing a million songs with Hive and Hadoop on AWS The million song dataset was created a few years ago to help encourage research on algorithms for analysing music related data.
NHS Health and Social Care Information Centre.
Data sets used in our data science apprenticeship - includes both real data and simulated data - and tips to create artificial, rich, big data sets for testing models.
DataScienceCentral selection of big data sets - check out the first itemized bullet list after clicking on this link.
This is an interesting listing created by Bernard Marr.