These 19 ‘sets of data sets’ cover free or public data from various industries, including small and large, structured and unstructured data sets. Hone your data science and machine learning skills on these data sets, or use them for testing algorithms or for benchmarking.
19 data set repositories
- Big data sets available for free
- Great Github list of public data sets
- 10 Great Healthcare Data Sets
- Password and hijacked email dataset
- 20 Big Data Repositories You Should Check Out
- Top 20 Open Data sources
- Great IoT, Sensor and other Data Sets Repositories
- The Free ‘Big Data’ Sources Everyone Should Know
- 100+ Interesting Data Sets for Data Science
- Data sets and other machine learning resources from UC Irvine
- Big data set – 3.5 billion web pages
- The Free ‘Big Data’ Sources Everyone Should Know
- Another large data set – 250 million data points
- Two big datasets to challenge your data science expertise
- 13 Machine Learning Data Set Collections
- Datasets for queueing modelling
- Kaggle Releases Data Sets About Global Warming
- Facebook Shares Large Data Sets to Help Improve its AI
- DSC data sets
More data sets can be found here.
Top DSC Resources
- Article: What is Data Science? 24 Fundamental Articles Answering This Question
- Article: Hitchhiker’s Guide to Data Science, Machine Learning, R, Python
- Tutorial: Data Science Cheat Sheet
- Tutorial: How to Become a Data Scientist – On Your Own
- Categories: Data Science – Machine Learning – AI – IoT – Deep Learning
- Tools: Hadoop – DataViZ – Python – R – SQL – Excel
- Techniques: Clustering – Regression – SVM – Neural Nets – Ensembles – Decision Trees
- Links: Cheat Sheets – Books – Events – Webinars – Tutorials – Training – News – Jobs
- Links: Announcements – Salary Surveys – Data Sets – Certification – RSS Feeds – About Us
- Newsletter: Sign-up – Past Editions – Members-Only Section – Content Search – For Bloggers
- DSC on: Ning – Twitter – LinkedIn – Facebook – GooglePlus
Follow us on Twitter: @DataScienceCtrl | @AnalyticBridge