This long article with a lot of source code was posted by Suraj V Vidyadaran. Suraj is pursuing a Master in Computer Science at Temple university primarily focused in Data Science specialization. His areas of interests are in sentiment analysis, data visualization, big data and machine learning.
This data is obtained from UCI Machine learning repository. The purpose of the analysis is to evaluate the safety standard of the cars based on certain parameters and classify them. The detailed description of the dataset is provided in the original article, and in Suraj’s Github repository. For another article comparing different data science techniques (by a different author), read Performance From Various Predictive Models.
Clustering based on density peaks (source: click here)
This article provide source code and results for the data set in question, for the following clustering techniques:
- Logistic regression
- Linear discriminant analysis
- Mixture disriminant analysis
- Quadratic Discriminant Analysis
- Neural Network
- Flexible Discriminant Analysis
- Support Vector Machine
- k-Nearest Neighbors
- Naive Bayes
- Classification and Regression Trees (CART)
- C4.5
- PART
- Bagging CART
- Random Forest
- Gradient Boosted Machine
I was surprised to see the overlap with our recent article on top 10 machine learning algorithms. You can read the full article (with voluminous source code in R) here.
DSC Resources
- Career: Training | Books | Cheat Sheet | Apprenticeship | Certification | Salary Surveys | Jobs
- Knowledge: Research | Competitions | Webinars | Our Book | Members Only | Search DSC
- Buzz: Business News | Announcements | Events | RSS Feeds
- Misc: Top Links | Code Snippets | External Resources | Best Blogs | Subscribe | For Bloggers
Additional Reading
- What statisticians think about data scientists
- Data Science Compared to 16 Analytic Disciplines
- 10 types of data scientists
- 91 job interview questions for data scientists
- 50 Questions to Test True Data Science Knowledge
- 24 Uses of Statistical Modeling
- 21 data science systems used by Amazon to operate its business
- Top 20 Big Data Experts to Follow (Includes Scoring Algorithm)
- 5 Data Science Leaders Share their Predictions for 2016 and Beyond
- 50 Articles about Hadoop and Related Topics
- 10 Modern Statistical Concepts Discovered by Data Scientists
- Top data science keywords on DSC
- 4 easy steps to becoming a data scientist
- 22 tips for better data science
- How to detect spurious correlations, and how to find the real ones
- 17 short tutorials all data scientists should read (and practice)
- High versus low-level data science
Follow us on Twitter: @DataScienceCtrl | @AnalyticBridge